JPH10133847A

JPH10133847A - Mobile terminal system for voice recognition, database search, and resource access communications

Info

Publication number: JPH10133847A
Application number: JP8285086A
Authority: JP
Inventors: Toru Yamakita; 徹山北
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 1996-10-28
Filing date: 1996-10-28
Publication date: 1998-05-22

Abstract

PROBLEM TO BE SOLVED: To realize voice recognition, database search, and resource access function as a user interface with practical accuracy and cost in communications environment using a mobile terminal. SOLUTION: In the mobile terminal 101, a voice signal inputted from an input part 109 is transmitted to a PHS network 103 from a control part 110 and a communication part 111, and transmitted to a voice control host unit 108 from there via a control host unit 104 and an internet 105. This voice signal is received by a mobile terminal communication control part 116 via a packet transmitting/receiving part 115 in the same unit, and after recognized in a sentence voice recognizing part 117, its search key word is extracted in a search control part 118, and a search processing is executed to a prescribed database engine. The search result HTML sentence data obtained as a result of that is returned to the mobile terminal 101 and received by the control part 110 via the communication part 111 and displayed at an output part 112. By selecting a hyper text on the displayed search result HTML sentence, a user accesses to an arbitrary resource on the internet 105.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、移動（携帯）端末
装置において入力された通話音声等の音声を認識しその
認識結果に基づいてデータベースを検索する技術、及び
インターネット上等のリソースにアクセスする技術に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technology for recognizing speech such as a call speech input in a mobile (portable) terminal device, searching a database based on the result of the recognition, and accessing resources on the Internet or the like. About technology.

【０００２】[0002]

【従来の技術及び発明が解決しようとする課題】音声信
号を認識して、文字データに変換して蓄積したり、認識
結果を種々のサービスの利用に供したりするための音声
認識技術は、様々な産業分野で従来から要請されてい
る。2. Description of the Related Art There are various voice recognition technologies for recognizing voice signals, converting them into character data and storing the data, and providing recognition results to use of various services. It has been demanded in various industrial fields.

【０００３】近年では、音声認識アルゴリズムの発達に
より、メインフレームコンピュータ又はワークステーシ
ョンコンピュータ等を用いた音声認識システムが開発さ
れている。In recent years, with the development of speech recognition algorithms, speech recognition systems using a mainframe computer or a workstation computer have been developed.

【０００４】これらのシステムは、例えば、電話音声を
入力とする銀行の残高照会システムや座席予約システ
ム、作業員の音声を認識して荷物の自動配送を行う配送
荷物の仕分システムを始めとして、種々の産業分野に取
り入られつつある。[0004] These systems include various systems such as a bank balance inquiry system and a seat reservation system using telephone voice as input, and a delivery package sorting system that recognizes workers' voices and automatically delivers packages. It is being adopted by other industrial fields.

【０００５】しかし、このような音声認識システムは、
上述のような大規模なコンピュータシステムの環境のも
とでようやく実用的な認識精度を有するレベルに到達し
たばかりであり、いわゆるパーソナルコンピュータのよ
うな小型のコンピュータシステム環境のもとでは、実用
的な認識精度を有する安価な音声認識システムは未だ実
現されていないのが現状である。However, such a speech recognition system has
Only under the environment of a large-scale computer system as described above has reached a level having practical recognition accuracy, and under the environment of a small-sized computer system such as a so-called personal computer, the practical recognition accuracy has been reached. At present, an inexpensive speech recognition system having recognition accuracy has not been realized yet.

【０００６】一方、上述のような情報処理技術と並行し
て、近年、自動車電話・携帯電話やＰＨＳ（パーソナル
ハンディホンシステム）等の移動端末が、急速に普及し
つつある。[0006] On the other hand, in parallel with the above information processing technology, mobile terminals such as car phones, mobile phones, and PHS (Personal Handy Phone System) have been rapidly spreading in recent years.

【０００７】特に、ＰＨＳは、小型であると同時に、自
動車電話・携帯電話に比較して通話料金が安く、かつ、
「いつでも、どこでも、誰とでも」高い品質で通話がで
きるという特徴を備えており、爆発的に普及しつつあ
る。更に、ＰＨＳは、ＩＳＤＮ（Integrated Services
Digital Network:サービス統合デジタル網）をバックボ
ーンとする公衆網であるため、３２キロビット／秒の伝
送レートでの高速デジタル通信が可能であり、マルチメ
ディア通信分野への応用に対する期待も高まっている。[0007] In particular, the PHS is small in size, and at the same time, the call charge is lower than that of a car phone or a mobile phone.
It has the feature of being able to talk with high quality "anytime, anywhere, with anyone" and is exploding. In addition, PHS is an ISDN (Integrated Services)
Since it is a public network using a Digital Network (Integrated Services Digital Network) as a backbone, high-speed digital communication at a transmission rate of 32 kilobits / second is possible, and expectations for its application to the multimedia communication field are increasing.

【０００８】更には、移動端末の利便性をいかすべく、
携帯電話装置としてだけではなく、携帯情報管理装置と
しても利用できるような、マルチメディア情報管理／通
信端末装置としての実現の可能性も期待されている。具
体的には、このような移動端末は、通話機能／ＦＡＸ機
能を備えることはもちろん、インターネットや社内ネッ
トワークへのアクセス機能としてのホームページアクセ
ス機能や電子メール通信機能を備えることが予想される
ほか、アドレス管理、スケジュール管理、データベース
検索／蓄積等の情報管理機能をも兼ね備えることが期待
される。Further, in order to take advantage of the convenience of the mobile terminal,
It is also expected that it can be used as a multimedia information management / communication terminal device that can be used not only as a mobile phone device but also as a mobile information management device. Specifically, such a mobile terminal is expected to have not only a call function / fax function, but also a homepage access function and an e-mail communication function as an access function to the Internet and a company network. It is expected to have information management functions such as address management, schedule management, and database search / storage.

【０００９】そして、このような移動端末は、人が気軽
に利用できるよう、できる限り人にやさしく自然なユー
ザインタフェースを備えることが要請される。現在実現
されているユーザインタフェースとしては、キーボード
やマウスによる指操作入力、電子ペンによる手書き入力
等が実用化されているが、音声入力等にも対応すること
ができれば、ユーザインタフェースとして理想的であ
る。即ち例えば、基本機能としての通話機能を利用しな
がら通話内容を示す音声信号をデータとして処理し、そ
の処理結果に基づいてインターネット上のデータベース
を検索すること等が可能になれば、移動端末の利便性を
飛躍的に増大させることが可能になる。ここに、移動端
末に対してユーザインタフェースとして音声認識機能を
適用することの価値を見出すことができる。[0009] Such a mobile terminal is required to have a natural user interface that is as gentle as possible to a person so that the user can use it easily. As user interfaces currently realized, finger operation input using a keyboard or a mouse, handwriting input using an electronic pen, and the like have been put to practical use, but if they can respond to voice input and the like, they are ideal as user interfaces. . That is, for example, if it becomes possible to process a voice signal indicating the content of a call as data while using a call function as a basic function, and to search a database on the Internet based on the processing result, etc. It is possible to dramatically increase the performance. Here, the value of applying a voice recognition function as a user interface to a mobile terminal can be found.

【００１０】しかし、移動端末は小型でありその情報処
理能力は限られている反面、前述したように、現在の音
声認識処理では、メインフレームコンピュータ又はワー
クステーションコンピュータクラスの環境のもとでない
と、実用的な認識精度を実現することは困難である。従
って、現状では、移動端末のユーザインタフェースとし
て音声認識機能を実現することは非常に困難である、と
いう問題点を有している。[0010] However, while the mobile terminal is small and has limited information processing capability, as described above, in the current speech recognition processing, the mobile terminal must be installed in a mainframe computer or workstation computer class environment. It is difficult to achieve practical recognition accuracy. Therefore, at present, there is a problem that it is very difficult to realize a voice recognition function as a user interface of a mobile terminal.

【００１１】本発明の課題は、移動端末を用いた通信環
境において、そのユーザインタフェースとしての音声認
識機能を実用的な精度及びコストで実現し、インターネ
ット上でのデータベース検索及び各種リソースへのアク
セスを可能とすることにある。An object of the present invention is to realize a speech recognition function as a user interface with practical accuracy and cost in a communication environment using a mobile terminal, and perform database search and access to various resources on the Internet. To make it possible.

【００１２】[0012]

【課題を解決するための手段】本発明はまず、以下の構
成を含む移動端末を有する。即ち、ホスト接続手段（制
御部１１０、通信部１１１）は、無線網又は有線網の何
れか一方又は両方から構成される中継網（ＰＨＳ網１０
３とインターネット１０５）を介して間接的に又はその
中継網を介さずに直接的にホスト装置である音声制御ホ
スト装置（音声制御ホスト装置１０８）に接続する。The present invention first has a mobile terminal having the following configuration. In other words, the host connection means (the control unit 110 and the communication unit 111) is connected to a relay network (PHS network 10) composed of one or both of a wireless network and a wired network.
3 and the Internet 105) indirectly or directly to a voice control host device (voice control host device 108) as a host device without going through a relay network.

【００１３】音声入力手段（入力部１０９）は、音声を
入力する。音声データ送信手段（制御部１１０、通信部
１１１）は、ホスト接続手段による接続動作の後、音声
入力手段から入力される音声データを音声制御ホスト装
置に送信する。The voice input means (input section 109) inputs voice. After the connection operation by the host connection unit, the audio data transmission unit (the control unit 110 and the communication unit 111) transmits the audio data input from the audio input unit to the audio control host device.

【００１４】検索結果ＨＴＭＬ文章データ受信手段（制
御部１１０、通信部１１１）は、音声制御ホスト装置か
ら返信されハイパーテキストマークアップ言語ＨＴＭＬ
によって記述された検索結果ＨＴＭＬ文章データを受信
する。The search result HTML text data receiving means (the control unit 110 and the communication unit 111) returns the hypertext markup language HTML which is returned from the voice control host device.
Receiving the search result HTML text data described in the above.

【００１５】検索結果ＨＴＭＬ文章データ表示／処理手
段（制御部１１０、出力部１１２）は、その受信された
検索結果ＨＴＭＬ文章データを表示及び処理する。リソ
ースアクセス処理手段（制御部１１０、通信部１１１）
は、表示される検索結果ＨＴＭＬ文章データに含まれか
つユーザが指定したアクセス情報（ハイパーテキストに
含まれるＵＲＬ）に対応するホスト装置上のリソース
（ホームページデータ、Ｊａｖａアプレット、ファイル
データ、そのホスト装置のログインアカウント等）に、
中継網を介して間接的に又は中継網を介さずに直接的に
アクセスして、そのリソースを処理する。より具体的に
は、リソースアクセス処理手段は、上記アクセス情報で
あるＵＲＬにより示されるインターネット等に接続され
るホスト装置上のホームページデータやＪａｖａアプレ
ットやファイルデータやそのホスト装置のログインアカ
ウント等の各種リソースに対し、そのＵＲＬにより指定
されるＨＴＴＰ（Hyper Text Transfer Protocol）（ホ
ームページデータの取得又はＪａｖａアプレットの取得
／実行の場合）、ＦＴＰ（File TransferProtocol）
（ファイルデータの取得の場合）、又はＴＥＬＮＥＴ
（ホスト装置へのログインの場合）等の通信プロトコル
を用いて、アクセスする。この場合、リソースアクセス
処理手段は、上記ＵＲＬによってホームページデータや
Ｊａｖａアプレットやファイルデータ等の取得又は実行
が指定されている場合には、それらを、上記ＨＴＴＰ又
はＦＴＰ等の通信プロトコルを用いてホスト装置から移
動端末に転送させて、表示し（ホームページデータの取
得の場合）、実行し（Ｊａｖａアプレットの取得及び実
行の場合）、又は蓄積する（ファイルデータの取得の場
合）。或いは、リソースアクセス処理手段は、上記ＵＲ
Ｌによってホスト装置に対するログインが指定されてい
る場合は、ＴＥＬＮＥＴプロトコルを用いてそのホスト
装置にログインする。The search result HTML text data display / processing means (control unit 110, output unit 112) displays and processes the received search result HTML text data. Resource access processing means (control unit 110, communication unit 111)
Are resources (homepage data, Java applets, file data, etc.) on the host device corresponding to the access information (URL included in the hypertext) included in the displayed search result HTML text data and specified by the user. Login account)
Access the resource indirectly via the transit network or directly without transiting the transit network to process the resources. More specifically, the resource access processing means includes various resources such as homepage data, Java applets and file data on a host device connected to the Internet or the like indicated by the URL as the access information, and a login account of the host device. HTTP (Hyper Text Transfer Protocol) specified by the URL (in the case of obtaining homepage data or obtaining / executing a Java applet), FTP (File Transfer Protocol)
(For file data acquisition) or TELNET
Access is made using a communication protocol such as (in the case of login to a host device). In this case, when acquisition or execution of homepage data, Java applet, file data, or the like is specified by the URL, the resource access processing unit converts the data into a host device using the communication protocol such as HTTP or FTP. To transfer to a mobile terminal and display (in the case of acquiring homepage data), execute (in the case of acquiring and executing Java applets), or accumulate (in the case of acquiring file data). Alternatively, the resource access processing means includes the UR
When the login to the host device is designated by L, the user logs in to the host device using the TELNET protocol.

【００１６】次に、本発明は、以下の構成を含む音声制
御ホスト装置１０８を有する。即ち、移動端末接続手段
（パケット送受信部１１５、移動端末通信制御部１１
６）は、移動端末内のホスト接続手段による接続動作に
応答して、移動端末を識別して接続する。Next, the present invention has a voice control host device 108 having the following configuration. That is, mobile terminal connection means (packet transmitting / receiving section 115, mobile terminal communication control section 11
6) responds to the connection operation by the host connection means in the mobile terminal, and identifies and connects the mobile terminal.

【００１７】音声データ受信手段（パケット送受信部１
１５、移動端末通信制御部１１６）は、現在接続されて
いる移動端末毎に、音声データを受信する。音声認識手
段（移動端末通信制御部１１６、文音声認識部１１７）
は、現在接続されている移動端末毎に、音声データ受信
手段により受信された音声データに対して音声認識処理
を実行する。Voice data receiving means (packet transmitting / receiving unit 1)
15. The mobile terminal communication control unit 116) receives voice data for each currently connected mobile terminal. Voice recognition means (mobile terminal communication control unit 116, sentence voice recognition unit 117)
Executes voice recognition processing on voice data received by the voice data receiving means for each mobile terminal currently connected.

【００１８】検索制御手段（検索制御部１１８）は、現
在接続されている移動端末毎に、音声認識手段による音
声認識処理によって得られる認識音声データから検索キ
ーワードを抽出し、その検索キーワードに対応するリソ
ースに対するアクセス情報を含む検索結果ＨＴＭＬ文章
データを所定のデータベースシステムから検索する。こ
の所定のデータベースシステムは、例えば、インターネ
ットに接続される所定のホスト装置が提供する。The search control means (search control unit 118) extracts a search keyword from the recognized voice data obtained by the voice recognition processing by the voice recognition means for each currently connected mobile terminal, and corresponds to the search keyword. The search result HTML text data including the access information for the resource is searched from a predetermined database system. This predetermined database system is provided, for example, by a predetermined host device connected to the Internet.

【００１９】また、上述の検索制御手段は、例えば、以
下の検索インデックス作成手段、検索キーワード抽出手
段、及び検索実行手段を含む。即ちまず、検索インデッ
クス作成手段は、現在接続されている移動端末毎に、音
声認識手段による音声認識処理によって得られる認識音
声データを所定の分類規則に従って分類することによ
り、より具体的には、例えば入力されたデータ中に現れ
る各単語を出現回数の多い順に分類することによって、
検索インデックスを作成する。検索キーワード抽出手段
は、現在接続されている移動端末毎に、検索インデック
ス作成手段により作成される検索インデックスから所定
の抽出基準を満たす語句を抽出し、より具体的には、例
えば出現回数が所定回数以上の単語又は所定の出現回数
順位以上の順位の単語を抽出し、その抽出された語句か
ら所定の不要キーワードを除去し、その結果得られる語
句のうち新規のものを検索キーワードとして抽出する。
そして、検索実行手段は、検索キーワードに対応する検
索結果ＨＴＭＬ文章データを所定のデータベースシステ
ムから検索する。なお、検索インデックスの作成処理と
不要キーワードの除去処理は、逆の順序で実行されても
よく、それも本発明の権利範囲である。The above-mentioned search control means includes, for example, the following search index creation means, search keyword extraction means, and search execution means. That is, first, the search index creation unit classifies the recognized speech data obtained by the speech recognition processing by the speech recognition unit for each currently connected mobile terminal according to a predetermined classification rule, and more specifically, for example, By classifying the words that appear in the input data in the order of the number of occurrences,
Create a search index. The search keyword extracting means extracts a phrase satisfying a predetermined extraction criterion from a search index created by the search index creating means for each of the currently connected mobile terminals, and more specifically, for example, when the number of appearances is a predetermined number of times. The above words or words having a rank equal to or higher than a predetermined number of appearances are extracted, a predetermined unnecessary keyword is removed from the extracted phrases, and a new one of the resulting phrases is extracted as a search keyword.
Then, the search execution means searches the predetermined database system for search result HTML text data corresponding to the search keyword. Note that the search index creation processing and the unnecessary keyword removal processing may be performed in the reverse order, which is also within the scope of the present invention.

【００２０】検索結果ＨＴＭＬ文章データ返信手段（移
動端末通信制御部１１６、パケット送受信部１１５）
は、現在接続されている移動端末毎に、検索制御手段に
よる検索処理によって得られる検索結果ＨＴＭＬ文章デ
ータを、それに対応する移動端末に返信する。Means for returning search result HTML text data (mobile terminal communication control unit 116, packet transmission / reception unit 115)
Returns search result HTML text data obtained by the search processing by the search control means for each mobile terminal currently connected to the corresponding mobile terminal.

【００２１】以上の移動端末と音声制御ホスト装置を含
む本発明による通信移動端末音声認識／データベース検
察通信システムにより、移動端末は、高度な音声認識／
データベース検索／リソースアクセス環境を設備する必
要がなく実用的な精度を有する音声認識／データベース
検索機能の提供を低コストで受けることができる。The mobile terminal according to the present invention including the mobile terminal and the voice control host device according to the present invention can perform advanced voice recognition / database detection.
There is no need to provide a database search / resource access environment, and a speech recognition / database search function having practical accuracy can be provided at low cost.

【００２２】上述の発明の構成は、下記の限定を含むこ
とができる。即ち、まず、移動端末は、パーソナルハン
ディホンシステム通信機能（通信部１１１）を備える。The configuration of the invention described above can include the following limitations. That is, first, the mobile terminal has a personal handyphone system communication function (communication unit 111).

【００２３】次に、中継網は、パーソナルハンディホン
システム通信網（ＰＨＳ網１０３）とインターネット
（インターネット１０５）を含む。また、音声制御ホス
ト装置及び前述のアクセス情報に対応するホスト装置
は、インターネットに接続する。Next, the relay network includes a personal handyphone system communication network (PHS network 103) and the Internet (Internet 105). The voice control host device and the host device corresponding to the above-mentioned access information are connected to the Internet.

【００２４】そして、移動端末内のホスト接続手段又は
リソースアクセス処理手段は、パーソナルハンディホン
システム通信網を介して、そのパーソナルハンディホン
システム通信網を含む公衆網とインターネットとの間の
ゲートウエイ機能を有する移動端末制御ホスト装置（移
動端末制御ホスト装置１０４）に発信し接続することに
よって、インターネット上の通信プロトコルを使用し
て、移動端末制御ホスト装置からインターネットを介し
て音声制御ホスト装置又は前述のアクセス情報に対応す
るホスト装置に接続又はアクセスする。The host connection means or the resource access processing means in the mobile terminal has a gateway function between the public network including the personal handyphone system communication network and the Internet via the personal handyphone system communication network. By transmitting and connecting to the mobile terminal control host device (mobile terminal control host device 104), the voice control host device or the aforementioned access information is transmitted from the mobile terminal control host device via the Internet using a communication protocol on the Internet. Or access to the host device corresponding to.

【００２５】この限定構成によって、現在全国的及び全
世界的に普及しつつあるパーソナルハンディホンシステ
ム通信網及びインターネットを経由することにより、実
用的な精度を有する音声認識機能と、ワールドワイドな
データベース検索、及びその検索結果に対応するリソー
スへのアクセス機能の提供を、より低コスト及び手軽に
受けることができると同時に、本発明が提供する機能と
パーソナルハンディホンシステム通話機能及びインター
ネットアクセス機能とを、シームレスに結合することが
できる。With this limited configuration, a voice recognition function having practical accuracy and a world-wide database search can be provided via a personal handyphone system communication network and the Internet, which are currently spreading nationwide and worldwide. , And the provision of an access function to a resource corresponding to the search result can be provided at a lower cost and easily, and at the same time, the function provided by the present invention, the personal handyphone system call function and the Internet access function, Can be seamlessly combined.

【００２６】更に、ホスト接続手段が使用する通信プロ
トコルは、下記の限定を含むことができる。即ち、上記
通信プロトコルは、インターネットプロトコル（ＩＰ）
層及びトランスミッションコントロールプロトコル（Ｔ
ＣＰ）層を含む階層プロトコルである。Further, the communication protocol used by the host connection means can include the following restrictions. That is, the communication protocol is Internet Protocol (IP)
Layer and Transmission Control Protocol (T
This is a layer protocol including a CP) layer.

【００２７】次に、インターネット上を伝送されるイン
ターネットプロトコル層のパケットデータであるインタ
ーネットプロトコルデータグラム（ＩＰデータグラム）
のヘッダ（ＩＰヘッダ）フィールドには、インターネッ
ト上での移動端末及び音声制御ホスト装置のアドレスを
指定する送信元インターネットプロトコルアドレス及び
宛先インターネットプロトコルアドレスが格納され、そ
のインターネットプロトコルデータグラムのデータフィ
ールドには、トランスミッションコントロールプロトコ
ル層のパケットデータであるトランスミッションコント
ロールプロトコルセグメントが格納される。Next, an Internet Protocol datagram (IP datagram) which is packet data of an Internet protocol layer transmitted on the Internet.
The header (IP header) field stores a source Internet protocol address and a destination Internet protocol address designating addresses of a mobile terminal and a voice control host device on the Internet, and the data field of the Internet protocol datagram contains And a transmission control protocol segment which is packet data of the transmission control protocol layer.

【００２８】また、トランスミッションコントロールプ
ロトコルセグメント（ＴＣＰセグメント）のヘッダ（Ｔ
ＣＰヘッダ）フィールドには、音声認識／データベース
検索処理のための通信プロトコルを特定する送信元ポー
ト番号及び宛先ポート番号が格納され、そのトランスミ
ッションコントロールプロトコルセグメントのデータフ
ィールドには、移動端末を識別するための端末識別コー
ド、音声データ、又は検索結果ＨＴＭＬ文章データが格
納される。The transmission control protocol segment (TCP segment) header (T
The CP header field stores a source port number and a destination port number for specifying a communication protocol for speech recognition / database search processing. The data field of the transmission control protocol segment includes a data field for identifying a mobile terminal. Terminal identification code, voice data, or search result HTML text data.

【００２９】この限定構成によって、移動端末と音声制
御ホスト装置とを全世界的に容易に特定できると共に、
音声認識／データベース検索処理サービスと、その検索
結果に基づくリソースへのアクセスサービス、及びその
他の情報処理サービスとの共存を容易に実現できる。With this limited configuration, the mobile terminal and the voice control host device can be easily specified worldwide, and
The coexistence of the voice recognition / database search processing service, the resource access service based on the search result, and other information processing services can be easily realized.

【００３０】ここまでの発明の構成において、音声制御
ホスト装置は、網によって相互に接続され、移動端末接
続手段、音声データ受信手段、音声認識手段、データベ
ース手段、検索制御手段、及び検索結果ＨＴＭＬ文章デ
ータ返信手段に対応する機能を分散して実現する複数の
ホストコンピュータから構成されるように実現すること
ができる。In the configuration of the present invention described above, the voice control host devices are mutually connected by a network, and mobile terminal connecting means, voice data receiving means, voice recognition means, database means, search control means, and search result HTML text are provided. The present invention can be realized by a plurality of host computers that realize the functions corresponding to the data return means in a distributed manner.

【００３１】この限定構成によって、ホスト装置側の負
荷分散を容易に実現できる。なお、上述した移動端末及
び音声制御ホスト装置の単体も、本発明の権利範囲であ
る。With this limited configuration, load distribution on the host device side can be easily realized. Note that the above-described mobile terminal and voice control host device alone are also within the scope of the present invention.

【００３２】[0032]

【発明の実施の形態】以下、図面を参照しながら本発明
の実施の形態について詳細に説明する。本実施の形態で
は、ＰＨＳ機能が組み込まれた移動端末において、通話
時に又はオフライン状態でマイクから入力された音声信
号が、ＰＨＳ網からインターネットを介して特定の音声
サービスプロバイダ内のＬＡＮに接続される音声制御ホ
スト装置に送られ、そこで上記音声信号が認識された
後、その認識結果が自動的に分類され、その分類結果に
基づいてインターネット上の特定のデータベース検索エ
ンジンがアクセスされ、そこでのデータベースが検索さ
れる。この結果、音声制御ホスト装置は、移動端末で入
力された音声の内容に関連するデータベース情報をイン
ターネットから取得し、それをリアルタイムに移動端末
に返送する。このデータベース情報は、移動端末で入力
された音声の内容に関連するキーワードを含むインター
ネット上のホームページ等の各種リソースに対する統一
形式アクセス情報であるＵＲＬが記述されているハイパ
ーテキストを含むＨＴＭＬ文章（検索結果ＨＴＭＬ文章
データ）として表現される。移動端末は、この検索結果
ＨＴＭＬ文章データを受信すると、ホームページの閲覧
のためのブラウザアプリケーションを起動し、上述の検
索結果ＨＴＭＬ文章データをホームページ形式で表示す
る。移動端末のユーザは、ハイパーテキストの一部とし
て表示されたアンカー（テキストアンカー又はグラフィ
ックスアンカー）を電子ペンでタッチ等して選択するこ
とによって、そのアンカーと共にハイパーテキストに含
まれるＵＲＬに対応するインターネット上のホームペー
ジやＪａｖａアプレットやファイルやホスト装置のログ
インアカウント等の各種リソースにアクセスし、移動端
末に転送する必要のある場合には、それらのリソース
を、移動端末１０１に転送した後、表示、実行、又は蓄
積することができる。このように、本実施の形態では、
移動端末は、高度な音声認識環境を設備する必要がなく
実用的な精度を有する音声認識機能の提供を低コストで
受けることができ、かつインターネット上のデータベー
スの対話的検索機能をも装備することができることが、
本発明に関連する大きな特徴である。＜システム構成＞図１は、本発明の実施の形態の全体シ
ステム構成図である。Embodiments of the present invention will be described below in detail with reference to the drawings. In the present embodiment, in a mobile terminal having a built-in PHS function, a voice signal input from a microphone during a call or in an off-line state is connected to a LAN in a specific voice service provider from the PHS network via the Internet. After being sent to the voice control host device, where the voice signal is recognized, the recognition result is automatically classified, and a specific database search engine on the Internet is accessed based on the classification result, and the database there is searched. Searched. As a result, the voice control host device obtains database information related to the content of voice input at the mobile terminal from the Internet, and returns it to the mobile terminal in real time. The database information includes an HTML sentence including a hypertext in which a URL that is a unified format access information for various resources such as a homepage on the Internet including a keyword related to the content of the voice input by the mobile terminal is described (search result). (HTML text data). Upon receiving the search result HTML text data, the mobile terminal activates a browser application for browsing a home page, and displays the above-described search result HTML text data in a home page format. The user of the mobile terminal selects an anchor (text anchor or graphics anchor) displayed as a part of the hypertext by touching the electronic pen or the like with an electronic pen or the like, and the Internet corresponding to the URL included in the hypertext together with the anchor. When it is necessary to access various resources such as the above homepage, Java applet, file, and login account of the host device and transfer them to the mobile terminal, transfer those resources to the mobile terminal 101, and then display and execute the resources. Or can be accumulated. Thus, in the present embodiment,
The mobile terminal must be able to provide a voice recognition function with practical accuracy at a low cost without having to install an advanced voice recognition environment, and also be equipped with an interactive database search function on the Internet. Can do
This is a major feature related to the present invention. <System Configuration> FIG. 1 is an overall system configuration diagram of an embodiment of the present invention.

【００３３】移動端末１０１は、ＰＨＳ端末機能を有し
ており、無線基地１０２を介して、無線通信によってＰ
ＨＳ網１０３に接続される。無線基地１０２は、街路の
公衆電話ボックス、電柱、ビル屋上、地下通路等に設け
られる公衆無線基地、又は加入者宅内の親子電話装置等
である。なお、親子電話装置に接続される場合は、ＰＨ
Ｓ網を介さずに、直接公衆電話網に接続される。なお、
無線基地１０２の代わりに、有線接続装置を介して、有
線通信によってＰＨＳ網１０３又は公衆電話網に接続さ
れるように構成されてもよい。[0033] The mobile terminal 101 has a PHS terminal function.
It is connected to the HS network 103. The wireless base 102 is a public wireless base provided on a public telephone booth, a telephone pole, a building rooftop, an underground passage, or the like on a street, or a parent-child telephone device in a subscriber's house. When connected to the parent-child telephone device, the PH
It is directly connected to the public telephone network without going through the S network. In addition,
Instead of the wireless base 102, a configuration may be adopted in which the wireless base 102 is connected to the PHS network 103 or the public telephone network by wired communication via a wired connection device.

【００３４】ＰＨＳ網１０３は、公衆電話網又はＩＳＤ
Ｎ網と相互接続しており、これらの網には、高速デジタ
ル専用線等によってインターネット１０５に接続してい
る移動端末制御ホスト装置１０４が接続されている。The PHS network 103 is a public telephone network or an ISD
It is interconnected with N networks, and a mobile terminal control host device 104 connected to the Internet 105 by a high-speed digital leased line or the like is connected to these networks.

【００３５】移動端末１０１は、無線基地１０２及びＰ
ＨＳ網１０３を介して、上記公衆電話網又はＩＳＤＮ網
に接続されている移動端末制御ホスト装置１０４に自動
的にダイヤルアップ発信することによって、インターネ
ット１０５に接続することができる。The mobile terminal 101 is connected to the radio base 102 and the P
By automatically dialing up the mobile terminal control host device 104 connected to the public telephone network or the ISDN network via the HS network 103, it is possible to connect to the Internet 105.

【００３６】インターネット１０５には、高速デジタル
専用線等を介して所定の音声サービスプロバイダのＬＡ
Ｎ１０７に接続しているルータ装置１０６が接続されて
いる。ＬＡＮ１０７は、イーサネット方式、ＡＴＭ（As
ynchronous Transfer Mode）方式、又はＦＤＤＩ方式に
よるローカルエリアネットワークである。ＬＡＮ１０７
には、更に音声制御ホスト装置１０８が接続されてい
る。A predetermined voice service provider LA is connected to the Internet 105 via a high-speed digital leased line or the like.
The router device 106 connected to N107 is connected. LAN 107 is an Ethernet system, ATM (As
Synchronous Transfer Mode) or FDDI. LAN 107
Is connected to a voice control host device 108.

【００３７】移動端末１０１は、移動端末制御ホスト装
置１０４に自動的にダイヤルアップ発信した後に、イン
ターネット１０５、ルータ装置１０６、及びＬＡＮ１０
７を介して、音声制御ホスト装置１０８と通信すること
ができる。After automatically dialing up the mobile terminal 101 to the mobile terminal control host device 104, the mobile terminal 101 sends the Internet 105, the router device 106, and the LAN 10
7 can communicate with the voice control host device 108.

【００３８】今、移動端末１０１内の入力部１０９にお
いて、ユーザが、タッチパネルから音声制御ホスト装置
１０８との通信を指示すると、制御部１１０は、通信部
１１１に対して、音声制御ホスト装置１０８との通信開
始を依頼する。Now, when the user instructs communication with the voice control host device 108 from the touch panel on the input unit 109 in the mobile terminal 101, the control unit 110 sends a command to the communication unit 111 with the voice control host device 108. Request to start communication.

【００３９】通信部１１１は、制御部１１０から通信開
始を依頼されると、現在移動端末制御ホスト装置１０４
に接続していなければ、無線基地（又は有線接続装置）
１０２に無線（又は有線）発信してＰＨＳ網１０３に接
続した後、移動端末制御ホスト装置１０４のアクセス電
話番号を指定してダイヤルアップ発信する。When the communication unit 111 is requested by the control unit 110 to start communication, the current mobile terminal control host device 104
If not connected to a wireless base (or wired connection device)
After making a wireless (or wired) call to 102 and connecting to the PHS network 103, a dial-up call is made by specifying the access telephone number of the mobile terminal control host device 104.

【００４０】移動端末制御ホスト装置１０４が着信する
と、移動端末１０１内の通信部１１１は、まず、移動端
末制御ホスト装置１０４内の接続確立部１１３と通信す
ることにより、インターネット１０５上の標準通信プロ
トコルであるＴＣＰ／ＩＰ及びＰＰＰ方式による接続の
確立のためのネゴシエーションを行う。この結果、移動
端末制御ホスト装置１０４から、移動端末１０１内の通
信部１１１に対して、インターネット１０５上の識別ア
ドレスであるＩＰアドレスが付与され、移動端末１０１
は、インターネット１０５へのアクセスが可能となる。When the mobile terminal control host device 104 receives an incoming call, the communication unit 111 in the mobile terminal 101 first communicates with the connection establishment unit 113 in the mobile terminal control host device 104, thereby establishing a standard communication protocol on the Internet 105. Negotiation for establishing a connection by the TCP / IP and PPP methods. As a result, an IP address, which is an identification address on the Internet 105, is assigned from the mobile terminal control host device 104 to the communication unit 111 in the mobile terminal 101, and the mobile terminal 101
Can access the Internet 105.

【００４１】移動端末１０１内の通信部１１１は、既に
移動端末制御ホスト装置１０４に接続していれば、上記
タイヤルアップ発信は省略する。その後、移動端末１０
１内の通信部１１１は、予め設定されている音声制御ホ
スト装置１０８のＩＰアドレスである“宛先ＩＰアドレ
ス”と、移動端末制御ホスト装置１０４から付与された
ＩＰアドレスである“送信元ＩＰアドレス”と、移動端
末１０１を識別するための“端末識別コード”（例えば
ＰＨＳ電話番号）と、ユーザの指定に基づく文音声認識
／データベース検索処理の開始要求コマンド又は文音声
認識／データベース検索処理の終了要求コマンドとが格
納されたＴＣＰ／ＩＰパケットを、インターネット１０
５に向けて送出する。If the communication unit 111 in the mobile terminal 101 is already connected to the mobile terminal control host device 104, the above dial-up transmission is omitted. Then, the mobile terminal 10
1 includes a “destination IP address” that is a preset IP address of the voice control host device 108 and a “source IP address” that is an IP address assigned by the mobile terminal control host device 104. And a "terminal identification code" (for example, a PHS telephone number) for identifying the mobile terminal 101, and a command to start a sentence speech recognition / database search process or a request to end a sentence speech recognition / database search process based on a user's designation. The TCP / IP packet storing the command and the
Send it out to 5.

【００４２】このＴＣＰ／ＩＰパケットは、それに格納
されている“宛先ＩＰアドレス”に基づき、移動端末制
御ホスト装置１０４内のルーティング部１１４とインタ
ーネット１０５内の特には図示しない中継ホスト装置に
よって、音声サービスプロバイダ内のルータ装置１０６
まで転送された後、更に、ＬＡＮ１０７を介して音声制
御ホスト装置１０８内のパケット送受信部１１５まで転
送される。Based on the “destination IP address” stored in the TCP / IP packet, a voice service is provided by a routing unit 114 in the mobile terminal control host device 104 and a relay host device (not shown) in the Internet 105. Router device 106 in the provider
After that, the packet is further transferred to the packet transmitting / receiving unit 115 in the voice control host device 108 via the LAN 107.

【００４３】パケット送受信部１１５は、受信したＴＣ
Ｐ／ＩＰパケットから、“送信元ＩＰアドレス”と、
“端末識別コード”と、文音声認識／データベース検索
処理の開始要求コマンド又は文音声認識／データベース
検索処理の終了要求コマンドとを取り出して、音声制御
ホスト装置１０８内の移動端末通信制御部１１６に引き
渡す。Packet transmitting / receiving section 115 receives the received TC
From the P / IP packet, the “source IP address”
The terminal identification code and the sentence speech recognition / database search processing start request command or the sentence speech recognition / database search processing end request command are extracted and transferred to the mobile terminal communication control unit 116 in the speech control host device 108. .

【００４４】移動端末通信制御部１１６は、引き渡され
た“送信元ＩＰアドレス”と、“端末識別コード”と、
文音声認識／データベース検索処理の開始要求コマンド
又は文音声認識／データベース検索処理の終了要求コマ
ンドに関する情報を後述する処理端末登録テーブル（図
１２）に登録した後、パケット送受信部１１５に対し
て、送信許可データが格納されたＴＣＰ／ＩＰパケット
の移動端末１０１への返信を依頼する。The mobile terminal communication control unit 116 transmits the delivered “source IP address”, “terminal identification code”,
After registering information relating to a start command of the sentence speech recognition / database search process or an end request command of the sentence speech recognition / database search process in a processing terminal registration table (FIG. 12) described later, the information is transmitted to the packet transmitting / receiving unit 115 A request is sent to the mobile terminal 101 to return a TCP / IP packet storing the permission data.

【００４５】パケット送受信部１１５は、対応するＴＣ
Ｐ／ＩＰパケットを、移動端末１０１に対応するＩＰア
ドレスに向けて送信する。このようにして、音声制御ホ
スト装置１０８は、移動端末１０１から転送されてくる
音声データに対して文音声認識／データベース検索処理
を実行することが可能となる。The packet transmitting / receiving unit 115
The P / IP packet is transmitted to the IP address corresponding to the mobile terminal 101. In this way, the voice control host device 108 can execute sentence voice recognition / database search processing on voice data transferred from the mobile terminal 101.

【００４６】移動端末１０１内の通信部１１１は、音声
制御ホスト装置１０８から上記送信許可データが格納さ
れたＴＣＰ／ＩＰパケットを受信すると、それに格納さ
れている送信許可データを制御部１１０に引き渡す。When the communication unit 111 in the mobile terminal 101 receives a TCP / IP packet storing the above-mentioned transmission permission data from the voice control host device 108, it passes the transmission permission data stored therein to the control unit 110.

【００４７】移動端末１０１内の制御部１１０は、上記
送信許可データを引き渡された後、通信部１１１に対し
て、通話動作又はオフライン状態での音声入力動作によ
ってマイクから入力された音声データの音声制御ホスト
装置１０８への送信を依頼する。After the transmission permission data is delivered, the control unit 110 in the mobile terminal 101 sends the voice of the voice data input from the microphone to the communication unit 111 by a call operation or a voice input operation in an offline state. Request transmission to the control host device 108.

【００４８】通信部１１１は、上記音声データが格納さ
れたＴＣＰ／ＩＰパケットを、音声制御ホスト装置１０
８に対応するＩＰアドレスに向けて送信する。このＴＣ
Ｐ／ＩＰパケットは、それに格納されている“宛先ＩＰ
アドレス”に基づき、移動端末制御ホスト装置１０４内
のルーティング部１１４、インターネット１０５内の特
には図示しない中継ホスト装置、音声サービスプロバイ
ダ内のルータ装置１０６、及びＬＡＮ１０７を介して、
音声制御ホスト装置１０８内のパケット送受信部１１５
まで転送される。The communication unit 111 transmits the TCP / IP packet storing the voice data to the voice control host device 10.
8 is transmitted to the IP address corresponding to No. 8. This TC
The P / IP packet has the “destination IP” stored therein.
Based on the address, via the routing unit 114 in the mobile terminal control host device 104, the relay host device (not shown) in the Internet 105, the router device 106 in the voice service provider, and the LAN 107,
Packet transmission / reception unit 115 in voice control host device 108
Transferred to

【００４９】パケット送受信部１１５は、受信したＴＣ
Ｐ／ＩＰパケットに格納されている音声データを取り出
し、それを音声制御ホスト装置１０８内の移動端末通信
制御部１１６に引き渡す。The packet transmitting / receiving unit 115 receives the received TC
The voice data stored in the P / IP packet is extracted and delivered to the mobile terminal communication control unit 116 in the voice control host device 108.

【００５０】移動端末通信制御部１１６は、引き渡され
た音声データを文音声認識部１１７に引き渡す。文音声
認識部１１７は、引き渡された音声データに対し文音声
認識処理を実行し、認識結果である認識音声文章データ
を検索制御部１１８に引き渡す。検索制御部１１８は、
認識音声文章データを、移動端末１０１別にインデック
ス分類し、その結果得られる検索インデックスに基づき
不要キーワード辞書を参照しながら検索キーワードを抽
出する。そして、検索制御部１１８は、インターネット
１０５上の予め登録されている特定のデータベース検索
エンジンに対して、検索キーワードによる問合せを依頼
する。その結果、検索制御部１１８は、データベース検
索エンジンから返される検索結果に基づき検索結果ＨＴ
ＭＬ文章データを生成し、それを移動端末通信制御部１
１６に引き渡す。この検索結果ＨＴＭＬ文章データは、
移動端末１０１で入力された音声の内容に関連する上記
検索キーワードを含むインターネット１０５上の任意の
ホームページ等の各種リソースに対する統一形式アクセ
ス情報であるＵＲＬ（Uniform Resource Locator）が記
述されているハイパーテキストを含むハイパーテキスト
マークアップランゲージＨＴＭＬである。The mobile terminal communication control section 116 delivers the delivered voice data to the sentence voice recognition section 117. The sentence speech recognition unit 117 executes a sentence speech recognition process on the delivered speech data, and delivers the recognized speech sentence data as the recognition result to the search control unit 118. The search control unit 118
Recognized speech text data is classified into indices for each mobile terminal 101, and a search keyword is extracted based on a search index obtained as a result while referring to an unnecessary keyword dictionary. Then, the search control unit 118 requests a specific database search engine registered in advance on the Internet 105 to make an inquiry using the search keyword. As a result, the search control unit 118 uses the search result HT based on the search result returned from the database search engine.
Generates ML text data and transmits it to the mobile terminal communication control unit 1
Hand over to 16. This search result HTML text data is
A hypertext in which a URL (Uniform Resource Locator), which is unified format access information for various resources such as an arbitrary homepage on the Internet 105 including the above-described search keyword related to the content of the voice input by the mobile terminal 101, is described. Includes Hypertext Markup Language HTML.

【００５１】今、例えば、移動端末１０１におけるＰＨ
Ｓ通話において、図１７に示されるような会話がやりと
りされたとする。これに対して、文音声認識部１１７
は、途中経過として、図１８に示されるような認識音声
文章データを出力する。なお、“＊”は、文音声認識部
１１７によって付加される単語の区切りである。この認
識音声文章データを入力した検索制御部１１８は、途中
経過として、例えば、図１９に示されるような検索イン
デックスを作成して、その中で例えば出現回数が２回を
超えた単語“時計”及び“カシオ”を、検索キーワード
として抽出する。そして、この検索キーワード（アンド
条件）によるインターネット１０５上の特定のデータベ
ース検索エンジンに対する問合せの結果として、図２０
に示されるような検索結果ＨＴＭＬ文章データを生成す
る。このＨＴＭＬデータにおいて、例えば、“<A HREF
="http://www.casio.co.jp/">カシオホームページ</A
>”がカシオホームページに対応するハイパーテキスト
で、"http://www.casio.co.jp/"が上記ホームページの
ＵＲＬを示し、“カシオホームページ”がそのＵＲＬに
アクセスするためのテキストアンカーを示している。上
記ＵＲＬは、アドレス情報“www.casio.co.jp/”と、そ
のアドレスのリソースにアクセスするための通信プロト
コル情報“http”とを含む。また、それに続く“<DD>”
以降の文章が、上述のデータベース検索エンジンから自
動的に得られる説明文である。なお、“ <”と“> ”で
囲まれた記号は、表示制御用の記号である。Now, for example, the PH in the mobile terminal 101
It is assumed that a conversation as shown in FIG. 17 has been exchanged in the S call. On the other hand, the sentence speech recognition unit 117
Outputs recognized speech sentence data as shown in FIG. Note that “*” is a word segment added by the sentence speech recognition unit 117. The search control unit 118 that has input the recognized voice sentence data creates a search index as shown in FIG. 19, for example, as the progress, and in the search index, the word “clock” in which the number of appearances exceeds two, for example, And “Casio” are extracted as search keywords. As a result of an inquiry to a specific database search engine on the Internet 105 using the search keyword (and condition), FIG.
The search result HTML text data as shown in FIG. In this HTML data, for example, “<A HREF
= "http://www.casio.co.jp/"> Casio homepage </ A
“>” Is a hypertext corresponding to the Casio homepage, “http://www.casio.co.jp/” indicates the URL of the above homepage, and “Casio homepage” indicates a text anchor for accessing the URL. The URL includes address information “www.casio.co.jp/” and communication protocol information “http” for accessing a resource of the address, and “<DD>” following the communication information.
The following sentence is an explanatory sentence automatically obtained from the above-described database search engine. The symbols enclosed by “<” and “>” are display control symbols.

【００５２】移動端末通信制御部１１６は、検索結果Ｈ
ＴＭＬ文章データが格納されたＴＣＰ／ＩＰパケットの
移動端末１０１への返信を依頼する。パケット送受信部
１１５は、対応するＴＣＰ／ＩＰパケットを、移動端末
１０１に対応するＩＰアドレスに向けて送信する。The mobile terminal communication control unit 116 searches the search result H
A request is sent to the mobile terminal 101 to return a TCP / IP packet storing the TML text data. Packet transmitting / receiving section 115 transmits a corresponding TCP / IP packet to an IP address corresponding to mobile terminal 101.

【００５３】移動端末１０１内の通信部１１１は、音声
制御ホスト装置１０８から上記検索結果ＨＴＭＬ文章デ
ータが格納されたＴＣＰ／ＩＰパケットを受信すると、
それに格納されている検索結果ＨＴＭＬ文章データを制
御部１１０に引き渡す。When the communication unit 111 in the mobile terminal 101 receives a TCP / IP packet storing the above search result HTML text data from the voice control host device 108,
The search result HTML text data stored therein is delivered to the control unit 110.

【００５４】移動端末１０１内の制御部１１０は、ブラ
ウザアプリケーションを起動して、引き渡された検索結
果ＨＴＭＬ文章データを、ホームページ形式でＬＣＤ表
示部に表示する。The control unit 110 in the mobile terminal 101 activates a browser application and displays the delivered search result HTML text data on the LCD display unit in a homepage format.

【００５５】今、例えば、前述の図２０に示される検索
結果ＨＴＭＬ文章データが受信されると、ＬＣＤ表示部
３１１（図２の２０３）には、例えば図２１のように検
索結果が表示される。ここで、下線が付加されたキーワ
ードが、インターネット１０５上のホームページ等の各
種リソースのＵＲＬと共にハイパーテキストに含まれる
テキストアンカーを示している。Now, for example, when the search result HTML text data shown in FIG. 20 is received, the search result is displayed on the LCD display unit 311 (203 in FIG. 2), for example, as shown in FIG. . Here, the underlined keyword indicates the text anchor included in the hypertext together with the URL of various resources such as a homepage on the Internet 105.

【００５６】移動端末１０１のユーザが、上述のように
表示されたアンカーを電子ペンでタッチ等することによ
り選択すると、移動端末１０１は、ブラウザアプリケー
ションの機能により、移動端末制御ホスト装置１０４を
介し、上記アンカーと共にハイパーテキストに含まれる
ＵＲＬにより示されるインターネット１０５に接続され
るホスト装置上のホームページデータやＪａｖａアプレ
ットやファイルデータやホスト装置のログインアカウン
ト等の各種リソースに対し、そのＵＲＬにより指定され
るＨＴＴＰ（Hyper Text Transfer Protocol）（ホーム
ページデータの取得又はＪａｖａアプレットの取得及び
実行の場合）、ＦＴＰ（File TransferProtocol）（フ
ァイルデータの取得の場合）、又はＴＥＬＮＥＴ（ホス
ト装置へのログインの場合）等の通信プロトコルを用い
て、アクセスする。この場合、移動端末１０１は、上記
ＵＲＬによってホームページデータやＪａｖａアプレッ
トやファイルデータ等の取得又は実行が指定されている
場合には、それらを、上記ＨＴＴＰ又はＦＴＰ等の通信
プロトコルを用いてホスト装置から移動端末１０１に転
送させて、ＬＣＤ表示部３１１（図２の２０３）に表示
し（ホームページデータの取得の場合）、ＣＰＵ３１６
に実行させ（Ｊａｖａアプレットの取得及び実行の場
合）、又はＲＡＭ３１７に蓄積する（ファイルデータの
取得の場合）。或いは、移動端末１０１は、上記ＵＲＬ
によってホスト装置に対するログインが指定されている
場合は、ＴＥＬＮＥＴプロトコルを用いてそのホスト装
置にログインする。When the user of the mobile terminal 101 selects the anchor displayed as described above by touching it with an electronic pen or the like, the mobile terminal 101 uses the function of the browser application to transmit the anchor via the mobile terminal control host device 104, For various resources such as homepage data, Java applets and file data on the host device connected to the Internet 105 indicated by a URL included in the hypertext together with the anchor, and a login account of the host device, HTTP specified by the URL is used. Communication such as (Hyper Text Transfer Protocol) (for acquiring homepage data or acquiring and executing a Java applet), FTP (File Transfer Protocol) (for acquiring file data), or TELNET (for logging in to a host device) Access using protocol. In this case, when acquisition or execution of homepage data, Java applet, file data, or the like is specified by the URL, the mobile terminal 101 transmits the data from the host device using the communication protocol such as HTTP or FTP. The data is transferred to the mobile terminal 101 and displayed on the LCD display unit 311 (203 in FIG. 2) (in the case of acquiring homepage data).
(In the case of acquiring and executing a Java applet) or storing it in the RAM 317 (in the case of acquiring file data). Alternatively, the mobile terminal 101 uses the URL
When the login to the host device is designated by the, the login to the host device is performed using the TELNET protocol.

【００５７】今、図２１に示される検索結果の表示画面
上で、ユーザが、例えば、テキストアンカー“WATCH WA
TCHES!”を選択すると、ＵＲＬ“http://www.casio.co.
jp”を有するインターネット１０５上のホスト装置内の
“ww”ディレクトリから、ＨＴＴＰ通信プロトコルを用
いて、図２２に示されるようなホームページが取得さ
れ、ＬＣＤ表示部３１１（図２の２０３）に表示され
る。同様に、ユーザが、例えば、テキストアンカー“カ
シオホームページ”を選択すると、ＵＲＬ“http://ww
w.casio.co.jp”を有するインターネット１０５上のホ
スト装置内の“/ ”ディレクトリ（Ｗｅｂルートディレ
クトリ）から、図２３に示されるようなホームページが
取得され、ＬＣＤ表示部３１１（図２の２０３）に表示
される。＜移動端末１０１の外観構成＞図２は、図１の移動端末
１０１の外観図である。Now, on the search result display screen shown in FIG. 21, the user can input, for example, a text anchor " WATCH WA ".
TCHES! ”, The URL“ http://www.casio.co.
From the “ww” directory in the host device on the Internet 105 having “jp”, a home page as shown in FIG. 22 is obtained using the HTTP communication protocol, and displayed on the LCD display unit 311 (203 in FIG. 2). Similarly, when the user selects, for example, the text anchor “Casio homepage”, the URL “http: // ww
A home page as shown in FIG. 23 is obtained from a “/” directory (Web root directory) in the host device on the Internet 105 having “w.casio.co.jp”, and is displayed on the LCD display unit 311 (203 in FIG. 2). <External Configuration of Mobile Terminal 101> FIG. 2 is an external view of the mobile terminal 101 shown in FIG.

【００５８】移動端末１０１は、コンパクトな携帯情報
管理装置の外観を有し、音声を入力するための送話器を
兼ねたマイク２０１と、本発明には特には関連しないが
画像を入力するためのカメラ２０２と、各種情報を表示
し、またタッチ入力又はペン入力を受け付けるタッチパ
ネル機能を有するＬＣＤ表示部２０３と、音声を出力す
るための受話器を兼ねたスピーカ２０４を有する、ま
た、図１の無線基地１０２に発信するための無線アンテ
ナ２０５と、無線基地１０２の代わりの有線接続装置に
接続するためのソケット２０６を有する。The mobile terminal 101 has the appearance of a compact portable information management device, and has a microphone 201 also serving as a transmitter for inputting voice, and a microphone 201 for inputting an image which is not particularly related to the present invention. 1. The camera 202, an LCD display unit 203 having a touch panel function of displaying various information and receiving a touch input or a pen input, and a speaker 204 also serving as a receiver for outputting voice. It has a wireless antenna 205 for transmitting to the base 102 and a socket 206 for connecting to a wired connection device instead of the wireless base 102.

【００５９】更に、各種ＩＣカードを挿入するためのＩ
Ｃカードスロット２０７と、他の移動端末１０１又はパ
ーソナルコンピュータ等との間で赤外線光通信を行うた
めの光送受信機２０８を有する。Further, an I for inserting various IC cards is provided.
An optical transceiver 208 for performing infrared optical communication between the C card slot 207 and another mobile terminal 101 or a personal computer or the like is provided.

【００６０】スイッチ２０９は、電源スイッチである。＜移動端末１０１の機能ブロック構成＞図３は、移動端
末１０１の機能ブロック図である。The switch 209 is a power switch. <Functional Block Configuration of Mobile Terminal 101> FIG. 3 is a functional block diagram of the mobile terminal 101.

【００６１】移動端末１０１は、図１にも示したよう
に、入力部１０９、制御部１１０、通信部１１１、及び
出力部１１２から構成され、それぞれバス３２６によっ
て相互に接続されている。As shown in FIG. 1, the mobile terminal 101 comprises an input unit 109, a control unit 110, a communication unit 111, and an output unit 112, and are mutually connected by a bus 326.

【００６２】まず、入力部１０９は、音声を入力する部
分と、本発明には特には関連しないが画像を入力する部
分と、出力部１１２の動作において後述するタッチパネ
ル機構の部分とから構成される。First, the input unit 109 includes a part for inputting voice, a part for inputting an image which is not particularly related to the present invention, and a part of a touch panel mechanism described later in the operation of the output unit 112. .

【００６３】音声を入力する部分は、マイク３０１、Ａ
／Ｄ変換部３０２、及びマイク制御部３０３から構成さ
れる。マイク３０１（図２の２０１に対応）は、ＰＨＳ
電話の送話器を兼ねており、ユーザが発声した音声を入
力する。The part for inputting voice is the microphone 301, A
It comprises a / D conversion unit 302 and a microphone control unit 303. The microphone 301 (corresponding to 201 in FIG. 2) is a PHS
Also serves as a telephone transmitter, and inputs a voice uttered by the user.

【００６４】Ａ／Ｄ変換部３０２は、マイク３０１から
入力されたアナログ音声信号をデジタル音声データに変
換し、更にそのデジタル音声データを、ＰＨＳの標準音
声符号化方式であるＡＤＰＣＭ（Adaptive Differentia
l Pulse Code Modulation:適応差分線形パルス符号化）
方式によって符号化する。なお、この部分は、ＰＨＳ端
末を構成するＬＳＩ集積回路として、既に実用化されて
いる。The A / D converter 302 converts an analog audio signal input from the microphone 301 into digital audio data, and further converts the digital audio data into an ADPCM (Adaptive Differential) which is a PHS standard audio encoding system.
l Pulse Code Modulation: Adaptive differential linear pulse coding
Encode according to the method. This part has already been put to practical use as an LSI integrated circuit constituting a PHS terminal.

【００６５】マイク制御部３０３は、上述の符号化され
た音声データを、通話時には、通信部１１１内の通信制
御部３２１に転送して通話チャネルに載せると共に、文
音声認識／データベース検索処理時には、更に制御部１
１０内のＲＡＭ３１７に転送する。The microphone control unit 303 transfers the encoded voice data to the communication control unit 321 in the communication unit 111 during a call and places it on a call channel. Control unit 1
10 to the RAM 317.

【００６６】一方、画像を入力する部分は、ＣＣＤ（Ch
arge Coupled Device ）カメラ３０４、Ａ／Ｄ変換部３
０５、メモリ３０６、及びカメラ制御部３０７から構成
される。On the other hand, a portion for inputting an image is a CCD (Ch
arge Coupled Device) Camera 304, A / D converter 3
05, a memory 306, and a camera control unit 307.

【００６７】ＣＣＤカメラ３０４は、ユーザの操作に基
づいて任意の画像を撮像する。Ａ／Ｄ変換部３０５は、
ＣＣＤカメラ３０４によって撮像されたアナログ映像信
号を、デジタル画像データに変換する。The CCD camera 304 captures an arbitrary image based on a user operation. The A / D conversion unit 305
An analog video signal captured by the CCD camera 304 is converted into digital image data.

【００６８】メモリ３０６は、デジタル画像データをフ
レーム単位で記憶する。カメラ制御部３０７は、ＣＣＤ
カメラ３０４、Ａ／Ｄ変換部３０５、及びメモリ３０６
の動作を制御する。The memory 306 stores digital image data in frame units. The camera control unit 307 includes a CCD
Camera 304, A / D converter 305, and memory 306
Control the operation of.

【００６９】次に、出力部１１２は、音声を出力する部
分と、画像を出力する部分とから構成される。音声を出
力する部分は、スピーカ３０８、Ｄ／Ａ変換部３０９、
及びスピーカ制御部３１０から構成される。Next, the output section 112 includes a section for outputting sound and a section for outputting an image. The part that outputs audio includes a speaker 308, a D / A converter 309,
And a speaker control unit 310.

【００７０】スピーカ制御部３１０は、通信部１１１内
の通信制御部３２１から受信されたＰＨＳ通話音声デー
タ、又は制御部１１０内のＲＡＭ３１７から受信された
合成音声データを、Ｄ／Ａ変換部３０９に転送する。The speaker control section 310 transmits the PHS telephone call voice data received from the communication control section 321 in the communication section 111 or the synthesized voice data received from the RAM 317 in the control section 110 to the D / A conversion section 309. Forward.

【００７１】Ｄ／Ａ変換部３０９は、受信された音声デ
ータを復号し、アナログ音声信号に変換し、それをスピ
ーカ３０８（図２の２０４に対応）から音声として放音
させる。The D / A converter 309 decodes the received audio data, converts it into an analog audio signal, and emits it as sound from the speaker 308 (corresponding to 204 in FIG. 2).

【００７２】画像を出力する部分は、ＬＣＤ表示部２０
３、ＬＣＤドライバ３１２、メモリ３１３、及びＬＣＤ
制御部３１４から構成される。ＬＣＤ制御部３１４は、
制御部１１０内のＲＡＭ３１７から受信された文字デー
タ、イメージデータ、コマンドボタンデータ等の各種画
像データをメモリ３１３にフレーム単位で保持させ、Ｌ
ＣＤドライバ３１２に起動をかける。The part for outputting the image is the LCD display unit 20
3, LCD driver 312, memory 313, and LCD
It comprises a control unit 314. The LCD control unit 314
Various image data such as character data, image data, and command button data received from the RAM 317 in the control unit 110 are stored in the memory 313 in frame units.
The CD driver 312 is started.

【００７３】ＬＣＤドライバ３１２は、メモリ３１３か
らフレーム単位で読み出される画像データを、ＬＣＤ表
示部３１１（図２の２０３に対応）に表示する。なお、
ＬＣＤ表示部３１１（図２の２０３）の表面には、透明
タッチパネルが配設されており、ユーザは、ＬＣＤ表示
部３１１に表示されるコマンドボタンデータ等に従っ
て、タッチパネルに指タッチ又はペンタッチすることに
より、コマンド入力を行うことができる。この入力信号
は、タッチパネル制御部３１５によって制御部１１０内
のＲＡＭ３１７に転送される。The LCD driver 312 displays the image data read from the memory 313 in frame units on the LCD display unit 311 (corresponding to 203 in FIG. 2). In addition,
A transparent touch panel is provided on the surface of the LCD display unit 311 (203 in FIG. 2). The user touches the touch panel with his / her finger or pen in accordance with command button data displayed on the LCD display unit 311. Command input. This input signal is transferred by the touch panel control unit 315 to the RAM 317 in the control unit 110.

【００７４】続いて、制御部１１０は、ＣＰＵ３１６、
ＲＡＭ３１７、及びＲＯＭ３１８と、ＩＣカードインタ
フェース部３１９、及び必要に応じてＩＣカードスロッ
ト２０７（図２）に挿入されるＩＣカード３２０とから
構成される。Subsequently, the control unit 110 controls the CPU 316,
It comprises a RAM 317 and a ROM 318, an IC card interface unit 319, and an IC card 320 inserted into an IC card slot 207 (FIG. 2) as required.

【００７５】ＣＰＵ３１６は、ＲＯＭ３１８に記憶され
た制御プログラムに従って、ＲＡＭ３１７をワークエリ
アとして使用しながら、移動端末１０１全体の動作を制
御する。The CPU 316 controls the operation of the entire mobile terminal 101 according to the control program stored in the ROM 318 while using the RAM 317 as a work area.

【００７６】ＩＣカードインタフェース部３１９は、Ｉ
Ｃカード３２０に対するデータの入出力を制御する。最
後に、通信部１１１は、通信制御部３２１、無線ドライ
バ３２２、無線アンテナ３２３、有線ドライバ３２４、
及びソケット３２５から構成される。The IC card interface unit 319
The input / output of data to / from the C card 320 is controlled. Lastly, the communication unit 111 includes a communication control unit 321, a wireless driver 322, a wireless antenna 323, a wired driver 324,
And a socket 325.

【００７７】通信制御部３２１は、ＰＨＳ通話処理及び
インターネット１０５との間のＴＣＰ／ＩＰ通信処理
（後述する）を実行し、無線ドライバ３２２又は有線ド
ライバ３２４を制御する。The communication control unit 321 executes a PHS call process and a TCP / IP communication process (to be described later) with the Internet 105, and controls the wireless driver 322 or the wired driver 324.

【００７８】無線ドライバ３２２は、無線通信時に、通
信データを、無線アンテナ３２３（図２の２０５に対
応）を介して送受信されるＰＨＳ無線信号との間で相互
変換する。ＰＨＳ無線信号は、１．９ＧＨｚの無線周波
数と、３００ｋＨｚのキャリア周波数間隔と、４チャネ
ル／キャリアのＴＤＭＡ−ＴＤＤ無線アクセス方式と、
π／４シフトＱＰＳＫ変調方式と、３８４ｋｂｉｔｓ／
ｓｅｃの無線伝送速度に基づく無線信号である。The wireless driver 322 mutually converts communication data with a PHS wireless signal transmitted / received via the wireless antenna 323 (corresponding to 205 in FIG. 2) during wireless communication. The PHS radio signal has a radio frequency of 1.9 GHz, a carrier frequency interval of 300 kHz, a TDMA-TDD radio access scheme of 4 channels / carrier,
π / 4 shift QPSK modulation method and 384 kbits /
This is a wireless signal based on the wireless transmission speed of sec.

【００７９】一方、有線ドライバ３２４は、有線通信時
に、通信データを、ソケット３２５（図２の２０６に対
応）を介して送受信される有線信号との間で相互変換す
る。これは、一般的な電話帯域モデム変調信号である。
以上の構成を有する本発明の実施の形態の動作につい
て、以下に詳細に説明する。＜移動端末１０１の処理＞まず、移動端末１０１の処理
について説明する。On the other hand, at the time of wired communication, the wired driver 324 mutually converts communication data with a wired signal transmitted / received via the socket 325 (corresponding to 206 in FIG. 2). This is a typical telephone band modem modulated signal.
The operation of the embodiment of the present invention having the above configuration will be described in detail below. <Process of Mobile Terminal 101> First, the process of the mobile terminal 101 will be described.

【００８０】図４は、図３の制御部１１０内のＣＰＵ３
１６が、電源投入後に、制御部１１０内のＲＯＭ３１８
に記憶されている制御プログラムを実行する動作として
実現される制御動作を示す全体動作フローチャートであ
る。FIG. 4 shows the CPU 3 in the control unit 110 shown in FIG.
16 stores the ROM 318 in the control unit 110 after the power is turned on.
4 is an overall operation flowchart showing a control operation realized as an operation of executing a control program stored in the control program.

【００８１】なお、図４、図５、及び図８の動作フロー
チャートで示される各機能を実現する制御プログラム及
びそれに必要なデータは、例えば、図２に示されるＩＣ
カードスロット２０７に着脱自在なＩＣカード３２０
に、ＣＰＵ３１６が読み取り可能なプログラムコードの
形態で記憶され、そのプログラムコードがＣＰＵ３１６
によって直接実行され、又は、そのプログラムコードが
必要に応じてＲＡＭ３１７又は書込み可能なＲＯＭ３１
８にロードされてＣＰＵ３１６によって実行されるよう
に構成されてもよい。或いは、上述の制御プログラム及
びそれに必要なデータは、無線又は有線の通信回線又は
光送受信機２０８（図２）から通信部１１１を介して他
の機器から受信されて、ＲＡＭ３１７又は書込み可能な
ＲＯＭ３１８にロードされてＣＰＵ３１６によって実行
されるように構成されてもよい。A control program for realizing each function shown in the operation flowcharts of FIGS. 4, 5 and 8 and data necessary for the control program are, for example, ICs shown in FIG.
IC card 320 detachable from card slot 207
Is stored in the form of a program code readable by the CPU 316, and the program code is stored in the CPU 316.
Or the program code can be directly executed by the RAM 317 or the writable ROM 31 as needed.
8 to be executed by the CPU 316. Alternatively, the above-described control program and data necessary for the control program are received from another device via a communication unit 111 from a wireless or wired communication line or an optical transceiver 208 (FIG. 2), and stored in a RAM 317 or a writable ROM 318. It may be configured to be loaded and executed by the CPU 316.

【００８２】まず、ステップ４０１→４０２→４０３→
４０４→４０１の繰返しループにおいては、図３のタッ
チパネル制御部３１５からタッチパネル入力の検出が通
知されたか否かの判定処理（４０１）、音声制御ホスト
装置１０８（図１）から検索結果ＨＴＭＬ文章データが
受信されたか否かの判定処理（４０２）、その他の受信
／表示処理（４０３）、及び必要なデータの送信処理
（４０４）が実行される。First, steps 401 → 402 → 403 →
In the repetition loop of 404 → 401, it is determined whether touch panel input has been detected from the touch panel control unit 315 in FIG. 3 (401), and the search result HTML text data is sent from the voice control host device 108 (FIG. 1). A determination process (402) of whether or not the data has been received, another reception / display process (403), and a necessary data transmission process (404) are executed.

【００８３】タッチパネル制御部３１５からタッチパネ
ル入力の検出が通知されステップ４０１の判定がＹＥＳ
となると、ステップ４０５又は４０６で、上記タッチパ
ネル入力が図３のＣＣＤカメラ３０４（図２の２０２）
の入力指示又は図３のマイク３０１（図２の２０１）の
入力指示であるか否かが、判定される。Touch panel control section 315 notifies of touch panel input detection, and determination in step 401 is YES.
Then, in step 405 or 406, the touch panel input is performed by the CCD camera 304 in FIG. 3 (202 in FIG. 2).
It is determined whether or not the input instruction is the input instruction of the microphone 301 of FIG. 3 (201 of FIG. 2).

【００８４】タッチパネル入力が図３のＣＣＤカメラ３
０４（図２の２０２）の入力指示であってステップ４０
５の判定がＹＥＳとなると、ステップ４０７で、図３の
入力部１０９内のカメラ制御部３０７に対して、例えば
手書き文字画像等の入力処理の開始が指示される。その
後、ステップ４０４の送信処理に進む。画像入力処理
は、本発明には特には関連しないため、その詳細な説明
は省略する。The touch panel input is the CCD camera 3 shown in FIG.
04 (202 in FIG. 2) and the
If the determination at 5 is YES, at step 407, the camera control unit 307 in the input unit 109 in FIG. 3 is instructed to start input processing of, for example, a handwritten character image. Thereafter, the process proceeds to the transmission process of step 404. Since the image input processing is not particularly related to the present invention, a detailed description thereof will be omitted.

【００８５】タッチパネル入力が図３のマイク３０１
（図２の２０１）の入力指示であってステップ４０６の
判定がＹＥＳとなると、ステップ４０８で、図３の入力
部１０９内のマイク制御部３０３に対し、音声入力処理
の開始が指示される。この音声入力処理の開始指示は、
例えばＰＨＳ通話処理の開始指示、又は文音声認識／デ
ータベース検索処理を実行するためのオフライン状態で
の音声入力処理の開始指示である。The touch panel input is the microphone 301 shown in FIG.
If the input instruction is (201 in FIG. 2) and the determination in step 406 is YES, in step 408, the microphone control unit 303 in the input unit 109 in FIG. 3 is instructed to start the voice input process. This voice input processing start instruction is
For example, it is an instruction to start a PHS call process or an instruction to start a speech input process in an offline state for executing a sentence speech recognition / database search process.

【００８６】マイク制御部３０３は、上述のＣＰＵ３１
６からの指示によって、マイク３０１（図２の２０１）
及びＡ／Ｄ変換部３０２に対して、音声入力の開始を指
示する。この結果、Ａ／Ｄ変換部３０２からは、マイク
３０１（図２の２０１）から入力された音声データが出
力される。The microphone control unit 303 is provided with the CPU 31 described above.
6, the microphone 301 (201 in FIG. 2)
And instruct the A / D converter 302 to start voice input. As a result, the audio data input from the microphone 301 (201 in FIG. 2) is output from the A / D converter 302.

【００８７】その後、上述の音声入力処理の開始指示が
ＰＨＳ通話の開始指示である場合には、上述の音声デー
タは、通信制御部３２１の特には図示しない送信処理に
よって、所定の通話チャネルに載せられて通話相手に送
信される。After that, when the above-mentioned voice input processing start instruction is a PHS telephone call start instruction, the above-mentioned voice data is loaded on a predetermined telephone channel by a transmission processing (not shown) of the communication control unit 321. And sent to the other party.

【００８８】また、上述の音声入力処理の開始指示が文
音声認識／データベース検索処理のための音声入力処理
の開始指示を含む場合には、それ以後マイク３０１（図
２の２０１）から入力されマイク制御部３０３から出力
された音声データは、後述するステップ４０４の送信処
理において、そこで音声制御ホスト装置１０８に向けて
送信される。If the above-described voice input processing start instruction includes a voice input processing start instruction for sentence voice recognition / database search processing, thereafter, the microphone 301 (201 in FIG. 2) receives the input from the microphone 301. The audio data output from the control unit 303 is transmitted to the audio control host device 108 in the transmission processing in step 404 described below.

【００８９】タッチパネル入力が図３のＣＣＤカメラ３
０４（図２の２０２）の入力指示でも図３のマイク３０
１（図２の２０１）の入力指示でもない場合には、ステ
ップ４０５及び４０６の判定がＮＯとなって、ステップ
４０９で、他のキー入力処理が実行される。その後、ス
テップ４０４の送信処理に進む。The touch panel input is the CCD camera 3 shown in FIG.
04 (202 in FIG. 2), the microphone 30 in FIG.
If the input instruction is not the input instruction 1 (201 in FIG. 2), the determinations in steps 405 and 406 are NO, and in step 409, another key input processing is executed. Thereafter, the process proceeds to the transmission process of step 404.

【００９０】一方、音声制御ホスト装置１０８（図１）
から通信部１１１を介して制御部１１０内のＲＡＭ３１
７に検索結果ＨＴＭＬ文章データが受信され、ステップ
４０１→４０２→４０３→４０４→４０１の繰返しルー
プにおけるステップ４０２の判定がＹＥＳとなると、ス
テップ４１０において、上記検索結果ＨＴＭＬ文章デー
タがＲＡＭ３１７から出力部１１２内のメモリ３１３に
転送され、ＬＣＤ制御部３１４に対して上記検索結果Ｈ
ＴＭＬ文章データの表示が指示される。On the other hand, the voice control host device 108 (FIG. 1)
From the RAM 31 in the control unit 110 via the communication unit 111
7, the search result HTML text data is received, and if the determination in step 402 in the iteration loop of steps 401 → 402 → 403 → 404 → 401 becomes YES, in step 410, the search result HTML text data is output from the RAM 317 to the output unit 112. The search result H is transferred to the memory 313 in the
Display of the TML text data is instructed.

【００９１】この結果、ＬＣＤ制御部３１４の制御によ
って、メモリ３１３からＬＣＤドライバ３１２を介して
ＬＣＤ表示部３１１（図２の２０３）に、受信された検
索結果ＨＴＭＬ文章データが表示される。As a result, under the control of the LCD control unit 314, the received search result HTML text data is displayed on the LCD display unit 311 (203 in FIG. 2) from the memory 313 via the LCD driver 312.

【００９２】次に、ステップ４０４の送信処理について
説明する。図５は、上記送信処理の詳細を示す動作フロ
ーチャートである。まず、ステップ５０１では、図４の
ステップ４０９の他キー入力処理によって処理されたタ
ッチパネルからのキー入力が送信指示を伴っているか否
かが判定される。この判定がＮＯの場合には、ステップ
５０５の処理へ進む。Next, the transmission process in step 404 will be described. FIG. 5 is an operation flowchart showing details of the transmission processing. First, in step 501, it is determined whether or not a key input from the touch panel processed by the other key input processing in step 409 in FIG. 4 is accompanied by a transmission instruction. If this determination is NO, the process proceeds to step 505.

【００９３】ステップ５０１の判定がＹＥＳの場合に
は、ステップ５０２で、移動端末１０１が現在図１の移
動端末制御ホスト装置１０４に接続中であるか否かが判
定される。If the determination in step 501 is YES, in step 502, it is determined whether or not the mobile terminal 101 is currently connected to the mobile terminal control host device 104 in FIG.

【００９４】移動端末１０１が現在図１の移動端末制御
ホスト装置１０４に接続中でありステップ５０２の判定
がＹＥＳならば、図３の制御部１１０内のＣＰＵ３１６
は、ステップ５０４で、移動端末１０１の“端末識別コ
ード”とキー入力処理に対応するコマンドの送信指示
を、図３の通信部１１１内の通信制御部３２１に対し依
頼する。この結果、通信制御部３２１は、上記“端末識
別コード”とコマンドが格納されたＴＣＰ／ＩＰパケッ
トを生成し、それをインターネット１０５に接続されて
いる所定のホスト（例えば図１の音声制御ホスト装置１
０８）に向け送信する。If mobile terminal 101 is currently connected to mobile terminal control host device 104 in FIG. 1 and the determination in step 502 is YES, CPU 316 in control unit 110 in FIG.
Requests the communication control unit 321 in the communication unit 111 in FIG. 3 to transmit a “terminal identification code” of the mobile terminal 101 and a command corresponding to the key input process in step 504. As a result, the communication control unit 321 generates a TCP / IP packet storing the “terminal identification code” and the command, and transmits the TCP / IP packet to a predetermined host connected to the Internet 105 (for example, the voice control host device in FIG. 1). 1
08).

【００９５】移動端末１０１が現在図１の移動端末制御
ホスト装置１０４に接続中ではなくステップ５０２の判
定がＮＯならば、図３の制御部１１０内のＣＰＵ３１６
は、ステップ５０３で、図３の通信部１１１内の通信制
御部３２１に対して発信処理を依頼してから、ステップ
５０４を実行する。If the mobile terminal 101 is not currently connected to the mobile terminal control host device 104 in FIG. 1 and the determination in step 502 is NO, the CPU 316 in the control unit 110 in FIG.
Requests the communication control unit 321 in the communication unit 111 of FIG. 3 to perform a transmission process in step 503, and then executes step 504.

【００９６】後に詳述するように、ユーザの指定に基づ
く文音声認識／データベース検索処理の開始要求コマン
ドの送信指示及び文音声認識／データベース検索処理の
終了要求コマンドの送信指示は、上述のステップ５０４
において発行される。As will be described in detail later, the transmission instruction of the start request command of the sentence speech recognition / database search process and the transmission instruction of the end request command of the sentence speech recognition / database search process based on the user's designation are made in step 504 described above.
Issued at

【００９７】前述したようにステップ５０１の判定がＮ
Ｏの場合又はステップ５０４の処理の後、ステップ５０
５では、図４のステップ４０８によって、文音声認識／
データベース検索処理のための音声入力処理の開始指示
が実行されており、音声データの音声制御ホスト装置１
０８（図１）への送信指示がなされているか否かが判定
される。As described above, the determination at step 501 is N
In the case of O or after the processing of step 504, step 50
In step 5, in step 408 of FIG.
An instruction to start voice input processing for database search processing has been executed, and the voice control host device 1 for voice data has been executed.
08 (FIG. 1) is determined.

【００９８】この判定がＮＯの場合には、ステップ５１
０の処理へ進む。ステップ５０５の判定がＹＥＳの場合
には、ステップ５０６で、音声制御ホスト装置１０８か
ら文音声認識／データベース検索処理の開始要求コマン
ドに対する応答である送信許可データが既に返信されて
いるか否かが判定される。If this determination is NO, step 51
Proceed to process 0. If the determination in step 505 is YES, in step 506, it is determined whether or not transmission permission data, which is a response to the command to start the sentence voice recognition / database search processing, has already been returned from the voice control host device. You.

【００９９】この判定がＮＯの場合には、音声制御ホス
ト装置１０８がまだ移動端末１０１からの文音声認識／
データベース検索処理の開始要求コマンドに対する準備
が完了していないため、ステップ５１０の処理へ進む。If the determination is NO, the voice control host device 108 still recognizes the sentence voice recognition /
Since the preparation for the database search process start request command has not been completed, the process proceeds to step 510.

【０１００】音声制御ホスト装置１０８から文音声認識
／データベース検索処理の開始要求コマンドに対する応
答である送信許可データが既に返信されておりステップ
５０６の判定がＹＥＳの場合には、更に、ステップ５０
７で、移動端末１０１が現在図１の移動端末制御ホスト
装置１０４に接続中であるか否かが判定される。If transmission permission data, which is a response to the command for requesting the start of sentence voice recognition / database search processing, has already been returned from the voice control host device 108, and if the determination in step 506 is YES, then step 50 is further executed.
At 7, it is determined whether the mobile terminal 101 is currently connected to the mobile terminal control host device 104 of FIG.

【０１０１】移動端末１０１が現在図１の移動端末制御
ホスト装置１０４に接続中でありステップ５０７の判定
がＹＥＳならば、図３の制御部１１０内のＣＰＵ３１６
は、ステップ５０９で、図３に示される入力部１０９内
のマイク制御部３０３から制御部１１０内のＲＡＭ３１
７に転送されてきている音声データの送信指示を、通信
部１１１内の通信制御部３２１に対し依頼する。この結
果、通信制御部３２１は、上記音声データが格納された
ＴＣＰ／ＩＰパケットを生成し、それをインターネット
１０５に接続されている図１の音声制御ホスト装置１０
８に向けて送信する。If the mobile terminal 101 is currently connected to the mobile terminal control host device 104 in FIG. 1 and the determination in step 507 is YES, the CPU 316 in the control unit 110 in FIG.
In step 509, the microphone control unit 303 in the input unit 109 shown in FIG.
The communication control section 321 in the communication section 111 is requested to transmit the voice data transferred to the communication section 7. As a result, the communication control unit 321 generates a TCP / IP packet in which the above-mentioned voice data is stored, and transmits the TCP / IP packet to the voice control host device 10 of FIG.
Send to 8

【０１０２】移動端末１０１が現在図１の移動端末制御
ホスト装置１０４に接続中ではなくステップ５０７の判
定がＮＯならば、図３の制御部１１０内のＣＰＵ３１６
は、ステップ５０８で、図３の通信部１１１内の通信制
御部３２１に対して発信処理を依頼してから、ステップ
５０９を実行する。If mobile terminal 101 is not currently connected to mobile terminal control host device 104 in FIG. 1 and the determination in step 507 is NO, CPU 316 in control unit 110 in FIG.
Requests the communication control unit 321 in the communication unit 111 of FIG. 3 to perform a transmission process in step 508, and then executes step 509.

【０１０３】後に詳述するように、文音声認識／データ
ベース検索処理のための音声データの送信指示は、上述
のステップ５０９において発行される。前述したように
ステップ５０５又は５０６の判定がＮＯの場合又はステ
ップ５０９の処理の後、ステップ５１０では、図４のス
テップ４０７によって、画像入力処理の開始指示が実行
されており、画像データを図１のインターネット１０５
に接続されている特には図示しない画像制御ホスト装置
への送信指示がなされているか否かが判定される。As will be described in detail later, an instruction to transmit voice data for sentence voice recognition / database search processing is issued in step 509 described above. As described above, when the determination in step 505 or 506 is NO or after the processing in step 509, in step 510, a start instruction of the image input processing is executed by step 407 in FIG. The Internet 105
It is determined whether or not a transmission instruction to an image control host device (not shown) connected to the image control host device is transmitted.

【０１０４】この判定がＮＯの場合には、図４のステッ
プ４０４の送信処理を終了する。ステップ５１０の判定
がＹＥＳの場合には、ステップ５１１で、移動端末１０
１が現在図１の移動端末制御ホスト装置１０４に接続中
であるか否かが判定される。If this determination is NO, the transmission processing of step 404 in FIG. 4 ends. If the determination in step 510 is YES, in step 511, the mobile terminal 10
1 is currently connected to the mobile terminal control host device 104 in FIG.

【０１０５】移動端末１０１が現在図１の移動端末制御
ホスト装置１０４に接続中でありステップ５１１の判定
がＹＥＳならば、図３の制御部１１０内のＣＰＵ３１６
は、ステップ５１３で、図３に示される入力部１０９内
のメモリ３０６に得られている画像データの送信指示
を、通信部１１１内の通信制御部３２１に対して依頼す
る。この結果、通信制御部３２１は、上記画像データが
格納されたＴＣＰ／ＩＰパケットを生成し、それをイン
ターネット１０５に接続されている特には図示しない画
像制御ホスト装置１０８に向けて送信する。If mobile terminal 101 is currently connected to mobile terminal control host device 104 in FIG. 1 and the determination in step 511 is YES, CPU 316 in control unit 110 in FIG.
Requests the communication control unit 321 in the communication unit 111 to transmit the image data obtained in the memory 306 in the input unit 109 shown in FIG. As a result, the communication control unit 321 generates a TCP / IP packet storing the image data, and transmits the TCP / IP packet to the image control host device 108 (not shown) connected to the Internet 105.

【０１０６】移動端末１０１が現在図１の移動端末制御
ホスト装置１０４に接続中ではなくステップ５１１の判
定がＮＯならば、図３の制御部１１０内のＣＰＵ３１６
は、ステップ５１２で、図３の通信部１１１内の通信制
御部３２１に対して発信処理を依頼してから、ステップ
５１３を実行する。If mobile terminal 101 is not currently connected to mobile terminal control host device 104 in FIG. 1 and the determination in step 511 is NO, CPU 316 in control unit 110 in FIG.
Requests the communication control unit 321 in the communication unit 111 of FIG. 3 to perform a transmission process in step 512, and then executes step 513.

【０１０７】なお、ステップ５１３の画像データの送信
指示は、本発明には特には関連しないため、その詳細な
説明は省略する。前述したようにステップ５１０の判定
がＮＯの場合又はステップ５１３の処理の後、図４のス
テップ４０４の送信処理を終了する。＜通信データのフォーマット＞図６は、移動端末１０１
と移動端末制御ホスト装置１０４及びインターネット１
０５（音声制御ホスト装置１０８）との間で通信される
通信データのフォーマット図である。Since the image data transmission instruction in step 513 is not particularly related to the present invention, a detailed description thereof will be omitted. As described above, when the determination in step 510 is NO or after the processing in step 513, the transmission processing in step 404 in FIG. 4 ends. <Format of Communication Data> FIG.
And mobile terminal control host device 104 and Internet 1
FIG. 5 is a format diagram of communication data communicated with the MFP 05 (voice control host device 108).

【０１０８】移動端末１０１と移動端末制御ホスト装置
１０４との間では、通信データは、ＰＰＰ（Point-to-P
oint Protocol ）と呼ばれる通信プロトコルに基づき、
図６(a) に示されるＰＰＰフレーム（図の左から右に向
けて転送される）を用いて、ＰＨＳ規格の３２ｋｂｉｔ
ｓ／ｓｅｃの伝送レートを有するデジタル通信チャネル
上を伝送される。Communication data between the mobile terminal 101 and the mobile terminal control host device 104 is PPP (Point-to-P
oint Protocol).
Using the PPP frame shown in FIG. 6A (transferred from left to right in the figure), 32 kbits of the PHS standard
It is transmitted over a digital communication channel having a transmission rate of s / sec.

【０１０９】ＰＰＰフレームを構成する、“フラグ”、
“アドレス”、“コントロール”の各フィールドは、図
６(a) に示される各固定ビット列が設定される。２オク
テットのデータ長を有するＦＣＳは、フレームチェック
シーケンスと呼ばれ、ＰＰＰフレームデータの誤り検出
／訂正用のデータである。移動端末１０１と移動端末制
御ホスト装置１０４との間でＰＰＰリンクが確立した後
に転送されるＰＰＰフレームの“インフォメーション”
フィールド（可変データ長を有する）には、インターネ
ット１０５（図１）上のデータの基本伝送単位であるＩ
Ｐデータグラムが格納され、その場合に、２オクテット
のデータ長を有する“プロトコル”フィールドには、”
インフォメーション”フィールドにＩＰデータグラムが
格納されていることを示す１６進値“0021”が格納され
る。"Flag", which constitutes a PPP frame,
In each of the "address" and "control" fields, each fixed bit string shown in FIG. 6A is set. The FCS having a data length of 2 octets is called a frame check sequence, and is data for error detection / correction of PPP frame data. “Information” of a PPP frame transferred after a PPP link is established between the mobile terminal 101 and the mobile terminal control host device 104
The field (having a variable data length) includes I, which is a basic transmission unit of data on the Internet 105 (FIG. 1).
P datagram is stored, in which case a "protocol" field having a data length of 2 octets contains:
A hexadecimal value “0021” indicating that the IP datagram is stored is stored in the “information” field.

【０１１０】ＰＰＰフレームの“インフォメーション”
フィールドには、上述のようにＩＰデータグラムが格納
される。このＩＰデータグラムは、上述のようにインタ
ーネット１０５上のデータの基本伝送単位である。ＩＰ
データグラムは、インターネットプロトコル（ＩＰ）に
従って規定され、その“データ”フィールドに格納され
たデータをインターネット１０５上の宛先のホスト装置
まで一意に転送するための機能を提供し、インターネッ
ト１０５上でのアドレスを特定する機能、そのＩＰデー
タグラム自身を“宛先ＩＰアドレス”で指定されたホス
トまでインターネット１０５上の一定の経路で転送する
機能、そのＩＰデータグラム自身のフラグメント化（分
割）と再組立てを行う機能等を備える。"Information" of PPP frame
The IP datagram is stored in the field as described above. This IP datagram is a basic transmission unit of data on the Internet 105 as described above. IP
The datagram is defined in accordance with the Internet Protocol (IP), provides a function for uniquely transferring data stored in its “data” field to a destination host device on the Internet 105, and has an address on the Internet 105. , The function of transferring the IP datagram itself to the host specified by the "destination IP address" through a fixed route on the Internet 105, and the fragmentation (division) and reassembly of the IP datagram itself. It has functions and the like.

【０１１１】ＩＰデータグラムは、図６(b) に示される
ように、ＩＰヘッダフィールドとデータフィールドとか
ら構成される。ＩＰヘッダフィールドには、それが含ま
れるＩＰデータグラム自身を配送するために必要な全て
の情報が含まれる。図７(a)は、ＩＰヘッダのフォーマ
ット図である。An IP datagram is composed of an IP header field and a data field as shown in FIG. The IP header field contains all the information necessary to deliver the IP datagram containing it. FIG. 7A is a format diagram of the IP header.

【０１１２】ＩＰヘッダは、３２ビットを１ワードとし
て、５乃至６ワードのデータ長を有し、このデータ長は
第１ワードの“ヘッダ長”フィールドに格納され、ま
た、ＩＰデータグラム全体のデータ長は、第１ワードの
“ＩＰデータグラムの全長”フィールドに格納される。The IP header has a data length of 5 to 6 words, with 32 bits as one word. This data length is stored in the “header length” field of the first word. The length is stored in the first word of the "total length of IP datagram" field.

【０１１３】第１ワードの“バージョン”フィールドに
は、ＩＰデータグラムの転送方法を規定するインターネ
ットプロトコル（ＩＰ）のバージョンが設定され、現在
のバージョンは４である。In the "version" field of the first word, the version of the Internet Protocol (IP) that defines the method of transmitting the IP datagram is set. The current version is 4.

【０１１４】第１ワードの“サービスの種類”フィール
ドには、配送の優先度を表わす情報等が格納されるが、
ここは本発明には特には関連しない。第２ワードの各フ
ィールドは、ＩＰデータグラムがインターネット１０５
上での転送の制約によりフラグメント化（分割）される
場合における制御情報を規定する。まず、“識別番号”
フィールドには、分割されたフラグメントであるこのＩ
Ｐデータグラムが属する分割前のＩＰデータグラムを識
別するための一意な整数が設定される。次に、”フラグ
メントのオフセット”フィールドには、分割されたフラ
グメントであるこのＩＰデータグラムが分割前のＩＰデ
ータグラムのどの部分に相当するかを示すオフセット情
報が設定される。そして、”フラグ列”フィールドに
は、分割されたフラグメントであるこのＩＰデータグラ
ムに、それが属する分割前のＩＰデータグラムを構成す
る他のフラグメントが後続するか否かが設定される。以
上の情報により、インターネット１０５上の中継ホスト
においてＩＰデータグラムがフラグメント化されても、
受信側で分割前のＩＰデータグラムを正確に復元するこ
とができる。In the “service type” field of the first word, information indicating the priority of delivery is stored.
This is not particularly relevant to the present invention. Each field of the second word indicates that the IP datagram is
The control information in the case of fragmentation (division) due to the above-described transfer restriction is defined. First, the “identification number”
The field contains this fragmented I
A unique integer for identifying the undivided IP datagram to which the P datagram belongs is set. Next, in the “fragment offset” field, offset information indicating which part of the IP datagram before the division corresponds to the IP datagram that is the divided fragment is set. In the "flag string" field, it is set whether or not another fragment constituting the undivided IP datagram to which this divided IP datagram belongs is followed by the divided fragment. With the above information, even if the IP datagram is fragmented at the relay host on the Internet 105,
The IP datagram before division can be accurately restored on the receiving side.

【０１１５】第３ワードの“生存期間”（ＴＴＬ：Time
To Live）フィールドには、そのＩＰデータグラムがイ
ンターネット１０５上にどれだけの時間の間存在するこ
とを許すかを示す秒単位の時間情報が設定される。イン
ターネット１０５上の中継ホストは、ＩＰデータグラム
を処理する毎に上記フィールド値を減算し、値が０以下
になったＩＰデータグラムはインターネット１０５上か
ら廃棄する。これにより、インターネット１０５上での
過度なトラヒックの発生が抑制される。なお、廃棄され
たＩＰデータグラムに関する再送制御は、そのＩＰデー
タグラムに格納されるＴＣＰセグメントに対する制御処
理において実行される。The "lifetime" of the third word (TTL: Time
In the “To Live” field, time information in seconds indicating how long the IP datagram is allowed to exist on the Internet 105 is set. The relay host on the Internet 105 subtracts the above field value each time the IP datagram is processed, and discards the IP datagram whose value becomes 0 or less from the Internet 105. This suppresses the occurrence of excessive traffic on the Internet 105. The retransmission control for the discarded IP datagram is executed in a control process for a TCP segment stored in the IP datagram.

【０１１６】第３ワードの“プロトコル”フィールドに
は、そのＩＰデータグラムの“データ”フィールドに格
納されるデータのフォーマットを規定するための整数値
が設定される。本実施の形態の場合には、図６(c) に示
されるように、ＩＰデータグラムの“データ”フィール
ドにはＴＣＰセグメントデータが格納されるため、その
フォーマットを規定する整数値６が設定される。In the "protocol" field of the third word, an integer value for defining the format of data stored in the "data" field of the IP datagram is set. In the case of the present embodiment, as shown in FIG. 6C, since the TCP segment data is stored in the "data" field of the IP datagram, an integer value 6 defining the format is set. You.

【０１１７】第３ワードの“ヘッダのチェックサム”フ
ィールドには、ＩＰヘッダのデータの誤りを検出するた
めのチェックサムデータが設定される。第４ワードに
は、３２ビットの“送信元ＩＰアドレス”が設定され
る。例えばＩＰデータグラムが移動端末１０１から音声
制御ホスト装置１０８へ転送される場合には、“送信元
ＩＰアドレス”としては、後述する発信処理により移動
端末制御ホスト装置１０４から移動端末１０１に対して
付与されたＩＰアドレスが設定される。図１の音声制御
ホスト装置１０８は、この“送信元ＩＰアドレス”を記
憶することにより、インターネット１０５を介して移動
端末１０１に対して、フォーマット文章データ等を返信
することができる。In the “checksum of header” field of the third word, checksum data for detecting an error in the data of the IP header is set. A 32-bit “source IP address” is set in the fourth word. For example, when an IP datagram is transferred from the mobile terminal 101 to the voice control host device 108, the “source IP address” is assigned from the mobile terminal control host device 104 to the mobile terminal 101 by a transmission process described later. The set IP address is set. By storing this “source IP address”, the voice control host device 108 in FIG. 1 can return format text data and the like to the mobile terminal 101 via the Internet 105.

【０１１８】第５ワードには、３２ビットの“宛先ＩＰ
アドレス”が設定される。例えばＩＰデータグラムが移
動端末１０１から音声制御ホスト装置１０８へ転送され
る場合には、“宛先ＩＰアドレス”としては、音声制御
ホスト装置１０８に固定的に割当てられているＩＰアド
レスが設定される。移動端末制御ホスト装置１０４内の
ルーティング部１１４、インターネット１０５上の各中
継ホスト装置、及び音声サービスプロバイダ内のルータ
装置１０６は、受信したＩＰデータグラムに格納されて
いる上記“宛先ＩＰアドレス”を識別することによっ
て、予め各装置が有する経路制御テーブル情報に従っ
て、そのＩＰデータグラムの配送経路を決定し、最終的
にそのＩＰデータグラムを音声サービスプロバイダ内の
音声制御ホスト装置１０８まで転送することができる。The fifth word contains a 32-bit “destination IP”.
For example, when an IP datagram is transferred from the mobile terminal 101 to the voice control host device 108, the “destination IP address” is fixedly assigned to the voice control host device 108. The IP address is set.The routing unit 114 in the mobile terminal control host device 104, each relay host device on the Internet 105, and the router device 106 in the voice service provider are stored in the received IP datagram. By identifying the "destination IP address", the delivery route of the IP datagram is determined in advance according to the routing control table information of each device, and finally the IP datagram is transferred to the voice control host device in the voice service provider. 108.

【０１１９】第６ワードの“ＩＰオプション”フィール
ドは、オプションであり、インターネット１０５を構成
する各ネットワークのテスト又はデバッグのための情報
や、インターネット１０５上での配送経路を制御又は監
視するための制御情報等が設定されるが、ここは本発明
には特には関連しない。The “IP option” field of the sixth word is optional, and is used for information for testing or debugging each network constituting the Internet 105 and control for controlling or monitoring a delivery route on the Internet 105. Information and the like are set, but this is not particularly relevant to the present invention.

【０１２０】第６ワードの“パディング”フィールドに
は、データ長を合わせるためのパディングデータが設定
される。次に、ＩＰデータグラムの“データ”フィール
ドには、ＴＣＰセグメントデータが格納される。このＴ
ＣＰセグメントは、トランスミッションコントロールプ
ロトコル（ＴＣＰ）に従って規定され、その“データ”
フィールドに格納されたデータをインターネット１０５
上の宛先のホスト装置まで正確に適切な順序で配送する
ための機能を備える。ＩＰデータグラムがインターネッ
ト１０５上でのデータの一意な転送の機能のみを提供
し、データの信頼性を確保する機能（再送制御機能等）
を提供しないのに対して、ＴＣＰセグメントは、データ
の信頼性を確保する機能を提供するものである。In the "padding" field of the sixth word, padding data for adjusting the data length is set. Next, TCP segment data is stored in the "data" field of the IP datagram. This T
The CP segment is defined according to the Transmission Control Protocol (TCP), and its "data"
The data stored in the field is transferred to the Internet 105
A function is provided for accurately delivering the packet to the above destination host device in an appropriate order. IP datagrams provide only the unique transfer function of data on the Internet 105, and the function of ensuring data reliability (retransmission control function, etc.)
Is provided, whereas the TCP segment provides a function for ensuring data reliability.

【０１２１】このように、通信データが、（ＰＰＰフレ
ームと）ＩＰデータグラムとＴＣＰセグメントという階
層構造を有するのは、インターネット１０５上ではなる
べく小さい処理負荷のもとで効率良くデータを配送する
必要があり、エンド対エンド間ではできるかぎり信頼性
の高いデータ配送を実現する必要があるという異なる要
請に効率的に対処するためである。これにより、インタ
ーネット１０５上の中継ホスト装置は、ＩＰデータグラ
ムのＩＰヘッダのみを参照することにより、そのＩＰデ
ータグラムの“データ”フィールドに格納された情報
（ＴＣＰセグメント）をできる限り高速かつ効率的に宛
先ホスト装置まで配送することができ、エンド対エンド
（送信元ホスト装置と宛先ホスト装置）間では、ＴＣＰ
セグメントのＴＣＰヘッダを参照することにより、再送
制御等の信頼性の高いデータ通信を実現することができ
るのである。As described above, the communication data has the hierarchical structure of the IP datagram (with the PPP frame) and the TCP segment. Therefore, it is necessary to efficiently deliver the data on the Internet 105 under a processing load as small as possible. Yes, in order to efficiently address the different demands of achieving the most reliable data delivery end-to-end. As a result, the relay host device on the Internet 105 refers to only the IP header of the IP datagram, so that the information (TCP segment) stored in the “data” field of the IP datagram is as fast and efficiently as possible. To the destination host device, and between end-to-end (source host device and destination host device), TCP
By referring to the TCP header of the segment, highly reliable data communication such as retransmission control can be realized.

【０１２２】ＴＣＰセグメントは、図６(b) に示される
ように、ＴＣＰヘッダフィールドとデータフィールドと
から構成される。図７(b) は、ＴＣＰヘッダのフォーマ
ット図である。The TCP segment is composed of a TCP header field and a data field as shown in FIG. FIG. 7B is a format diagram of the TCP header.

【０１２３】ＴＣＰヘッダは、ＩＰヘッダの場合と同様
に、３２ビットを１ワードとして、５乃至６ワードのデ
ータ長を有し、このデータ長は第４ワードの“ヘッダ
長”フィールドに格納され、また、ＩＰデータグラム全
体のデータ長は、第１ワードの“ＩＰデータグラムの全
長”フィールドに格納される。As in the case of the IP header, the TCP header has a data length of 5 to 6 words with 32 bits as one word, and this data length is stored in the “header length” field of the fourth word. Also, the data length of the entire IP datagram is stored in the “total length of IP datagram” field of the first word.

【０１２４】第１ワードの“送信元ポート番号”フィー
ルド及び“宛先ポート番号”フィールドには、文音声認
識／データベース検索処理のための通信プロトコルを特
定する１６ビットの整数値が設定される。In the "source port number" field and the "destination port number" field of the first word, a 16-bit integer value specifying a communication protocol for sentence speech recognition / database search processing is set.

【０１２５】音声制御ホスト装置１０８内のパケット送
受信部１１５（図１）は、文音声認識／データベース検
索処理のための音声データが格納されたＴＣＰセグメン
トのほかにも、電子メールデータを始めとする様々なデ
ータが格納された様々なＴＣＰセグメントを送受信する
ため、受信したＴＣＰセグメントのＴＣＰヘッダに設定
されている“宛先ポート番号”フィールドの値を認識す
ることによって、そのＴＣＰセグメントの“データ”フ
ィールドに格納されているデータを音声制御ホスト装置
１０８で実行されるどのアプリケーションに引き渡すか
を決定することができる。The packet transmission / reception unit 115 (FIG. 1) in the voice control host device 108 includes electronic mail data in addition to TCP segments in which voice data for sentence voice recognition / database search processing is stored. In order to transmit and receive various TCP segments storing various data, by recognizing the value of the “destination port number” field set in the TCP header of the received TCP segment, the “data” field of the TCP segment is recognized. Can be determined to which application executed on the voice control host device 108 the data stored in the voice control host device 108 is to be delivered.

【０１２６】そして、パケット送受信部１１５は、受信
したＴＣＰセグメントのＴＣＰヘッダに設定されている
“宛先ポート番号”フィールドの値が文音声認識／デー
タベース検索処理のための通信プロトコルに対応する値
を示している場合には、そのＴＣＰセグメントの“デー
タ”フィールドに格納されている音声データを移動端末
通信制御部１１６に引き渡すことができる。The packet transmitting / receiving unit 115 sets the value of the “destination port number” field set in the TCP header of the received TCP segment to a value corresponding to the communication protocol for sentence speech recognition / database search processing. In this case, the voice data stored in the "data" field of the TCP segment can be delivered to the mobile terminal communication control unit 116.

【０１２７】同様に、移動端末１０１の通信部１１１内
の通信制御部３２１（図３）も、検索結果ＨＴＭＬ文章
データが格納されたＴＣＰセグメントの他にも、ホーム
ページデータや電子メールデータを始めとする様々なデ
ータが格納された様々なＴＣＰセグメントを送受信する
ため、受信したＴＣＰセグメントのＴＣＰヘッダに設定
されている“宛先ポート番号”フィールドの値を認識す
ることにより、そのＴＣＰセグメントの“データ”フィ
ールドに格納されているデータを移動端末１０１で実行
されるどのアプリケーションに引き渡すかを決定するこ
とができる。Similarly, the communication control unit 321 (FIG. 3) in the communication unit 111 of the mobile terminal 101 also stores homepage data and e-mail data in addition to the TCP segment in which the search result HTML text data is stored. In order to transmit and receive various TCP segments storing various data to be transmitted and received, the value of the “destination port number” field set in the TCP header of the received TCP segment is recognized, so that the “data” of the TCP segment is recognized. It is possible to determine to which application executed on the mobile terminal 101 the data stored in the field is to be delivered.

【０１２８】そして、通信制御部３２１は、受信したＴ
ＣＰセグメントのＴＣＰヘッダに設定されている“宛先
ポート番号”フィールドの値が文音声認識／データベー
ス検索処理のための通信プロトコルに対応する値を示し
ている場合には、制御部１１０（図１、図３）に、文音
声認識／データベース検索処理のためのデータの受信を
通知し、そのＴＣＰセグメントの“データ”フィールド
に格納されている検索結果ＨＴＭＬ文章データを引き渡
すことができる。The communication control unit 321 transmits the received T
If the value of the “destination port number” field set in the TCP header of the CP segment indicates a value corresponding to a communication protocol for sentence speech recognition / database search processing, the control unit 110 (FIG. 1, FIG. 3), the reception of data for sentence speech recognition / database search processing is notified, and the search result HTML text data stored in the “data” field of the TCP segment can be delivered.

【０１２９】更に、音声制御ホスト装置１０８内のパケ
ット送受信部１１５及び移動端末１０１の通信部１１１
内の通信制御部３２１は、受信したＴＣＰセグメントの
ＴＣＰヘッダに設定されている“送信元ポート番号”を
確認することにより、送信元のアプリケーションを確認
することができる。Furthermore, the packet transmission / reception unit 115 in the voice control host device 108 and the communication unit 111 of the mobile terminal 101
The communication control unit 321 in the above can confirm the source application by confirming the “source port number” set in the TCP header of the received TCP segment.

【０１３０】次に、図７に示されるＴＣＰヘッダの第２
ワードの“シーケンス番号”フィールドは、現在のＴＣ
Ｐコネクションにおいて送信側から受信側に送信される
全バイトストリームのうち、このＴＣＰセグメントの
“データ”フィールドに格納されているデータの先頭が
上記全バイトストリームの何バイト目にあたるかを、送
信側から受信側に通知するためのフィールドである。逆
に、第３ワードの“確認応答番号”フィールドは、現在
のＴＣＰコネクションにおいて送信側から受信側に送信
される全バイトストリームのうち、受信側が現在何バイ
ト目までを誤り無く受信したかを、受信側から送信側に
通知するためのフィールドである。これにより、例えば
移動端末１０１から音声制御ホスト装置１０８に対し
て、音声データを正しい順序でかつ高い信頼性のもとで
転送することが可能となる。Next, the second TCP header shown in FIG.
The word "sequence number" field contains the current TC
From the entire byte stream transmitted from the transmission side to the reception side in the P connection, the transmission side determines from which byte of the byte stream the data stored in the "data" field of this TCP segment corresponds. This is a field for notifying the receiving side. Conversely, the “acknowledgement number” field of the third word indicates the number of bytes that the receiving side has received without error in the entire byte stream transmitted from the transmitting side to the receiving side in the current TCP connection. This is a field for notification from the receiving side to the transmitting side. As a result, for example, the voice data can be transferred from the mobile terminal 101 to the voice control host device 108 in the correct order and with high reliability.

【０１３１】第４ワードの“フラグ列”フィールドに
は、ＴＣＰセグメントの種類を示す値が設定される。Ｔ
ＣＰ通信においては、例えばコネクションの開始時又は
終了時等において確認応答のための様々な制御データが
通信されるが、それらの制御データの種類が、“フラグ
列”フィールドに設定される。In the "flag string" field of the fourth word, a value indicating the type of the TCP segment is set. T
In the CP communication, for example, various control data for acknowledgment is transmitted at the start or end of the connection, for example, and the type of the control data is set in the “flag string” field.

【０１３２】第４ワードの“ウインドウ”フィールド
は、受信側が現在何バイトのデータを連続して受信する
ことが可能であるかを示すウインドウデータを、受信側
から送信側に通知するためのフィールドである。これに
より、受信側から送信側に対するデータのフロー制御が
可能となり、例えば音声制御ホスト装置１０８の負荷が
高いような場合には移動端末１０１に対して音声データ
の送信を抑制させる、といようなきめの細かい制御が可
能となる。The "window" field of the fourth word is a field for notifying the receiving side to the transmitting side window data indicating how many bytes of data the receiving side can currently receive continuously. is there. This enables data flow control from the receiving side to the transmitting side. For example, when the load on the voice control host device 108 is high, the transmission of voice data to the mobile terminal 101 is suppressed. Fine control is possible.

【０１３３】第４ワードの“予約済”フィールドは、予
約用のフィールドである。第５ワードの“チェックサ
ム”フィールドには、ＴＣＰヘッダ及び“データ”フィ
ールドに格納されているデータの誤りを検出するための
チェックサムデータが格納される。これにより、例えば
音声制御ホスト装置１０８は、移動端末１０１から音声
データを正確に受信することができる。The “reserved” field of the fourth word is a field for reservation. The “checksum” field of the fifth word stores checksum data for detecting an error in the data stored in the TCP header and the “data” field. Thus, for example, the voice control host device 108 can correctly receive voice data from the mobile terminal 101.

【０１３４】第５ワードの“緊急ポインタ”は、緊急デ
ータ（インタラプトデータやアボートデータ等）を通信
するための制御データであるが、これは本発明には特に
は関連しない。The "urgent pointer" of the fifth word is control data for communicating urgent data (interrupt data, abort data, etc.), but this is not particularly relevant to the present invention.

【０１３５】第６ワードの“オプション”フィールド
は、例えば送受信装置間で通信可能な最大セグメント長
を指定するため等に使用されるが、これは本発明には特
には関連しない。The "option" field of the sixth word is used, for example, to specify the maximum segment length that can be communicated between the transmitting and receiving apparatuses, but this is not particularly relevant to the present invention.

【０１３６】第６ワードの“パディング”フィールドに
は、データ長を合わせるためのパディングデータが設定
される。上述の構成を有するＴＣＰセグメントの通信
（終端）処理機能は、移動端末１０１においては通信部
１１１内の通信制御部３２１（図３）において実現さ
れ、音声制御ホスト装置１０８においてはパケット送受
信部１１５（図１）において実現される。なお、移動端
末１０１においてＣＰＵ３１６が実行する制御プログラ
ムが上記処理機能を実現するように構成されてもよい。＜発信処理＞前述のように、移動端末１０１の制御部１
１０内のＣＰＵ３１６（図３）は、図４のステップ４０
４に対応する図５に示される送信処理のうち、移動端末
１０１が現在図１の移動端末制御ホスト装置１０４に接
続中でなくステップ５０２、５０７、又は５１１の判定
がＮＯである場合には、ステップ５０３、５０８、又は
５１２において、図３の通信部１１１内の通信制御部３
２１に対して発信処理を依頼する。この依頼によって、
通信制御部３２１が実行する発信処理は、図８の動作フ
ローチャートによって示される。In the “padding” field of the sixth word, padding data for adjusting the data length is set. The communication (termination) processing function of the TCP segment having the above-described configuration is realized by the communication control unit 321 (FIG. 3) in the communication unit 111 in the mobile terminal 101, and the packet transmission / reception unit 115 ( 1). Note that the control program executed by the CPU 316 in the mobile terminal 101 may be configured to realize the above processing functions. <Outgoing Call Processing> As described above, the control unit 1 of the mobile terminal 101
The CPU 316 in FIG. 10 (FIG. 3)
5, when the mobile terminal 101 is not currently connected to the mobile terminal control host device 104 in FIG. 1 and the determination in step 502, 507, or 511 is NO, In step 503, 508, or 512, the communication control unit 3 in the communication unit 111 in FIG.
Request the transmission processing to 21. By this request,
The transmission process executed by the communication control unit 321 is shown by the operation flowchart in FIG.

【０１３７】まず、ステップ８０１では、リンク確立フ
ェーズが実行される。このフェーズでは、移動端末制御
ホスト装置１０４のアクセス電話番号に対して自動的に
ダイヤルアップが行われ移動端末制御ホスト装置１０４
が着信した後、リンクコントロールプロトコル（ＬＣ
Ｐ）と呼ばれるプロトコルを使用し、通信に使用される
ＰＰＰフレーム（図６(a) ）の最大データ長の決定、エ
スケープされるべき非透過文字の決定、ＰＰＰフレーム
の“プロトコル”フィールド（図６(a) ）のデータ長を
２オクテットから１オクテットに圧縮することの有無の
決定、ＰＰＰフレームの固定値“11111111”を有する
“アドレス”フィールド（図６(a) ）を省略（圧縮）す
ることの有無の決定等に関するネゴシエーションが、移
動端末制御ホスト装置１０４内の接続確立部１１３（図
１）との間で実行される。この場合、移動端末１０１の
通信部１１１内の通信制御部３２１と移動端末制御ホス
ト装置１０４内の接続確立部１１３との間の通信は、図
６(a) に示されるフォーマットを有するＰＰＰフレーム
を用いて、その“プロトコル”フィールドにＬＣＰを特
定する１６進値“c021”を設定し、その“インフォメー
ションフィールド”に、必要な制御データを設定して、
実行される。First, in step 801, a link establishment phase is executed. In this phase, the access telephone number of the mobile terminal control host device 104 is automatically dialed up and the mobile terminal control host device 104
Is received, the link control protocol (LC
P), a maximum data length of a PPP frame (FIG. 6A) used for communication, a non-transparent character to be escaped, a "protocol" field of the PPP frame (FIG. 6). (a)) Determine whether to compress the data length from 2 octets to 1 octet, and omit (compress) the "address" field (FIG. 6 (a)) having the fixed value "11111111" of the PPP frame. A negotiation regarding the determination of the presence / absence is performed with the connection establishment unit 113 (FIG. 1) in the mobile terminal control host device 104. In this case, communication between the communication control unit 321 in the communication unit 111 of the mobile terminal 101 and the connection establishment unit 113 in the mobile terminal control host device 104 is performed by using a PPP frame having the format shown in FIG. By setting a hexadecimal value “c021” specifying the LCP in the “protocol” field and setting necessary control data in the “information field”,
Be executed.

【０１３８】次に、ステップ８０２においては、認証フ
ェーズが実行される。このフェーズでは、ＰＡＰ（Pass
word Authentication Protocol）又はＣＨＡＰ（Challe
ngeHandshake Authentication Protocol ）と呼ばれる
認証プロトコルを使用し、移動端末１０１を使用するユ
ーザの認証が、移動端末制御ホスト装置１０４内の接続
確立部１１３（図１）から移動端末１０１に対して実行
される。これにより、移動端末制御ホスト装置１０４を
運営するインターネットプロバイダは、移動端末１０１
を使用するユーザが契約されたユーザであるか否かを決
定できる。この場合、移動端末１０１の通信部１１１内
の通信制御部３２１と移動端末制御ホスト装置１０４内
の接続確立部１１３との間の通信は、図６(a) に示され
るフォーマットを有するＰＰＰフレームを用いて、その
“プロトコル”フィールドにＰＡＰを特定する１６進値
“c023”又はＣＨＡＰを特定する１６進値“c223”を設
定し、その“インフォメーションフィールド”に、必要
な認証用データを設定して、実行される。Next, in step 802, an authentication phase is executed. In this phase, PAP (Pass
word Authentication Protocol) or CHAP (Challe
The authentication of the user using the mobile terminal 101 is performed from the connection establishment unit 113 (FIG. 1) in the mobile terminal control host device 104 to the mobile terminal 101 using an authentication protocol called ngeHandshake Authentication Protocol). As a result, the Internet provider operating the mobile terminal control host device 104 becomes the mobile terminal 101
Can be determined whether or not the user who uses is a contracted user. In this case, communication between the communication control unit 321 in the communication unit 111 of the mobile terminal 101 and the connection establishment unit 113 in the mobile terminal control host device 104 is performed by using a PPP frame having the format shown in FIG. A hexadecimal value “c023” specifying PAP or a hexadecimal value “c223” specifying CHAP is set in the “protocol” field, and necessary authentication data is set in the “information field”. Will be executed.

【０１３９】最後に、ステップ８０３では、ネットワー
クレイヤプロトコルフェーズが実行される。本実施の形
態の場合、このフェーズでは、ＩＰコントロールプロト
コル（ＩＰＣＰ）と呼ばれるプロトコルを使用して、Ｔ
ＣＰヘッダ（図７(b) 参照）の圧縮の有無が決定される
と共に、移動端末制御ホスト装置１０４が割当てること
のできる空き（未使用）ＩＰアドレスのうちの１つが移
動端末１０１に対して割り当てられ、加えて、必要な経
路情報が移動端末１０１の通信部１１１内の通信制御部
３２１（図３）と移動端末制御ホスト装置１０４内のル
ーティング部１１４（図１）に設定される。これ以後、
移動端末１０１は、そのＩＰアドレスを使用することに
よって、インターネット１０５に接続される音声制御ホ
スト装置１０８、及びインターネット１０５上のユーザ
が希望する任意のリソースにアクセスすることが可能と
なる。この場合、移動端末１０１の通信部１１１内の通
信制御部３２１と移動端末制御ホスト装置１０４内の接
続確立部１１３との間の通信は、図６(a) に示されるフ
ォーマットを有するＰＰＰフレームを用いて、その“プ
ロトコル”フィールドにＩＰＣＰを特定する１６進値
“8021”を設定し、その“インフォメーションフィール
ド”に、必要なＩＰアドレスのネゴシエーションのため
のデータ等を設定して、実行される。Finally, in step 803, a network layer protocol phase is executed. In the case of the present embodiment, in this phase, a protocol called an IP control protocol (IPCP) is used, and T
Whether to compress the CP header (see FIG. 7B) is determined, and one of the free (unused) IP addresses that can be allocated by the mobile terminal control host device 104 is allocated to the mobile terminal 101. In addition, necessary route information is set in the communication control unit 321 (FIG. 3) in the communication unit 111 of the mobile terminal 101 and the routing unit 114 (FIG. 1) in the mobile terminal control host device 104. After this,
By using the IP address, the mobile terminal 101 can access the voice control host device 108 connected to the Internet 105 and any resource desired by the user on the Internet 105. In this case, communication between the communication control unit 321 in the communication unit 111 of the mobile terminal 101 and the connection establishment unit 113 in the mobile terminal control host device 104 is performed by using a PPP frame having the format shown in FIG. The "protocol" field is used to set a hexadecimal value "8021" for specifying the IPCP, and the "information field" is set with data for negotiating a necessary IP address, and executed.

【０１４０】以上の一連の動作により、移動端末１０１
は、移動端末制御ホスト装置１０４内のルーティング部
１１４との間で通信用のＴＣＰ／ＩＰパケットが格納さ
れたＰＰＰフレームを授受することが可能となり、移動
端末１０１は、インターネット１０５上のリソースに自
由にアクセスすることが可能になる。By the above series of operations, the mobile terminal 101
Can exchange a PPP frame storing a TCP / IP packet for communication with the routing unit 114 in the mobile terminal control host device 104, and the mobile terminal 101 can freely use resources on the Internet 105. Can be accessed.

【０１４１】なお、ＰＨＳ通話時にも音声制御ホスト装
置１０８等へのアクセスを可能とするために、移動端末
１０１は、例えば２チャネル同時通信機能を有するよう
に構成することができる。Note that the mobile terminal 101 can be configured to have, for example, a two-channel simultaneous communication function in order to allow access to the voice control host device 108 and the like even during a PHS call.

【０１４２】また、移動端末１０１の通信部１１１内の
通信制御部３２１（図３）は、一定時間（例えば１０分
間）送受信データを検出しなかった場合に、移動端末制
御ホスト装置１０４との間のＰＰＰリンクを自動的に切
断するように構成することができる。＜文音声認識／データベース検索処理に関する移動端末
１０１の送受信処理の詳細動作＞ユーザが移動端末１０
１のタッチパネルを操作して文音声認識／データベース
検索処理の開始を指示した場合及びそれ以後に移動端末
１０１が実行する送受信処理の詳細な動作について、説
明する。When the communication control unit 321 (FIG. 3) in the communication unit 111 of the mobile terminal 101 does not detect transmission / reception data for a predetermined time (for example, 10 minutes), the communication control unit 321 communicates with the mobile terminal control host device 104. May be configured to automatically disconnect the PPP link. <Detailed operation of transmission / reception processing of mobile terminal 101 related to sentence speech recognition / database search processing>
The detailed operation of the transmission / reception processing executed by the mobile terminal 101 when the start of sentence speech recognition / database search processing is instructed by operating the touch panel 1 and thereafter will be described.

【０１４３】上述のタッチパネルの操作は、図３のタッ
チパネル制御部３１５において検出された後、制御部１
１０内のＣＰＵ３１６（図３）によって、それが実行さ
れる前述した図４の動作フローチャートに対応する制御
動作において、ステップ４０１の判定がＹＥＳ、ステッ
プ４０５及び４０６の判定がＮＯとなって、ステップ４
０９の他キー入力処理が実行されることにより、検出さ
れる。更に、ステップ４０４の送信処理において、前述
した図５のステップ５０１の判定がＹＥＳとなり、必要
に応じてステップ５０３で発信処理が実行された後、ス
テップ５０４において、移動端末１０１の“端末識別コ
ード”と上述の文音声認識／データベース検索処理の開
始指示を示すキー入力処理に対応するコマンドの送信指
示が、図３の通信部１１１内の通信制御部３２１に対し
て依頼される。The operation of the touch panel described above is detected by the touch panel control unit 315 of FIG.
In the control operation corresponding to the above-described operation flowchart of FIG. 4 executed by the CPU 316 (FIG. 3) in the CPU 10, the determination in step 401 is YES, and the determinations in steps 405 and 406 are NO.
09 is detected by executing another key input process. Further, in the transmission processing in step 404, the determination in step 501 in FIG. 5 described above is YES, and the transmission processing is executed in step 503 as necessary. Then, in step 504, the “terminal identification code” of the mobile terminal 101 is transmitted. Then, the communication control unit 321 in the communication unit 111 of FIG. 3 is requested to send a command corresponding to the key input process indicating the start instruction of the sentence speech recognition / database search process.

【０１４４】この結果、通信制御部３２１は、まず、図
６(c) に示されるフォーマットを有するＴＣＰセグメン
トを生成する。この場合、図６(c) 及び図７(b) に示さ
れるフォーマットを有するＴＣＰヘッダにおいて、“送
信元ポート番号”フィールド及び“宛先ポート番号”フ
ィールドには、文音声認識／データベース検索処理のた
めの通信プロトコルを特定する１６ビットの整数値が設
定される。そして、ＴＣＰセグメントの“データ”フィ
ールドには、移動端末１０１を特定する“端末識別コー
ド”（例えばそのＰＨＳ電話番号）と、ユーザの指定に
基づく文音声認識／データベース検索処理の開始要求コ
マンドとが格納される。As a result, the communication control section 321 first generates a TCP segment having the format shown in FIG. In this case, in the TCP header having the format shown in FIGS. 6 (c) and 7 (b), the "source port number" field and the "destination port number" field contain text / speech recognition / database search processing. A 16-bit integer value specifying the communication protocol is set. In the “data” field of the TCP segment, a “terminal identification code” (for example, the PHS telephone number) for identifying the mobile terminal 101 and a command for starting a sentence voice recognition / database search process based on a user's designation are included. Is stored.

【０１４５】次に、通信制御部３２１は、上述のＴＣＰ
セグメントが“データ”フィールドに格納された図６
(b) に示されるフォーマットを有するＩＰデータグラム
を生成する。この場合に、図６(b) 及び図７(a) に示さ
れるフォーマットを有するＩＰヘッダにおいて、“プロ
トコル”フォーマットには、その“データ”フィールド
に格納されるＴＣＰセグメントデータのフォーマットを
規定する整数値６が設定される。また、“送信元ＩＰア
ドレス”フィールドには、既に実行されている発信処理
（図８のステップ８０３の説明を参照）によって移動端
末制御ホスト装置１０４内の接続確立部１１３から移動
端末１０１の通信部１１１内の通信制御部３２１に対し
て付与されたＩＰアドレスが設定される。更に、“宛先
ＩＰアドレス”フィールドには、音声制御ホスト装置１
０８に割り当てられているＩＰアドレスが設定される。Next, the communication control unit 321 executes the above-described TCP
Figure 6 with segments stored in the "data" field
An IP datagram having the format shown in (b) is generated. In this case, in the IP header having the format shown in FIGS. 6 (b) and 7 (a), the "protocol" format includes a format that defines the format of the TCP segment data stored in the "data" field. Numerical value 6 is set. In the “source IP address” field, the connection establishment unit 113 in the mobile terminal control host device 104 transmits the communication unit of the mobile terminal 101 by the transmission processing already executed (see the description of step 803 in FIG. 8). The assigned IP address is set for the communication control unit 321 in 111. Further, the "destination IP address" field contains the voice control host device 1
08 is set.

【０１４６】そして、通信制御部３２１は、上述のＩＰ
データグラムが“インフォメーション”フィールドに格
納され、その”インフォメーション”フィールドにＩＰ
データグラムが格納されていることを示す１６進値“00
21”が“プロトコル”フィールドに格納された図６(a)
に示されるフォーマットを有するＰＰＰフレームを生成
し、通信制御部３２１内に設定されている経路情報（図
８のステップ８０３の説明を参照）に従って、上記ＰＰ
Ｐフレームを移動端末制御ホスト装置１０４に送信す
る。以降、上述のＴＣＰセグメント、ＩＰデータグラ
ム、及びＰＰＰフレームとからなるデータ単位がインタ
ーネット１０５内を転送される場合に、そのデータ単位
を単にＴＣＰ／ＩＰパケットと呼ぶ。Then, the communication control unit 321 transmits the IP
The datagram is stored in the "Information" field, and the IP
Hexadecimal value "00" indicating that the datagram is stored
FIG. 6A in which “21” is stored in the “protocol” field
A PPP frame having the format shown in FIG. 8 is generated, and according to the path information (see the description of step 803 in FIG. 8) set in the communication control unit 321, the PPP frame is generated.
The P frame is transmitted to the mobile terminal control host device 104. Hereinafter, when a data unit including the above-described TCP segment, IP datagram, and PPP frame is transferred in the Internet 105, the data unit is simply referred to as a TCP / IP packet.

【０１４７】このＴＣＰ／ＩＰパケットは、それを構成
するＩＰデータグラムのＩＰヘッダに格納されている
“宛先ＩＰアドレス”に基づいて、移動端末制御ホスト
装置１０４内のルーティング部１１４とインターネット
１０５内の特には図示しない中継ホスト装置によって、
音声サービスプロバイダ内のルータ装置１０６まで転送
された後、更に、ＬＡＮ１０７を介して音声制御ホスト
装置１０８内のパケット送受信部１１５まで転送され
る。This TCP / IP packet is routed to the routing unit 114 in the mobile terminal control host device 104 and the Internet 105 in the Internet 105 based on the “destination IP address” stored in the IP header of the IP datagram constituting the TCP / IP packet. In particular, by a relay host device (not shown),
After being transferred to the router device 106 in the voice service provider, the data is further transferred to the packet transmitting / receiving unit 115 in the voice control host device 108 via the LAN 107.

【０１４８】パケット送受信部１１５は、転送されてき
たＴＣＰ／ＩＰパケットを構成するＩＰデータグラムの
ＩＰヘッダの“宛先ＩＰアドレス”フィールドに自分で
ある音声制御ホスト装置１０８のＩＰアドレスが設定さ
れていることを識別することによって、そのＴＣＰ／Ｉ
Ｐパケットを受信する。The packet transmitting / receiving section 115 has its own IP address set in the “destination IP address” field of the IP header of the IP datagram constituting the transferred TCP / IP packet. That the TCP / I
Receive a P packet.

【０１４９】そして、パケット送受信部１１５は、受信
したＴＣＰ／ＩＰパケットを構成するＴＣＰセグメント
の“宛先ポート番号”フィールド及び“送信元ポート番
号”フィールドに文音声認識／データベース検索処理の
ための通信プロトコルを特定する１６ビットの整数値が
設定されていることを確認することによって、移動端末
通信制御部１１６（図１）に対して受信通知を通知す
る。The packet transmitting / receiving unit 115 stores a communication protocol for text / speech recognition / database search processing in the “destination port number” field and the “source port number” field of the TCP segment constituting the received TCP / IP packet. The mobile terminal communication control unit 116 (FIG. 1) is notified of the reception by confirming that a 16-bit integer value for specifying is set.

【０１５０】この通知と共に、パケット送受信部１１５
は、受信したＴＣＰ／ＩＰパケットを構成するＩＰデー
タグラムのＩＰヘッダから“送信元ＩＰアドレス”を取
り出し、上記ＴＣＰ／ＩＰパケットを構成するＴＣＰセ
グメントの“データ”フィールドから“端末識別コー
ド”と文音声認識／データベース検索処理の開始要求コ
マンドとを取り出して、それらのデータを移動端末通信
制御部１１６に引き渡す。Along with this notification, the packet transmitting / receiving unit 115
Extracts the “source IP address” from the IP header of the IP datagram constituting the received TCP / IP packet, and reads “terminal identification code” from the “data” field of the TCP segment constituting the TCP / IP packet. The voice recognition / database search process start request command is extracted, and the data is transferred to the mobile terminal communication control unit 116.

【０１５１】この結果、後述するようにして音声制御ホ
スト装置１０８から移動端末１０１に対して、送信許可
データが格納されたＴＣＰ／ＩＰパケットが返信され
る。このＴＣＰ／ＩＰパケットは、それを構成するＩＰ
データグラムのＩＰヘッダに格納されている“宛先ＩＰ
アドレス”に基づいて、音声サービスプロバイダ内のル
ータ装置１０６と、インターネット１０５内の特には図
示しない中継ホスト装置によって、移動端末制御ホスト
装置１０４内のルーティング部１１４まで転送された
後、更に、ＰＨＳ網１０３（図１）を介して移動端末１
０１の通信部１１１内の通信制御部３２１（図３）まで
転送される。As a result, a TCP / IP packet storing transmission permission data is returned from the voice control host device 108 to the mobile terminal 101 as described later. This TCP / IP packet is composed of the IP
“Destination IP” stored in the IP header of the datagram
After being transferred to the routing unit 114 in the mobile terminal control host device 104 by the router device 106 in the voice service provider and the relay host device (not shown) in the Internet 105 based on the "address", the PHS network Mobile terminal 1 via 103 (FIG. 1)
01 to the communication control unit 321 (FIG. 3) in the communication unit 111.

【０１５２】移動端末１０１の通信部１１１内の通信制
御部３２１は、転送されてきたＴＣＰ／ＩＰパケットを
構成するＩＰデータグラムのＩＰヘッダの“宛先ＩＰア
ドレス”フィールドに自分である移動端末１０１（に一
時的又は動的）に割当てられているのＩＰアドレスが設
定されていることを識別することによって、そのＴＣＰ
／ＩＰパケットを受信する。The communication control section 321 in the communication section 111 of the mobile terminal 101 stores its own mobile terminal 101 (in the “destination IP address” field of the IP header of the IP datagram constituting the transferred TCP / IP packet. By identifying that the IP address assigned to it (temporarily or dynamically) is set.
/ IP packet is received.

【０１５３】そして、通信制御部３２１は、受信したＴ
ＣＰ／ＩＰパケットを構成するＴＣＰセグメントの“宛
先ポート番号”フィールド及び“送信元ポート番号”フ
ィールドに文音声認識／データベース検索処理のための
通信プロトコルを特定する１６ビットの整数値が設定さ
れていることを確認することにより、移動端末１０１の
制御部１１０内のＣＰＵ３１６に対して受信通知を通知
する。Then, the communication control unit 321 transmits the received T
In the “destination port number” field and the “source port number” field of the TCP segment constituting the CP / IP packet, a 16-bit integer value for specifying a communication protocol for sentence speech recognition / database search processing is set. By confirming this, the CPU 316 in the control unit 110 of the mobile terminal 101 is notified of the reception notification.

【０１５４】この通知と共に、通信制御部３２１は、受
信したＴＣＰ／ＩＰパケットを構成するＴＣＰセグメン
トの“データ”フィールドから送信許可データを取り出
し、それをＣＰＵ３１６に引き渡す。At the same time as this notification, the communication control unit 321 extracts the transmission permission data from the “data” field of the TCP segment constituting the received TCP / IP packet, and delivers it to the CPU 316.

【０１５５】ＣＰＵ３１６は、上述の受信通知と送信許
可データを、前述した図４のステップ４０３で処理し、
その送信許可データをＲＡＭ３１７に記憶する。移動端
末１０１では、ユーザがタッチパネルを操作して文音声
認識／データベース検索処理の開始を指示することによ
って、ＣＰＵ３１６が、前述した図４のステップ４０８
で、図３の入力部１０９内のマイク制御部３０３に対し
て、ＰＨＳ通話処理の開始指示、又は文音声認識／デー
タベース検索処理を実行するためのオフライン状態での
音声入力処理の開始を指示する。これにより、ユーザ
は、通話動作又はオフライン状態での音声入力動作によ
ってマイク３０１（図２の２０１）からの音声の入力を
開始している。The CPU 316 processes the above-described reception notification and transmission permission data in step 403 of FIG.
The transmission permission data is stored in the RAM 317. In the mobile terminal 101, when the user operates the touch panel to instruct the start of the sentence speech recognition / database search processing, the CPU 316 causes the above-described step 408 in FIG.
Then, the microphone control unit 303 in the input unit 109 in FIG. 3 is instructed to start a PHS call process or to start a speech input process in an off-line state for executing a sentence speech recognition / database search process. . As a result, the user has started inputting sound from the microphone 301 (201 in FIG. 2) through a call operation or a sound input operation in an offline state.

【０１５６】これ以後、ＣＰＵ３１６により前述した図
４のステップ４０１→４０２→４０３→４０４→４０１
の繰返しループの１処理として実行されるステップ４０
４の送信処理において、図５のステップ５０５、５０６
の判定がＹＥＳとなり、必要に応じてステップ５０８で
再度の発信処理が実行された後、ステップ５０９で、図
３に示される入力部１０９内のマイク制御部３０３から
制御部１１０内のＲＡＭ３１７に転送されてきている音
声データの送信指示が、通信部１１１内の通信制御部３
２１に対して依頼される。Thereafter, the CPU 316 executes steps 401 → 402 → 403 → 404 → 401 in FIG.
Step 40 executed as one processing of a repetition loop of
4 in steps 505 and 506 in FIG.
Is determined to be YES, and if necessary, the transmission processing is executed again in step 508, and then, in step 509, the data is transferred from the microphone control unit 303 in the input unit 109 to the RAM 317 in the control unit 110 shown in FIG. The transmitted voice data transmission instruction is transmitted to the communication control unit 3 in the communication unit 111.
21 is requested.

【０１５７】この結果、通信制御部３２１は、まず、図
６(c) に示されるフォーマットを有するＴＣＰセグメン
トを生成する。この場合に、図６(c) 及び図７(b) に示
されるフォーマットを有するＴＣＰヘッダにおいて、
“送信元ポート番号”フィールド及び“宛先ポート番
号”フィールドには、文音声認識／データベース検索処
理のための通信プロトコルを特定する１６ビットの整数
値が設定される。そして、ＴＣＰセグメントの“デー
タ”フィールドには、図３に示される入力部１０９内の
マイク制御部３０３から制御部１１０内のＲＡＭ３１７
に転送されてきている音声データが格納される。As a result, the communication control unit 321 first generates a TCP segment having the format shown in FIG. In this case, in the TCP header having the format shown in FIGS. 6 (c) and 7 (b),
In the “source port number” field and the “destination port number” field, a 16-bit integer value specifying a communication protocol for sentence speech recognition / database search processing is set. Then, the “data” field of the TCP segment includes the microphone control unit 303 in the input unit 109 and the RAM 317 in the control unit 110 shown in FIG.
Is stored.

【０１５８】次に、通信制御部３２１は、上述のＴＣＰ
セグメントが“データ”フィールドに格納された図６
(b) に示されるフォーマットを有するＩＰデータグラム
を生成する。この場合に、図６(b) 及び図７(a) に示さ
れるフォーマットを有するＩＰヘッダにおいて、“プロ
トコル”フォーマットには、その“データ”フィールド
に格納されるＴＣＰセグメントデータのフォーマットを
規定する整数値６が設定される。また、“送信元ＩＰア
ドレス”フィールドには、既に実行されている発信処理
（図８のステップ８０３の説明を参照）によって移動端
末制御ホスト装置１０４内の接続確立部１１３から移動
端末１０１の通信部１１１内の通信制御部３２１に対し
て付与されたＩＰアドレスが設定される。更に、“宛先
ＩＰアドレス”フィールドには、音声制御ホスト装置１
０８に割り当てられているＩＰアドレスが設定される。Next, the communication control unit 321 executes the above-described TCP
Figure 6 with segments stored in the "data" field
An IP datagram having the format shown in (b) is generated. In this case, in the IP header having the format shown in FIGS. 6 (b) and 7 (a), the "protocol" format includes a format that defines the format of the TCP segment data stored in the "data" field. Numerical value 6 is set. In the “source IP address” field, the connection establishment unit 113 in the mobile terminal control host device 104 transmits the communication unit of the mobile terminal 101 by the transmission processing already executed (see the description of step 803 in FIG. 8). The assigned IP address is set for the communication control unit 321 in 111. Further, the "destination IP address" field contains the voice control host device 1
08 is set.

【０１５９】そして、通信制御部３２１は、上述のＩＰ
データグラムが“インフォメーション”フィールドに格
納され、その”インフォメーション”フィールドにＩＰ
データグラムが格納されていることを示す１６進値“00
21”が“プロトコル”フィールドに格納された図６(a)
に示されるフォーマットを有するＰＰＰフレームを生成
し、通信制御部３２１内に設定されている経路情報（図
８のステップ８０３の説明を参照）に従って、上記ＰＰ
Ｐフレームを移動端末制御ホスト装置１０４に送信す
る。Then, the communication control unit 321 transmits the IP
The datagram is stored in the "Information" field, and the IP
Hexadecimal value "00" indicating that the datagram is stored
FIG. 6A in which “21” is stored in the “protocol” field
A PPP frame having the format shown in FIG. 8 is generated, and according to the path information (see the description of step 803 in FIG. 8) set in the communication control unit 321, the PPP frame is generated.
The P frame is transmitted to the mobile terminal control host device 104.

【０１６０】このＴＣＰ／ＩＰパケットは、それを構成
するＩＰデータグラムのＩＰヘッダに格納されている
“宛先ＩＰアドレス”に基づいて、移動端末制御ホスト
装置１０４内のルーティング部１１４とインターネット
１０５内の特には図示しない中継ホスト装置によって、
音声サービスプロバイダ内のルータ装置１０６まで転送
された後、更に、ＬＡＮ１０７を介して音声制御ホスト
装置１０８内のパケット送受信部１１５まで転送され
る。The TCP / IP packet is routed to the routing section 114 in the mobile terminal control host device 104 and to the Internet 105 in the Internet 105 based on the “destination IP address” stored in the IP header of the IP datagram constituting the TCP / IP packet. In particular, by a relay host device (not shown),
After being transferred to the router device 106 in the voice service provider, the data is further transferred to the packet transmitting / receiving unit 115 in the voice control host device 108 via the LAN 107.

【０１６１】パケット送受信部１１５は、転送されてき
たＴＣＰ／ＩＰパケットを構成するＩＰデータグラムの
ＩＰヘッダの“宛先ＩＰアドレス”フィールドに自分で
ある音声制御ホスト装置１０８のＩＰアドレスが設定さ
れていることを識別することによって、そのＴＣＰ／Ｉ
Ｐパケットを受信する。そして、パケット送受信部１１
５は、受信したＴＣＰ／ＩＰパケットを構成するＴＣＰ
セグメントの“宛先ポート番号”フィールド及び“送信
元ポート番号”フィールドに文音声認識／データベース
検索処理のための通信プロトコルを特定する１６ビット
の整数値が設定されていることを確認することにより、
移動端末通信制御部１１６（図１）に対して受信通知を
通知する。The packet transmitting / receiving section 115 has its own IP address set in the “destination IP address” field of the IP header of the IP datagram constituting the transferred TCP / IP packet. That the TCP / I
Receive a P packet. Then, the packet transmitting / receiving unit 11
5 is the TCP constituting the received TCP / IP packet
By confirming that a 16-bit integer value specifying a communication protocol for sentence speech recognition / database search processing is set in the “destination port number” field and the “source port number” field of the segment,
The reception notification is notified to the mobile terminal communication control unit 116 (FIG. 1).

【０１６２】この通知と共に、パケット送受信部１１５
は、受信したＴＣＰ／ＩＰパケットを構成するＩＰデー
タグラムのＩＰヘッダから“送信元ＩＰアドレス”を取
り出し、上記ＴＣＰ／ＩＰパケットを構成するＴＣＰセ
グメントの“データ”フィールドから音声データを取り
出して、それらのデータを移動端末通信制御部１１６に
引き渡す。Along with this notification, the packet transmitting / receiving unit 115
Extracts the “source IP address” from the IP header of the IP datagram constituting the received TCP / IP packet, extracts the audio data from the “data” field of the TCP segment constituting the TCP / IP packet, and Is transferred to the mobile terminal communication control unit 116.

【０１６３】この結果、移動端末通信制御部１１６は、
後述するようにして文音声認識／データベース検索処理
の制御を実行し、文音声認識部１１７に対して受信した
音声データの認識処理を実行させ、それによって得られ
る認識音声文章データについて検索制御部１１８に対し
てデータベース検索処理を実行させる。そして、移動端
末通信制御部１１６は、後述するようにして、検索制御
部１１８から得た検索結果ＨＴＭＬ文章データが格納さ
れたＴＣＰ／ＩＰパケットを、移動端末１０１に対して
返信する。As a result, the mobile terminal communication control unit 116
As will be described later, the control of the sentence speech recognition / database search process is executed, the sentence speech recognition unit 117 executes the recognition process of the received speech data, and the search control unit 118 performs the recognition speech sentence data obtained thereby. To execute a database search process. Then, the mobile terminal communication control unit 116 returns a TCP / IP packet storing the search result HTML text data obtained from the search control unit 118 to the mobile terminal 101 as described later.

【０１６４】このＴＣＰ／ＩＰパケットは、それを構成
するＩＰデータグラムのＩＰヘッダに格納されている
“宛先ＩＰアドレス”に基づいて、音声サービスプロバ
イダ内のルータ装置１０６と、インターネット１０５内
の特には図示しない中継ホスト装置によって、移動端末
制御ホスト装置１０４内のルーティング部１１４まで転
送された後、更に、ＰＨＳ網１０３（図１）を介して移
動端末１０１の通信部１１１内の通信制御部３２１（図
３）まで転送される。This TCP / IP packet is based on the “destination IP address” stored in the IP header of the IP datagram that composes the TCP / IP packet. After being transferred by the relay host device (not shown) to the routing unit 114 in the mobile terminal control host device 104, the communication control unit 321 (in the communication unit 111 of the mobile terminal 101) via the PHS network 103 (FIG. 1). It is transferred to Fig. 3).

【０１６５】移動端末１０１の通信部１１１内の通信制
御部３２１は、転送されてきたＴＣＰ／ＩＰパケットを
構成するＩＰデータグラムのＩＰヘッダの“宛先ＩＰア
ドレス”フィールドに自分である移動端末１０１（に一
時的又は動的）に割当てられているのＩＰアドレスが設
定されていることを識別することによって、そのＴＣＰ
／ＩＰパケットを受信する。The communication control unit 321 in the communication unit 111 of the mobile terminal 101 stores its own mobile terminal 101 (in the “destination IP address” field of the IP header of the IP datagram constituting the transferred TCP / IP packet). By identifying that the IP address assigned to it (temporarily or dynamically) is set.
/ IP packet is received.

【０１６６】そして、通信制御部３２１は、受信したＴ
ＣＰ／ＩＰパケットを構成するＴＣＰセグメントの“宛
先ポート番号”フィールド及び“送信元ポート番号”フ
ィールドに文音声認識／データベース検索処理のための
通信プロトコルを特定する１６ビットの整数値が設定さ
れていることを確認することにより、移動端末１０１の
制御部１１０内のＣＰＵ３１６に対して受信通知を通知
する。The communication control unit 321 transmits the received T
In the “destination port number” field and the “source port number” field of the TCP segment constituting the CP / IP packet, a 16-bit integer value for specifying a communication protocol for sentence speech recognition / database search processing is set. By confirming this, the CPU 316 in the control unit 110 of the mobile terminal 101 is notified of the reception notification.

【０１６７】この通知と共に、通信制御部３２１は、受
信したＴＣＰ／ＩＰパケットを構成するＴＣＰセグメン
トの“データ”フィールドから検索結果ＨＴＭＬ文章デ
ータを取り出し、それをＣＰＵ３１６に引き渡す。Along with this notification, the communication control unit 321 extracts the search result HTML text data from the “data” field of the TCP segment constituting the received TCP / IP packet, and transfers it to the CPU 316.

【０１６８】ＣＰＵ３１６は、上述の受信通知と検索結
果ＨＴＭＬ文章データを、前述した図４のステップ４０
２で処理し、ブラウザアプリケーションを起動して、引
き渡された検索結果ＨＴＭＬ文章データを、ハイパーテ
キストの一部であるアンカーを含むホームページ形式で
ＬＣＤ表示部３１１（図２の２０３）に表示する。The CPU 316 converts the above-described reception notification and the search result HTML text data into the above-described step 40 in FIG.
2, the browser application is started, and the delivered search result HTML text data is displayed on the LCD display unit 311 (203 in FIG. 2) in a homepage format including an anchor which is a part of the hypertext.

【０１６９】移動端末１０１のユーザが、上述のように
表示されたホームページ上のアンカーを電子ペンでタッ
チ等することにより選択すると、移動端末１０１は、ブ
ラウザアプリケーションの機能によって、移動端末制御
ホスト装置１０４を介して、上記アンカーと共にハイパ
ーテキストに含まれるＵＲＬにより示されるインターネ
ット１０５に接続されるホスト装置上のホームページデ
ータやＪａｖａアプレットやファイルデータやホスト装
置のログインアカウント等の各種リソースに対して、そ
のＵＲＬによって示されるＨＴＴＰやＦＴＰ等の通信プ
ロトコルを用いて、アクセスする。When the user of the mobile terminal 101 selects an anchor on the home page displayed as described above by touching it with an electronic pen or the like, the mobile terminal 101 uses the function of the browser application to control the mobile terminal control host device 104. The URL of various resources such as homepage data, Java applets and file data on the host device connected to the Internet 105 indicated by the URL included in the hypertext together with the anchor, together with the anchor, and the login account of the host device. Is accessed using a communication protocol such as HTTP or FTP indicated by.

【０１７０】ユーザは、移動端末１０１のタッチパネル
を操作することによって、音声制御ホスト装置１０８に
対して文音声認識／データベース検索処理の終了を示す
ための、文音声認識／データベース検索処理の終了要求
コマンドを指示することができる。The user operates the touch panel of the mobile terminal 101 to request the voice control host device 108 to end the sentence speech recognition / database search processing, thereby requesting the end of the sentence speech recognition / database search processing. Can be indicated.

【０１７１】この場合に、上述のタッチパネルの操作
は、図３のタッチパネル制御部３１５において検出され
た後、制御部１１０内のＣＰＵ３１６（図３）によっ
て、それが実行される前述した図４の動作フローチャー
トに対応する制御動作において、ステップ４０１の判定
がＹＥＳ、ステップ４０５及び４０６の判定がＮＯとな
って、ステップ４０９の他キー入力処理が実行されるこ
とにより、検出される。更に、ステップ４０４の送信処
理において、前述した図５のステップ５０１の判定がＹ
ＥＳとなり、必要に応じてステップ５０３で発信処理が
実行された後、ステップ５０４において、移動端末１０
１の“端末識別コード”と上述の文音声認識／データベ
ース検索処理の終了要求コマンドの送信指示が、図３の
通信部１１１内の通信制御部３２１に対して依頼され
る。In this case, the operation of the touch panel described above is detected by the touch panel control section 315 of FIG. 3 and then executed by the CPU 316 (FIG. 3) of the control section 110. In the control operation corresponding to the flowchart, the determination is made by YES in Step 401 and NO in Steps 405 and 406, and the other key input processing of Step 409 is executed. Further, in the transmission processing of step 404, the determination of step 501 in FIG.
In step 503, the mobile terminal 10 becomes an ES, and if necessary, a calling process is executed in step 503.
The transmission control unit 321 in the communication unit 111 of FIG. 3 is requested to transmit the “terminal identification code” of No. 1 and the above-described sentence speech recognition / database search processing end request command.

【０１７２】この結果、通信制御部３２１は、まず、
“データ”フィールドに移動端末１０１を特定する“端
末識別コード”と文音声認識／データベース検索処理の
終了要求コマンドとが格納された図６(c) に示されるフ
ォーマットを有するＴＣＰセグメントを生成し、次に、
そのＴＣＰセグメントが“データ”フィールドに格納さ
れた図６(b) に示されるフォーマットを有するＩＰデー
タグラムを生成し、更に、そのＩＰデータグラムが“イ
ンフォメーション”フィールドに格納された図６(a) に
示されるフォーマットを有するＰＰＰフレームを生成
し、それらからなるＴＣＰ／ＩＰパケットを送信する。
この場合に、ＴＣＰヘッダ（図６(c) 、図７(b) ）、Ｉ
Ｐヘッダ（図６(b) 、図７(a) ）、及び“プロトコル”
フィールド（図６(a) ）に設定される各情報は、前述の
文音声認識／データベース検索処理の開始要求コマンド
が送信される場合に設定される各情報と同一である。As a result, the communication control unit 321 first
A TCP segment having a format shown in FIG. 6C in which a "terminal identification code" for specifying the mobile terminal 101 and a command for requesting termination of sentence speech recognition / database search processing are stored in a "data" field, next,
An IP datagram having the format shown in FIG. 6B in which the TCP segment is stored in the "data" field is generated, and the IP datagram is stored in the "information" field in FIG. 6A. A PPP frame having the format shown in (1) is generated, and a TCP / IP packet including the PPP frame is transmitted.
In this case, the TCP header (FIG. 6 (c), FIG. 7 (b)), I
P header (Fig. 6 (b), Fig. 7 (a)) and "protocol"
Each piece of information set in the field (FIG. 6 (a)) is the same as each piece of information set when the above-described sentence speech recognition / database search processing start request command is transmitted.

【０１７３】この結果、上述のＴＣＰ／ＩＰパケット
は、前述の文音声認識／データベース検索処理の開始要
求コマンド等が格納されたＴＣＰ／ＩＰパケットの場合
と全く同様にして、インターネット１０５を介して音声
制御ホスト装置１０８内のパケット送受信部１１５まで
転送される。As a result, the above-mentioned TCP / IP packet is transmitted via the Internet 105 in exactly the same manner as the TCP / IP packet storing the above-mentioned sentence speech recognition / database search processing start request command and the like. The packet is transferred to the packet transmission / reception unit 115 in the control host device 108.

【０１７４】パケット送受信部１１５は、前述の文音声
認識／データベース検索処理の開始要求コマンド等が格
納されたＴＣＰ／ＩＰパケットが転送されてきた場合と
全く同様にして、転送されてきたＴＣＰ／ＩＰパケット
を受信し、移動端末通信制御部１１６（図１）に対して
受信通知を通知する。The packet transmitting / receiving unit 115 transmits the transferred TCP / IP in exactly the same manner as when the TCP / IP packet storing the above-described sentence speech recognition / database search processing start request command and the like is transferred. It receives the packet and notifies the mobile terminal communication control unit 116 (FIG. 1) of a reception notification.

【０１７５】この通知と共に、パケット送受信部１１５
は、受信したＴＣＰ／ＩＰパケットを構成するＴＣＰセ
グメントの“データ”フィールドから“端末識別コー
ド”と文音声認識／データベース検索処理の終了要求コ
マンドとを取り出して、それらのデータを移動端末通信
制御部１１６に引き渡す。Along with this notification, the packet transmitting / receiving unit 115
Extracts the “terminal identification code” and the command to end the sentence / speech recognition / database search process from the “data” field of the TCP segment constituting the received TCP / IP packet, and extracts the data from the mobile terminal communication control unit. Hand over to 116.

【０１７６】この結果、移動端末通信制御部１１６は、
後述するようにしてその移動端末１０１に対する文音声
認識／データベース検索処理を終了する。＜移動端末通信制御部１１６、文音声認識部１１７、及
び検索制御部１１８の概略動作＞次に、音声制御ホスト
装置１０８内の移動端末通信制御部１１６、文音声認識
部１１７、及び検索制御部１１８の概略動作について説
明する。As a result, the mobile terminal communication control unit 116
As described later, the sentence speech recognition / database search processing for the mobile terminal 101 ends. <Schematic Operation of Mobile Terminal Communication Control Unit 116, Sentence / Speech Recognition Unit 117, and Search Control Unit 118> Next, the mobile terminal communication control unit 116, sentence / speech recognition unit 117, and search control unit in the voice control host device 108 The schematic operation of 118 will be described.

【０１７７】移動端末通信制御部１１６は、文音声認識
／データベース検索処理の開始要求コマンドを送信した
移動端末１０１に割当てられている“端末識別コード”
（上記コマンドを転送してきたＴＣＰセグメントに格納
されている）毎に、図１２に示されるデータ構造を有す
る処理端末登録テーブルにエントリを登録すると共に、
音声データの受信用のバッファファイル（音声バッファ
ファイル）と、認識音声文章データの一時保存用のバッ
ファファイル（文章バッファファイル）と、検索結果Ｈ
ＴＭＬ文章データの送信用のバッファファイル（検索結
果バッファファイル）、及びその他の必要なバッファフ
ァイルを音声制御ホスト装置１０８が管理するファイル
システム上に作成する。また、移動端末通信制御部１１
６は、上記エントリとファイルの登録に成功すると、上
記コマンドを転送してきたＩＰデータグラムに格納され
ていた“送信元ＩＰアドレス”の移動端末１０１に向け
て、送信許可データを返信する。The mobile terminal communication control unit 116 transmits the “terminal identification code” assigned to the mobile terminal 101 that has transmitted the command to start the sentence speech recognition / database search process.
An entry is registered in the processing terminal registration table having the data structure shown in FIG.
A buffer file for receiving voice data (voice buffer file), a buffer file for temporarily storing recognized voice text data (text buffer file), and a search result H
A buffer file for transmitting TML text data (search result buffer file) and other necessary buffer files are created on a file system managed by the voice control host device 108. Also, the mobile terminal communication control unit 11
6 successfully registers the entry and the file, and returns the transmission permission data to the mobile terminal 101 of the “source IP address” stored in the IP datagram that has transmitted the command.

【０１７８】移動端末通信制御部１１６は、それ以後移
動端末１０１から受信した音声データを、その“送信元
ＩＰアドレス”（それを転送してきたＩＰデータグラム
に格納されている）に対応する処理端末登録テーブルの
エントリから特定される音声バッファファイルに追加書
き込みする。The mobile terminal communication control unit 116 converts the voice data received from the mobile terminal 101 thereafter into the processing terminal corresponding to the “source IP address” (stored in the IP datagram that transferred the data). Write additionally to the audio buffer file specified from the entry in the registration table.

【０１７９】文音声認識部１１７は、図１２に示される
処理端末登録テーブルのエントリ毎に、各エントリから
特定される音声バッファファイルに音声データが受信さ
れていればそれに対して文音声認識処理を実行し、その
結果得られる認識音声文章データを上記各エントリに対
応する文章バッファファイルに追加書き込みする。The sentence / speech recognition unit 117 performs a sentence / speech recognition process on each entry of the processing terminal registration table shown in FIG. 12 if the speech data is received in the speech buffer file specified from each entry. Then, the recognition voice text data obtained as a result is additionally written to the text buffer file corresponding to each of the above entries.

【０１８０】検索制御部１１８（図１）は、図１２に示
される処理端末登録テーブルのエントリ毎に、各エント
リから特定される文章バッファファイルに認識音声文章
データが得られていればそれに対してデータベース検索
処理を実行し、その結果得られる検索結果ＨＴＭＬ文章
データを上記各エントリに対応する検索結果バッファフ
ァイルに追加書き込みする。For each entry in the processing terminal registration table shown in FIG. 12, the search control unit 118 (FIG. 1) responds to the sentence buffer file specified by each entry if the recognized speech sentence data is obtained. A database search process is executed, and the search result HTML text data obtained as a result is additionally written into a search result buffer file corresponding to each entry.

【０１８１】移動端末通信制御部１１６は、処理端末登
録テーブルのエントリ毎に、各エントリから特定される
検索結果バッファファイルに検索結果ＨＴＭＬ文章デー
タが得られていれば、それを各エントリに登録されてい
る“送信元ＩＰアドレス”の移動端末１０１に向けて返
信する。For each entry in the processing terminal registration table, if the search result HTML text data is obtained in the search result buffer file specified from each entry, the mobile terminal communication control unit 116 registers it in each entry. To the mobile terminal 101 of the “source IP address”.

【０１８２】移動端末通信制御部１１６は、文音声認識
／データベース検索処理の終了要求コマンドを受信した
処理端末登録テーブルのエントリ、又は最終アクセス時
刻が現在時刻から一定時間前の時刻よりも更に前の時刻
である処理端末登録テーブルのエントリについて、その
エントリの内容を削除し、それから特定される各バッフ
ァファイルを削除する。＜移動端末通信制御部１１６の詳細動作＞図９〜図１１
は、上記機能を実現するために、移動端末通信制御部１
１６が実行する制御動作を示す動作フローチャートであ
る。この動作フローチャートは、移動端末通信制御部１
１６を制御する特には図示しないプロセッサが、特には
図示しない制御プログラムを実行する動作として実現さ
れる。The mobile terminal communication control unit 116 determines whether the entry in the processing terminal registration table that has received the end request command for the sentence speech recognition / database search processing or the last access time is earlier than the time that is a fixed time before the current time. For the entry in the processing terminal registration table that is the time, the contents of the entry are deleted, and each buffer file specified from that entry is deleted. <Detailed Operation of Mobile Terminal Communication Control Unit 116> FIGS. 9 to 11
Is a mobile terminal communication control unit 1 for realizing the above function.
16 is an operation flowchart illustrating a control operation performed by the control unit 16. This operation flowchart is based on the mobile terminal communication control unit 1.
A processor (not shown) for controlling the CPU 16 is realized as an operation for executing a control program (not shown).

【０１８３】まず、ステップ９０１で、音声制御ホスト
装置１０８内のパケット送受信部１１５（図１）から受
信通知が通知されたか否かが判定される。前述したよう
に、パケット送受信部１１５は、インターネット１０５
から転送されてきたＴＣＰ／ＩＰパケットを構成するＩ
ＰデータグラムのＩＰヘッダの“宛先ＩＰアドレス”フ
ィールドに自分である音声制御ホスト装置１０８のＩＰ
アドレスが設定されていることを識別することにより、
そのＴＣＰ／ＩＰパケットを受信し、かつ、それを構成
するＴＣＰセグメントの“宛先ポート番号”フィールド
及び“送信元ポート番号”フィールドに文音声認識／デ
ータベース検索処理のための通信プロトコルを特定する
１６ビットの整数値が設定されていることを確認するこ
とによって、移動端末通信制御部１１６に対して受信通
知を通知する。この受信通知は、文音声認識／データベ
ース検索処理の開始要求コマンド、文音声認識／データ
ベース検索処理の対象である音声データ、又は文音声認
識／データベース検索処理の終了要求コマンドの何れか
に関する受信通知である。First, in step 901, it is determined whether or not a reception notification has been received from the packet transmitting / receiving unit 115 (FIG. 1) in the voice control host device. As described above, the packet transmitting / receiving unit 115 communicates with the Internet 105
I that constitutes the TCP / IP packet transferred from
In the "destination IP address" field of the IP header of the P datagram, the IP address of the voice control host
By identifying that the address is set,
A 16-bit that receives the TCP / IP packet and specifies a communication protocol for sentence speech recognition / database search processing in the "destination port number" field and the "source port number" field of the TCP segment constituting the TCP / IP packet By confirming that an integer value is set, the mobile terminal communication control unit 116 is notified of the reception notification. This reception notification is a reception notification relating to any one of a sentence speech recognition / database search processing start request command, speech data to be subjected to sentence speech recognition / database search processing, and a sentence speech recognition / database search processing end request command. is there.

【０１８４】パケット送受信部１１５から受信通知が通
知されステップ９０１の判定がＹＥＳとなると、ステッ
プ９０２で、パケット送受信部１１５から受信通知と共
に引き渡されたデータが取り込まれる。この場合に、受
信通知が、文音声認識／データベース検索処理の開始要
求コマンドの受信通知である場合には、“送信元ＩＰア
ドレス”と“端末識別コード”と上記コマンドとが取り
込まれる。また、受信通知が、音声データの受信通知で
ある場合には、“送信元ＩＰアドレス”と音声データと
が取り込まれる。更に、受信通知が、文音声認識／デー
タベース検索処理の終了要求コマンドの受信通知である
場合には、“端末識別コード”とそのコマンドとが取り
込まれる。When the reception notification is notified from the packet transmission / reception unit 115 and the determination in step 901 is YES, in step 902, the data transferred together with the reception notification from the packet transmission / reception unit 115 is fetched. In this case, if the reception notification is a reception notification of a sentence speech recognition / database search process start request command, the “source IP address”, the “terminal identification code”, and the above command are fetched. When the reception notification is a reception notification of audio data, the “source IP address” and the audio data are captured. Further, when the reception notification is a reception notification of a command for requesting termination of sentence speech recognition / database search processing, the “terminal identification code” and the command are fetched.

【０１８５】ステップ９０２の処理の後に、図９のステ
ップ９０３、図１０のステップ９０７、又は図１０のス
テップ９０９の判定が順に検査され、何れかの判定結果
がＹＥＳとなる。即ち、ステップ９０２でパケット送受
信部１１５から引き渡されたデータが、文音声認識／デ
ータベース検索処理の開始要求コマンドに関するもので
ある場合はステップ９０３の判定がＹＥＳとなってステ
ップ９０４〜９０６が実行され、音声データに関するも
のである場合は図１０のステップ９０７の判定がＹＥＳ
となってステップ９０８が実行され、文音声認識／デー
タベース検索処理の終了要求コマンドに関するものであ
る場合には図１０のステップ９０９の判定がＹＥＳとな
ってステップ９１０と９１１が実行される。After the processing of step 902, the judgments of step 903 of FIG. 9, step 907 of FIG. 10, or step 909 of FIG. 10 are sequentially examined, and any judgment result becomes YES. That is, if the data delivered from the packet transmitting / receiving unit 115 in step 902 is related to a command requesting start of sentence speech recognition / database search processing, the determination in step 903 is YES, and steps 904 to 906 are executed. If it is related to voice data, the determination in step 907 of FIG. 10 is YES.
Then, step 908 is executed, and when the command is related to the end request command of the sentence speech recognition / database search processing, the determination in step 909 in FIG. 10 is YES, and steps 910 and 911 are executed.

【０１８６】パケット送受信部１１５から受信通知が通
知されておらずステップ９０１の判定がＮＯの場合、又
は上述の各コマンド又は音声データの受信に対応する処
理の後には、図１１のステップ９１２と９１３で検索結
果ＨＴＭＬ文章データの送信処理が実行され、それに続
くステップ９１４及び９１５で最終アクセス時刻が一定
時間以上前である移動端末１０１との通信を終了させる
ための処理が行われた後、再び図９のステップ９０１の
判定処理に戻る。If the reception notification has not been received from the packet transmission / reception unit 115 and the determination in step 901 is NO, or after the processing corresponding to the reception of each command or voice data described above, steps 912 and 913 in FIG. The transmission processing of the search result HTML text data is executed in Steps 914 and 915. After the processing for terminating the communication with the mobile terminal 101 whose last access time is a predetermined time or more is performed in Steps 914 and 915, the processing shown in FIG. The process returns to the determination process of Step 901 of Step 9.

【０１８７】ステップ９０１の判定がＹＥＳであり、ス
テップ９０２でパケット送受信部１１５から引き渡され
たデータが文音声認識／データベース検索処理の開始要
求コマンドに関するものである場合において、ステップ
９０３の判定がＹＥＳとなって実行されるステップ９０
４〜９０５の処理について説明する。If the determination in step 901 is YES and the data passed from packet transmitting / receiving section 115 in step 902 is related to a command requesting start of sentence speech recognition / database search processing, the determination in step 903 is YES. Step 90 to be executed
The processing of 4-905 will be described.

【０１８８】まず、ステップ９０４では、音声データの
受信用のバッファファイルである音声バッファファイル
と、認識音声文章の一時保存用のバッファファイルであ
る文章バッファファイルと、検索制御部１１８が使用す
る検索済キーワードバッファファイル及び検索インデッ
クスバッファファイルと、検索結果ＨＴＭＬ文章データ
の送信用のバッファファイルである検索結果バッファフ
ァイルとが、音声制御ホスト装置１０８が管理するファ
イルシステム上に作成される。First, in step 904, an audio buffer file which is a buffer file for receiving audio data, a text buffer file which is a buffer file for temporarily storing a recognized voice text, and a search completed file used by the search control unit 118. A keyword buffer file, a search index buffer file, and a search result buffer file that is a buffer file for transmitting search result HTML text data are created on a file system managed by the voice control host device 108.

【０１８９】次に、ステップ９０４では、移動端末通信
制御部１１６内の特には図示しないメモリに記憶される
図１２に示されるデータ構造を有する処理端末登録テー
ブルに、１つのエントリ（横１行のデータ組）が確保さ
れる。そして、そのエントリに、“端末識別コード”
と、“送信元ＩＰアドレス”と、最終アクセス時刻と、
音声バッファファイル名と、文章バッファファイル名
と、検索済キーワードバッファファイル名と、検索イン
デックスバッファファイル名と、検索結果バッファファ
イル名とが、登録される。“端末識別コード”は、ステ
ップ９０２でパケット送受信部１１５から引き渡された
データであり、移動端末１０１から転送されてきたＴＣ
Ｐ／ＩＰパケットを構成するＴＣＰセグメントの“デー
タ”フィールドに格納されていたものである（図６(c)
参照）。“送信元ＩＰアドレス”は、やはりステップ９
０２においてパケット送受信部１１５から引き渡された
データであり、移動端末１０１から転送されてきたＴＣ
Ｐ／ＩＰパケットを構成するＩＰデータグラムのＩＰヘ
ッダに格納されていたものである（図６(b) 、図７(a)
参照）。最終アクセス時刻には、現在時刻が設定され
る。各バッファファイル名は、ステップ９０４で作成さ
れた各ファイルを示すファイル名である。Next, at step 904, one entry (one horizontal row) is stored in the processing terminal registration table having the data structure shown in FIG. 12 stored in the memory (not shown) in the mobile terminal communication control unit 116. Data set) is secured. Then, in the entry, "terminal identification code"
, “Source IP address”, last access time,
An audio buffer file name, a text buffer file name, a searched keyword buffer file name, a search index buffer file name, and a search result buffer file name are registered. The “terminal identification code” is data transferred from the packet transmission / reception unit 115 in step 902, and is the TC transmitted from the mobile terminal 101.
This is stored in the "data" field of the TCP segment constituting the P / IP packet (FIG. 6 (c)).
reference). The “source IP address” is also stored in step 9
02 is the data transferred from the packet transmitting / receiving unit 115, and is the TC transferred from the mobile terminal 101.
This is stored in the IP header of the IP datagram constituting the P / IP packet (FIGS. 6B and 7A).
reference). The current time is set as the last access time. Each buffer file name is a file name indicating each file created in step 904.

【０１９０】ステップ９０５の処理の後、ステップ９０
６では、ステップ９０２でパケット送受信部１１５から
引き渡され処理端末登録テーブルの上記エントリに登録
された“送信元ＩＰアドレス”に向けて、送信許可デー
タが返信される。After the processing of step 905, step 90
In step 6, the transmission permission data is returned to the "source IP address" passed from the packet transmitting / receiving unit 115 in step 902 and registered in the entry of the processing terminal registration table.

【０１９１】具体的には、移動端末通信制御部１１６
は、“送信元ＩＰアドレス”への送信許可データの返信
を、パケット送受信部１１５（図１）に対して依頼す
る。この結果、パケット送受信部１１５は、まず、図６
(c) に示されるフォーマットを有するＴＣＰセグメント
を生成する。この場合、図６(c) 及び図７(b) に示され
るフォーマットを有するＴＣＰヘッダにおいて、“送信
元ポート番号”フィールド及び“宛先ポート番号”フィ
ールドには、文音声認識／データベース検索処理のため
の通信プロトコルを特定する１６ビットの整数値が設定
される。そして、ＴＣＰセグメントの“データ”フィー
ルドには、送信許可データが格納される。More specifically, mobile terminal communication control section 116
Requests the packet transmission / reception unit 115 (FIG. 1) to return transmission permission data to the “source IP address”. As a result, the packet transmitting / receiving unit 115 first
Generate a TCP segment having the format shown in (c). In this case, in the TCP header having the format shown in FIGS. 6 (c) and 7 (b), the "source port number" field and the "destination port number" field contain text / speech recognition / database search processing. A 16-bit integer value specifying the communication protocol is set. Then, transmission permission data is stored in the “data” field of the TCP segment.

【０１９２】次に、パケット送受信部１１５は、上述の
ＴＣＰセグメントが“データ”フィールドに格納された
図６(b) に示されるフォーマットを有するＩＰデータグ
ラムを生成する。この場合に、図６(b) 及び図７(a) に
示されるフォーマットを有するＩＰヘッダにおいて、
“プロトコル”フォーマットには、その“データ”フィ
ールドに格納されるＴＣＰセグメントデータのフォーマ
ットを規定する整数値６が設定される。また、“送信元
ＩＰアドレス”フィールドには、音声制御ホスト装置１
０８に割当てられているＩＰアドレスが設定される。更
に、“宛先ＩＰアドレス”フィールドには、図９のステ
ップ９０２でパケット送受信部１１５から引き渡された
“送信元ＩＰアドレス”が設定される。Next, the packet transmitting / receiving unit 115 generates an IP datagram having the format shown in FIG. 6B in which the above-mentioned TCP segment is stored in the “data” field. In this case, in the IP header having the format shown in FIGS. 6 (b) and 7 (a),
In the “protocol” format, an integer value 6 defining the format of the TCP segment data stored in the “data” field is set. The “source IP address” field contains the voice control host device 1
08 is set. Further, in the “destination IP address” field, the “source IP address” passed from the packet transmission / reception unit 115 in step 902 of FIG. 9 is set.

【０１９３】そして、パケット送受信部１１５は、上述
のＩＰデータグラムが格納されたＬＡＮ１０７上のプロ
トコルに従ったフレームを生成し、それをＬＡＮ１０７
に送出する。例えば、ＬＡＮ１０７がイーサネット方式
によるローカルエリアネットワークであれば、上記フレ
ームは、イーサネットフレームである。Then, the packet transmitting / receiving unit 115 generates a frame according to the protocol on the LAN 107 in which the above-described IP datagram is stored, and transmits the frame to the LAN 107
To send to. For example, if the LAN 107 is a local area network based on the Ethernet system, the frame is an Ethernet frame.

【０１９４】上記フレームとＩＰデータグラムとＴＣＰ
セグメントとから構成されるＴＣＰ／ＩＰパケットは、
それを構成するＩＰデータグラムのＩＰヘッダに格納さ
れている“宛先ＩＰアドレス”に基づいて、ルータ装置
１０６及びインターネット１０５を介して移動端末制御
ホスト装置１０４まで転送された後、更に、ＰＨＳ網１
０３及び無線基地（又は有線接続装置）１０２を介し
て、移動端末１０１の通信部１１１内の通信制御部３２
１（図３）まで転送される。The above frame, IP datagram and TCP
A TCP / IP packet composed of a segment and
After being transferred to the mobile terminal control host device 104 via the router device 106 and the Internet 105 based on the “destination IP address” stored in the IP header of the IP datagram constituting the PHS network,
03 and the communication control unit 32 in the communication unit 111 of the mobile terminal 101 via the wireless base (or wired connection device) 102
1 (FIG. 3).

【０１９５】これ以降、移動端末１０１から音声制御ホ
スト装置１０８へは、前述したようにして、音声データ
が転送されてくる。ステップ９０６の処理の後は、図１
１のステップ９１２と９１３で検索結果ＨＴＭＬ文章デ
ータの送信処理が実行され、それに続くステップ９１４
及び９１５で最終アクセス時刻が一定時間以上前である
移動端末１０１との通信を終了させるための処理が行わ
れた後、再び図９のステップ９０１の判定処理に戻る。Thereafter, voice data is transferred from the mobile terminal 101 to the voice control host device 108 as described above. After the processing of step 906, FIG.
In steps 912 and 913 of step 1, transmission processing of search result HTML text data is executed, and the subsequent step 914
In steps 915 and 915, a process for terminating communication with the mobile terminal 101 whose last access time is a predetermined time or more is performed, and then the process returns to step 901 in FIG.

【０１９６】次に、図９のステップ９０１の判定がＹＥ
Ｓであり、ステップ９０２でパケット送受信部１１５か
ら引き渡されたデータが音声データである場合におい
て、図１０のステップ９０７の判定がＹＥＳとなって実
行されるステップ９０８の処理について説明する。Next, the determination in step 901 in FIG.
Step S 908, which is executed when the determination in step 907 in FIG. 10 is YES when the data delivered from the packet transmitting / receiving unit 115 in step S 902 is voice data, will be described.

【０１９７】即ち、ステップ９０８では、図９のステッ
プ９０２でパケット送受信部１１５から引き渡されたの
と同じ“送信元ＩＰアドレス”が記憶されている処理端
末登録テーブル（図１２）のエントリが検索され、該当
するエントリに記憶されている音声バッファファイル名
に対応する音声バッファファイル（図９のステップ９０
４参照）に、図９のステップ９０２でパケット送受信部
１１５から引き渡された音声データが追加書き込みされ
る。なお、追加書込み時の音声バッファファイルのサイ
ズは、音声制御ホスト装置１０８が管理するファイルシ
ステムによって自動的に調整される。That is, in step 908, an entry in the processing terminal registration table (FIG. 12) in which the same “source IP address” passed from the packet transmitting / receiving section 115 in step 902 in FIG. 9 is stored is searched. The audio buffer file corresponding to the audio buffer file name stored in the corresponding entry (step 90 in FIG. 9)
4), the audio data transferred from the packet transmitting / receiving unit 115 in step 902 of FIG. 9 is additionally written. The size of the audio buffer file at the time of additional writing is automatically adjusted by the file system managed by the audio control host device 108.

【０１９８】また、ステップ９０８では、上記該当する
エントリに記憶されている最終アクセス時刻が、現在時
刻に更新される。このようにして、移動端末１０１毎
（“端末識別コード”毎）の音声バッファファイルを介
して、移動端末通信制御部１１６から文音声認識部１１
７（図１）に音声データが引き渡される。文音声認識部
１１７は、後述するように、図１２に示される処理端末
登録テーブルのエントリ毎に、各エントリから特定され
る音声バッファファイルに音声データが受信されていれ
ばそれに対して文音声認識処理を実行し、その結果得ら
れる認識音声文章データを上記各エントリに対応する文
章バッファファイルに追加書き込みすることになる。更
に、検索制御部１１８（図１）は、後述するように、図
１２に示される処理端末登録テーブルのエントリ毎に、
各エントリから特定される文章バッファファイルに認識
音声文章データが得られていればそれに対してデータベ
ース検索処理を実行し、その結果得られる検索結果ＨＴ
ＭＬ文章データを上記各エントリに対応する検索結果バ
ッファファイルに追加書き込みすることになる。At step 908, the last access time stored in the relevant entry is updated to the current time. In this way, the mobile terminal communication control unit 116 sends the sentence voice recognition unit 11 via the voice buffer file for each mobile terminal 101 (for each “terminal identification code”).
7 (FIG. 1) is delivered. As will be described later, the sentence speech recognition unit 117 performs, for each entry of the processing terminal registration table shown in FIG. 12, the sentence speech recognition if the speech data is received in the speech buffer file specified from each entry. The process is executed, and the resulting recognized voice text data is additionally written to the text buffer file corresponding to each of the entries. Further, as described later, the search control unit 118 (FIG. 1) performs, for each entry of the processing terminal registration table shown in FIG.
If the recognized voice sentence data is obtained in the sentence buffer file specified from each entry, a database search process is executed on the data, and a search result HT obtained as a result is obtained.
The ML text data is additionally written to the search result buffer file corresponding to each of the entries.

【０１９９】ステップ９０８の処理の後は、図１１のス
テップ９１２と９１３で検索結果ＨＴＭＬ文章データの
送信処理が実行され、それに続くステップ９１４及び９
１５で最終アクセス時刻が一定時間以上前である移動端
末１０１との通信を終了させるための処理が行われた
後、再び図９のステップ９０１の判定処理に戻る。After the processing in step 908, transmission processing of the retrieval result HTML text data is executed in steps 912 and 913 in FIG. 11, and the subsequent steps 914 and 9
After the processing for terminating the communication with the mobile terminal 101 whose final access time is a fixed time or more in 15 is performed, the process returns to the determination processing in step 901 in FIG. 9 again.

【０２００】次に、図９のステップ９０１の判定がＹＥ
Ｓであり、ステップ９０２でパケット送受信部１１５か
ら引き渡されたデータが文音声認識／データベース検索
処理の終了要求コマンドに関するものである場合におい
て、図１０のステップ９０９の判定がＹＥＳとなって実
行されるステップ９１０と９１１の処理について説明す
る。Next, the determination in step 901 in FIG.
If it is S and the data transferred from the packet transmitting / receiving unit 115 in step 902 is related to a command for requesting termination of sentence speech recognition / database search processing, the determination in step 909 in FIG. The processing of steps 910 and 911 will be described.

【０２０１】まず、ステップ９１０で、図９のステップ
９０２でパケット送受信部１１５から引き渡されたのと
同じ“端末識別コード”が記憶されている処理端末登録
テーブル（図１２）のエントリの内容が全て削除され
る。First, in step 910, all the contents of the entries in the processing terminal registration table (FIG. 12) storing the same “terminal identification code” passed from the packet transmitting / receiving section 115 in step 902 in FIG. Deleted.

【０２０２】次に、ステップ９１１で、上記エントリに
記憶されていた音声バッファファイル名、文章バッファ
ファイル名、検索済キーワードバッファファイル名、検
索インデックスバッファファイル名、及び検索結果バッ
ファファイル名に対応する各バッファファイルが、音声
制御ホスト装置１０８が管理するファイルシステム上か
ら削除される。Next, in step 911, each of the audio buffer file name, the sentence buffer file name, the searched keyword buffer file name, the search index buffer file name, and the search result buffer file name stored in the above entry is read. The buffer file is deleted from the file system managed by the audio control host device 108.

【０２０３】ステップ９１１の処理の後は、図１１のス
テップ９１２と９１３で検索結果ＨＴＭＬ文章データの
送信処理が実行され、それに続くステップ９１４及び９
１５で最終アクセス時刻が一定時間以上前である移動端
末１０１との通信を終了させるための処理が行われた
後、再び図９のステップ９０１の判定処理に戻る。After the processing of step 911, transmission processing of the retrieval result HTML text data is executed in steps 912 and 913 of FIG.
After the processing for terminating the communication with the mobile terminal 101 whose final access time is a fixed time or more in 15 is performed, the process returns to the determination processing in step 901 in FIG. 9 again.

【０２０４】パケット送受信部１１５から受信通知が通
知されておらず図９のステップ９０１の判定がＮＯの場
合、又は上述の各コマンド又は音声データの受信に対応
する処理の後に実行される、図１１のステップ９１２と
９１３の処理、及びそれに続くステップ９１４と９１５
の処理について説明する。When the reception notification is not notified from the packet transmitting / receiving unit 115 and the determination in step 901 in FIG. 9 is NO, or after the processing corresponding to the reception of each command or voice data described above, FIG. Processing of steps 912 and 913, and subsequent steps 914 and 915
Will be described.

【０２０５】これらの処理において、文音声認識部１１
７から得られている検索結果ＨＴＭＬ文章データの送信
処理が実行される。まず、ステップ９１２では、処理端
末登録テーブル（図１２）において、検索結果バッファ
ファイル名に対応する検索結果バッファファイルに検索
結果ＨＴＭＬ文章データが存在するエントリがあるか否
かが判定される。In these processes, the sentence speech recognition unit 11
7 is transmitted. First, in step 912, it is determined whether or not the search result buffer file corresponding to the search result buffer file name has an entry in which search result HTML text data exists in the processing terminal registration table (FIG. 12).

【０２０６】そのようなエントリが無くステップ９１２
の判定がＮＯの場合には、ステップ９１３での検索結果
ＨＴＭＬ文章データの送信処理は実行されずに、ステッ
プ９１４及び９１５の処理に進む。If there is no such entry, step 912
Is negative, the process of transmitting the search result HTML text data in step 913 is not executed, and the process proceeds to steps 914 and 915.

【０２０７】上述のようなエントリが１つ以上存在しス
テップ９１２の判定がＹＥＳの場合には、ステップ９１
３で、該当するエントリ毎に、そのエントリに記憶され
ている“送信元ＩＰアドレス”に向けて、そのエントリ
に記憶されている検索結果バッファファイル名に対応す
る検索結果バッファファイル内の検索結果ＨＴＭＬ文章
データが送信され、その送信された検索結果ＨＴＭＬ文
章データが上記検索結果バッファファイルから削除され
る。なお、削除時の検索結果バッファファイルのサイズ
は、音声制御ホスト装置１０８が管理するファイルシス
テムによって自動的に調整される。If there is one or more entries as described above and the determination in step 912 is YES, step 91
At 3, the search result HTML in the search result buffer file corresponding to the search result buffer file name stored in the entry is directed to the “source IP address” stored in the entry for each corresponding entry. The sentence data is transmitted, and the transmitted search result HTML sentence data is deleted from the search result buffer file. The size of the search result buffer file at the time of deletion is automatically adjusted by the file system managed by the voice control host device 108.

【０２０８】上述のステップ９１３の処理の後又はステ
ップ９１２の判定がＮＯである場合に、ステップ９１４
が実行される。ここでは、処理端末登録テーブル（図１
２）のエントリのうち、最終アクセス時刻が現在時刻か
ら一定時間前の時刻より更に前の時刻であるエントリが
検出され、そのエントリの内容が全て削除される。After the processing in step 913 or when the determination in step 912 is NO, step 914
Is executed. Here, the processing terminal registration table (FIG. 1)
Of the entries in 2), the entry whose last access time is a time earlier than the current time by a certain time before the current time is detected, and all the contents of the entry are deleted.

【０２０９】また、ステップ９１５で、上記エントリに
記憶されていた音声バッファファイル名、文章バッファ
ファイル名、検索済キーワードバッファファイル名、検
索インデックスバッファファイル名、及び検索結果バッ
ファファイル名に対応する各バッファファイルが、音声
制御ホスト装置１０８が管理するファイルシステム上か
ら削除される。In step 915, each buffer corresponding to the audio buffer file name, text buffer file name, searched keyword buffer file name, search index buffer file name, and search result buffer file name stored in the above entry The file is deleted from the file system managed by the voice control host device 108.

【０２１０】ステップ９１５の処理の後、再び図９のス
テップ９０１の判定処理に戻る。＜文音声認識部１１７の詳細動作＞図１３は、文音声認
識部１１７の機能ブロック図である。After the processing in step 915, the process returns to the determination processing in step 901 in FIG. <Detailed Operation of Sentence Speech Recognition Unit 117> FIG. 13 is a functional block diagram of the sentence speech recognition unit 117.

【０２１１】この文音声認識部１１７は、前述したよう
に、図１２に示される処理端末登録テーブルのエントリ
毎に、各エントリから特定される音声バッファファイル
に音声データが受信されていればそれに対して文音声認
識を実行し、その結果得られる認識音声文章データを上
記各エントリに対応する文章バッファファイルに追加書
き込みする。As described above, this sentence speech recognition unit 117 performs, for each entry in the processing terminal registration table shown in FIG. 12, if speech data is received in a speech buffer file specified from each entry, Then, sentence speech recognition is executed, and the resulting recognized speech sentence data is additionally written into a sentence buffer file corresponding to each of the above entries.

【０２１２】上述のエントリ毎の音声バッファファイル
からの音声データの読出しと文章バッファファイルへの
認識音声文章データの書込みは、図１３の入出力制御部
１３０９が制御する。まず、この入出力制御部１３０９
の制御動作につき説明する。図１４は、入出力制御部１
３０９が実行する制御動作を示す動作フローチャートで
ある。この動作フローチャートは、入出力制御部１３０
９を制御する特には図示しないプロセッサが、特には図
示しない制御プログラムを実行する動作として実現され
る。The reading of the voice data from the voice buffer file for each entry and the writing of the recognized voice text data to the text buffer file are controlled by the input / output control unit 1309 in FIG. First, the input / output control unit 1309
Will be described. FIG. 14 shows the input / output control unit 1
309 is an operation flowchart illustrating a control operation performed by the control unit. This operation flowchart is based on the input / output control unit 130.
9 is realized as an operation of executing a control program (not shown).

【０２１３】まず、ステップ１４０１では、処理端末登
録テーブル（図１２）において、音声バッファファイル
名に対応する音声バッファファイルに音声データが記憶
されているエントリが存在するか否かが判定される。First, in step 1401, it is determined whether or not an entry in which audio data is stored in the audio buffer file corresponding to the audio buffer file name exists in the processing terminal registration table (FIG. 12).

【０２１４】そのようなエントリが存在しステップ１４
０１の判定がＹＥＳならば、ステップ１４０２で、該当
するエントリ毎に、そのエントリに記憶されている“端
末識別コード”と、そのエントリに記憶されている音声
バッファファイル名に対応する音声バッファファイル上
の音声データとが、図１３の入力バッファキュー１３０
１に書き込まれ、その音声データが音声バッファファイ
ルから削除される。If there is such an entry and step 14
If the determination of 01 is YES, in step 1402, for each entry, the "terminal identification code" stored in the entry and the audio buffer file name corresponding to the audio buffer file name stored in the entry are displayed. Of the input buffer queue 130 shown in FIG.
1 and the audio data is deleted from the audio buffer file.

【０２１５】入力バッファキュー１３０１は、それがキ
ューイングしている音声データを、音声区間検出部１３
０２に順次流し込む機能を有する。音声区間検出部１３
０２以降に接続されている音声分析部１３０３、音素認
識部１３０４、単語認識部１３０６、及び文章認識部１
３０７は、データ処理パイプラインを形成しており、相
互に独立して、入力データを処理する機能を有する。ま
た、１３０２〜１３０７の各部分は、現在処理している
音声データに対応する“端末識別コード”（入力バッフ
ァキュー１３０１から入力される）を認識することがで
きる。従って、最終的に文章認識部１３０７から出力バ
ッファキュー１３０８へは、“端末識別コード”と認識
音声文章データとの組が出力されることになる。The input buffer queue 1301 stores the audio data queued in the input buffer queue 1301
02. Voice section detector 13
02, the speech analysis unit 1303, the phoneme recognition unit 1304, the word recognition unit 1306, and the text recognition unit 1
A data processing pipeline 307 has a function of processing input data independently of each other. Each of the parts 1302 to 1307 can recognize the “terminal identification code” (input from the input buffer queue 1301) corresponding to the audio data currently being processed. Therefore, finally, a set of the “terminal identification code” and the recognized voice text data is output from the text recognition unit 1307 to the output buffer queue 1308.

【０２１６】ステップ１４０２の処理の後又はステップ
１４０１の判定がＮＯの場合には、ステップ１４０３
で、図１３の出力バッファキュー１３０８に、“端末識
別コード”と認識音声文章データの組が得られているか
否かが判定される。After the processing in step 1402 or when the determination in step 1401 is NO, step 1403
Then, it is determined whether or not a set of “terminal identification code” and recognized speech text data has been obtained in the output buffer queue 1308 of FIG.

【０２１７】そのような組が得られておりステップ１４
０３の判定がＹＥＳならば、ステップ１４０４で、出力
バッファキュー１３０８内の組毎に、その組の“端末識
別コード”に対応する処理端末登録テーブルのエントリ
について、そのエントリに記憶されている文章バッファ
ファイル名に対応する文章バッファファイルに、出力バ
ッファキュー１３０８内の組の認識音声文章データが追
加書き込みされる。When such a set has been obtained, step 14 is executed.
If the determination in step 03 is YES, in step 1404, for each set in the output buffer queue 1308, for the entry in the processing terminal registration table corresponding to the "terminal identification code" of that set, the text buffer stored in that entry The set of recognized speech text data in the output buffer queue 1308 is additionally written to the text buffer file corresponding to the file name.

【０２１８】ステップ１４０４の処理の後又はステップ
１４０３の判定がＮＯの場合には、再びステップ１４０
１の判定処理が実行される。以上のようにして文音声認
識部１１７は、流れ作業的に効率良く、複数の移動端末
１０１から要求された音声データに対する文音声認識処
理を実行することができる。After the processing in step 1404 or when the determination in step 1403 is NO, step 140
1 is performed. As described above, the sentence / speech recognition unit 117 can efficiently execute the sentence / speech recognition process on the speech data requested by the plurality of mobile terminals 101 in a streamlined manner.

【０２１９】次に、文音声認識処理を実現するための１
３０２〜１３０７の各部分の機能につき、以下に説明す
る。なお、以下に説明する各方式は、例えば、文献「電
子・情報工学入門シリーズ２音響・音声工学」（古井
著、近代科学社）第１４章」を参照することにより、実
現することができる。Next, 1 for realizing the sentence speech recognition processing is described.
The function of each of the parts 302 to 1307 will be described below. Each of the methods described below can be realized by referring to, for example, the document “Electronic / Information Engineering Introduction Series 2, Sound and Speech Engineering” (Furui, Modern Science Co., Chapter 14).

【０２２０】音声区間検出部１３０２は、入力バッファ
キュー１３０１から入力される音声データのサンプル時
系列について、音声が存在する区間を検出する。より具
体的には、音声区間検出部１３０２は、所定サンプル
（例えば８ｋＨｚサンプリングデータについて３２乃至
２５６サンプル）ずつの平均パワー（電力）を計算し、
その平均パワーが所定の閾値を超えた状態が所定回数以
上連続して続く区間を、音声区間として検出する。これ
により、音声が存在しない区間で文音声が誤認識されて
しまうのを防ぐことができる。[0220] Voice section detection section 1302 detects a section in which voice exists, in a sample time series of voice data input from input buffer queue 1301. More specifically, the voice section detection unit 1302 calculates an average power (power) of predetermined samples (for example, 32 to 256 samples for 8 kHz sampling data),
A section in which the state in which the average power exceeds a predetermined threshold continues for a predetermined number of times or more is detected as a voice section. Thereby, it is possible to prevent a sentence voice from being erroneously recognized in a section where no voice exists.

【０２２１】音声分析部１３０３は、音声区間検出部１
３０２から出力される音声データについて、その特徴分
析を行うことによって、特徴量パラメータベクトルを検
出する。音声分析方式としては、以下の周知の分析方式
の何れかを採用することができる。（１）音声データ時系列を入力とする帯域フィルタバン
クの各出力を平滑化し、それらの平滑化された各出力を
特徴量パラメータベクトルの要素とする方式。（２）連続する所定サンプルずつの音声データ時系列を
入力とする高速フーリエ変換（ＦＦＴ）によって計算し
た各短時間スペクトル成分を平滑化し、それらの平滑化
された各成分値を特徴量パラメータベクトルの要素とす
る方式。（３）連続する所定サンプルずつの音声データ時系列を
入力とするケプストラム分析によってケプストラム係数
群を計算し、それらを特徴量パラメータベクトルの要素
とする方式。（４）上記（３）のケプストラム係数群に加えて、それ
らに対するΔ（デルタ）ケプストラム（ケプストラムの
微係数）群を計算し、それらを特徴量パラメータベクト
ルの要素に加える方式。（５）連続する所定サンプルずつの音声データ時系列を
入力とする線形予測分析（ＬＰＣ分析、更に具体的には
線スペクトル対分析：ＬＳＰ分析）によって、ＬＰＣ
（ＬＳＰ）係数群を計算し、それらを特徴量パラメータ
ベクトルの要素とする方式。（６）連続する所定サンプルずつの音声データ時系列を
入力とする自己相関分析によって自己相関関数を計算
し、それらに基づいて検出される音声のピッチ基本周波
数パターンを特徴量パラメータベクトルの１つの要素に
加える方式。次に、音素認識部１３０４は、所定フレーム周期（所定
サンプル）毎に音声分析部１３０３から出力される特徴
量パラメータベクトルと、音素標準パターン辞書１３０
５に蓄積されている各音素の特徴量パラメータベクトル
の標準パターンとの類似度（距離）を計算し、その結果
所定フレーム周期毎に得られる類似度の高い音素の組を
その類似度と共に音素ラティスデータとして出力する。
音素認識部１３０４は、音素の認識誤りの発生を回避す
るために、所定フレーム周期毎に最終的な音素を決定す
ることはせずに、音素候補を表にした音素ラティスデー
タの形式で結果データを出力する。The voice analysis unit 1303 is provided with the voice section detection unit 1
A feature amount parameter vector is detected by performing a feature analysis on the audio data output from 302. Any of the following well-known analysis methods can be adopted as the voice analysis method. (1) A method of smoothing each output of a band filter bank to which a time series of audio data is input, and using each smoothed output as an element of a feature parameter vector. (2) Smoothing each short-time spectrum component calculated by Fast Fourier Transform (FFT) which receives an audio data time series of a predetermined number of continuous samples as input, and converts each smoothed component value into a feature amount parameter vector. Element method. (3) A method in which a cepstrum coefficient group is calculated by cepstrum analysis using a time series of audio data for each successive predetermined sample as an input, and these are used as elements of a feature parameter vector. (4) A method of calculating a Δ (delta) cepstrum (differential coefficient of a cepstrum) group for the cepstrum coefficient group in addition to the cepstrum coefficient group of the above (3), and adding them to the element of the feature amount parameter vector. (5) LPC analysis is performed by linear prediction analysis (LPC analysis, more specifically, line spectrum pair analysis: LSP analysis) that receives a time series of audio data for each successive predetermined sample as input.
(LSP) A method of calculating coefficient groups and using them as elements of a feature parameter vector. (6) An autocorrelation function is calculated by an autocorrelation analysis using a time series of voice data of each successive predetermined sample as an input, and a pitch fundamental frequency pattern of voice detected based on the autocorrelation function is calculated as one element of a feature parameter vector. Method to add to Next, the phoneme recognizing unit 1304 converts the feature parameter vector output from the speech analyzing unit 1303 for each predetermined frame period (predetermined sample) with the phoneme standard pattern dictionary 1303.
5. The similarity (distance) of the feature parameter vector of each phoneme stored in No. 5 with the standard pattern is calculated, and as a result, a set of phonemes with high similarity obtained at every predetermined frame period is determined along with the phonetic lattice along with the similarity. Output as data.
The phoneme recognizing unit 1304 does not determine the final phoneme at every predetermined frame period in order to avoid occurrence of a phoneme recognition error. Instead, the result data in the form of phoneme lattice data in which phoneme candidates are listed. Is output.

【０２２２】単語認識部１３０６は、所定フレーム周期
毎に音素認識部１３０４から出力される音素ラティスデ
ータを入力として、所定フレーム周期毎に単語候補を表
にして単語ラティスデータを出力する。単語認識方式と
しては、以下の周知の分析方式の何れかを採用すること
ができる。（１）単語認識部１３０６は、音素認識部１３０４から
出力される複数のフレーム周期にまたがる音素ラティス
データの時系列と、単語辞書に蓄積されている全音素標
準パターン系列とで、時間正規化（ＤＰマッチング or
ＤＴＷ：DynamicTime Warping）を実行し、単語ラティ
スデータを出力する。この場合も、単語認識部１３０６
は、単語の認識誤りの発声を回避するために、所定フレ
ーム周期毎に最終的な単語を決定することはせずに、単
語候補を表にした単語ラティスデータの形式で結果デー
タを出力する。（２）単語認識部１３０６は、ＨＭＭ（Hidden Markov
Model ）によって、全単語をモデル化し、音素認識部１
３０４から出力される複数のフレーム周期にまたがる音
素ラティスデータの時系列をＨＭＭ分析部に入力し、生
起確率の大きいものから複数個のモデルに対応する各単
語を、単語候補である単語ラティスデータとして出力す
る。最後に、文章認識部１３０７は、その第１段処理とし
て、単語認識部１３０６から出力される単語ラティスデ
ータを順次入力し、日本語（英語でもよい）の文節構造
に関する文節内文法（語順規則）に従って、種々の文節
の可能性を文節ラティスデータとして算出する。そし
て、文章認識部１３０７は、その第２段処理として、文
節間文法に従って文節間の意味的な係り受けを解析し、
認識音声文章データを決定し、それを、入力バッファキ
ュー１３０１から順次伝達されてきた“端末識別コー
ド”と対について、出力バッファキュー１３０８に書き
込む。＜検索制御部１１８の詳細動作＞図１５は、検索制御部
１１８の機能ブロック図である。The word recognizing unit 1306 receives the phoneme lattice data output from the phoneme recognizing unit 1304 at every predetermined frame cycle, and outputs word lattice data with a table of word candidates at every predetermined frame cycle. As the word recognition method, any of the following well-known analysis methods can be adopted. (1) The word recognition unit 1306 performs time normalization on a time series of phoneme lattice data output from the phoneme recognition unit 1304 over a plurality of frame periods and all phoneme standard pattern sequences stored in the word dictionary ( DP matching or
DTW (Dynamic Time Warping) is executed to output word lattice data. Also in this case, the word recognition unit 1306
Outputs the result data in the form of word lattice data in which word candidates are tabulated without determining a final word at every predetermined frame period in order to avoid utterance of a word recognition error. (2) The word recognition unit 1306 uses HMM (Hidden Markov)
Model), all the words are modeled, and the phoneme recognition unit 1
A time series of phoneme lattice data spanning a plurality of frame periods output from 304 is input to the HMM analysis unit, and each word corresponding to a plurality of models from a large occurrence probability is regarded as word lattice data as a word candidate. Output. Finally, the sentence recognizing unit 1307 sequentially inputs the word lattice data output from the word recognizing unit 1306 as a first stage processing, and generates a grammar (phrase order rule) in the bunsetsu regarding the bunsetsu structure of Japanese (or English). , The possibility of various phrases is calculated as phrase lattice data. Then, the sentence recognizing unit 1307 analyzes the semantic dependency between the phrases according to the inter-phrase grammar as the second stage processing,
The recognition voice sentence data is determined, and it is written in the output buffer queue 1308 in combination with the “terminal identification code” sequentially transmitted from the input buffer queue 1301. <Detailed Operation of Search Control Unit 118> FIG. 15 is a functional block diagram of the search control unit 118.

【０２２３】この検索制御部１１８は、前述したよう
に、図１２に示される処理端末登録テーブルのエントリ
毎に、各エントリから特定される文章バッファファイル
に文音声認識部１１７によって認識音声文章データが得
られていればそれに対してデータベース検索処理を実行
し、その結果得られる検索結果ＨＴＭＬ文章データを上
記各エントリに対応する検索結果バッファファイルに追
加書き込みする。As described above, the search control unit 118 stores, for each entry in the processing terminal registration table shown in FIG. 12, the sentence speech recognition unit 117 stores the sentence speech sentence data in the sentence buffer file specified from each entry. If it is obtained, a database search process is executed for the search result, and the search result HTML text data obtained as a result is additionally written to the search result buffer file corresponding to each entry.

【０２２４】上述のエントリ毎の文章バッファファイル
からの認識音声文章データの読出しと検索結果バッファ
ファイルへの検索結果ＨＴＭＬ文章データの書込みは、
図１５の入出力制御部１５０７が制御する。まず、この
入出力制御部１５０７の制御動作につき説明する。図１
６は、入出力制御部１５０７が実行する制御動作を示す
動作フローチャートである。この動作フローチャート
は、入出力制御部１５０７を制御する特には図示しない
プロセッサが、特には図示しない制御プログラムを実行
する動作として実現され、前述した、文音声認識部１１
７内の図１３に示される入出力制御部１３０９と同様の
制御動作を実現する。The reading of the recognized speech text data from the text buffer file for each entry and the writing of the search result HTML text data to the search result buffer file are performed as described above.
The input / output control unit 1507 in FIG. First, the control operation of the input / output control unit 1507 will be described. FIG.
6 is an operation flowchart illustrating a control operation performed by the input / output control unit 1507. This operation flowchart is realized as an operation in which a processor (not shown) for controlling the input / output control unit 1507 executes a control program (not shown).
7 realizes the same control operation as the input / output control unit 1309 shown in FIG.

【０２２５】まず、ステップ１６０１では、処理端末登
録テーブル（図１２）において、文章バッファファイル
名に対応する文章バッファファイルに認識音声文章デー
タが記憶されているエントリが存在するか否かが判定さ
れる。First, in step 1601, it is determined whether or not there is an entry in the processing terminal registration table (FIG. 12) in which the recognized speech text data is stored in the text buffer file corresponding to the text buffer file name. .

【０２２６】そのようなエントリが存在しステップ１６
０１の判定がＹＥＳならば、ステップ１６０２で、該当
するエントリ毎に、そのエントリに記憶されている“端
末識別コード”と、そのエントリに記憶されている文章
バッファファイル名に対応する文章バッファファイル上
の認識音声文章データとが、図１５の入力バッファキュ
ー１５０１に書き込まれ、その認識音声文章データが文
章バッファファイルから削除される。If such an entry exists and step 16
If the determination of 01 is YES, in step 1602, for each entry, the "terminal identification code" stored in the entry and the text buffer file name corresponding to the text buffer file name stored in the entry are displayed. Is written into the input buffer queue 1501 in FIG. 15, and the recognized voice sentence data is deleted from the text buffer file.

【０２２７】入力バッファキュー１５０１は、それがキ
ューイングしている認識音声文章データを、検索インデ
ックス作成部１５０２に順次流し込む機能を有する。検
索インデックス作成部１５０２以降に接続されている検
索キーワード抽出部１５０３及び検索実行部１５０５
は、図１３に示される文音声認識部１１７の構成の場合
と同様に、データ処理パイプラインを形成しており、相
互に独立して、入力データを処理する機能を有する。ま
た、１５０２〜１５０５の各部分は、現在処理している
認識音声文章データに対応する“端末識別コード”（入
力バッファキュー１５０１から入力される）を認識する
ことができる。従って、最終的に検索実行部１５０５か
ら出力バッファキュー１５０６へは、“端末識別コー
ド”と検索結果ＨＴＭＬ文章データとの組が出力される
ことになる。The input buffer queue 1501 has a function of sequentially flowing the recognition voice sentence data queued therein to the search index creation unit 1502. A search keyword extraction unit 1503 and a search execution unit 1505 connected after the search index creation unit 1502
Forms a data processing pipeline similarly to the configuration of the sentence speech recognition unit 117 shown in FIG. 13, and has a function of processing input data independently of each other. Each of the parts 1502 to 1505 can recognize the “terminal identification code” (input from the input buffer queue 1501) corresponding to the currently recognized speech text data. Therefore, finally, a set of the “terminal identification code” and the search result HTML text data is output from the search execution unit 1505 to the output buffer queue 1506.

【０２２８】ステップ１６０２の処理の後又はステップ
１６０１の判定がＮＯの場合には、ステップ１６０３
で、図１５の出力バッファキュー１５０６に、“端末識
別コード”と検索結果ＨＴＭＬ文章データの組が得られ
ているか否かが判定される。After the processing in step 1602 or when the determination in step 1601 is NO, step 1603
Then, it is determined whether a set of “terminal identification code” and search result HTML text data is obtained in the output buffer queue 1506 in FIG.

【０２２９】そのような組が得られておりステップ１６
０３の判定がＹＥＳならば、ステップ１６０４で、出力
バッファキュー１５０６内の組毎に、その組の“端末識
別コード”に対応する処理端末登録テーブルのエントリ
について、そのエントリに記憶されている検索結果バッ
ファファイル名に対応する検索結果バッファファイル
に、出力バッファキュー１５０６内の組の検索結果ＨＴ
ＭＬ文章データが追加書き込みされる。When such a set is obtained, step 16
If the determination in step 03 is YES, in step 1604, for each set in the output buffer queue 1506, for the entry in the processing terminal registration table corresponding to the "terminal identification code" of that set, the search result stored in that entry A set of search results HT in the output buffer queue 1506 is stored in the search result buffer file corresponding to the buffer file name.
ML text data is additionally written.

【０２３０】ステップ１６０４の処理の後又はステップ
１６０３の判定がＮＯの場合には、再びステップ１６０
１の判定処理が実行される。以上のようにして検索制御
部１１８は、文音声認識部１１７の場合と同様に、流れ
作業的に効率良く、複数の移動端末１０１からの要求に
基づいて文音声認識部１１７において得られた認識音声
文章データに対するデータベース検索処理を実行するこ
とができる。After the processing in step 1604 or when the determination in step 1603 is NO, step 160
1 is performed. As described above, similarly to the case of the sentence speech recognition unit 117, the search control unit 118 efficiently performs the workflow and recognizes the recognition obtained by the sentence speech recognition unit 117 based on the requests from the plurality of mobile terminals 101. It is possible to execute a database search process for voice sentence data.

【０２３１】次に、データベース検索処理を実現するた
めの１５０２〜１５０５の各部分の機能につき、以下に
説明する。検索インデックス作成部１５０２は、入力バ
ッファキュー１４０１から順次入力される“端末識別コ
ード”と認識音声文章データとの組のそれぞれについ
て、その組の“端末識別コード”に対応する処理端末登
録テーブルのエントリに記憶されている検索インデック
スバッファファイル名から得られる検索インデックスバ
ッファファイルを使用しながら、移動端末１０１別に認
識音声文章データを構成する単語を一定の基準に従って
分類したリストである検索インデックスを作成し、それ
を上記組の“端末識別コード”と共に検索キーワード抽
出部１５０３に出力する。具体的には、検索インデック
ス作成部１５０２は、例えば、その組の認識音声文章デ
ータを構成する例えば図１８に示されるような各単語の
出現回数をカウントすることにより、出現回数の大きい
順にリスト化された単語表である検索インデックスを例
えば図１９に示されるように作成する。この場合、検索
インデックスバッファファイルには、１つの移動端末１
０１から文音声認識／データベース検索処理の開始要求
コマンドが指定された以後の検索インデックスが蓄積さ
れており、その検索インデックスと今回入力された認識
音声文章データを構成する各単語とに基づいて、新たな
検索インデックスが作成され、それが検索インデックス
バッファファイルに蓄積される。このため、上記コマン
ドの指定以後に１つの移動端末１０１から入力された音
声に現れる単語が、一定の基準で、即ち例えば出現回数
の多い順で、検索インデックス上でリスト化されること
になる。なお、認識音声文章データには、例えば図１８
の“＊”として示されるように単語の区切り情報が含ま
れる。この単語の区切り情報は、文章バッファファイル
及び入力バッファキュー１５０１を介して、文音声認識
部１１７内の文章認識部１４０７から引き渡されるた
め、認識音声文章データ上での各単語の区切りは容易に
識別できる。Next, the function of each of the parts 1502 to 1505 for realizing the database search processing will be described below. For each set of the “terminal identification code” and the recognized speech text data that are sequentially input from the input buffer queue 1401, the search index creation unit 1502 creates an entry in the processing terminal registration table corresponding to the “terminal identification code” of the set. The search index buffer file obtained from the search index buffer file name stored in is used to create a search index, which is a list in which words constituting the recognized speech text data are classified for each mobile terminal 101 according to a predetermined criterion, It is output to the search keyword extraction unit 1503 together with the above-mentioned “terminal identification code”. More specifically, for example, the search index creation unit 1502 counts the number of appearances of each of the words constituting the set of recognized speech text data as shown in FIG. A search index, which is a word table obtained, is created, for example, as shown in FIG. In this case, one mobile terminal 1 is stored in the search index buffer file.
From 01, a search index after the command for starting the sentence speech recognition / database search process is designated is accumulated, and a new search index is stored on the basis of the search index and each word constituting the currently-recognized speech sentence data. A search index is created and stored in a search index buffer file. Therefore, words appearing in the voice input from one mobile terminal 101 after the designation of the command are listed on the search index according to a certain standard, that is, for example, in descending order of the number of appearances. Note that the recognized voice sentence data includes, for example, FIG.
The word delimiter information is included as indicated by “*”. Since the word delimiter information is passed from the text recognition unit 1407 in the text and speech recognition unit 117 via the text buffer file and the input buffer queue 1501, the delimitation of each word on the recognized voice text data is easily identified. it can.

【０２３２】次に、検索キーワード抽出部１５０３は、
検索インデックス作成部１５０２から出力される“端末
識別コード”と検索インデックスとの組のそれぞれにつ
き、その組の検索インデックス中で所定の基準を満たす
単語、例えば出現回数が一定回数以上の（又は一定の出
現回数順位以上の順位の）単語を抽出する。更に、検索
キーワード抽出部１５０３は、抽出された単語のうち、
不要キーワード辞書１５０４に登録されていない単語を
抽出し、更に、上記検索インデックスと共に検索インデ
ックス作成部１５０２から出力されている“端末識別コ
ード”に対応する処理端末登録テーブルのエントリに記
憶されている検索済キーワードバッファファイル名から
得られる検索済キーワードバッファファイルに登録され
ている検索済キーワード以外の単語を抽出し、それを検
索キーワードとして上記組の“端末識別コード”と共に
出力する。また、検索キーワード抽出部１５０３は、そ
の検索キーワードを、上記検索済キーワードバッファフ
ァイルに登録する。Next, the search keyword extraction unit 1503
For each set of the “terminal identification code” and the search index output from the search index creation unit 1502, a word that satisfies a predetermined criterion in the search index of the set, for example, the number of occurrences of which is equal to or more than a certain number (or a certain number) Extract words whose rank is equal to or higher than the rank of appearance frequency. Further, the search keyword extracting unit 1503 selects, among the extracted words,
A word that is not registered in the unnecessary keyword dictionary 1504 is extracted, and a search stored in an entry of the processing terminal registration table corresponding to the “terminal identification code” output from the search index creation unit 1502 together with the search index. Then, a word other than the searched keyword registered in the searched keyword buffer file obtained from the searched keyword buffer file name is extracted, and the extracted word is output as a search keyword together with the “terminal identification code” of the above set. The search keyword extraction unit 1503 registers the search keyword in the searched keyword buffer file.

【０２３３】不要キーワード辞書１５０４には、普通動
詞、形容詞、副詞、助動詞、助詞、接続詞、前置詞等の
単語が登録されている。この辞書が参照されることによ
り、無意味な単語がデータベース検索処理されることを
回避することができ、移動端末１０１に対して有意な検
索結果ＨＴＭＬ文章データのみを提供することができ
る。In the unnecessary keyword dictionary 1504, words such as ordinary verbs, adjectives, adverbs, auxiliary verbs, particles, conjunctions, and prepositions are registered. By referring to this dictionary, meaningless words can be prevented from being searched in the database, and only significant search result HTML text data can be provided to the mobile terminal 101.

【０２３４】また、検索済キーワードバッファファイル
には、１つの移動端末１０１から文音声認識／データベ
ース検索処理の開始要求コマンドが指定された以後にデ
ータベース検索処理された検索キーワードが登録されて
いる。このファイルが参照されることにより、同じ検索
キーワードが重複してデータベース検索処理されること
を回避することができる。[0234] In the searched keyword buffer file, search keywords that have been subjected to database search processing after one mobile terminal 101 has specified a sentence speech recognition / database search processing start request command are registered. By referring to this file, it is possible to prevent the same search keyword from being subjected to database search processing redundantly.

【０２３５】検索実行部１５０５は、検索キーワード抽
出部１５０３から出力される“端末識別コード”と検索
キーワードの組のそれぞれについて、その組の検索キー
ワードを用いて、インターネット１０５上の予め登録さ
れている特定のデータベース検索エンジンに対して、問
合せを依頼する。この場合、複数検索キーワードが例え
ばアンド結合又はオア結合されることによって問合せデ
ータが作成される。そして、この問合せデータは、上述
のデータベース検索エンジンが存在するインターネット
１０５に接続されるホスト装置上のＷｅｂサーバに対す
るＨＴＴＰの通信プロトコルに基づく要求データとし
て、ＴＣＰ／ＩＰパケットに格納されパケット送受信部
１１５（図１）を介して送信される。その結果、検索制
御部１１８は、インターネット１０５上の上記ホスト装
置からルータ装置１０６、ＬＡＮ１０７、及びパケット
送受信部１１５（図１）を介して返される検索結果に基
づいて、図２０に示されるような検索結果ＨＴＭＬ文章
データを生成し、それを上記組の“端末識別コード”と
共に出力バッファキュー１４０８に書き込む。＜他の実施の形態＞以上説明した実施の形態では、移動
端末１０１は、ＰＨＳ端末であって、移動端末１０１と
音声制御ホスト装置１０８とは、ＰＨＳ網１０３とイン
ターネット１０５を介して接続されている。しかし、本
発明は、これに限られるものではなく、無線又は有線に
よって間接的又は直接的に音声制御ホスト装置１０８に
接続される形態であれば、どのような形態であっても本
発明をそれに適用することができる。The search execution unit 1505 is registered in advance on the Internet 105 for each set of the “terminal identification code” and the search keyword output from the search keyword extraction unit 1503, using the set search keyword. Request a query from a specific database search engine. In this case, the inquiry data is created by, for example, AND-joining or OR-joining the plurality of search keywords. The inquiry data is stored in a TCP / IP packet as request data based on an HTTP communication protocol for a Web server on a host device connected to the Internet 105 where the above-described database search engine exists, and is stored in the packet transmission / reception unit 115 ( 1). As a result, based on the search result returned from the host device on the Internet 105 via the router device 106, the LAN 107, and the packet transmission / reception unit 115 (FIG. 1), the search control unit 118 as shown in FIG. The search result HTML text data is generated and written to the output buffer queue 1408 together with the above-mentioned “terminal identification code”. <Other Embodiments> In the embodiment described above, mobile terminal 101 is a PHS terminal, and mobile terminal 101 and voice control host device 108 are connected via PHS network 103 and Internet 105. I have. However, the present invention is not limited to this, and the present invention may be applied to any form connected to the voice control host device 108 indirectly or directly by wireless or wired. Can be applied.

【０２３６】また、本実施の形態では、検索制御部１１
８によって検索されるデータベース検索エンジンは、イ
ンターネット１０５に接続されるホスト装置上のＷｅｂ
サーバが管理するものであるが、本発明はこれに限られ
るものではなく、例えば音声制御ホスト装置１０８内又
はＬＡＮ１０７に接続される他のホスト装置内に検索キ
ーワードに対するホームページ情報を格納したローカル
なデータベースを構築し、検索制御部１１８はそれにア
クセスして検索結果を得るように構成されてもよい。In this embodiment, the search control unit 11
The database search engine searched by the Web server 8 is a Web search engine on a host device connected to the Internet 105.
Although managed by the server, the present invention is not limited to this. For example, a local database storing home page information for a search keyword in the voice control host device 108 or another host device connected to the LAN 107 And the search control unit 118 may be configured to access it and obtain search results.

【０２３７】[0237]

【発明の効果】本発明によれば、移動端末は、高度な音
声認識／データベース検索／リソースアクセス環境を設
備する必要がなく実用的な精度を有する音声認識／デー
タベース検索機能の提供を低コストで受けることが可能
となる。According to the present invention, a mobile terminal can provide a speech recognition / database search function having practical accuracy without providing a sophisticated speech recognition / database search / resource access environment at a low cost. It is possible to receive.

【０２３８】また、本発明によれば、現在全国的及び全
世界的に普及しつつあるパーソナルハンディホンシステ
ム通信網及びインターネットを経由することにより、実
用的な精度を有する音声認識機能と、ワールドワイドな
データベース検索、及びその検索結果に対応するリソー
スへのアクセス機能の提供を、より低コスト及び手軽に
受けることができると同時に、本発明が提供する機能と
パーソナルハンディホンシステム通話機能及びインター
ネットアクセス機能とを、シームレスに結合することが
可能となる。Further, according to the present invention, by using a personal handyphone system communication network and the Internet, which are currently spreading nationwide and worldwide, a speech recognition function having practical accuracy and a world wide A simple database search and a function of accessing a resource corresponding to the search result can be provided at a lower cost and easily, and at the same time, a function provided by the present invention, a call function of a personal handyphone system, and an Internet access function Can be seamlessly combined.

【０２３９】更に、本発明によれば、移動端末と音声制
御ホスト装置とを全世界的に容易に特定できると共に、
音声認識／データベース検索処理サービスと、その検索
結果に基づくリソースへのアクセスサービス、及びその
他の情報処理サービスとの共存を容易に実現することが
可能となる。Further, according to the present invention, a mobile terminal and a voice control host device can be easily specified worldwide,
It is possible to easily realize a speech recognition / database search processing service, a resource access service based on the search result, and other information processing services.

【０２４０】加えて、本発明によれば、ホスト装置側の
負荷分散を容易に実現することが可能となる。In addition, according to the present invention, it is possible to easily realize load distribution on the host device side.

[Brief description of the drawings]

【図１】全システム構成図である。FIG. 1 is an overall system configuration diagram.

【図２】移動端末の外観図である。FIG. 2 is an external view of a mobile terminal.

【図３】移動端末の機能ブロック図である。FIG. 3 is a functional block diagram of a mobile terminal.

【図４】移動端末の処理の全体動作フローチャートであ
る。FIG. 4 is an overall operation flowchart of processing of a mobile terminal.

【図５】送信処理の動作フローチャートである。FIG. 5 is an operation flowchart of a transmission process.

【図６】通信データのフォーマット図である。FIG. 6 is a format diagram of communication data.

【図７】ＩＰヘッダとＴＣＰヘッダのフォーマット図で
ある。FIG. 7 is a format diagram of an IP header and a TCP header.

【図８】ＰＰＰを用いた発信処理の動作フローチャート
である。FIG. 8 is an operation flowchart of a calling process using PPP.

【図９】移動端末通信制御部の動作フローチャート（そ
の１）である。FIG. 9 is an operation flowchart (part 1) of a mobile terminal communication control unit.

【図１０】移動端末通信制御部の動作フローチャート
（その２）である。FIG. 10 is an operation flowchart (part 2) of the mobile terminal communication control unit.

【図１１】移動端末通信制御部の動作フローチャート
（その３）である。FIG. 11 is an operation flowchart (part 3) of the mobile terminal communication control unit.

【図１２】処理端末登録テーブルのデータ構成図であ
る。FIG. 12 is a data configuration diagram of a processing terminal registration table.

【図１３】文音声認識部の構成図である。FIG. 13 is a configuration diagram of a sentence speech recognition unit.

【図１４】文音声認識部内の入出力制御部の動作フロー
チャートである。FIG. 14 is an operation flowchart of an input / output control unit in the sentence speech recognition unit.

【図１５】検索制御部の構成図である。FIG. 15 is a configuration diagram of a search control unit.

【図１６】検索制御部内の入出力制御部の動作フローチ
ャートである。FIG. 16 is an operation flowchart of an input / output control unit in the search control unit.

【図１７】ＰＨＳ会話内容の例を示す図である。FIG. 17 is a diagram illustrating an example of PHS conversation content.

【図１８】認識音声文章データの例を示す図である。FIG. 18 is a diagram illustrating an example of recognized speech sentence data.

【図１９】検索インデックスの例を示す図である。FIG. 19 is a diagram illustrating an example of a search index.

【図２０】検索結果ＨＴＭＬ文章データの例を示す図で
ある。FIG. 20 is a diagram illustrating an example of search result HTML text data.

【図２１】検索結果ＨＴＭＬ文章データの表示画面例を
示す図である。FIG. 21 is a diagram illustrating a display screen example of search result HTML text data.

【図２２】ハイパーリンク先のホームページの表示画面
例（その１）を示す図である。FIG. 22 is a diagram showing an example (part 1) of a display screen of a homepage at a hyperlink destination.

【図２３】ハイパーリンク先のホームページの表示画面
例（その２）を示す図である。FIG. 23 is a diagram showing an example (part 2) of a display screen of a homepage at a hyperlink destination.

[Explanation of symbols]

１０１移動端末１０２無線基地（有線接続装置）１０３ＰＨＳ網（公衆電話網、ＩＳＤＮ網）１０４移動端末制御ホスト装置１０５インターネット１０６ルータ装置１０７ＬＡＮ（ローカルエリアネットワーク）１０８音声制御ホスト装置１０９入力部１１０制御部１１１通信部１１２出力部１１３接続確立部１１４ルーティング部１１５パケット送受信部１１６移動端末通信制御部１１７文音声認識部１１８検索制御部２０１、３０１マイク２０２、３０４カメラ（ＣＣＤカメラ）２０３、３１１ＬＣＤ表示部２０４、３０８スピーカ２０５、３２３無線アンテナ２０６、３２５ソケット（通信用）２０７ＩＣカードスロット２０８光送受信機（光通信用）３０２、３０５Ａ／Ｄ変換部３０３マイク制御部３０６、３１３メモリ３０７カメラ制御部３０９Ｄ／Ａ変換部３１０スピーカ制御部３１２ＬＣＤドライバ３１４ＬＣＤ制御部３１５タッチパネル制御部３１６ＣＰＵ３１７ＲＡＭ３１８ＲＯＭ３１９ＩＣカードインタフェース部３２０ＩＣカード３２１通信制御部３２２無線ドライバ３２４有線ドライバ１４０１、１５０１入力バッファキュー１４０２音声区間検出部１４０３音声分析部１４０４音素認識部１４０５音素標準パターン辞書１４０６単語認識部１４０７文章認識部１４０８、１５０６出力バッファキュー１４０９、１５０７入出力制御部１５０２検索インデックス作成部１５０３検索キーワード抽出部１５０４不要キーワード辞書１５０５検索実行部 Reference Signs List 101 mobile terminal 102 wireless base (wired connection device) 103 PHS network (public telephone network, ISDN network) 104 mobile terminal control host device 105 internet 106 router device 107 LAN (local area network) 108 voice control host device 109 input unit 110 control Unit 111 communication unit 112 output unit 113 connection establishment unit 114 routing unit 115 packet transmission / reception unit 116 mobile terminal communication control unit 117 sentence speech recognition unit 118 search control unit 201, 301 microphone 202, 304 camera (CCD camera) 203, 311 LCD display Units 204, 308 Speakers 205, 323 Wireless antenna 206, 325 Socket (for communication) 207 IC card slot 208 Optical transceiver (for optical communication) 302, 305 A / D conversion unit 303 Microphone Control unit 306, 313 memory 307 Camera control unit 309 D / A conversion unit 310 Speaker control unit 312 LCD driver 314 LCD control unit 315 Touch panel control unit 316 CPU 317 RAM 318 ROM 319 IC card interface unit 320 IC card 321 Communication control unit 322 Wireless driver 324 Wired driver 1401, 1501 Input buffer queue 1402 Voice section detection unit 1403 Voice analysis unit 1404 Phoneme recognition unit 1405 Phoneme standard pattern dictionary 1406 Word recognition unit 1407 Text recognition unit 1408, 1506 Output buffer queue 1409, 1507 Input / output control unit 1502 search index creation unit 1503 search keyword extraction unit 1504 unnecessary keyword dictionary 1505 search execution unit

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号ＦＩＨ０４Ｍ 11/08 Ｈ０４Ｂ 7/26 １０９Ｍ ──────────────────────────────────────────────────の Continued on the front page (51) Int.Cl. ⁶ Identification code FI H04M 11/08 H04B 7/26 109M

Claims

[Claims]

1. A communication system in which a mobile terminal communicates with a host device, wherein the mobile terminal indirectly or via a relay network comprising one or both of a wireless network and a wired network. A host connecting means for directly connecting to the voice control host device, which is the host device, without going through a relay network; a voice input means for inputting voice; and a connection operation by the host connecting means, after the voice input means, Voice data transmission means for transmitting input voice data to the voice control host device; and search result HTML text returned from the voice control host device and receiving search result HTML text data described in hypertext markup language HTML. Data receiving means, and search result HTML text data for displaying and processing the received search result HTML text data Display / processing means, indirectly via the relay network or via the relay network to resources on the host device which are included in the displayed search result HTML text data and correspond to the access information specified by the user. Resource access processing means for directly accessing the mobile terminal without processing the resource, in the voice control host device, in response to a connection operation by a host connection means in the mobile terminal, Mobile terminal connection means for identifying and connecting to each other, voice data receiving means for receiving the voice data for each currently connected mobile terminal, and voice data receiving means for each currently connected mobile terminal. Voice recognition means for performing voice recognition processing on received voice data, and voice recognition processing by the voice recognition means for each currently connected mobile terminal. Search control means for extracting a search keyword from the recognized voice data obtained by the search and searching the search result HTML text data including the access information for the resource corresponding to the search keyword from a predetermined database system; For each mobile terminal, the search result HTML text data obtained by the search processing by the search control means is returned to the corresponding mobile terminal by the search result H.
A mobile terminal voice recognition / database search / resource access communication system, comprising: TML text data return means.

2. The mobile terminal used in a communication system in which the mobile terminal communicates with a host device, wherein the mobile terminal is indirectly or via a relay network including one or both of a wireless network and a wired network. Host connection means for directly connecting to the audio control host device which is the host device without passing through the relay network; voice input means for inputting voice; and after the connection operation by the host connection means, the voice input means Voice data transmitting means for transmitting voice data input from the voice control host device to the voice control host device; and search result HTML for receiving search result HTML text data returned from the voice control host device and described in hypertext markup language HTML Sentence data receiving means, and a search result HTML sentence for displaying and processing the received search result HTML sentence data Data display / processing means, and indirectly via the relay network or the relay network to resources on the host device included in the displayed search result HTML text data and corresponding to the access information specified by the user. And a resource access processing means for directly accessing the resource without using the resource and processing the resource.

3. The host device used in a communication system in which a mobile terminal communicates with a host device, wherein the host device is indirectly or via a relay network including one or both of a wireless network and a wired network. Mobile terminal connection means for identifying and connecting to the mobile terminal in response to a connection operation performed by the mobile terminal directly without passing through the relay network; and voice data for each currently connected mobile terminal. Voice data receiving means for receiving voice data, voice recognition means for performing voice recognition processing on voice data received by the voice data receiving means, for each currently connected mobile terminal, For each terminal, a search keyword is extracted from recognition voice data obtained by voice recognition processing by the voice recognition means, and access information for a resource corresponding to the search keyword is extracted. Search control means for searching a predetermined database system for search result HTML text data described in a hypertext markup language HTML, and a search obtained by the search processing by the search control means for each currently connected mobile terminal. A search result H which returns the result HTML text data to the corresponding mobile terminal.
A voice control host device comprising: a TML text data returning means.

4. The mobile terminal has a personal handyphone system communication function, the relay network includes a personal handyphone system communication network and the Internet, and the voice control host device and the host device corresponding to the access information are: Connecting to the Internet, the host connection means or the resource access processing means in the mobile terminal, via the personal handyphone system communication network, between the public network including the personal handyphone system communication network and the Internet The voice control host device or the access information is transmitted from the mobile terminal control host device via the Internet using the communication protocol on the Internet by transmitting and connecting to the mobile terminal control host device having a gateway function of Corresponding to The mobile terminal voice recognition / database search / resource access communication system, mobile terminal, or voice control host device according to any one of claims 1 to 3, wherein the mobile terminal is connected or accessed to a host device.

5. The communication protocol used by the host connection means is a hierarchical protocol including an Internet protocol layer and a transmission control protocol layer, and the Internet protocol is packet data of the Internet protocol layer transmitted on the Internet. In the header field of the datagram, a source Internet protocol address and a destination Internet protocol address that specify the addresses of the mobile terminal and the voice control host device on the Internet are stored, and in the data field of the Internet protocol datagram, Stores a transmission control protocol segment which is packet data of the transmission control protocol layer. In the header field of the transmission control protocol segment, a source port number and a destination port number that specify a communication protocol for the voice recognition / database search processing are stored. In a data field of the transmission control protocol segment, A terminal identification code for identifying the mobile terminal, the voice data, or the search result HTML
The mobile terminal voice recognition / communication according to claim 4, wherein text data is stored.
Database search / resource access communication system, mobile terminal, or voice control host device.

6. The voice control host device is mutually connected by a network, and the mobile terminal connection means, the voice data receiving means, the voice recognition means, the database means, the search control means, and the search result HTML. The mobile terminal voice recognition / database according to any one of claims 1 to 3, further comprising a plurality of host computers that realize functions distributed to the sentence data return means in a distributed manner. Search / resource access communication system or voice control host device.

7. The search control unit creates a search index by classifying recognized speech data obtained by speech recognition processing by the speech recognition unit according to a predetermined classification rule for each of currently connected mobile terminals. A search index creating unit that extracts a phrase that satisfies a predetermined extraction criterion from a search index created by the search index creation unit for each currently connected mobile terminal, and extracts a predetermined unnecessary keyword from the extracted phrase. Search keyword extracting means for extracting a new one of the resulting phrases as a search keyword, and search executing means for searching the predetermined database system for search result HTML text data corresponding to the search keyword. 7. The method according to claim 1, further comprising: Mobile device voice recognition according to one of claims / database search /
Resource access communication system or voice control host device.

8. The search index creating means creates the search index by classifying each word appearing in the input data in the order of the number of appearances, and the search keyword extracting means includes: A word whose appearance number is equal to or more than a predetermined number or a word whose rank is equal to or more than a predetermined appearance number rank, removes a predetermined unnecessary keyword from the extracted words, and replaces a new word among the words obtained as a result. 8. The mobile terminal voice recognition / database search / resource access communication system or voice control host device according to claim 7, wherein the voice terminal is extracted as a search keyword.

9. The mobile terminal according to claim 4, wherein the predetermined database system is provided by a predetermined host device connected to the Internet.
Database search / resource access communication system or voice control host device.