JPH11252283A

JPH11252283A - Portable information terminal, control method for the portable information terminal and storage medium

Info

Publication number: JPH11252283A
Application number: JP10054545A
Authority: JP
Inventors: Atsushi Katayama; 敦之片山
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1998-03-06
Filing date: 1998-03-06
Publication date: 1999-09-17

Abstract

PROBLEM TO BE SOLVED: To enable a portable information terminal to recognize voice information obtained from a speech opposite party with a simple operation and to store the information in a re-usable form. SOLUTION: This terminal 101 uses a voice path control section 103 to connect a PHS radio control section 104, a PHS radio section 105, a speaker 106 and a microphone 107 to conduct a speech. When an operator operates a key switch 111, the voice path control section 103 changes an output path of a voice signal of an opposite party for the speaker 106 into a voice recognition section 108 to start the recognition of the voice of the opposite party. The storage start timing of the voice recognition result to a storage section 109 by the voice recognition section 108 is decided by detecting prescribed voice information in the voice signal. The end timing of the voice recognition is decided by the 2nd time operation of the key switch 111 conducted by the operator to visually recognize the display of the recognition result or the detection of prescribed voice information in the voice signal.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声通信機能およ
び音声認識機能を有する携帯情報端末、その制御方法、
および携帯情報端末の制御プログラムを格納した記憶媒
体に関するものである。The present invention relates to a portable information terminal having a voice communication function and a voice recognition function, a control method thereof,
And a storage medium storing a control program for the portable information terminal.

【０００２】[0002]

【従来の技術】近年高度情報化社会の到来に伴って、種
々多様な情報処理装置が開発され、またそれらの装置の
情報の通信網が整備されつつあり、コンピュータ、電話
機、モデムなどの通信装置では種々の新しい機器が用い
られるようになってきた。2. Description of the Related Art In recent years, with the advent of a highly information-oriented society, various information processing apparatuses have been developed, and communication networks for information of those apparatuses have been improved, and communication apparatuses such as computers, telephones, and modems have been developed. Various new devices have come to be used.

【０００３】特に、携帯可能な情報通信装置としては、
携帯電話機（含ＰＨＳ）が急速に普及している。携帯電
話機は、無線通信によって遠方にいる相手と音声を介し
てコミュニケーションできるというものであり、この携
帯電話機の普及に伴い、社会的インフラストラクチャー
として公衆無線電話網が整備、拡充されつつある。In particular, portable information communication devices include:
Mobile phones (including PHS) are rapidly spreading. A mobile phone is capable of communicating with a remote party via voice via wireless communication. With the spread of the mobile phone, a public wireless telephone network is being developed and expanded as a social infrastructure.

【０００４】また、情報処理装置としてもハンディター
ミナル、ノート型やサブノート型のパーソナルコンピュ
ータ等も急速に普及しつつある。これら、情報処理装置
も小型化が著しく、常時携帯しても苦痛にならないよう
な携帯端末も商品化されている。As information processing apparatuses, handy terminals, notebook-type and sub-note-type personal computers, etc. are also rapidly spreading. These information processing devices have also been significantly reduced in size, and portable terminals that do not cause any pain even when being carried at all times have been commercialized.

【０００５】また、これらの携帯電話機、携帯端末を一
体にした製品が最近商品化されつつある。また、単に通
信と情報処理の機能を１つの筐体に納めた製品のみなら
ず、たとえば、電話機やモデム、通信アダプタなどをＰ
Ｃカードの態様で供給するなど、種々の形態が存在す
る。[0005] Products integrating these portable telephones and portable terminals have recently been commercialized. In addition to products that simply house communication and information processing functions in one housing, for example, telephones, modems,
There are various forms such as supply in the form of a C card.

【０００６】[0006]

【発明が解決しようとする課題】ところで、現在、商品
化されている携帯端末は、ペン入力やキー入力によって
操作／データ入力されることが前提となっている。した
がって端末を操作／データ入力するには、片手で端末を
支持し、もう片手で入力するというふうに、両手が自由
に使えることが必須となる。By the way, it is premised that portable terminals that are currently commercialized are operated / data-inputted by pen input or key input. Therefore, in order to operate / input data on the terminal, it is essential that both hands can be used freely, such as supporting the terminal with one hand and inputting with the other hand.

【０００７】これは、両手が自由な時でないと端末が操
作／データ入力できないということを意味する。つま
り、携帯端末はその携帯性からどこでも持ち歩け、いつ
でも操作／データ入力したいものであるにもかかわら
ず、操作／データ入力可能な場面が限定されてしまう問
題がある。[0007] This means that the terminal cannot operate / input data unless both hands are free. In other words, there is a problem in that the portable terminal can be carried anywhere and the operation / data input can be performed at any time.

【０００８】この問題に鑑み、音声入力方式を用いるこ
とが考えられる。すなわち、端末に対して話しかけるこ
とによって操作者が音声を入力すると、端末において入
力された音声を認識・解釈して端末の操作コマンドまた
はデータとして扱えるようにすることで、操作者が片手
しか自由にならない時においても端末の操作／入力が可
能となる。In view of this problem, it is conceivable to use a voice input method. That is, when the operator inputs voice by speaking to the terminal, the voice input at the terminal is recognized and interpreted so that it can be handled as an operation command or data of the terminal, so that the operator can freely use only one hand. Even when this is not the case, operation / input of the terminal becomes possible.

【０００９】しかしながら、この音声入力を採用した場
合でも次のような問題点がある。However, even when this voice input is employed, there are the following problems.

【００１０】１）操作者の音声入力により携帯端末の操
作／データ入力は可能であるが、装置が通信機能をも有
する場合、たとえば通話機能を持つ場合を考えると、従
来技術では通話中の相手の音声を認識することが不可能
であり、仮に通話相手から住所と電話番号を知りたかっ
た人の住所と電話番号を聞いても、それをユーザーが頭
の中に記憶しておくか、あるいは、紙に書いておき、そ
の後住所録のソフトウェアを立ち上げ、ペンまたはキー
入力によりデータ入力を行なう、あるいは上述の操作者
の音声を音声認識によりデータ入力を行なう、という２
重の手間が必要であり、入力操作が非常に面倒であっ
た。1) Although the operation / data input of the portable terminal is possible by the voice input of the operator, if the device also has a communication function, for example, a case where the device has a call function, the conventional technology requires that It is not possible to recognize the voice of the person, and even if the address and telephone number of the person who wanted to know the address and telephone number from the other party are heard, the user remembers it in his head, or Write on paper, and then start up the address book software and enter data by pen or key input, or enter the above-mentioned operator's voice by voice recognition.
It required heavy labor and the input operation was very troublesome.

【００１１】２）上記問題点１）を解決する手段とし
て、通話中の相手の音声を音声認識し、その音声認識デ
ータを携帯端末のデータとして記憶させる手段が考えら
れる。この場合、音声をスピーカーに出力したまま音声
認識できるのが望ましいが、リアルタイムでこのような
処理を行なうのは形態端末のハードウェアでは限界があ
り、音声を一度記憶させてからスピーカーに出力しなけ
ればならない。このために、小型化が最も重要である携
帯情報端末において、構成が増えその分端末が大きくな
る問題を生じる。2) As means for solving the above problem 1), means for recognizing the voice of the other party during a call and storing the voice recognition data as data of the portable terminal is conceivable. In this case, it is desirable that the voice can be recognized while the voice is output to the speaker. However, there is a limit in performing such processing in real time with the hardware of the form terminal, and the voice must be stored once and then output to the speaker. Must. For this reason, in a portable information terminal in which miniaturization is most important, there is a problem that the configuration is increased and the terminal is correspondingly enlarged.

【００１２】３）携帯端末では、キーボードを持たず、
ディスプレイと一体のタッチパネルを用いる構成も多く
用いられている。また、構成の簡略化のため音声出力の
スピーカーが本体と一体化しているものも多い。このよ
うな構成、特にタッチパネルを用いる場合は操作の際、
タッチパネルの視認が不可欠であり、音声認識を開始／
終了のタイミングでスピーカーを一端耳元から外してか
ら、片手で端末を保持し、もう一方の手で入力するな
ど、忙しく装置を持ち替えて操作する必要がある。ま
た、耳元から一度外さなければならない、などの面倒が
あり、片手操作が容易ではない、という問題がある。3) A portable terminal does not have a keyboard,
A configuration using a touch panel integrated with a display is also often used. In many cases, a speaker for audio output is integrated with the main body to simplify the configuration. In such a configuration, especially when using a touch panel,
Visual recognition of the touch panel is indispensable, and voice recognition starts /
At the end timing, it is necessary to remove the speaker from one ear, hold the terminal with one hand, and input with the other hand, and switch the device busy to operate. In addition, there is a problem that it is necessary to remove the device from the ear once, and one-handed operation is not easy.

【００１３】４）従来では、音声認識を開始後、音声認
識結果データをすべて記憶手段に記憶していたため、記
憶容量が多く必要になる、あるいは記憶容量が限定され
ている場合には処理可能なデータ容量が小さくなる問題
がある。4) Conventionally, after the speech recognition is started, all the speech recognition result data is stored in the storage means, so that a large storage capacity is required or processing is possible when the storage capacity is limited. There is a problem that the data capacity is reduced.

【００１４】５）音声入力によって装置にコマンドを与
えるユーザインターフェースは便利であるが、相手の音
声を認識する処理と共に用いるには無理がある。すなわ
ち、音声認識終了時、通話中の相手が音声認識させたい
音声データを話終えたことを携帯端末に音声で示し、携
帯端末がその通話を音声認識し、その音声認識結果デー
タを表示部に表示し、さらに操作者がその表示を見て音
声認識を終了させる、といった操作はユーザの操作をか
えって複雑にしてしまう恐れがある。5) The user interface for giving commands to the apparatus by voice input is convenient, but it is impossible to use it together with the process of recognizing the voice of the other party. That is, at the end of voice recognition, the portable terminal indicates by voice to the mobile terminal that the other party on the call has finished speaking the voice data to be recognized, and the mobile terminal recognizes the call and displays the voice recognition result data on the display unit. There is a possibility that the operation of displaying the message and terminating the voice recognition after the operator sees the display may complicate the operation of the user.

【００１５】６）電話回線網との接続を有線により行な
うことも考えられるが、いつでもどこでも通話を行なえ
る、という携帯端末の特徴である手軽さを損なう問題が
ある。6) It is conceivable that the connection with the telephone line network is made by wire. However, there is a problem that the simplicity, which is a feature of the portable terminal, that a call can be made anytime and anywhere, is impaired.

【００１６】本発明の課題は上記問題を解決し、簡単な
操作により通話相手から得た音声情報を認識させ、再利
用可能な形で記憶させることができる簡単安価に構成可
能な携帯情報端末を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to solve the above-mentioned problems, and to provide a portable information terminal which can be configured simply and inexpensively, which can recognize voice information obtained from a communication partner by a simple operation and store it in a reusable form. To provide.

【００１７】[0017]

【課題を解決するための手段】上記問題点１を解決する
ため、本発明においては、所定操作手段の入力に応じ
て、前記音声認識手段による音声信号の認識開始ないし
終了タイミングを決定し、相手の音声データを音声認識
し、その音声認識データを記憶させる構成を採用する。According to the present invention, in order to solve the above problem 1, in accordance with an input from a predetermined operation means, a start or end timing of recognition of a voice signal by the voice recognition means is determined. , And the voice recognition data is stored.

【００１８】上記問題点２を解決するため、本発明にお
いては、通話中の相手の音声の認識開始時に回線接続手
段から入力される音声データの出力先を前記音声出力手
段から前記音声認識手段へ切り換え、通話中の音声を音
声出力手段に出力させないようにする構成を採用する。In order to solve the above problem 2, in the present invention, the output destination of the voice data input from the line connection means at the start of the recognition of the voice of the other party during the call is changed from the voice output means to the voice recognition means. A configuration is adopted in which switching is performed so that voice during a call is not output to the voice output unit.

【００１９】上記問題点３を解決するため、本発明にお
いては、通話中の相手の音声を音声認識し、その認識結
果データを携帯端末のデータとして記憶させる場合、音
声信号の認識開始タイミングを決定する操作手段として
キースイッチを用いる構成を採用する。In order to solve the above problem 3, according to the present invention, when the voice of the other party during a call is voice-recognized and the recognition result data is stored as data of the portable terminal, the recognition start timing of the voice signal is determined. In this case, a key switch is used as the operation means.

【００２０】上記問題点４を解決するため、本発明にお
いては、音声信号の認識開始後、受信された音声信号中
に所定の音声情報が検出されることにより、音声認識手
段による音声信号の認識結果を記憶手段に格納する処理
の開始タイミングが決定される構成を採用する。In order to solve the above problem 4, in the present invention, after the recognition of the voice signal is started, predetermined voice information is detected in the received voice signal, so that the voice recognition means recognizes the voice signal. A configuration is adopted in which the start timing of the process of storing the result in the storage means is determined.

【００２１】上記問題点５を解決するため、本発明にお
いては、音声信号の認識開始後、受信された音声信号中
に所定の音声情報が検出されることにより、音声認識手
段による音声信号の認識結果を記憶手段に格納する処理
の終了タイミングが決定される構成を採用する。In order to solve the above problem 5, in the present invention, after the recognition of the voice signal is started, predetermined voice information is detected in the received voice signal, so that the voice recognition means recognizes the voice signal. A configuration is adopted in which the end timing of the processing for storing the result in the storage means is determined.

【００２２】上記問題点６を解決するため、本発明にお
いては、電話回線網との接続を無線通信方式により行な
う構成を採用する。In order to solve the above problem 6, the present invention employs a configuration in which connection to a telephone line network is made by a wireless communication system.

【００２３】[0023]

【発明の実施の形態】以下、図面に示す実施形態に基づ
き本発明を詳細に説明する。以下では、ＰＨＳ回線を介
して通信（通話、ファクシミリ、データ通信など）する
通信機能を有する携帯情報端末の実施例を示す。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described in detail based on an embodiment shown in the drawings. Hereinafter, an embodiment of a portable information terminal having a communication function of performing communication (call, facsimile, data communication, and the like) via a PHS line will be described.

【００２４】（携帯情報端末の構成）まず、本実施形態
の携帯情報端末の構成について説明する。図１は、携帯
情報端末１０１の構成を示しており、同図において符号
１０２は、本発明の携帯情報端末全体の動作を制御する
システム主制御部である。システム主制御部１０２は、
主としてマイクロプロセッサ、あるいはその周辺チップ
などから構成される。(Configuration of Portable Information Terminal) First, the configuration of the portable information terminal of the present embodiment will be described. FIG. 1 shows the configuration of a portable information terminal 101. In FIG. 1, reference numeral 102 denotes a system main control unit for controlling the operation of the entire portable information terminal of the present invention. The system main control unit 102
It is mainly composed of a microprocessor or its peripheral chip.

【００２５】符号１０３は、マイク１０７からの入力音
声をＰＨＳ無線制御部１０４に出力し、またＰＨＳ無線
部１０５により受信したデータやマイク１０７からの入
力音声をスピーカー１０６に出力し、さらに音声認識用
のキースイッチが押された場合にＰＨＳ無線部１０５よ
り受信した通話音声のデータの伝達を、スピーカーから
音声認識部に切り換える音声通路制御部である。Reference numeral 103 denotes a voice output from the microphone 107 to the PHS radio control unit 104, and outputs data received by the PHS radio unit 105 and voice input from the microphone 107 to the speaker 106. Is a voice passage control unit that switches the transmission of the call voice data received from the PHS wireless unit 105 from the speaker to the voice recognition unit when the key switch is pressed.

【００２６】符号１０４は、ＰＨＳ無線電話網へのアク
セスによる通信データの入出力、およびプロトコル制御
を行なうＰＨＳ無線制御部である。ＰＨＳ無線制御部１
０４は次のＰＨＳ無線部１０５とともにＰＨＳ回線イン
ターフェースとして機能する。Reference numeral 104 denotes a PHS radio control unit which performs input / output of communication data by accessing the PHS radio telephone network and protocol control. PHS radio control unit 1
04 functions as a PHS line interface together with the next PHS radio unit 105.

【００２７】符号１０５は、ＰＨＳ無線電話網からのＰ
ＨＳ無線電波を受信し、その受信データをＰＨＳ無線制
御部１０４に伝達し、またＰＨＳ無線制御部１０４から
送られてきた送信データをＰＨＳ無線電波に変換してＰ
ＨＳ無線電話網に送信するＰＨＳ無線部であり、ＰＨＳ
無線部１０５は公知の送受信回路から構成される。Reference numeral 105 denotes P from the PHS wireless telephone network.
It receives the HS radio wave, transmits the received data to the PHS radio control unit 104, converts the transmission data sent from the PHS radio control unit 104 into a PHS radio wave,
A PHS radio unit for transmitting to the HS radio telephone network;
The wireless unit 105 includes a known transmission / reception circuit.

【００２８】符号１０６は、スピーカーで、音声通路制
御手段１０２から伝達された電気的な音声信号を生の音
声として出力する。１０７は、マイクで、操作者の通話
時の音声を電気信号として入力するためのものである。Reference numeral 106 denotes a speaker, which outputs an electric sound signal transmitted from the sound passage control means 102 as raw sound. Reference numeral 107 denotes a microphone for inputting voice during a call of the operator as an electric signal.

【００２９】符号１０８は、マイク１０７から入力され
た操作者の生の音声を音声認識し（音声メモなどの機能
を実装する場合）、また後述のキースイッチ１１１が押
された場合に通話中の音声を音声認識し、認識結果であ
る音声認識データを生成する音声認識部である。音声認
識部１０８はアナログ増幅回路、Ａ／ＤおよびＤ／Ａ変
換器、システム主制御部１０２とデータ入出力を行なう
ためのインターフェース回路などから構成される。Reference numeral 108 denotes voice recognition of the operator's raw voice input from the microphone 107 (when a function such as a voice memo is implemented), and when a key switch 111 described later is pressed, This is a voice recognition unit that recognizes voice and generates voice recognition data as a recognition result. The voice recognition unit 108 includes an analog amplifier circuit, A / D and D / A converters, an interface circuit for inputting and outputting data to and from the system main control unit 102, and the like.

【００３０】符号１０９は、携帯情報端末１０１の種々
のアプリケーション、すなわち、住所録、スケジュール
などのＰＩＭ（Personal Information Manager）、電
話、電子メールなどのソフトウェア、音声認識結果デー
タ、ユーザーデータ、拡張ソフトウェアなどのプログラ
ムの格納領域として、また各プログラムの処理データ領
域として用いられる記憶部である。記憶部１０９はＲＯ
Ｍ、ＲＡＭなどの素子、内蔵型あるいは外付け型の外部
記憶装置などから構成され、後述の本発明に係る制御プ
ログラムの格納にも用いられる。Reference numeral 109 denotes various applications of the portable information terminal 101, such as PIM (Personal Information Manager) such as an address book and a schedule, software such as telephone and e-mail, voice recognition result data, user data, and extended software. This is a storage unit used as a storage area for the programs and a processing data area for each program. The storage unit 109 stores the RO
M, RAM, and other elements, a built-in or external type external storage device, and the like, which are also used to store a control program according to the present invention described later.

【００３１】符号１１０は、通常、操作者が携帯端末を
操作するために各種のコマンドを入力するためのコマン
ド入力部である。コマンド入力部１１０は、キーボー
ド、後述の表示部１１２とともに機能するタッチパネ
ル、マウス、グライドポイント、トラックポイントジョ
イスティックその他の各種入力装置から構成される。Reference numeral 110 is a command input section for inputting various commands for the operator to operate the portable terminal. The command input unit 110 includes a keyboard, a touch panel functioning together with a display unit 112 described later, a mouse, a glide point, a track point joystick, and other various input devices.

【００３２】符号１１１は、通話中の音声の認識の開始
および終了タイミングを決定するためのキースイッチ
で、本実施形態では、少なくともキースイッチ１１１の
操作により通話音声の音声認識の開始タイミングがシス
テム主制御部１０２に伝達される。Reference numeral 111 denotes a key switch for determining the start and end timings of speech recognition during a call. In the present embodiment, at least the key switch 111 is operated to set the start timing of speech recognition of the speech to be performed by the system. The information is transmitted to the control unit 102.

【００３３】符号１１２は、携帯端末内のソフトウェア
やコマンド入力結果や音声認識結果を表示させる表示部
で、表示部１１２はＬＣＤなどの表示器から構成され
る。Reference numeral 112 denotes a display unit for displaying software, command input results, and voice recognition results in the portable terminal, and the display unit 112 is constituted by a display device such as an LCD.

【００３４】符号１１３は、バッテリまたはＡＣアダプ
タにより携帯情報端末の電源を供給する電源部である。Reference numeral 113 denotes a power supply unit that supplies power to the portable information terminal using a battery or an AC adapter.

【００３５】（詳細動作説明）以下、上記のハードウェ
ア構成における動作につき２つの実施形態を示す。(Detailed Operation Description) Hereinafter, two embodiments of the operation in the above hardware configuration will be described.

【００３６】［実施形態１］まず、前述の問題点１）〜
４）、６）を解決するための実施形態につき説明する。
本実施形態は、携帯情報端末１０１により通話音声を音
声認識し、その音声認識データを携帯端末のデータとし
て記憶する際の携帯情報端末の動作に関するもので、図
２はその際システム主制御部１０２が行なう制御手順を
示す。図示の手順は記憶部１０９にシステム主制御部１
０２のプログラムとして格納される。図２では、音声通
信を行なう手順のみを示す。[Embodiment 1] First, the aforementioned problems 1) to 1)
An embodiment for solving 4) and 6) will be described.
The present embodiment relates to the operation of the mobile information terminal when the voice of the call is recognized by the mobile information terminal 101 and the voice recognition data is stored as the data of the mobile terminal. FIG. 1 shows a control procedure performed by the computer. The illustrated procedure stores the system main control unit 1 in the storage unit 109.
02 is stored as a program. FIG. 2 shows only a procedure for performing voice communication.

【００３７】まず、携帯情報端末が通話中になるまでの
動作を説明する（ステップＳ２０１〜Ｓ２０８）。First, the operation until the portable information terminal is in a call will be described (steps S201 to S208).

【００３８】携帯端末の電源をＯＮ状態にする（ステッ
プＳ２０１）と、次に電話をかける（発信）かどうかを
判断し（ステップＳ２０２）、電話を相手にかける場合
には、コマンド入力部１１０の電話ボタンを押し、記憶
部１０９に格納された電話ソフトウェアを起動する（ス
テップＳ２０３）。When the power supply of the portable terminal is turned on (step S201), it is determined whether or not to make a next call (call) (step S202). The telephone button is pressed to activate the telephone software stored in the storage unit 109 (step S203).

【００３９】操作者が電話番号をコマンド入力部１１０
から入力すると（ステップＳ２０４）、通話したい相手
に対してＰＨＳ無線制御部１０４、ＰＨＳ無線部１０５
を介してＰＨＳ無線通信方式により発呼する（ステップ
Ｓ２０５）。相手がオフフックすると通話中になる（ス
テップＳ２０６）。The operator inputs the telephone number into the command input unit 110
(Step S204), the PHS wireless control unit 104 and the PHS wireless unit 105
(Step S205). When the other party goes off-hook, a call is in progress (step S206).

【００４０】また、電話をかけない場合（選択された操
作が電話機能ではない場合）には、ステップＳ２０７に
移行し、ＰＨＳ無線通信方式により電話がかかってきた
（着信）かどうかを判断する。電話がかかってこなかっ
た場合には、ステップＳ２０２にループして、上記同様
に着信ないし発信の契機を待つ。If no call is made (the selected operation is not a telephone function), the flow shifts to step S207 to determine whether or not a call has been received (incoming) by the PHS wireless communication system. If no telephone call is received, the process loops to step S202 and waits for an incoming or outgoing call as described above.

【００４１】電話がかかってきた場合には、コマンド入
力部１１０からオフフックし電話を受信する（ステップ
Ｓ２０８）。このオフフックの検出は、コマンド入力１
１０の所定スイッチの操作や、スピーカー１０６、マイ
ク１０７がハンドセットとして構成されている場合には
文字通りそのハンドセットのオフフックを検出すること
により行なう。When a call is received, the user goes off-hook from command input section 110 and receives the call (step S208). The detection of this off-hook is based on the command input 1
When the predetermined switch 10 is operated, or when the speaker 106 and the microphone 107 are configured as a handset, the operation is performed by literally detecting the off-hook of the handset.

【００４２】上記のいずれかの操作により、通話が開始
される（ステップＳ２０６）。通話中、音声通路制御部
１０３は、マイク１０７からの入力音声がＰＨＳ無線制
御部１０４に出力し、またＰＨＳ無線部１０５により受
信したデータやマイク１０７からの入力音声をスピーカ
ー１０６に出力することにより通話が行なわれる。A call is started by any of the above operations (step S206). During a call, the voice passage control unit 103 outputs the input voice from the microphone 107 to the PHS wireless control unit 104 and outputs the data received by the PHS wireless unit 105 and the input voice from the microphone 107 to the speaker 106. A call is made.

【００４３】上記のようにＰＨＳ無線通信方式により電
話回線網と接続すれば、無線通信により公衆回線網と接
続するため、いつどこにいても回線接続し通話すること
が可能である。As described above, by connecting to the telephone line network by the PHS wireless communication system, since the connection to the public line network is made by wireless communication, it is possible to connect the line and talk anytime and anywhere.

【００４４】以上で携帯情報端末が通話中になったこと
になる。次に、通話音声を音声認識し、その音声認識結
果データを記憶する動作（ステップＳ２０９以降）につ
いて説明する。通話音声は、操作者の音声を含んでお
り、本実施形態の構成はそのまま操作者の音声認識にも
利用できるものであるが、以下では、通話相手に住所や
電話番号を尋ね、それをデータとして入力する場合を例
に説明する。Thus, the portable information terminal is busy. Next, an operation (step S209 and subsequent steps) of recognizing a speech voice and storing the speech recognition result data will be described. The call voice includes the voice of the operator, and the configuration of the present embodiment can be used as it is for the voice recognition of the operator. The following describes an example of inputting as "?".

【００４５】装置の操作者が、相手と通話中に住所や電
話番号（相手の住所や電話番号に限らず、通話相手が知
っている他人の住所や電話番号でもよい）を自分が携帯
情報端末１０１にデータとして入力したくなった場合
（ステップＳ２０９）は、相手の音声を認識させるため
に、音声認識用のキースイッチ１１１を押す（ステップ
Ｓ２１０）。The operator of the apparatus uses the portable information terminal to input the address and telephone number (not limited to the address and telephone number of the other party, but may be the address and telephone number of another person known to the other party) during a call with the other party. If the user wants to input the data to the terminal 101 (step S209), the user presses the voice recognition key switch 111 to recognize the voice of the other party (step S210).

【００４６】音声認識用のキースイッチ１１１が押され
ると、音声通路制御部１０３がＰＨＳ無線制御部１０４
から送られてきた音声データの出力経路をスピーカー１
０６から音声認識部１０８に切り換える（ステップＳ２
１１）。When the key switch 111 for voice recognition is pressed, the voice passage control unit 103 causes the PHS radio control unit 104 to operate.
The output path of the audio data sent from
06 to the voice recognition unit 108 (step S2).
11).

【００４７】これにより、スピーカー１０６に音声デー
タが伝達されなくなり、スピーカーからは音は出力され
ない。したがって、音声データをいったん記憶させるこ
とをせず、直接、音声認識部１０８に入力し、音声認識
させることができるため、音声データ記憶のための余計
な記憶手段が不要であり、携帯情報端末を簡単安価かつ
小型軽量に構成することができる。As a result, the sound data is not transmitted to the speaker 106, and no sound is output from the speaker. Therefore, since the voice data can be directly input to the voice recognition unit 108 for voice recognition without storing the voice data once, no extra storage means for storing the voice data is required. It can be configured simply, inexpensively, and small and light.

【００４８】これにより音声認識部１０８の音声認識が
開始される（ステップＳ２１２）。Thus, the voice recognition of the voice recognition unit 108 is started (step S212).

【００４９】音声認識部１０８は、音声認識結果を記憶
開始させるためのデータを検出して、そのタイミングを
システム主制御部１０２に通知し、その時点からシステ
ム主制御部１０２が記憶部１０９に認識され、文字コー
ドなどに変換された住所や電話番号などのデータの記憶
を開始する（あるいは単に音声認識結果データ記憶手段
に記憶開始させるためのデータの検出タイミングから音
声認識部１０８が認識結果を出力させるような構成でも
よい）。The voice recognition unit 108 detects data for starting the storage of the voice recognition result, notifies the timing to the system main control unit 102, and from that point on, the system main control unit 102 recognizes the data in the storage unit 109. Then, storage of data such as an address and a telephone number converted into a character code or the like is started (or the speech recognition unit 108 outputs a recognition result from a data detection timing for simply causing the speech recognition result data storage unit to start storing). The configuration may be such that the

【００５０】音声認識結果を記憶開始させるためのデー
タは、相手の音声中の所定の単語（「東京都」や「郵便
番号」などの住所データの開始点に相当する単語の音
声、あるいは「０３」などの電話番号の開始点に相当す
る単語の音声など）などが考えられるが、単に音声の振
幅レベルが所定のしきい値を超えたことで音声認識開始
を検出するような構成であってもよい。The data for starting the storage of the voice recognition result is a predetermined word in the voice of the other party (the voice of a word corresponding to the starting point of the address data such as "Tokyo" or "zip code" or "03". ), Etc.), but the configuration is such that the start of voice recognition is simply detected when the amplitude level of the voice exceeds a predetermined threshold. Is also good.

【００５１】システム主制御部１０２は、音声認識部１
０８から通知された記憶開始通知を認識し、その後音声
認識部１０８から出力される音声認識結果データを記憶
部１０９に記憶させる（ステップＳ２１４）。このよう
に相手の音声信号中の所定のデータや音声信号の状態に
応じて認識を開始する処理により、記憶部１０９の記憶
容量が少なくて済む。The system main control unit 102 includes the voice recognition unit 1
Then, the storage start notification notified from step 08 is recognized, and then the voice recognition result data output from the voice recognition unit 108 is stored in the storage unit 109 (step S214). As described above, the process of starting the recognition in accordance with the predetermined data in the voice signal of the other party or the state of the voice signal can reduce the storage capacity of the storage unit 109.

【００５２】音声認識の終了タイミングは、キースイッ
チ１１１を押すことで決定される。ただし、本実施形態
では上記のように、スピーカ１０６によるモニターを行
なわないため、なんらかの方法で認識すべき相手の音声
信号が終了したことを検出する必要がある。The end timing of the voice recognition is determined by pressing the key switch 111. However, in the present embodiment, as described above, since monitoring by the speaker 106 is not performed, it is necessary to detect the end of the voice signal of the other party to be recognized by some method.

【００５３】本実施形態では、操作者が、表示部１１２
で行なわれる認識結果の表示を見て認識の終了を判断す
る。In this embodiment, the operator operates the display unit 112
The end of the recognition is determined by looking at the display of the recognition result performed in step (1).

【００５４】すなわち、音声認識部１０８による音声認
識結果データがシステム主制御部１０２の制御を経て表
示部１１２に送られ、表示される（ステップＳ２１
５）。That is, the speech recognition result data by the speech recognition unit 108 is sent to the display unit 112 under the control of the system main control unit 102 and displayed (step S21).
5).

【００５５】操作者は、表示部１１２で行なわれるこの
認識結果の表示を見て通話相手が住所、電話番号を言い
終えたことを認識し（ステップＳ２１６）、音声認識終
了のキースイッチ１１１を押す（ステップＳ２１７）。
ステップＳ２１６〜Ｓ２１３へのループから明白なよう
に、音声認識部１０８から送られる認識結果データは逐
次表示される。操作者は表示を見てデータの認識終了タ
イミングを判断できる。The operator recognizes that the caller has finished speaking the address and telephone number by viewing the display of the recognition result displayed on the display unit 112 (step S216), and presses the key switch 111 for terminating the voice recognition. (Step S217).
As is apparent from the loop from steps S216 to S213, the recognition result data sent from the voice recognition unit 108 is sequentially displayed. The operator can determine the data recognition end timing by looking at the display.

【００５６】キースイッチ１１１が押されると、システ
ム主制御部１０２は音声データ伝送経路を元の経路、す
なわち、ＰＨＳ無線制御部１０４→音声通路制御部１０
３→スピーカー１０６）に復帰させ、これにより相手の
音声がスピーカーから出力される（ステップＳ２１
８）。When the key switch 111 is pressed, the system main control unit 102 sets the audio data transmission path to the original path, that is, the PHS radio control unit 104 → the audio path control unit 10.
3 → speaker 106), whereby the voice of the other party is output from the speaker (step S21).
8).

【００５７】以上のようにキースイッチ１１１により音
声認識を開始／終了させることにより、携帯情報端末の
音声認識を片手操作で開始／終了させることが可能にな
る。上述のように、キースイッチ１１１は音声認識の契
機の一部として用いられているが、実際には音声信号の
認識すべき区間は音声認識に基いているので、従来のよ
うにスピーカー１０６からの音声を聞きながら操作者が
自分で認識タイミングを決定する必要がない。As described above, by starting / ending the voice recognition by the key switch 111, it becomes possible to start / end the voice recognition of the portable information terminal by one-handed operation. As described above, the key switch 111 is used as a part of the trigger of the voice recognition. However, since the section in which the voice signal is to be recognized is actually based on the voice recognition, the key switch 111 is transmitted from the speaker 106 as in the related art. There is no need for the operator to determine the recognition timing by himself while listening to the voice.

【００５８】また、スピーカー１０６からのモニター出
力を停止させることで、操作者はキースイッチ１１１の
操作と表示部１１２の出力に集中できるため、操作者に
余計な心理的負担を与えることがない。Further, by stopping the monitor output from the speaker 106, the operator can concentrate on the operation of the key switch 111 and the output of the display unit 112, so that an unnecessary psychological burden is not given to the operator.

【００５９】また、音声入力やタッチパネル入力によら
ず、キースイッチを用いることにより、音声認識操作が
極めて容易かつ判りやすいものになる。たとえば、タッ
チパネルの操作面の視認が不要となり、手さぐりでの操
作が可能となり、上述の問題点３）に示したように忙し
く装置を持ち替えて操作する必要がなくなる。Further, by using a key switch irrespective of voice input or touch panel input, voice recognition operation becomes extremely easy and easy to understand. For example, it is not necessary to visually recognize the operation surface of the touch panel, and it is possible to perform an operation with a hand gesture, and as described in the above problem 3), there is no need to operate the device while holding the device busy.

【００６０】ステップＳ２１９では通話を終了するかど
うかが検出される。この検出は、オフフックと同様、コ
マンド入力１１０の所定キーや文字通りのオンフックの
検出により行なう。通話終了の操作が行なわれると、Ｐ
ＨＳ無線部１０５を介してＰＨＳ無線制御部１０４を制
御し、呼接続を切断する（ステップＳ２２０）。In step S219, it is detected whether or not to end the call. This detection is performed by detecting a predetermined key of the command input 110 or a literal on-hook, similarly to the off-hook. When the operation to end the call is performed, P
The PHS radio control unit 104 is controlled via the HS radio unit 105 to disconnect the call connection (step S220).

【００６１】以上の実施形態によれば、通話中、通話音
声を音声認識し、携帯情報端末のデータとして記憶させ
ることにより、操作者が通話相手から聞いた音声から自
分で携帯情報端末に入力／記憶させなくて済み、簡単に
データの入力／記憶を行うことができる。According to the above-described embodiment, during a call, the voice of the call is recognized and stored as data of the portable information terminal. It is not necessary to store the data, and data can be easily input / stored.

【００６２】［実施形態２］以上では、音声認識の終了
タイミングを操作者が表示部１１２の表示を見て決定し
ている。しかし、この音声認識の終了タイミングも音声
認識開始と同様に特定の音声中の単語、あるいは特定の
音声信号を検出することで行なうことができる。[Second Embodiment] In the above, the operator determines the end timing of the voice recognition by looking at the display on the display unit 112. However, the end timing of the speech recognition can also be performed by detecting a word in a particular speech or a particular speech signal, similarly to the start of speech recognition.

【００６３】以下に示す実施形態は、音声認識の終了タ
イミングの決定を特定の音声中の単語、あるいは特定の
音声信号を検出することで行なう例を示すもので、前述
の問題点５を解決するものである。The following embodiment shows an example in which the end timing of speech recognition is determined by detecting a word in a specific voice or a specific voice signal, and solves the above-mentioned problem 5. Things.

【００６４】図３は、図２とほぼ同じ体裁のフローチャ
ートであるが、ステップ番号には３００番台の数字が用
いられており、図中の図２と同じ１０番台のステップ番
号を持つステップは図２の対応するステップとほぼ同様
の処理を行なうものである。以下では、図３中、図２と
異なる処理についてのみ説明する。FIG. 3 is a flowchart similar to that of FIG. 2 except that steps having numbers in the 300s are used for the step numbers, and steps having the same step numbers in the 10s in FIG. The processing is substantially the same as the corresponding step 2. Hereinafter, only processing different from FIG. 2 in FIG. 3 will be described.

【００６５】図３で図２と異なるのはステップＳ３１６
の音声認識処理であることと、ステップＳ２１７に相当
するキースイッチ１１１の操作が無い点である。FIG. 3 differs from FIG. 2 in step S316.
And that there is no operation of the key switch 111 corresponding to step S217.

【００６６】本実施形態では、音声認識の終了タイミン
グの決定（ステップＳ３１６）を特定の音声中の単語、
あるいは特定の音声信号を検出することで行なうが、こ
のためには、音声認識部１０８により、音声認識開始の
際と同様に特定の音声中の単語、あるいは特定の音声信
号を検出する。たとえば相手の音声中の「終了しまし
た」、「終り」などの単語を音声認識部１０８で音声認
識した際に、音声認識を終了することが考えられる。ま
た、相手に適当なテンキー操作を依頼して、その周波数
を検出する方法を用いてもよい。あるいは所定時間以上
の無音（ないし音声の振幅レベルが一定以下）状態を検
出することにより音声認識の終了タイミングを検出して
もよい。In this embodiment, the determination of the end timing of the speech recognition (step S316) is based on the word in the specific speech,
Alternatively, detection is performed by detecting a specific voice signal. For this purpose, a word in a specific voice or a specific voice signal is detected by the voice recognition unit 108 in the same manner as at the start of voice recognition. For example, when words such as “finished” and “end” in the voice of the other party are recognized by the voice recognition unit 108, the voice recognition may be terminated. Alternatively, a method of requesting the other party to perform an appropriate ten-key operation and detecting the frequency may be used. Alternatively, the end timing of the speech recognition may be detected by detecting a state of silence (or the amplitude level of the speech is equal to or less than a predetermined value) for a predetermined time or more.

【００６７】つまり、本実施形態ではステップＳ３１６
の音声認識の終了タイミングの決定をユーザの操作では
なく、認識処理そのもの、つまり相手の音声信号中の所
定のデータによって行なう、あるいは音声信号の状態の
変化に応じて行なう。That is, in the present embodiment, step S316
The end timing of the voice recognition is determined not by the user operation but by the recognition processing itself, that is, by predetermined data in the voice signal of the other party, or in response to a change in the state of the voice signal.

【００６８】音声認識の終了タイミングが検出される
と、システム主制御部１０２は音声認識部１０８の音声
認識を終了させ、音声データ伝送経路を元の経路（ＰＨ
Ｓ無線制御部１０４→音声通路制御部１０３→スピーカ
ー１０６）に戻す。以後の処理は図２と同じである。When the end timing of the speech recognition is detected, the system main control unit 102 terminates the speech recognition of the speech recognition unit 108 and changes the speech data transmission path to the original path (PH).
The processing returns to the S wireless control unit 104 → the voice passage control unit 103 → the speaker 106). Subsequent processing is the same as in FIG.

【００６９】このように、通話相手の音声により音声認
識を終了させることにより、操作者が終了表示を見て音
声認識を終了させる判断すら必要無くなり、明示的な認
識終了操作を行なうことなく、音声認識の終了をさらに
簡単に行うことができるようになる。また、記憶部９の
記憶容量の節約も可能となる。As described above, by ending the voice recognition by the voice of the other party, it is not necessary for the operator to determine whether to end the voice recognition by seeing the end display, and to perform the voice recognition without performing the explicit recognition ending operation. Recognition can be completed more easily. Further, the storage capacity of the storage unit 9 can be saved.

【００７０】以上の説明において示したハードウェアお
よびソフトウェア構成はあくまでも一例にすぎず、当業
者が本発明の範囲を逸脱しない範囲で種々の変更が可能
であるのはいうまでもない。たとえば、図１に示した構
成の各部は一体の筐体に納められている必要はなく、任
意の一部がオプション部品として供給されていたり、外
付けされていたりするものであってかまわない。通信方
式もＰＨＳ回線を用いるもののほか、通常の有線の公衆
回線、専用回線、あるいは携帯電話回線や自動車の回
線、あるいは衛星回線を用いるものなどであってもよ
い。特に、上記実施形態では無線接続のＰＨＳが前提で
あるが、相手の音声の認識に関しては有線の回線であっ
ても同様の効果を期待できるのはいうまでもない。The hardware and software configurations shown in the above description are merely examples, and it is needless to say that those skilled in the art can make various modifications without departing from the scope of the present invention. For example, each part of the configuration shown in FIG. 1 does not need to be housed in an integrated housing, and any part may be supplied as optional parts or may be externally attached. The communication system may be a system using a PHS line, a normal wired public line, a dedicated line, a mobile phone line, a car line, or a satellite line. In particular, in the above-described embodiment, the PHS of the wireless connection is premised, but it goes without saying that the same effect can be expected even with a wired line in recognizing the voice of the other party.

【００７１】また、装置が通話機能の他にさらにファク
シミリ機能やデータ通信の機能を有していても良い。ま
た、装置は多機能型の携帯電話機として、あるいはモバ
イル型のパーソナルコンピュータなどのどのような形態
を有していてもかまわない。要するに通話機能と、相手
から得た情報をデータとして利用することを目的とする
通信装置であれば本発明が実施できるのはいうまでもな
い。The apparatus may have a facsimile function and a data communication function in addition to the call function. Further, the apparatus may have any form such as a multifunctional mobile phone or a mobile personal computer. In short, it goes without saying that the present invention can be implemented as long as the communication device aims to use the communication function and information obtained from the other party as data.

【００７２】[0072]

【発明の効果】以上から明らかなように、本発明によれ
ば、所定操作手段の入力に応じて、音声認識手段による
音声信号の認識開始ないし終了のタイミングを決定し、
相手の音声データを音声認識し、その音声認識データを
記憶させる構成を採用しているので、通話相手の音声を
音声認識し、携帯情報端末のデータとして記憶させるこ
とにより、操作者が通話相手から聞いた音声から自分で
携帯情報端末に入力／記憶させなくて済み、簡単にデー
タの入力／記憶を行うことができる、という優れた効果
がある。As is apparent from the above, according to the present invention, the start or end timing of speech signal recognition by the speech recognition means is determined in accordance with the input of the predetermined operation means.
By adopting a configuration that recognizes the voice data of the other party and stores the voice recognition data, the voice of the other party is recognized by voice and stored as data of the portable information terminal, so that the operator can recognize the voice from the other party. There is an excellent effect that data input / storage can be easily performed without having to input / store the data in the portable information terminal by himself / herself from the voice heard.

【００７３】また、通話中の相手の音声の認識開始時に
回線接続手段から入力される音声データの出力先を前記
音声出力手段から前記音声認識手段へ切り換え、通話中
の音声を音声出力手段に出力させないようにする構成を
採用しているので、音声データ記憶のための余計な記憶
手段が不要であり、携帯情報端末を簡単安価かつ小型軽
量に構成することができる。Further, at the start of the recognition of the voice of the other party during the call, the output destination of the voice data input from the line connection means is switched from the voice output means to the voice recognition means, and the voice during the call is output to the voice output means. Since a configuration that prevents such a situation is adopted, no extra storage means for storing voice data is required, and the portable information terminal can be configured simply, inexpensively, and in a small size and light weight.

【００７４】また、通話中の相手の音声を音声認識し、
その認識結果データを携帯端末のデータとして記憶させ
る場合、音声信号の認識開始の契機となるタイミングを
決定する操作手段としてキースイッチを用いる構成を採
用することにより、音声入力やタッチパネル入力による
方法よりも音声認識操作が極めて容易かつ判りやすいも
のになり、片手操作で音声認識を開始／終了させること
が可能となる。Further, the voice of the other party during the call is recognized by voice,
When the recognition result data is stored as data of the mobile terminal, by adopting a configuration using a key switch as an operation means for determining a timing of starting a recognition of a voice signal, a method of inputting voice signal or touch panel input is used. The voice recognition operation becomes extremely easy and easy to understand, and the voice recognition can be started / terminated by one-handed operation.

【００７５】また、音声信号の認識開始後、受信された
音声信号中に所定の音声情報が検出されることにより、
音声認識手段による音声信号の認識結果を記憶手段に格
納する処理の開始タイミングが決定される構成を採用す
ることにより、認識結果を記憶する記憶手段の記憶容量
を減らすことができ、携帯情報端末を簡単安価かつ小型
軽量に構成することができる。After the recognition of the voice signal is started, predetermined voice information is detected in the received voice signal,
By adopting a configuration in which the start timing of the process of storing the recognition result of the voice signal by the voice recognition unit in the storage unit is determined, the storage capacity of the storage unit for storing the recognition result can be reduced, and the portable information terminal can be used. It can be configured simply, inexpensively, and small and light.

【００７６】さらに、音声信号の認識開始後、受信され
た音声信号中に所定の音声情報が検出されることによ
り、音声認識手段による音声信号の認識結果を記憶手段
に格納する処理の終了タイミングが決定される構成を採
用することにより、認識結果を記憶する記憶手段の記憶
容量の縮減を期待できるとともに、明示的な認識終了操
作が不要となり、装置の操作がより容易になる。Further, after the recognition of the voice signal is started, predetermined voice information is detected in the received voice signal, so that the processing of storing the recognition result of the voice signal by the voice recognition means in the storage means is completed. By adopting the determined configuration, it is possible to reduce the storage capacity of the storage unit that stores the recognition result, and it is not necessary to perform an explicit recognition end operation, so that the operation of the apparatus becomes easier.

【００７７】また、電話回線網との接続を無線通信方式
により行なう構成を採用することにより、いつでもどこ
でも携帯情報端末による通話、および通話で得た音声情
報を音声認識する処理が可能となる。Further, by adopting a configuration in which the connection with the telephone line network is made by the wireless communication system, it is possible to perform a voice communication with the portable information terminal and a voice recognition process of voice information obtained by the voice communication anytime and anywhere.

【００７８】以上のように、本発明によれば、携帯情報
端末において、簡単な操作により通話相手から得た音声
情報を認識し、再利用可能な形で記憶させることができ
る、という優れた効果がある。As described above, according to the present invention, the portable information terminal has an excellent effect of recognizing voice information obtained from a communication partner by a simple operation and storing it in a reusable form. There is.

[Brief description of the drawings]

【図１】本発明を採用した携帯情報端末の構成を示した
ブロック図である。FIG. 1 is a block diagram showing a configuration of a portable information terminal employing the present invention.

【図２】図１の装置における通話時の制御手順の第１実
施形態を示したフローチャート図である。FIG. 2 is a flowchart showing a first embodiment of a control procedure at the time of a call in the apparatus of FIG. 1;

【図３】図１の装置における通話時の制御手順の第２実
施形態を示したフローチャート図である。FIG. 3 is a flowchart showing a second embodiment of a control procedure at the time of a call in the apparatus of FIG. 1;

[Explanation of symbols]

１０１携帯情報端末１０２システム主制御部１０３音声通路制御部１０４ＰＨＳ無線制御部１０５ＰＨＳ無線部１０６スピーカー１０７マイク１０８音声認識部１０９記憶部１１０コマンド入力１１１キースイッチ１１２表示部１１３電源部 Reference Signs List 101 portable information terminal 102 system main control unit 103 voice passage control unit 104 PHS radio control unit 105 PHS radio unit 106 speaker 107 microphone 108 voice recognition unit 109 storage unit 110 command input 111 key switch 112 display unit 113 power supply unit

Claims

[Claims]

1. A line connecting means connected to a telephone line network for inputting and outputting a voice signal to and from the telephone line network; a voice output means for outputting a voice signal input from the line connecting means; Voice input means for inputting voice and outputting the voice data to the line connection means; voice recognition means for recognizing voice signals; storage means for storing recognition result data by the voice recognition means; voice by the voice recognition means Operating means for inputting the timing of signal recognition start or end, and control means for determining the start or end timing of voice signal recognition by the voice recognition means in response to the input of the operation means. Mobile information terminal.

2. In response to an input from the operation means, at a timing at which the voice recognition means starts recognizing a voice signal,
2. The portable information terminal according to claim 1, wherein the control unit switches an output destination of the audio data input from the line connection unit from the audio output unit to the audio recognition unit.

3. The portable information terminal according to claim 1, wherein said operation means is a key switch.

4. After the voice recognition unit starts recognizing the voice signal, predetermined voice information is detected in the received voice signal, and the result of the voice signal recognition by the voice recognition unit is stored in the storage unit. 2. The portable information terminal according to claim 1, wherein a start timing of the storing process is determined.

5. After the voice recognition unit starts recognizing the voice signal, predetermined voice information is detected in the received voice signal, and the result of the voice signal recognition by the voice recognition unit is stored in the storage unit. 2. The portable information terminal according to claim 1, wherein an end timing of the storing process is determined.

6. The portable information terminal according to claim 1, wherein the connection with the telephone line network is performed by a wireless communication system.

7. A line connection means connected to a telephone line network for inputting / outputting an audio signal to / from the telephone line network, an audio output means for outputting an audio signal input from the line connection means,
A portable information terminal having voice input means for inputting voice of an operator and outputting the voice data to the line connection means, voice recognition means for recognizing voice signals, and storage means for storing recognition result data by the voice recognition means; A control step of inputting a timing of starting or ending the recognition of the voice signal by the voice recognition means from a predetermined operation means, and determining a timing of starting or ending the recognition of the voice signal by the voice recognition means. A method for controlling a portable information terminal, comprising:

8. In response to an input from the operation means, at a timing at which the voice recognition means starts recognizing a voice signal,
8. The control method for a portable information terminal according to claim 7, wherein an output destination of voice data input from a line connection means is switched from said voice output means to said voice recognition means.

9. The method according to claim 7, wherein a key switch is used as the operation unit.

10. After the voice signal is recognized by the voice recognition means, predetermined voice information is detected in the received voice signal, and the result of voice signal recognition by the voice recognition means is stored in the storage means. The control method of a portable information terminal according to claim 7, wherein a start timing of the storing process is determined.

11. After the voice signal is recognized by the voice recognition means, predetermined voice information is detected in the received voice signal, and the recognition result of the voice signal by the voice recognition means is stored in the storage means. The control method for a portable information terminal according to claim 7, wherein an end timing of the storing process is determined.

12. The method for controlling a portable information terminal according to claim 7, wherein connection to a telephone line network is performed by a wireless communication system.

13. A line connection means connected to a telephone line network for inputting and outputting a voice signal to and from the telephone line network, a voice output means for outputting a voice signal input from the line connection means, A control program for a portable information terminal, comprising: voice input means for inputting voice and outputting the voice data to the line connection means; voice recognition means for recognizing voice signals; and storage means for storing recognition result data by the voice recognition means. A control step of inputting the start or end timing of recognition of the voice signal by the voice recognition unit from a predetermined operation unit and determining the start or end timing of recognition of the voice signal by the voice recognition unit. A storage medium characterized by the following.

14. An output destination of voice data input from a line connection means is switched from said voice output means to said voice recognition means at a timing at which voice recognition is started by said voice recognition means in response to an input of said operation means. 14. The storage medium according to claim 13, wherein a control step is stored.

15. The storage medium according to claim 13, further comprising a control step of inputting the timing of starting or ending the recognition of a voice signal by said voice recognition means from a key switch as said operation means.

16. After the voice recognition unit starts recognizing the voice signal, predetermined voice information is detected in the received voice signal, and the result of the voice signal recognition by the voice recognition unit is stored in the storage unit. 14. The storage medium according to claim 13, further comprising a control step of determining a start timing of the storing process.

17. After the voice recognition unit starts recognizing the voice signal, predetermined voice information is detected in the received voice signal, and the voice signal recognition result by the voice recognition unit is stored in the storage unit. 14. The storage medium according to claim 13, further comprising a control step for determining an end timing of the storing process.

18. The storage medium according to claim 13, further comprising a control step of connecting to a telephone line network by a wireless communication system.