JPH02117252A

JPH02117252A - Telephone set

Info

Publication number: JPH02117252A
Application number: JP27157488A
Authority: JP
Inventors: Hiroshi Matsuura; 博松浦; Akira Fukumine; 福嶺　明
Original assignee: Toshiba Corp; Toshiba Computer Engineering Corp
Current assignee: Toshiba Corp; Toshiba Computer Engineering Corp
Priority date: 1988-10-27
Filing date: 1988-10-27
Publication date: 1990-05-01
Anticipated expiration: 2013-01-26
Also published as: JP2703952B2

Abstract

PURPOSE:To prevent erroneous dialing from being performed and to improve operability by issuing a call to a recognition candidate with the highest order of candidate displayed in voice input, stopping the issuing of the call when the selection of another recognition candidate is performed while issuing the call, and issuing the call by a selected recognition candidate. CONSTITUTION:An operator raises a handset 2 and performs the input of an opposite name in voice according to the message of a display part 7, and it is supplied to a vocalizing recognition device 10, and a prescribed number of candidates of the opposite name is found. A CPU 1 retrieves a storage part 6, and reads out the corresponding telephone number of an opponent and attached, information, and displays them on the display part 7, and also, outgoing to the opponent that is a first candidate out of the candidates of the opposite name is started. The opposite name and the attached information on the display part 7 are checked during a line is connected. and when they are wrong, desired opposite name and information out of the opposite names and the attached information can be selected by a selection button. The CPU 1 interrupts the work of line connection under execution while the above selection is being perform ed, and starts the outgoing to the opponent selected newly.

Description

【発明の詳細な説明】［発明の目的コ（産業上の利用分野）本発明は、音声入力による自動ダイヤルがｉ”ｉＪ能な
電話機に関する。DETAILED DESCRIPTION OF THE INVENTION [Object of the Invention (Field of Industrial Application) The present invention relates to a telephone capable of automatic dialing by voice input.

（従来の技術）近年、我々の身の回りには、多くの電子機器が氾濫して
いる。そしてそれらの機器の中には複数の機能または動
作を採り得るものが多い。これは非常に便利である反面
、操作上不便なことも多い。このような観点から種々の
電子機器とこれを利用する人間との関係を考慮すること
が非常に重要となってきている。(Prior Art) In recent years, many electronic devices have become ubiquitous around us. Many of these devices can have multiple functions or operations. Although this is very convenient, it is often inconvenient in operation. From this perspective, it has become very important to consider the relationship between various electronic devices and the people who use them.

これを具体的に説明すると、例えば電話機の場合、自動
電子交換機の登場によりわざわざ交換手による取次ぎ操
作を行うことなく、相手先の電話番号さえダイヤルすれ
ば相手先と通話することが可能となってきた。このよう
な自動機能は非常に便利である。しかし、このような自
動機能を利用し得るのは相手先の電話番号が分っている
時だけであって、相手先の電話番号が分らない時にはこ
の自動機能を利用することが出来ない。勿論、このよう
な場合の為に電話帳等が準備されているのであるが、電
話帳には利用者にとって不必要な情報も多く掲載されて
おり、一般的には使い難い。To explain this in detail, for example, in the case of a telephone, with the advent of automatic electronic exchanges, it has become possible to make a call to the other party by simply dialing the other party's phone number, without having to go through the trouble of having a switchboard handle the transfer operations. Ta. Such automatic functions are very convenient. However, such an automatic function can be used only when the telephone number of the other party is known, and cannot be used when the telephone number of the other party is not known. Of course, telephone directories and the like are prepared for such cases, but telephone directories contain a lot of information that is unnecessary for users and are generally difficult to use.

そこで本発明者等は、相手先を示す入力音声を認識処理
して求められる複数の認識候補の中から、その候補順位
の高いものから順に所定個数求めて表示し、利用者によ
る選択指示により選択された相手先に対して自動ダイヤ
ルする電話機を提案している。Therefore, the inventors of the present invention determined a predetermined number of recognition candidates from among a plurality of recognition candidates obtained by recognition processing of input speech indicating the other party, in descending order of candidate ranking, and selected the candidates according to a selection instruction from the user. We are proposing a telephone that automatically dials the called party.

ところが、この自動ダイヤル電話機では、相手先を複数
の認識候補の中から利用者の選択指示により選択するも
のであるため、音声認識処理から生じる誤ダイヤルを防
止しうる反面、利用者に対して常に選択操作を強いるも
のであるため、利用者にとっては面倒となることも多い
。However, with these automatic dialing telephones, the destination is selected from among multiple recognition candidates according to the user's selection instructions, so while it is possible to prevent incorrect dialing caused by voice recognition processing, it is difficult for the user to always Since it forces a selection operation, it is often troublesome for the user.

（発明が解決しようとする課題）このように上述した自動ダイヤル電話機では、相手先名
を音声入力することにより、該相手先に対して自動ダイ
ヤルすることができるが、音声認識処理から生じる誤ダ
イヤルの防止等の観点から、相手先を複数の認識候補の
中から利用者の選択指示により選択するものであるため
、利用者に対して常に選択操作を強いることとなり、利
用者にとって面倒となることがある。(Problem to be Solved by the Invention) As described above, the above-mentioned automatic dialing telephone can automatically dial a destination by inputting the destination name by voice, but incorrect dialing caused by voice recognition processing can occur. In order to prevent such problems, the user selects the other party from among multiple recognition candidates based on the user's selection instructions, which forces the user to constantly select, which can be troublesome for the user. There is.

本発明はこのような事情を考慮してなされたもので、音
声認識処理を用いた自動ダイヤルによって生じる誤ダイ
ヤルを防止しつつ、操作性の向上を図ることのできる電
話機を提供することを目的としている。The present invention was made in consideration of the above circumstances, and an object of the present invention is to provide a telephone that can improve operability while preventing erroneous dialing caused by automatic dialing using voice recognition processing. There is.

［発明の構成］（課題を解決するための手段）本発明は、音声が入力される音声入力手段と、この音声
入力手段により入力された音声を認識し、この音声に対
応する複数の認識候補およびこれら各認識候補に対応す
る候補順位を出力する音声認識手段と、この音声認識手
段から出力された複数の認識候補のうち、２個以上の認
識候補を表示する表示手段と、前記音声認識手段から出
力された順位候補に基づき、前記表示手段により表示さ
れた認識候補のうち、候補順位がもっとも高い認識候補
に対し発呼を行う第１の発呼手段と、この第１の発呼手
段により発呼が行われた認識候補以外の認識候補を前記
表示手段に表示された結果に基づき選択する選択手段と
、前記第１の発呼手段により発呼動作が行われている最
中に、前記選択手段による選択が行われたとき、前記第
１の発呼手段による発呼動作を停止させ、前記選択手段
により選択された認識候補により発呼を行う第２の発呼
手段とを備えることにより、上述した課題を解決してい
る。[Structure of the Invention] (Means for Solving the Problems) The present invention provides a voice input means into which voice is input, a voice input by the voice input means, and a plurality of recognition candidates corresponding to the voice. and a voice recognition means for outputting a candidate ranking corresponding to each of these recognition candidates, a display means for displaying two or more recognition candidates among the plurality of recognition candidates output from the voice recognition means, and the voice recognition means a first calling means that calls a recognition candidate with the highest candidate ranking among the recognition candidates displayed by the display means based on the ranking candidates output from the first calling means; selecting means for selecting a recognition candidate other than the recognition candidate to which the call was made based on the result displayed on the display means; and while the first calling means is performing the calling operation; and a second calling means that stops the calling operation of the first calling means and makes a call based on the recognition candidate selected by the selecting means when the selection means makes a selection. , which solves the above-mentioned problems.

（作　用）即ち本発明によれば、音声が入力されると、複数の認識
候補およびこれら各認識候補に対応する候補順位が出力
され、複数の認識候補が表示され、まず、候補順位がも
っとも高い認識候補に対し発呼が行われる。そして、こ
の発呼動作が行われている最中に、他の認識候補の選択
が行われると、前記発呼動作が停止され、選択された認
識候補により発呼が行われる。(Function) That is, according to the present invention, when a voice is input, a plurality of recognition candidates and candidate rankings corresponding to these recognition candidates are output, a plurality of recognition candidates are displayed, and first, the candidate ranking is the highest. A call is made to a high recognition candidate. If another recognition candidate is selected while this calling operation is being performed, the calling operation is stopped and a call is made using the selected recognition candidate.

（実施例）以下、図面を参照して本発明の実施例につき説明する。(Example) Embodiments of the present invention will be described below with reference to the drawings.

第１図は本発明の一実施例に係る電話機の概略的な機能
ブロック構成を示す図である。この電話機はＣＰＵから
なる制御装置１を主体とし、後述する各部を備えて構成
される。即ち、この電話機は回線インターフェース８を
介して電話回線に接続されるもので、送受品温２、ダイ
ヤルパッドあるいは選択ボタンｌｌａ、ｌｌｂを含むボ
タン入力部５、相手先名を見出しとしてその電話番号等
を格納した情報記憶部６、液晶デイスプレィあるいは発
信中表示ランプ１２　ａ　ｓ　１２　ｂ　ｓ　１２　ｃ
等からなる表示部７、および音声認識部３と認識辞書４
とからなる音声認識装置１０を備えて構成される。第２
図はこのような機能を備えて構成される電話機の外観構
成を示すものである。FIG. 1 is a diagram showing a schematic functional block configuration of a telephone according to an embodiment of the present invention. This telephone mainly includes a control device 1 consisting of a CPU, and includes various parts described below. That is, this telephone is connected to a telephone line via a line interface 8, and displays information such as the sending/receiving item temperature 2, a button input section 5 including a dial pad or selection buttons lla and llb, and a telephone number with the name of the other party as a heading. an information storage unit 6 storing information, a liquid crystal display or a transmission indicator lamp 12 a s 12 b s 12 c
a display section 7 consisting of a speech recognition section 3 and a recognition dictionary 4;
The speech recognition device 10 is configured to include a speech recognition device 10 consisting of the following. Second
The figure shows the external configuration of a telephone equipped with such functions.

さて音声認識装置１０は前記送受話器２を介して音声入
力された相手先名を示す単語音声の特徴を音声認識部３
にて分析抽出し、認識辞書４に予め登録されている複数
の相手先の各音声的特徴（標準パターン）とそれぞれ照
合して、その入力音声単語（相手名）を認識するもので
ある。Now, the voice recognition device 10 recognizes the characteristics of the word voice indicating the name of the other party inputted via the handset 2 by the voice recognition unit 10.
The input speech word (name of the other party) is recognized by analyzing and extracting the word and comparing it with each of the phonetic characteristics (standard patterns) of a plurality of other parties registered in advance in the recognition dictionary 4.

制御装置１はこの音声認識装置１０にて認識された入力
音声、即ち相手先に対する複数の認識候補（相手先名候
補）のうち、例えば候補順位の高いものから順に所定個
数求めて、後述する選択処理に供している。The control device 1 obtains a predetermined number of recognition candidates (destination name candidates) for the input speech recognized by the speech recognition device 10, for example, from among the plurality of recognition candidates (destination name candidates) for the destination, and makes a selection as described later. Submitted for processing.

一方、情報記憶部６には前述した認識対象とする相手先
に対応して、その相手先の電話番号が予め登録されてい
る。尤もこの情報記憶部６には、必要に応じて相手先に
関する付属情報、例えば所属会者名、役職、住所、氏名
、職業等を記憶しておいてもよい。On the other hand, in the information storage unit 6, the telephone number of the other party to be recognized is registered in advance in correspondence with the other party to be recognized. Of course, the information storage unit 6 may store additional information regarding the other party, such as affiliated member's name, position, address, name, occupation, etc., if necessary.

ボタン入力部５はダイヤルパッドを構成する［１，２、
〜９，０．＊、＃Ｊ等のボタンや選択ボタンｌｌａ、ｌ
ｌｂ等から成る。これらの各ボタンは、押下されること
によりそのボタンに対応した信号を制御装置１に送って
いる。このボタン入力部５は、通常の電話操作（ダイヤ
ル操作：発信）にも用いられるが、この実施例では更に
前記情報記憶部６への情報入力にも利用されるようにな
っている。即ち、単数又は複数のボタンの組合わせに対
して、例えば第３図に示すように日本語のカナ１０１個
を割当てておき、これらボタンの操作により情報入力を
行い得るものとなっている。The button input section 5 constitutes a dial pad [1, 2,
~9,0. *, #J buttons and selection buttons lla, l
Consists of lb etc. Each of these buttons sends a signal corresponding to the button to the control device 1 when pressed. This button input section 5 is also used for normal telephone operations (dial operation: making a call), but in this embodiment, it is also used for inputting information into the information storage section 6. That is, for example, 101 Japanese kana characters are assigned to one or more combinations of buttons as shown in FIG. 3, and information can be input by operating these buttons.

尚、音声認識装置１０に単音節を認識する機能を持たせ
、音声によりカナ単位に情報入力させるようにしても良
い。情報記憶部６への入力に際し、数字情報についても
前記ボタン入力部５または音声認識装置１０を用いても
良いことは勿論のことである。また、選択ボタンｌｌａ
、ｌｌｂは、それぞれ、発呼時に表示部７にて表示され
る第３候補までの相手先名候補のうち第２候補および第
３候補の相手先名候補に対応して設けられている。Note that the speech recognition device 10 may be provided with a function of recognizing monosyllables, and the information may be input in units of kana by voice. Of course, when inputting numerical information to the information storage section 6, the button input section 5 or the voice recognition device 10 may be used. Also, select button lla
, llb are provided corresponding to the second and third destination name candidates among the destination name candidates up to the third candidate displayed on the display unit 7 at the time of making a call.

そして、後述するように、第１候補の相手先名候補に対
する発呼中にこれらの選択ボタンｌｌａ、１１ｂが押下
されると、これに対応する相手先名候補に対する発呼に
切替えられるようになっている。As will be described later, when these selection buttons lla and 11b are pressed during a call to the first destination name candidate, the call is switched to the corresponding destination name candidate. ing.

一方、前記表示部７は液晶デイスプレィ、発信中表水ラ
ンプ１２ａ、１２ｂ、１２ｃ等からなり、制御装置１か
らの指示に従って種々の表示情報を可視化して利用者に
呈示するものである。この表示部７にて前記音声認識装
置１０にて認識された認識候補のうち、所定数の１０手
先名候補とその相手先に対応する電話番号等の情報が項
目別に分類されて表示される。また後述するように、こ
の表示部７にて各種メツセージも表示される。更に、発
信中表水ランプ１２ａ、１２ｂ、１２Ｃは、発呼時に表
示部７にて表示される第３候補までの相手先名候補に対
応して設けられている。そして、発呼時に、発呼相手に
対応する発信中表水ランプ１２ａ、１２ｂ、１２ｃが点
灯あるいは点滅するようになっている。On the other hand, the display section 7 includes a liquid crystal display, transmitting water lamps 12a, 12b, 12c, etc., and visualizes various display information and presents it to the user according to instructions from the control device 1. On the display section 7, a predetermined number of 10 candidate names among the recognition candidates recognized by the voice recognition device 10 and information such as telephone numbers corresponding to the parties are classified and displayed by item. Furthermore, various messages are also displayed on the display section 7, as will be described later. Furthermore, the calling lamps 12a, 12b, and 12C are provided corresponding to the destination name candidates up to the third candidate displayed on the display unit 7 when making a call. When a call is made, the calling lamps 12a, 12b, and 12c corresponding to the called party are turned on or blinking.

また回線インターフェース８は制御装置１の制御の下で
該電話機と回線との信号とのやりとりを行い、該回線を
介して相手先の電話機との間の通話路を形成するもので
ある。The line interface 8 exchanges signals between the telephone and the line under the control of the control device 1, and forms a communication path with the other party's telephone via the line.

次に上記構成の電話機の動作について説明する。Next, the operation of the telephone with the above configuration will be explained.

通話を行おうとする場合、先ず通常通り送受話器２を取
上げる。すると表示部７には、例えば［相手先名を発声
して下さい。」等のメツセージが表示される。操作者はこれに従い相手
先名を音声入力する。このようにして発声入力された音
声が送受話器２を介して音声認識装置１０に与えられ、
所定個数（例えば上位３個）の認識候補（相手先名候補
）が求められる。制御装置１はこれらの各相手先名候補
について前記情報記憶部６を検索し、対応する各相手先
の電話番号および付属情報を読出して表示部７による表
示に供するとともに認識候補（相手先名候補）中、第１
候補の相手先への発信を開始する。即ち、第１候補の相
手先電話番号に対応した信号を回線インターフェース８
を介して回線に送出する。以下は通常の電話機の操作と
同様にして回線接続がなされる。実際に回線が接続され
るまでにはしばらくの時間を要するが操作者はその時間
に表示部７に表示された相手先名または付属情報の確認
を行う。このときもし相手先がまちがっていた場合操作
者は表示部７に表示された残りの認識候補（相手先名候
補）または付属情報のうち、希望のものを選択ボタンｌ
ｌａ、ｌｌｂを用いて選択する。When attempting to make a call, first pick up the handset 2 as usual. Then, the display section 7 will display, for example, ``Please say the name of the other party.'' ” message is displayed. The operator follows this and inputs the name of the other party by voice. The voice input in this way is given to the voice recognition device 10 via the handset 2,
A predetermined number (for example, top three) of recognition candidates (destination name candidates) are obtained. The control device 1 searches the information storage section 6 for each of these destination name candidates, reads out the telephone number and attached information of each corresponding destination, and displays them on the display section 7, and also displays recognition candidates (destination name candidates). ) middle, 1st
Start calling the candidate destination. That is, a signal corresponding to the first candidate's destination telephone number is sent to the line interface 8.
to the line via. The line connection is then made in the same way as normal telephone operations. Although it takes some time until the line is actually connected, the operator confirms the name of the other party or the attached information displayed on the display section 7 during that time. At this time, if the destination is incorrect, the operator selects the desired one from the remaining recognition candidates (destination name candidates) or attached information displayed on the display unit 7 by pressing the button l.
Select using la and llb.

この選択に対して制御装置１は、実行中の回線接続作業
を中止し、新たに選択された相手先への発信を開始する
。In response to this selection, the control device 1 cancels the line connection work in progress and starts calling the newly selected destination.

第４図は以上の動作処理手続きの概要を示す図である。FIG. 4 is a diagram showing an outline of the above operation processing procedure.

この処理手続きは前述の送受話器２から、通話相手先名
（発信先の相手名）を示す音声を入力することから始め
られる（ステップａ）。送受話器２から入力された音声
信号は、例えば図示しないＡ／Ｄ変換器によりディジタ
ル化された後、音声認識部１０に供給される。This processing procedure begins with the input of voice indicating the name of the other party (the name of the destination party) from the above-mentioned handset 2 (step a). A voice signal input from the handset 2 is digitized, for example, by an A/D converter (not shown), and then supplied to the voice recognition section 10.

このようにして送受話器２から音声が入力されると前記
音声認識部１０による音声認識処理が開始される（ステ
ップｂ）。この音声認識処理は、先ず入力信号から音声
区間の切出しを行い、その音声区間の入力信号の音声パ
ワー変化や周波数分析結果の変化を求める等して、その
音響的特徴を入力音声パターンとして抽出することから
行われる。そしてこの音響分析により抽出した入力音声
パターンと、前記認識辞書４に予め登録されている相手
先名の標準パターンとの間で類似度を求める等してそれ
ぞれ照合し、入力音声に対する認識候補を求める。この
入力音声の認識処理に倶される認識辞書４は、例えば多
数の複数の不特定話者から収集された音声データを統計
処理し、認識対象とする複数の相手先名についてそれぞ
れ求められている標準パターンを登録したものである。When the voice is input from the handset 2 in this way, the voice recognition process by the voice recognition section 10 is started (step b). This speech recognition process first extracts a speech section from an input signal, then calculates changes in the audio power and frequency analysis results of the input signal for that speech section, and extracts its acoustic features as an input speech pattern. It is done because of this. Then, the input speech pattern extracted through this acoustic analysis and the standard pattern of the destination name registered in advance in the recognition dictionary 4 are compared by determining the degree of similarity, etc., and recognition candidates for the input speech are obtained. . The recognition dictionary 4 involved in the recognition process of this input voice performs statistical processing on voice data collected from, for example, a large number of unspecified speakers, and calculates each of the names of a plurality of recipients to be recognized. This is a registered standard pattern.

しかし利用者が限定されている電話機の場合には、該電
話機を使用する特定の話者（単数又は複数）から音声デ
ータを収集し、この収集データから作成された標準パタ
ーンを認識辞書４に登録しておくようにしても良い。However, in the case of a telephone that has a limited number of users, voice data is collected from a specific speaker (single or plural) using the telephone, and a standard pattern created from this collected data is registered in the recognition dictionary 4. You may leave it as is.

しかして上記照合処理は上記入力音声パターンと標準パ
ターンとの間の類似度や距離値を計算する等して行われ
る。そしてその照合結果から認識候補順位の高いものを
、例えば第１位から第３位までを前述の入力音声に対す
る認識候補（相手先名候補）として抽出する（ステップ
Ｃ）。この複数個の相手先名候補の抽出は、単語音声の
認識精度の点から、通常の入力音声に対してはその単語
を全て正確に認識することは困難であるが、認識候補の
上位３位までに着目すればその中に正しい認識結果が含
まれる確率が高いことに立脚している。The matching process is performed by calculating the degree of similarity or distance between the input speech pattern and the standard pattern. Then, from the comparison results, those with high recognition candidate rankings, for example, the first to third ranking candidates are extracted as recognition candidates (destination name candidates) for the input voice described above (step C). In order to extract multiple destination name candidates, it is difficult to accurately recognize all the words in normal input speech from the viewpoint of recognition accuracy of word speech, but This is based on the fact that if you focus on the above, there is a high probability that the correct recognition result will be included.

このようにして入力音声（相手先名を示す単語音声）に
対する認識処理を行った後、前述した制御装置１にて認
識結果が抽出されたか否かを判定する（ステップｄ）。After performing recognition processing on the input speech (word speech indicating the name of the other party) in this manner, the aforementioned control device 1 determines whether or not a recognition result has been extracted (step d).

この判定結果にて入力音声に対するリジェクトが生じた
ことが確認された場合、つまり認識候補の抽出が出来な
かった場合には、制御装置１の制御の下で前記表示部７
を用いて音声認識が不首尾に終わったことを示すメツセ
ージ、例えば「もう−度発声して下さい。」というメツセージを表示して利用者に相手先名の再入力
を促す（ステップｅ）。このメツセージにより再入力さ
れた音声信号に対して同様にして音声認識処理を施す。If it is confirmed that the input voice has been rejected as a result of this determination, that is, if a recognition candidate cannot be extracted, the display unit 7 under the control of the control device 1
is used to display a message indicating that the voice recognition has been unsuccessful, for example, ``Please speak again.'' to prompt the user to re-enter the other party's name (step e). The voice recognition process is similarly performed on the voice signal re-inputted by this message.

尚、繰返し入力された音声に対してその認識処理が全て
不首尾に終わる様な場合には、例えば所定回数の繰返し
時点において、自動ダイヤルが出来ないこと、そして電
話番号簿を調べてボタン入力部５を操作してダイヤルす
ることを促す旨のメツセージを出力したり、発話者を他
の人に代わって貰う旨のメツセージを表示するようにす
ればよい。In addition, if the recognition process for the repeatedly input voice ends in failure, for example, when the voice is repeatedly input a predetermined number of times, automatic dialing cannot be performed, and the telephone number list is checked and the button input section 5 It may be possible to output a message urging the user to dial by operating the , or to display a message requesting that another person take the place of the speaker.

尚、ここでのメツセージ出力を表示部７による表示出力
でなく音声で出力してもよいし、またメツセージの表示
出力と音声出力とを併用してもよい。この音声出力に関
しては、例えば文字コード列で示される所定のメツセー
ジ・データを音声規則合成を用いる等して音声合成して
出力するようにすれば良い。Note that the message output here may be outputted as a sound instead of a display output by the display unit 7, or a message display output and an audio output may be used together. Regarding this voice output, for example, predetermined message data indicated by a character code string may be synthesized into voice using voice rule synthesis, etc., and then output.

さて、入力音声に対する複数の認識候補として相手先名
候補が求められると、制御装置１はこれらの相手先名に
ついて前記情報記憶部６を検索する（ステップｆ）。こ
の情報記憶部６には相手先名を見出しとして電話番号と
所属会社名等の付属情報が蓄積されている。制御装置１
はこのような情報記憶部６から電話番号、付属情報を前
述した相手先名候補に従ってそれぞれ検索抽出する。そ
して検索抽出された相手先名と電話番号を、また必要に
応じてこれに付属する前記表示部７に表示している（ス
テップｇ）。Now, when destination name candidates are obtained as a plurality of recognition candidates for the input voice, the control device 1 searches the information storage unit 6 for these destination names (step f). This information storage section 6 stores attached information such as a telephone number and the name of the company to which the person belongs, with the name of the recipient as a heading. Control device 1
retrieves and extracts telephone numbers and attached information from such information storage unit 6 according to the above-mentioned destination name candidates. Then, the searched and extracted destination name and telephone number are displayed on the display section 7 attached thereto as necessary (step g).

具体的には、例えば「カトウ」と利用者が発声した入力
音声に対して「カトウ」　「サトウ」　「サイトウ」な
る第３位までの認識候補（相手先名候補）が音声認識に
よって求められた場合、上記各相手先名の番号をｒ０３−３６５−５０４９Ｊｒ０４４−６５５−３０２１Ｊｒ０４５−３２１−６０２５Ｊとして前記情報記憶部６から求め、第５図のように表示
する。Specifically, for example, when a user utters the input voice "Kato," the top three recognition candidates (destination name candidates) are determined by voice recognition: "Kato,""Sato," and "Saito." In this case, the numbers of the respective destination names are obtained from the information storage unit 6 as r03-365-5049J r044-655-3021J r045-321-6025J and displayed as shown in FIG.

続いて制御装置１は第１候補の相手先の電話番号に対応
する信号を回線インタフェース８を介して回線に送出す
ることにより自動ダイヤルを行う（ステップｈ）。この
際自動ダイヤル（発信）中を示すために表示部７の左側
に設けられた該当す発信中表示ランプ１２ａを点燈ある
いは点滅させる。尚、このような発信中表示ランプを設
けなくても、相手先名ダイヤル付属情報などの表示自身
を反転表示あるいは点滅表示させる等して発信中である
事を示しても良い。Subsequently, the control device 1 performs automatic dialing by sending a signal corresponding to the telephone number of the first candidate to the line via the line interface 8 (step h). At this time, in order to indicate that automatic dialing (calling) is in progress, the corresponding calling indicator lamp 12a provided on the left side of the display section 7 is turned on or blinks. Incidentally, even if such a call-in-progress indicator lamp is not provided, it is also possible to indicate that a call is in progress by displaying the information attached to the destination name and dial in reverse or blinking.

操作者は、このような発信中を示すランプの点滅等によ
り発信中の相手先が正しいかどうか確認を行う（ステッ
プｉ）。正しければ回線か接続されるのを待つ。しばら
くすると回線が接続され通話可能な状態（ステップｍ）
となる。一方、Ｉ１１手先がまちがっていた場合操作者
は表示部７に表示された残りの相手先名候補のなかから
希望する１１１手先名に対応して設けられている選択ボ
タンの押下を行う。選択ボタンの押下が行われると、そ
のボタンに応じて選択指示情報の入力処理が行われる（
ステップｊ）。この選択指示情報の入力により、まずス
テップｈの自動ダイヤルを中止（ステップｋ）Ｌ、続い
て選択された相手先の電話番号が情報記憶部６から読み
出され、続いて前記回線インターフェース８を介して上
記電話番号が送信されて自動ダイヤルされる（ステップ
ｌ）。The operator checks whether the calling party is correct by checking the blinking of the lamp indicating that the call is in progress (step i). If it is correct, wait for the line to be connected. After a while, the line will be connected and you can talk (step m)
becomes. On the other hand, if the I11 hand name is incorrect, the operator presses the selection button provided corresponding to the desired 111 hand name from among the remaining partner name candidates displayed on the display section 7. When a selection button is pressed, selection instruction information input processing is performed according to the button (
Step j). By inputting this selection instruction information, the automatic dialing in step h is first canceled (step k) L, then the telephone number of the selected destination is read out from the information storage section 6, and then the telephone number is read out from the information storage section 6, and then the telephone number The telephone number is sent and automatically dialed (step l).

これにより本電話機と相手先の電話機が交換機（図示せ
ず）を介して接続され（ステップｍ）送受話器２の入出
力音声は回線インターフェース８から交換機を介して相
手先の電話機とやりとりされることにより通話可能とな
る。As a result, this telephone and the other party's telephone are connected via an exchange (not shown) (step m), and the input/output audio of the handset 2 is exchanged with the other party's telephone from the line interface 8 via the exchange. It becomes possible to make a call.

かくして本実施例に係る電話機によれば、音声認識処理
による第１候補の相手先に対しては即自動ダイヤルされ
るので、いちいち選択ボタンを押すという煩わしさがな
くなる。Thus, according to the telephone set according to the present embodiment, the first candidate destination through voice recognition processing is automatically dialed immediately, eliminating the trouble of pressing the selection button one by one.

また、第１候補がまちがっていた場合でも第２候補、第
３候補の相手先名と電話番号を確認し、希望する相手先
名や電話番号を選択指示することにより自動的にダイヤ
ルすることができるので、音声認識処理を用いた自動ダ
イヤルによって生じる誤ダイヤルを防止することができ
る。Additionally, even if the first choice is incorrect, you can check the second and third candidate names and phone numbers, and then select and instruct the desired recipient name and phone number to automatically dial. Therefore, it is possible to prevent incorrect dialing caused by automatic dialing using voice recognition processing.

なお、本発明は上述した実施例に限定されるものではな
い。Note that the present invention is not limited to the embodiments described above.

例えば第２候補、第３候補を選択する場合、第５図に示
すように、それぞれに相当する選択ボタン１．１．　ａ
、ｌｌｂによる選択を行ったが、これに限らない。例え
ば第３候補までの認識候補として第６図に示す結果が得
られたとする。このとき次候補ボタンを押すことにより
発信の相手先が順次（第１候補、第２候補、第３候補、
第１候補、・・・）選択されても良いし、さらに第４候
補、第５候補、・・・、を表示し選択するようにしても
良い。For example, when selecting the second candidate and the third candidate, as shown in FIG. 5, the corresponding selection buttons 1.1. a
, llb, but the selection is not limited to this. For example, assume that the results shown in FIG. 6 are obtained as recognition candidates up to the third candidate. At this time, by pressing the next candidate button, the destination of the call will be selected in order (1st candidate, 2nd candidate, 3rd candidate,
The first candidate, . . . ) may be selected, or the fourth candidate, the fifth candidate, . . . may be further displayed and selected.

また、相手先名候補の選択については上述した選択ボタ
ンの利用には限定されない。例えば選択ボタンの代わり
に各表示項目に■、■、■の識別表示を行い、前記ボタ
ン入力部５のキーバッドによる該当キーの押下指示によ
ってその選択を行うようにしても良い。また「イチ」　
「二」　ＦサンＪ等の入力音声を認識してその選択情報
の入力を行うようにしても良い。Furthermore, the selection of destination name candidates is not limited to the use of the selection buttons described above. For example, instead of a selection button, each display item may be marked with ``■'', ``■'', and ``■'' for identification, and the selection may be made by pressing the corresponding key on the key pad of the button input section 5. “Ichi” again
``Second'' An input voice such as F San J may be recognized and the selection information may be input.

［発明の効果］以上説明したように本発明によれば、音声が入力される
と、複数の認識候補およびこれら各認識候補に対応する
候補順位が出力され、複数の認識候補が表示され、まず
、候補順位がもっとも高い認識候補に対し発呼が行われ
る。そして、この発呼動作が行われている最中に、他の
認識候補の選択が行われると、前記発呼動作が停止され
、選択された認識候補により発呼が行われる。このため
、音声認識処理を用いた自動ダイヤルによって生じる誤
ダイヤルを防止しつつ、操作性の向上を図ることのでき
る。[Effects of the Invention] As explained above, according to the present invention, when a voice is input, a plurality of recognition candidates and candidate rankings corresponding to these recognition candidates are output, a plurality of recognition candidates are displayed, and first, , a call is made to the recognition candidate with the highest candidate rank. If another recognition candidate is selected while this calling operation is being performed, the calling operation is stopped and a call is made using the selected recognition candidate. Therefore, it is possible to improve operability while preventing erroneous dialing caused by automatic dialing using voice recognition processing.

[Brief explanation of the drawing]

第１図は本発明を電話機に適用した実施例装置の概略構
成を示す機能ブロック図、第２図は実施例に係る電話の
表面パネルの構成例を示す図、第３図は情報記憶装置に
情報入力するカナに対するキーの割当て例を示す図、第
４図は実施例に係る処理手続きの流れを示す図、第５図
及び第６図はそれぞれ音声認識された相手先名候補とそ
の相手先の電話番号の表示例とその相手先の電話番号の
表示例とその選択指示の形態の例を示す図である。１・・・制御装置、２・・・送受話器、３・・・音声認
識部、４・・・認識辞書、５・・・ボタン入力部、６・
・・情報記憶部、７・・・表示部、８・・・回線インタ
フェース、１０・・・音声認識装置、ｌｌａ、１ユｂ・
・・選択ボタン、１２　ａ　ｓ　１２　ｂ　１１２　ｃ
・・・発信中表示ランプ。出願人　　　　　　株式会社　東芝代理人　弁理士　　須　山　佐　− 第４図ボタン光信中オ目′！！−九電話番号第５ボタン発信中、相乎先七繍号第６図FIG. 1 is a functional block diagram showing a schematic configuration of a device according to an embodiment of the present invention applied to a telephone, FIG. 2 is a diagram showing an example of the configuration of a front panel of a telephone according to an embodiment, and FIG. A diagram showing an example of assigning keys to kana for inputting information, FIG. 4 is a diagram showing the flow of processing procedures according to the embodiment, and FIGS. 5 and 6 respectively show speech-recognized destination name candidates and their destinations. FIG. 4 is a diagram illustrating an example of displaying a telephone number of a person, a display example of a telephone number of the other party, and an example of a form of a selection instruction. DESCRIPTION OF SYMBOLS 1... Control device, 2... Handset, 3... Voice recognition part, 4... Recognition dictionary, 5... Button input part, 6...
... Information storage unit, 7... Display unit, 8... Line interface, 10... Voice recognition device, lla, 1b.
...Selection button, 12 a s 12 b 112 c
...Transmission indicator lamp. Applicant Toshiba Corporation Patent Attorney Sasa Suyama - Figure 4 Button Mitsunobu Nakao'! ! - Nine telephone numbers 5th button dialing, Xianxian 7th number 6th figure

Claims

[Claims]

(1) A voice input means into which voice is input, and a voice recognition system that recognizes the voice input by this voice input means and outputs a plurality of recognition candidates corresponding to the voice and a candidate ranking corresponding to each of these recognition candidates. means, a display means for displaying two or more recognition candidates out of the plurality of recognition candidates output from the voice recognition means, and display means for displaying two or more recognition candidates out of the plurality of recognition candidates output from the voice recognition means, based on the ranking candidates output from the voice recognition means. a first calling means that calls the recognition candidate with the highest candidate rank among the recognition candidates that have been selected; and displaying the recognition candidates other than the recognition candidates to which the first calling means has called. selection means for making a selection based on the result displayed on the means; and when the selection by the selection means is made while the first calling means is performing a calling operation;
and a second calling means that stops the calling operation of the calling means and makes a call using the recognition candidate selected by the selection means.