JP2003513341A

JP2003513341A - System and method for increasing recognition rate of voice input command in telecommunications terminal

Info

Publication number: JP2003513341A
Application number: JP2001535162A
Authority: JP
Inventors: フェルトストレム，アルベルトディエゴジメネズ
Original assignee: テレフオンアクチーボラゲットエルエムエリクソン（パブル）
Priority date: 1999-11-04
Filing date: 2000-10-31
Publication date: 2003-04-08
Also published as: CN1387663A; WO2001033553A3; EP1226576A2; CN1191566C; AU1390501A; WO2001033553A2

Abstract

(57)【要約】遠隔通信端末の音声ダイヤリングの精度を高めるための方法と、その方法を用いる端末を開示する。望みの電話番号を示すアナログ音声入力をデジタル信号に変換する。自動音声認識モジュールは数字を認識し、その数字を示す出力信号を生成する。判断モジュールは、電話番号の１以上の数字が変換モジュールにより認識されなかったかどうかを判断するためのテストを行う。電話番号が認識されなかった数字を含む場合、検索モジュールは、ユーザーが入力した電話番号の認識された数字と一致する数字を有する電話番号を付随するメモリモジュールから検索する。一致したメモリからの電話番号を、視覚的に、又は音声によってユーザーに通知するようにしても良い。要望に応じて、遠隔端末はメモリモジュールから選ばれた電話番号に接続するようにしてもよい。 (57) [Summary] A method for improving the accuracy of voice dialing of a telecommunications terminal and a terminal using the method are disclosed. The analog voice input indicating the desired telephone number is converted to a digital signal. The automatic speech recognition module recognizes the number and generates an output signal indicative of the number. The determining module performs a test to determine whether one or more digits of the telephone number were not recognized by the conversion module. If the telephone number contains unrecognized digits, the search module searches the associated memory module for a telephone number having a number that matches the recognized number of the telephone number entered by the user. The user may be notified of the phone number from the matched memory either visually or by voice. If desired, the remote terminal may connect to a telephone number selected from the memory module.

Description

Detailed Description of the Invention

【０００１】背景[0001] background

【０００２】本発明は通信装置における音声入力の認識に関し、更に詳しくは遠隔通信端末
における音声ダイヤリングシステムの精度を高めるためのシステム及び方法に関
する。The present invention relates to recognition of voice input in a communication device, and more particularly to a system and method for increasing the accuracy of a voice dialing system in a telecommunications terminal.

【０００３】例えば移動電話機などの遠隔通信端末は、多くの現代産業国においてユビキタ
スである。遠隔通信端末のほとんどは、入力装置としてキーパッドを用いている
。しかし、キーパッドには欠点がある。まず、キーパッドを使うためには、たと
え短い時間ではあってもユーザーは通信装置に注意を向けなければならない。例
えば運転中など特定の場合には、これは望ましいことではない。また市場の力は
、ハンドセットとも呼ばれるより小さい遠隔電話端末装置を製造するように、間
断無く製造者を駆り立てている。端末装置が小型化するとキーパッドエラーが起
こりやすくなり、入力装置としてのキーパッドの精度が下がる。Telecommunication terminals, such as mobile phones, are ubiquitous in many modern industrial countries. Most telecommunications terminals use a keypad as an input device. However, keypads have drawbacks. First, in order to use the keypad, the user must pay attention to the communication device, even for a short time. In certain cases, such as while driving, this is not desirable. Market forces are also continually driving manufacturers to produce smaller remote telephone terminals, also called handsets. When the terminal device is downsized, a keypad error is likely to occur and the accuracy of the keypad as an input device is lowered.

【０００４】製造業者は、音声入力を受け付け、入力を認識し、その入力に基づいて動作す
る音声による入力装置を実現した。例えば、Kuniyoshiの米国特許第4,959,850号
では、電話の音声ダイヤリングのための音声認識能力を有する無線電話装置を開
示している。同様に、Sakanishiの米国特許第5,042,063号及びGerson等の米国特
許第4,870,686号は、音声ダイヤリングを可能にするために音声認識能力を利用
した電話装置を開示している。音声認識機能は、Willの米国特許第5,917,891号
、Maekawa等の米国特許第5,884,257号、Eting等の米国特許第5,651,056号、Mead
orの米国特許第5,638,425号、Petersonの米国特許第5,509,049号、Jakatdarの米
国特許第5,495,553、そして、Hunt等の米国特許第5,303,299にも開示されている
。Manufacturers have implemented voice input devices that accept voice input, recognize the input, and act on the input. For example, Kuniyoshi U.S. Pat. No. 4,959,850 discloses a wireless telephone device having voice recognition capability for voice dialing of a telephone. Similarly, Sakanishi U.S. Pat. No. 5,042,063 and Gerson et al. U.S. Pat. No. 4,870,686 disclose telephone devices that utilize voice recognition capabilities to enable voice dialing. Speech recognition features are described in Will U.S. Pat.No. 5,917,891, Maekawa et al. U.S. Pat.No. 5,884,257, Eting et al. U.S. Pat.No. 5,651,056, Mead
or US Pat. No. 5,638,425, Peterson US Pat. No. 5,509,049, Jakatdar US Pat. No. 5,495,553, and Hunt et al. US Pat. No. 5,303,299.

【０００５】しかし、音声認識とは難しいもので、特に、自動車の音や雑踏と言った周辺環
境からの雑音が音声信号に混ざると難しい。発音が不十分だったり、周辺の雑音
が邪魔になったりすると、装置がユーザーの音声を認識できないことがある。音
声ダイヤリングに適用した場合には、電話装置が間違った番号をダイヤルしてし
まうことになる。または、電話装置が認識できない数字または数字列全部を繰り
返すようにユーザーに促すこともできる。音声認識システムの精度によってはユ
ーザーはかなりの確率で番号を繰り返さなければならず、音声ダイヤリングがユ
ーザーにとってあまり便利なものではなくなってしまう。However, voice recognition is difficult, especially when noise from the surrounding environment such as automobile sounds and crowds is mixed in the voice signal. The device may not be able to recognize the user's voice if the pronunciation is insufficient or if ambient noise interferes. If applied to voice dialing, the telephone device would dial the wrong number. Alternatively, the user may be prompted to repeat all numbers or sequences of numbers that the telephone device does not recognize. Depending on the accuracy of the voice recognition system, the user must repeat the number with a high probability, making voice dialing less convenient for the user.

【０００６】従って、音声ダイヤルシステム及び方法を向上するための技術が求められてい
る。Therefore, there is a need for techniques to improve voice dialing systems and methods.

【０００７】概略[0007] Outline

【０００８】本発明は、移動電話機を含む遠隔通信端末の音声ダイヤリングを容易にするた
めの装置及び方法を提供することで、上記及びその他の問題を解決する。本発明
によれば、遠隔端末は、音声認識ルーチンの精度を高めるためにメモリに格納さ
れた情報を用いる。好ましくはその情報はその遠隔端末から以前にかけた電話番
号に関する先験的な情報であり、音声認識システムの精度を高めるために、音声
ダイヤリング方法によって入力された電話番号と照合することができる。The present invention solves these and other problems by providing an apparatus and method for facilitating voice dialing of telecommunications terminals, including mobile telephones. According to the invention, the remote terminal uses the information stored in the memory to enhance the accuracy of the speech recognition routine. Preferably the information is a priori information about a phone number previously placed from the remote terminal and can be matched with the phone number entered by the voice dialing method to increase the accuracy of the voice recognition system.

【０００９】一様態によれば、本発明は通信装置の音声ダイヤリングを容易にするためのシ
ステムを提供する。システムは、入力文字列を示す音声入力を受けてその入力文
字列の各文字を示す信号を生成する変換モジュールと、入力文字列が認識されな
かった文字を含むかどうかを判断する判断モジュールと、ネットワークアドレス
に対応する複数の文字列を含むメモリモジュールと、入力文字列の中の認識され
た文字に対応する文字を含む文字列をメモリモジュールから検索する検索モジュ
ールとを有する。使用にあたって、変換モジュールが入力文字列の中の１以上の
文字を変換できない場合に、検索モジュールは、入力文字列中の認識された文字
に一致する文字を有する１以上の文字列をメモリモジュールから検索することが
できる。According to one aspect, the present invention provides a system for facilitating voice dialing of a communication device. The system includes a conversion module that receives a voice input indicating an input character string and generates a signal indicating each character of the input character string, and a determination module that determines whether the input character string includes an unrecognized character. The memory module includes a plurality of character strings corresponding to the network address, and a search module that searches the memory module for a character string including a character corresponding to the recognized character in the input character string. In use, if the conversion module is unable to convert one or more characters in the input string, the retrieval module retrieves from the memory module one or more strings having characters that match the recognized characters in the input string. You can search.

【００１０】別の様態によれば、本発明は通信装置の音声ダイヤリングを容易にする方法を
提供する。その方法は、望みの文字列を示す音声入力を受け、文字列の各文字を
示す信号を生成し、文字列が認識されなかった文字を含むかどうかかを判断し、
含む場合には入力された文字列中の認識された文字に対応する文字を有する、一
致する文字列をメモリモジュール内で検索し、一致する文字列を示す信号を生成
する工程を有する。According to another aspect, the present invention provides a method for facilitating voice dialing of a communication device. The method receives a voice input indicating a desired character string, generates a signal indicating each character of the character string, determines whether or not the character string includes an unrecognized character,
If included, the method includes searching the memory module for a matching character string having a character corresponding to the recognized character in the input character string and generating a signal indicating the matching character string.

【００１１】詳細な説明[0011] Detailed description

【００１２】今日用いられる多くのデジタル無線システムは、タイム・スロット・アクセス
・システムを用いている。ユーザー情報（例えば音声）は分割され、圧縮され、
パケット化されて、予め割り当てられたタイム・スロットで送信される。タイム
・スロットは異なるユーザに割り当てることができ、この手法は一般に時分割多
元接続（ＴＤＭＡ）と呼ばれている。ヨーロッパのＧＳＭ（Global System for
Mobile communications)システムや、北アメリカのＤ−ＡＭＰＳ(Digial-Advanc
ed Mobile Phone System)システム、日本のＰＤＣ(Personal Digital Cellular)
システムなどの時分割多元接続（ＴＤＭＡ）通信システムでは、複数の遠隔装置
が１つの無線周波数チャネルを共有することができるので、通信システムの容量
を増やすことができる。Many digital wireless systems in use today use time slot access systems. User information (eg voice) is split, compressed,
It is packetized and transmitted in pre-assigned time slots. Time slots can be assigned to different users and this approach is commonly referred to as Time Division Multiple Access (TDMA). GSM (Global System for Europe)
Mobile communications) system and North American D-AMPS (Digial-Advanc
ed Mobile Phone System) system, Japanese PDC (Personal Digital Cellular)
In a time division multiple access (TDMA) communication system, such as a system, multiple remote devices can share a single radio frequency channel, thus increasing the capacity of the communication system.

【００１３】以下、時分割多元接続（ＴＤＭＡ）無線通信システムに関して実施の形態を示
す。しかしながら、ＴＤＭＡ方式は単に説明のために記述するものであり、本発
明を周波数分割多元接続（ＦＤＭＡ）、ＴＤＭＡ、符号分割多元接続（ＣＤＭＡ
）、及び／又はこれらを組み合わせたものを含む全てのタイプのアクセス方式に
適用できることは、当業者であれば理解できるであろう。Hereinafter, embodiments will be described with respect to a time division multiple access (TDMA) wireless communication system. However, the TDMA scheme is described for illustrative purposes only, and the present invention is not limited to frequency division multiple access (FDMA), TDMA, code division multiple access (CDMA).
), And / or any combination thereof, as will be appreciated by those skilled in the art.

【００１４】ＧＳＭ規格に対応するセルラー通信システムの動作は、欧州電気通信標準化機
構（ＥＴＳＩ）文書ＥＴＳ３００５７３，ＥＴＳ３００５７４，ＥＴＳ３０
０５７８に記載されており、ここでは引例として挙げておく。従って、ＧＳＭ
システム例の動作は、ここでは簡単な説明に留める。本発明はＧＳＭシステムに
おける一例として記述するが、本発明を他の通信システムに利用できるというこ
とは、当業者であれば理解できるであろう。The operation of a cellular communication system compatible with the GSM standard is described in the European Telecommunications Standards Institute (ETSI) document ETS 300 573, ETS 300 574, ETS 30.
0 578, which is here cited as a reference. Therefore, GSM
The operation of the example system is limited to a brief description here. Although the present invention is described as an example in a GSM system, one of ordinary skill in the art will appreciate that the present invention can be used in other communication systems.

【００１５】図１には、本発明を実現可能な通信システム１０が示されている。システム１
０は通話を管理するための複数のレベルを有する階層ネットワークである。１組
のアップリンクとダウンリンク無線周波数を用いて、システム１０内で動作して
いる遠隔無線端末１２はこれらの周波数においてそれぞれに割り当てられている
タイムスロットを用いた通話を行う。上位の階層レベルでは、移動通信交換局（
ＭＳＣ）１４のグループが通話を発呼側から着信先へルーティングする。特に、
ＭＳＣ１４は呼のセットアップ、制御及び切断を行う。ＭＳＣ１４の１つは一般
的にゲートウェイＭＳＣと称され、公衆交換電話網（ＰＳＴＮ）１８との通信ま
たは、その他の公衆及び私設ネットワークとの通信を取り扱う。FIG. 1 shows a communication system 10 in which the present invention can be implemented. System 1
0 is a hierarchical network with multiple levels for managing calls. Using a set of uplink and downlink radio frequencies, remote wireless terminals 12 operating in system 10 make calls using their assigned time slots at these frequencies. At higher hierarchical levels, mobile switching centers (
A group of MSCs) 14 routes the call from the caller to the destination. In particular,
The MSC 14 is responsible for call setup, control and disconnection. One of the MSCs 14, commonly referred to as a gateway MSC, handles communication with the public switched telephone network (PSTN) 18 or with other public and private networks.

【００１６】各ＭＳＣ１４は、１以上の基地局コントローラ（ＢＳＣ）１６に接続されてい
る。ＧＳＭ規格では、ＢＳＣ１６は、ＣＣＩＴＴＮｏ．７信号方式の移動通信
応用部に基づく、Ａ−インターフェースとして知られる基準インターフェースに
よりＭＳＣ１４と通信する。Each MSC 14 is connected to one or more base station controllers (BSC) 16. According to the GSM standard, the BSC 16 has a CCITT No. It communicates with the MSC 14 through a reference interface known as an A-interface, which is based on the 7 signaling mobile communication application.

【００１７】各ＢＳＣ１６は、１以上の無線基地局装置（ＢＴＳ）２０を制御する。各ＢＴ
Ｓ２０は、１以上の通信セル２１のような特定の地域でサービスを提供をするた
めにアップリンク及びダウンリンク無線周波数（ＲＦチャネル）を使用する、１
以上の送受信機（ＴＲＸ）（不図示）を含む。ＢＴＳ２０は主に、各セル内でデ
ータバーストを遠隔局１２へ送信したり、遠隔局１２から受信するためのＲＦリ
ンクを提供する。一実施の形態では、多数のＢＴＳ２０が無線基地局（ＲＢＳ）
２２に含まれている。ＲＢＳ２２は、例えば、ＲＢＳ−２０００製品系列に応じ
て構成しても良い。それらの製品は、本発明の譲受人であるテレフオンアクチー
ボラゲットＬＭエリクソンにより提供されている。一例である遠隔局１２及びＲ
ＢＳ２２の実施に関する詳細については、Frondigh等による米国特許第5,909,46
9号を参照されたい。Each BSC 16 controls one or more wireless base station devices (BTS) 20. Each BT
S20 uses uplink and downlink radio frequencies (RF channels) to provide service in a particular area, such as one or more communication cells 21, 1
The above transceiver (TRX) (not shown) is included. BTS 20 primarily provides an RF link for transmitting and receiving data bursts to remote station 12 within each cell. In one embodiment, multiple BTSs 20 are radio base stations (RBSs).
22 included. The RBS 22 may be configured according to the RBS-2000 product series, for example. Those products are provided by Telefon Acty Boraget LM Ericsson, the assignee of the present invention. An example remote station 12 and R
For more information regarding the implementation of BS22, see Frondigh et al., US Pat. No. 5,909,46.
See issue 9.

【００１８】図２は本発明において用いられる遠隔端末２００の概略を示す。遠隔端末２０
０は例えばＧＳＭシステム、ＰＤＣシステム、又はＤ−ＡＭＰＳシステムと言っ
たデジタルＴＤＭＡセルラー通信システムで用いられる移動電話機であることが
好ましい。しかし、上述の通り、本発明は全てのタイプのアクセスシステムに適
用可能であり、ＴＤＭＡやＣＤＭＡシステム、又はこれらを組み合わせたものに
容易に応用することができる。遠隔端末は広く知られており、すでに市販されて
いる。従って、本発明に関する遠隔端末２００の様態についてのみ詳細に説明す
る。遠隔端末についての追加情報については、Dent等による米国特許第5,745,52
3号を参照されたい。FIG. 2 schematically shows a remote terminal 200 used in the present invention. Remote terminal 20
0 is preferably a mobile telephone used in a digital TDMA cellular communication system, eg GSM system, PDC system or D-AMPS system. However, as described above, the present invention is applicable to all types of access systems and can be easily applied to TDMA and CDMA systems, or a combination thereof. Remote terminals are widely known and already on the market. Therefore, only the aspect of the remote terminal 200 relating to the present invention will be described in detail. For additional information on remote terminals, see US Pat. No. 5,745,52 by Dent et al.
See issue 3.

【００１９】図２において、遠隔端末２００は、本発明に直接関係する部分として、電話の
ユーザーからの音声入力を受けるためのマイク２１０を有する。マイク２１０は
変換モジュール２２０に接続されている。変換モジュール２２０は、アナログ音
声入力をデジタル信号に変換するためのアナログ・デジタル（Ａ／Ｄ）変換器２
２４を有する。変換モジュール２２０は、更に、ユーザーの音声を認識するため
の自動音声認識（ＡＳＲ）モジュール２２８を含む。また、遠隔端末２００は、
ユーザーが話した文字が望みの精度をもってＡＳＲモジュール２２８により認識
されたかどうかを判断するための判断モジュール２３０を含む。遠隔端末２００
は更に、有効な電話番号を示す文字列を格納するためのメモリモジュール２５０
と、メモリモジュール２５０を検索するための検索モジュールとを含む。遠隔端
末２００はまた、例えば図１に示すようなＧＳＭネットワークなどの通信ネット
ワークとの通信接続を確立するための接続モジュール２６０を含む。更に、遠隔
端末２００は、ユーザー向けに情報を表示する適切な表示器２７０（例えば、Ｌ
ＥＤまたはＬＣＤ表示器）を有する。適切な音声認識モジュールを有する端末の
１つとして、Ｔ２８がエリクソンから発売されている。In FIG. 2, the remote terminal 200 has a microphone 210 for receiving voice input from a telephone user, as a portion directly related to the present invention. The microphone 210 is connected to the conversion module 220. The conversion module 220 is an analog-to-digital (A / D) converter 2 for converting an analog voice input into a digital signal.
With 24. The conversion module 220 further includes an automatic speech recognition (ASR) module 228 for recognizing a user's voice. In addition, the remote terminal 200
A decision module 230 is included to determine whether the characters spoken by the user have been recognized by the ASR module 228 with the desired accuracy. Remote terminal 200
Further includes a memory module 250 for storing a string indicating a valid telephone number.
And a search module for searching the memory module 250. The remote terminal 200 also includes a connection module 260 for establishing a communication connection with a communication network, such as the GSM network as shown in FIG. In addition, the remote terminal 200 may include an appropriate indicator 270 (eg, L
ED or LCD display). The T28 is available from Ericsson as one of the terminals with a suitable voice recognition module.

【００２０】モジュール２２０〜２６０の一部又は全ては、適切な特定用途向け集積回路（
ＡＳＩＣ）や、プログラムされたデジタル信号プロセッサや、複数のＡＳＩＣを
含むチップセットの形態で実現することができる。モジュール２２０〜２６０及
び遠隔端末のその他の構成要素は電気的に接続される。例えば、判断モジュール
２３０と検索モジュール２４０は表示部２７０、スピーカー２８０、及び接続モ
ジュール２６０に電気的に接続されている。Some or all of the modules 220-260 may include any suitable application specific integrated circuit (
ASIC), a programmed digital signal processor, or a chipset containing multiple ASICs. The modules 220-260 and other components of the remote terminal are electrically connected. For example, the determination module 230 and the search module 240 are electrically connected to the display unit 270, the speaker 280, and the connection module 260.

【００２１】加えて、好適な実施の形態では、メモリモジュール２５０と接続モジュール２
６０の電気的接続により、遠隔端末により確立した接続に関する電話番号をメモ
リモジュール２５０に格納することができる。例えば、遠隔端末２００でユーザ
が電話番号を入力する度にその番号がメモリモジュール２５０に格納される。こ
のようにして、後述するようにメモリモジュール２５０は、音声ダイヤリングの
精度を高めるために先験的な情報として用いることのできる、以前にかけた電話
番号のリストを維持する。In addition, in the preferred embodiment, the memory module 250 and the connection module 2
The electrical connection of 60 allows the telephone number associated with the connection established by the remote terminal to be stored in the memory module 250. For example, each time the user inputs a telephone number at the remote terminal 200, the number is stored in the memory module 250. In this way, the memory module 250 maintains a list of previously called telephone numbers that can be used as a priori information to enhance the accuracy of voice dialing, as described below.

【００２２】図３は、本発明の実施の形態における音声ダイヤリングの方法を示す。この方
法の概要としては、図３に示すように、ユーザーが話した文字を受け、その文字
をデジタル信号に変換し、文字列が完成しているかどうかを判断する。文字列が
完成していなければ、システムは追加される文字を繰り返し受け取ってデジタル
信号に変換する。完全な文字列を受け取った後、システムは文字列中に１以上の
認識されなかった文字が含まれるかどうかを判断する。文字列が認識されなかっ
た文字を含まない場合、その文字列を、認識された文字列に対応する番号を電話
がダイヤルできるようにするモジュール（例えば接続モジュール）に送る。文字
列が１以上の認識されなかった文字を含む場合、検索モジュールを呼び出す。検
索モジュールは、文字列中の認識された数字を、付随するメモリ内の文字列の対
応する数字と比較し、メモリ内の文字列が、ユーザーが入力した文字列と一致し
ているようであるかどうかを判断する。一致しているようであると判断された場
合、その文字列は、認識された文字列に対応する番号を電話がダイヤルできるよ
うにするモジュールに送られる。または、文字列を表示したり、音声により電話
のユーザーに知らせるようにしても良く、その場合、ユーザーはその文字列が実
際に望みの文字列に一致しているかどうかを指示することができる。以下、この
処理を詳細に説明する。FIG. 3 shows a method of voice dialing in the embodiment of the present invention. As an outline of this method, as shown in FIG. 3, a character spoken by the user is received, the character is converted into a digital signal, and it is determined whether or not the character string is completed. If the string is not complete, the system repeatedly receives the added characters and converts them into a digital signal. After receiving the complete string, the system determines if the string contains one or more unrecognized characters. If the string does not contain unrecognized characters, it is sent to a module that allows the telephone to dial the number corresponding to the recognized string (eg, a connection module). If the string contains one or more unrecognized characters, call the search module. The search module compares the recognized number in the string with the corresponding number in the associated in-memory string, and the in-memory string appears to match the string entered by the user. Determine if If it is determined that there is a match, the string is sent to the module that allows the phone to dial the number corresponding to the recognized string. Alternatively, the string may be displayed or voiced to inform the user of the phone, in which case the user may indicate whether the string actually matches the desired string. Hereinafter, this process will be described in detail.

【００２３】一実施の形態では、図３に示す処理は、例えば音声ダイヤリングできる移動電
話機などの遠隔通信端末上で実現される。図３のステップ３１０において、音声
ダイヤリング機能が使用可能になり、遠隔端末は文字列の１番目の文字を示す音
声入力を受け取る。米国では、好ましくはその文字が公知の１０桁の電話番号フ
ォーマット（例えば、ＸＸＸ−ＸＸＸ−ＸＸＸＸ）の１つの数字を示す。しかし
、文字列は異なる地域の電話番号システム用のフォーマットであったり、または
データアプリケーションにおいてはデータネットワークのネットワークアドレス
（例えば、ＵＲＬやＩＰアドレス）を示すものであっても良い。または、文字列
は遠隔端末向けのコマンドを示すコマンドであっても、高速ダイヤリングのため
の番号を含むメモリ位置を示すものであっても良い。In one embodiment, the process shown in FIG. 3 is implemented on a telecommunications terminal, such as a mobile telephone capable of voice dialing. In step 310 of Figure 3, the voice dialing feature is enabled and the remote terminal receives a voice input indicating the first character of the string. In the United States, the letters preferably refer to a single digit in the well-known ten digit telephone number format (eg, XXX-XXX-XXXX). However, the character string may be in a format for a telephone number system of a different area, or may indicate a network address (eg, URL or IP address) of a data network in a data application. Alternatively, the character string may be a command indicating a command for a remote terminal or a memory location including a number for high speed dialing.

【００２４】ステップ３２０において、受け取った文字は、ユーザーが話した文字を示すデ
ジタル信号に変換される。変換は、アナログ／デジタル（Ａ／Ｄ）変換器を適切
なＡＳＲモジュールと併せて用いることで行うことができる。多くのＡＳＲモジ
ュールを用いることで、ある文字に対して為された判定の信頼性の計量を報告す
るための、統計的処理を実行することが可能になる。望みの信頼性率はＡＳＲモ
ジュールのロジック内にプログラムしても、ユーザー選択可能にし、パラメータ
としてシステムに入力するようにしても良い。ＡＳＲモジュールは公知の技術で
あり、ＡＳＲモジュールの詳細は本発明を左右するものではない。In step 320, the received characters are converted into a digital signal that represents the characters spoken by the user. The conversion can be performed using an analog / digital (A / D) converter in combination with a suitable ASR module. The use of many ASR modules makes it possible to carry out statistical processes for reporting a measure of the reliability of the decisions made on a character. The desired reliability rate may be programmed into the logic of the ASR module or it may be user selectable and entered into the system as a parameter. The ASR module is a known technique, and details of the ASR module do not influence the present invention.

【００２５】ステップ３３０において、文字列の入力が完了したかどうかを判断するテスト
を行う。例えば、１０文字フォーマットを用いる米国の電話システムでは、１０
個目の文字入力をもって文字列が完成したと判断される。別の例としては、判断
ステップにおいてタイムアウト処理を利用し、特定文字入力後、所定時間の経過
をもって文字列が完成したものと見なす。また別の例としては、指定キーを押下
したり、指定コードを話すことにより、文字列が終了したことをユーザーが積極
的に示すようにしてもよい。当業者であれば、入力文字列の終了を検知する多く
の方法を認めることができるであろう。文字列が完成していない場合、文字列が
完成するか、ユーザーが音声入力処理を中止する旨を指示するまで、ステップ３
１０から３３０を繰り返す。In step 330, a test is performed to determine if the character string input is complete. For example, in the US telephone system using the 10 character format, 10
It is determined that the character string is completed by the input of the character of the number. As another example, a time-out process is used in the determination step, and it is considered that the character string is completed when a predetermined time elapses after the specific character is input. As another example, the user may positively indicate the end of the character string by pressing a designated key or speaking a designated code. One of ordinary skill in the art will recognize many ways to detect the end of an input string. If the character string is not completed, until the character string is completed, or until the user instructs to stop the voice input process, step 3
Repeat from 10 to 330.

【００２６】文字列が完成したと判断されると、ステップ３４０において文字列が１以上の
認識されなかった文字を含むかどうかを判断するテストを行う。ここで言う「認
識されなかった文字」とは、ＡＳＲモジュールにより確認されなかった文字列中
の文字を指す。一実施の形態では、システムは、文字列中の１以上の文字に関す
る信頼性の計量が所定閾値（例えば、９５％又は９０％）より低いかどうかを判
断するテストを行い、低い場合に、その文字列が認識されなかった文字を有する
ものとしてもよい。更にテストを追加して行っても良い。例えば、２文字に関す
る信頼性の計量が所定閾値よりも低い場合に、その文字列は認識されなかった文
字を有するものとしてもよい。Once the string is determined to be complete, a test is performed at step 340 to determine if the string contains one or more unrecognized characters. The "unrecognized character" here refers to a character in a character string that is not confirmed by the ASR module. In one embodiment, the system performs a test to determine if the confidence metric for one or more characters in the string is below a predetermined threshold (eg, 95% or 90%), and if so, The character string may have unrecognized characters. Additional tests may be added. For example, if the reliability metric for two characters is less than a predetermined threshold, then the string may have unrecognized characters.

【００２７】もし、文字列が認識されなかった文字を含まなければ、ステップ３８０におい
てその文字列をダイヤルし、遠隔端末２００はネットワークと接続するように試
みる。If the string does not contain any unrecognized characters, the string is dialed in step 380 and the remote terminal 200 attempts to connect to the network.

【００２８】文字列が認識されなかった文字を含む場合、ステップ３５０において、遠隔端
末に付随するメモリモジュールを検索し、メモリモジュール内の文字列が、ユー
ザーにより入力された文字列の認識された文字と一致するかどうかを判断する。
ステップ３６０において一致する場合、その文字列をメモリから取得し、ステッ
プ３７０でユーザーに示すが、これは必ずしも行わなくても良い。一実施の形態
では、文字列は、ＬＣＤやその他適切な表示器上に表示するなどして、視覚的に
ユーザに提示される。別の形態では、音声合成機を用いて音声によりその文字列
をユーザーに提示する。ユーザーから承認の指示を受け取ると、その文字列はス
テップＳ３８０でダイヤルされる。If the character string includes unrecognized characters, the memory module attached to the remote terminal is searched in step 350, and the character string in the memory module is the recognized character of the character string input by the user. To see if it matches.
If there is a match in step 360, the string is retrieved from memory and presented to the user in step 370, although this need not be the case. In one embodiment, the string is visually presented to the user, such as by being displayed on an LCD or other suitable display. In another form, the character string is presented to the user by voice using a voice synthesizer. Upon receiving the approval instruction from the user, the character string is dialed in step S380.

【００２９】ステップ３１０〜３８０の一部又は全ては、適切なＡＳＩＣ、ＤＳＣ又はチッ
プセット、又は汎用プロセッサ上で動作している論理命令により行うことができ
る。Some or all of steps 310-380 may be performed by logic instructions running on a suitable ASIC, DSC or chipset, or general purpose processor.

【００３０】本発明は、２〜３の実施の形態に基づいて詳細に説明した。しかしながら、当
業者であれば、本発明から離脱することなく、様々な変形が可能であることは明
らかであろう。従って、本発明は添付の請求項によってのみ定義され、その同等
の構成は全て本発明に包含されるものである。The present invention has been described in detail based on a few embodiments. However, it will be apparent to those skilled in the art that various modifications can be made without departing from the present invention. Therefore, the present invention is defined only by the appended claims, and all equivalent constructions are included in the present invention.

[Brief description of drawings]

本発明の目的、特徴及び利点は、上記の詳細な記述を以下の図面と合わせ読む
ことでより明らかになるであろう。The objects, features and advantages of the present invention will become more apparent when the above detailed description is read in conjunction with the following drawings.

【図１】図１は、本発明を実現するのに適したＧＳＭ通信例を示すブロック図である。[Figure 1] FIG. 1 is a block diagram showing an example of GSM communication suitable for implementing the present invention.

【図２】図２は、本発明の実施の形態における通信装置での音声発呼を容易にするため
の方法を示すフローチャートである。FIG. 2 is a flowchart showing a method for facilitating a voice call in a communication device according to an embodiment of the present invention.

【図３】図３は、本発明の実施の形態における遠隔通信端末の概略図である。[Figure 3] FIG. 3 is a schematic diagram of a telecommunications terminal in the embodiment of the present invention.

───────────────────────────────────────────────────── フロントページの続き (81)指定国ＥＰ(ＡＴ，ＢＥ，ＣＨ，ＣＹ，ＤＥ，ＤＫ，ＥＳ，ＦＩ，ＦＲ，ＧＢ，ＧＲ，ＩＥ，ＩＴ，ＬＵ，ＭＣ，ＮＬ，ＰＴ，ＳＥ)，ＯＡ(ＢＦ，ＢＪ，ＣＦ，ＣＧ，ＣＩ，ＣＭ，ＧＡ，ＧＮ，ＧＷ，ＭＬ，ＭＲ，ＮＥ，ＳＮ，ＴＤ，ＴＧ)，ＡＰ(ＧＨ，ＧＭ，ＫＥ，ＬＳ，ＭＷ，ＭＺ，ＳＤ，ＳＬ，ＳＺ，ＴＺ，ＵＧ，ＺＷ)，ＥＡ(ＡＭ，ＡＺ，ＢＹ，ＫＧ，ＫＺ，ＭＤ，ＲＵ，ＴＪ，ＴＭ)，ＡＥ，ＡＧ，ＡＬ，ＡＭ，ＡＴ，ＡＵ，ＡＺ，ＢＡ，ＢＢ，ＢＧ，ＢＲ，ＢＹ，ＢＺ，ＣＡ，ＣＨ，ＣＮ，ＣＲ，ＣＵ，ＣＺ，ＤＥ，ＤＫ，ＤＭ，ＤＺ，ＥＥ，ＥＳ，ＦＩ，ＧＢ，ＧＤ，ＧＥ，ＧＨ，ＧＭ，ＨＲ，ＨＵ，ＩＤ，ＩＬ，ＩＮ，ＩＳ，ＪＰ，ＫＥ，ＫＧ，ＫＰ，ＫＲ，ＫＺ，ＬＣ，ＬＫ，ＬＲ，ＬＳ，ＬＴ，ＬＵ，ＬＶ，ＭＡ，ＭＤ，ＭＧ，ＭＫ，ＭＮ，ＭＷ，ＭＸ，ＭＺ，ＮＯ，ＮＺ，ＰＬ，ＰＴ，ＲＯ，ＲＵ，ＳＤ，ＳＥ，ＳＧ，ＳＩ，ＳＫ，ＳＬ，ＴＪ，ＴＭ，ＴＲ，ＴＴ，ＴＺ，ＵＡ，ＵＧ，ＵＺ，ＶＮ，ＹＵ，ＺＡ，ＺＷ─────────────────────────────────────────────────── ─── Continued front page (81) Designated countries EP (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, I T, LU, MC, NL, PT, SE), OA (BF, BJ , CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG), AP (GH, GM, K E, LS, MW, MZ, SD, SL, SZ, TZ, UG , ZW), EA (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), AE, AG, AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, BZ, C A, CH, CN, CR, CU, CZ, DE, DK, DM , DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, K E, KG, KP, KR, KZ, LC, LK, LR, LS , LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, NO, NZ, PL, PT, RO, R U, SD, SE, SG, SI, SK, SL, TJ, TM , TR, TT, TZ, UA, UG, UZ, VN, YU, ZA, ZW

Claims

[Claims]

1. A system for facilitating voice dialing of a communication device, comprising: a conversion module that receives voice input indicating an input character string and generates a signal indicating each character of the input character string. A determination module for determining whether the input character string includes unrecognized characters, a memory module including a plurality of character strings corresponding to network addresses, and a memory module corresponding to the recognized characters in the input character string. A search module for searching a character string containing characters from the memory module, wherein the search module recognizes the input character string when the conversion module cannot convert one or more characters in the input character string. A system for retrieving from the memory module one or more character strings having characters that match the retrieved characters.

2. The system of claim 1, wherein the conversion module includes an A / D converter for digitizing the received audio input signal.

3. The system according to claim 1, wherein the conversion module includes a voice recognition module that analyzes a digital signal and generates a signal indicating a character string represented by the digital signal.

4. The conversion module generates a signal indicating a reliability level regarding the accuracy of conversion, and the determination module generates a signal indicating whether the reliability level is higher than a predetermined threshold value. The system of claim 1, wherein:

5. The system of claim 1, wherein the conversion module and the decision module are implemented in a digital signal processor.

6. The system according to claim 1, further comprising an output module for generating a signal indicating a character string in the memory.

7. The system according to claim 6, further comprising a display module for displaying the character string indicated by the signal generated by the output module.

8. The system according to claim 6, further comprising a module for audibly notifying a character string indicated by a signal generated by the output module.

9. The system according to claim 1, further comprising a connection module for making a connection with the character string indicated by the signal generated by the output module.

10. A method for facilitating a voice call in a communication device, wherein a voice input indicating a desired character string is received, a signal indicating each character of the character string is generated, and the character string is recognized. It is determined whether or not the character that does not exist is included, and if it does, a matching character string having a character corresponding to the recognized character in the input character string is searched in the memory module, and the matching character string is searched. A method comprising: producing a signal indicative.

11. The step of generating a signal indicating each character in the character string comprises:
The method of claim 1 including digitizing the received audio input signal.
The method described in 0.

12. The step of generating a signal indicating each character in the character string comprises:
12. The method according to claim 11, comprising the step of analyzing the digital signal and generating a signal indicating a character string represented by the digital signal.

13. The step of generating a signal indicating each character in the character string comprises:
The method of claim 10 including the step of generating a first signal indicative of a confidence level for the accuracy of the transformation.

14. The step of determining whether the character string includes an unrecognized character includes comparing the reliability level with a predetermined threshold value and indicating whether the reliability level is greater than a predetermined threshold value. 14. The method of claim 13 including the step of generating a signal of

15. The method of claim 10, further comprising displaying the character string represented by the signal generated by the output module.

16. The method according to claim 10, further comprising the step of audibly notifying the character string indicated by the signal generated by the output module.