JPH05347664A

JPH05347664A - Voice dial recognition method

Info

Publication number: JPH05347664A
Application number: JP15494092A
Authority: JP
Inventors: Yoshio Kuboyama; 嘉男久保山
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1992-06-15
Filing date: 1992-06-15
Publication date: 1993-12-27

Abstract

PURPOSE:To obtain a voice dial recognition method in which a communication a opposite terminal equipment is designated with high accuracy with simple configuration based on voice information of a language used usually. CONSTITUTION:For example, voice information for identifying a communication opposite terminal equipment and a terminal equipment number corresponding one by one to the voice information are stored respectively in a voice information registration memory 5 and a terminal number registration memory 7 respectively in advance in a voice terminal equipment accommodated into an exchange network. Then the voice information entered from a voice input section 1 and the voice information stored in advance are compared and discriminated by a voice information discrimination circuit 10 at dialing and a terminal equipment number corresponding to the registered voice information best coincident is sent to an exchange.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、電話等の音声端末を収
容する交換網で使用される着信相手指定するためのダイ
ヤルを音声情報により入力し、それをもとに通信相手を
判別する音声ダイヤル認識方法に関し、特に簡易な構成
で誤ダイヤルを起こしにくい音声ダイヤル認識方法に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice for inputting a dial for designating a called party, which is used in a switching network accommodating a voice terminal such as a telephone, by voice information, and for discriminating a communication partner based on the dialed information. The present invention relates to a dial recognition method, and more particularly, to a voice dial recognition method with a simple structure that is less likely to cause erroneous dialing.

【０００２】[0002]

【従来の技術】従来の音声ダイヤル方式では、操作者が
通信相手端末番号を数字の音声情報として入力し、これ
を音声端末や交換機で認識し、数字に変換していたた
め、「イチ」や「ニ」といった極めて短い発音の連続の
各々がどの数字であるかを認識する必要があり、高度な
音声認識技術を必要とし、高価になるという欠点があ
り、かつ認識誤りが発生し易かった。また、音声であっ
ても、機械的な相手端末の番号を入力する必要があり、
日常使用されている言葉と異なる点があった。例えば、
「山田部長」に電話をかけるのに、「山田部長の番号は
１２３４番だから」と頭の中で、またはメモを見なが
ら、一度番号に変換してから、「イチ、ニ、サン、ヨ
ン」と発音する必要があった。図２は、従来の音声ダイ
ヤル方式で用いられる単語音声認識部の構成を示す図で
ある。図２において、２１は入力音声のスペクトル分析
部、２２は音素標準パターン部、２３は入力音声と音素
標準パターンとの類似度を計算する音素類似度計算部、
２４は計算結果を音素毎に時系列に蓄積する類似度行列
部、２５は音素列として蓄積されている単語辞書部、２
６は類似度行列と単語辞書の一致度を計算する単語マッ
チング部、２７は入力音声が最終的にどの単語であるか
を判定する単語判定部である。まず、入力された音声は
スペクトル分析部２１で一定の短い時間区間（１５ｍ
秒）毎にスペクトル分析され、母音、子音等の音素毎に
蓄積された標準スペクトル群（約４０種）の各々との類
似度が計算され、その結果が時間区間毎に順次、類似度
行列部２４に蓄積される。そして、入力音声の休止等に
より単語の区切りを判定すると、次に、音素群を時系列
に並べて構成した単語群と類似度行列の一致度を計算す
る。この際、発声者毎の単語発音時間長の違いはダイナ
ミックプログラミング（ＤＰ）マッチング法により時間
の長さを単語辞書の長さに合わせて計算する。さらに、
単語マッチング部２６で単語辞書の各単語との一致度を
計算し、その出力が最もよい一致度を示す単語を入力さ
れた単語であると単語判定部２７で判定する。すなわ
ち、従来方式によれば、入力された音声単語を標準パタ
ーンの音素群と比較して判定するため、発声者による発
音の個人差の吸収が困難で誤判定し易く、また、単語辞
書に記録された範囲の単語しか使用できない。また、単
語数を多くすれば、入力音声と辞書内の全ての単語との
一致計算をするため、膨大なハードウェア量を必要とす
るか、あるいは計算時間が膨大になる。さらに、音素類
似度計算や単語一致度計算は厳密に行なう程、認識率が
向上するため、認識率を１００％近くにするには、高度
な計算回路が必要で膨大なハードウェア量を要する。な
お、この種の方法については、例えば、「ディジタル音
声処理、古井貞煕著、東海大学出版会（１９８５
年）」に記載されている。2. Description of the Related Art In the conventional voice dial system, an operator inputs a communication partner terminal number as numeric voice information, which is recognized by a voice terminal or an exchange and converted into a number. It is necessary to recognize which number each of a series of extremely short pronunciations, such as "d", requires advanced voice recognition technology, has the drawback of being expensive, and is prone to recognition error. Also, even with voice, it is necessary to enter the number of the mechanical partner terminal,
There were differences from the words used in everyday life. For example,
When calling "Manager Yamada", "I have Yamada's number is 1234." In mind or while looking at the memo, convert it to a number and then "Ichi, Ni, San, Yong" I had to pronounce it. FIG. 2 is a diagram showing a configuration of a word voice recognition unit used in a conventional voice dialing method. In FIG. 2, 21 is a spectrum analysis unit of the input voice, 22 is a phoneme standard pattern unit, 23 is a phoneme similarity calculation unit that calculates the similarity between the input voice and the phoneme standard pattern,
Reference numeral 24 is a similarity matrix section that accumulates the calculation results in time series for each phoneme, 25 is a word dictionary section that is accumulated as a phoneme sequence, and 2
Reference numeral 6 is a word matching unit that calculates the degree of coincidence between the similarity matrix and the word dictionary, and 27 is a word determination unit that determines which word the input voice is finally in. First, the input voice is analyzed by the spectrum analysis unit 21 for a certain short time period (15 m).
Every second), the spectrum is analyzed, and the similarity with each of the standard spectrum groups (about 40 kinds) accumulated for each phoneme such as vowels and consonants is calculated, and the result is sequentially calculated for each time interval in the similarity matrix section. Stored in 24. Then, when the word break is determined by the pause of the input voice or the like, next, the degree of coincidence between the word group formed by arranging the phoneme groups in time series and the similarity matrix is calculated. At this time, the difference in word pronunciation time length for each speaker is calculated according to the length of the word dictionary by the dynamic programming (DP) matching method. further,
The word matching unit 26 calculates the degree of matching with each word in the word dictionary, and the word determining unit 27 determines that the word whose output shows the best matching degree is the input word. That is, according to the conventional method, since the input voice word is compared with the phoneme group of the standard pattern for determination, it is difficult to absorb the individual difference in pronunciation by the speaker, and it is easy to make an erroneous determination. Only the words in the specified range can be used. Further, if the number of words is increased, the calculation of matching between the input voice and all the words in the dictionary requires a huge amount of hardware, or the calculation time becomes huge. Further, the more rigorously the phoneme similarity calculation and the word coincidence calculation are, the higher the recognition rate is. Therefore, in order to bring the recognition rate close to 100%, an advanced calculation circuit is required and a huge amount of hardware is required. Note that this type of method is described in, for example, "Digital Speech Processing, Sadahiro Furui, Tokai University Press (1985).
Year)) ”.

【０００３】[0003]

【発明が解決しようとする課題】上記従来技術では、
（１）極めて短い発音の連続の各々がどの数字であるか
を認識する必要があり、認識装置が高価になる、（２）
音声であっても、機械的な相手端末番号を入力する必要
があり、日常使用される言葉と異なる、等の問題があっ
た。本発明の目的は、このような問題点を改善し、簡易
な構成で、日常使用している言葉の音声情報により、通
信相手端末を精度よく指定できる音声ダイヤル認識方法
を提供することにある。In the above prior art,
(1) It is necessary to recognize which number is in each sequence of extremely short pronunciations, which makes the recognition device expensive (2)
Even with voice, there is a problem that it is necessary to input a mechanical partner terminal number, which is different from the words used in daily life. An object of the present invention is to improve such problems and provide a voice dial recognition method with a simple configuration, which can accurately specify a communication partner terminal based on voice information of words used in daily life.

【０００４】[0004]

【課題を解決するための手段】上記目的を達成するた
め、本発明の音声ダイヤル認識方法は、音声端末または
交換機に、予め、通常よく通信する相手の端末番号とと
もに音声情報を登録、蓄積しておき、発呼時の入力音声
情報と比較して、最もよく一致した登録音声情報に対応
する端末番号を通信相手の端末番号とすることに特徴が
ある。In order to achieve the above object, the voice dial recognition method of the present invention is such that voice information is registered and stored in advance in a voice terminal or an exchange together with a terminal number of a party with whom communication is normally performed. Every other time, as compared with the input voice information at the time of making a call, the terminal number corresponding to the registered voice information that best matches is used as the terminal number of the communication partner.

【０００５】[0005]

【作用】本発明においては、音声情報の意味自体を認識
する必要はなく、同一の音声入力部を用いた登録時の音
声情報と発呼時の入力音声情報とを、各々音声情報全体
のまとまりとして比較し、一致の程度をみればよく、か
つ通常良く通信する相手は高々数十人程度であるため、
誤ダイヤルを極めて少なくでき、また、登録音声情報群
の中から入力音声情報との一致を検出する手段は、簡易
かつ安価な回路で容易に実現できる。このため、複雑な
構成の音声認識装置等を必要とせず、かつ日常使用して
いる多様な言葉をそのまま使用し、簡易かつ安価な構成
で誤ダイヤルの少ない音声ダイヤル認識方法を提供でき
る。In the present invention, it is not necessary to recognize the meaning itself of the voice information, and the voice information at the time of registration and the voice information at the time of making a call using the same voice input unit are respectively collected as a whole voice information. Therefore, it is only necessary to check the degree of agreement, and the number of people with whom communication is normally good is at most several dozens.
Erroneous dialing can be extremely reduced, and the means for detecting a match with the input voice information from the registered voice information group can be easily realized by a simple and inexpensive circuit. Therefore, it is possible to provide a voice dial recognition method that does not require a voice recognition device having a complicated configuration, uses a variety of words that are used everyday, and has a simple and inexpensive configuration with less erroneous dialing.

【０００６】[0006]

【実施例】以下、本発明の一実施例を図面により説明す
る。＜第１の実施例＞図１は、本発明の第１の実施例におけ
る音声端末の概略を示す構成図である。図１において、
１はマイクロホン等の音声入力部、２は通常の電話機等
で使用されているダイヤルボタン、３はアナログ音声情
報の特徴を抽出してディジタル化するためのコーダ、４
−１〜４−５は情報の登録、確認、変更等に使用する機
能スイッチ、５は音声情報群を登録、蓄積しておく音声
情報登録メモリ、６は音声情報登録メモリ内の各々のフ
ィールドに音声情報が登録されているか否かを管理する
管理メモリ、７は登録音声情報と１対１に対応する端末
番号を登録、蓄積しておく端末番号登録メモリ、８は音
声情報の登録および音声情報の呼び出しに使用するアド
レスカウンタ、９は発呼時の入力音声の一次的なバッフ
ァ、１０は発呼時の入力音声情報と登録された音声情報
との一致を判定する音声情報判定回路、１１は音声情報
判定回路１０の一致／不一致の出力と管理メモリ６の読
み出し出力からアドレスカウンタ８を制御するアドレス
制御回路、１２はアドレス制御回路１１からの制御によ
りアドレスカウンタ８の出力を端末番号登録メモリ７の
アドレスとして入力させるゲート回路、１３は音声情報
登録メモリ６への書き込みデータバッファ、１４は音声
情報登録メモリ６からの読み出しデータバッファ、１５
は端末番号登録メモリ７への書き込みデータのバッフ
ァ、１６は端末番号登録メモリ７からの読み出しデータ
のバッファ、１７は端末番号登録メモリ７の読み出し出
力を表示する表示回路、３１〜３５は音声情報線、３６
は音声情報登録メモリ６から呼び出された登録音声情報
線、４１〜４３はダイヤルボタン２のダイヤル情報線、
４４は端末番号登録メモリ７から呼び出された端末番号
情報線、４５は通信相手端末番号を送出するための相手
端末番号情報線、５１はアドレスカウンタ８の出力であ
るアドレス線、５２はアドレスカウンタ８の出力で管理
メモリ６のアドレス線、５３は入力音声情報線３５と登
録音声情報線３６の内容が一致した時に端末番号を呼び
出すための端末番号登録メモリ７のアドレス線、６１は
音声情報判定回路１０の出力で一致が検出された時に出
力する信号線、６２はアドレス制御回路１１の出力で、
音声情報判定回路１０で一致が検出された時に出力する
信号線、６３はアドレス制御回路１２の出力でアドレス
カウンタ８を制御するための信号線である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. <First Embodiment> FIG. 1 is a block diagram showing the outline of a voice terminal according to a first embodiment of the present invention. In FIG.
Reference numeral 1 is a voice input unit such as a microphone, 2 is a dial button used in an ordinary telephone or the like, 3 is a coder for extracting the characteristics of analog voice information and digitizing it, 4
-1 to 4-5 are function switches used for registration, confirmation and change of information, 5 is a voice information registration memory for registering and accumulating voice information groups, and 6 is a field in the voice information registration memory. A management memory for managing whether or not voice information is registered, 7 is a terminal number registration memory for registering and storing registered voice information and a terminal number corresponding to one to one, and 8 is registration of voice information and voice information. An address counter used for calling the call, 9 is a temporary buffer of the input voice at the time of calling, 10 is a voice information determination circuit for determining whether the input voice information at the time of calling matches the registered voice information, and 11 is An address control circuit for controlling the address counter 8 based on the coincidence / non-coincidence output of the voice information judging circuit 10 and the read output of the management memory 6, and 12 is an address counter under the control of the address control circuit 11. Gate circuit for inputting the output of 8 as the address of the terminal number registration memory 7, 13 write data buffer to the audio information registration memory 6, 14 read data buffer from the speech information registration memory 6, 15
Is a buffer for writing data to the terminal number registration memory 7, 16 is a buffer for reading data from the terminal number registration memory 7, 17 is a display circuit for displaying the read output of the terminal number registration memory 7, and 31 to 35 are audio information lines. , 36
Is a registered voice information line called from the voice information registration memory 6, 41 to 43 are dial information lines of the dial buttons 2,
44 is a terminal number information line called from the terminal number registration memory 7, 45 is a partner terminal number information line for transmitting the communication partner terminal number, 51 is an address line output from the address counter 8, 52 is an address counter 8 Is output, the address line 53 of the management memory 6, 53 is the address line of the terminal number registration memory 7 for calling the terminal number when the contents of the input voice information line 35 and the registered voice information line 36 match, and 61 is the voice information determination circuit. A signal line output when a match is detected by the output of 10, and 62 is an output of the address control circuit 11,
Reference numeral 63 is a signal line output when the voice information judging circuit 10 detects a match, and 63 is a signal line for controlling the address counter 8 by the output of the address control circuit 12.

【０００７】次に、本実施例の動作について述べる。ま
ず、登録時においては、操作者は、まず機能スイッチ４
−１を動作させて音声入力部１の出力線３１を音声情報
登録メモリ６の書き込み線３３側へ切り替える。次に、
操作者は、登録すべき音声情報を音声入力部１を用いて
入力する。このとき、音声情報は最初の音量が一定レベ
ルに達した時から一定の時間だけに区切る方法が考えら
れる。また、音声端末からのガイダンス指示によって登
録動作をより確実に行なう方法も可能である。こうして
入力された音声情報の蓄積データの形式はコーダ３で決
められるが、音声情報の特徴が分別できる形式であれ
ば、どのような形式でもよい。例えば、従来技術の例で
示したようなスペクトル分析をして、その結果をそのま
まディジタルデータとして蓄積してもよいし、単に、Ｐ
ＣＭコーダ等でディジタル化して、そのまま蓄積しても
よい。本実施例の音声端末では、機能スイッチ４−１が
動作するとき、アドレスカウンタ８を動作させ、管理メ
モリ６を検索し、その内容が「０」のアドレスを検索し
ておき、入力された音声情報を音声情報メモリ５の当該
アドレスに書き込む。次に、機能スイッチ４−２を動作
させて、ダイヤルボタン２の出力線４１を端末番号登録
メモリ７の書き込み線４３側へ切り替え、登録すべき端
末番号をダイヤルボタン２を用いて入力し、アドレスカ
ウンタ８の出力で示される端末番号登録メモリ７のアド
レスに書き込む。次に、機能スイッチ４−５を「１」側
に動作させ、アドレスカウンタ８の出力で示される管理
メモリ６のアドレスに「１」を書き込む。また、発呼時
においては、機能スイッチ４−３，４−４を動作させ、
音声ダイヤル側に切り替える。これにより、操作者が発
声する音声は音声入力部１を経由し、音声情報線３１，
３２，３４を通って、音声情報バッファ９へ入力され
る。一方、登録しておいた音声情報がメモリ５から呼び
出され、バッファ４を経由して音声情報判定回路１０に
入力され、音声情報バッファ９の出力との一致が判定さ
れる。この判定方法は、ＤＰマッチング法により両者の
時間の長さを合わせて、両者の一致度合を計算し、例え
ば、９０％以上の一致があれば、一致したと判定する。
判定の結果、不一致の場合は、アドレス制御回路１１の
制御によりアドレスカウンタ８が更新され、次の蓄積音
声情報が呼び出されて音声情報判定回路１０へ入力され
る。操作者は既に音声ダイヤルとしてのまとまった音声
情報を発声し終わっているので、次の発声までに計算を
終了させる等の条件はなく、順次検索、計算により多少
時間がかかったとしても問題はなく、計算回路は少なく
て済む。また、一致した場合には、アドレス制御回路１
１からの制御でゲート１２が開けられ、アドレスカウン
タ８の出力が端末番号登録メモリ７のアドレスとして入
力され、一致した音声情報に対応した端末番号が呼び出
され、相手端末番号情報線４５を通じて交換機へ送出さ
れる。このとき、相手端末番号が表示回路１７に表示さ
れる。Next, the operation of this embodiment will be described. First, at the time of registration, the operator first sets the function switch 4
-1 is operated to switch the output line 31 of the voice input unit 1 to the write line 33 side of the voice information registration memory 6. next,
The operator uses the voice input unit 1 to input voice information to be registered. At this time, it is conceivable to divide the voice information into a certain time after the initial volume reaches a certain level. It is also possible to carry out the registration operation more reliably by the guidance instruction from the voice terminal. The format of the accumulated data of the voice information thus input is determined by the coder 3, but any format can be used as long as the features of the voice information can be distinguished. For example, the spectrum analysis as shown in the example of the prior art may be performed and the result may be directly stored as digital data.
It may be digitized by a CM coder or the like and stored as it is. In the voice terminal of the present embodiment, when the function switch 4-1 operates, the address counter 8 is operated, the management memory 6 is searched, and the address whose content is “0” is searched in advance, and the input voice is input. The information is written in the address of the voice information memory 5. Next, the function switch 4-2 is operated to switch the output line 41 of the dial button 2 to the writing line 43 side of the terminal number registration memory 7, and the terminal number to be registered is input using the dial button 2 to enter the address. Write to the address of the terminal number registration memory 7 indicated by the output of the counter 8. Next, the function switch 4-5 is operated to the "1" side, and "1" is written in the address of the management memory 6 indicated by the output of the address counter 8. When making a call, the function switches 4-3 and 4-4 are operated,
Switch to the voice dial side. As a result, the voice uttered by the operator passes through the voice input unit 1 and the voice information line 31,
It is input to the voice information buffer 9 through 32 and 34. On the other hand, the registered voice information is called from the memory 5, is input to the voice information determination circuit 10 via the buffer 4, and it is determined whether or not it matches the output of the voice information buffer 9. In this determination method, the DP matching method is used to match the lengths of time of both parties, and the degree of matching between the two is calculated.
If the result of determination is that they do not match, the address counter 8 is updated under the control of the address control circuit 11, and the next accumulated voice information is called and input to the voice information determination circuit 10. Since the operator has already uttered the complete voice information as a voice dial, there is no condition such as ending the calculation until the next utterance, and there is no problem even if it takes some time for sequential search and calculation. , The number of calculation circuits is small. If they match, the address control circuit 1
The gate 12 is opened by the control from 1, the output of the address counter 8 is input as the address of the terminal number registration memory 7, the terminal number corresponding to the matched voice information is called, and is sent to the exchange through the partner terminal number information line 45. Sent out. At this time, the partner terminal number is displayed on the display circuit 17.

【０００８】また、本実施例では、上記の機能に付随し
て、次に示す動作が可能である。音声情報群登録メモリ
５の登録内容を確認するには、機能スイッチ４−３を動
作させ、登録音声情報と同様の音声を音声入力回路１か
ら入力する。これにより、音声情報登録メモリ５を検索
し、一致した音声情報に対応した端末番号が表示回路１
７に表示される。また、登録内容を変更するには、同様
に、機能スイッチ４−３を動作させ、登録音声情報と同
様の音声を音声入力回路１から入力することにより、音
声情報登録メモリ５を検索し、一致した音声情報に対応
した端末番号を表示回路１７に表示し、さらに、機能ス
イッチ４−２を動作させ、ダイヤルボタン３から新しい
端末番号を入力する。さらに、登録を消去するには、機
能スイッチ４−３を動作させ、登録音声情報と同様の音
声を音声入力回路１から入力すると、音声情報登録メモ
リ５を検索し、一致を検出した音声情報登録メモリアド
レスに該当する管理メモリ６の内容を、機能スイッチ４
−５を「０」側に動作させて、「０」に書き替える。な
お、音声情報登録メモリエリアを全て検索しても、入力
音声情報が登録音声情報の何れとも一致しなかった場合
は、表示回路１７に端末番号が表示されないことで判別
可能であるが、アナウンス等を出力してもよい。また、
機能スイッチ４−１〜４−５は、その機能を満足するも
のであれば、音声端末に取り付けられたダイヤルボタン
２や他の機能スイッチとの共用であってもよく、また、
操作者の押下等によるスイッチの切り替え動作だけでな
く、音声端末内に設けられた何らかの制御回路からの制
御による切り替えであってもよい。さらに、本実施例で
は、登録音声情報メモリ５、管理メモリ６、登録端末番
号メモリ７は別メモリであるが、同一メモリ内にフィー
ルドを分割して持ってもよい。また、これらのメモリ部
分をＩＣメモリカードとし、本実施例の機能スイッチや
周辺回路部分を組み込んだＩＣカード音声端末に挿入し
て、本実施例と同様の効果を得ることも可能である。In addition, in the present embodiment, the following operation is possible in addition to the above functions. To confirm the registered contents of the voice information group registration memory 5, the function switch 4-3 is operated and the voice similar to the registered voice information is input from the voice input circuit 1. As a result, the voice information registration memory 5 is searched, and the terminal number corresponding to the matched voice information is displayed on the display circuit 1.
It is displayed on 7. Further, in order to change the registered contents, similarly, the function switch 4-3 is operated, and a voice similar to the registered voice information is input from the voice input circuit 1 to search the voice information registration memory 5 to find a match. The terminal number corresponding to the voice information is displayed on the display circuit 17, and the function switch 4-2 is operated to input a new terminal number from the dial button 3. Further, in order to delete the registration, the function switch 4-3 is operated, and when the same voice as the registered voice information is input from the voice input circuit 1, the voice information registration memory 5 is searched, and the voice information registration in which a match is detected is registered. The contents of the management memory 6 corresponding to the memory address are stored in the function switch 4
-5 is moved to the "0" side and rewritten to "0". Even if all the voice information registration memory areas are searched, if the input voice information does not match any of the registered voice information, it can be determined that the terminal number is not displayed on the display circuit 17, but an announcement etc. May be output. Also,
The function switches 4-1 to 4-5 may be shared with the dial button 2 attached to the voice terminal or other function switches as long as they satisfy the function.
Not only the switching operation of the switch by the pressing of the operator, but also the switching by the control from some control circuit provided in the voice terminal may be performed. Further, in this embodiment, the registered voice information memory 5, the management memory 6, and the registered terminal number memory 7 are separate memories, but the fields may be divided and held in the same memory. It is also possible to obtain the same effect as this embodiment by inserting these memory parts into an IC memory card and inserting them into an IC card voice terminal incorporating the function switch and peripheral circuit part of this embodiment.

【０００９】＜第２の実施例＞図３は、本発明の第２の
実施例における交換機の概略を示す構成図である。本実
施例では、交換機のダイヤル受信トランクの一種として
音声ダイヤル受信トランクを設けた場合を示す。図３に
おいて、１は音声端末における音声入力回路、２はダイ
ヤルボタン、１０５は音声情報登録メモリ、１０６は管
理メモリ、１０７は端末番号登録メモリ、１２０は交換
機の通話路部、１２１は交換機の制御装置、１２２は音
声ダイヤル受信トランク、１２３は音声ダイヤル受信ト
ランク１２２の内部制御回路、１２４は音声情報とダイ
ヤル情報の流れを切り換えるためのスイッチ、１２５は
音声端末内で音声情報とダイヤル情報の流れを切り換え
るためのスイッチである。本実施例の音声ダイヤル受信
トランクの内部構成は、基本的には図１に示した音声端
末での回路と同様であるが、特に、機能スイッチを切り
替え制御する内部制御回路１２３と音声情報とダイヤル
情報の流れを切り換えるスイッチ１２４を追加し、交換
機からの制御によりスイッチ動作を可能としたことと、
複数の使用者のために、メモリを追加し、使用者ごとの
メモリフィールドが指定できるように制御装置１２１か
らアドレスカウンタ１０８の値を設定できるようにした
ことが、第１の実施例と異なる。このような構成によ
り、音声端末の操作者が、オフフック等によって交換機
へ要求があることを知らせると、交換機はダイヤル音を
返し、ダイヤルの入力を促す。ここで、音声ダイヤルの
発呼、登録、変更、削除等であることを知らせるため
に、ダイヤルボタン２から予め決められた特定の番号を
入力する。交換機では、ダイヤルを通常のダイヤル受信
トランクで受信し、制御装置１２１に報告すると、制御
装置１２１は通話路部１２０の通話路スイッチを音声ダ
イヤル受信トランク１２２へ切り換える。その後は、第
１の実施例と同様にして、音声情報と端末番号の登録や
音声情報による発呼を行なう。なお、上記実施例では、
音声ダイヤルへの用途について述べたが、同様の音声情
報認識方法を、音声より各種機器へ指示を出して動作さ
せる音声コマンドへ適用することも可能である。<Second Embodiment> FIG. 3 is a block diagram showing the outline of an exchange according to a second embodiment of the present invention. In this embodiment, a voice dial receiving trunk is provided as a kind of dial receiving trunk of the exchange. In FIG. 3, 1 is a voice input circuit in a voice terminal, 2 is a dial button, 105 is a voice information registration memory, 106 is a management memory, 107 is a terminal number registration memory, 120 is a communication path section of an exchange, 121 is control of the exchange. A device, 122 is a voice dial receiving trunk, 123 is an internal control circuit of the voice dial receiving trunk 122, 124 is a switch for switching the flow of voice information and dial information, and 125 is a flow of voice information and dial information in the voice terminal. It is a switch for switching. The internal configuration of the voice dial receiving trunk of this embodiment is basically the same as the circuit in the voice terminal shown in FIG. 1, but in particular, the internal control circuit 123 for switching and controlling the function switches, voice information and dialing. A switch 124 for switching the flow of information is added, and the switch operation is enabled by the control of the exchange.
It differs from the first embodiment in that a memory is added for a plurality of users and the value of the address counter 108 can be set from the control device 121 so that a memory field for each user can be designated. With such a configuration, when the operator of the voice terminal informs the exchange of a request by off-hook or the like, the exchange returns a dial tone and prompts for dial input. Here, in order to inform that the voice dialing is a call, registration, change, deletion, etc., a predetermined specific number is input from the dial button 2. In the exchange, when the dial is received by the normal dial receiving trunk and reported to the control device 121, the control device 121 switches the call path switch of the call path unit 120 to the voice dial receiving trunk 122. After that, similarly to the first embodiment, the voice information and the terminal number are registered and the call is made by the voice information. In the above embodiment,
Although the application to the voice dial has been described, a similar voice information recognition method can be applied to a voice command to be operated by issuing an instruction to various devices from voice.

【００１０】[0010]

【発明の効果】本発明によれば、予め登録しておいた音
声情報と発呼時の音声情報を比較・判定するので、音声
情報であれば何でもよく、任意の言葉が使用でき、さら
に、音量の意味自体を認識する必要がないので、高価な
音声認識装置は不要で、経済的に音声ダイヤルが実現で
きる。さらに、同一音声端末から入力された音声情報を
まとまりとして比較するので、一致が行ないやすく誤ダ
イヤルを起こしにくい。また、本発明を交換機で実施す
る場合でも、他の音声端末の使用者が同じ言葉の音声情
報を登録、使用しても発端末番号により判別すれば全く
混同しない等、多くの利点がある。According to the present invention, since the voice information registered in advance and the voice information at the time of making a call are compared and judged, any voice information may be used, and arbitrary words can be used. Since it is not necessary to recognize the meaning of the volume itself, an expensive voice recognition device is unnecessary, and voice dialing can be realized economically. Furthermore, since the voice information input from the same voice terminal is compared as a group, it is easy to make a match and it is difficult to make an erroneous dial. Further, even when the present invention is carried out by the exchange, there are many advantages such that even if a user of another voice terminal registers and uses voice information of the same word, if it is discriminated by the calling terminal number, it will not be confused at all.

【００１１】[0011]

[Brief description of drawings]

【図１】本発明の第１の実施例における音声端末の概略
を示す構成図である。FIG. 1 is a configuration diagram showing an outline of a voice terminal in a first embodiment of the present invention.

【図２】従来の音声ダイヤル方式で用いられる単語音声
認識部の構成を示す図である。FIG. 2 is a diagram showing a configuration of a word voice recognition unit used in a conventional voice dial system.

【図３】本発明の第２の実施例における交換機の概略を
示す構成図である。FIG. 3 is a configuration diagram showing an outline of an exchange in a second embodiment of the present invention.

[Explanation of symbols]

１音声入力部２ダイヤルボタン３コーダ４−１機能スイッチ４−２機能スイッチ４−３機能スイッチ４−４機能スイッチ４−５機能スイッチ５音声情報登録メモリ６管理メモリ７端末番号登録メモリ８アドレスカウンタ９バッファ１０音声情報判定回路１１アドレス制御回路１２ゲート回路１３書き込みデータバッファ１４読み出しデータバッファ１５バッファ１６バッファ１７表示回路２１スペクトル分析部２２音素標準パターン部２３音素類似度計算部２４類似度行列部２５単語辞書部２６単語マッチング部２７単語判定部３１音声情報線３２音声情報線３３音声情報線３４音声情報線３５音声情報線３６登録音声情報線４１ダイヤル情報線４２ダイヤル情報線４３ダイヤル情報線４４端末番号情報線４５相手端末番号情報線５１アドレス線５２アドレス線５３アドレス線６１信号線６２信号線６３信号線１０３コーダ１０４−１機能スイッチ１０４−２機能スイッチ１０４−３機能スイッチ１０４−４機能スイッチ１０４−５機能スイッチ１０５音声情報登録メモリ１０６管理メモリ１０７端末番号登録メモリ１０８アドレスカウンタ１０９バッファ１１０音声情報判定回路１１１アドレス制御回路１１２ゲート回路１１３書き込みデータバッファ１１４読み出しデータバッファ１１５バッファ１１６バッファ１２０通話路部１２１制御装置１２２音声ダイヤル受信トランク１２３内部制御装置１２４スイッチ１２５スイッチ１３１音声情報線１３２音声情報線１３３音声情報線１３４音声情報線１３５音声情報線１３６登録音声情報線１４１ダイヤル情報線１４２ダイヤル情報線１４３ダイヤル情報線１４４端末番号情報線１４５相手端末番号情報線１５１アドレス線１５２アドレス線１５３アドレス線１６１信号線１６２信号線１６３信号線 1 Voice Input Section 2 Dial Button 3 Coder 4-1 Function Switch 4-2 Function Switch 4-3 Function Switch 4-4 Function Switch 4-5 Function Switch 5 Voice Information Registration Memory 6 Management Memory 7 Terminal Number Registration Memory 8 Address Counter 9 buffer 10 voice information determination circuit 11 address control circuit 12 gate circuit 13 write data buffer 14 read data buffer 15 buffer 16 buffer 17 display circuit 21 spectrum analysis section 22 phoneme standard pattern section 23 phoneme similarity calculation section 24 similarity matrix section 25 Word dictionary unit 26 Word matching unit 27 Word determination unit 31 Voice information line 32 Voice information line 33 Voice information line 34 Voice information line 35 Voice information line 36 Registered voice information line 41 Dial information line 42 Dial information line 43 Dial information line 4 terminal number information line 45 partner terminal number information line 51 address line 52 address line 53 address line 61 signal line 62 signal line 63 signal line 103 coder 104-1 function switch 104-2 function switch 104-3 function switch 104-4 function Switch 104-5 Function switch 105 Voice information registration memory 106 Management memory 107 Terminal number registration memory 108 Address counter 109 Buffer 110 Voice information determination circuit 111 Address control circuit 112 Gate circuit 113 Write data buffer 114 Read data buffer 115 Buffer 116 Buffer 120 Call Road part 121 Control device 122 Voice dial receiving trunk 123 Internal control device 124 Switch 125 Switch 131 Voice information line 132 Voice information line 133 Sound Voice information line 134 Voice information line 135 Voice information line 136 Registered voice information line 141 Dial information line 142 Dial information line 143 Dial information line 144 Terminal number information line 145 Partner terminal number information line 151 Address line 152 Address line 153 Address line 161 Signal Line 162 Signal line 163 Signal line

Claims

[Claims]

1. A method of designating a receiving terminal of a switching network accommodating a voice terminal, wherein voice information for identifying a communication partner terminal and a terminal number corresponding to the voice information are registered in advance in the voice terminal. At the time of making a call, the voice terminal compares the input voice information with the voice information registered in advance, determines the terminal number corresponding to the registered voice information that best matches the communication partner terminal number, A voice dial recognition method characterized by transmitting a terminal number to an exchange.

2. A method of designating a called terminal of a switching network accommodating a voice terminal, wherein voice information for identifying a communication partner terminal, which is input from the voice terminal in advance, and a terminal number corresponding to the voice information are registered in the exchange. Incidentally, at the time of making a call from the voice terminal, the exchange selects the registered voice information registered in advance from the call terminal based on the number for identifying the call terminal, and inputs it from the call terminal. The voice dial recognition method is characterized in that the terminal number corresponding to the registered voice information that most closely matches is determined as the communication partner terminal number and the connection operation is performed.