JP2000112493A

JP2000112493A - Speech recognizing device and communication device, and their control method

Info

Publication number: JP2000112493A
Application number: JP10281653A
Authority: JP
Inventors: Hiroshi Shinoda; 弘志信田; Naoki Sugawara; 尚樹菅原; Muneki Nakao; 宗樹中尾; Takeshi Toyama; 猛外山; Susumu Matsuzaki; 進松崎; Yasuhide Ueno; 康秀上野
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1998-10-02
Filing date: 1998-10-02
Publication date: 2000-04-21

Abstract

PROBLEM TO BE SOLVED: To enable speech data to be surely registered without exerting a burden on a user by announcing the number of registration trial times of speech data at the time of registration of the speech data. SOLUTION: When the speech registration is first judged to be possible by comparing a speech registerable number with the registered number, for example, 3 is set in a counter. Namely, the speech input in speech registration has executed three times (S45, S46). When a user picks up a receiver according to the instruction of a screen, a guidance display and guide tones are generated (S47, S48). The user speaks his or her name to be registered toward the receiver in this state. When the speeches are sensed, the analysis of the inputted speeches is executed (S49, S50). When the speech analysis is normally executed, the value of the counter is decreased by 1 (S51, S52). Announcement is made that the speech analysis is normally executed and the display of the screen to announce the remaining required number of times of the occurrence is executed (S53, S54). When the speech registration of three times is ended, all of the speech data are registered in an unused speech registration region (S55).

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声認識装置及び
方法及び該音声認識装置を用いた通信装置及びその制御
方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech recognition apparatus and method, a communication apparatus using the speech recognition apparatus, and a control method therefor.

【０００２】[0002]

【従来の技術】従来より、電話機やファクシミリ装置に
おいて、宛先を簡便にダイヤルするために、複数の宛先
の番号をワンタッチキーや短縮キーに登録しておく機能
がある。更には、宛先の略称や番号を登録しておいて、
それらを表示させながらその中から目的の宛先を選択す
る電話帳機能が搭載されて実用化されている。また、さ
らなる利便性向上のために音声認識機能を搭載し、宛先
のダイヤル番号を認識させたりすることが提案されてい
る。2. Description of the Related Art Conventionally, telephones and facsimile machines have a function of registering numbers of a plurality of destinations with one-touch keys or abbreviated keys in order to easily dial a destination. Furthermore, register the abbreviation and number of the destination,
A telephone directory function for selecting a desired destination from these while displaying them has been installed and put to practical use. Further, it has been proposed that a voice recognition function is mounted for further improving convenience so that a dial number of a destination is recognized.

【０００３】[0003]

【発明が解決しようとする課題】原理的に特定話者を前
提とした音声認識アルゴリズムによる音声データ識別で
は、所定の人間が登録しておいた音声データを照合認識
するためには該登録した本人が発声しないと正しく照合
されない場合が多い。また、認識のための音声入力時に
環境ノイズが混入して誤認識の原因になることもある。
さらには、登録した本人であるにも関わらず、体調を崩
すなどして発声すると誤認識されるケースもありうる。
この結果、オペレータが誤認識して選択された宛先ダイ
ヤルに気付かずに間違いＦＡＸや間違い電話をかけてし
まうことになる。これら誤認識を極力防止するために
は、認識時に照合検索する対象となる音声データの登録
が正しく行われている必要がある。音声データの登録の
条件が悪いと、音声認識時に認識精度が低下する。従っ
て音声データの登録時には、同じ音声データを登録する
ために複数回の登録試行を行って最も条件が良いと判断
されたデータを音声メモリに登録するのが一般的であ
る。In principle, in speech data identification by a speech recognition algorithm that presupposes a specific speaker, in order to collate and recognize speech data registered by a predetermined person, the registered person must be registered. Is often not correctly collated unless uttered. Also, when inputting speech for recognition, environmental noise may be mixed in and cause erroneous recognition.
Further, there may be a case where the user is erroneously recognized as uttering due to illness or the like, despite being the registered person.
As a result, the operator may mistakenly recognize the selected destination dial and make a mistaken fax or call. In order to prevent such erroneous recognition as much as possible, it is necessary that voice data to be collated and searched at the time of recognition be correctly registered. If the conditions for registering voice data are poor, the recognition accuracy will be reduced during voice recognition. Therefore, when registering voice data, it is common to perform a plurality of registration trials in order to register the same voice data, and register the data determined to have the best condition in the voice memory.

【０００４】しかしながら、ユーザに同じデータを複数
回発声させて登録する場合、あらかじめ何回登録試行す
れば良いのか知らされていない場合は、必要回数分の試
行を行なわずに登録操作を中断してしまったり、操作に
著しいストレスを感じたりする。However, when a user registers the same data by uttering the same data a plurality of times, if the user does not know in advance how many registration attempts should be made, the registration operation is interrupted without performing the necessary number of trials. The user feels excessive stress in operation.

【０００５】本発明は上記の問題に鑑みてなされたもの
であり、音声登録時に適切なガイダンスを行なうことに
より、ユーザに負担をかけずに、確実に音声データの登
録を行なえるようにすることを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and provides an appropriate guidance at the time of voice registration so that voice data can be reliably registered without burdening a user. With the goal.

【０００６】[0006]

【課題を解決するための手段】上記の目的を達成するた
めの本発明の一態様による音声認識装置は例えば以下の
構成を備える。すなわち、複数の入力音声に対応する音
声データを登録する音声データ登録手段と、前記音声デ
ータ記憶手段に記憶された音声データと入力音声とを照
合することにより音声認識を行なう音声認識手段と、前
記音声データ登録手段による音声データの登録時に、音
声データの登録試行回数を通知する通知手段とを備え
る。A speech recognition apparatus according to one embodiment of the present invention for achieving the above object has, for example, the following configuration. That is, voice data registration means for registering voice data corresponding to a plurality of input voices, voice recognition means for performing voice recognition by comparing voice data stored in the voice data storage means with input voice, And notifying means for notifying the number of registration attempts of the audio data when the audio data is registered by the audio data registering means.

【０００７】また、本発明の他の態様による音声認識装
置の制御方法は、例えば以下の工程を備える。すなわ
ち、複数の入力音声に対応する音声データを音声データ
メモリに登録する音声データ登録工程と、前記音声デー
タメモリに記憶された音声データと入力音声とを照合す
ることにより音声認識を行なう音声認識工程と、前記音
声データ登録工程における音声データの登録時に、音声
データの登録試行回数を通知する通知工程とを備える。A method for controlling a speech recognition device according to another aspect of the present invention includes, for example, the following steps. That is, a voice data registration step of registering voice data corresponding to a plurality of input voices in a voice data memory, and a voice recognition step of performing voice recognition by comparing voice data stored in the voice data memory with an input voice. And a notifying step of notifying the number of trial registrations of audio data when registering audio data in the audio data registration step.

【０００８】また、本発明の他の態様による通信装置
は、例えば以下の構成を備える。すなわち、回線に接続
して遠隔装置との通信を行なう通信装置であって、宛先
名と回線上の相手先アドレスとを対にして宛先メモリに
登録する宛先登録手段と、宛先名の入力音声に対応する
音声データを音声データメモリに登録する音声データ登
録手段と、前記音声データメモリに登録された音声デー
タと入力音声とを照合することにより音声認識を行なう
音声認識手段と、前記音声認識手段による認識結果に基
づいて宛先名を特定し、前記宛先メモリを検索して対応
する相手先アドレスを得る検索手段と、前記音声データ
登録手段による音声データの登録時に、音声データの登
録試行回数を通知する通知手段とを備える。A communication device according to another aspect of the present invention has, for example, the following configuration. That is, a communication device that communicates with a remote device by connecting to a line, a destination registration unit that registers a destination name and a destination address on the line in a destination memory, and a destination name input voice. Voice data registration means for registering corresponding voice data in a voice data memory; voice recognition means for performing voice recognition by comparing voice data registered in the voice data memory with input voice; Search means for specifying a destination name based on the recognition result, searching the destination memory to obtain a corresponding destination address, and notifying the number of voice data registration attempts when registering voice data by the voice data registration means. Notification means.

【０００９】更に、本発明の他の態様による通信装置の
制御方法は、例えば以下の工程を備える。すなわち回線
に接続して遠隔装置との通信を行なう通信装置の制御方
法であって、宛先名と回線上の相手先アドレスとを対に
して宛先メモリに登録する宛先登録工程と、宛先名の入
力音声に対応する音声データを音声データメモリに登録
する音声データ登録工程と、前記音声データメモリに登
録された音声データと入力音声とを照合することにより
音声認識を行なう音声認識工程と、前記音声認識工程に
よる認識結果に基づいて宛先名を特定し、前記宛先メモ
リを検索して対応する相手先アドレスを得る検索工程
と、前記音声データ登録工程における音声データの登録
時に、音声データの登録試行回数を通知する通知工程と
を備える。Further, a control method of a communication device according to another aspect of the present invention includes, for example, the following steps. That is, a method of controlling a communication device that communicates with a remote device by connecting to a line, a destination registration step of registering a destination name and a destination address on the line in a destination memory, and inputting the destination name A voice data registration step of registering voice data corresponding to voice in a voice data memory; a voice recognition step of performing voice recognition by comparing voice data registered in the voice data memory with an input voice; A search step of specifying a destination name based on the recognition result of the step and searching the destination memory to obtain a corresponding destination address; and, when registering the voice data in the voice data registration step, reducing the number of trial registrations of the voice data. Notification step of notifying.

【００１０】[0010]

【発明の実施の形態】以下、添付の図面を参照して本発
明の好適な一実施形態を説明する。Preferred embodiments of the present invention will be described below with reference to the accompanying drawings.

【００１１】本実施形態では、アナログ小電力方式のコ
ードレス電話機機能を搭載し、ＩＴＵ勧告Ｇ３規格を満
足する機能を有するファクシミリ装置に本発明を適用し
た例を説明する。なお、本実施形態のファクシミリ装置
は、カラー読み取り部とカラー記録部とを具備し、カラ
ーコピー機能やカラー画像の通信機能を有する。更に、
本実施形態のファクシミリ装置は、ファクシミリや通話
の目的で、記憶してある宛先略称を選択することで対応
する宛先番号に自動宛先ダイヤルを行う電話帳機能を有
する。特に、その電話帳機能を利用し、音声認識部によ
り音声を認識して得られた認識結果に基づいて自動宛先
ダイヤルをする機能も有している。In this embodiment, an example will be described in which the present invention is applied to a facsimile apparatus equipped with a cordless telephone function of an analog low power system and having a function satisfying the ITU recommendation G3 standard. Note that the facsimile apparatus of the present embodiment includes a color reading unit and a color recording unit, and has a color copy function and a color image communication function. Furthermore,
The facsimile apparatus according to the present embodiment has a telephone directory function for performing automatic destination dialing to a corresponding destination number by selecting a stored destination abbreviation for the purpose of facsimile or telephone conversation. In particular, it has a function of performing automatic destination dialing based on a recognition result obtained by recognizing a voice by a voice recognition unit using the telephone directory function.

【００１２】またカラー読み取り部は本体から着脱自在
なハンドスキャナユニットで構成されており、シート状
原稿の読み取りはもとより、ブック原稿なども読み取る
ことが可能である。スキャナの読み取りセンサは線順次
でＲＧＢ各色を出力する密着型のカラーセンサであっ
て、読み取り幅はＢ４幅、読み取り解像度は２００ｄｐ
ｉである。The color reading section is constituted by a hand scanner unit detachable from the main body, and is capable of reading not only sheet-shaped documents but also book documents. The reading sensor of the scanner is a contact type color sensor that outputs RGB colors in a line-sequential manner, and has a reading width of B4 and a reading resolution of 200 dp.
i.

【００１３】また、カラー記録部はインク吐出方式によ
って記録を行なうものであり、ＣＭＹＫ各色のインクタ
ンクとインク吐出部とが一体となったカラーカートリッ
ジと、黒インクのみのインクタンクとインク吐出部が一
体となったモノクロカートリッジのいずれか一方を装着
することにより３６０ｄｐｉの記録解像度で記録紙に２
値データを記録する。いずれのカートリッジを装着して
もモノクロ記録時の記録幅は最大Ｂ４幅であるが、カラ
ーカートリッジを装着した場合のカラー記録時には、そ
の記録幅が最大Ａ４幅となる。The color recording section performs recording by an ink discharge method. A color cartridge in which an ink tank of each color of CMYK and an ink discharge section are integrated, an ink tank of only black ink and an ink discharge section are provided. Attach one of the integrated monochrome cartridges to print on recording paper at a recording resolution of 360 dpi.
Record the value data. Regardless of which cartridge is mounted, the recording width for monochrome recording is a maximum B4 width, but for color recording when a color cartridge is mounted, the recording width is a maximum A4 width.

【００１４】通信時にはＧ３モードで最高９６００ｂｐ
ｓのモデム速度を有し、画像伝送の誤り再送機能である
ＥＣＭモードを具備している。以下、本実施形態のファ
クシミリ装置の構成、動作について説明する。At the time of communication, maximum 9600 bp in G3 mode
s modem speed and an ECM mode which is an error retransmission function for image transmission. Hereinafter, the configuration and operation of the facsimile apparatus of the present embodiment will be described.

【００１５】図１は実施形態によるファクシミリ装置の
システムブロック図である。図１において、1-1-1は本
装置の制御部であるＣＰＵ、1-1-21はプログラムや各種
固定データを格納するＲＯＭ、1-1-18は各種プログラム
のワークメモリ、留守録などの音声データおよびモノク
ロ画像・カラー画像データ用の蓄積メモリとして使用さ
れるＤＲＡＭである。ＤＲＡＭ1-1-18の容量は全体で２
ＭＢあり、内０．５ＭＢ分はワークメモリとして使用さ
れ、残りの１．５ＭＢは画像データ（送受信データ）の
蓄積と音声データの蓄積のために使用される。1-1-24は
システムに必要な登録データ（各種ソフトスイッチ、電
話帳データ、音声認識のための音声登録データ、装置の
電話番号、略称などの装置用ＩＤデータ）を登録記憶す
るためのＳＲＡＭであって、電源断によりデータが失わ
れないよう電池でバックアップされている。FIG. 1 is a system block diagram of a facsimile apparatus according to an embodiment. In FIG. 1, 1-1-1 is a CPU which is a control unit of the apparatus, 1-1-21 is a ROM for storing programs and various fixed data, 1-1-18 is a work memory of various programs, an answering machine, etc. Used as a storage memory for audio data and monochrome image / color image data. Total capacity of DRAM1-1-18 is 2
There are MBs, of which 0.5 MB is used as a work memory, and the remaining 1.5 MB is used for storing image data (transmitted / received data) and audio data. 1-1-24 is an SRAM for registering and storing registration data necessary for the system (various software switches, telephone directory data, voice registration data for voice recognition, device ID data such as device telephone number, abbreviation, etc.). The battery is backed up so that data is not lost due to power interruption.

【００１６】1-1-2はＩＴＵ勧告Ｇ３モードに必要な機
能を持つ公知のファクシミリ用モデム（本実施形態で
は、V.29，V.21，V27terのほかにＤＴＭＦ認識機能、Ｄ
ＲＡＭ1-1-18への音声録音・ＤＲＡＭ1-1-18等からの音
声再生のための音声コーデック機能も具備するタイプを
使用）である。また、1-1-3は子機電話1-1-5とアナログ
小電力方式の無線通信を行うベースユニットであって、
ＣＰＵ1-1-1からの指示により無線通信を制御する公知
のユニットである。1-1-4はアナログ信号用接続スイッ
チ（クロスポイントスイッチ）であって、ＣＰＵ1-1-1
からの設定によりハンドセット1-1-6、マイク1-1-7、ス
ピーカ1-1-8、ＮＣＵ1-1-9、ＣＰＵ1-1-1の音声入力端
子、モデム1-1-2およびベースユニット1-1-3との間のア
ナログ信号の接続を自在に切り替える公知の回路であ
る。1-1-2 is a well-known facsimile modem having functions required for the ITU recommendation G3 mode (in this embodiment, in addition to V.29, V.21, V27ter, a DTMF recognition function,
A type that also has a voice codec function for voice recording in the RAM 1-1-18 and voice reproduction from the DRAM 1-1-18) is used. In addition, 1-1-3 is a base unit that performs wireless communication of the slave phone 1-1-5 with the analog low power system,
This is a known unit that controls wireless communication according to instructions from the CPU 1-1-1. 1-1-4 is a connection switch (cross point switch) for analog signals, and is a CPU 1-1-1.
1-1-6, microphone 1-1-7, speaker 1-1-8, NCU 1-1-9, voice input terminal of CPU 1-1-1, modem 1-1-2 and base unit This is a well-known circuit that freely switches connection of analog signals between 1-1 and 1-3.

【００１７】1-1-6は、装置本体の電話機ハンドセッ
ト、1-1-7は音声入力用のマイク、1-1-8はスピーカ、1-
1-9は回線とインタフェースするための公知のＮＣＵで
ある。1-1-6 is a telephone handset of the apparatus body, 1-1-7 is a microphone for voice input, 1-1-8 is a speaker, 1-
1-9 is a known NCU for interfacing with a line.

【００１８】1-1-10はモノクロ２値生画像データからラ
ンレングス符号を生成し、またはランレングス符号を入
力してモノクロ２値生画像データを出力する公知のラン
レングス回路である。1-1-11は公知の時計ＩＣ（ＲＴ
Ｃ）である。、また、1-1-25はハンドスキャナユニット
（ＨＳＵ）であり、画像読み取りのためのカラーコンタ
クトセンサ1-1-12と原稿上を移動した距離を測定するた
めのロータリーエンコーダ1-1-16が内蔵されている。ス
キャナユニット1-1-16は装置本体とカールコードにより
接続されて着脱自在な構成になっている。1-1-13はエン
コーダ検出部であり、エンコーダ1-1-16の出力信号から
移動距離データ（ロータリーエンコーダの回転数情報）
を生成してＣＰＵ1-1-1に通知する。Reference numeral 1-1-10 denotes a known run-length circuit which generates a run-length code from the monochrome binary raw image data or inputs the run-length code and outputs the monochrome binary raw image data. 1-1-11 is a known clock IC (RT
C). Reference numeral 1-1-25 denotes a hand scanner unit (HSU), which is a color contact sensor 1-1-12 for reading an image and a rotary encoder 1-1-16 for measuring a distance moved on a document. Is built-in. The scanner unit 1-1-16 is connected to the apparatus main body by a curl cord, and has a detachable configuration. 1-1-13 is an encoder detection unit, which moves distance data (rotational encoder rotation information) from the output signal of encoder 1-1-16
Is generated and notified to the CPU 1-1-1.

【００１９】1-1-14は画像処理部であって、ＣＰＵ1-1-
1の指示により、カラー生画像データをＤＲＡＭ1-1-18
に蓄積する場合は、カラーコンタクトセンサ1-1-12から
出力されたアナログＲＧＢ信号（線順次で解像度２００
ｄｐｉ）を入力して、ＲＧＢ各８ビット（１画素あたり
では２４ビット）の９０ｄｐｉデジタルデータに変換
し、得られた変換データをＤＭＡコントローラ1-1-17に
供給する。また、カラーコピーをダイレクトで行う場合
であれば、カラーコンタクトセンサ1-1-12から出力され
たアナログＲＧＢ信号（線順次で解像度２００ｄｐｉ）
を入力して３６０ｄｐｉ、２値ＹＭＣＫのデジタルデー
タに変換し、得られた変換データをＤＭＡコントローラ
1-1-17に供給する。Reference numeral 1-1-14 denotes an image processing unit, and the CPU 1-1-
According to the instruction of 1, the color raw image data is
If the analog RGB signals output from the color contact sensors 1-1-12 (line-sequential
dpi) is input, and converted into 90 dpi digital data of 8 bits for each of RGB (24 bits per pixel), and the obtained converted data is supplied to the DMA controller 1-1-17. In the case of performing color copying directly, an analog RGB signal output from the color contact sensor 1-1-12 (line sequential resolution 200 dpi)
And converts the data into 360 dpi, binary YMCK digital data, and converts the obtained converted data into a DMA controller.
Supply to 1-1-17.

【００２０】ＤＭＡコントローラ1-1-17は、カラー生画
像データ（９０ｄｐｉ、ＲＧＢ）をＤＲＡＭ1-1-18に蓄
積する場合にはＤＲＡＭ1-1-18にカラー生画像データを
転送し、カラーコピーをダイレクトで行う場合には記録
制御部1-1-19に３６０ｄｐｉの２値ＹＭＣＫデータを転
送する。When the color raw image data (90 dpi, RGB) is stored in the DRAM 1-1-18, the DMA controller 1-1-17 transfers the color raw image data to the DRAM 1-1-18 and performs color copying. In the case of direct recording, binary YMCK data of 360 dpi is transferred to the recording control unit 1-1-19.

【００２１】モノクロ生画像データをＤＲＡＭ1-1-18に
蓄積する場合には、画像処理部1-1-14は、カラーコンタ
クトセンサ1-1-12から出力されたアナログＲＧＢ信号
（線順次２００ｄｐｉ）を入力して、モノクロ２値の２
００ｄｐｉデジタルデータに変換し、得られた変換デー
タをＤＭＡコントローラ1-1-17に供給する。When the monochrome raw image data is stored in the DRAM 1-1-18, the image processing section 1-1-14 outputs an analog RGB signal (line-sequential 200 dpi) output from the color contact sensor 1-1-12. And enter the monochrome binary 2
The data is converted to 00 dpi digital data, and the obtained converted data is supplied to the DMA controller 1-1-17.

【００２２】また、モノクロコピーをダイレクトで行う
場合は、カラーコンタクトセンサ1-1-12から出力された
アナログＲＧＢ信号（線順次２００ｄｐｉ）を入力し
て、モノクロ２値の３６０ｄｐｉデジタルデータに変換
し、ＤＭＡコントローラ1-1-17に供給する。When monochrome copying is performed directly, an analog RGB signal (line sequential 200 dpi) output from the color contact sensor 1-1-12 is input and converted into monochrome binary 360 dpi digital data. This is supplied to the DMA controller 1-1-17.

【００２３】そして、ＤＭＡコントローラ1-1-17は、モ
ノクロ生画像データをＤＲＡＭに蓄積する場合にはＤＲ
ＡＭ1-1-18にモノクロ２値画像データを転送し、モノク
ロコピーをダイレクトで行う場合には記録制御部1-1-19
にモノクロ２値画像データを転送する。When storing the monochrome raw image data in the DRAM, the DMA controller
When the monochrome binary image data is transferred to AM1-1-18 and monochrome copying is performed directly, the recording control unit 1-1-19
And the monochrome binary image data.

【００２４】更に、上記動作について補足すると、（１）本体にスキャナを装着した状態でのコピーはダイ
レクトモードとなり、読み取りデータをＤＲＡＭ1-1-18
へ1ページ分のデータを蓄積（以下、ページ蓄積）する
ことなく、シート状原稿を読み取りながら記録部に印字
する。ＤＲＡＭ1-1-18へのページ蓄積が不要のために高
解像度でもメモリオーバーフローしない理由から、解像
度は記録部1-1-20に整合させてあり、読み取りデータは
副走査方向360dpiとなる。また、コピーサイズもモノク
ロ時はＢ４幅まで、カラー時は記録部1-1-20の仕様でＡ
４幅までとなる。Further, the above operation is supplemented by the following. (1) Copying with the scanner mounted on the main body is in the direct mode, and the read data is transferred to the DRAM 1-1-18.
The sheet-shaped document is read and printed on the recording unit without storing one page of data (hereinafter, page storage). Because the page does not need to be stored in the DRAM 1-1-18 and the memory does not overflow even at a high resolution, the resolution is matched to the recording unit 1-1-20, and the read data is 360 dpi in the sub-scanning direction. Also, the copy size is up to B4 width in monochrome, and A
Up to 4 widths.

【００２５】（２）本体からスキャナを取り出してハン
ドスキャナとしてコピー使用する場合は、メモリモード
となり、読み取りデータは必ずＤＲＡＭ1-1-18にページ
蓄積される。これは、記録部1-1-20の印字速度が人間が
ストレス無くハンドスキャンを実行する速度に対して遅
いための処置であり、一旦ＤＲＡＭ1-1-18にページ蓄積
することによってハンドスキャンを高速で実行終了でき
るよう構成されている。また、読み取り解像度を記録部
1-1-20に整合させて３６０ｄｐｉとすると１ページのデ
ータ容量が大きすぎてＤＲＡＭ1-1-18を占有してしま
い、装置動作に支障をきたす、もしくはＤＲＡＭ1-1-18
の容量を上げるとコストがかかる等の理由から、読み取
り解像度はモノクロ時では２００ｄｐｉ、カラー時では
９０ｄｐｉにおさえてある。(2) When the scanner is taken out from the main body and used for copying as a hand scanner, the mode is the memory mode, and the read data is always stored in the DRAM 1-1-18 as a page. This is because the printing speed of the recording unit 1-1-20 is slower than the speed at which a human performs a hand scan without stress, and the speed of the hand scan is increased by temporarily storing pages in the DRAM 1-1-18. It is configured to be able to end execution. In addition, the reading resolution
If it is set to 360 dpi by matching with 1-1-20, the data capacity of one page is too large to occupy the DRAM 1-1-18, which hinders the operation of the device or the DRAM 1-1-18.
The read resolution is limited to 200 dpi for monochrome and 90 dpi for color, for example, because the cost increases if the capacity is increased.

【００２６】また、同様の理由から、モノクロ時のコピ
ーサイズはＢ４幅までであるが、カラー時のコピーサイ
ズはＡ６（もしくは官製はがきサイズ）以下に制限して
ある。ちなみにモノクロ２００ｄｐｉでＢ４サイズ１ペ
ージ分のデータ容量は約７００ＫＢ、カラー９０ｄｐｉ
でＡ６サイズの１ページ分のデータ容量は約６００ＫＢ
となり、ＤＲＡＭ1-1-18に蓄積可能である。For the same reason, the copy size in monochrome is up to B4 width, but the copy size in color is limited to A6 (or government postcard size) or less. By the way, the data capacity of one page of B4 size is about 700 KB in monochrome 200 dpi and color 90 dpi in color.
And the data capacity of one page of A6 size is about 600KB
And can be stored in the DRAM 1-1-18.

【００２７】データを圧縮符号化してＤＲＡＭ1-1-18に
蓄積すればサイズ・解像度をもっと上げることは出来る
が、後に述べるように圧縮符号化はソフトウエアで行う
ために時間がかかりハンドスキャンの実行速度が下がっ
てしまうというデメリットがある。また、高速の圧縮符
号化のためにハードウエアを追加することも考えられる
が、大幅なコスト増をまねく。The size and resolution can be further increased by compressing and encoding the data and storing the data in the DRAM 1-1-18. However, as will be described later, since the compression encoding is performed by software, it takes a long time to execute the hand scan. There is a disadvantage that the speed decreases. It is also conceivable to add hardware for high-speed compression encoding, but this will result in a significant increase in cost.

【００２８】（３）本体にスキャナを装着した状態での
原稿送信はダイレクトモードとなり、読み取りデータを
ページ蓄積することなくＤＲＡＭ1-1-18経由でモデムに
転送して、シート状原稿を読み取りながら相手ファクシ
ミリに送信してゆく。(3) The original transmission with the scanner mounted on the main body is in the direct mode, and the read data is transferred to the modem via the DRAM 1-1-18 without accumulating the pages, and the other side is read while reading the sheet original. Send to facsimile.

【００２９】（４）本体からスキャナを取り出してハン
ドスキャナとして使用し、原稿送信する場合は、メモリ
モードとなり、読み取りデータは必ずＤＲＡＭ1-1-18に
ページ蓄積される。これは、モデム1-1-2の通信速度
が、人間がストレス無くハンドスキャンを実行する速度
に対して遅いための処置であり、一旦ＤＲＡＭ1-1-18に
ページ蓄積することによってハンドスキャンを高速で実
行終了できるよう構成されている。読み取り解像度やサ
イズに関しては上記の項目（２）と同じ理由で同様の仕
様としている。(4) When the scanner is taken out from the main body and used as a hand scanner to transmit a document, the mode is the memory mode, and the read data is always stored in the page in the DRAM 1-1-18. This is because the communication speed of the modem 1-1-2 is slower than the speed at which a human performs a hand scan without stress, and the speed of the hand scan is increased by temporarily storing pages in the DRAM 1-1-18. It is configured to be able to end execution. The reading resolution and size have the same specifications for the same reason as the item (2).

【００３０】1-1-19は記録制御部であり、記録部1-1-20
が記録可能なデータ形式に入力データを変換する。1-1-
20は記録部であり、インク吐出方式の公知のカラープリ
ンタで構成され、インクタンクとインク吐出部が一体に
なっているカートリッジを記録紙の主走査方向に移動さ
せながら画像を記録してゆく。カラーカートリッジとモ
ノクロカートリッジのいずれが装着されているかは、記
録部1-1-20とカートリッジの電気的接点の構成によりＣ
ＰＵ1-1-1が判別できるようになっている。Reference numeral 1-1-19 denotes a recording control unit.
Converts the input data into a recordable data format. 1-1-
Reference numeral 20 denotes a recording unit, which is formed by a known color printer of an ink ejection system, and records an image while moving a cartridge in which an ink tank and an ink ejection unit are integrated in a main scanning direction of recording paper. Whether the color cartridge or the monochrome cartridge is mounted is determined by the configuration of the electrical contact between the recording unit 1-1-20 and the cartridge.
PU1-1-1 can be determined.

【００３１】1-1-15は解像度変換部であり、モノクロ２
値画像データを入力して解像度変換を行う公知の回路で
構成される。解像度変換部1-1-15は、モノクロ画像の拡
大・縮小などのために使用されるとともに、記録部1-1-
20の解像度に対して回線から受信したモノクロ画像の解
像度を整合させる用途にも使用される。Reference numeral 1-1-15 denotes a resolution converter, which is a monochrome 2
It is composed of a known circuit that inputs value image data and performs resolution conversion. The resolution conversion section 1-1-15 is used for enlarging / reducing a monochrome image, and the recording section 1-1-
It is also used to match the resolution of a monochrome image received from the line with a resolution of 20.

【００３２】1-1-22は操作パネルであり、本ファクシミ
リ装置の各種操作入力及び表示出力を行なう。操作パネ
ル1-1-22は、図３に示すような各種キーや表示ランプ、
およびＬＣＤディスプレイを具備する。また、機構とし
ては、マイク1-1-7、スピーカ1-1-8が配備されている。Reference numeral 1-1-22 denotes an operation panel for performing various operation inputs and display outputs of the facsimile apparatus. The operation panel 1-1-22 includes various keys and display lamps as shown in FIG.
And an LCD display. In addition, microphones 1-1-7 and speakers 1-1-8 are provided as mechanisms.

【００３３】1-1-26はハンドスキャナ着脱センサであ
り、ハンドスキャナユニット1-1-25と装置本体との着脱
状態を検出する。ハンドスキャナ着脱センサ1-1-26の出
力によって、ＣＰＵ1-1-1はシート状原稿を読み取るダ
イレクトモードであるか、ブック原稿など立体物を読み
取るメモリモードであるかを決定する。Reference numeral 1-1-26 denotes a hand scanner attachment / detachment sensor which detects the attachment / detachment state between the hand scanner unit 1-1-25 and the apparatus main body. Based on the output of the hand scanner attachment / detachment sensor 1-1-26, the CPU 1-1-1 determines whether it is in the direct mode for reading a sheet document or the memory mode for reading a three-dimensional object such as a book document.

【００３４】図３は、本実施形態のファクシミリ装置の
概観を示し、特に操作パネル（図１の1-1-22）の構成を
示す図である。FIG. 3 is a diagram showing an overview of the facsimile apparatus of the present embodiment, and particularly showing a configuration of an operation panel (1-1-22 in FIG. 1).

【００３５】図３において、1-3-1は本体用ハンドセッ
ト（図１の1-1-6と同一）である。1-3-2は各種登録や設
定のための「機能キー」、1-3-3は伝言や通話内容を音
声データとしてＤＲＡＭ1-1-18に録音するとき使用する
「録音キー」、1-3-4はモデム1-1-2の音声コーデックに
よってＣＰＵ1-1-1がＤＲＡＭ1-1-18に録音した音声デ
ータを再生するとき使用する「再生キー」、1-3-5はメ
モリに格納された各種データを消去する際に使用する
「消去キー」である。In FIG. 3, reference numeral 1-3-1 denotes a main body handset (same as 1-1-6 in FIG. 1). 1-3-2 is a "function key" for various registrations and settings, 1-3-3 is a "recording key" used when recording messages and conversation contents as voice data in DRAM 1-1-18, 1- 3-4 is a "play key" used by CPU 1-1-1 to play back audio data recorded in DRAM 1-1-18 by the voice codec of modem 1-1-2, and 1-3-5 is stored in memory This is an "erase key" used when erasing various types of data.

【００３６】1-3-6は１６文字分のキャラクタを２行表
示できるバックライト付ＬＣＤであって、装置の状態や
各種メッセージを出力するために使われる。1-3-7は
「カラーＬＥＤ」で、1-3-8の「カラー/白黒キー」でカ
ラーモードが選択されると点灯する。1-3-9はモノクロ
モードでの画質を選択するための「画質キー」である。1-3-6 is an LCD with a backlight capable of displaying 16 characters of two lines, and is used to output the status of the apparatus and various messages. 1-3-7 is a "color LED", which is turned on when a color mode is selected with the "color / monochrome key" of 1-3-8. 1-3-9 is an "image quality key" for selecting the image quality in the monochrome mode.

【００３７】1-3-10は電話帳を呼び出すための「電話帳
キー」と、登録などの内容を確定させるための「セット
キー」を兼用するキーであって、簡単のために以降「セ
ットキー」と呼ぶことにする。1-3-11は「上カーソルキ
ー」、1-3-12は「下カーソルキー」、1-3-13は「左カー
ソルキー」、1-3-14は「右カーソルキー」である。いず
れも表示制御の操作で使用される。1-3-10 is a key which also serves as a "phone book key" for calling the phone book and a "set key" for confirming the contents of registration and the like. Key. 1-3-11 is an "up cursor key", 1-3-12 is a "down cursor key", 1-3-13 is a "left cursor key", and 1-3-14 is a "right cursor key". Both are used for display control operations.

【００３８】1-3-15はファクシミリ送信のための「送信
キー」、1-3-16はファクシミリ受信および受信画像をプ
リントするための「受信／プリントキー」である。1-3-
17はコピーを実行するための「コピーキー」、1-3-18は
実行中の装置動作を中断するための「ストップキー」で
ある。1-3-19はマイク1-1-7の開口部、1-3-20はテンキ
ー部、1-3-21は回線を接続したままマイク1-1-7をアク
ティブにして回線上に音声出力し、また、回線上の音を
スピーカ1-1-8に出力する状態にするための「スピーカ
ホンキー」である。1-3-15 is a "transmission key" for facsimile transmission, and 1-3-16 is a "reception / print key" for facsimile reception and printing of a received image. 1-3-
Reference numeral 17 denotes a "copy key" for executing copying, and 1-3-18 denotes a "stop key" for interrupting the operation of the apparatus being executed. 1-3-19 is the opening of the microphone 1-1-7, 1-3-20 is the numeric keypad, and 1-3-21 is the microphone on the line activated by activating the microphone 1-1-7 with the line connected. This is a "speaker phone key" for outputting a sound on the line to the speaker 1-1-8.

【００３９】1-3-22は音声認識によって宛先自動ダイヤ
ルを実行するための「音声認識キー」、1-3-23は着信し
た電話の相手通話内容を自動的にＤＲＡＭ1-1-18に録音
するための「留守キー」、1-3-24は本体からコードレス
子機電話を呼び出して内線通話を実行するための「子機
キー」、1-3-25は通話状態を保留したまま相手にメロデ
ィを送出する「保留キー」、1-3-26は前回かけた相手の
電話番号を自動的にダイヤルするための「リダイヤルキ
ー」、1-3-27は外線通話中にキャッチホンがはいったと
き、キャッチホンの呼に通話を切り替えるためと、切り
替えたキャッチホン通話から元の通話に切り替えるため
に使用する「キャッチキー」である。1-3-22 is a "speech recognition key" for executing automatic destination dialing by voice recognition, and 1-3-23 is a function of automatically recording the contents of the other party of the incoming call in the DRAM 1-1-18. "Answer key" to call, 1-3-24 is a cordless handset key to call a cordless handset phone from the main unit and execute an extension call, 1-3-25 is to the other party while holding the call state "Hold key" to send out a melody, 1-3-26 is "Redial key" to automatically dial the telephone number of the other party you called last, and 1-3-27 is when call waiting is activated during an outside call The "catch key" is used to switch the call to the call of the call waiting and to switch the call from the switched call waiting to the original call.

【００４０】次に、本実施形態のファクシミリ装置の制
御について説明する。図２は、本実施形態のファクシミ
リ装置におけるタスク構成を示すブロック図である。本
装置のソフトウエアは、マルチタスクＯＳ1-2-12によっ
て、各タスクが同時に平行して動作できる環境を与えら
れている。以下、各タスクについて説明してゆく。Next, control of the facsimile apparatus of this embodiment will be described. FIG. 2 is a block diagram showing a task configuration in the facsimile apparatus of the present embodiment. The software of this apparatus is provided with an environment in which each task can be operated simultaneously in parallel by the multitask OS1-2-12. Hereinafter, each task will be described.

【００４１】1-2-1は状態監視タスクで、装置内に発生
する各イベントを監視し、装置状態を変化させる必要の
あるイベントの発生を検出すると、必要なタスクにその
情報を通知する機能を持つ。たとえば、操作パネルから
のキー情報を検出すると、オペレートタスク1-2-2や回
線制御タスク1-2-3に必要なキー情報を伝達して、装置
の機能を起動する。1-2-1 is a status monitoring task, which monitors each event occurring in the apparatus and, when detecting the occurrence of an event that needs to change the state of the apparatus, notifies the necessary task of the information. have. For example, when key information from the operation panel is detected, necessary key information is transmitted to the operation task 1-2-2 and the line control task 1-2-3 to activate the function of the apparatus.

【００４２】1-2-2はオペレートタスクで、状態監視タ
スク1-2-1からのキーコードＡ情報を受けて動作モード
を判定し、必要な機能を実行するタスクにスタートコマ
ンドを発行するとともに、キーコードＡ情報に基づい
て、操作パネル1-1-22上の表示機能を制御する。1-2-2 is an operating task, which receives the key code A information from the status monitoring task 1-2-1 to determine an operation mode, issues a start command to a task for executing a necessary function, and , The display function on the operation panel 1-1-22 is controlled based on the key code A information.

【００４３】1-2-3は回線制御タスクであり、ＮＣＵを
制御して、回線からの着呼を受け付けたり、オペレート
タスク1-2-2からのダイヤル要求コマンドに応じてダイ
ヤル信号を送出するために回線を捕捉したり、回線断を
実行して通信を終了させたりするシーケンスを実行す
る。また、回線接続状態における状態監視タスク1-2-1
からのキーコードＢ情報で「送信キー」や「受信/プリ
ントキー」を検出するか、発信元の相手が電話かファク
シミリかを回線上の信号を分析することにより自動判別
して、ファクシミリならば通信タスク1-2-5にファクシ
ミリ通信のための通信スタートコマンドを発行したりす
るものである。A line control task 1-2-3 controls the NCU to accept an incoming call from the line or to transmit a dial signal in response to a dial request command from the operation task 1-2-2. For this purpose, a sequence for capturing a line or executing a line disconnection to terminate communication is executed. Also, the status monitoring task 1-2-1 in the line connection status
Detects "Send key" or "Receive / Print key" with key code B information from or automatically determines whether the caller is a telephone or facsimile by analyzing the signal on the line. The communication task issues a communication start command for facsimile communication to the communication task 1-2-5.

【００４４】1-2-4はダイヤル制御タスクであり、回線
制御タスク1-2-3あるいは通信タスク1-2-5からのダイヤ
ルスタートコマンドに応じて各種ダイヤル信号を交換機
に送出する機能を持つ。1-2-5は通信タスクであり、回
線制御タスク1-2-3からの通信スタートコマンドによっ
て各種ファクシミリ通信（通信手順の実行や画像データ
伝送）を実行する。A dial control task 1-2-4 has a function of transmitting various dial signals to the exchange in response to a dial start command from the line control task 1-2-3 or the communication task 1-2-5. . 1-2-5 is a communication task, and executes various facsimile communication (execution of communication procedure and transmission of image data) by a communication start command from the line control task 1-2-3.

【００４５】1-2-6は読み取りタスクであり、オペレー
トタスク1-2-2からの読み取りスタートコマンドに応じ
て、ハンドスキャナユニット1-1-25と画像処理部1-1-14
を制御し、原稿の読み取りを実行する。1-2-7は符号・
復号タスクであり、通信タスク1-2-5、読み取りタスク1
-2-6、記録タスク1-2-9からの各種符号復号スタートコ
マンドに応じて、画像データの符号化、復号化の処理を
ソフトウエアで実行する。このため本装置は、符号復号
化のためのハードウエアコストを大きく削減している
（ただしハードウエアでの実施に比べて処理時間はかか
る）。なお、モノクロ画像には公知のＭＨ符号を適用
し、カラー画像に関してはＲＧＢ多値ＤＰＣＭ方式（Ｒ
ＧＢの各８ビット値で隣接画素間の差分値を計算する方
式）にハフマン符号を割り当てた公知の符号化方式を適
用する。Reference numeral 1-2-6 denotes a reading task. In response to a reading start command from the operating task 1-2-2, the hand scanner unit 1-1-25 and the image processing unit 1-1-14 are read.
To read the original. 1-2-7 is a sign
It is a decryption task, communication task 1-2-5, read task 1
-2-6, In accordance with various code decoding start commands from the recording task 1-2-9, image data encoding and decoding processes are executed by software. For this reason, the present apparatus greatly reduces hardware costs for code decoding (however, processing time is longer than that of hardware). A known MH code is applied to a monochrome image, and an RGB multilevel DPCM method (R
A known encoding method in which a Huffman code is assigned is applied to (a method of calculating a difference value between adjacent pixels with each 8-bit value of GB).

【００４６】1-2-8は音声認識タスクであり、ハンドセ
ット1-3-1から入力された使用者の発声音声を分析し
て、あらかじめ登録されてある音声データ（複数可）と
比較し一致するものを検出し通知する、公知の音声認識
アルゴリズムを含むソフトウエアである。音声認識タス
ク1-2-8は、オペレートタスク1-2-2からの音声認識スタ
ートコマンドにより起動される。また、音声認識タスク
1-2-8の終了は、音声認識タスク1-2-8自身によるもので
あり、音声トレーニング（音声登録）および音声認識の
結果を音声認識結果コマンドとして、オペレートタスク
1-2-2に返送した後に終了する。A voice recognition task 1-2-8 analyzes a user's voice input from the handset 1-3-1 and compares it with pre-registered voice data (a plurality of voice data). Software that includes a well-known voice recognition algorithm that detects and notifies the user of what to do. The speech recognition task 1-2-8 is started by a speech recognition start command from the operation task 1-2-2. Also, voice recognition tasks
The end of 1-2-8 is performed by the voice recognition task 1-2-8 itself, and the result of voice training (voice registration) and voice recognition is used as a voice recognition result command to operate the task.
It ends after returning to 1-2-2.

【００４７】1-2-9は記録タスクであり、レポートタス
ク1-2-10やプリントタスク1-2-11からの記録スタートコ
マンドに応じて、要求された画像データを記録部1-1-20
で印字させる機能をもつ。1-2-10はレポートタスクであ
り、通信履歴が記録される通信管理レポートやＳＲＡＭ
1-1-24の登録情報などの機能設定リストをキャラクタデ
ータで作成し、それを画像データに展開して記録タスク
1-2-9に記録依頼する、各種レポート作成用のソフトウ
エアである。1-2-11はプリントタスクであり、常時、自
動的に記録する必要のある画像データがＤＲＡＭ1-1-18
に蓄積されていないかをチェックしており、記録する必
要のある画像データを検出すると、記録スタートコマン
ドを記録タスクに発行する監視機能を持つ。Reference numeral 1-2-9 denotes a recording task, which stores requested image data in response to a recording start command from the report task 1-2-10 or the print task 1-2-11. 20
It has a function to print with. 1-2-10 is a report task, such as a communication management report or SRAM that records the communication history.
Create a function setting list such as registration information of 1-1-24 with character data, develop it into image data and record it
This is the software for creating various reports, requesting recording from 1-2-9. 1-2-11 is a print task in which image data that needs to be automatically recorded at all times is stored in the DRAM 1-1-18.
It has a monitoring function of issuing a recording start command to a recording task when image data that needs to be recorded is detected.

【００４８】以上のような構成を備えた、本実施形態の
ファクシミリ装置における、電話帳機能、音声登録機能
及び音声認識機能について説明する。図４は、本実施形
態による電話帳機能、音声登録及び音声認識に関わる制
御の概要を説明する図である。The telephone directory function, voice registration function, and voice recognition function in the facsimile apparatus of the present embodiment having the above configuration will be described. FIG. 4 is a diagram illustrating an outline of control relating to the telephone directory function, voice registration, and voice recognition according to the present embodiment.

【００４９】（１）電話帳機能相手先略称と相手先電話番号を対応させて電話帳データ
（図４の4-2）として登録しておき、所望の相手先略称
を検索し、検索された相手先略称に対応する相手先電話
番号でダイヤルを行なう。本実施形態では、相手先の検
索に際して、電話帳データに登録された相手先略称を順次（たとえ
ば、あいうえお順）ＬＣＤ表示器1-3-6に表示すること
により所望の相手先を選択し、必要な相手先番号を得る
方法と、相手先略称を音声登録（以下の（２）項参照）してお
き、相手先略称を音声によって入力し、これを音声認識
（以下の（３）項参照）することで得られた相手先略称
で電話帳データを検索して、対応する相手先番号を得る
方法とを用いることができる。(1) Telephone Directory Function The destination abbreviation and the destination telephone number are associated with each other and registered as telephone directory data (4-2 in FIG. 4), and a desired destination abbreviation is searched for. Dial with the destination telephone number corresponding to the destination abbreviation. In the present embodiment, when searching for a destination, a desired destination is selected by sequentially displaying the destination abbreviations registered in the telephone directory data on the LCD display 1-3-6 (for example, in order of alphabetical order). The method of obtaining the required destination number and the destination abbreviation are registered by voice (see item (2) below), the destination abbreviation is input by voice, and this is recognized by voice (see item (3) below). ), The telephone directory data is searched with the destination abbreviation obtained, and a method of obtaining the corresponding destination number can be used.

【００５０】（２）音声登録音声認識の為の比較分析対象データを、ＳＲＡＭ1-1-24
内の音声登録メモリに作成する。ハンドセット1-1-6、
マイク1-1-7から入力された音声を、或いはＮＣＵ1-1-9
を通じて回線から、もしくはベースユニット1-1-3を通
じて子機1-1-5から入力された音声をクロスポイントＳ
Ｗ1-1-4を通して音声データとしてＣＰＵ1-1-1に入力す
る。ＣＰＵ1-1-1では、オペレートタスク1-2-2及び音声
認識タスク1-2-8を実行することにより、当該音声デー
タに種々の演算を施し、音声認識の為のデータを作成
し、ＳＲＡＭ1-1-24内の音声登録メモリ4-1に登録す
る。なお、このときの結果ＯＫ／ＮＧを音声認識結果コ
マンドとして、オペレートタスク1-2-2に返送する。(2) Voice Registration The data to be compared and analyzed for voice recognition is stored in the SRAM1-1-24.
It is created in the voice registration memory inside. Handset 1-1-6,
Voice input from microphone 1-1-7 or NCU 1-1-9
Through the line or through the base unit 1-1-3 from the slave unit 1-1-5
The data is input to CPU 1-1-1 as audio data through W1-1-4. The CPU 1-1-1 executes the operation task 1-2-2 and the voice recognition task 1-2-8 to perform various operations on the voice data to create data for voice recognition, and Register in the voice registration memory 4-1 in -1-24. The result OK / NG at this time is returned to the operating task 1-2-2 as a speech recognition result command.

【００５１】ここで、オペレートタスク1-2-2は、音声
認識結果コマンドとして「ＯＫ」を受け取ると、電話帳
データにおける相手先略称と、その音声認識用データと
を対応付ける（詳細は後述する）。Here, when the operation task 1-2-2 receives "OK" as the voice recognition result command, it associates the destination abbreviation in the telephone directory data with the voice recognition data (details will be described later). .

【００５２】（３）音声認識音声認識においては、入力された音声について、音声登
録メモリ4-1を参照して認識を行ない、対応する相手先
電話番号を電話帳データ4-2から抽出する。音声登録時
と同様に、音声は、ハンドセット1-1-6、マイク1-1-7か
ら、或いはＮＣＵ1-1-9を通じて回線から、もしくはベ
ースユニット1-1-3を通じて子機1-1-5から入力される。
入力された音声は、クロスポイントＳＷ1-1-4を介して
音声データとしてＣＰＵ1-1-1に入力される。ＣＰＵ1-1
-1では、オペレートタスク1-2-2及び音声認識タスク1-2
-8を実行することにより、入力された音声データに種々
の演算を施し、得られたデータと音声登録メモリ4-1に
登録されているデータを比較し、最も近いデータを選択
し、その結果を音声認識結果コマンドとして、オペレー
トタスク1-2-2に返送する。(3) Voice Recognition In voice recognition, the input voice is recognized with reference to the voice registration memory 4-1 and the corresponding destination telephone number is extracted from the telephone directory data 4-2. As with the voice registration, the voice is transmitted from the handset 1-1-6, the microphone 1-1-7, the line through the NCU 1-1-9, or the slave unit 1-1- through the base unit 1-1-3. Entered from 5.
The input voice is input to the CPU 1-1-1 as voice data via the cross point SW1-1-4. CPU1-1
In -1, the operation task 1-2-2 and the speech recognition task 1-2
-8, various operations are performed on the input voice data, the obtained data is compared with the data registered in the voice registration memory 4-1 and the closest data is selected. Is returned to the operating task 1-2-2 as a voice recognition result command.

【００５３】なお、図４において、電話帳への音声登録
を行う場合と、音声認識による電話帳検索とダイヤルを
実行する場合、オペレートタスク1-2-2は音声認識タス
ク1-2-8に音声認識スタートコマンドを発行し、音声認
識タスクに音声登録処理や認識処理をスタートさせる。In FIG. 4, the operation task 1-2-2 is replaced with the voice recognition task 1-2-8 when performing voice registration in the telephone directory and when executing a telephone directory search and dialing by voice recognition. A voice recognition start command is issued, and the voice recognition task starts voice registration processing and recognition processing.

【００５４】音声認識タスク1-2-8は、ハンドセット1-1
-6やマイク1-1-7から入力したアナログ音声データをＣ
ＰＵ1-1-1に内蔵されたＡ／Ｄ変換回路によってサンプ
リングし、デジタルデータに変換し、音声入力データと
してタスク（音声認識タスク1-2-8）に取り込み、演算
処理をかける。処理が終了すると、音声認識結果コマン
ドが音声認識タスク1-2-8からオペレートタスク1-2-2に
返送される。The voice recognition task 1-2-8 includes the handset 1-1.
-6 and analog audio data input from microphone 1-1-7 to C
The data is sampled by an A / D conversion circuit built in the PU 1-1-1, converted into digital data, captured as voice input data in a task (voice recognition task 1-2-8), and subjected to arithmetic processing. When the processing is completed, the speech recognition result command is returned from the speech recognition task 1-2-8 to the operating task 1-2-2.

【００５５】以下、上述した各機能（電話帳機能、音声
登録、音声認識）に関して更に詳細な説明を行なう。Hereinafter, each of the above-mentioned functions (phone book function, voice registration, voice recognition) will be described in more detail.

【００５６】図５は本実施形態による電話帳データ及び
音声登録メモリのデータ構成例を説明する図である。図
５の上部点線内は、ＳＲＡＭ1-1-24上でオペレートタス
クが管理する電話帳データ4-2である。電話帳データ4-2
は大きく２種類に別れており、一つは全ユーザに共通の
電話帳データであり、もう一つは、個人別電話帳データ
である。FIG. 5 is a diagram for explaining an example of the data structure of the telephone directory data and the voice registration memory according to the present embodiment. The upper dotted line in FIG. 5 is the telephone directory data 4-2 managed by the operating task on the SRAM 1-1-24. Phonebook data 4-2
Are generally classified into two types, one is telephone directory data common to all users, and the other is personal telephone directory data.

【００５７】共通の電話帳データは、電話帳データテー
ブル5-1を備える。電話帳データテーブル5-1には、相手
先の略称を示す「略称」、対応する相手先の「電話番
号」、音声データが登録されているかを表すフラグであ
るところの「音声登録」、音声データが登録されている
場合にその音声データを特定する「登録番号」で構成さ
れるデータブロックが複数記憶されている。The common telephone directory data has a telephone directory data table 5-1. The phonebook data table 5-1 includes “abbreviated name” indicating the abbreviated name of the other party, “telephone number” of the corresponding other party, “voice registration” which is a flag indicating whether voice data is registered, When data is registered, a plurality of data blocks each including a “registration number” for specifying the audio data are stored.

【００５８】個人別の電話帳データは、個人用ＩＤテー
ブル5-2と個人用電話帳テーブル5-3で構成される。個人
用ＩＤテーブル5-2には、個人用電話帳テーブルを所有
するユーザの略称を示す「略称」、この略称に対応する
音声データが登録されているかどうかを表す「音声登
録」、音声データが登録されている場合にその音声デー
タを特定する「登録番号」、及び対応する個人用電話帳
テーブルを特定するための「個人用電話帳テーブルアド
レス」で構成されるデータブロックが格納される。ま
た、個人用電話帳テーブル5-3は、個人用ＩＤテーブル
で登録されたユーザの数分存在する。各テーブルのデー
タの構造は、電話帳データテーブル5-1と同じである。The personal telephone directory data is composed of a personal ID table 5-2 and a personal telephone table 5-3. In the personal ID table 5-2, “abbreviated name” indicating the abbreviation of the user who owns the personal telephone directory table, “voice registration” indicating whether or not voice data corresponding to the abbreviation is registered, and voice data are stored. When registered, a data block composed of a “registration number” for specifying the voice data and a “personal phonebook table address” for specifying the corresponding personal phonebook table is stored. Also, the personal telephone directory tables 5-3 exist for the number of users registered in the personal ID table. The data structure of each table is the same as the telephone directory data table 5-1.

【００５９】音声登録メモリ4-1は、ＳＲＡＭ1-1-24上
で音声認識タスクが管理するもので、音声登録テーブル
5-4と音声データ登録領域5-5を格納する。音声登録テー
ブル5-4は、「登録番号」、「状態」、「除外フラ
グ」、「登録アドレス」を備える。「登録番号」は、音
声データ登録領域5-5を特定するものである。「登録番
号」は、先の「電話帳データテーブル」「個人用ＩＤテ
ーブル」「個人用電話帳テーブル」に記述される「登録
番号」と１対１に対応して関連している。「状態」は、
対応する音声データ登録領域5-5にデータが格納されて
いるか否かを示す。「除外フラグ」はオペレートタスク
から認識処理の比較対象としない旨の指定を受けたかど
うかを表す。「除外フラグ」は図６で後述のコマンド
０、１、２、３、４のいずれかを受信したとき全てＯＦ
Ｆにリセットされ、同じくコマンド５、６のいずれかを
受信したときコマンドで指定された登録番号の「除外フ
ラグ」がＯＮにセットされる。また、「登録アドレス」
は、デジタル符号化された音声データが記憶されている
ＲＡＭエリアの実体アドレスである。音声データ登録領
域5-5には、音声認識を行なうのに必要な音声データが
登録される。The voice registration memory 4-1 is managed by the voice recognition task on the SRAM 1-1-24, and includes a voice registration table.
5-4 and a voice data registration area 5-5 are stored. The voice registration table 5-4 includes “registration number”, “state”, “exclusion flag”, and “registration address”. The “registration number” specifies the audio data registration area 5-5. The “registration number” has a one-to-one correspondence with the “registration number” described in the “phone book data table”, “personal ID table”, and “personal phone book table”. "State"
Indicates whether data is stored in the corresponding audio data registration area 5-5. The “exclusion flag” indicates whether or not the operation task has received a designation not to be compared with the recognition process. The “exclusion flag” is set to “OFF” when any of commands 0, 1, 2, 3, and 4 described below is received in FIG.
The flag is reset to F, and when any of commands 5 and 6 is received, the "exclusion flag" of the registration number designated by the command is set to ON. "Registered address"
Is the actual address of the RAM area where the digitally encoded audio data is stored. Voice data required for performing voice recognition is registered in the voice data registration area 5-5.

【００６０】以上のようなデータ構成において、電話帳
データと音声データは「登録番号」によってリンクされ
ている。従って、たとえば、入力された音声について、
音声データ登録領域5-5のデータを参照して音声認識を
行ない、認識結果として登録番号「ＮＯ．２」を得た場
合、この登録番号で電話帳データテーブル5-1及び個人
用ＩＤテーブル5-2が検索され、この場合は、電話帳デ
ータテーブル5-1から「マナブ」という略称で登録され
た電話番号が獲得される。In the above data structure, telephone directory data and voice data are linked by a “registration number”. Therefore, for example, for the input voice,
When voice recognition is performed with reference to the data in the voice data registration area 5-5 and a registration number "NO.2" is obtained as a recognition result, the telephone directory data table 5-1 and the personal ID table 5 -2 is searched, and in this case, a telephone number registered with an abbreviation "manab" is obtained from the telephone directory data table 5-1.

【００６１】なお、音声認識の結果として登録番号「Ｎ
Ｏ.７」を得た場合、個人用ＩＤテーブル5-2に一致する
データブロックが存在するので、当該音声入力は個人別
電話帳の指定であったと判断する。そして、次に入力さ
れる音声に対する音声認識は、指定された個人別電話帳
(個人用電話帳テーブル5-3)に登録された範囲内で音声
認識による相手先の特定を行なう。なお、図５では略称
が３文字のものしか示されていないが、３文字に限られ
ないことは言うまでもないであろう。The registration number “N” is obtained as a result of the speech recognition.
When "O.7" is obtained, since there is a data block corresponding to the personal ID table 5-2, it is determined that the voice input is a designation of a personal telephone directory. Then, voice recognition for the next input voice is performed in the specified personal telephone directory.
The destination is identified by voice recognition within the range registered in (Personal Phone Book Table 5-3). In FIG. 5, the abbreviation has only three characters, but it is needless to say that the abbreviation is not limited to three characters.

【００６２】本実施形態では、上述のような個人別の電
話帳データを用いることにより、特定話者認識のごとき
音声認識環境が得られるので、音声認識の認識率を向上
させることができる。なお、共通の電話帳データと、個
人別の電話帳データの利用についての更なる詳細は後述
する。In the present embodiment, a voice recognition environment such as specific speaker recognition can be obtained by using the above-described telephone directory data for each individual, so that the recognition rate of voice recognition can be improved. Further details on the use of the common telephone directory data and the individual telephone directory data will be described later.

【００６３】図６はオペレートタスクから音声認識タス
クへ発行される音声認識スタートコマンドの種類と内容
を説明する図である。図６に示されるように、コマンド
の種類としてコマンド０からコマンド６まで存在する。
コマンド０は指定の音声登録データを音声登録メモリか
ら消去するため使用される。なお、図６の「登録番号」
に○が付されているのは、そのコマンドが登録番号を指
定することを表す。また、ここで指定される登録番号
は、図５で説明した登録番号と対応するものである。FIG. 6 is a diagram for explaining the type and contents of a speech recognition start command issued from the operation task to the speech recognition task. As shown in FIG. 6, there are command types from command 0 to command 6.
Command 0 is used to delete the specified voice registration data from the voice registration memory. The “registration number” in FIG.
Indicates that the command specifies a registration number. The registration number specified here corresponds to the registration number described with reference to FIG.

【００６４】コマンド１および２は、音声認識タスク1-
2-8に音声登録処理を依頼するため使用される。コマン
ド３および４は音声認識タスク1-2-8に音声認識処理を
依頼するため使用される。コマンド５および６も音声認
識処理を依頼するため使用されるが、これらを受取った
音声認識タスクは、当該コマンドで指定された登録番号
の音声登録データを比較対象から削除して認識する（図
５の音声登録テーブル5-4における「除外フラグ」をＯ
Ｎにする）。Commands 1 and 2 correspond to voice recognition task 1-
Used to request the audio registration process from 2-8. Commands 3 and 4 are used to request the voice recognition task 1-2-8 for voice recognition processing. Commands 5 and 6 are also used to request a voice recognition process, and the voice recognition task that receives them deletes the voice registration data of the registration number specified by the command from the comparison target and recognizes it (FIG. 5). "Exclusion flag" in the voice registration table 5-4
N).

【００６５】図７は音声認識タスクからオペレートタス
クへ発行される音声認識結果コマンドの種類と内容を説
明する図である。図７に示されるように、コマンド種別
がコマンド１からコマンド４まで存在する。コマンド１
および２は、音声登録結果をオペレートタスク1-2-2に
通知するため使用される。コマンド３および４は音声認
識結果を通知するために使用される。FIG. 7 is a diagram for explaining the types and contents of voice recognition result commands issued from the voice recognition task to the operating task. As shown in FIG. 7, command types exist from command 1 to command 4. Command 1
And 2 are used to notify the voice registration result to the operating task 1-2-2. Commands 3 and 4 are used to notify the speech recognition result.

【００６６】なお、本装置のＳＲＡＭ1-1-24上には、
「理由コード表示する」のスイッチ項目が設けられてお
り、「機能設定」モードにおいて、オペレータの所望す
る状態（ＯＮ/ＯＦＦ）を設定できる。これがＯＮされ
ているときには、音声登録／認識の各処理でＮＧだった
場合に、ＮＧの理由を示す「理由コード」が結果コマン
ドに反映される。すなわち、「理由コードを表示する」
がＯＮとなっている場合には、音声登録／認識の各処理
において、オペレートタスク1-2-2はそれぞれコマンド
４／コマンド６（ＮＧ発生時にその理由コードを付す旨
の指示を含むコマンド）を用いて音声認識タスク1-2-8
に指示を行なう。そして、このコマンドを受けた音声認
識タスク1-2-8は、音声認識結果コマンド２もしくは４
を用いて返答を行なうことになる。ここで、例えば入力
した音声の所定の区間（長さ）で無音を検知した場合に
は「有効データ無し」であると、また、所定の区間（長
さ）で有音（音声部分）が終了しないことを検知した場
合は「入力時間超過」であると音声認識タスク1-2-8は
判断し、このエラー状態に対応する理由コードを音声認
識結果コマンド２もしくは４に含ませる。The SRAM 1-1-24 of the present device has:
A switch item "display reason code" is provided, and a state (ON / OFF) desired by the operator can be set in the "function setting" mode. When this is turned on, if the voice registration / recognition processing is unsuccessful, a “reason code” indicating the reason of the unsuccessful operation is reflected in the result command. That is, "Display reason code"
Is ON, in each process of voice registration / recognition, the operating task 1-2-2 respectively issues a command 4 / command 6 (a command including an instruction to attach a reason code when an NG occurs). Speech recognition task using 1-2-8
Instruct Then, the voice recognition task 1-2-8 receiving this command executes the voice recognition result command 2 or 4
Will be used to respond. Here, for example, when silence is detected in a predetermined section (length) of the input voice, it is determined that there is no valid data, and a sound (voice portion) ends in the predetermined section (length). If the voice recognition task 1-2-8 detects that it is not, the voice recognition task 1-2-8 determines that the input time has been exceeded, and includes the reason code corresponding to this error state in the voice recognition result command 2 or 4.

【００６７】図８は、本実施形態による理由コードと、
処理種別、理由内容の関係の一例を示す図である。図８
に示されるような内容を取扱説明書等に載せておくこと
により、オペレータは発生したＮＧの理由を知ることが
できる。本実施形態のファクシミリ装置では、「理由コ
ードを表示する」がＯＮとなっていれば、操作パネル上
のＬＣＤ1-3-6に、ＮＧの旨の表示と共に、図示のよう
な理由コードが表示される。このため、製造検査時やユ
ーザー使用時の重要なガイダンスを提供することが出来
る。ＮＧ発生時の表示は、理由コードのみならず、理由
内容をも表示しても良い。そのためには、理由コードに
対応する理由内容を表す文字コード列、或いはＬＣＤ1-
3-6に表示する理由内容を表すパターンデータそのもの
を予め記憶しておき、発生した理由コードに対応するデ
ータを読み出してＬＣＤ1-3-6に表示する。また、理由
内容に替えて、対処方法を表示しても良い。FIG. 8 shows reason codes according to the present embodiment,
FIG. 11 is a diagram illustrating an example of a relationship between a process type and a reason content. FIG.
The operator can know the reason of the NG that has occurred by placing the contents shown in the above in an instruction manual or the like. In the facsimile apparatus of this embodiment, if "display reason code" is ON, a reason code as shown in the figure is displayed on LCD1-3-6 on the operation panel together with a display indicating NG. You. For this reason, it is possible to provide important guidance at the time of manufacturing inspection and at the time of user use. The display at the time of NG occurrence may display not only the reason code but also the content of the reason. To do so, a character code string representing the reason content corresponding to the reason code or the LCD1-
The pattern data itself representing the reason content to be displayed on 3-6 is stored in advance, and the data corresponding to the generated reason code is read out and displayed on the LCD 1-3-6. Further, a coping method may be displayed instead of the reason content.

【００６８】以下、本実施形態のファクシミリ装置にお
ける電話帳機能の登録と操作、および音声認識による宛
先ダイヤルを行う際の詳細な動作について説明してゆ
く。The detailed operation of registering and operating the telephone directory function in the facsimile apparatus of this embodiment and performing the destination dialing by voice recognition will be described below.

【００６９】＜電話帳登録関連＞電話帳登録関連の処理
としては、個人別電話帳の作成、共通の電話帳及び個人
別電話帳のそれぞれに対する「新規登録」、「登録内容
修正」、「登録内容消去」及び「音声データ消去」があ
る。各処理を実行させるためには、メニューから所望の
機能を選択したりするなど、種々の方法が考えられる。
本実施形態では、処理の選択操作について限定はしな
い。<Phone Book Registration Related> The processing related to phone book registration includes creation of an individual phone book, “new registration”, “registration correction”, and “registration” for a common phone book and an individual phone book. There are "delete contents" and "delete audio data". Various methods are conceivable for executing each process, such as selecting a desired function from a menu.
In the present embodiment, the selection operation of the process is not limited.

【００７０】（１）個人別電話帳の作成図９は本実施形態による個人別電話帳の作成処理操作に
おける表示内容を説明する図である。また、図１０は本
実施形態による個人別電話帳の登録処理手順を表すフロ
ーチャートである。(1) Creation of Individual Phone Book FIG. 9 is a view for explaining the display contents in the operation of creating an individual phone book according to the present embodiment. FIG. 10 is a flowchart showing a procedure for registering an individual telephone directory according to the present embodiment.

【００７１】機能メニュー等を操作して、個人別電話帳
の作成処理を選択すると、ステップＳ１１において図９
の（ａ）に示すような初期画面を表示する。この状態
で、セットキー1-3-10を押すと、処理はステップＳ１２
からＳ１３へ進み、図９の（ｂ）に示すような略称入力
画面を表示して、ユーザに略称を入力させる。操作とし
ては、下段から所望の文字を選択すると、上段の■カー
ソルの位置に文字が順次配置され、下段の「オワリ」に
カーソルを移動することで略称入力の完了とする。こう
して略称入力の完了が示されると、処理はステップＳ１
４からステップＳ１５へと進む。When the user operates the function menu or the like to select a personal telephone directory creation process, in step S11 FIG.
An initial screen as shown in FIG. In this state, if the set key 1-3-10 is pressed, the process proceeds to step S12.
Then, the process proceeds to S13, where an abbreviation input screen as shown in FIG. 9B is displayed, and the user inputs an abbreviation. As an operation, when a desired character is selected from the lower part, the character is sequentially arranged at the position of the @ cursor in the upper part, and the cursor is moved to "Owari" in the lower part to complete the abbreviation input. When the completion of the abbreviation input is indicated in this manner, the process proceeds to step S1.
The process proceeds from Step 4 to Step S15.

【００７２】ステップＳ１５では、個人用ＩＤテーブル
5-2に新たなデータブロックを追加し、その「略称」の
欄にステップＳ１３で入力された略称を登録し、新たに
割り当てた個人用電話帳テーブルのアドレスをその個人
用ＩＤテーブル5-2の「個人用電話帳テーブルアドレ
ス」の欄に登録する。そして、ステップＳ１６におい
て、図９の（ｃ）の如く、個人別電話帳の登録完了を示
す表示を行なう。In step S15, the personal ID table
A new data block is added to 5-2, the abbreviation entered in step S13 is registered in the "abbreviation" field, and the address of the newly assigned personal telephone directory table is assigned to the personal ID table 5-2. In the "Personal phonebook table address" column. Then, in step S16, as shown in FIG. 9C, a display indicating completion of registration of the personal telephone directory is performed.

【００７３】引き続き、ステップＳ１７において、図９
の（ｅ）のごとき画面を表示し、音声データの登録を行
なうか否かを問い合わせる。ここで、ユーザが「シナ
イ」を選択すれば、そのまま本処理を終了する。Subsequently, in step S17, FIG.
(E) is displayed and an inquiry is made as to whether or not to register audio data. Here, if the user selects “Sinai”, the process is terminated as it is.

【００７４】一方、音声データの登録を「スル」とした
場合、処理はステップＳ１８からステップＳ１９へ進
み、音声認識用データを生成するための音声登録処理を
実行する。音声登録処理については図１５により後述す
る。この音声登録処理によって得られたデータは、音声
データ登録領域5-5に登録される。そして、ステップＳ
２０において、音声登録テーブル5-4及び個人用ＩＤテ
ーブル5-2の各欄に、略称と音声データとをリンクする
ための適切なデータが書き込まれる。On the other hand, if the registration of the voice data is set to "NO", the process proceeds from step S18 to step S19, and a voice registration process for generating voice recognition data is performed. The voice registration process will be described later with reference to FIG. The data obtained by this voice registration processing is registered in the voice data registration area 5-5. And step S
At 20, the appropriate data for linking the abbreviation with the audio data is written in each column of the audio registration table 5-4 and the personal ID table 5-2.

【００７５】（２）電話帳登録処理電話帳へのデータ登録について説明する。なお、データ
登録に先立って、登録先が個人別電話帳か共通電話帳か
を指定し、個人別電話帳であればどのユーザの電話帳で
あるかが指定される。電話帳の特定は、たとえば次のよ
うに行なわれる。すなわち、機能キー1-3-2を押し、
上下カーソルキー1-3-11、1-3-12を操作することで、
「共通電話帳」か「個人別電話帳」のいずれかがメニュ
ーから指定され、個人別電話帳が指定された場合は、
ＬＣＤ1-3-6上に、個人用ＩＤテーブル5-2に登録されて
いる略称が順次表示される。所望の略称を選択するこ
とにより処理対象とする個人別電話帳が選択される。従
って、この状態で、「データ登録」を開始すれば、選択
された個人別電話帳への新規登録が行なわれることにな
る。この時点で、共通電話帳への登録処理も個人別電話
帳への登録処理も同じものとなる。(2) Telephone Directory Registration Processing Data registration to the telephone directory will be described. Prior to data registration, it is specified whether the registration destination is a personal telephone directory or a common telephone directory, and if it is a personal telephone directory, which user's telephone directory is specified. The telephone directory is specified, for example, as follows. That is, press the function key 1-3-2,
By operating the up and down cursor keys 1-3-11 and 1-3-12,
If either "Common Phonebook" or "Personal Phonebook" is specified from the menu and a Personal Phonebook is specified,
Abbreviations registered in the personal ID table 5-2 are sequentially displayed on the LCD 1-3-6. By selecting a desired abbreviation, a personal telephone directory to be processed is selected. Therefore, if "data registration" is started in this state, new registration in the selected personal telephone directory is performed. At this point, the registration process in the common telephone directory and the registration process in the personal telephone directory are the same.

【００７６】図１１は電話帳へのデータ登録における画
面表示例を示す図である。図１２及び図１３は電話帳へ
の音声データ登録における画面表示例を示す図である。
図１４は電話帳へのデータ登録処理を説明するフローチ
ャートである。また、図１５は電話帳への音声データ登
録処理を説明するフローチャートである。FIG. 11 is a diagram showing an example of a screen display in registering data in the telephone directory. FIG. 12 and FIG. 13 are diagrams showing screen display examples in registering voice data in the telephone directory.
FIG. 14 is a flowchart illustrating a process of registering data in a telephone directory. FIG. 15 is a flowchart for explaining a process of registering voice data in a telephone directory.

【００７７】電話帳を選択し、電話帳へのデータ登録が
メニューから選択されると、ステップＳ２１において、
ＬＣＤ1-3-6上に、図１１の（ａ）に示すごとき表示を
行なう。なお、この表示において「アキ70ケン」は、電話帳
へ登録が可能な残り件数を表す。本実施形態において、
登録可能件数を１００件とすれば、図１１の（ａ）の表
示の時点では３０件のデータがすでに登録されている。When a telephone directory is selected and data registration in the telephone directory is selected from a menu, in step S21,
A display as shown in FIG. 11A is performed on the LCD 1-3-6. In this display, “Aki 70 Ken” indicates the number of remaining items that can be registered in the telephone directory. In this embodiment,
Assuming that the number of registrable cases is 100, at the time of the display of FIG. 11A, 30 data items have already been registered.

【００７８】次に、セットキー1-3-10が押されると、ス
テップＳ３２からＳ３３へ処理が進み、図１１の（ｂ）
に示すような画面を表示して、ユーザに略称（宛名）を
入力させる。Next, when the set key 1-3-10 is pressed, the process proceeds from step S32 to S33, and FIG.
Is displayed, and the user inputs an abbreviation (address).

【００７９】入力が完了すると（図１１（ｂ）において
「オワリ」が選択されると）、ステップＳ３４からステッ
プＳ３５へ処理が進み、電話番号の入力を行なわせる。
電話番号の入力はダイヤルにより行なうものとする。図
１１の（ｃ）では、電話番号の入力を行なった状態が示
されている。この状態で、セットキー1-3-10が押される
と、ステップＳ３６からステップＳ３７へ処理が進み、
電話帳データテーブル5-1（個人別電話帳への登録であ
れば、指定された個人用電話帳テーブル5-3）へ、入力
された略称と電話番号が登録される。When the input is completed ("Owari" is selected in FIG. 11B), the process proceeds from step S34 to step S35, and the telephone number is input.
The telephone number is entered by dialing. FIG. 11C shows a state in which a telephone number has been input. When the set key 1-3-10 is pressed in this state, the process proceeds from step S36 to step S37,
The entered abbreviation and telephone number are registered in the telephone directory data table 5-1 (or in the case of registration in the personal telephone directory, a designated personal telephone table 5-3).

【００８０】その後、ステップＳ３８へ進み、図１１の
（ｅ）に示すような画面表示を行なって、音声データの
登録を行なうか否かをユーザに問い合わせる。図１１
（ｅ）において「シナイ」が選択された場合は、ステップ
Ｓ４１へ進み、続けて当該電話帳へのデータ登録を行な
うかどうかを問い合わせる（図１１の（ｆ））。続けて
登録するのであれば、処理をステップＳ３３へ戻し、そ
うでなければ本処理を終了する。Thereafter, the flow advances to step S38 to display a screen as shown in FIG. 11E and ask the user whether or not to register voice data. FIG.
If "Sinai" is selected in (e), the process proceeds to step S41, and an inquiry is made as to whether or not to register data in the telephone directory ((f) in FIG. 11). If registration is to be continued, the process returns to step S33; otherwise, the process ends.

【００８１】一方、ステップＳ３８において、音声登録
を行なう旨の指示がなされると、処理はステップＳ３９
へ進み、音声登録処理を行なう。そして、ステップＳ４
０において、図１０のステップＳ２０において上述した
ように、音声登録処理によって得られたデータは、音声
データ登録領域5-5に登録される。すなわち、音声登録
テーブル5-4及び電話帳データテーブル5-1（個人別電話
帳への登録であれば、指定された個人用電話帳テーブル
5-3）の各欄に、略称と音声データとをリンクするため
の適切なデータが書き込まれる。On the other hand, when an instruction to perform voice registration is issued in step S38, the process proceeds to step S39.
Then, the voice registration process is performed. Then, step S4
At 0, the data obtained by the voice registration process is registered in the voice data registration area 5-5, as described above in step S20 of FIG. That is, the voice registration table 5-4 and the telephone directory data table 5-1 (if registered in the personal telephone directory, the designated personal telephone directory table
In each column of 5-3), appropriate data for linking the abbreviation and audio data is written.

【００８２】次に音声登録処理を説明する。Next, the voice registration processing will be described.

【００８３】図１５のステップＳ４５において、装置の
音声登録可能件数と、音声の登録件数とを比較し、音声
登録が可能かどうかを判断する。本実施形態では、最大
１５件の音声データを登録できるものとする。音声登録
ができなければ、エラー処理を行なう（ステップＳ５
７）。なお、このエラー処理は、後述の図２２のフロー
チャートにおけるステップＳ７０〜Ｓ７２と同様の処理
行なうもので、音声登録が不可と判断された理由を表す
コードを図８に示される理由コードから選択し、ユーザ
に提示する（ここでは、図８のＲ０４「登録件数フル」
が選択されることになる。In step S45 of FIG. 15, the number of voice registrations of the apparatus and the number of voice registrations are compared to determine whether voice registration is possible. In the present embodiment, it is assumed that up to 15 voice data can be registered. If voice registration is not possible, error processing is performed (step S5).
7). Note that this error processing is the same processing as steps S70 to S72 in the flowchart of FIG. 22 described later, and a code indicating the reason that voice registration is determined to be impossible is selected from the reason codes shown in FIG. Presented to the user (here, R04 “Registration number full” in FIG. 8)
Will be selected.

【００８４】音声登録が可能であれば、ステップＳ４６
において図１２の（ａ）に示すような初期画面を表示す
るとともに、カウンタＮに３をセットする。本実施形態
では、音声登録において、音声入力を３回行なわせるこ
とにより、音声認識処理の精度を向上させるが、このカ
ウンタＮはその音声入力回数をカウントするものであ
る。If voice registration is possible, step S46
In FIG. 12, an initial screen as shown in FIG. 12A is displayed, and 3 is set in the counter N. In the present embodiment, the accuracy of the voice recognition processing is improved by performing voice input three times in voice registration, but this counter N counts the number of voice inputs.

【００８５】画面の指示にしたがってユーザが受話器を
上げると、ステップＳ４７からステップＳ４８へ進み、
ガイダンス表示及びガイドトーンの発生を行なう。ガイ
ダンス表示では、まず図１２の（ｂ）のごとき表示を行
ない、ガイドトーン（ピー音）を発生した後に図１２の
（ｃ）のごとき表示を行なう。When the user picks up the receiver according to the instructions on the screen, the process proceeds from step S47 to step S48,
A guidance display and a guide tone are generated. In the guidance display, first, a display as shown in FIG. 12B is performed, and after a guide tone (peep sound) is generated, a display as shown in FIG. 12C is performed.

【００８６】この状態でユーザが登録する名前を受話器
に向かって話し、音声を感知すると、ステップＳ４９に
おいて図１２の（ｄ）のごとき表示を行なう。そして、
音声の入力を終えると、ステップＳ４９からステップＳ
５０へ進み、図１２の（ｅ）のごとき表示を行なうと共
に、入力された音声の解析を行なう。そして、この解析
が正常に行なわれたと判断されたならばステップＳ５１
からステップＳ５２へ進み、異常であると判断された場
合はステップＳ４７からエラー処理へ移行する（ステッ
プＳ５８）。なお、このエラー処理は、後述の図２２の
フローチャートにおけるステップＳ７０〜Ｓ７２と同様
の処理行なうもので、音声登録が不可であると判断され
た理由をあらわすコードを図８に示される理由コードか
ら選択し、ユーザに提示する。In this state, when the user speaks the registered name to the receiver and senses voice, a display as shown in FIG. 12D is made in step S49. And
When the input of the voice is completed, steps S49 to S
Proceeding to 50, the display as shown in FIG. 12 (e) is made and the input voice is analyzed. If it is determined that this analysis has been performed normally, step S51 is performed.
The process proceeds from step S52 to step S52, and if it is determined that there is an abnormality, the process proceeds from step S47 to error processing (step S58). This error processing is similar to steps S70 to S72 in the flowchart of FIG. 22, which will be described later. A code indicating the reason that voice registration is determined to be impossible is selected from the reason codes shown in FIG. And present it to the user.

【００８７】音声解析が正常に行なわれると、ステップ
Ｓ５２においてカウンタＮの値を１減少させ、ステップ
Ｓ５３において、その結果としてＮ＝０となったかどう
かを判定する。Ｎ＝０でなければ、また音声解析処理を
行なう必要がある（本実施形態では３回行なう）ので、
ステップＳ５４において、例えば図１３の（ａ）に示す
ごとく音声解析が正常に行なわれたことを報知し、残り
発生必要回数を報知する画面の表示を行なう。なお、図
１３の（ａ）において、残り回数の表示（「アト2カイ」）
は、カウンタＮの値を表示すれば良い。When the voice analysis is performed normally, the value of the counter N is decreased by 1 in step S52, and in step S53, it is determined whether or not N = 0 as a result. Unless N = 0, it is necessary to perform a voice analysis process (in this embodiment, three times).
In step S54, for example, as shown in FIG. 13A, the fact that the voice analysis has been performed normally is displayed, and a screen for notifying the remaining necessary number of times is displayed. In addition, in (a) of FIG. 13, the display of the remaining number of times (“At 2 Kai”)
May display the value of the counter N.

【００８８】以上のようにして、３回の音声登録を終了
すると、カウンタＮの値が０となり、処理はステップＳ
５３からステップＳ５５へ進む。ステップＳ５５では、
未使用の音声データ登録領域5-5へ発声された複数回の
音声データ全てを、ステップＳ３３及びＳ３５で入力さ
れたデータに対応させて登録する。ここで登録する音声
データは、全ての音声データではなく、条件の良いもの
のみを選択しても良いし、或いは複数の発声データから
最良のデータを作成したものでも良い。この間、図１３
の（ｂ）のような、音声登録処理中であることを報知す
る画面の表示を行なう。そして、音声データの登録を完
了すると、ステップＳ５６において後処理を行なう。す
なわち、図１３の（ｃ）に示すような表示を行なって、
受話器を元に戻すよう指示し、受話器が戻されたことの
検知に応じて、図１３の（ｄ）のような音声登録処理の
完了を報知する画面の表示を行なって、登録が完了した
ことを示す。そして、図１３の（ｅ）に示すような、別
項目の音声登録を続けて行なうか否かをオペレータが指
示するための画面を表示し、「シナイ」が指示された場
合には、呼び出しもとのルーチン（図１０或いは図１
４）に戻り、「スル」が指示された場合にはステップＳ
３３に戻る。As described above, when three voice registrations have been completed, the value of the counter N becomes 0, and the process proceeds to step S
From 53, the process proceeds to step S55. In step S55,
All of the plurality of voice data uttered to the unused voice data registration area 5-5 are registered in correspondence with the data input in steps S33 and S35. The voice data to be registered here may not be all voice data, but may be selected only with good conditions, or may be the best data created from a plurality of voice data. During this time, FIG.
A screen for notifying that the voice registration process is being performed as shown in FIG. When the registration of the audio data is completed, post-processing is performed in step S56. That is, a display as shown in FIG.
When the receiver is instructed to return to the original state, and in response to the detection that the receiver has been returned, a screen for notifying the completion of the voice registration process as shown in FIG. 13D is displayed, and the registration is completed. Is shown. Then, as shown in FIG. 13 (e), a screen for the operator to instruct whether or not to continue the voice registration of another item is displayed. When "Sinai" is instructed, a call is also made. (FIG. 10 or FIG. 1)
Returning to 4), if “Sur” is instructed, step S
Return to 33.

【００８９】（３）電話帳登録内容修正電話帳に登録された内容を修正する操作を説明する。な
お、本処理の制御手順は、以下に説明する操作の手順と
それにしたがった表示画面の提示により明らかであるの
で、ここでは図１６を参照して操作手順のみを説明す
る。図１６は、本実施形態における電話帳データの登録
内容の修正操作を説明する図である。(3) Correction of Registered Contents in Phone Book The operation for correcting the contents registered in the phone book will be described. Since the control procedure of this process is clear from the procedure of the operation described below and the presentation of the display screen according to the procedure, here, only the operation procedure will be described with reference to FIG. FIG. 16 is a diagram illustrating an operation of correcting the registered contents of the telephone directory data according to the present embodiment.

【００９０】例えば、本装置のスタンバイ状態で「電話
帳」キー（セットアップキー1-3-10）を押下し、上下カ
ーソルキーで登録修正したい略称（あて先）を選択す
る。この時点における画面表示例は、図１６の（ａ）の
ようになる。なお、表示の右端に「Ｖ」が表示されてい
るのは、その登録データに音声認識用のデータが登録さ
れていることを表す。次に、「カナ／英数字」キーを押
下すると、図１６の（ｂ）のような修正用画面の表示と
なる。なお、上下左右カーソルで下段の候補文字選択を
行ない、上段のカーソルは「＊＃」キーで移動を行なう
ものとする。For example, in the standby state of the apparatus, the "phone directory" key (setup key 1-3-10) is pressed, and the abbreviation (destination) to be registered / corrected is selected by the up / down cursor keys. An example of the screen display at this point is as shown in FIG. The display of “V” at the right end of the display indicates that the data for voice recognition is registered in the registered data. Next, when the "kana / alphanumeric" key is pressed, a correction screen as shown in FIG. 16B is displayed. Note that the lower candidate character is selected with the up, down, left, and right cursors, and the upper cursor is moved with the “* #” key.

【００９１】ここで、「キムラ」の「キ」を「タ」に修
正すると、図１６の（ｃ）のようになる。そして、「オワ
リ」を選択すると、略称の修正操作を終えて、番号の修
正操作を行なうべく、図１６の（ｄ）の表示となる。番
号を変更する場合は、「消去」キーによって現在登録さ
れている番号をいったん消去し、その後ダイヤルにより
入力しなおす。図１６の（ｅ）は番号の入れなおしを行
なっている状態を示す。Here, when "K" of "Kimura" is corrected to "TA", the result is as shown in FIG. 16 (c). Then, when "Owari" is selected, the display of FIG. 16D is displayed in order to complete the operation of correcting the abbreviation and to perform the operation of correcting the number. When changing the number, the currently registered number is deleted once by the "delete" key, and then the number is input again by dialing. FIG. 16E shows a state in which the numbers are reset.

【００９２】番号を入れなおしたならば、セットキーを
押下することで修正処理を終了する。一方、番号の変更
がなければ、図１６（ｄ）の状態からセットキーを押下
する。After the numbers have been reset, the correction process is terminated by pressing the set key. On the other hand, if there is no change in the number, the set key is pressed from the state shown in FIG.

【００９３】次に、音声データについて修正を行なうか
どうかをユーザに指示させるために、図１６（ｆ）のご
とき画面表示を行なう。すでに当該データに関して音声
データが登録されている場合、図１６（ｆ）の画面で
「スル」が選択されると、更に図１６の（ｇ）のごとき
画面を表示し、音声データの入れ替えを行うか問い合わ
せる。すなわち、本修正操作によれば、音声データが未
登録の登録データについては図１６（ｆ）の画面で「ス
ル」を選択することにより音声データ登録が開始され
る。また、音声データがすでに登録されているデータに
ついては、図１６（ｆ）の画面で「スル」が選択された
場合に、更に図１６（ｇ）のような画面表示を行なっ
て、音声データの入れ替えを行なっても良いかどうかを
問い合わせる（すなわち、登録済みの音声データが消え
ても良いかどうかを問い合わせている）。図１６（ｇ）
の画面で「シナイ」が選択された場合は、音声データの登
録処理は行なわれないことになる。Next, a screen display as shown in FIG. 16 (f) is made to instruct the user whether or not the voice data should be corrected. If audio data has already been registered for the data, and “Sur” is selected on the screen in FIG. 16F, a screen as shown in FIG. 16G is displayed, and the audio data is replaced. Inquire. In other words, according to this correction operation, voice data registration is started by selecting “NO” on the screen of FIG. 16F for registered data for which voice data has not been registered. For data for which voice data has already been registered, when “Sur” is selected on the screen of FIG. 16F, a screen display as shown in FIG. An inquiry is made as to whether or not the replacement can be performed (that is, whether or not the registered voice data can be erased). FIG. 16 (g)
When "Sinai" is selected on the screen, the registration processing of the audio data is not performed.

【００９４】（４）電話帳登録内容削除電話帳に登録された内容を削除する操作を説明する。な
お、本処理の制御手順は、以下に説明する操作の手順と
それにしたがった表示画面の内容の提示により明らかで
あるので、ここでは図１７を参照して操作手順のみを説
明する。図１７は、本実施形態における電話帳データの
登録内容の削除操作を説明する図である。(4) Deletion of Registered Contents in Phone Book The operation for deleting the contents registered in the phone book will be described. Since the control procedure of this process is clear from the procedure of the operation described below and the presentation of the contents of the display screen according to the procedure, only the operation procedure will be described with reference to FIG. FIG. 17 is a diagram illustrating an operation of deleting registered contents of telephone directory data according to the present embodiment.

【００９５】本装置のスタンバイ状態で「電話帳」キー
（セットアップキー1-3-10）を押下し、上下カーソルキ
ーで登録修正したい略称（あて先）を選択する。この時
点における画面表示例は、図１７の（ａ）のようにな
る。While the apparatus is in the standby state, the user presses the "Phonebook" key (setup key 1-3-10) and selects the abbreviation (destination) to be registered / corrected with the up / down cursor keys. An example of the screen display at this point is as shown in FIG.

【００９６】この状態で、消去キー1-3-5を押下する
と、図１７の（ｂ）のようになる。そして、セットアッ
プキー1-3-10を押下すると、図１７の（ｃ）のように表
示される。この時点で、略称が「キムラタクヤ」として登録
された電話帳データは、電話帳データテーブル5-1（個
人別電話帳への登録であれば、指定された個人用電話帳
テーブル5-3）から削除される。このとき、消去したデ
ータに音声データが登録されていた場合は、当該音声デ
ータもいっしょに消去される。すなわち、電話帳データ
テーブルにおいて、削除対象のデータブロックにおける
「登録番号」で音声登録テーブル5-4を検索し、対応す
るデータブロックの「状態」を空きにする。In this state, when the erase key 1-3-5 is pressed, the state becomes as shown in FIG. Then, when the setup key 1-3-10 is pressed, the display is displayed as shown in FIG. At this point, the telephone directory data registered with the abbreviated name “Kimura Takuya” is the telephone directory data table 5-1 (or the specified personal telephone directory table 5-3 if registered in the personal telephone directory). Removed from. At this time, if audio data is registered in the erased data, the audio data is also erased together. That is, in the telephone directory data table, the voice registration table 5-4 is searched for the "registration number" of the data block to be deleted, and the "state" of the corresponding data block is made empty.

【００９７】以上のようにして消去処理を終えると、図
１７の（ｃ）のように表示され、所定時間経過後に図１
７の（ｄ）のように表示される。When the erasing process is completed as described above, the display is displayed as shown in FIG.
7 (d).

【００９８】（５）音声データの削除次に、登録されている音声データのみを削除する方法説
明する。図１８は音声データの削除手順を説明する図で
ある。音声データの削除に際しては、まず、電話帳機能
により、音声データを削除したい略称（宛先略称であっ
ても個人別電話帳の略称であっても良い）を表示させ
る。そして、「カナ／英字」キー（本実施形態では、リ
ダイヤルキー1-3-26で代用）を押下し、略称と電話番号
をそのまま確定すると、図１８（ａ）に示す如く表示さ
れる。ここで、「シナイ」を選択すると、図１８の（ｂ）
に示す如く表示され、当該音声データを消去するかどう
かを問い合わせる。ここでさらに「スル」が選択される
と、当該登録データの音声データが消去され、図１７の
（ｃ）に示すような表示が行なわれる。(5) Deletion of Audio Data Next, a method of deleting only registered audio data will be described. FIG. 18 is a diagram illustrating a procedure for deleting audio data. When deleting voice data, first, an abbreviation (either a destination abbreviation or an individual phonebook abbreviation) to be deleted is displayed by the telephone directory function. When the "kana / alphabet" key (in this embodiment, the redial key 1-3-26 is substituted) is pressed down and the abbreviation and the telephone number are determined as they are, the display is as shown in FIG. 18 (a). Here, when "Sinai" is selected, (b) in FIG.
Is displayed as shown in (2), and an inquiry is made as to whether or not to delete the audio data. Here, when “Sur” is further selected, the audio data of the registered data is deleted, and a display as shown in FIG. 17C is performed.

【００９９】なお、電話帳で略称を表示して、「カナ／
英字」キーを押下し、名前と電話番号をそのまま確定し
た後、図１８（ａ）の表示において「スル」を選択する
と、図１８の（ｄ）のように表示され、当該登録データ
に関する音声データの入れ替えを行なうことができる。[0099] The abbreviation is displayed in the telephone directory and "Kana /
After pressing the "alphabet" key and confirming the name and telephone number as they are, selecting "Sulu" in the display of FIG. 18 (a) displays as shown in (d) of FIG. Can be replaced.

【０１００】（６）電話帳検索操作次に、電話帳データの検索について説明する。図１９は
電話帳検索時の操作に応じた画面表示例を示す図であ
る。スタンバイ状態、又は回線捕捉後に電話帳キー（セ
ットキー1-3-10）を押下し、上下カーソル移動キー1-3-
11,1-3-12を操作することにより、ＬＣＤ1-3-6上に、電
話帳に登録されている略称が順次（例えばあいうえお順
で）表示される（図１９の（ａ）から（ｃ））。図１９
の（ａ）の状態から下カーソルキーを押下すると図１９
の（ｂ）のようになり、この状態で上カーソルキーを押
下すると図１９（ｃ）のようになる。(6) Phonebook Search Operation Next, search of phonebook data will be described. FIG. 19 is a diagram showing an example of a screen display corresponding to an operation at the time of telephone directory search. Press the phonebook key (set key 1-3-10) in the standby state or after capturing the line, and press the up / down cursor movement key 1-3
By operating the buttons 11,1-3-12, the abbreviations registered in the telephone directory are sequentially displayed (for example, in order of alphabetical order) on the LCD 1-3-6 (from (a) to (c) in FIG. 19). )). FIG.
When the down cursor key is pressed from the state shown in FIG.
(B). When the up cursor key is pressed in this state, the state becomes as shown in FIG. 19 (c).

【０１０１】表示されている略称を選択し、右カーソル
キーを押下すると、図１９の（ｄ）のような詳細表示
（番号表示）となる。When the displayed abbreviation is selected and the right cursor key is pressed, a detailed display (number display) as shown in FIG. 19D is displayed.

【０１０２】この状態で、上下カーソルキーを操作する
と、詳細表示の直前の状態から上下カーソルキーを押下
したのと同じ動作を行なう。また、ストップキー1-3-18
を押下すると、スタンバイ状態へ、或いは回線捕捉中で
あれば「ＴＥＬ＝」の表示へ戻る。In this state, operating the up / down cursor keys performs the same operation as pressing the up / down cursor keys from the state immediately before the detailed display. Also, stop key 1-3-18
Pressing returns to the standby state, or to the display of "TEL =" if the line is being captured.

【０１０３】さて、図１９の（ａ）〜（ｃ）のいずれか
の状態において（本例では図１９の（ｃ）の状態とす
る）、セットキー1-3-10が押下されると、回線捕捉中で
あれば図１９の（ｅ）のように表示して、ダイヤル発呼
を行なう。また、回線捕捉中でなければセットキーの押
下は無視され、図１９の（ｃ）の表示状態を維持する。（７）音声認識による電話帳検索次に、音声認識による電話帳検索について説明する。音
声認識キー1-3-22を押して、宛先の略称を発声し、それ
を音声認識することで、共通の電話帳を使った宛先ダイ
ヤルが実行されることになる。或いは、音声認識キー1-
3-22を押して個人用ＩＤ（図５ではミユキやテツヤなど）を発
声すると、これを音声認識することによって個人用ＩＤ
テーブル5-2から個人用電話帳テーブルが指定される。
個人用電話帳テーブルが選択された場合は、更に宛先の
略称を続けて発声し、それを音声認識することで、当該
個人用の電話帳を使った宛先ダイヤルが実行されること
になる。When the set key 1-3-10 is depressed in any of the states (a) to (c) of FIG. 19 (in this example, the state of FIG. 19 (c)), If the line is being captured, a display is made as shown in FIG. Unless the line is being captured, the pressing of the set key is ignored, and the display state of FIG. 19C is maintained. (7) Phonebook Search by Voice Recognition Next, a phonebook search by voice recognition will be described. By pressing the voice recognition key 1-3-22, the abbreviation of the destination is uttered, and the voice is recognized, whereby the destination dialing using the common telephone directory is executed. Or voice recognition key 1-
Press 3-22 to utter a personal ID (such as Miyuki or Tetsuya in Fig. 5).
A personal telephone book table is specified from Table 5-2.
When the personal telephone directory table is selected, the destination dial using the personal telephone directory is executed by further uttering the abbreviation of the destination and recognizing it.

【０１０４】図２０及び図２１は、音声認識による電話
帳検索の操作手順を説明する図である。また、図２２
は、音声認識による電話帳検索の処理手順を説明する図
である。FIGS. 20 and 21 are diagrams for explaining the operation procedure of telephone directory search by voice recognition. FIG.
FIG. 4 is a diagram for explaining a processing procedure of a telephone directory search by voice recognition.

【０１０５】本機が待機状態で音声認識キー1-3-22が押
されると、オペレートタスク1-2-2は、音声認識による
電話帳検索処理を開始する。まず、ステップＳ６１にお
いて、図２０の（ａ）に示される初期画面の表示を行な
うと共に、カウンタＮの値を０にセットするという初期
化処理を行なう。また、この初期化処理において、オペ
レートタスク1-2-2は音声登録テーブル5-4の「除外フラ
グ」を全てＯＦＦにセットする。When the voice recognition key 1-3-22 is pressed while the apparatus is in a standby state, the operation task 1-2-2 starts a telephone directory search process by voice recognition. First, in step S61, an initial screen shown in FIG. 20A is displayed, and an initialization process of setting the value of the counter N to 0 is performed. In this initialization process, the operating task 1-2-2 sets all "exclusion flags" of the voice registration table 5-4 to OFF.

【０１０６】ステップＳ６２において、ユーザがハンド
セット1-3-1をオフフックしたことを検知すると、ステ
ップＳ６３に進み、図２０の（ｂ）に示すようなガイダ
ンス表示を行なうと共に、ガイドトーン（ピー音）を出
力する。その後、図２０の（ｃ）のようにユーザに発声
開始可能状態を報知する画面の表示を行ない、ユーザが
音声入力するのを待つ。If it is detected in step S62 that the user has taken off-hook of the handset 1-3-1, the flow advances to step S63 to provide a guidance display as shown in FIG. 20B and a guide tone (beep sound). Is output. Thereafter, as shown in FIG. 20 (c), a screen for notifying the user of the state in which the utterance can be started is displayed, and the user waits for a voice input.

【０１０７】ステップＳ６４では、ユーザによる音声の
入力を検知すると、オペレートタスク1-2-2は、図２０
の（ｄ）に示すような、音声の入力を検知し、メモリに
音声データを格納中であること、すなわち音声の装置へ
の取り込み中であること、をユーザに報知する画面の表
示を行なうと共に、音声認識スタートコマンド（ここで
は、図６のコマンド３か４が発行される）を音声認識タ
スク1-2-8に対して発行する。そして、ステップＳ６５
にて、音声認識タスク1-2-8より音声認識結果コマンド
（図７）が入力されるのを待つ。In step S64, when the input of the voice by the user is detected, the operation task 1-2-2 executes the processing shown in FIG.
As shown in (d), a screen for notifying the user that voice data is being stored in the memory, that is, voice data is being taken into the device, is displayed while the input of voice is detected. Issue a speech recognition start command (here, command 3 or 4 in FIG. 6 is issued) to the speech recognition task 1-2-8. Then, step S65
Waits for the input of the voice recognition result command (FIG. 7) from the voice recognition task 1-2-8.

【０１０８】音声認識結果コマンドの入力に応じて、処
理はステップＳ６６へ進み、入力した認識結果コマンド
に基づいて当該認識結果が正常に終了したかどうかを判
定する。もしも、結果コマンドが「ＮＧ」を表すコマン
ドであれば、ステップＳ７０へ進む。ステップＳ７０で
は、ＳＲＡＭ1-1-24上の「理由コード表示する」のスイ
ッチ項目がＯＮされているかどうかを判定し、ＯＮされ
ていればステップＳ７１で理由コードに従ってパネルに
ＮＧ理由を表示し、待機状態へ移行する。In response to the input of the voice recognition result command, the process proceeds to step S66, and determines whether or not the recognition result has been normally completed based on the input recognition result command. If the result command is a command indicating "NG", the process proceeds to step S70. In step S70, it is determined whether or not the switch item of "display reason code" on the SRAM 1-1-24 is turned on. If the switch item is turned on, an NG reason is displayed on the panel according to the reason code in step S71, and the process waits. Move to state.

【０１０９】一方、認識結果がＯＫならば、ステップＳ
６７において、認識結果コマンドに含まれている登録番
号を基に電話帳データ4-2内の各種テーブルを検索し
て、その登録番号を有する該当項目を見つける。ここ
で、該当項目が個人用ＩＤテーブルに登録されたもので
あった場合は、ステップＳ６８からステップＳ６９へ進
み、その略称とともに個人用電話帳が指定された旨をパ
ネルに表示する。そして「個人用電話帳テーブル」の宛
先検索のために再び音声認識処理を行なうべく、ステッ
プＳ６３へ処理を戻す。なお、このとき（ステップＳ６
９において）、個人用電話帳テーブルによる宛先検索へ
の処理の移行を許可する旨をユーザが指示するようにし
てもよい。ここで、ユーザが個人用電話帳テーブルによ
る宛先検索への移行を拒否した場合は、再度電話帳デー
タテーブル5-1と個人用ＩＤテーブル5-2を対象とした宛
先検索が行なわれる。On the other hand, if the recognition result is OK, step S
At 67, various tables in the telephone directory data 4-2 are searched based on the registration number included in the recognition result command, and a corresponding item having the registration number is found. If the corresponding item is registered in the personal ID table, the process proceeds from step S68 to step S69, and the abbreviation is displayed on the panel indicating that the personal telephone directory has been designated. Then, the process returns to step S63 to perform the voice recognition process again for the address search of the “personal telephone directory table”. At this time (step S6)
9), the user may instruct that the transfer of the process to the address search using the personal telephone directory table is permitted. Here, if the user refuses to shift to the destination search using the personal telephone directory table, the destination search is performed again on the telephone directory data table 5-1 and the personal ID table 5-2.

【０１１０】また、該当項目が共通の「電話帳データテ
ーブル」に登録されたものであった場合、或いは個人用
電話帳テーブルが選択された後にその選択された「個人
用電話帳テーブル」に登録されたものであった場合に
は、ステップＳ６８からステップＳ７３へ進み、認識結
果をユーザに示すため、図２０の（ｅ）で示す如く、Ｌ
ＣＤ1-3-6にその項目の「宛先略称」表示を行う。When the corresponding item is registered in the common “phone book data table”, or after the personal phone book table is selected, it is registered in the selected “personal phone book table”. If the recognition has been performed, the process proceeds from step S68 to step S73, and in order to show the recognition result to the user, as shown in FIG.
The "address abbreviation" of the item is displayed on CD1-3-6.

【０１１１】次に、ステップＳ７４において、当該ダイ
ヤル発呼がファクシミリ送信であるどうかを判定する。
ファクシミリ送信であるか否かの判定は、原稿入り口に
原稿がセット済みである場合、或いはファクシミリ送信
のための画質情報がセット済みである場合等により、フ
ァクシミリ送信であると判断する。ファクシミリ送信で
なければ、ステップＳ７５へ進み、「セットキー待ち」
が指定されているかどうかを判定する。ＳＲＡＭ1-1-24
内のスイッチ「音声ダイヤルのセットキー待ち」の項目
が「セットキー待ち」に設定されている場合は、ステッ
プＳ７９でセットキー1-3-10の入力を待ち、セットキー
1-3-10を押すことでダイヤル実行に移行する。ＳＲＡＭ
1-1-24内のスイッチ「音声ダイヤルのセットキー待ち」
の項目が「自動」に設定されている場合は、ステップＳ
７５からステップＳ７６へ進み、セットキー入力を待た
ずにダイヤル発呼の実行に移行する。なお、ファクシミ
リ送信の操作を実施している場合はスイッチに関係なく
セットキー1-3-10の押下が必要である（誤認識による送
信の可能性を避けるため）。そのため、ファクシミリ送
信であれば、ステップＳ７５をスキップし、ステップＳ
７９へ進む。Next, in step S74, it is determined whether or not the dial call is facsimile transmission.
The facsimile transmission is determined to be facsimile transmission when a document has been set at the document entrance or when image quality information for facsimile transmission has been set. If it is not facsimile transmission, the process proceeds to step S75, and “wait for set key”
Determines if is specified. SRAM1-1-24
If the item "Wait for set key of voice dial" in the switch is set to "Wait for set key", wait for input of the set key 1-3-10 in step S79.
Press 1-3-10 to shift to dialing. SRAM
1-1-24 switch "Wait for voice dial set key"
If the item is set to “automatic”, step S
From 75, the process proceeds to step S76, and the process shifts to the execution of dialing without waiting for the set key input. When a facsimile transmission operation is performed, it is necessary to press the set key 1-3-10 regardless of the switch (to avoid the possibility of transmission due to erroneous recognition). Therefore, in the case of facsimile transmission, step S75 is skipped and step S75 is performed.
Go to 79.

【０１１２】なお、自動的にダイヤル実行へ移行する場
合は、宛先表示をユーザが確認する時間が必要なため、
タイマーＴ時間だけ遅延させてダイヤルを実行をスター
トする（ステップＳ７６、Ｓ７８）。ＴはあらかじめＳ
ＲＡＭに設定された所定値であるが、ユーザの操作で変
更することも出来る。また、この状態でストップキーを
押下すると、ステップＳ７７から待機状態へ移行するの
で、処理を中断させることができる。In the case of automatically shifting to dialing, it takes time for the user to confirm the destination display.
Dial execution is started with a delay of the timer T (steps S76, S78). T is S in advance
This is a predetermined value set in the RAM, but can be changed by a user operation. If the stop key is pressed in this state, the process shifts from step S77 to the standby state, so that the processing can be interrupted.

【０１１３】さて、上記ダイヤル実行のためにセットキ
ー1-3-10の押下を待っている状態で、ユーザがパネルに
表示された略称を見て、宛先誤認識を発見した場合はこ
こで音声認識キー1-3-22を押すことにより、そのとき選
択されている宛先を取り消し、新たな宛先の入力の開始
を指示することができる（ステップＳ７９、ステップＳ
８０）。音声認識キー1-3-22が押下された場合は、ステ
ップＳ８１へ進み、カウンタＮの値を１つ増加する。そ
して、ステップＳ８２において、音声データの登録数と
比較し、Ｎが音声データの登録数以上であった場合は、
該当する認識結果はないとして、ステップＳ７２でエラ
ー表示を行なう。When the user looks at the abbreviation displayed on the panel while waiting for the set key 1-3-10 to be pressed to execute the dial, and finds that the address is incorrectly recognized, the voice is input here. By pressing the recognition key 1-3-22, the destination selected at that time can be canceled and an instruction to start inputting a new destination can be given (step S79, step S79).
80). If the voice recognition key 1-3-22 has been pressed, the flow advances to step S81 to increase the value of the counter N by one. Then, in step S82, compared with the number of registered audio data, if N is equal to or greater than the number of registered audio data,
Assuming that there is no corresponding recognition result, an error is displayed in step S72.

【０１１４】一方、Ｎが音声データの登録数より小さけ
れば、オペレートタスク1-2-2は、表示中の宛先を除外
して再び音声認識するように音声認識タスクに依頼す
る。すなわち、ステップＳ８３において、そのとき選択
されている宛先を除外するべく、当該音声データの登録
番号を限定認識の音声認識スタートコマンド（コマンド
５か６）に付加して音声認識タスク1-2-8に発行する
（ステップＳ８４）。そしてステップＳ６５へ戻る。音
声認識タスクは、入力したコマンドに付加されていた登
録番号に対応する除外対象の音声登録データの「除外フ
ラグ」をＯＮにして再び音声認識処理を実行する。On the other hand, if N is smaller than the number of registered voice data, the operating task 1-2-2 requests the voice recognition task to perform voice recognition again excluding the displayed destination. That is, in step S83, in order to exclude the destination selected at that time, the registration number of the voice data is added to the voice recognition start command (command 5 or 6) of the limited recognition to perform the voice recognition task 1-2-8. (Step S84). Then, the process returns to step S65. The voice recognition task turns on the “exclusion flag” of the voice registration data to be excluded corresponding to the registration number added to the input command, and executes the voice recognition process again.

【０１１５】再度音声認識した結果、認識された略称表
示が再び誤認識であったならば、セットキー1-3-10を押
さずに再度音声認識キー1-3-22を押せば良い。こうする
ことにより、前回の認識宛先に加えて今回の認識宛先が
除外されたまま新たな音声認識が実行されることにな
る。すなわち、音声登録テーブル5-4において、該当す
る除外フラグがＯＮとなり、対応する音声データは次回
の音声認識に用いられない。As a result of the voice recognition again, if the recognized abbreviation is incorrectly recognized again, the voice recognition key 1-3-22 may be pressed again without pressing the set key 1-3-10. In this way, new speech recognition is performed with the current recognition destination excluded in addition to the previous recognition destination. That is, in the voice registration table 5-4, the corresponding exclusion flag is turned ON, and the corresponding voice data is not used for the next voice recognition.

【０１１６】なお、図２１に示される操作手順は、ハン
ドセットをオフフックしてから音声認識キーを押して電
話をかける（ファクシミリ送信を行なう）場合を示して
いる。まず、ハンドセットをオフフックすることで、図
２１の（ａ）のごとき電話番号の入力をオペレータに促
す画面の表示が行なわれる。この状態では、通常のダイ
ヤル通話が可能である。ここで、音声認識キーを押す
と、図２１の（ｂ）に示す表示がなされ、以降、図２０
の（ｂ）〜（ｅ）と同じである。また、その処理内容
も、図２２のフローチャートで示したものと同様とな
る。The operation procedure shown in FIG. 21 shows a case where a telephone call is made by pressing the voice recognition key after off-hook of the handset (performing facsimile transmission). First, by off-hooking the handset, a screen for prompting the operator to input a telephone number as shown in FIG. 21A is displayed. In this state, a normal dial call is possible. Here, when the voice recognition key is pressed, the display shown in FIG. 21B is made.
(B) to (e). The processing contents are the same as those shown in the flowchart of FIG.

【０１１７】なお、図２２に示した例では、再試行の回
数は音声データの登録数と一致するが、これに限らな
い。例えば、ステップＳ８２で比較する数を３回等の所
定数としても良い。また、ステップＳ７７でストップキ
ーを検出した場合に、ダイヤルの実行を行なわずに本処
理を終了するが、このタイミングで音声認識キーを押す
ことで音声認識の再試行が行なえるようにしても良い
（すなわち、ステップＳ７７からステップＳ８１へ処理
を移す）。In the example shown in FIG. 22, the number of retries coincides with the number of registered voice data, but is not limited to this. For example, the number to be compared in step S82 may be a predetermined number such as three. When the stop key is detected in step S77, the process ends without executing the dialing. At this timing, the voice recognition key may be pressed again to perform the voice recognition again. (That is, the process moves from step S77 to step S81.)

【０１１８】また、上記の手順では、除外フラグのリセ
ットを音声認識の開始時（ステップＳ６１）に行なうが
これに限らない。例えば、エラーによってステップＳ７
２から待機状態へ移行したとき、ストップキーによりス
テップＳ７７から待機状態へ移行したとき、或いはダイ
ヤルが実行されたときに除外フラグをリセットするよう
にしても良い。In the above procedure, the exclusion flag is reset at the start of the speech recognition (step S61), but is not limited to this. For example, if an error occurs in step S7
Alternatively, the exclusion flag may be reset when shifting from 2 to the standby state, when shifting from step S77 to the standby state by the stop key, or when dialing is performed.

【０１１９】以上説明したように、上記実施形態によれ
ば、音声認識実行時において誤認識（音声を登録した環
境と、認識のために音声を入力した環境が違う等によ
る）が発生した場合、オペレータが装置を停止させて待
機状態に戻し、再び操作をやり直すという必要がなくな
り、誤認識発生時に即時に再試行をさせることが出来
る。そして、この再試行時には、前回までの試行で誤認
識した音声データの候補が除外されるので操作性、信頼
性を大幅に向上させることが可能になる。As described above, according to the above embodiment, when erroneous recognition (due to a difference between the environment in which the voice is registered and the environment in which the voice is input for recognition) occurs during voice recognition, It is not necessary for the operator to stop the apparatus, return to the standby state, and perform the operation again, and it is possible to immediately retry when an erroneous recognition occurs. Then, at the time of this retry, the operability and reliability can be greatly improved because voice data candidates that were erroneously recognized in the previous trials are excluded.

【０１２０】また、この音声認識から除外された音声デ
ータは、次の新たな操作において自動的に復帰されるの
で、オペレータは音声認識の再試行時に音声データの消
去や再登録の必要が無くなり、操作性、信頼性を大幅に
向上させることが可能になる。Further, the voice data excluded from the voice recognition is automatically restored in the next new operation, so that the operator does not need to delete or re-register the voice data when retrying the voice recognition. Operability and reliability can be greatly improved.

【０１２１】すなわち、音声認識実行時に誤認識が発生
した場合に、音声データの削除や再登録を行うことなく
簡単に再試行が可能になり、さらには再試行時の認識確
率を大幅に向上させることが可能になる。ここで、音声
認識の再試行では、再度音声入力から行なうことにな
る。例えば、発声したときに限って他人の声や自動車の
音などが入力されてしまうと、その音声で認識した候補
群（第１位候補や下位候補を含む）は全く信頼性の低い
ものとなる。従って、明らかに誤認識した宛先を除いて
再度音声入力させることは効率的でもあり、信頼性にと
って重要である。That is, when erroneous recognition occurs during speech recognition, retry can be easily performed without deleting or reregistering speech data, and the recognition probability at the time of retry is greatly improved. It becomes possible. Here, the retry of the voice recognition is performed again from the voice input. For example, if the voice of another person or the sound of a car is input only when uttered, the candidate group (including the first candidate and the lower candidate) recognized by the voice becomes completely unreliable. . Therefore, it is also efficient and important for reliability to re-input the voice except for the destination that is clearly misrecognized.

【０１２２】また、上記実施形態によれば、共通電話帳
に加えて、個人別の電話帳データを登録可能とし、これ
に音声データを対応付けることができる。すなわち、個
人別電話帳毎に、特定話者を前提とした音声認識アルゴ
リズムによる音声認識ダイヤルを実現できる。このた
め、複数の使用者が使用したとしても、認識精度の高い
認識ダイヤル機能を提供することが出来、利便性・信頼
性が大幅に向上する。Further, according to the above embodiment, in addition to the common telephone directory, individual telephone directory data can be registered, and voice data can be associated therewith. That is, it is possible to realize a voice recognition dial using a voice recognition algorithm for a specific speaker for each personal telephone directory. For this reason, even if it is used by a plurality of users, a recognition dial function with high recognition accuracy can be provided, and convenience and reliability are greatly improved.

【０１２３】また、上記実施形態によれば、音声認識の
ための音声データを登録するに際して複数回の発声デー
タを登録して、認識精度の向上を図っている。また、こ
のような音声登録処理において、音声データを登録する
ための必要試行回数を表示するようにしたので、必要回
数分の音声入力を行なわずにオペレータが処理を中断し
てしまうことを防ぎ、現在何回目の登録を行なっている
かをいちいち記憶することからユーザを解放し、操作に
著しいストレスを感じたりすることなく、登録操作が行
なえる。Further, according to the above embodiment, when registering voice data for voice recognition, utterance data is registered a plurality of times to improve recognition accuracy. Also, in such a voice registration process, the number of trials required for registering voice data is displayed, so that the operator does not interrupt the process without inputting the required number of voices, The user is freed from remembering how many times the registration is currently performed, and the registration operation can be performed without feeling remarkable stress on the operation.

【０１２４】また、上記実施形態によれば、音声認識実
行時に認識不能（音声を登録した環境と、認識のために
音声を入力した環境が違う等による）であった場合、オ
ペレータに理由情報を通知することが可能である。この
ため、再試行時の認識精度が改善され、さらに操作性、
信頼性を大幅に向上させることが可能になる。例えば、
登録時に前記音声変換手段が音声を変換する際に特徴抽
出に失敗した場合、前記音声変換手段から音声登録に失
敗した理由情報が出力されてオペレータに通知される。
このため、例えば声が大きすぎる場合は小さい声で、声
が小さ過ぎる場合は大きい声で、騒音がひどい場合は静
かな場所で登録をし直せば、音声データが正しく登録さ
れるようになり、認識精度も大幅に向上する。Further, according to the above-described embodiment, if the recognition is not possible (due to the difference between the environment in which the voice is registered and the environment in which the voice is input for recognition) when executing the voice recognition, the operator is notified of the reason information. It is possible to notify. For this reason, recognition accuracy at the time of retry is improved, operability,
The reliability can be greatly improved. For example,
If the voice conversion unit fails to extract the feature when converting the voice at the time of registration, the voice conversion unit outputs information on the reason for the voice registration failure and notifies the operator.
For this reason, for example, if the voice is too loud, the voice is too loud, if the voice is too loud, if it is loud, if it is re-registered in a quiet place, the voice data will be registered correctly, The recognition accuracy is also greatly improved.

【０１２５】なお、上記実施形態では、ファクシミリ装
置に本発明を適用し、登録される宛先として電話番号を
例に挙げて説明した。しかしながら、本発明はこれに限
られるものではなく、例えば、インターネットアドレス
を宛先として音声に対応付けることも出来る。このよう
にすれば、宛先略称を音声入力するだけで、例えばイン
ターネットのＵＲＬを獲得でき、煩雑キー入力を行なわ
ずにすむので、操作性が著しく向上する。すなわち、本
発明はファクシミリ装置（電話機）や、ＬＡＮに接続さ
れた、或いはモデムを介して回線に接続されたコンピュ
ータ等、通信網における相手先のアドレスを指定して通
信を行なう装置（これらを通信装置と称する）にも適用
可能であることは明らかである。In the above embodiment, the present invention is applied to a facsimile apparatus, and a telephone number is described as an example of a destination to be registered. However, the present invention is not limited to this. For example, an Internet address can be associated with a voice as a destination. In this way, for example, the URL of the Internet can be obtained only by inputting the destination abbreviation by voice, and complicated key input is not required, so that the operability is significantly improved. That is, the present invention relates to a device that performs communication by designating a destination address in a communication network, such as a facsimile device (telephone) or a computer connected to a LAN or connected to a line via a modem (these are used for communication) It is clear that the present invention is also applicable.

【０１２６】なお、本発明は、複数の機器（例えばホス
トコンピュータ，インタフェイス機器，リーダ，プリン
タなど）から構成されるシステムに適用しても、一つの
機器からなる装置（例えば、複写機，ファクシミリ装置
など）に適用してもよい。The present invention can be applied to a system including a plurality of devices (for example, a host computer, an interface device, a reader, a printer, etc.), and can be applied to a single device (for example, a copier, a facsimile). Device).

【０１２７】また、本発明の目的は、前述した実施形態
の機能を実現するソフトウェアのプログラムコードを記
録した記憶媒体を、システムあるいは装置に供給し、そ
のシステムあるいは装置のコンピュータ（またはＣＰＵ
やＭＰＵ）が記憶媒体に格納されたプログラムコードを
読出し実行することによっても、達成されることは言う
までもない。Further, an object of the present invention is to provide a storage medium storing a program code of software for realizing the functions of the above-described embodiments to a system or an apparatus, and to provide a computer (or CPU) of the system or apparatus.
And MPU) read and execute the program code stored in the storage medium.

【０１２８】この場合、記憶媒体から読出されたプログ
ラムコード自体が前述した実施形態の機能を実現するこ
とになり、そのプログラムコードを記憶した記憶媒体は
本発明を構成することになる。In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention.

【０１２９】プログラムコードを供給するための記憶媒
体としては、例えば、フロッピディスク，ハードディス
ク，光ディスク，光磁気ディスク，ＣＤ−ＲＯＭ，ＣＤ
−Ｒ，磁気テープ，不揮発性のメモリカード，ＲＯＭな
どを用いることができる。As a storage medium for supplying the program code, for example, a floppy disk, hard disk, optical disk, magneto-optical disk, CD-ROM, CD
-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

【０１３０】また、コンピュータが読出したプログラム
コードを実行することにより、前述した実施形態の機能
が実現されるだけでなく、そのプログラムコードの指示
に基づき、コンピュータ上で稼働しているＯＳ（オペレ
ーティングシステム）などが実際の処理の一部または全
部を行い、その処理によって前述した実施形態の機能が
実現される場合も含まれることは言うまでもない。When the computer executes the readout program code, not only the functions of the above-described embodiment are realized, but also the OS (Operating System) running on the computer based on the instruction of the program code. ) May perform some or all of the actual processing, and the processing may realize the functions of the above-described embodiments.

【０１３１】さらに、記憶媒体から読出されたプログラ
ムコードが、コンピュータに挿入された機能拡張ボード
やコンピュータに接続された機能拡張ユニットに備わる
メモリに書込まれた後、そのプログラムコードの指示に
基づき、その機能拡張ボードや機能拡張ユニットに備わ
るＣＰＵなどが実際の処理の一部または全部を行い、そ
の処理によって前述した実施形態の機能が実現される場
合も含まれることは言うまでもない。Further, after the program code read from the storage medium is written into a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, based on the instruction of the program code, It goes without saying that the CPU included in the function expansion board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【０１３２】[0132]

【発明の効果】以上説明したように、本発明によれば、
音声登録時に適切なガイダンスを行なうことにより、ユ
ーザに余計な負担をかけずに、確実に所定回数の音声デ
ータの登録が行なえる。As described above, according to the present invention,
By providing appropriate guidance at the time of voice registration, it is possible to reliably register voice data a predetermined number of times without imposing an extra burden on the user.

【０１３３】[0133]

[Brief description of the drawings]

【図１】実施形態によるファクシミリ装置のシステムブ
ロック図である。FIG. 1 is a system block diagram of a facsimile apparatus according to an embodiment.

【図２】本実施形態のファクシミリ装置におけるタスク
構成を示すブロック図である。FIG. 2 is a block diagram showing a task configuration in the facsimile apparatus of the embodiment.

【図３】本実施形態のファクシミリ装置の概観を示し、
特に操作パネル（図１の1-1-22）の構成を示す図であ
る。FIG. 3 shows an overview of the facsimile apparatus of the present embodiment,
FIG. 2 is a diagram particularly showing a configuration of an operation panel (1-1-22 in FIG. 1).

【図４】本実施形態による電話帳機能、音声登録及び音
声認識に関わる制御の概要を説明する図である。FIG. 4 is a diagram illustrating an outline of control relating to a telephone directory function, voice registration, and voice recognition according to the present embodiment.

【図５】本実施形態による電話帳データ及び音声登録メ
モリのデータ構成例を説明する図である。FIG. 5 is a diagram illustrating a data configuration example of telephone directory data and a voice registration memory according to the present embodiment.

【図６】オペレートタスクから音声認識タスクへ発行さ
れる音声認識スタートコマンドの種類と内容を説明する
図である。FIG. 6 is a diagram illustrating types and contents of a speech recognition start command issued from the operation task to the speech recognition task.

【図７】音声認識タスクからオペレートタスクへ発行さ
れる音声認識結果コマンドの種類と内容を説明する図で
ある。FIG. 7 is a diagram illustrating types and contents of a speech recognition result command issued from the speech recognition task to the operating task.

【図８】本実施形態による理由コードの一例を示す図で
ある。FIG. 8 is a diagram showing an example of a reason code according to the embodiment.

【図９】本実施形態による個人別電話帳の作成処理操作
における表示内容を説明する図である。FIG. 9 is a diagram for explaining display contents in an operation for creating an individual telephone directory according to the embodiment;

【図１０】本実施形態による個人別電話帳の登録処理手
順を表すフローチャートである。FIG. 10 is a flowchart showing a procedure for registering an individual telephone directory according to the embodiment.

【図１１】電話帳へのデータ登録における画面表示例を
示す図である。FIG. 11 is a diagram illustrating an example of a screen display in registering data in a telephone directory.

【図１２】電話帳への音声データ登録における画面表示
例を示す図である。FIG. 12 is a diagram showing an example of a screen display in registering voice data in a telephone directory.

【図１３】電話帳への音声データ登録における画面表示
例を示す図である。FIG. 13 is a diagram showing an example of a screen display in registering voice data in a telephone directory.

【図１４】電話帳へのデータ登録処理を説明するフロー
チャートである。FIG. 14 is a flowchart illustrating a process of registering data in a telephone directory.

【図１５】電話帳への音声データ登録処理を説明するフ
ローチャートである。FIG. 15 is a flowchart illustrating a process of registering voice data in a telephone directory.

【図１６】本実施形態における電話帳データの登録内容
の修正操作を説明する図である。FIG. 16 is a diagram illustrating an operation of correcting the registered contents of telephone directory data according to the present embodiment.

【図１７】本実施形態における電話帳データの登録内容
の削除操作を説明する図である。FIG. 17 is a diagram illustrating an operation of deleting registered contents of telephone directory data in the embodiment.

【図１８】音声データの削除手順を説明する図である。FIG. 18 is a diagram illustrating a procedure for deleting audio data.

【図１９】電話帳検索時の操作に応じた画面表示例を示
す図である。FIG. 19 is a diagram showing an example of a screen display according to an operation at the time of a telephone directory search.

【図２０】音声認識による電話帳検索時の画面表示例を
示す図である。FIG. 20 is a diagram showing a screen display example at the time of telephone directory search by voice recognition.

【図２１】音声認識による電話帳検索時の画面表示例を
示す図である。FIG. 21 is a diagram showing an example of a screen display at the time of telephone directory search by voice recognition.

【図２２】音声認識による電話帳検索の処理手順を説明
するフローチャートである。FIG. 22 is a flowchart illustrating a telephone book search processing procedure by voice recognition.

フロントページの続き (72)発明者中尾宗樹東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者外山猛東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者松崎進東京都大田区下丸子３丁目30番２号キヤノン株式会社内 (72)発明者上野康秀東京都大田区下丸子３丁目30番２号キヤノン株式会社内Ｆターム(参考） 5C075 BA08 CD02 CD07 CD13 5D015 GG00 5K027 BB02 FF22 HH20 HH21 HH23 5K036 AA15 BB01 DD17 DD32 FF06 JJ02 JJ10 JJ13 KK09 KK13Continued on the front page (72) Inventor Muneki Nakao 3- 30-2 Shimomaruko, Ota-ku, Tokyo Inside Canon Inc. (72) Inventor Takeshi Toyama 3-30-2 Shimomaruko, Ota-ku, Tokyo Inside Canon Inc. (72) Inventor Susumu Matsuzaki 3-30-2 Shimomaruko, Ota-ku, Tokyo Canon Inc. (72) Inventor Yasuhide Ueno 3-30-2 Shimomaruko 3-chome, Ota-ku, Tokyo F-term (reference) 5C075 BA08 CD02 CD07 CD13 5D015 GG00 5K027 BB02 FF22 HH20 HH21 HH23 5K036 AA15 BB01 DD17 DD32 FF06 JJ02 JJ10 JJ13 KK09 KK13

Claims

[Claims]

1. A voice data registration unit for registering voice data corresponding to a plurality of input voices, and a voice recognition unit for performing voice recognition by comparing voice data stored in the voice data storage unit with an input voice. And when the voice data is registered by the voice data registration means,
A speech recognition apparatus, comprising: a notification unit configured to notify the number of registration trials of voice data.

2. The speech recognition apparatus according to claim 1, wherein the notifying unit displays the remaining number of trials each time a trial of registration of voice data is completed.

3. A voice data registration step of registering voice data corresponding to a plurality of input voices in a voice data memory, and performing voice recognition by comparing voice data stored in the voice data memory with the input voice. A method for controlling a voice recognition apparatus, comprising: a voice recognition step; and a notification step of notifying the number of registration attempts of voice data at the time of voice data registration in the voice data registration step.

4. The control method for a voice recognition device according to claim 3, wherein the notification step displays the number of remaining trials each time a trial of registration of voice data is completed.

5. A communication device for communicating with a remote device by connecting to a line, comprising: destination registration means for registering a destination name and a destination address on the line as a pair in a destination memory; Voice data registration means for registering voice data corresponding to input voice in a voice data memory; voice recognition means for performing voice recognition by comparing voice data registered in the voice data memory with input voice; A search unit that specifies a destination name based on the recognition result by the recognition unit and searches the destination memory to obtain a corresponding destination address, and when registering voice data by the voice data registration unit,
A communication device, comprising: a notification unit configured to notify the number of registration attempts of voice data.

6. The communication apparatus according to claim 5, wherein said notifying means displays the remaining number of trials each time the trial of registration of voice data is completed.

7. A method for controlling a communication device for communicating with a remote device by connecting to a line, comprising: a destination registration step of registering a destination name and a destination address on the line in a destination memory; A voice data registration step of registering voice data corresponding to the input voice of the destination name in a voice data memory; and a voice recognition step of performing voice recognition by comparing the voice data registered in the voice data memory with an input voice. A search step of specifying a destination name based on the recognition result of the voice recognition step, searching the destination memory to obtain a corresponding destination address, and registering the voice data in the voice data registration step. A notification step of notifying the number of registration attempts.

8. The method according to claim 7, wherein the notifying step displays the number of remaining trials each time a trial of registration of voice data is completed.

9. A storage medium for storing a control program for causing a computer to perform voice recognition, wherein the control program stores voice data corresponding to a plurality of input voices in a voice data memory. And a code of a voice recognition step for performing voice recognition by comparing voice data stored in the voice data memory with an input voice; and registering voice data at the time of voice data registration in the voice data registration step. A notification process code for notifying the number of trials.

10. A storage medium for storing a control program for causing a computer to control a communication device connected to a line and communicating with a remote device, the control program comprising: a destination name and a destination on the line. A code of a destination registration step of registering an address with a destination memory in a destination memory; a code of an audio data registration step of registering audio data corresponding to an input voice of a destination name in an audio data memory; A code of a voice recognition step for performing voice recognition by comparing the input voice with the input voice data, and a destination name is specified based on the recognition result of the voice recognition step, and the destination memory is searched by searching the destination memory. Notifying the code of a search step for obtaining an address and the number of trials of registration of voice data when registering voice data in the voice data registration step Storage medium characterized by comprising a code of knowledge process.