JP3357629B2

JP3357629B2 - Equipment control system

Info

Publication number: JP3357629B2
Application number: JP11874199A
Authority: JP
Inventors: 秀之山岸; 誠庄境
Original assignee: Asahi Kasei Corp
Current assignee: Asahi Kasei Corp
Priority date: 1999-04-26
Filing date: 1999-04-26
Publication date: 2002-12-16
Anticipated expiration: 2019-04-26
Also published as: JP2000310999A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、設備、特に住宅の
設備に好適な設備制御システムに関し、より詳しくは、
音声により制御内容の指示を行う設備制御システムに関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an equipment control system suitable for equipment, especially for residential equipment,
The present invention relates to a facility control system that gives instructions of control contents by voice.

【０００２】[0002]

【従来の技術】従来、音声認識装置を住宅の制御システ
ムに組み込み、居住者が音声で制御内容、たとえば、照
明の点灯、消灯、ガレージの開閉を指示することができ
る設備制御システム（特開平１０−２７６４８３）が知
られている。2. Description of the Related Art Conventionally, a voice recognition device is incorporated in a control system of a house, and a facility control system (Japanese Unexamined Patent Application Publication No. Hei 10 (1994)) which enables a resident to instruct the contents of control by voice, for example, turning on / off a light and opening / closing a garage. -276483) is known.

【０００３】[0003]

【０００４】[0004]

【発明が解決しようとする課題】従来この種の制御シス
テムではマイクロホンが複数の部屋に設置され、各マイ
クロホンから入力された音声を１台の音声認識装置で認
識する。このため、異なる部屋にいる２人の話者の音声
が重複して発生された場合、あるいは同一人物の音声が
距離の異なる複数のマイクロホンに入力された場合、音
声認識装置には複数組の音声信号があたかも同一人物の
音声として入力される。音声認識装置では、入力される
複数組の音声信号を区別することができず、また、誤認
識を起こす。このため、話者が指示した制御内容を実行
できないという解決すべき課題が従来技術にはあった。Conventionally, in a control system of this type, microphones are installed in a plurality of rooms, and a voice input from each microphone is recognized by one voice recognition device. For this reason, when the voices of two speakers in different rooms are generated repeatedly, or when the voice of the same person is input to a plurality of microphones at different distances, a plurality of sets of voices are The signal is input as if it were a voice of the same person. The speech recognition device cannot distinguish a plurality of sets of input speech signals, and causes erroneous recognition. For this reason, the prior art has a problem to be solved in that the control content specified by the speaker cannot be executed.

【０００５】[0005]

【０００６】本発明の目的は、異なる音声入力手段から
同一内容の音声が入力された場合には誤認識を起こさな
いようにした設備制御システムを提供することにある。An object of the present invention is to provide a facility control system which prevents erroneous recognition when the same voice is input from different voice input means.

【０００７】[0007]

【課題を解決するための手段】本発明は、入力された音
声信号を音声認識手段により音声認識し、音声認識結果
に対応する内容の動作を制御対象機器に実行させる設備
制御システムにおいて、異なった場所からの音声を入力
し、音声信号を出力する複数の音声入力手段と、当該複
数の音声入力手段から出力される音声信号を音声認識す
る複数の音声認識手段と、所定の時間内に複数の音声が
重複的に発生した時に、前記複数の音声に対する前記複
数の音声認識手段の音声認識結果をソーティング処理
し、該ソーティング処理により同一の複数の音声認識結
果については単一の音声認識結果として処理して出力す
る情報処理手段と、該情報処理手段から出力される音声
認識結果に対応する内容の動作を前記制御対象機器に実
行させる制御手段とを備えたことを特徴とする。According to the present invention, there is provided a facility control system for recognizing an input voice signal by voice recognition means and causing a control target device to execute an operation of the content corresponding to the voice recognition result. A plurality of voice input means for inputting a voice from a place and outputting a voice signal; a plurality of voice recognition means for recognizing a voice signal output from the plurality of voice input means; and a plurality of voice recognition means within a predetermined time. When a plurality of voices are generated, the voice recognition results of the plurality of voice recognition units for the plurality of voices are sorted, and the same plurality of voice recognition results are processed as a single voice recognition result by the sorting process. Information processing means for outputting as a result, and control means for causing the controlled device to execute an operation having contents corresponding to the speech recognition result output from the information processing means. Characterized by comprising.

【０００８】[0008]

【０００９】[0009]

【００１０】[0010]

【００１１】[0011]

【００１２】[0012]

【００１３】[0013]

【００１４】[0014]

【００１５】[0015]

【００１６】[0016]

【００１７】[0017]

【００１８】[0018]

【００１９】[0019]

【００２０】[0020]

【００２１】[0021]

【００２２】[0022]

【００２３】[0023]

【００２４】[0024]

【００２５】[0025]

【００２６】[0026]

【発明の実施の形態】以下、図面を参照して本発明の実
施形態を詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００２７】（第１の実施形態）図１は本発明第１の実
施形態のシステム構成を示す。図１において、１は音声
を入力するためのマイクロホンである。１０は制御部で
あり、音声で指示された制御内容を認識するとともに、
認識した制御内容でエアコン３１のオン／オフや照明設
備３２の電源スイッチのオン／オフを行う。(First Embodiment) FIG. 1 shows a system configuration of a first embodiment of the present invention. In FIG. 1, reference numeral 1 denotes a microphone for inputting voice. Reference numeral 10 denotes a control unit, which recognizes control contents instructed by voice,
The on / off of the air conditioner 31 and the on / off of the power switch of the lighting equipment 32 are performed based on the recognized control contents.

【００２８】マイクロホン１は住宅の各部屋に設置さ
れ、住宅内のいずれかに設置された１台の制御部１０に
信号線で接続される。制御部１では、たとえば、話者か
らエアコンのオン／オフを指示された場合には、音声の
認識結果に基づきリモコン２１に対して、オン／オフを
指示する制御信号を送信し、リモコン２１にエアコン３
１をオン／オフさせる。話者から照明設備のオン／オフ
が指示された場合、制御部１０は信号変換回路２２にオ
ン／オフを指示するための制御信号（デジタル形態）を
送信する。信号変換回路２２では受信した制御信号をア
ナログ信号に変換して照明設備３２に転送し、照明設備
３２の電源スイッチをオン／オフさせる。The microphone 1 is installed in each room of the house, and is connected by a signal line to one control unit 10 installed in any of the houses. For example, when the speaker instructs on / off of the air conditioner, the control unit 1 transmits a control signal for instructing on / off to the remote controller 21 based on the voice recognition result, and sends the control signal to the remote controller 21. Air conditioner 3
Turn 1 on / off. When the speaker instructs on / off of the lighting equipment, the control unit 10 transmits a control signal (digital form) for instructing on / off to the signal conversion circuit 22. The signal conversion circuit 22 converts the received control signal into an analog signal, transfers the analog signal to the lighting equipment 32, and turns on / off the power switch of the lighting equipment 32.

【００２９】制御部１０の内部構成を説明する。制御部
１０はアナログデジタル変換回路（Ａ／Ｄ）１１、調停
回路１３、音声認識用プロセッサ１４およびインタフェ
ース１５を有する。Ａ／Ｄ１１はマイクロホン１から入
力されるアナログの音声信号をデジタルの音声信号に変
換する。Ａ／Ｄ１１は設置されたマイクロホン１の個数
と同じだけ用意される。調停回路１３は複数のＡ／Ｄ１
１から入力される音声信号の内、一番早く受信した音声
信号のみを受付け（一番早く音声を入力したマイクロホ
ン１の検知）、その信号ライン１２を音声認識用プロセ
ッサ１４に接続する。一例としては音声信号のパルスの
立ち上がりを比較することにより一番早く入力された音
声信号を検知する調停回路を使用することができる。The internal configuration of the control unit 10 will be described. The control unit 10 includes an analog-to-digital conversion circuit (A / D) 11, an arbitration circuit 13, a voice recognition processor 14, and an interface 15. The A / D 11 converts an analog audio signal input from the microphone 1 into a digital audio signal. A / Ds 11 are prepared as many as the number of microphones 1 installed. The arbitration circuit 13 includes a plurality of A / D1s.
Only the earliest received audio signal from among the audio signals input from 1 is received (detection of the microphone 1 which has input the earliest audio), and its signal line 12 is connected to the audio recognition processor 14. As one example, an arbitration circuit that detects the earliest input audio signal by comparing the rising edges of the audio signal pulses can be used.

【００３０】これにより、一番早く発声された話者の音
声のみが音声認識用プロセッサ１４により音声認識され
る。他の話者により後で発声された音声の音声信号は調
停回路１３までしか届かず、もって、音声認識用プロセ
ッサ１４では音声認識されない。As a result, only the voice of the speaker uttered first is recognized by the voice recognition processor 14. The voice signal of the voice uttered later by another speaker reaches only the arbitration circuit 13, and is not recognized by the voice recognition processor 14.

【００３１】したがって、発声開始が完全同一でない限
り、一番早く発声された音声のみが音声認識用プロセッ
サ１４により音声認識される。Therefore, unless the start of the utterance is completely the same, only the earliest uttered voice is recognized by the speech recognition processor 14.

【００３２】調停回路１３は周知の回路であるので、そ
の内部構成の説明を省略する。音声認識用プロセッサ１
４としては本願出願人が提案した特願平９−５１５７７
号および特願平９−５６０１８号の装置を使用すること
ができる。さらにこれら音声認識装置に対して、ＰＣ
Ｔ、日本出願００９１５／１９９８に開示された不特定
雑音除去機能を持たせることができる。Since the arbitration circuit 13 is a well-known circuit, the description of its internal configuration is omitted. Speech recognition processor 1
No. 4 is Japanese Patent Application No. 9-51577 proposed by the present applicant.
And Japanese Patent Application No. 9-56018. Furthermore, for these speech recognition devices, a PC
T, can have an unspecified noise removal function disclosed in Japanese Application 00915/1998.

【００３３】特願平９−５６０１８号に記載の音声認識
装置は、話者の音声の歪みまたは音響の歪みの特徴をメ
モリに記憶しておき、音声認識時にはメモリに記憶して
ある該当の歪みの特徴を使用して、信号処理回路により
入力の音声信号の歪みを補正する音声認識装置である。In the speech recognition apparatus described in Japanese Patent Application No. 9-56018, the characteristics of the speaker's voice distortion or acoustic distortion are stored in a memory, and the corresponding distortion stored in the memory is stored during voice recognition. Is a speech recognition device that corrects distortion of an input speech signal by a signal processing circuit using the features of (1).

【００３４】特願平９−５１５７７号に記載の音声処理
装置は雑音を消去する適応フィルタに与えるインパルス
応答をＦＩＦＯメモリに保持し、音声の入力が検知され
ている間はＦＩＦＯメモリに保持されている一定時間前
のインパルス応答を雑音除去に使用する音声処理装置で
ある。この間のＦＩＦＯメモリに対するインパルス応答
の入力が禁止され、音声が入力されていない間は、現時
点のインパルス応答を雑音除去に使用する。The speech processing apparatus described in Japanese Patent Application No. 9-51577 holds an impulse response given to an adaptive filter for canceling noise in a FIFO memory, and holds the impulse response in the FIFO memory while speech input is detected. This is a speech processing apparatus that uses an impulse response before a certain period of time to remove noise. During this time, the input of the impulse response to the FIFO memory is prohibited, and the current impulse response is used for noise removal while no voice is input.

【００３５】ＰＣＴ、日本出願００９１５／１９９８に
記載された雑音除去機能は、音源の信号が特定不可能な
（音源未知）雑音のスペクトルを一定周期で推定し、音
声信号のスペクトルから減算することにより音源未知の
雑音を除去する不特定雑音除去手段により実現される。The noise removal function described in PCT, Japanese Patent Application No. 00915/1998 estimates the spectrum of noise from which the signal of the sound source cannot be specified (sound source unknown) at a fixed period and subtracts it from the spectrum of the voice signal. This is realized by an unspecified noise removing unit that removes noise of a sound source unknown.

【００３６】上述の雑音除去機能や歪み補正機能につい
ては自動車室内の音声認識を目的として開発されたが、
実験の結果、住宅室内では上記機能を有する音声認識装
置では最低で１ｍの距離だけマイクロホンと発話者が離
れていても実用に供する音声認識性能を発揮することを
本願発明者は発見した。このような音声信号の歪み補正
機能、雑音除去機能を音声認識用プロセッサ１４に選択
的に持たせることにより、住宅に住む住人のような特定
話者の音声を精度よく音声認識することができる。さら
に音声信号の歪み補正機能および／または雑音除去機能
を音声認識用プロセッサ１４に持たせることにより１ｍ
以上マイクロホンから離れた場所からしゃべった音声に
対する音声認識精度をより高めることができる。なお、
上述の３つの機能については製品のコスト、設備使用環
境に応じて、任意に組合わせればよい。The above-described noise elimination function and distortion correction function have been developed for the purpose of recognizing speech in a car cabin.
As a result of the experiment, the inventor of the present application has found that a voice recognition device having the above function exhibits practical voice recognition performance even if the speaker is at least 1 m away from the microphone in a residential room. By selectively providing the voice signal distortion correction function and the noise removal function to the voice recognition processor 14, it is possible to accurately recognize the voice of a specific speaker such as a resident living in a house. Further, by providing the voice recognition processor 14 with a voice signal distortion correction function and / or noise removal function,
As described above, it is possible to further improve the speech recognition accuracy for a speech spoken from a place away from the microphone. In addition,
The above three functions may be arbitrarily combined according to the cost of the product and the environment in which the equipment is used.

【００３７】このような音声認識用プロセッサ１４を使
用すると、本実施形態では、図２に示すようにマイクロ
ホン１を天井にも設置することができる。When such a speech recognition processor 14 is used, in this embodiment, the microphone 1 can be installed on the ceiling as shown in FIG.

【００３８】図２は住宅の間取りを簡素化して示す説明
図である。図２において、天井４１の裏側（いわゆる天
井裏）に、制御部１０を設置する。マイクロホン１は部
屋の横壁４２、天井４１に設置することができる。従来
の音声認識装置の性能では、マイクロホンと人間との間
の距離が約１メートル以上離れると、音声認識精度が極
端に低下する。しかしながら、上述の歪み補正機能およ
び／または雑音除去機能を有する音声認識用プロセッサ
１４を使用すると、人間と音声認識装置との間が１メー
トル以上離れても、実用に供するだけの音声認識精度を
維持できる。このため、マイクロホン１を天井４１に設
置することが可能となり、マイクロホン１の設置場所の
自由度が高まる。FIG. 2 is an explanatory diagram showing a simplified layout of a house. In FIG. 2, the control unit 10 is installed behind the ceiling 41 (so-called ceiling). The microphone 1 can be installed on the side wall 42 and the ceiling 41 of the room. In the performance of the conventional speech recognition apparatus, if the distance between the microphone and a person is about 1 m or more, the accuracy of speech recognition is extremely reduced. However, when the speech recognition processor 14 having the above-described distortion correction function and / or noise removal function is used, even if the distance between the human and the speech recognition apparatus is 1 meter or more, the speech recognition accuracy sufficient for practical use is maintained. it can. For this reason, the microphone 1 can be installed on the ceiling 41, and the degree of freedom of the installation location of the microphone 1 is increased.

【００３９】（第２の実施形態）第１の実施形態は複数
のマイクロホン１から入力される音声信号の中の１つを
調停回路１３により選択して、音声認識用プロセッサ１
４でその音声信号を音声認識する例であった。次に、マ
イクロホンと制御部を１対とした音声認識装置を複数有
する設備制御システムを説明する。(Second Embodiment) In the first embodiment, one of the audio signals input from the plurality of microphones 1 is selected by the arbitration circuit 13 and the voice recognition processor 1 is selected.
4 is an example of voice recognition of the voice signal. Next, an equipment control system having a plurality of voice recognition devices each having a pair of a microphone and a control unit will be described.

【００４０】図３は第２の実施形態のシステム構成を示
す。図３において、１００は音声認識装置である。音声
認識装置１００はマイクロホン１０１、Ａ／Ｄ１０２、
音声認識用プロセッサ１０３、制御用インタフェース１
０４および通信用インタフェース１０６を有する。マイ
クロホン１０１、Ａ／Ｄ１０２および音声認識用プロセ
ッサ１０３は第１の実施形態と同様の回路を使用する。
ただし、音声認識用プロセッサ１０３は後述の通信機能
を有する。FIG. 3 shows the system configuration of the second embodiment. In FIG. 3, reference numeral 100 denotes a voice recognition device. The voice recognition device 100 includes a microphone 101, an A / D 102,
Speech recognition processor 103, control interface 1
04 and a communication interface 106. The microphone 101, the A / D 102, and the speech recognition processor 103 use the same circuits as in the first embodiment.
However, the voice recognition processor 103 has a communication function described later.

【００４１】制御用インタフェース１０４は第１の実施
形態のインタフェース１５と同じである。通信用インタ
フェース１０６は、例えば、イーサネットと呼ばれるＬ
ＡＮ（広域ネットワーク）用の通信インタフェースを使
用する。音声認識用プロセッサ１０３の音声認識結果に
対応する制御信号が制御用インタフェース１０４を介し
て制御対象機器（第１の実施形態のエアコン等）１０５
に送られる。The control interface 104 is the same as the interface 15 of the first embodiment. The communication interface 106 is, for example, an L called Ethernet.
Use a communication interface for AN (Wide Area Network). A control signal corresponding to the voice recognition result of the voice recognition processor 103 is transmitted to the control target device (such as the air conditioner of the first embodiment) 105 via the control interface 104.
Sent to

【００４２】このような音声認識装置１００複数台が信
号線１１０を介して接続される。A plurality of such voice recognition devices 100 are connected via a signal line 110.

【００４３】本実施形態の特徴は、マイクロホン１０１
に音声が入力されると、音声認識用プロセッサ１０３が
他の音声認識装置１００に対して音声認識処理および／
または音声信号の入力処理を停止を要求する機能と、他
の音声認識装置１００からの停止要求に応じて、音声認
識処理および／または音声信号の入力処理を停止する機
能を有する点にある。The feature of this embodiment is that the microphone 101
Is input to the voice recognition processor 103, the voice recognition processor 103 performs a voice recognition process and / or
Alternatively, it has a function of requesting a stop of the voice signal input process and a function of stopping the voice recognition process and / or the voice signal input process in response to a stop request from another voice recognition device 100.

【００４４】音声認識装置１００の設置例を図４に示
す。音声認識装置１００は筐体２０１内に収納され、天
井２０２に設置される。マイクロホン１０１は筐体２０
１の外側に集音可能に取り付けられる。FIG. 4 shows an example of installation of the voice recognition device 100. The voice recognition device 100 is housed in a housing 201 and installed on a ceiling 202. The microphone 101 is the housing 20
1 is attached so as to be able to collect sound.

【００４５】この例では部屋に設置された制御対象機器
１０５に対して赤外線、あるいは無線による通信により
制御動作が制御用インタフェース１０４から指示され
る。なお、マイクロホン１０１、音声認識用プロセッサ
１０３、インタフェース１０４，１０６は、設置場所に
応じて一体に形成すればよい。たとえば、風呂場では、
マイクロホン１０１のみを風呂場に設置すると好適であ
る。In this example, the control operation is instructed from the control interface 104 to the control target device 105 installed in the room by infrared or wireless communication. Note that the microphone 101, the voice recognition processor 103, and the interfaces 104 and 106 may be integrally formed according to the installation location. For example, in the bathroom,
It is preferable to install only the microphone 101 in the bathroom.

【００４６】他の音声認識装置との通信によって、一番
早く音声が入力された音声認識装置の機器制御のみを有
効とさせるための処理を図５を参照して説明する。Referring to FIG. 5, a description will be given of a process for validating only the device control of the voice recognition device to which the voice is input first by communicating with another voice recognition device.

【００４７】図５は音声認識用プロセッサ１０３内に記
憶された音声認識・通信用のプログラムの内容を示す。
このプログラムは、ＣＰＵが実行可能なプログラム言語
で記載され、音声認識用プロセッサ１０３の内のＲＯＭ
等に保存記憶される。FIG. 5 shows the contents of a speech recognition / communication program stored in the speech recognition processor 103.
This program is written in a program language that can be executed by the CPU, and is stored in the ROM in the speech recognition processor 103.
Etc.

【００４８】音声認識用プロセッサ１０３内のＣＰＵ
（以下、ＣＰＵと略記する）は電源が供給される間、図
５の処理手順を実行し続ける。CPU in speech recognition processor 103
(Hereinafter, abbreviated as CPU) continues to execute the processing procedure of FIG. 5 while power is supplied.

【００４９】なお、以下の説明で使用する音声入力停止
フラグについて説明しておく。音声入力停止フラグはオ
ン／オフについての情報で、音声認識用プロセッサ１０
４内のＲＡＭに記憶される。音声に入力停止フラグがオ
ンの時には、自己の音声認識装置は音声入力処理および
音声認識処理が停止状態であることを示す。音声に入力
停止フラグがオフの時には、自己の音声認識装置が音声
入力処理および音声認識処理が可能である状態（停止解
除状態）であることを示す。The voice input stop flag used in the following description will be described. The voice input stop flag is information about on / off, and the voice recognition processor 10
4 is stored in the RAM. When the input stop flag is turned on for the voice, the own voice recognition device indicates that the voice input process and the voice recognition process are stopped. When the input stop flag is off for voice, it indicates that the own voice recognition device is in a state where voice input processing and voice recognition processing are possible (stop release state).

【００５０】図５において、ＣＰＵは音声入力停止信号
を他の音声認識装置から受信しているか否かを通信イン
タフェース１０６の受信内容の確認により判定する（ス
テップＳ５０）。この時点で、音声入力停止信号を受信
した場合、音声入力停止フラグをオン（自己の音声入力
・音声認識処理の禁止）に設定する。In FIG. 5, the CPU determines whether or not a voice input stop signal has been received from another voice recognition device by checking the received content of the communication interface 106 (step S50). At this point, if a voice input stop signal is received, the voice input stop flag is set to ON (the own voice input / voice recognition processing is prohibited).

【００５１】音声入力停止信号を受信していない場合に
は、音声入力停止解除信号が他の音声認識装置から送信
されたか否かを判定する（ステップＳ１００）。If a voice input stop signal has not been received, it is determined whether a voice input stop release signal has been transmitted from another voice recognition device (step S100).

【００５２】この判定は、通信用インタフェース１０６
の受信内容を判定することで実現できる。次にＣＰＵは
フラグの内容がオンであるか、オフであるかを判定する
（ステップＳ１００）。This determination is made by the communication interface 106
It can be realized by judging the contents of reception. Next, the CPU determines whether the content of the flag is on or off (step S100).

【００５３】他の音声入力装置が音声入力停止解除信号
を送信した場合には音声入力停止フラグをオフに設定
し、音声入力および音声認識処理を停止解除状態とする
（ステップＳ１０５）。If another voice input device has transmitted the voice input stop release signal, the voice input stop flag is set to off, and the voice input and voice recognition processing is set to the stop release state (step S105).

【００５４】一方、音声入力停止解除信号を受信してい
ない場合、ＣＰＵは手順をステップＳ１００からステッ
プＳ１１０に手順を進める。このステップでＣＰＵは、
音声入力停止フラグがオンであるか否かを判定する。音
声入力停止フラグがオンのときには、手順はステップＳ
５０に戻る。このため、他の音声認識装置からの音声入
力停止解除信号を受信するまでは、ステップＳ５０〜Ｓ
１１０のループ処理が繰り返される。On the other hand, if the voice input stop release signal has not been received, the CPU proceeds from step S100 to step S110. In this step, the CPU
It is determined whether the voice input stop flag is on. If the voice input stop flag is ON, the procedure is step S
Return to 50. Therefore, steps S50 to S50 are performed until a speech input stop release signal from another speech recognition device is received.
The loop processing of 110 is repeated.

【００５５】これによりＣＰＵの実行手順は後述のステ
ップＳ１２０以降の音声入力処理および音声認識処理へ
移行せず、たとえ、自己のマイクロホン１から音声が入
力されても音声認識用プロセッサ１０３はＡ／Ｄ１０２
から入力される音声信号を受け付けない。As a result, the execution procedure of the CPU does not shift to the voice input processing and the voice recognition processing of step S120 and thereafter, and even if voice is input from its own microphone 1, the voice recognition processor 103 operates the A / D 102
Does not accept audio signals input from

【００５６】自己および他の音声認識装置が音声認識を
行っていない状態では音声入力停止フラグはオン状態に
ある。The voice input stop flag is in an on state in a state where the self and other voice recognition devices are not performing voice recognition.

【００５７】したがって、手順はステップＳ１００→Ｓ
１１０→Ｓ１２０へと進む。この時点で話者が音声を発
声していない状態、すなわち、無音状態であると、ステ
ップＳ１２０の音声入力の有無の判定（Ａ／Ｄ１０２か
らの入力信号の有無の判定）は、無しとなる。したがっ
て、音声信号の入力があるまで、ステップＳ１００〜Ｓ
１２０→Ｓ１３０→Ｓ１００のループ処理が繰り返され
る。Therefore, the procedure is changed from step S100 to S
Go to 110 → S120. At this point, if the speaker is not producing sound, that is, if there is no sound, the determination of the presence or absence of the voice input in step S120 (the determination of the presence or absence of the input signal from the A / D 102) is absent. Therefore, steps S100 to S100 are performed until an audio signal is input.
The loop processing of 120 → S130 → S100 is repeated.

【００５８】話者が制御対象機器に対する動作の指示を
発声すると、マイクロホン１から入力された音声が音声
信号の形態で、音声認識用プロセッサ１０３に入力され
る。When the speaker utters an operation instruction to the control target device, the voice input from the microphone 1 is input to the voice recognition processor 103 in the form of a voice signal.

【００５９】この入力がステップＳ１２０において検出
され、ＣＰＵの手順はステップＳ１２０→Ｓ１２１へと
進む。ステップＳ１２１において、音声入力停止信号が
通信用インタフェース１０６、信号線１１０を介して、
他の音声認識装置に送信され、続いて音声認識処理が行
われる。これにより他の音声認識装置では音声入力停止
フラグをオンにするので、自己の音声入力・音声認識処
理が停止する。This input is detected in step S120, and the procedure of the CPU proceeds from step S120 to S121. In step S121, the audio input stop signal is transmitted via the communication interface 106 and the signal line 110,
The data is transmitted to another voice recognition device, and subsequently the voice recognition process is performed. As a result, the voice input stop flag is turned on in the other voice recognition device, so that its own voice input / voice recognition process is stopped.

【００６０】ステップ１２１では、上述したように特願
平９−５６０１８号で開示されている処理、すなわち、
歪み補正処理と、ＰＣＴ日本出願００９１５／１９９８
および特願平１０−２５７５８３号で開示されている雑
音除去処理が行われた後、音声認識処理が行われる。音
声認識処理自体は周知の処理方法を使用すればよく、詳
細な説明を省略する。In step 121, as described above, the processing disclosed in Japanese Patent Application No. 9-56018, that is,
Distortion correction processing and PCT Japanese application 00915/1998
After performing the noise removal processing disclosed in Japanese Patent Application No. 10-257585, the speech recognition processing is performed. A well-known processing method may be used for the voice recognition processing itself, and a detailed description will be omitted.

【００６１】音声認識結果は、たとえば、文字コード列
の形態で得られるので、予め音声認識用プロセッサ内の
ＲＯＭに格納されている文字コード列−制御信号対応表
に基づいて、複数ビットの制御信号に変換される。変換
された制御信号が、制御用インタフェース１０４を介し
て部屋内の制御対象機器１０５に送信される（ステップ
Ｓ１２２）。これにより音声認識用プロセッサ１０３は
話者から指示された動作内容を制御対象機器１０５に対
して実行させることができる。Since the speech recognition result is obtained, for example, in the form of a character code string, a control signal of a plurality of bits is obtained based on a character code string-control signal correspondence table stored in advance in a ROM in the speech recognition processor. Is converted to The converted control signal is transmitted to the control target device 105 in the room via the control interface 104 (step S122). Thereby, the voice recognition processor 103 can cause the control target device 105 to execute the operation content instructed by the speaker.

【００６２】この後、ＣＰＵは手順をステップＳ１２３
へ進め、音声入力停止解除信号を通信用インタフェース
１０６および信号線１１０を介して他の音声認識用プロ
セッサに送信する。Thereafter, the CPU proceeds to step S123.
Then, a speech input stop release signal is transmitted to another speech recognition processor via the communication interface 106 and the signal line 110.

【００６３】この音声入力停止信号を受けて、他の音声
認識プロセッサでは自己の音声入力停止フラグをオフに
切り替え、音声入力、音声認識可能状態とする。In response to the voice input stop signal, the other voice recognition processors switch their own voice input stop flags to off, thereby enabling voice input and voice recognition.

【００６４】以上の処理を複数の音声認識装置が実行す
ると、全ての音声認識装置が音声入力・音声認識可能状
態（音声入力停止フラグオフ）にあるときに、一番早く
音声を入力した音声認識装置から音声入力停止信号が発
生され、この入力停止信号を受けた他の音声認識装置で
は音声入力停止フラグをオンに設定することで、自己の
音声入力・音声認識処理を停止する。これにより、一番
早く話者の音声を入力した音声認識装置だけが、制御対
象機器（１０５）の制御を実行することができる。When a plurality of speech recognizers execute the above processing, when all of the speech recognizers are in the speech input / speech recognizable state (speech input stop flag is off), the speech recognizer that has input the speech earliest. , A speech input stop signal is generated, and the other speech recognition devices that have received the input stop signal set their speech input stop flag to on to stop their own speech input / speech recognition processing. Thus, only the voice recognition device that has input the voice of the speaker first can control the control target device (105).

【００６５】また、上記一番早く音声を入力した音声認
識装置の制御処理が終了すると、その音声認識装置から
音声入力停止解除信号が発生されるので、この信号を受
けた他の音声認識装置では音声入力停止フラグをオンに
設定して、自己での音声入力・音声認識処理を可能状態
とする。When the control process of the voice recognition device which has input the voice earliest is completed, a voice input stop release signal is generated from the voice recognition device. The voice input stop flag is set to on to enable the voice input / voice recognition processing by itself.

【００６６】（第３の実施形態）複数の音声認識用プロ
セッサの音声認識結果を専用の制御用プロセッサが受信
して、制御用プロセッサが設備制御を統括する第３の実
施形態を説明する。第３の実施形態のシステム構成を図
６に示す。第２の実施形態と同様の回路には同一の符号
を付しており詳細な説明を省略する。(Third Embodiment) A third embodiment in which a dedicated control processor receives voice recognition results of a plurality of voice recognition processors and the control processor controls the facility control will be described. FIG. 6 shows a system configuration of the third embodiment. Circuits similar to those in the second embodiment are denoted by the same reference numerals, and detailed description thereof will be omitted.

【００６７】図６において、２００は設備制御装置であ
り制御用プロセッサ２０１、通信用インタフェース２０
２および制御用インタフェース２０３を有する。制御用
インタフェースは複数の制御対象機器に対して、制御用
プロセッサ２０１からの動作指示を転送する。In FIG. 6, reference numeral 200 denotes an equipment control unit, which is a control processor 201 and a communication interface 20.
2 and a control interface 203. The control interface transfers an operation instruction from the control processor 201 to a plurality of control target devices.

【００６８】通信用インタフェース２０２は信号線１１
０と接続し、複数の音声認識装置１００から音声認識結
果を受信する。The communication interface 202 is connected to the signal line 11
0, and receives speech recognition results from a plurality of speech recognition devices 100.

【００６９】制御用プロセッサ２０１は複数の音声認識
結果から受信した音声認識結果を制御用の動作指示信号
に変換する。また、一定時間内に受信した複数組の音声
認識結果を相互比較し、内容が一致している音声認識結
果を見つけると同一内容の複数の音声認識結果を単一の
音声認識結果に統合する。The control processor 201 converts the received speech recognition result from the plurality of speech recognition results into an operation instruction signal for control. Also, a plurality of sets of speech recognition results received within a certain time are compared with each other, and when a speech recognition result having the same content is found, a plurality of speech recognition results of the same content are integrated into a single speech recognition result.

【００７０】このようなシステムの動作を図７のフロー
チャートを使用して説明する。図７は制御用プロセッサ
２０１が実行する処理プログラムの内容を示す。この処
理プログラムは予め、制御用プロセッサ内のメモリに組
み込まれている。The operation of such a system will be described with reference to the flowchart of FIG. FIG. 7 shows the contents of the processing program executed by the control processor 201. This processing program is incorporated in a memory in the control processor in advance.

【００７１】音声認識用プロセッサ１０３はマイクロホ
ン１０１から音声の入力があると、従来と同様にして音
声認識を行い、その音声認識結果を通信用インタフェー
ス１０６を介して、設備制御装置２００に送信する。When a voice is input from the microphone 101, the voice recognition processor 103 performs voice recognition in the same manner as in the related art, and transmits the voice recognition result to the equipment control device 200 via the communication interface 106.

【００７２】制御用プロセッサ２０１は通常は、ステッ
プＳ２００〜Ｓ２１０のループ処理により音声認識装置
１００からの送信を待機している。音声認識装置１００
からのデータ送信があることをステップＳ２００で検知
すると制御用プロセッサ２０１は受信したデータ（音声
認識結果）を内部メモリに一時記憶する（ステップＳ２
００→Ｓ２０５）。Normally, the control processor 201 waits for transmission from the speech recognition apparatus 100 by loop processing of steps S200 to S210. Speech recognition device 100
When it is detected in step S200 that there is data transmission from the control processor 201, the control processor 201 temporarily stores the received data (speech recognition result) in the internal memory (step S2).
00 → S205).

【００７３】また、ステップＳ２１０では一定時間を計
時する内部タイマーがカウントアップしたか否かを監視
し、カウントアップしていない場合には、手順をステッ
プＳ２１０からＳ２００に戻す。内部タイマーのカウン
トアップ時間をたとえば、１０秒とすると、１０秒の間
に、複数の音声認識装置１００から送信される音声認識
結果が内部メモリに収集される。内部タイマーがカウン
トアップしたことをステップＳ２１０で検出した制御用
プロセッサ２０１は、内部メモリに記憶されている１以
上の音声認識結果の中の２つの音声認識結果を任意に組
み合わせ、一致比較を行う。本実施形態では、ソーティ
ングと呼ばれている周知の情報処理手法を使用して、音
声認識結果の並び換えを行う途中で、音声認識結果の文
字列が同一のものを１つに統合する（ステップＳ２２０
→Ｓ２３０）。ソーティング用のプログラムを使用しな
くても、２つの音声認識結果が一致の判定が得られた場
合には、２つの音声認識結果の内の１つを内部メモリか
ら削除することにより音声認識結果の統合を行うことが
できる。In step S210, it is monitored whether or not an internal timer for counting a predetermined time has counted up. If not, the procedure returns from step S210 to S200. Assuming that the count-up time of the internal timer is, for example, 10 seconds, the voice recognition results transmitted from the plurality of voice recognition devices 100 are collected in the internal memory within 10 seconds. The control processor 201, which has detected in step S210 that the internal timer has counted up, arbitrarily combines two voice recognition results among one or more voice recognition results stored in the internal memory and performs a match comparison. In the present embodiment, using a well-known information processing technique called sorting, during the rearrangement of the speech recognition results, those having the same character string of the speech recognition results are integrated into one (step S1). S220
→ S230). Even if a sorting program is not used, if the two speech recognition results are determined to be coincident, one of the two speech recognition results is deleted from the internal memory to thereby reduce the result of the speech recognition. Integration can be performed.

【００７４】同一音声認識結果を統合した後の複数の音
声認識結果は文字列から制御機器への動作指示に従来と
同様にして変換され、制御用インタフェース２０３を介
して制御対象機器２０４に送られる。A plurality of speech recognition results after integrating the same speech recognition result are converted from character strings into operation instructions to the control device in the same manner as in the related art, and sent to the control target device 204 via the control interface 203. .

【００７５】この後、内部メモリ内の音声認識結果は消
去され、内部カウンタが再起動される（ステップＳ２４
０）。Thereafter, the voice recognition result in the internal memory is deleted, and the internal counter is restarted (step S24).
0).

【００７６】以上の手順を繰り返すと一定時間間隔（こ
の形態では１０秒）間隔で音声認識結果が設備制御装置
１００で収集され、収集された音声認識結果の中の同一
のものが統合される。したがって、１０秒以内に複数の
人間から発生された同一内容の音声や複数のマイクロホ
ン１０１から入力され複数の音声認識用プロセッサ１０
２で音声認識される単一話者の音声についても、制御用
プロセッサ２０１側では複数回、制御対象機器に動作指
示を行うことはない。When the above procedure is repeated, the speech recognition results are collected by the equipment control unit 100 at regular time intervals (10 seconds in this embodiment), and the same speech recognition results among the collected speech recognition results are integrated. Therefore, a plurality of voice recognition processors 10 input from a plurality of microphones 101 and voices of the same content generated from a plurality of humans within 10 seconds.
Regarding the voice of a single speaker whose voice is recognized in step 2, the control processor 201 does not give an operation instruction to the control target device a plurality of times.

【００７７】上述の実施形態の他に次の形態を実施でき
る。The following embodiment can be carried out in addition to the above embodiment.

【００７８】１）第１の実施形態において、制御対象機
器と、制御部１０あるいは音声認識装置１００との間の
通信手段は無線（たとえば、赤外線、光を使用した無線
通信方法）、有線の周知の通信手段を使用することがで
きる。さらにマイクロホンと音声認識用プロセッサの間
のいずれかの経路部分を無線の通信手段としてもよいこ
と勿論である。1) In the first embodiment, the communication means between the device to be controlled and the control unit 10 or the voice recognition device 100 is wireless (for example, a wireless communication method using infrared rays or light) or wired. Communication means can be used. Further, it goes without saying that any path between the microphone and the voice recognition processor may be a wireless communication means.

【００７９】２）第２の実施形態の音声認識装置１００
の間の通信についても、イーサネットのような通信方法
の他に、有線、無線の通信方法（手段）を使用すること
ができる。2) Speech Recognition Apparatus 100 of Second Embodiment
In addition, a communication method (means) of a wired or wireless communication can be used in addition to the communication method such as Ethernet.

【００８０】３）上述の第１、第２の実施形態では、設
備の一例として、住宅設備を説明したが、他の設備、た
とえば、生産設備など、他の設備にも本実施形態を適用
できる。3) In the above-described first and second embodiments, the residential equipment has been described as an example of the equipment. However, the present embodiment can be applied to other equipment, for example, other equipment such as production equipment. .

【００８１】４）上述の実施形態において、光電池（光
電変換手段）を太陽光を受光可能な位置に設置し、音声
認識装置１００等や設備制御システム全体の電源とする
こともできる。4) In the above embodiment, the photovoltaic cell (photoelectric conversion means) can be installed at a position where sunlight can be received, and can be used as a power supply for the voice recognition device 100 and the like and the entire equipment control system.

【００８２】５）上述の第１の実施形態においては、入
力が受け付けられなかった音声の話者、第２の実施形態
においては音声認識が行われなかった音声の話者に音声
が受け付けられなかった旨を案内する複数の案内手段を
設置することもできる。この場合、案内手段としては合
成音声発生器や表示器を使用する。第１の実施形態で
は、入力を受け付けるマイクロホンを調停回路が検知す
るので、調停回路から、入力を受け付けなかったマイク
ロホンに対の案内手段に案内実行のための信号を送信す
る。案内手段では合成音声あるいは表示によりメッセー
ジを案内する。案内としては、その他、ランプの点灯、
ブザーによる報知も可能である。5) In the above-described first embodiment, no voice is accepted by a speaker whose input is not accepted, and in the second embodiment, a speaker whose speech is not recognized. A plurality of guide means for guiding the user can be provided. In this case, a synthetic speech generator or a display is used as the guidance means. In the first embodiment, since the arbitration circuit detects a microphone that accepts an input, the arbitration circuit transmits a signal for executing guidance to the paired guidance means to the microphone that has not received the input. The guidance means guides the message by synthesized voice or display. Other guidance includes lighting of the lamp,
Notification by buzzer is also possible.

【００８３】第２の実施形態では音声認識用プロセッサ
が音声入力フラグをオンに設定した時点で、自己に対応
する案内手段に案内実行の信号を送信する。In the second embodiment, when the speech recognition processor sets the speech input flag to ON, a signal for executing the guidance is transmitted to the guidance means corresponding to the self.

【００８４】６）第２の実施形態では通信用インタフェ
ースを含む音声認識装置１００を１つの筐体の中に収納
したが、筐体の形状としては図４の形状に限らず、人
形、家具、生活用備品等各種の収納可能物体に音声認識
装置１００を収納することができる。6) In the second embodiment, the voice recognition device 100 including the communication interface is housed in one housing. However, the shape of the housing is not limited to the shape shown in FIG. The voice recognition device 100 can be stored in various storable objects such as household items.

【００８５】７）音声を入力するマイクロホンは１部屋
に複数設置してもよいこと勿論である。7) Needless to say, a plurality of microphones for inputting voice may be provided in one room.

【００８６】８）上述の第２実施形態では、音声の入力
および認識処理を双方を停止させる例を説明したが、話
者が異なる部屋におり、離れているような場合は各、音
声認識プロセッサでは入力の音声信号を内部メモリに記
憶し、音声認識処理のみを停止することもできる。8) In the above-described second embodiment, an example has been described in which both the voice input and the recognition processing are stopped. However, in the case where the speakers are in different rooms and are separated, the voice recognition processor Then, the input voice signal can be stored in the internal memory, and only the voice recognition processing can be stopped.

【００８７】９）第２の実施形態では音声認識用プロセ
ッサの間で通信を行うことによりシステムに調停機能
（一番早く音声信号が入力された音声認識用プロセッサ
で音声認識を行うこと）を持たせたが、専用の調停回路
を設け、調停回路により一番早く音声信号を入力した音
声認識用プロセッサを動作可能状態（アクチブ）とする
ことができる。9) In the second embodiment, the system has an arbitration function (perform speech recognition by the speech recognition processor to which the speech signal is input first) by communicating between the speech recognition processors. However, a dedicated arbitration circuit can be provided, and the arbitration circuit can set the speech recognition processor to which the voice signal is input first, in an operable state (active).

【００８８】１０）第１の実施形態の調停回路１３は、
音声が一番早くされた音声入力系統を選択する。選択さ
れた入力系統の解除は、音声認識の終了時としてもよ
い、選択した音声入力系統上の音声信号のレベルが閾値
以下（発生していた音声の停止）としてもよい。10) The arbitration circuit 13 of the first embodiment
Select the audio input system with the fastest audio. The release of the selected input system may be performed at the end of the voice recognition, or the level of the voice signal on the selected voice input system may be equal to or less than the threshold (stopping the generated voice).

【００８９】１１）上述の実施形態１〜２では一番早く
入力した音声またはその音声認識結果を選択するように
しているが、複数の音声が重複的に発生した場合に、音
声の質が一番よい音声が入力された音声入力系統を選択
することもできる。音声の質を表すパラメータとして
は、音声信号平均レベル、すなわち、音声信号が発生し
てから一定時間内の音声信号のレベル平均を比較するパ
ラメータとして使用することができる。また、Ｓ／Ｎ
比、音声信号の振幅の最大値を比較のためのパラメータ
とすることもできる。11) In the first and second embodiments, the earliest input voice or the voice recognition result is selected. However, when a plurality of voices are generated repeatedly, the voice quality is reduced. It is also possible to select a voice input system to which the best voice has been input. As a parameter indicating the quality of the voice, it can be used as a parameter for comparing the average level of the voice signal, that is, the level average of the voice signal within a predetermined time after the voice signal is generated. Also, S / N
The ratio and the maximum value of the amplitude of the audio signal can be used as parameters for comparison.

【００９０】したがって、図１の調停回路１３は、一定
時間内の音声信号を保持する回路、上記パラメータの値
を当該保持された音声信号から取得する回路と、取得さ
れたパラメータの値を比較する回路および比較の結果に
応じて、音声入力系統を音声認識用プロセッサ１４に接
続させる信号線切替え回路で構成すればよい。これら個
々の回路自体は周知の回路を使用することができ、当業
者であれば、容易に調停回路を作成することができよ
う。Therefore, the arbitration circuit 13 of FIG. 1 compares the value of the obtained parameter with the circuit for holding the audio signal within a certain period of time and the circuit for obtaining the value of the parameter from the held audio signal. What is necessary is just to comprise the signal line switching circuit which connects the audio | voice input system to the audio | voice recognition processor 14 according to a circuit and the result of comparison. Known circuits can be used as these individual circuits themselves, and those skilled in the art will be able to easily create an arbitration circuit.

【００９１】１２）第３の実施形態に第２実施形態の調
停機能を持たしてもよいことは言うまでもない。12) It goes without saying that the third embodiment may have the arbitration function of the second embodiment.

【００９２】１３）第１〜第３の実施形態の音声認識用
プロセッサは、制御対象機器への１つの動作指示が１回
の発話で行われることを想定しているが、１回の発話の
中に複数組の動作指示を含ませてもよい。この場合、単
一の音声認識用プロセッサの中に、同一の複数の音声認
識結果を統合する機能（第３実施形態のソーティング機
能）を持たせるとよい。この機能を持たせることによ
り、発話者が繰り返し発生した動作指示を１つの音声認
識結果として統合することができる。13) The speech recognition processors according to the first to third embodiments assume that one operation instruction to the control target device is performed by one utterance. A plurality of sets of operation instructions may be included therein. In this case, a single voice recognition processor may have a function of integrating the same plurality of voice recognition results (the sorting function of the third embodiment). By providing this function, the operation instructions repeatedly generated by the speaker can be integrated as one voice recognition result.

【００９３】[0093]

【発明の効果】本発明によれば、所定時間内に複数の音
声が重複的に発生した場合に、複数の音声認識手段から
の音声認識結果がソーティング処理により並び替えられ
て、同一の音声認識結果が検出される。このため、同一
時刻に発生した異なる複数の音声は、そのまま使用さ
れ、また、同一時刻あるいは所定時間内の異なる時刻で
発生された同一内容の複数の音声に対する音声認識結果
は単一の音声認識結果として取扱われる。これにより、
２人の話者の音声が重複して発生された場合、あるいは
同一人物の音声が距離の異なる複数の音声入力手段に入
力された場合でも正しく音声認識を行って制御対象機器
を制御することができる。According to the present invention, when a plurality of voices are generated repeatedly within a predetermined time, the voice recognition results from the plurality of voice recognition means are rearranged by sorting processing, and the same voice recognition is performed. The result is detected. For this reason, a plurality of different voices generated at the same time are used as they are, and a voice recognition result for a plurality of voices of the same content generated at the same time or at different times within a predetermined time is a single voice recognition result. Treated as This allows
Even when the voices of two speakers are duplicated, or when the voice of the same person is input to a plurality of voice input units at different distances, it is possible to correctly perform voice recognition and control the control target device. it can.

【００９４】[0094]

【００９５】[0095]

【００９６】[0096]

【００９７】[0097]

【００９８】[0098]

【００９９】[0099]

【０１００】[0100]

【０１０１】[0101]

【０１０２】[0102]

【０１０３】[0103]

【０１０４】[0104]

【０１０５】[0105]

[Brief description of the drawings]

【図１】本発明第１の実施形態のシステム構成を示すブ
ロック図である。FIG. 1 is a block diagram showing a system configuration according to a first embodiment of the present invention.

【図２】システムの配置例を示す説明図である。FIG. 2 is an explanatory diagram showing an example of a system arrangement.

【図３】本発明第２の実施形態のシステム構成を示すブ
ロック図である。FIG. 3 is a block diagram showing a system configuration according to a second embodiment of the present invention.

【図４】本発明第２の実施形態の模式的な外観を示す構
成図である。FIG. 4 is a configuration diagram showing a schematic appearance of a second embodiment of the present invention.

【図５】本発明第２の実施形態の処理手順を示すフロー
チャートである。FIG. 5 is a flowchart illustrating a processing procedure according to a second embodiment of the present invention.

【図６】本発明第３の実施形態のシステム構成を示すブ
ロック図である。FIG. 6 is a block diagram showing a system configuration according to a third embodiment of the present invention.

【図７】本発明第３の実施形態の処理手順を示すフロー
チャートである。FIG. 7 is a flowchart illustrating a processing procedure according to a third embodiment of the present invention.

[Explanation of symbols]

１、１０１マイクロホン１１、１０２Ａ／Ｄ１３調停回路１４、１０３音声認識用プロセッサ２０１音声認識（制御）用プロセッサ DESCRIPTION OF SYMBOLS 1, 101 Microphone 11, 102 A / D 13 Arbitration circuit 14, 103 Speech recognition processor 201 Speech recognition (control) processor

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩＧ１０Ｌ 3/00 ５７１Ｃ５７１Ｈ５７１Ｋ 3/02 ３０１Ａ３０１Ｄ (56)参考文献特開平４−318900（ＪＰ，Ａ) 特開平５−83764（ＪＰ，Ａ) 特開平５−289694（ＪＰ，Ａ) 特開平10−276483（ＪＰ，Ａ) 特開平８−328579（ＪＰ，Ａ) 特開昭59−23397（ＪＰ，Ａ) 特開平10−257583（ＪＰ，Ａ) 特開平10−254494（ＪＰ，Ａ) 特開平２−179700（ＪＰ，Ａ) 特開平８−314489（ＪＰ，Ａ) 特開平８−186654（ＪＰ，Ａ) 特開平７−231668（ＪＰ，Ａ) 特開平７−162989（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 15/00 G10L 15/20 G10L 15/28 G10L 21/02 ────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁷ Identification code FIG10L 3/00 571C 571H 571K 3/02 301A 301D (56) References JP-A-4-318900 (JP, A) JP-A-5 JP-83764 (JP, A) JP-A-5-289694 (JP, A) JP-A-10-276483 (JP, A) JP-A-8-328579 (JP, A) JP-A-59-23397 (JP, A) JP-A-10-257583 (JP, A) JP-A-10-254494 (JP, A) JP-A-2-179700 (JP, A) JP-A-8-314489 (JP, A) 186654 (JP, A) JP-A-7-231668 (JP, A) JP-A-7-162989 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 15/00 G10L 15 / 20 G10L 15/28 G10L 21/02

Claims

(57) [Claims]

1. An equipment control system for recognizing an input voice signal by voice recognition means and executing an operation having a content corresponding to the voice recognition result on a control target device, wherein voices from different places are input; A plurality of voice input means for outputting a voice signal; a plurality of voice recognition means for recognizing a voice signal output from the plurality of voice input means; An information processing unit that performs a sorting process on the voice recognition results of the plurality of voice recognition units with respect to the plurality of voices, and processes and outputs the same plurality of voice recognition results as a single voice recognition result by the sorting process; And control means for causing the controlled device to execute an operation having a content corresponding to a speech recognition result output from the information processing means. Equipment control system.