JP2002341893A - Speech recognition device - Google Patents

Speech recognition device

Info

Publication number
JP2002341893A
JP2002341893A JP2001146205A JP2001146205A JP2002341893A JP 2002341893 A JP2002341893 A JP 2002341893A JP 2001146205 A JP2001146205 A JP 2001146205A JP 2001146205 A JP2001146205 A JP 2001146205A JP 2002341893 A JP2002341893 A JP 2002341893A
Authority
JP
Japan
Prior art keywords
voice
power
unit
talk switch
time constant
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2001146205A
Other languages
Japanese (ja)
Inventor
Mikio Oda
幹夫 小田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP2001146205A priority Critical patent/JP2002341893A/en
Publication of JP2002341893A publication Critical patent/JP2002341893A/en
Pending legal-status Critical Current

Links

Abstract

PROBLEM TO BE SOLVED: To provide a sound transmitter for speech recognition without apocope in which power consumption is reduced, an operating time is extended. SOLUTION: The supply of a power source to a main processing part for speech transmission is continued for a period when a talk switch is pushed and for several seconds even when the talk switch is opened, and thereby a speech signal without the apocope is transmitted on radio. Accordingly, the reduction of power consumption of a power source battery is realized, the operating time is extended, and a speech is reliably transmitted.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【発明の属する技術分野】本発明は、無線により送信さ
れた音声信号から、入力音声を認識する音声認識装置に
おいて、電池を電源とした音声送信機の消費電力を低減
し、電池の動作使用時間の拡大を図り、音声送信を確実
に行う音声認識装置に関するものである。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition apparatus for recognizing an input voice from a voice signal transmitted wirelessly, the power consumption of a voice transmitter powered by a battery being reduced, and the operating time of the battery being used. The present invention relates to a voice recognition device that ensures voice transmission by enlarging a voice.

【0002】[0002]

【従来の技術】昨今の音声認識技術は、デジタル信号処
理技術の向上、処理LSIの高性能化、低価格化などに
より、民生機器の音声認識によるリモコン化が図られて
おり、機器の操作性向上に役立っている。しかし、現状
のTVやVTRなどのリモコン送信機は、そのポータブ
ルさから電池を電源としており、電池寿命は約1年以上
と長いが、音声認識によるリモコン送信機では、電池の
寿命は極端に短くなる。
2. Description of the Related Art In recent voice recognition technologies, remote control by voice recognition of consumer devices has been attempted due to improvements in digital signal processing technology, higher performance of processing LSIs, lower prices, and the like. Helps to improve. However, current remote control transmitters such as TVs and VTRs use a battery as a power source because of their portability, and the battery life is as long as about one year or more. However, the battery life of a remote control transmitter using voice recognition is extremely short. Become.

【0003】通常、音声認識処理は、マイクロコンピュ
ータ、DSPなどで処理されており、数百mAの消費電流
が必要である。一方電池の技術開発も盛んに行われてお
り、単3電池で1500mAhなどの高容量のものがある。
Normally, speech recognition processing is performed by a microcomputer, DSP, or the like, and requires a current consumption of several hundred mA. On the other hand, battery technology is also being actively developed, and there are AA batteries with a high capacity of 1500 mAh or the like.

【0004】しかしながら、この種の電池を使用して
も、数百mAの消費電流では、数時間しか動作しないとい
う課題を有する。AC電源を使用すればこの課題は解決
できるが、ポータブルさは損なわれ、実用性に欠ける。
[0004] However, there is a problem that even with this kind of battery, operation is performed only for several hours at a current consumption of several hundred mA. The use of an AC power supply can solve this problem, but its portability is impaired and it is not practical.

【0005】この課題を解決する音声認識装置として、
特開平5−181493に示す構成が提案されている。
以下図6、図7を参照しながら音声認識装置の一例につ
いて説明する。
[0005] As a voice recognition device for solving this problem,
A configuration shown in Japanese Patent Application Laid-Open No. 5-181493 has been proposed.
Hereinafter, an example of the speech recognition device will be described with reference to FIGS.

【0006】図6は第1の実施の形態を示すものであ
り、図6において、符号20は発声された音声信号を電
気信号に変換するマイク、21は入力されたマイク信号
出力から音声の始端及び終端を検出する音声区間検出
部、22は音声区間検出部が検出した始端及び終端の音
声区間の音声信号を受信系へ無線で送信する無線送信部
である。
FIG. 6 shows a first embodiment. In FIG. 6, reference numeral 20 denotes a microphone for converting an uttered voice signal into an electric signal, and reference numeral 21 denotes a starting point of voice from an input microphone signal output. An audio section detection unit 22 for detecting the end and the end is a wireless transmission unit that wirelessly transmits the audio signals of the start and end audio sections detected by the audio section detection unit to the receiving system.

【0007】以上のように構成された第1の実施例の音
声認識装置について、その動作を説明する。使用者が音
声認識を利用して機器を制御する場合、まず、図6で構
成された音声送信機に向かって、音声認識処理できる単
語を発声する。
The operation of the speech recognition apparatus of the first embodiment configured as described above will be described. When a user controls a device using voice recognition, first, a word that can be subjected to voice recognition processing is uttered to the voice transmitter configured in FIG.

【0008】マイク20に音声が入力されると音声区間
検出部21で、まず始端を検出し、発声が終了すると今
度は終端を検出する。この始端と終端の区間だけ無線送
信部22に電源を供給し、無線送信部22で入力された
音声を搬送波に乗せて送信する。
When a voice is input to the microphone 20, the voice section detector 21 first detects the beginning, and when the voice is finished, detects the end. Power is supplied to the wireless transmission unit 22 only in the start and end sections, and the voice input by the wireless transmission unit 22 is transmitted on a carrier wave.

【0009】送信された音声は、受信検波し、音声信号
を音声認識させ、発声された音声に沿う機器の制御を行
い、一連の音声認識処理を完了する。
The transmitted voice is received and detected, and the voice signal is subjected to voice recognition, the apparatus is controlled in accordance with the uttered voice, and a series of voice recognition processing is completed.

【0010】また図7は第2の実施の形態を示すもので
あり、図7において、符号20は発声された音声信号を
電気信号に変換するマイク、21は入力されたマイク信
号出力から音声の始端及び終端を検出する音声区間検出
部、22は音声区間検出部が検出した始端及び終端の音
声区間の音声信号を受信系へ無線で送信する無線送信
部、24は音声アナログ信号をデジタル信号に変換する
A/Dコンバータ、23はA/Dコンバータ24でデジタ
ルに変換された音声信号を記憶するリングメモリ、25
は記憶された音声デジタル信号をアナログに変換するD
/Aコンバータである。
FIG. 7 shows a second embodiment. In FIG. 7, reference numeral 20 denotes a microphone for converting an uttered voice signal into an electric signal, and reference numeral 21 denotes a voice from an input microphone signal output. A voice section detector for detecting the start and end of the voice section 22 is a wireless transmission section for wirelessly transmitting the voice signals of the start and end voice sections detected by the voice section detector to the receiving system, and 24 is a digital signal for converting the audio analog signal to a digital signal. Convert
A / D converter 23 is a ring memory for storing the audio signal converted to digital by A / D converter 24, 25
D converts the stored audio digital signal to analog
/ A converter.

【0011】以上のように構成された第2の実施例の音
声認識装置について、その動作を説明する。
The operation of the speech recognition apparatus of the second embodiment configured as described above will be described.

【0012】使用者が音声認識を利用して機器を制御す
る場合、まず、図7で構成された音声送信機に向かって
音声認識処理できる単語を発声する。マイク20に音声
が入力されるとA/Dコンバータ24でデジタル信号に
変換され、リングメモリ23に順次書き込み記憶され
る。
When a user controls a device using voice recognition, first, a word that can be subjected to voice recognition processing is uttered toward the voice transmitter configured in FIG. When sound is input to the microphone 20, it is converted into a digital signal by the A / D converter 24, and is sequentially written and stored in the ring memory 23.

【0013】また音声区間検出部21は音声が入力され
た始端を検出し、発声が終了すると今度は終端を検出す
る。この始端と終端の区間だけリングメモリ23から順
次読み出し、D/Aコンバータ25でアナログ信号に変換
し、無線送信部22で入力された音声を搬送波に乗せて
送信する。送信された音声は、受信検波し、音声信号を
音声認識させ、発声された音声に沿う機器の制御を行
い、一連の音声認識処理を完了する。
The voice section detection unit 21 detects the start end of the input voice, and detects the end when the utterance ends. Only the start and end sections are sequentially read from the ring memory 23, converted into analog signals by the D / A converter 25, and the voice input by the wireless transmission unit 22 is transmitted on a carrier wave. The transmitted voice is received and detected, the voice signal is subjected to voice recognition, and devices are controlled in accordance with the uttered voice, thereby completing a series of voice recognition processes.

【0014】[0014]

【発明が解決しようとする課題】しかしながら前記従来
の構成では、音声認識処理が必要な音声区間のみ、無線
送信部22に電源を供給するので、消費電力の低減には
なるが、実用性に欠ける。
However, in the above-described conventional configuration, power is supplied to the radio transmission section 22 only in a voice section requiring voice recognition processing, so that power consumption is reduced, but lacks practicality. .

【0015】すなわち、第1の実施例では、音声区間検
出部21において、その始端及び終端検出の方法は、入
力される音声レベルを検出するものであり、不要な会話
雑音、電話の呼び出し音などの生活雑音によっても無線
送信部22に電源が供給されることになり、マイク感度
は微妙な設定が要求され、また誤動作も多くなる。
That is, in the first embodiment, the method of detecting the start and end of the voice section detection section 21 is to detect the input voice level, and unnecessary speech noise, telephone ringing sound, etc. As a result, power is supplied to the wireless transmission unit 22 due to the living noise, and delicate setting of the microphone sensitivity is required, and erroneous operation increases.

【0016】第2の実施例では音声区間検出部21はデ
ジタル処理で行う方式だが、アナログ処理の第1の実施
例と同様、マイク感度の微妙な設定、誤動作問題があ
り、またデジタルに変換するA/Dコンバータには常に電
源が必要で、消費電力が増加し、システム構成が複雑に
なる。
In the second embodiment, the voice section detection unit 21 performs digital processing. However, similar to the first embodiment of analog processing, there is a problem of delicate setting of microphone sensitivity and malfunction, and conversion to digital is performed. The A / D converter always requires a power supply, which increases power consumption and complicates the system configuration.

【0017】わざわざ手でスイッチを入れ、発声すると
いう従来一般的な方法より使い勝手は良くなるが、音声
区間検出部のレベル設定、雑音との切り分けなど、実用
上課題が多い。つまり、音声レベルだけで電源供給のオ
ンオフをさせるのは、現実問題として課題が多い。
Although the usability is improved as compared with the conventional general method of turning on the switch manually and uttering, there are many practical problems such as setting the level of the voice section detection unit and separating the noise from the noise. That is, turning on / off the power supply only by the audio level has many problems as a practical problem.

【0018】[0018]

【課題を解決するための手段】前記課題を解決するため
に本発明の音声認識装置は、トークスイッチの押された
区間と、トークスイッチを開放しても数秒間、音声送信
主処理部に電源を供給し続け、語尾切れのない音声信号
を無線で送信することで、電源電池の消費電力の低減を
図り、動作使用時間を拡大し、音声送信を確実に行うこ
とを特徴としたものである。
In order to solve the above-mentioned problems, a voice recognition apparatus according to the present invention provides a power supply to a voice transmission main processing unit for a section where a talk switch is pressed and for several seconds even when the talk switch is opened. By continuously transmitting voice signals and transmitting endless audio signals wirelessly, power consumption of the power supply battery is reduced, operation time is extended, and sound transmission is performed reliably. .

【0019】本発明によれば、音声認識の発声を行う時
は、トークスイッチを押し、周辺の雑音に影響される事
なく、確実に音声信号を無線で送信し、また言い終わる
と同時に切るトークスイッチの操作に対しても、予め設
定された時定数保持部で保持された時間だけ、音声送信
主処理部に電源を供給し続け、語尾切れのない音声信号
を無線で送信することで、電源電池の消費電力の低減を
図り、動作使用時間を拡大し、音声送信を確実に行える
音声認識装置を提供することが可能となる。
According to the present invention, when uttering speech recognition, the talk switch is pressed, and the speech signal is transmitted wirelessly without being affected by the surrounding noise. For the operation of, the power source is continuously supplied to the audio transmission main processing unit for the time held in the preset time constant holding unit, and the audio signal without ending is wirelessly transmitted, so that the power supply battery is It is possible to provide a speech recognition device that can reduce power consumption, extend operation use time, and reliably perform speech transmission.

【0020】[0020]

【発明の実施の形態】本発明の請求項1に記載の音声認
識装置は、トークスイッチの押された区間と、トークス
イッチを開放しても数秒間、音声送信主処理部に電源を
供給し続け、語尾切れのない音声信号を無線で送信する
ことで、電源電池の消費電力の低減を図り、動作使用時
間を拡大し、音声送信を確実に行える音声認識を実現し
うるものである。
DESCRIPTION OF THE PREFERRED EMBODIMENTS The voice recognition apparatus according to the first aspect of the present invention supplies power to the voice transmission main processing section for a section where the talk switch is pressed and for several seconds even when the talk switch is released. Continuously, by transmitting a speech signal without end of the word by radio, the power consumption of the power supply battery can be reduced, the operation time can be extended, and voice recognition that can reliably transmit voice can be realized.

【0021】つぎに、本発明の請求項2に記載された音
声認識装置は、入力された音声信号を電気信号に変換す
るマイクと、前記マイク出力を増幅する増幅器と、前記
増幅器で増幅された音声信号を変調、送信する変調送信
部からなる音声送信主処理部を有すると共に、音声発声
の操作を知らしめるトークスイッチと、前記トークスイ
ッチを押している時間の時定数を保持する時定数保持部
と、前記時定数保持部で設定された時間幅のみ前記音声
送信主処理部に電源を供給する電源制御部を具備し、前
記電源制御部出力を入力とし、電源供給時を表示する表
示部で構成することで、トークスイッチの押された区間
と、トークスイッチを開放しても数秒間、音声送信主処
理部に電源を供給し続け、語尾切れのない音声信号を無
線で送信し、電源電池の消費電力の低減を図り、動作使
用時間を拡大し、音声送信を確実に行える音声認識を実
現しうるものである。
Next, according to a second aspect of the present invention, there is provided a voice recognition device for converting an input voice signal into an electrical signal, an amplifier for amplifying the microphone output, and an amplifier amplified by the amplifier. A voice signal is modulated and has a voice transmission main processing unit including a modulation transmission unit that transmits a voice signal, a talk switch that informs an operation of voice utterance, and a time constant holding unit that holds a time constant of a time when the talk switch is pressed. A power control unit that supplies power to the main voice transmission processing unit only for a time width set by the time constant holding unit, and a display unit that receives power from the power control unit and displays power supply time. By doing so, the power is continuously supplied to the voice transmission main processing unit for a few seconds even when the talk switch is pressed and the talk switch is opened, and a sound signal without any ending is wirelessly transmitted. Achieving a reduction in power consumption of the pond, to expand the operation using time, those capable of realizing a voice recognition can be reliably performed voice transmission.

【0022】つぎに、本発明の請求項3に記載された音
声認識装置は、入力された音声信号を電気信号に変換す
るマイクと、前記マイク出力を増幅する増幅器と、前記
増幅器で増幅された音声信号を変調、送信する変調送信
部と、前記増幅器出力を入力とし、音声入力レベルを検
出するレベル検出器からなる音声送信主処理部を有する
と共に、音声発声の操作を知らしめるトークスイッチ
と、前記トークスイッチと前記レベル検出器のORをとる
OR回路と、前記トークスイッチを押している時間と前記
レベル検出器で検出した音声区間のORをとったOR回路出
力パルスの時定数を保持する時定数保持部と、前記時定
数保持部で設定された時間幅のみ前記音声送信主処理部
に電源を供給する電源制御部を具備し、前記電源制御部
出力を入力とし、電源供給時を表示する表示部で構成す
ることで、トークスイッチの押された区間と、トークス
イッチを開放しても発声が終了するまでの数秒間、音声
送信主処理部に電源を供給し続け、語尾切れのない音声
信号を無線で送信し、電源電池の消費電力の低減を図
り、動作使用時間を拡大し、音声送信を確実に行える音
声認識を実現しうるものである。
Next, a voice recognition device according to a third aspect of the present invention provides a microphone for converting an input voice signal into an electric signal, an amplifier for amplifying the microphone output, and an amplifier amplified by the amplifier. A modulation transmission unit that modulates and transmits an audio signal, and has an audio output main processing unit including a level detector that receives the amplifier output and detects an audio input level, and a talk switch that informs an operation of audio utterance, OR the talk switch and the level detector
An OR circuit, a time constant holding unit that holds a time constant of an OR circuit output pulse obtained by ORing a time during which the talk switch is pressed and a voice section detected by the level detector, and a time constant holding unit that sets the time constant. A power control unit for supplying power to the voice transmission main processing unit only for the time width set as input, the output of the power control unit being input, and a display unit for displaying when power is supplied, the talk switch is pressed. Power is continuously supplied to the voice transmission main processing unit for several seconds until the utterance ends even if the talk switch is released, and a voice signal without any ending is wirelessly transmitted, and the power consumption of the power battery is reduced. It is possible to realize voice recognition that can reduce the time, extend the operation time, and reliably transmit voice.

【0023】以下本発明の実施の形態について、図1か
ら図5を用いて説明する。
An embodiment of the present invention will be described below with reference to FIGS.

【0024】(実施の形態1)以下に、本発明の請求項
1及び請求項2に記載された発明の実施の形態につい
て、図1、図2、図3を用いて説明する。
(Embodiment 1) An embodiment of the present invention described in claims 1 and 2 of the present invention will be described below with reference to FIGS. 1, 2 and 3. FIG.

【0025】図1は、本発明の一実施例における音声認
識装置の音声送信機のブロック構成図を示す。図1にお
いて、符号7は入力された音声信号を電気信号に変換す
るマイク、8は前記マイク出力を増幅する増幅器、9は前
記増幅器で増幅された音声信号を変調、送信する変調送
信部である。
FIG. 1 is a block diagram showing a voice transmitter of a voice recognition apparatus according to one embodiment of the present invention. In FIG. 1, reference numeral 7 denotes a microphone that converts an input audio signal into an electric signal, 8 denotes an amplifier that amplifies the microphone output, and 9 denotes a modulation transmission unit that modulates and transmits the audio signal amplified by the amplifier. .

【0026】また、1はマイク7、増幅器8、変調送信
部9からなる音声送信主処理部、2は音声発声の操作を
知らしめるトークスイッチ、3は前記トークスイッチ2
を押してる時間の時定数を保持する時定数保持部、4は
前記時定数保持部で設定された時間幅のみ前記音声送信
主処理部1に電源を供給する電源制御部、6は前記電源
制御部4で電源供給される時を表示する表示部、5は音
声送信機全体に電源を供給する電池部である。
Reference numeral 1 denotes an audio transmission main processing unit including a microphone 7, an amplifier 8, and a modulation transmission unit 9. Reference numeral 2 denotes a talk switch for notifying an operation of audio utterance.
A time constant holding unit for holding a time constant of a pressing time, a power control unit 4 for supplying power to the voice transmission main processing unit 1 only for a time width set by the time constant holding unit, and a power control unit 6. A display unit 5 that indicates when power is supplied by the unit 4 is a battery unit that supplies power to the entire audio transmitter.

【0027】以上のように構成された音声認識装置につ
いて、その動作を説明する。音声認識でTVやVTRな
どの機器を動作制御する場合、図1に示す音声送信機が
使用者の近くにあり、音声信号のみを送信した方が、音
声認識全体をユニット化するより、電源の消費電力は削
減できる。
The operation of the speech recognition apparatus configured as described above will be described. When controlling the operation of a device such as a TV or a VTR by voice recognition, the voice transmitter shown in FIG. 1 is close to the user, and transmitting only the voice signal is more effective than the unitization of the entire voice recognition. Power consumption can be reduced.

【0028】使用者はまず、この音声送信機のトークス
イッチ2を押し続けて、音声認識に登録されている単語
を発声する。このトークスイッチ2を押すと時定数保持
部3はHパルスを出力し、時定数保持部3の出力がHの
区間、電源制御部4は電池部5から供給される電池電源
を、マイク7、増幅器8、変調送信部9で構成される音
声送信主処理部1に電源を供給する。
First, the user keeps pressing the talk switch 2 of the voice transmitter to utter a word registered for voice recognition. When the talk switch 2 is pressed, the time constant holding unit 3 outputs an H pulse, and the output of the time constant holding unit 3 is in a period of H, and the power supply control unit 4 supplies the battery power supplied from the battery unit 5 to the microphone 7, Power is supplied to the main audio transmission processing unit 1 including the amplifier 8 and the modulation transmission unit 9.

【0029】電源を供給された音声主処理部1は、マイ
ク7で発声者の音声を電気信号に変換し、増幅器8でそ
のレベルを増幅し、変調送信部9で発声音声を無線送信
する。送信された音声は、TVやVTRなどの近くに設
置または内蔵された受信機で復調し、音声認識処理され
て発声された音声に沿う機器の制御を行い、一連の音声
認識処理が完了する。
The voice main processing unit 1 supplied with power converts the voice of the speaker into an electric signal by the microphone 7, amplifies the level by the amplifier 8, and wirelessly transmits the voice by the modulation transmitting unit 9. The transmitted voice is demodulated by a receiver installed or built in the vicinity of a TV or VTR, and is subjected to voice recognition processing to control devices according to the uttered voice, thereby completing a series of voice recognition processing.

【0030】使用者は発声が終わるとトークスイッチ2
を開放し、開放されたことを確認した時定数保持部3は
数秒後にHパルスからLパルスを電源制御部4に入力
し、電源制御部4は電池部5からの電源を、音声送信主
処理部1に供給することを停止する。表示部6はこの電
源供給時を表示する。変調送信部9は通常、FM変調な
どが一般的に利用され、また送信媒体としては、赤外
線、微少電波などが考えられる。
When the user has finished speaking, the talk switch 2
After a few seconds, the time constant holding unit 3 inputs the H-pulse to the L-pulse to the power control unit 4, and the power control unit 4 supplies the power from the battery unit 5 to the voice transmission main process. The supply to the unit 1 is stopped. The display unit 6 displays the power supply time. The modulation transmission unit 9 generally uses FM modulation or the like, and the transmission medium may be an infrared ray, a minute radio wave, or the like.

【0031】トークスイッチ2は発声する単語が完全に
終了するまで押し続けるのであれば、問題は発生しない
が、通常、人間の動作として、発声完了と同時にトーク
スイッチ2を開放することが多々ある。
If the talk switch 2 is kept depressed until the word to be uttered is completely finished, no problem occurs, but usually, as a human operation, the talk switch 2 is often released at the same time as the completion of the utterance.

【0032】図2は図1の各部の動作波形を示すタイム
チャートであり、時間t0で発声するためのトークスイッ
チ2を押し、単語例「イッチャンネル」を発声し、時間
t1で発声完了として、トークスイッチ2を開放する。何
も対策してなければ、すぐに電源制御部4からの電源が
供給されなくなり、最後の発声語の語尾が切れることに
なるが、時定数保持部3でLパルスになるのに数秒の時
間遅れを発生させ、電源制御部4から供給される電源を
その間、時間拡大する。
FIG. 2 is a time chart showing the operation waveforms of the respective parts in FIG. 1. The talk switch 2 for uttering at time t0 is pressed, and the word example "I-Channel" is uttered.
At t1, the utterance is completed, and the talk switch 2 is opened. If no countermeasures are taken, the power supply from the power control unit 4 is not immediately supplied, and the ending of the last uttered word is cut off. A delay is generated, and the power supplied from the power supply control unit 4 is expanded during that time.

【0033】図3は図1の時定数保持部3、電源制御部
4の具体的回路例であり、トークスイッチ2を押すとRC
の時定数で積分されてトランジスタQ1のベース電圧が
Hとなり、Q1が導通し、トランジスタQ2のベース電
圧がLとなり、Q2も導通状態となり電源電池の電圧が
Q2のコレクタに流れる。
FIG. 3 is a specific circuit example of the time constant holding unit 3 and the power supply control unit 4 in FIG.
, The base voltage of the transistor Q1 becomes H, the transistor Q1 becomes conductive, the base voltage of the transistor Q2 becomes L, the transistor Q2 becomes conductive, and the voltage of the power supply battery flows to the collector of the transistor Q2.

【0034】発声が終わってトークスイッチ2を開放す
ると、RCの時定数により積分され、Q1のベース電圧
がCに蓄えられた電位が自然放電されるまでHとなり、
その後Lになるので、開放より遅れてQ1が非導通とな
り、Q2も遅れて電源供給を停止する。このように簡単
な回路で時定数保持、電源制御が可能である。
When the talk switch 2 is opened after the utterance ends, integration is performed by the time constant of RC, and the base voltage of Q1 becomes H until the potential stored in C is spontaneously discharged,
After that, since it becomes L, Q1 becomes non-conductive later than opening, and Q2 also stops power supply later. Thus, the time constant can be maintained and the power supply can be controlled with a simple circuit.

【0035】つまり、常に音声送信機全体に電源を供給
する必要はなく、トークスイッチの押された区間と、ト
ークスイッチを開放しても数秒間、音声送信主処理部に
電源を供給し続けることで、語尾切れのない音声信号を
無線で送信し、電源電池の消費電力の低減を図り、動作
使用時間を拡大し、音声送信を確実に行える音声認識を
実現できる。
That is, it is not necessary to always supply power to the entire voice transmitter, and it is necessary to supply power to the voice transmission main processing unit for a few seconds even when the talk switch is pressed and when the talk switch is opened. Thus, it is possible to wirelessly transmit a speech signal without a suffix, reduce power consumption of a power supply battery, extend operation use time, and realize speech recognition that can reliably perform speech transmission.

【0036】(実施の形態2)つぎに、本発明の請求項
3に記載された発明の実施の形態について、図4、図5
を用いて説明する。
(Embodiment 2) Next, an embodiment of the invention described in claim 3 of the present invention will be described with reference to FIGS.
This will be described with reference to FIG.

【0037】図4は、本発明の一実施例における音声認
識装置の音声送信機のブロック構成図を示す。図4にお
いて、符号7は入力された音声信号を電気信号に変換す
るマイク、8は前記マイク出力を増幅する増幅器、9は前
記増幅器で増幅された音声信号を変調、送信する変調送
信部、12は前記増幅器で増幅された音声信号レベルを
検出し、レベル検出時はHパルスを出力するレベル検出
部である。
FIG. 4 is a block diagram showing a voice transmitter of the voice recognition apparatus according to one embodiment of the present invention. 4, reference numeral 7 denotes a microphone that converts an input audio signal into an electric signal, 8 denotes an amplifier that amplifies the microphone output, 9 denotes a modulation transmission unit that modulates and transmits the audio signal amplified by the amplifier, 12 Is a level detection unit that detects the level of the audio signal amplified by the amplifier and outputs an H pulse when the level is detected.

【0038】また、13はマイク7、増幅器8、変調送
信部9、レベル検出部12からなる音声送信主処理部、
2は音声発声の操作を知らしめるトークスイッチ、3は
前記トークスイッチ2を押してる時間の時定数を保持す
る時定数保持部、4は前記時定数保持部で設定された時
間幅のみ前記音声送信主処理部13に電源を供給する電
源制御部、6は前記電源制御部4で電源供給される時を
表示する表示部、5は音声送信機全体に電源を供給する
電池部、10、11はダイオードで構成されたトークス
イッチ2とレベル検出部12のORをとるOR回路であ
る。
Reference numeral 13 denotes an audio transmission main processing unit including the microphone 7, the amplifier 8, the modulation transmission unit 9, and the level detection unit 12,
Reference numeral 2 denotes a talk switch for notifying an operation of voice utterance. Reference numeral 3 denotes a time constant holding unit for holding a time constant of a time when the talk switch 2 is pressed. Reference numeral 4 denotes the voice transmission only for a time width set by the time constant holding unit. A power control unit for supplying power to the main processing unit 13, a display unit 6 for displaying when power is supplied by the power control unit 4, a battery unit 5 for supplying power to the entire audio transmitter, and 10, 11 This is an OR circuit that takes the OR of the talk switch 2 and the level detection unit 12 which are configured by diodes.

【0039】以上のように構成された音声認識装置につ
いて、その動作を説明する。使用者はまず、この音声送
信機のトークスイッチ2を押し続けて、音声認識に登録
されている単語を発声する。
The operation of the speech recognition apparatus configured as described above will be described. First, the user keeps pressing the talk switch 2 of the voice transmitter to utter a word registered for voice recognition.

【0040】このトークスイッチ2を押すと時定数保持
部3はHパルスを出力し、時定数保持部3がHの区間、
電源制御部4は電池部5から供給される電池電源を、マ
イク7、増幅器8、変調送信部9、レベル検出部12で
構成される音声送信主処理部13に電源を供給する。電
源を供給された音声送信主処理部13は、マイク7で発
声者の音声を電気信号に変換し、増幅器8でそのレベル
を増幅し、変調送信部9で発声音声を無線送信する。
When the talk switch 2 is pressed, the time constant holding unit 3 outputs an H pulse.
The power control unit 4 supplies the battery power supplied from the battery unit 5 to the audio transmission main processing unit 13 including the microphone 7, the amplifier 8, the modulation transmission unit 9, and the level detection unit 12. The voice transmission main processing unit 13 supplied with power converts the voice of the speaker into an electric signal with the microphone 7, amplifies the level with the amplifier 8, and wirelessly transmits the voice with the modulation transmission unit 9.

【0041】送信された音声は、TVやVTRなどの近
くに設置または内蔵された受信機で復調し、音声認識処
理されて発声された音声に沿う機器の制御を行い、一連
の音声認識処理が完了する。使用者は発声が終わるとト
ークスイッチ2を開放し、開放されたことを確認した時
定数保持部3は数秒後にHパルスからLパルスを電源制
御部4に入力し、電源制御部4は電池部5からの電源
を、音声送信主処理部13に供給することを停止する。
The transmitted voice is demodulated by a receiver installed or built in the vicinity of a TV or VTR, and subjected to voice recognition processing to control equipment in accordance with the uttered voice, and a series of voice recognition processing is performed. Complete. The user releases the talk switch 2 when the utterance ends, and after confirming that the talk switch 2 is released, the time constant holding unit 3 inputs an H pulse to an L pulse to the power control unit 4 after a few seconds, and the power control unit 4 controls the battery unit. The supply of the power from the power supply 5 to the audio transmission main processing unit 13 is stopped.

【0042】トークスイッチ2は発声する単語が完全に
終了するまで押し続けるのであれば、問題は発生しない
が、通常人間の動作として、発声完了と同時にトークス
イッチ2を開放することが多々ある。図5は図4の各部
の動作波形を示すタイムチャートであり、時間t0で発声
するためのトークスイッチ2を押し、単語例「イッチャ
ンネル」を発声し、時間t1で発声完了として、トークス
イッチ2を開放する。
If the talk switch 2 is kept depressed until the word to be uttered is completely finished, there is no problem. However, as a human operation, the talk switch 2 is often released simultaneously with the completion of the utterance. FIG. 5 is a time chart showing the operation waveforms of the respective parts of FIG. 4. The talk switch 2 for uttering at time t0 is pressed, the word example "I-Channel" is uttered, and the utterance is completed at time t1. To release.

【0043】トークスイッチ2を開放しても、発声が完
全に終了してない場合、レベル検出部12から音声信号
検出のHパルスが出力され続けており、完全に終了した
時間t2でレベル検出部12の出力パルスはLになる。
If the utterance is not completely terminated even after the talk switch 2 is opened, the H pulse for detecting the audio signal is continuously output from the level detection unit 12, and the level detection unit is output at time t2 when the speech signal is completely terminated. Twelve output pulses become L.

【0044】このLパルスを入力した時定数保持部3
は、Lパルスになるのに数秒の時間遅れを発生させ、電
源制御部4から供給される電源をその分、時間拡大す
る。何も対策してなければ、すぐに電源制御部4から電
源が供給されなくなり、最後の発声語の語尾が切れるこ
とになるが、語尾が完全に終了したことでもって電源供
給を停止する。
The time constant holding unit 3 receiving the L pulse
Causes a time delay of several seconds to become an L pulse, and expands the power supplied from the power supply control unit 4 by that amount. If no countermeasures are taken, power is not immediately supplied from the power control unit 4 and the ending of the last uttered word is cut off, but the power supply is stopped when the ending is completely completed.

【0045】つまり、常に音声送信機全体に電源を供給
する必要はなく、トークスイッチの押された区間と、ト
ークスイッチを開放しても音声入力の有無を検出し、そ
の検出区間と時定数保持部3で設定した数秒間、音声送
信主処理部に電源を供給し続けることで、語尾切れのな
い音声信号を無線で送信し、電源電池の消費電力の低減
を図り、動作使用時間を拡大し、音声送信を確実に行え
る音声認識を実現できる。
In other words, it is not necessary to always supply power to the entire voice transmitter. The section where the talk switch is pressed and the presence or absence of voice input even when the talk switch is opened are detected, and the detected section and the time constant are held. By continuing to supply power to the audio transmission main processing unit for several seconds set in the unit 3, the endless audio signal is transmitted wirelessly, the power consumption of the power supply battery is reduced, and the operation time is extended. In addition, it is possible to realize voice recognition that can reliably perform voice transmission.

【0046】また実施の形態1では、トークスイッチの
開放を判断して、開放後、時定数保持部で設定した時間
だけ電源供給を持続させている為、時定数設定にはかな
りの余裕が必要となる欠点があるが、実施の形態2で
は、一度電源供給が始まれば、音声レベルが無くなるま
で電源供給が持続され、より確実な語尾対策となる。い
ずれにしても、常に音声送信機全体に電源を供給する必
要はなく、トークスイッチの押された区間と、トークス
イッチを開放して、発声が終了するまでの数秒間、音声
送信主処理部に電源を供給し続けることで、語尾切れの
ない音声信号を無線で送信し、電源電池の消費電力の低
減を図り、動作使用時間を拡大し、音声送信を確実に行
える音声認識を実現できる。
In the first embodiment, since the power supply is maintained for the time set in the time constant holding unit after the release of the talk switch is determined after opening, a considerable margin is required for setting the time constant. However, in the second embodiment, once the power supply is started, the power supply is continued until the sound level disappears, and a more reliable ending countermeasure is provided. In any case, it is not necessary to always supply power to the entire voice transmitter, and the voice transmission main processing unit performs for a few seconds until the talk switch is pressed and the talk switch is released and the utterance ends. By continuously supplying power, it is possible to wirelessly transmit a sound signal without a suffix, reduce power consumption of a power supply battery, extend operation use time, and realize voice recognition capable of reliably transmitting voice.

【0047】[0047]

【発明の効果】以上のように、本発明の音声認識装置
は、トークスイッチの押された区間と、トークスイッチ
を開放しても数秒間、音声送信主処理部に電源を供給し
続け、語尾切れのない音声信号を無線で送信すること
で、電源電池の消費電力の低減を図り、動作使用時間を
拡大し、音声送信を確実に行える音声認識装置を提供す
ることが可能となる。
As described above, the voice recognition apparatus of the present invention continues to supply power to the voice transmission main processing section for several seconds even when the talk switch is depressed and when the talk switch is opened, By transmitting an uninterrupted voice signal wirelessly, it is possible to provide a voice recognition device capable of reducing power consumption of a power supply battery, extending operation use time, and reliably transmitting voice.

【図面の簡単な説明】[Brief description of the drawings]

【図1】本発明の第1の実施の形態における音声認識装
置の音声送信機のブロック構成図
FIG. 1 is a block diagram of a voice transmitter of a voice recognition device according to a first embodiment of the present invention.

【図2】図1の各部の動作波形を示す図FIG. 2 is a diagram showing operation waveforms of each unit in FIG. 1;

【図3】図1の時定数保持部、電源制御部の回路例を示
す図
FIG. 3 is a diagram illustrating a circuit example of a time constant holding unit and a power supply control unit in FIG. 1;

【図4】本発明の第2の実施の形態における音声認識装
置の音声送信機のブロック構成図
FIG. 4 is a block diagram of a voice transmitter of a voice recognition device according to a second embodiment of the present invention.

【図5】図4の各部の動作波形を示す図FIG. 5 is a diagram showing operation waveforms of each unit in FIG. 4;

【図6】従来の第1実施例の音声認識装置の音声送信機
のブロック構成図
FIG. 6 is a block diagram of a voice transmitter of the voice recognition device according to the first conventional example.

【図7】従来の第2実施例の音声認識装置の音声送信機
のブロック構成図
FIG. 7 is a block diagram of a speech transmitter of a speech recognition apparatus according to a second conventional example.

【符号の説明】[Explanation of symbols]

1 音声送信主処理部 2 トークスイッチ 3 時定数保持部 4 電源制御部 5 電池部 6 表示部 7 マイク 8 増幅器 9 変調送信部 10 ダイオード 11 ダイオード 12 レベル検出部 13 音声送信主処理部 20 マイク 21 音声区間検出部 22 無線送信部 23 リングメモリー 24 A/D変換器 25 D/A変換器 Reference Signs List 1 voice transmission main processing unit 2 talk switch 3 time constant holding unit 4 power supply control unit 5 battery unit 6 display unit 7 microphone 8 amplifier 9 modulation transmission unit 10 diode 11 diode 12 level detection unit 13 voice transmission main processing unit 20 microphone 21 voice Section detection unit 22 Wireless transmission unit 23 Ring memory 24 A / D converter 25 D / A converter

Claims (3)

【特許請求の範囲】[Claims] 【請求項1】 トークスイッチの押された区間と、トー
クスイッチを開放しても数秒間、音声送信主処理部に電
源を供給し続け、語尾切れのない音声信号を無線で送信
することで、電源電池の消費電力の低減を図り、動作使
用時間を拡大し、音声送信を確実に行う音声認識装置。
1. By continuously supplying power to a voice transmission main processing unit for a section in which a talk switch is pressed and for several seconds even when the talk switch is opened, and transmitting a sound signal without end of a word by radio, A speech recognition device that reduces the power consumption of the power battery, extends the operating time, and reliably transmits speech.
【請求項2】 入力された音声信号を電気信号に変換す
るマイクと、前記マイク出力を増幅する増幅器と、前記
増幅器で増幅された音声信号を変調、送信する変調送信
部からなる音声送信主処理部を有すると共に、音声発声
の操作を知らしめるトークスイッチと、前記トークスイ
ッチを押している時間の時定数を保持する時定数保持部
と、前記時定数保持部で設定された時間幅のみ前記音声
送信主処理部に電源を供給する電源制御部を具備し、前
記電源制御部出力を入力とし、電源供給時を表示する表
示部で構成したことを特徴とする音声認識装置。
2. An audio transmission main process comprising a microphone for converting an input audio signal into an electric signal, an amplifier for amplifying the microphone output, and a modulation transmission unit for modulating and transmitting the audio signal amplified by the amplifier. A talk switch for notifying an operation of voice utterance, a time constant holding unit for holding a time constant of a time when the talk switch is pressed, and the voice transmission only for a time width set by the time constant holding unit. A speech recognition apparatus, comprising: a power supply control unit that supplies power to a main processing unit; and a display unit that receives an output of the power supply control unit and displays power supply time.
【請求項3】 入力された音声信号を電気信号に変換す
るマイクと、前記マイク出力を増幅する増幅器と、前記
増幅器で増幅された音声信号を変調、送信する変調送信
部と、前記増幅器出力を入力とし、音声入力レベルを検
出するレベル検出器からなる音声送信主処理部を有する
と共に、音声発声の操作を知らしめるトークスイッチ
と、前記トークスイッチと前記レベル検出器のORをとる
OR回路と、前記トークスイッチを押している時間と前記
レベル検出器で検出した音声区間のORをとったOR回路出
力パルスの時定数を保持する時定数保持部と、前記時定
数保持部で設定された時間幅のみ前記音声送信主処理部
に電源を供給する電源制御部を具備し、前記電源制御部
出力を入力とし、電源供給時を表示する表示部で構成し
たことを特徴とする音声認識装置。
3. A microphone for converting an input audio signal into an electric signal, an amplifier for amplifying the microphone output, a modulation transmitting unit for modulating and transmitting the audio signal amplified by the amplifier, and It has an audio transmission main processing unit comprising a level detector for detecting an audio input level as an input, and a talk switch for notifying an operation of voice utterance, and an OR of the talk switch and the level detector is taken.
An OR circuit, a time constant holding unit that holds a time constant of an OR circuit output pulse obtained by ORing a time during which the talk switch is pressed and a voice section detected by the level detector, and a time constant holding unit that sets the time constant. A power control unit for supplying power to the voice transmission main processing unit only for a specified duration, and a display unit for receiving power from the power control unit and displaying when power is supplied. .
JP2001146205A 2001-05-16 2001-05-16 Speech recognition device Pending JP2002341893A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2001146205A JP2002341893A (en) 2001-05-16 2001-05-16 Speech recognition device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2001146205A JP2002341893A (en) 2001-05-16 2001-05-16 Speech recognition device

Publications (1)

Publication Number Publication Date
JP2002341893A true JP2002341893A (en) 2002-11-29

Family

ID=18991894

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2001146205A Pending JP2002341893A (en) 2001-05-16 2001-05-16 Speech recognition device

Country Status (1)

Country Link
JP (1) JP2002341893A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007286180A (en) * 2006-04-13 2007-11-01 Funai Electric Co Ltd Electronic apparatus with voice recognition function
JP2014170984A (en) * 2013-03-01 2014-09-18 Casio Comput Co Ltd Communication device and program

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007286180A (en) * 2006-04-13 2007-11-01 Funai Electric Co Ltd Electronic apparatus with voice recognition function
JP4670716B2 (en) * 2006-04-13 2011-04-13 船井電機株式会社 Electronic device with voice recognition function
JP2014170984A (en) * 2013-03-01 2014-09-18 Casio Comput Co Ltd Communication device and program

Similar Documents

Publication Publication Date Title
US6012029A (en) Voice activated system for locating misplaced items
US8271287B1 (en) Voice command remote control system
KR100818460B1 (en) Sound outputting apparatus and sound outputting method
JP2011118822A (en) Electronic apparatus, speech detecting device, voice recognition operation system, and voice recognition operation method and program
US6560469B1 (en) Microphone/speaker-contained wireless remote control system for internet device and method for controlling operation of remote controller therein
JP2983227B2 (en) Wireless telephone equipment
US7742069B2 (en) Telephone ring activation of a wireless transmitter for remote control of a television
JP5211684B2 (en) Audio output device and noise prevention method
JP2004219728A (en) Speech recognition device
US7020292B1 (en) Apparatuses and methods for recognizing an audio input and muting an audio device
JP2007165940A (en) Cellular phone, and acoustic reproduction operation automatic stopping method therefor
JP2002341893A (en) Speech recognition device
JPH1145474A (en) Transmitter/receiver for acoustic control signal, control system by acoustic control signal and control method
JP2001318689A (en) Remote controller by means of speech recognition
CN220491386U (en) Doorbell system based on wireless transmission
JPH09206329A (en) Audibility support system
WO2001008379A1 (en) Communication device and display control device
JP4210660B2 (en) Hearing aid system for remote control of hearing aids
JPH0479430A (en) Portable telephone system
JP2005536107A (en) Ring-activated mute
KR200335010Y1 (en) Wireless Ear MIC equipped with the voice control function for two way radios
JP3154914B2 (en) Communication device
JPH1070472A (en) Wireless microphone system
JP3687227B2 (en) TV door phone system
JP2003140689A (en) Voice recognition device

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20080331

RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7421

Effective date: 20080414

RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7421

Effective date: 20091119

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20100720

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20101207