JP2017084041A

JP2017084041A - Voice command input device and welding system

Info

Publication number: JP2017084041A
Application number: JP2015210677A
Authority: JP
Inventors: 将一石和久; Masakazu Ishiwaku
Original assignee: Daihen Corp
Current assignee: Daihen Corp
Priority date: 2015-10-27
Filing date: 2015-10-27
Publication date: 2017-05-18
Anticipated expiration: 2035-10-27
Also published as: JP6562815B2

Abstract

PROBLEM TO BE SOLVED: To highly accurately input a voice command without being influenced by noise.SOLUTION: A voice command input device comprises: an acceleration sensor 101 that is fitted around the mouth of a speaker; a coordinate transformation part 102 for transforming a detection value of the acceleration sensor 101 into data of motions of the mouth of the speaker; a voice conversion part 104 for converting the data of the motions of the mouth into a string of phonemes by a first conversion table that is stored in a conversion table storage part 103; a voice recognition part 105 for recognizing a command word from data of the string of phonemes by a language model that is stored in a language model storage part 106; and a command signal conversion part 107 by which the recognized voice command is converted into a command signal and outputted while using a second conversion table that is stored in a command word storage part 108. The voice command of the speaker is recognized from motions or shapes of the mouth, such that a command signal corresponding to the voice command can be highly accurately inputted without being influenced by noise.SELECTED DRAWING: Figure 1

Description

本発明は、加速度センサを用いた音声指令入力装置、及びその音声指令入力装置を用いた溶接システムに関するものである。 The present invention relates to a voice command input device using an acceleration sensor and a welding system using the voice command input device.

従来、音声による指令（以下、「音声指令」という。）が入力可能な溶接システムが提案されている。例えば、特許文献１には、作業者が音声指令を入力するための耳掛け式のマイクを設ける一方、溶接トーチにそのマイクを接続するためのマイク接続部を設け、作業者が溶接作業をする際にマイクを溶接トーチのマイク接続部に接続する構成が開示されている。 Conventionally, a welding system capable of inputting a voice command (hereinafter referred to as a “voice command”) has been proposed. For example, Patent Document 1 provides an ear-hook type microphone for an operator to input a voice command, while providing a microphone connection portion for connecting the microphone to a welding torch, and the operator performs a welding operation. A configuration is disclosed in which a microphone is connected to a microphone connection portion of a welding torch.

特許文献１は、作業者がマイクに入力した音声指令を、マイク接続部と、溶接トーチを溶接電源装置に接続する接続ケーブルとを介して溶接電源装置内の制御回路に送信し、当該制御回路で音声指令の入力処理をする構成である。 Patent Document 1 transmits a voice command input to a microphone by an operator to a control circuit in the welding power supply device via a microphone connection portion and a connection cable that connects the welding torch to the welding power supply device. In this configuration, voice command input processing is performed.

溶接現場は、アークの発生に起因する騒音を初めとして比較的高い騒音が発生する環境である。このため、作業者の発した音声指令（騒音が混入した音声指令）によって溶接システムの溶接電源装置を制御するには、音声指令を電気信号に変換した指令信号に対して騒音に基づくノイズを十分に抑制する必要がある。 The welding site is an environment in which relatively high noise is generated, including noise caused by arc generation. For this reason, in order to control the welding power supply device of the welding system by a voice command (voice command mixed with noise) issued by an operator, noise based on noise is sufficient for the command signal obtained by converting the voice command into an electrical signal. It is necessary to suppress it.

従来、音声認識システムにおいては、環境騒音による誤認識や誤動作を低減する技術が提案されている。例えば、特許文献２には、環境騒音による誤認識や誤動作を低減する有効な方法として、音声用マイクと騒音用マイクを設け、音声用マイクで得られた音声から騒音用マイクで得られた騒音成分（ノイズ成分）を差し引く方法（ノイズキャンセラー）や、音声用マイクに指向性の強いマイクや骨伝導マイクを用いて利用者の音声だけを拾う方法が開示されている。 Conventionally, in a speech recognition system, a technique for reducing misrecognition and malfunction due to environmental noise has been proposed. For example, in Patent Document 2, as an effective method for reducing misrecognition and malfunction due to environmental noise, a voice microphone and a noise microphone are provided, and noise obtained with a noise microphone from voice obtained with the voice microphone is disclosed. A method of subtracting a component (noise component) (noise canceller) and a method of picking up only a user's voice using a microphone having a high directivity or a bone conduction microphone are disclosed.

また、特許文献３には、人の撮影画像の唇の動きを用いてその人が発する音声を認識する技術が示されている。具体的には、撮影装置で撮影者自身を撮影するセルフタイマー撮影において、被写体である撮影者の動画像から唇領域の画像を抽出し、その抽出画像の唇の動きから撮影者がシャッターに関するキーワードを発したことを検出して静止画像を撮影する技術が記載されている。 Patent Document 3 discloses a technique for recognizing a voice produced by a person using the movement of a lip of a person's photographed image. Specifically, in self-timer shooting in which a photographer himself / herself is photographed by a photographing apparatus, a lip region image is extracted from a moving image of the photographer as a subject, and the photographer selects a keyword related to the shutter from the movement of the lips of the extracted image. A technique for detecting the occurrence of an image and capturing a still image is described.

特開２０１３−２４８６６３号公報JP 2013-248663 A 特開平０４−２７６７９９号公報Japanese Patent Laid-Open No. 04-276799 特開平２０１１−１４９８５号公報Japanese Unexamined Patent Publication No. 2011-14985

例えば、パルス溶接では、溶接電流がパルス周波数で小電流と大電流に交互に切り換えられるので、その溶接電流の変化に応じてアークの太さが変化する。このため、アークの周りの空気が振動して非常に大きな騒音が発生する。この騒音の大きさは、場合によっては８０〜１００［ｄＢ］の大騒音になることがあり、パルス溶接では一般的な工場騒音よりも大きな騒音やノイズが発生する。 For example, in pulse welding, since the welding current is alternately switched between a small current and a large current at the pulse frequency, the thickness of the arc changes according to the change in the welding current. For this reason, the air around the arc vibrates and a very large noise is generated. The magnitude of this noise may be as high as 80 to 100 [dB] in some cases, and noise and noise larger than general factory noise are generated in pulse welding.

溶接現場では、非常に大きな騒音が発生するので、マイクで集音した音声信号にフィルタリング処理を施して騒音を十分に除去することは困難である。このため、溶接システムに適用される音声指令入力装置では、音声による操作指令を高い精度で入力することが困難である。 Since very large noise is generated at the welding site, it is difficult to sufficiently remove the noise by performing filtering processing on the audio signal collected by the microphone. For this reason, in the voice command input device applied to the welding system, it is difficult to input a voice operation command with high accuracy.

騒音の影響を受けない方法として、特許文献３に開示されている作業者の顔を撮影した動画像を用いて当該作業者の発する音声を読み取る方法を適用することが考えられる。 As a method not affected by noise, it is conceivable to apply a method disclosed in Patent Document 3 that reads a voice generated by a worker using a moving image obtained by photographing the worker's face.

しかしながら、この方法では、作業者の顔に対してカメラを正対させるように配置しなければならず、カメラの配設が容易でない。特に、溶接作業中は作業者の姿勢が常に変化するので、作業者とは別の位置に固定したカメラでは作業者の顔の部分を撮影できない場合も生じ、作業者の発する音声を読み取れないことが生じる。 However, in this method, it is necessary to arrange the camera so as to face the operator's face, and it is not easy to arrange the camera. In particular, since the posture of the worker always changes during welding work, it may happen that the face part of the worker cannot be photographed with a camera fixed at a position different from the worker, and the voice produced by the worker cannot be read. Occurs.

この不都合を回避するために、作業者の頭部にカメラを取り付ける場合は、カメラを作業者の顔に正対させるように取り付けることになるので、カメラやそのカメラの支持部材が作業者の溶接作業の邪魔になるという問題が生じる。 In order to avoid this inconvenience, when the camera is attached to the operator's head, the camera and the camera support member are welded to the operator because the camera is attached to face the operator's face. The problem of getting in the way of work arises.

本発明は、上記の課題に鑑みてなされたもので、作業現場の周囲の騒音の影響を受けることなく高い精度で音声指令を入力することができる音声指令入力装置及びその音声指令入力装置を用いた溶接システムを提供することを目的とする。 The present invention has been made in view of the above problems, and uses a voice command input device and a voice command input device that can input voice commands with high accuracy without being affected by noise around the work site. The purpose is to provide a welding system.

本発明に係る音声指令入力装置は、音声により指令信号を入力する音声指令入力装置であって、話者が音声指令を発声したときの当該話者の口の動きを検出する検出手段と、検出手段が検出した口の動きを話者が発声した指令語に変換する指令語変換手段と、変換手段が変換した指令語に対応する指令信号を生成する指令信号生成手段と、を備えたことを特徴とする。 A voice command input device according to the present invention is a voice command input device for inputting a command signal by voice, and a detection means for detecting a movement of the speaker's mouth when the speaker utters a voice command; Command word conversion means for converting the mouth movement detected by the means into a command word uttered by the speaker, and command signal generation means for generating a command signal corresponding to the command word converted by the conversion means. Features.

この構成によれば、話者の口の動きから話者が発声した音声指令を認識することにより、周囲の騒音の影響を受けることなく音声指令に対応する指令信号を入力することができる。 According to this configuration, by recognizing the voice command uttered by the speaker from the movement of the speaker's mouth, it is possible to input a command signal corresponding to the voice command without being affected by ambient noise.

上記の音声指令入力装置において、検出手段は、話者の頬若しくは口元の周辺部に取り付けられる加速度センサと、話者が音声指令を発声するのに応じて加速度センサから出力される加速度の検出値を話者の口の動きのデータに変換する第１の変換手段とを含み、指令語変換手段は、話者の発声する音素と当該音素に対応する口の動きとの対応関係を示す第１の変換テーブルを記憶する第１の記憶手段と、予め登録された１以上の指令語を記憶する第２の記憶手段と、第１の変換テーブルを用いて、第１の変換手段から出力される話者の口の動きのデータを音素の列に変換する第２の変換手段と、第２の変換手段で変換された音素の列を第２の記憶手段に登録された指令語と照合して話者の発声した指令語を認識する音声認識手段とを含み、指令信号生成手段は、第２の記憶手段に記憶された１以上の指令語と指令信号との対応関係を示す第２の変換テーブルを記憶する第３の記憶手段と、第２の変換テーブルを用いて、音声認識手段が認識した指令語を当該指令語に対応する指令信号に変換する第３の変換手段とを含むとよい。 In the above voice command input device, the detection means includes an acceleration sensor attached to the speaker's cheek or peripheral portion of the mouth, and a detected acceleration value output from the acceleration sensor in response to the speaker speaking the voice command. The first conversion means for converting the voice into the mouth movement data of the speaker, the command word conversion means is a first indicating a correspondence relationship between the phoneme uttered by the speaker and the mouth movement corresponding to the phoneme. Output from the first conversion means using the first storage means for storing the first conversion table, the second storage means for storing one or more command words registered in advance, and the first conversion table. A second conversion means for converting the speech movement data of the speaker into a phoneme string, and collating the phoneme string converted by the second conversion means with a command word registered in the second storage means. Voice recognition means for recognizing command words spoken by a speaker, The command signal generating means includes a third storage means for storing a second conversion table indicating a correspondence relationship between one or more command words stored in the second storage means and the command signal, and a second conversion table. And a third conversion means for converting the command word recognized by the voice recognition means into a command signal corresponding to the command word.

この構成によれば、加速度センサにより検出した話者の口の動きのデータと第１の変換テーブルとから話者の発声した音声指令を構成する音素を適切に検出することができとともに、検出した音声指令の構成音素と登録した指令語とから当該音声指令の指令語を適切に認識することができる。また、第２の変換テーブルを用いて、認識した指令語を当該指令語に対応する指令信号に変換して入力することができる。 According to this configuration, the phoneme constituting the voice command uttered by the speaker can be appropriately detected and detected from the data of the mouth movement of the speaker detected by the acceleration sensor and the first conversion table. The command word of the voice command can be appropriately recognized from the constituent phonemes of the voice command and the registered command word. Further, the recognized command word can be converted into a command signal corresponding to the command word and input using the second conversion table.

さらに、上記の音声指令入力装置において、音声認識手段は、第２の記憶手段に登録された指令語と第２の変換手段で変換された音素の列との一致度を算出し、所定の閾値以上の一致度を有し、かつ、一致度が最も高い指令語を話者が発声した音声指令と認識するとよい。 Furthermore, in the above voice command input device, the voice recognition means calculates the degree of coincidence between the command word registered in the second storage means and the phoneme string converted by the second conversion means, and has a predetermined threshold value. The command word having the above matching degree and the highest matching degree may be recognized as a voice command uttered by the speaker.

この構成によれば、音声認識により認識した指令語に完全に一致する指令語が登録されてなくても、音声指令を高い精度で当該音声指令に対応する指令語に変換して入力することができる。 According to this configuration, even if a command word that completely matches the command word recognized by voice recognition is not registered, the voice command can be converted into a command word corresponding to the voice command and input with high accuracy. it can.

また、上記の音声指令入力装置において、話者の発声した音素に応じた口の動きのデータは、話者が音声を発声しない表情を基準として音声を発声したときの表情の変化に基づく加速度センサの変位ベクトルのデータであるとよい。 In the voice command input device, the mouth movement data corresponding to the phoneme uttered by the speaker is an acceleration sensor based on a change in facial expression when the speaker utters the speech based on an expression that does not utter the speech. The displacement vector data may be good.

この構成によれば、話者の発声した音声指令の音素の応じた口の動きを高い精度で検出することができる。 According to this configuration, the movement of the mouth according to the phoneme of the voice command uttered by the speaker can be detected with high accuracy.

また、上記の音声指令入力装置において、加速度センサは、互いに直交する３軸方向の加速度を検出する三軸加速度センサであり、話者の両頬もしくは口元の両側にそれぞれ取り付けられているとよい。 In the voice command input device described above, the acceleration sensor is a triaxial acceleration sensor that detects acceleration in three axial directions orthogonal to each other, and may be attached to both sides of the speaker's cheeks or mouth.

この構成によれば、話者の発声した音声指令の音素の応じた微妙な口の動きを高い精度で検出することができる。 According to this configuration, it is possible to detect a delicate movement of the mouth corresponding to the phoneme of the voice command uttered by the speaker with high accuracy.

本発明に係る溶接システムは、溶接トーチと、溶接トーチに溶接ワイヤを送給するワイヤ送給装置と、ワイヤ送給装置を介して溶接トーチと母材との間に溶接用の電力を供給する溶接電源と、溶接電源に各種の指令を入力するための指令入力装置と、を備えた溶接システムであって、指令入力装置は上記の本発明に係る音声指令入力装置で構成されている、ことを特徴とする。 A welding system according to the present invention supplies a welding torch, a wire feeding device that feeds a welding wire to the welding torch, and electric power for welding between the welding torch and the base material via the wire feeding device. A welding system comprising a welding power source and a command input device for inputting various commands to the welding power source, wherein the command input device comprises the voice command input device according to the present invention described above. It is characterized by.

この構成によれば、作業者の口の動きから話者が発声した音声指令を認識することにより、周囲の騒音の影響を受けることなく高い精度で溶接に関する音声指令を当該音声指令に対応する指令信号を変換して溶接電源に入力することができる溶接システムを適用することができる。 According to this configuration, by recognizing the voice command uttered by the speaker from the movement of the worker's mouth, the command corresponding to the voice command can be given to the voice command related to welding with high accuracy without being affected by ambient noise. A welding system that can convert a signal and input it to a welding power source can be applied.

上記の溶接システムにおいて、音声指令入力装置は、ワイヤ送給装置に設けられているとよい。 In the above welding system, the voice command input device may be provided in the wire feeding device.

この構成によれば、作業者の近くに配置されるワイヤ送給装置に音声指令入力装置を設けることにより、溶接システムへの音声指令入力装置の配設が容易かつコンパクトになる。 According to this configuration, the voice command input device is easily and compactly arranged in the welding system by providing the voice command input device in the wire feeding device arranged near the worker.

また、上記の溶接システムにおいて、ワイヤ送給装置に無線通信により接続可能な遠隔操作装置を備え、音声指令入力装置は遠隔操作装置に設けられている、とよい。 The welding system may include a remote operation device that can be connected to the wire feeding device by wireless communication, and the voice command input device may be provided in the remote operation device.

この構成によれば、作業者に保持される遠隔操作装置に音声指令入力装置を設けることにより、溶接システムへの音声指令入力装置の配設が容易かつコンパクトになる。 According to this configuration, the voice command input device is easily and compactly arranged in the welding system by providing the voice command input device in the remote control device held by the operator.

本発明による音声指令入力装置及び溶接システムによれば、作業者が発した音声指令を騒音の影響を受けることなく高い精度で認識し、その認識した音声指令に基づいて信頼性の高い指令信号を生成することができる。 According to the voice command input device and the welding system of the present invention, a voice command issued by an operator is recognized with high accuracy without being affected by noise, and a highly reliable command signal is generated based on the recognized voice command. Can be generated.

本発明に係る音声指令入力装置の構成を示すブロック図The block diagram which shows the structure of the voice command input device which concerns on this invention 同音声指令入力装置に用いられる加速度センサの外観を示す外観図External view showing the external appearance of the acceleration sensor used in the voice command input device 同音声指令入力装置に用いられる加速度センサを話者の顔に取り付けた場合の加速度の検出方向を示す図The figure which shows the detection direction of an acceleration at the time of attaching the acceleration sensor used for the voice command input device to a speaker's face 同音声指令入力装置に用いられる加速度センサの変位ベクトルを示す図The figure which shows the displacement vector of the acceleration sensor used for the voice command input device 話者が母音を発声するときの口の形の一例を示す図Figure showing an example of the shape of the mouth when the speaker utters a vowel 同音声指令入力装置に記憶される音素と口元の動きのデータとの対応関係を示す第１の変換テーブルの一例を示す図The figure which shows an example of the 1st conversion table which shows the correspondence of the phoneme memorize | stored in the same voice command input device and the data of the movement of the mouth 同音声指令入力装置を備える溶接システムの構成を示すブロック図The block diagram which shows the structure of a welding system provided with the voice command input device 同音声指令入力装置に記憶されている指令語に使用される用語や単語のデータの一例を示す図The figure which shows an example of the data of the term used for the command word memorize | stored in the voice command input device, and the word 同音声指令入力装置に登録されている溶接システムの指令語とその内容の一例を示す図The figure which shows an example of the command word of the welding system registered into the voice command input device, and the content 同音声指令入力装置による音声指令の入力制御を示すフローチャートFlowchart showing voice command input control by the voice command input device 同音声指令入力装置を備えた溶接システムの変形例を示す図The figure which shows the modification of the welding system provided with the voice command input device

以下、本発明に係る音声指令入力装置と、その音声指令入力装置を備えた溶接システムの実施の形態について、図面を参照して説明する。 DESCRIPTION OF EMBODIMENTS Hereinafter, embodiments of a voice command input device according to the present invention and a welding system including the voice command input device will be described with reference to the drawings.

まず、音声指令入力装置について説明する。 First, the voice command input device will be described.

図１は、本発明に係る音声指令入力装置の構成を示すブロック図である。図２は、同音声指令入力装置に用いられる加速度センサの外観を示す外観図である。 FIG. 1 is a block diagram showing a configuration of a voice command input device according to the present invention. FIG. 2 is an external view showing an external appearance of an acceleration sensor used in the voice command input device.

音声指令入力装置１は、加速度センサ１０１、座標変換部１０２、変換テーブル記憶部１０３、音声変換部１０４、音声認識部１０５、言語モデル記憶部１０６、指令信号変換部１０７及び指令語記憶部１０８を含む。 The voice command input device 1 includes an acceleration sensor 101, a coordinate conversion unit 102, a conversion table storage unit 103, a voice conversion unit 104, a voice recognition unit 105, a language model storage unit 106, a command signal conversion unit 107, and a command word storage unit 108. Including.

加速度センサ１０１は、音声指令を発する話者（作業者）Ｑの頬や口元の周辺部に取り付けられ、口角を動かす頬筋や口元の表情を作る口輪筋などの動きを検出するためのセンサである。座標変換部１０２、変換テーブル記憶部１０３、音声変換部１０４、音声認識部１０５、言語モデル記憶部１０６、指令信号変換部１０７及び指令語記憶部１０８は、音声指令入力装置１の装置本体１００内に設けられる。 The acceleration sensor 101 is attached to the cheek of the speaker (worker) Q who issues a voice command and the periphery of the mouth, and detects movements of the cheek muscle that moves the corner of the mouth and the mouth muscle that creates the facial expression of the mouth. It is. The coordinate conversion unit 102, the conversion table storage unit 103, the voice conversion unit 104, the voice recognition unit 105, the language model storage unit 106, the command signal conversion unit 107, and the command word storage unit 108 are included in the device main body 100 of the voice command input device 1. Is provided.

加速度センサ１０１及び座標変換部１０２は、本発明に係る「検出手段」に対応し、座標変換部１０２は、本発明に係る「第１の変換手段」に対応している。また、変換テーブル記憶部１０３、音声変換部１０４、音声認識部１０５及び言語モデル記憶部１０６は、本発明に係る「指令語変換手段」に対応し、変換テーブル記憶部１０３と言語モデル記憶部１０６は、本発明に係る「第１の記憶手段」と「第２の記憶手段」とにそれぞれ対応している。また、音声変換部１０４と音声認識部１０５は、本発明に係る「第２の変換手段」と「音声認識手段」とにそれぞれ対応している。 The acceleration sensor 101 and the coordinate conversion unit 102 correspond to a “detection unit” according to the present invention, and the coordinate conversion unit 102 corresponds to a “first conversion unit” according to the present invention. The conversion table storage unit 103, the voice conversion unit 104, the voice recognition unit 105, and the language model storage unit 106 correspond to the “command word conversion unit” according to the present invention, and the conversion table storage unit 103 and the language model storage unit 106. Corresponds to the “first storage means” and the “second storage means” according to the present invention, respectively. The voice conversion unit 104 and the voice recognition unit 105 correspond to the “second conversion unit” and the “voice recognition unit” according to the present invention, respectively.

また、指令信号変換部１０７及び指令語記憶部１０８は、本発明に係る「指令信号生成手段」に対応し、指令信号変換部１０７は、本発明に係る「第３の変換手段」に対応し、指令語記憶部１０８は、本発明に係る「第３の記憶手段」に対応している。 The command signal conversion unit 107 and the command word storage unit 108 correspond to the “command signal generation unit” according to the present invention, and the command signal conversion unit 107 corresponds to the “third conversion unit” according to the present invention. The command word storage unit 108 corresponds to the “third storage unit” according to the present invention.

加速度センサ１０１は、話者Ｑの頭部に装着されるケーブル中継部材２を介して装置本体１００内の座標変換部１０２に接続されている。ケーブル中継部材２は、円弧型に湾曲した弾性を有する頭部装着部２０１と、頭部装着部２０１の両端に設けられた加速度センサ１０１との接続部２０２と、一方端が接続部２０２に接続され、頭部装着部２０１のほぼ中央から引き出された他方端が装置本体１００に接続される信号線２０３と、を有する。 The acceleration sensor 101 is connected to the coordinate conversion unit 102 in the apparatus main body 100 via the cable relay member 2 attached to the head of the speaker Q. The cable relay member 2 includes a head mounting portion 201 having elasticity curved in an arc shape, a connection portion 202 between the acceleration sensor 101 provided at both ends of the head mounting portion 201, and one end connected to the connection portion 202. And a signal line 203 connected to the apparatus main body 100 at the other end drawn out from substantially the center of the head mounting portion 201.

加速度センサ１０１からはフレキシブルなリード線１０１１が引き出され、そのリード線１０１１の先端には接続部２０２に電気的に接続するためのコネクタ１０１２が設けられている。加速度センサ１０１のコネクタ１０１２をケーブル中継部材２の接続部２０２に接続することにより、加速度センサ１０１は、リード線１０１１及び信号線２０３によって装置本体１００内の座標変換部１０２に電気的に接続される。加速度センサ１０１の検出信号は、リード線１０１１及び信号線２０３を介して座標変換部１０２に入力される。 A flexible lead wire 1011 is drawn out from the acceleration sensor 101, and a connector 1012 for electrically connecting to the connection portion 202 is provided at the tip of the lead wire 1011. By connecting the connector 1012 of the acceleration sensor 101 to the connection part 202 of the cable relay member 2, the acceleration sensor 101 is electrically connected to the coordinate conversion part 102 in the apparatus main body 100 by the lead wire 1011 and the signal line 203. . A detection signal of the acceleration sensor 101 is input to the coordinate conversion unit 102 via the lead wire 1011 and the signal line 203.

加速度センサ１０１には、図２に示すように、数ｍｍ径の円盤状を有する小型・軽量の半導体式加速度センサが用いられている。半導体式加速度センサは、例えば、ＭＥＭＳ（ＭｉｃｒｏＥｌｅｃｔｒｏＭｅｃｈａｎｉｃａｌＳｙｓｔｅｍｓ）技術を用いたものが好適である。 As the acceleration sensor 101, as shown in FIG. 2, a small and lightweight semiconductor acceleration sensor having a disk shape with a diameter of several mm is used. The semiconductor acceleration sensor preferably uses, for example, a MEMS (Micro Electro Mechanical Systems) technology.

半導体式加速度センサ１０１は、加速度を検出するセンサチップと、センサチップから出力される検出信号に所定の信号処理を行って外部に出力する信号処理回路を含む。加速度検出方式には静電容量型、ピエゾ抵抗型、ガス温度分布型、動電型、磁気型などの各種のタイプがあるが、いずれのタイプのセンサチップを用いてもよい。 The semiconductor acceleration sensor 101 includes a sensor chip that detects acceleration and a signal processing circuit that performs predetermined signal processing on a detection signal output from the sensor chip and outputs the detected signal to the outside. There are various types of acceleration detection methods such as a capacitance type, a piezoresistive type, a gas temperature distribution type, an electrodynamic type, and a magnetic type, and any type of sensor chip may be used.

加速度センサには、一般に、ＸＹＺ直交座標系（図２参照）の各軸方向（３方向）の加速度を検出する３軸加速度センサ、２方向（ＸＹ方向）の加速度を検出する２軸加速度センサ、１方向（Ｘ方向）の加速度を検出する１軸加速度センサがある。加速度センサ１０１は、好ましくは２軸加速度センサ又は３軸加速度センサを用いるとよい。 The acceleration sensor generally includes a three-axis acceleration sensor that detects acceleration in each axial direction (three directions) of an XYZ orthogonal coordinate system (see FIG. 2), a two-axis acceleration sensor that detects acceleration in two directions (XY direction), There is a one-axis acceleration sensor that detects acceleration in one direction (X direction). As the acceleration sensor 101, a 2-axis acceleration sensor or a 3-axis acceleration sensor is preferably used.

加速度センサ１０１は、話者Ｑの両頬若しくは口元の両側の近傍に１対取り付けるのが好ましいが、２対以上取り付けてもよく、話者Ｑの一方の頬若しくは口元の一方側の近傍に１個取り付けてもよい。 The acceleration sensor 101 is preferably attached in a pair near both cheeks or both sides of the mouth of the speaker Q, but two or more pairs may be attached, and one in the vicinity of one cheek or one side of the mouth of the speaker Q. It may be attached individually.

加速度センサ１０１は、例えば、シート状の接着部材３によって話者Ｑの頬若しくは口元の周辺部に接着されている。接着部材３には、皮膚への密着性や接着強度などを考慮して、例えば、医療用テープや皮膚貼付用テープ等を用いるとよい。図１の例では、加速度センサ１０１に接着部材３を直接接着しているが、加速度センサ１０１と接着部材３との間に、話者Ｑが口を動かしたときの皮膚の振動を吸収するための緩衝部材を介在させるようにしてもよい。 The acceleration sensor 101 is bonded to the cheek of the speaker Q or the peripheral portion of the mouth by, for example, a sheet-like adhesive member 3. For the adhesive member 3, for example, a medical tape or a skin adhesive tape may be used in consideration of adhesion to the skin, adhesive strength, and the like. In the example of FIG. 1, the adhesive member 3 is directly bonded to the acceleration sensor 101, but in order to absorb the vibration of the skin when the speaker Q moves his / her mouth between the acceleration sensor 101 and the adhesive member 3. A buffer member may be interposed.

図１の例では、話者Ｑの口元の両側の近傍にそれぞれ加速度センサ１０１Ｒと加速度センサ１０１Ｌ（Ｒは、話者Ｑの顔の右側にあるセンサを示し、Ｌは、話者Ｑの顔の左側にあるセンサを示す。）が取り付けられている。また、加速度センサ１０１Ｒ，１０１Ｌは、３軸加速度センサである。話者Ｑの口元の両側に一対の３軸加速度センサ１０１Ｒ，１０１Ｌを設けることにより、両加速度センサ１０１Ｒ，１０１Ｌの検出信号Ｓｘ，Ｓｙ，Ｓｚを用いて話者Ｑが音声を発したときの口元の微妙な動きを検出することができる。 In the example of FIG. 1, the acceleration sensor 101R and the acceleration sensor 101L (R is a sensor on the right side of the face of the speaker Q, and L is the position of the face of the speaker Q. The sensor on the left is shown.) Is attached. The acceleration sensors 101R and 101L are triaxial acceleration sensors. By providing a pair of three-axis acceleration sensors 101R, 101L on both sides of the speaker Q's mouth, the mouth when the speaker Q utters speech using the detection signals Sx, Sy, Sz of both acceleration sensors 101R, 101L. Can detect subtle movements.

図３は、話者Ｑの顔に加速度センサ１０１を取り付けた場合の当該加速度センサ１０１の加速度の検出方向を示す図である。 FIG. 3 is a diagram illustrating the acceleration detection direction of the acceleration sensor 101 when the acceleration sensor 101 is attached to the face of the speaker Q.

３軸の加速度センサ１０１には、当該加速度センサ１０１に対して互いに直交するＸＹＺ座標系が仮想設定されており、加速度センサ１０１からはそのＸＹＺ座標系のＸ軸、Ｙ軸、Ｚ軸の各軸方向の加速度を検出した検出信号Ｓｘ，Ｓｙ，Ｓｚが出力される。 An XYZ coordinate system that is orthogonal to the acceleration sensor 101 is virtually set in the three-axis acceleration sensor 101, and each axis of the X, Y, and Z axes of the XYZ coordinate system is set from the acceleration sensor 101. Detection signals Sx, Sy, and Sz that detect the acceleration in the direction are output.

加速度センサ１０１は、当該加速度センサ１０１に仮想設定されるＸＹＺ座標系のＸ軸方向とＹ軸方向を話者Ｑの顔の横方向（左右方向）と縦方向（上下方向）にそれぞれ一致させて話者Ｑの顔の頬若しくは口元の周辺部に取り付けられる。 The acceleration sensor 101 matches the X-axis direction and Y-axis direction of the XYZ coordinate system virtually set in the acceleration sensor 101 with the horizontal direction (left-right direction) and vertical direction (vertical direction) of the face of the speaker Q, respectively. It is attached to the cheek of the face of the speaker Q or the peripheral part of the mouth.

話者Ｑの顔の頬若しくは口元の周辺部に加速度センサ１０１が取り付けられると、図３に示すように、加速度センサ１０１に仮想設定されているＸＹＺ座標系は、話者Ｑの顔の加速度を計測する直交座標系となる。 When the acceleration sensor 101 is attached to the cheek of the face of the speaker Q or the periphery of the mouth, the XYZ coordinate system virtually set in the acceleration sensor 101 indicates the acceleration of the face of the speaker Q as shown in FIG. It becomes an orthogonal coordinate system to measure.

すなわち、話者Ｑの顔の加速度センサ１０１の取り付け位置がＸＹＺ座標系の原点Ｏとなり、話者Ｑの顔の表面（通常、曲面）の原点Ｏにおける横方向の接線がＸ軸方向となり、縦方向の接線がＹ軸方向となる。また、話者Ｑの顔の表面の原点Ｏにおける法線方向がＺ軸方向となる。 That is, the attachment position of the acceleration sensor 101 on the face of the speaker Q is the origin O of the XYZ coordinate system, and the tangent in the horizontal direction at the origin O of the surface (usually a curved surface) of the speaker Q is the X-axis direction. The tangent of the direction is the Y-axis direction. Further, the normal direction at the origin O of the surface of the face of the speaker Q is the Z-axis direction.

話者Ｑが頬筋や口輪筋を動かしていない顔の表情を基準表情とすると、その基準表情におけるＸＹＺ座標系の原点Ｏの位置は話者Ｑの頬や口角の動きを検出する検出位置Ｐｒとなる。話者Ｑが頬筋や口輪筋を動かして顔の表情を変化させると、その変化に応じて加速度センサ１０１が変位し、話者Ｑの顔に取り付けられた加速度センサ１０１から話者Ｑの顔の横方向の加速度を検出した検出信号Ｓｘと、話者Ｑの顔の縦方向の加速度を検出した検出信号Ｓｙと、話者Ｑの顔の法線方向の加速度を検出した検出信号Ｓｚが出力される。 If the facial expression of the speaker Q whose cheek and muzzle muscles are not moved is a reference facial expression, the position of the origin O of the XYZ coordinate system in the reference facial expression is a detection position for detecting the movement of the speaker's cheek and mouth corner. Pr. When the speaker Q moves the cheek muscles or the muzzle muscles to change the facial expression, the acceleration sensor 101 is displaced according to the change, and the acceleration sensor 101 attached to the speaker Q's face moves the speaker Q. A detection signal Sx that detects the lateral acceleration of the face, a detection signal Sy that detects the vertical acceleration of the face of the speaker Q, and a detection signal Sz that detects the acceleration in the normal direction of the face of the speaker Q. Is output.

加速度センサ１０１が変位すると、図４に示すように、話者Ｑの頬や口角の動きを検出する検出位置Ｐｒが基準表情のときの位置（以下、この位置を「基準位置」という。）から変位する。検出位置Ｐｒの基準位置からの変位は変位ベクトルＶによって表され、その変位ベクトルＶのＸ軸、Ｙ軸及びＺ軸方向の各軸方向の成分Ｘｄ、Ｙｄ、Ｚｄは、検出信号Ｓｘ，Ｓｙ，Ｓｚを用いてそれぞれ算出することができる。 When the acceleration sensor 101 is displaced, as shown in FIG. 4, the detection position Pr for detecting the movement of the cheek or mouth corner of the speaker Q is a position when the reference expression is used (hereinafter, this position is referred to as “reference position”). Displace. The displacement of the detection position Pr from the reference position is represented by a displacement vector V. The components Xd, Yd, and Zd of the displacement vector V in the X-axis, Y-axis, and Z-axis directions are detected signals Sx, Sy, Each can be calculated using Sz.

座標変換部１０２は、加速度センサ１０１から出力される検出信号Ｓｘ，Ｓｙ，Ｓｚを用いて、変位ベクトルＶのＸ軸方向、Ｙ軸方向及びＺ軸方向の座標Ｘｄ、Ｙｄ、Ｚｄに変換する。座標変換部１０２は、所定の周期で加速度センサ１０１から検出信号Ｓｘ，Ｓｙ，Ｓｚを取り込み、検出信号毎に積分処理を２回行って各軸方向の距離、すなわち、検出位置Ｐｒが基準位置から変位した位置Ｐｄ（以下、「変位位置Ｐｄ」という。）の座標Ｘｄ、Ｙｄ、Ｚｄを算出する。 The coordinate conversion unit 102 converts the displacement vector V into coordinates Xd, Yd, and Zd in the X-axis direction, the Y-axis direction, and the Z-axis direction using the detection signals Sx, Sy, and Sz output from the acceleration sensor 101. The coordinate conversion unit 102 takes in the detection signals Sx, Sy, Sz from the acceleration sensor 101 at a predetermined period, performs integration processing twice for each detection signal, and detects the distance in each axis direction, that is, the detection position Pr from the reference position. Coordinates Xd, Yd, Zd of the displaced position Pd (hereinafter referred to as “displaced position Pd”) are calculated.

変換テーブル記憶部１０３は、座標変換部１０２が算出した変位位置Ｐｄの座標Ｘｄ、Ｙｄ、Ｚｄを、「あ」、「い」、「う」、…「ん」の音素のデータに変換するための変換テーブルを格納している。音素のデータは、例えば、周知の文字コード表における音素に対応する文字（ひらかな又はカタカナ）のコードデータである。 The conversion table storage unit 103 converts the coordinates Xd, Yd, Zd of the displacement position Pd calculated by the coordinate conversion unit 102 into phoneme data of “A”, “I”, “U”,. The conversion table is stored. The phoneme data is, for example, code data of characters (hirakana or katakana) corresponding to phonemes in a well-known character code table.

話者Ｑが「あ」、「い」、「う」等の音素を発声するとき、話者Ｑの口の形は、発生する音素によって異なる。例えば、「あ」、「い」、「う」の母音を発声する場合、口の形は、通常、図５の（ａ）〜（ｃ）に示すようになる。 When speaker Q utters a phoneme such as “A”, “I”, “U”, etc., the mouth shape of speaker Q varies depending on the phoneme generated. For example, when vowels of “A”, “I”, and “U” are uttered, the mouth shape is usually as shown in FIGS.

すなわち、「あ」を発声するときの口の形は、上唇と下唇を上下に大きく開けた形となり（図５（ａ））、「い」を発声するときの口の形は、上唇と下唇を上下に少し開いた状態で口角を横に引っ張る形となり（図５（ｂ））、「う」を発声するときの口の形は、唇を突き出すように口を窄めた形となる（図５（ｃ））。 That is, the shape of the mouth when uttering “a” is a shape with the upper lip and lower lip greatly opened up and down (FIG. 5A), and the shape of the mouth when uttering “i” is the upper lip and With the lower lip slightly opened up and down, the corner of the mouth is pulled sideways (Fig. 5 (b)), and the shape of the mouth when saying "U" is a shape that narrows the mouth so that the lips protrude. (FIG. 5C).

音素によって話者Ｑの口の形が異なるので、話者Ｑが言葉を発するとき、各音素の発声時の話者Ｑの頬若しくは口元の周辺部に取り付けた加速度センサ１０１の位置（検出位置Ｐｒ)の動き（変位量）が異なる。例えば、加速度センサ１０１が話者Ｑの口角の真横に取り付けられている場合、例えば、「あ」を発声したときの検出位置Ｐｒの変位ベクトルＶは、Ｘ方向とＹ方向の変位量は小さく、Ｚ方向の変位量がＸ方向，Ｙ方向の変位量に対して大きくなる傾向がある。一方、「い」を発声したときの検出位置Ｐｒの変位ベクトルＶは、Ｙ方向の変位量は小さく、Ｙ方向の変位量がＸ方向の変位量に対して大きくなる傾向がある。 Since the shape of the mouth of the speaker Q differs depending on the phoneme, when the speaker Q speaks, the position of the acceleration sensor 101 attached to the cheek of the speaker Q or the periphery of the mouth (detection position Pr) when each phoneme is uttered ) Movement (displacement) is different. For example, when the acceleration sensor 101 is mounted directly beside the mouth angle of the speaker Q, for example, the displacement vector V of the detection position Pr when “a” is uttered has a small amount of displacement in the X direction and the Y direction. The amount of displacement in the Z direction tends to increase with respect to the amount of displacement in the X and Y directions. On the other hand, the displacement vector V at the detection position Pr when “I” is uttered tends to have a small amount of displacement in the Y direction and a large amount of displacement in the Y direction relative to the amount of displacement in the X direction.

このように、話者Ｑが音素を発声したときの検出位置Ｐｒの変位ベクトルＶは、各音素に応じた特徴的な変化をするので、変換テーブル記憶部１０３には、検出位置Ｐｒの変位ベクトルＶ（変位位置Ｐｄの座標）と音素のデータとの対応関係を示すデータが変換テーブルとして記憶されている。この変換テーブルは、本発明に係る「第１の変換テーブル」に対応するものである。 Thus, since the displacement vector V of the detection position Pr when the speaker Q utters a phoneme changes characteristically according to each phoneme, the conversion table storage unit 103 stores the displacement vector of the detection position Pr. Data indicating a correspondence relationship between V (coordinates of the displacement position Pd) and phoneme data is stored as a conversion table. This conversion table corresponds to the “first conversion table” according to the present invention.

変換テーブルの音素に対応する変位ベクトルＶ（Ｘ，Ｙ，Ｚ）には、例えば、予め音声指令入力装置１を使用する話者Ｑに「あ」、「い」、「う」等の音素を発声させて取得した変位ベクトルＶ（Ｘ，Ｙ，Ｚ）の実測値が用いられる。多数の話者Ｑに音素を発声させて変位ベクトルＶ（Ｘ，Ｙ，Ｚ）を計測し、その計測値に基づいて算出した代表的な変位ベクトルＶ（Ｘ，Ｙ，Ｚ）を、変換テーブルの音素に対応する変位ベクトルＶ（Ｘ，Ｙ，Ｚ）に用いてもよい。代表的な変位ベクトルＶ（Ｘ，Ｙ，Ｚ）は、例えば、ｎ人の変位ベクトルの計測値（Ｘ_ｍ，Ｙ_ｍ，Ｚ_ｍ）の平均値（ΣＸ_ｍ／ｎ，ΣＹ_ｍ／ｎ，ΣＺ_ｍ／ｎ）によって定義することができる。 In the displacement vector V (X, Y, Z) corresponding to the phoneme in the conversion table, for example, a phoneme such as “A”, “I”, “U” is given to the speaker Q who uses the voice command input device 1 in advance. The measured value of the displacement vector V (X, Y, Z) acquired by speaking is used. A large number of speakers Q utter phonemes to measure displacement vectors V (X, Y, Z), and a representative displacement vector V (X, Y, Z) calculated based on the measured values is converted into a conversion table. May be used for the displacement vector V (X, Y, Z) corresponding to the phonemes. Representative displacement vector V (X, Y, Z), for example, measured values for the n displacement vector _{_{_{(X m, Y m, Z}}} m) the mean value of _{_{(ΣX m / n, ΣY m}} / n, ΣZ _m / n).

図６は、変換テーブル記憶部１０３に記憶されている変換テーブルの一例を示す図である。 FIG. 6 is a diagram illustrating an example of a conversion table stored in the conversion table storage unit 103.

変換テーブルは、「あ」、「い」、…「ん」の各音素のデータと、予め取得された変位ベクトルＶｉ（Ｘｉ，Ｙｉ，Ｚｉ）（ｉは、音素を区別するための便宜上の符合。ｉ＝１，２，…）との対応関係を示すデータである。図６の表では、音素の欄に、音素に対応する「ひらがな」の文字を記載しているが、変換テーブルでは「ひらかな」又は「カタカナ」の文字コード表におけるコードデータが記載されている。 The conversion table includes the phoneme data “a”, “i”,... “N” and the displacement vector Vi (Xi, Yi, Zi) (i is a code for convenience for distinguishing phonemes. .., I = 1, 2,... In the table of FIG. 6, the “Hiragana” character corresponding to the phoneme is described in the phoneme column, but the code data in the character code table of “Hiragana” or “Katakana” is described in the conversion table. .

図６に示す変換テーブルには、５０音の音素について、変位位置Ｐｄの座標Ｘｄ、Ｙｄ、Ｚｄを音素の文字コードに変換するデータが含まれているが、音声認識の対象とならない音素については、変換データがなくてもよい。例えば、音声指令入力装置１を溶接システムに用いる場合、溶接システムの音声指令に登録される指令の言葉（以下、「指令語」という。）は限られており、その指令語に使用される音素も限られるので、音声の指令語に使用されない音素については、変換データがなくてもよい。 The conversion table shown in FIG. 6 includes data for converting the coordinates Xd, Yd, and Zd of the displacement position Pd into phoneme character codes for 50 phonemes, but for phonemes that are not subject to speech recognition. The conversion data may not be present. For example, when the voice command input device 1 is used in a welding system, command words (hereinafter referred to as “command words”) registered in the voice command of the welding system are limited, and phonemes used for the command words are limited. Therefore, there is no need for conversion data for phonemes that are not used as speech command words.

例えば、溶接システムの音声指令に使用される用語が［かいし］、「ていし」、「でんりゅう」、「でんあつ」、「あげる」、「さげる」などの簡単な言葉で、例えば、「を」、「わ」などの音素が用いられない場合は、「を」、「わ」の音素については変換テーブルに変換データが含まれていなくてもよい。 For example, the terminology used for welding system voice commands is simple words such as [Kaishi], “Tashishi”, “Denryu”, “Denatsu”, “Give”, “Sageru”, etc. , “Wa”, “Wa” and other phonemes are not used, the conversion table does not have to include conversion data for the phonemes of “Wa” and “Wa”.

音声変換部１０４は、座標変換部１０２から出力される変位位置Ｐｄの座標Ｘｄ、Ｙｄ、Ｚｄを変換テーブル記憶部１０３に記憶された変換テーブルを参照して話者Ｑが発声した音声のデータ（音素の列を文字コードの列に変換したデータ）に変換する。 The voice conversion unit 104 refers to the conversion table stored in the conversion table storage unit 103 for the coordinates Xd, Yd, and Zd of the displacement position Pd output from the coordinate conversion unit 102, and the voice data ( Phoneme string to character code string).

例えば、座標変換部１０２から出力される変位位置Ｐｄの座標Ｘｄ、Ｙｄ、Ｚｄが（Ｘ１，Ｙ１，Ｚ１）を中心とした所定の誤差範囲（±ΔＸ，±ΔＹ，±ΔＺ）内であれば、音声変換部１０４は、「あ」の音素（文字コード）に変換する。同様に、変位位置Ｐｄの座標Ｘｄ、Ｙｄ、Ｚｄが（Ｘ６，Ｙ６，Ｚ６）であれば、音声変換部１０４は、「か」の音素（文字コード）に変換する。従って、座標変換部１０２から（Ｘ１，Ｙ１，Ｚ１）と（Ｘ６，Ｙ６，Ｚ６）の座標データが順番に出力されると、音声変換部１０４は、「あ」と「か」の音素（文字コード）に順番に変換して出力する。 For example, if the coordinates Xd, Yd, Zd of the displacement position Pd output from the coordinate conversion unit 102 are within a predetermined error range (± ΔX, ± ΔY, ± ΔZ) centered on (X1, Y1, Z1). The voice conversion unit 104 converts the phoneme (character code) to “A”. Similarly, if the coordinates Xd, Yd, Zd of the displacement position Pd are (X6, Y6, Z6), the speech conversion unit 104 converts the phoneme (character code) to “ka”. Therefore, when the coordinate data of (X1, Y1, Z1) and (X6, Y6, Z6) is output in order from the coordinate conversion unit 102, the voice conversion unit 104 will generate phonemes (characters) of “a” and “ka”. Code) in order and output.

音声認識部１０５は、音声変換部１０４から出力される文字列を言語モデル記憶部１０６に記憶されている言語モデルを用いて認識する処理（話者Ｑが発声した音声の認識処理）を行う。言語モデル記憶部１０６は、音声認識処理における周知の言語モデルを記憶する。言語モデルとは、日本語言語における文の品詞、統語構造、単語同士の関係性などを定式化したものである。 The speech recognition unit 105 performs processing for recognizing the character string output from the speech conversion unit 104 using the language model stored in the language model storage unit 106 (recognition processing of speech uttered by the speaker Q). The language model storage unit 106 stores a well-known language model in the speech recognition process. A language model is a formulation of part of speech, syntactic structure, relationship between words, etc. in a Japanese language.

音声認識部１０５は、例えば、音声変換部１０４から入力される文字コードの列が「よ」、「う」、「せ」、「つ」、「か」、「い」、「し」の場合、音声認識部１０５は、「ようせつ」＋「かいし」＝「溶接開始」と認識する処理を行う。 For example, when the character code string input from the voice conversion unit 104 is “yo”, “u”, “se”, “tsu”, “ka”, “i”, “shi”, the voice recognition unit 105 The voice recognition unit 105 performs a process of recognizing “Yosetsu” + “Kaishi” = “Start welding”.

指令信号変換部１０７は、音声認識部１０５から出力される音声認識の結果を指令語記憶部１０８に記憶されている指令語を用いて指令信号に変換する処理を行う。指令語記憶部１０８は、音声指令として登録された指令語と指令信号との対応関係を示す変換テーブルを記憶する。この変換テーブルは、本発明に係る「第２の変換テーブル」に対応するものである。 The command signal conversion unit 107 performs processing for converting the result of speech recognition output from the speech recognition unit 105 into a command signal using a command word stored in the command word storage unit 108. The command word storage unit 108 stores a conversion table indicating a correspondence relationship between command words registered as voice commands and command signals. This conversion table corresponds to the “second conversion table” according to the present invention.

指令信号は、例えば、制御部にアクチュエータの動作の制御を行わせるために当該制御部に入力する電気信号である。例えば、操作部に「動作開始」の操作ボタンが設けられ、その操作ボタンが操作されたときに制御部に「動作開始」に対応する所定の指令信号Ｓｃが入力される構成の場合、「動作開始」の指令語には「指令信号Ｓｃ」の指令信号が対応付けられる。 The command signal is, for example, an electrical signal that is input to the control unit in order to cause the control unit to control the operation of the actuator. For example, in the case where an operation button “start operation” is provided in the operation unit and a predetermined command signal Sc corresponding to “start operation” is input to the control unit when the operation button is operated, The command signal “command signal Sc” is associated with the command word “start”.

従って、指令信号変換部１０７は、音声認識部１０５から出力される音声認識の結果が「動作開始」の指令語の場合、指令語記憶部１０８に記憶された指令語を指令信号に変換するための変換テーブルを用いてその指令語に対応する指令信号Ｓｃを生成して出力する。 Therefore, the command signal conversion unit 107 converts the command word stored in the command word storage unit 108 into a command signal when the voice recognition result output from the voice recognition unit 105 is a command word “operation start”. Is used to generate and output a command signal Sc corresponding to the command word.

上記のように、本実施の形態に係る音声指令入力装置１によれば、話者（作業者）Ｑの頬若しくは口元の周辺部に加速度センサ１０１を取り付け、話者Ｑが音声指令を発声したときの加速度センサ１０１の検出信号Ｓｘ，Ｓｙ，Ｓｚを用いて話者Ｑの口の動きを検出し、その動きから話者Ｑが発声した音声指令を認識するようにしているので、例えば、話者Ｑの発声した音声指令をマイクで集音し、その集音した音声信号を用いて音声を認識する場合に比べて話者Ｑの周囲で発生する雑音や騒音の影響を殆ど受けることがなく、高い精度で音声指令を認識して当該音声指令に対応する指令信号を出力させることができる。 As described above, according to the voice command input device 1 according to the present embodiment, the acceleration sensor 101 is attached to the cheek of the speaker (worker) Q or the periphery of the mouth, and the speaker Q utters a voice command. Since the movement of the mouth of the speaker Q is detected using the detection signals Sx, Sy, Sz of the acceleration sensor 101 and the voice command uttered by the speaker Q is recognized from the movement, for example, The voice command uttered by the speaker Q is collected by the microphone, and is hardly affected by noise or noise generated around the speaker Q, compared to the case where the voice is recognized using the collected voice signal. It is possible to recognize a voice command with high accuracy and output a command signal corresponding to the voice command.

次に、本実施の形態に係る音声指令入力装置１の応用例として、音声による操作指令が入力可能な溶接システムについて説明する。 Next, as an application example of the voice command input device 1 according to the present embodiment, a welding system capable of inputting a voice operation command will be described.

図７は、音声指令入力装置１を備える溶接システムの構成を示すブロック図である。 FIG. 7 is a block diagram illustrating a configuration of a welding system including the voice command input device 1.

図７に示す溶接システム４は、音声による操作指令が入力可能な消耗電極式の半自動アーク溶接システムである。溶接システム４は、溶接電源装置５、ワイヤ送給装置６、溶接トーチ７、ガス供給装置８及びワイヤリール９を含み、ワイヤ送給装置６には音声により操作指令を入力するための上述した音声指令入力装置１が設けられている。 A welding system 4 shown in FIG. 7 is a consumable electrode type semi-automatic arc welding system capable of inputting a voice operation command. The welding system 4 includes a welding power source device 5, a wire feeding device 6, a welding torch 7, a gas supply device 8, and a wire reel 9, and the above-described voice for inputting an operation command by voice to the wire feeding device 6. A command input device 1 is provided.

溶接電源装置５は、溶接ワイヤＷと母材Ｂとの間に供給する電力（溶接用電力）と、ワイヤ送給装置６のワイヤ送給機構６０１に供給する電力（ワイヤ送給用電力）を生成し、ワイヤ送給装置６を介して溶接用電力を溶接トーチ７に供給するとともに、ワイヤ送給用電力をワイヤ送給装置６のワイヤ送給機構６０１に供給する機能を有する。 The welding power source device 5 supplies power (welding power) supplied between the welding wire W and the base material B and power (wire feeding power) supplied to the wire feeding mechanism 601 of the wire feeding device 6. The welding power is generated and supplied to the welding torch 7 via the wire feeding device 6, and the wire feeding power is supplied to the wire feeding mechanism 601 of the wire feeding device 6.

溶接電源装置５は、機能ブロックとして、電源部５０１、制御部５０２、操作部５０３及びは表示部５０４を含む。電源部５０１、操作部５０３及び表示部５０４は、制御部５０２に接続されている。 The welding power supply device 5 includes a power supply unit 501, a control unit 502, an operation unit 503, and a display unit 504 as functional blocks. The power supply unit 501, the operation unit 503, and the display unit 504 are connected to the control unit 502.

電源部５０１は、溶接電源装置５の駆動電力を生成するとともに、ワイヤ送給装置６の駆動電力と溶接用電力を生成する。ワイヤ送給装置６の駆動電力は、主として制御部６０３や表示部６０５などを駆動するための電力と溶接作業中に溶接ワイヤＷを溶接トーチ７に送給するための駆動電力である。溶接用電力は、主として溶接ワイヤＷと母材Ｂとの間にアークＡを発生させるための電力とアーク発生後の溶接作業用の電力である。 The power supply unit 501 generates drive power for the welding power supply device 5 and also generates drive power for the wire feeding device 6 and welding power. The driving power of the wire feeding device 6 is mainly driving power for driving the control unit 603 and the display unit 605 and driving power for feeding the welding wire W to the welding torch 7 during welding work. The electric power for welding is mainly electric power for generating the arc A between the welding wire W and the base material B and electric power for welding work after the arc is generated.

電源部５０１の電力発生動作は、制御部５０２によって制御される。ワイヤ送給装置６は、パワーケーブル（図示省略）と制御ケーブル（図示省略）とによって溶接電源装置５に接続される。溶接電源装置５の電源部５０１で生成されたワイヤ送給装置６の駆動電力と溶接トーチ７への溶接用電力は、パワーケーブルでワイヤ送給装置６に供給され、溶接用電力は、更にワイヤ送給装置６を介して溶接ワイヤＷと母材Ｂとの間に供給される。溶接電源装置５の制御部５０２とワイヤ送給装置６の制御部６０３との間のデータ通信は、制御ケーブルを介して行われる。 The power generation operation of the power supply unit 501 is controlled by the control unit 502. The wire feeding device 6 is connected to the welding power source device 5 by a power cable (not shown) and a control cable (not shown). The driving power of the wire feeding device 6 generated by the power supply unit 501 of the welding power source 5 and the power for welding to the welding torch 7 are supplied to the wire feeding device 6 by a power cable. It is supplied between the welding wire W and the base material B via the feeding device 6. Data communication between the control unit 502 of the welding power source device 5 and the control unit 603 of the wire feeding device 6 is performed via a control cable.

電源部５０１は、溶接用電力を生成する溶接用電源５０１１と溶接ワイヤ送給用の電力を生成するワイヤ送給用電源５０１２とを有する。溶接用電源５０１１は、三相交流の商用電源Ｐから制御部５０２の制御指令に基づく所定の溶接用直流電力又は溶接用交流電力を生成する。溶接用電力は、溶接中の作業者の操作指令や予め設定されたプログラムに基づいて変化する。 The power supply unit 501 includes a welding power source 5011 that generates welding power and a wire feeding power source 5012 that generates welding wire feeding power. A welding power source 5011 generates predetermined welding DC power or welding AC power based on a control command of the control unit 502 from a three-phase AC commercial power source P. The welding power changes based on an operation command of a worker during welding or a preset program.

ワイヤ送給用電源５０１２は、例えば、ワイヤ送給装置６のワイヤ送給機構６０１が直流モータで溶接ワイヤＷを送給する場合、商用電源Ｐから直流電力を生成する周知の直流電源で構成されている。ワイヤ送給用電源５０１２は、商用電源Ｐから直流電力を生成し、ワイヤ送給機構６０１に規定されている駆動電圧でパワーケーブルを介してワイヤ送給装置６に出力する。 For example, when the wire feeding mechanism 601 of the wire feeding device 6 feeds the welding wire W with a DC motor, the wire feeding power source 5012 is configured by a known DC power source that generates DC power from the commercial power source P. ing. The wire feeding power source 5012 generates DC power from the commercial power source P, and outputs it to the wire feeding device 6 via the power cable at a driving voltage defined in the wire feeding mechanism 601.

電源部５０１は、電力の発生動作に同期してガス供給装置８からシールドガスを溶接トーチ７に供給する制御を行う。電源部５０１は、制御ケーブルによってワイヤ送給装置６内のガス電磁弁６０２に接続されている。電源部５０１は、電力の生成の開始に同期してガス電磁弁６０２を「開」状態に制御し、電力の生成の停止に同期してガス電磁弁６０２を「閉」状態に制御する。 The power supply unit 501 performs control to supply the shield gas from the gas supply device 8 to the welding torch 7 in synchronization with the power generation operation. The power supply unit 501 is connected to the gas electromagnetic valve 602 in the wire feeder 6 by a control cable. The power supply unit 501 controls the gas electromagnetic valve 602 to the “open” state in synchronization with the start of power generation, and controls the gas electromagnetic valve 602 to the “closed” state in synchronization with the stop of power generation.

制御部５０２は、操作部５０３による情報の入力、表示部５０４よる情報の出力、電源部５０１によるワイヤ送給装置６への電源供給などの各処理を統括的に制御する。 The control unit 502 comprehensively controls processes such as information input by the operation unit 503, information output by the display unit 504, and power supply to the wire feeding device 6 by the power supply unit 501.

制御部５０２は、例えば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）及びＭＰＵ（Ｍｉｃｒｏ−ＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）を有するマイクロコンピュータで実現されている。なお、制御部５０２は、ＦＰＧＡ（Ｆｉｅｌｄ−ＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）などのＰＬＤ（ＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ）で実現してもよい。 The control unit 502 is realized by, for example, a microcomputer having a ROM (Read Only Memory), a RAM (Random Access Memory), and an MPU (Micro-Processing Unit). Note that the control unit 502 may be realized by a programmable logic device (PLD) such as a field-programmable gate array (FPGA).

操作部５０３は、作業者がアーク溶接に関する各種の情報を制御部５０２に入力する操作を行うブロックである。操作部５０３は、例えば、溶接電源装置５の筐体面に配設された操作パネルに設けられている。作業者が操作する操作部材は、例えば、操作パネルに所定の項目毎に配設された１又は２以上の操作ボタンや電源スイッチなどで構成される。なお、操作部５０３は、タッチパネルやキーボードなどの他の種類の入力装置で構成してもよい。 The operation unit 503 is a block that allows an operator to input various information related to arc welding to the control unit 502. The operation unit 503 is provided on, for example, an operation panel disposed on the casing surface of the welding power source device 5. The operation member operated by the worker includes, for example, one or more operation buttons or a power switch arranged on the operation panel for each predetermined item. Note that the operation unit 503 may be configured by other types of input devices such as a touch panel and a keyboard.

作業者が操作部５０３の操作ボタンを操作すると、その操作ボタンに対応する所定の指令信号が制御部５０２に入力される。 When the operator operates the operation button of the operation unit 503, a predetermined command signal corresponding to the operation button is input to the control unit 502.

作業者が操作部５０３から入力するアーク溶接に関する情報は、例えば、スタート方式や交流／直流の溶接方式などの方式に関する情報や、初期電流、溶接電圧、溶接電流、交流波形（パルス波形や正弦波波形などの波形）、交流溶接における周波数などの溶接条件に関する情報などである。操作部５０３から制御部５０２に入力されるアーク溶接に関する情報は、一旦メモリ（ＲＡＭ）に保存され、表示部５０４に転送される。 Information on arc welding input by the operator from the operation unit 503 includes, for example, information on a start method, an AC / DC welding method, an initial current, a welding voltage, a welding current, an AC waveform (a pulse waveform or a sine wave). Waveform), information on welding conditions such as frequency in AC welding, and the like. Information relating to arc welding input from the operation unit 503 to the control unit 502 is temporarily stored in a memory (RAM) and transferred to the display unit 504.

表示部５０４は、アーク溶接に関する各種の情報を作業者が視認できるように表示する。表示部５０４は、例えば、ＬＥＤや７セグメントなどの表示機で構成される。表示部５０４は、液晶ディスプレイなどの表示装置で構成してもよい。表示部５０４は、制御部５０２から転送される情報を所定の表示態様で表示する。 The display unit 504 displays various information related to arc welding so that an operator can visually recognize the information. The display unit 504 is configured by a display device such as an LED or 7 segments, for example. The display unit 504 may be configured by a display device such as a liquid crystal display. The display unit 504 displays information transferred from the control unit 502 in a predetermined display mode.

ワイヤ送給装置６は、機能ブロックとして、ワイヤ送給機構６０１、ガス電磁弁６０２、制御部６０３、操作部６０４、表示部６０５及び電圧／電流検出部６０６を含む。ワイヤ送給装置６には、上述した音声指令入力装置１が設けられている。操作部６０４、表示部６０５、電圧／電流検出部６０６及び音声指令入力装置１は、制御部６０３に接続されている。 The wire feeding device 6 includes a wire feeding mechanism 601, a gas electromagnetic valve 602, a control unit 603, an operation unit 604, a display unit 605, and a voltage / current detection unit 606 as functional blocks. The wire feeder 6 is provided with the voice command input device 1 described above. The operation unit 604, the display unit 605, the voltage / current detection unit 606, and the voice command input device 1 are connected to the control unit 603.

ワイヤ送給機構６０１は、溶接作業中に溶接ワイヤＷが巻き付けられたワイヤリール９から当該溶接ワイヤＷを繰り出して溶接トーチ７に連続的に供給するための機構である。ワイヤ送給機構６０１は、ワイヤリール９から溶接トーチ７まで溶接ワイヤＷをガイドするガイド機構と溶接ワイヤＷを繰り出す駆動源であるワイヤ送給モータとを有する。ワイヤ送給モータは、溶接電源装置５から供給されるワイヤ送給用電力によって駆動される。 The wire feeding mechanism 601 is a mechanism for feeding the welding wire W from the wire reel 9 around which the welding wire W is wound during the welding operation and continuously supplying the welding wire W to the welding torch 7. The wire feeding mechanism 601 includes a guide mechanism that guides the welding wire W from the wire reel 9 to the welding torch 7 and a wire feeding motor that is a drive source for feeding the welding wire W. The wire feed motor is driven by the wire feed power supplied from the welding power source device 5.

ガス電磁弁６０２は、ガス供給装置８から溶接トーチ７へのシールドガスの供給を制御するための弁である。シールドガスは、溶接面を空気から遮断するためのアルゴン（Ａｒ）、ヘリウム（Ｈｅ）といった不活性ガスである。シールドガスは、ガス供給装置８に封入されており、作業者が溶接作業を開始する際にガス供給装置８の栓を開くことによって、所定の圧力で溶接トーチ７への供給が可能になる。 The gas electromagnetic valve 602 is a valve for controlling the supply of shield gas from the gas supply device 8 to the welding torch 7. The shield gas is an inert gas such as argon (Ar) or helium (He) for shielding the welding surface from air. The shield gas is sealed in the gas supply device 8 and can be supplied to the welding torch 7 at a predetermined pressure by opening the plug of the gas supply device 8 when the operator starts the welding operation.

ガス電磁弁６０２は、溶接開始から溶接終了までの期間に流路を開いてガス供給装置８から溶接トーチ７にシールドガスを供給する。ガス電磁弁６０２の開閉は、上述したように、溶接電源装置５の電源部５０１によって制御される。 The gas solenoid valve 602 opens the flow path during the period from the start of welding to the end of welding and supplies the shield gas from the gas supply device 8 to the welding torch 7. The opening and closing of the gas electromagnetic valve 602 is controlled by the power supply unit 501 of the welding power supply device 5 as described above.

制御部６０３は、操作部６０４による情報の入力、表示部６０５による情報の出力、ワイヤ送給機構６０１による溶接ワイヤＷの送給などの各処理を統括的に制御する。制御部６０３は、ＲＯＭ、ＲＡＭ及びＭＰＵを有するマイクロコンピュータで実現されている。制御部６０３もＦＰＧＡなどのＰＬＤで実現してもよい。 The control unit 603 comprehensively controls each process such as input of information by the operation unit 604, output of information by the display unit 605, and feeding of the welding wire W by the wire feeding mechanism 601. The control unit 603 is realized by a microcomputer having a ROM, a RAM, and an MPU. The control unit 603 may also be realized by a PLD such as an FPGA.

操作部６０４は、ワイヤ送給装置６の筐体面に配設された操作パネルに設けられている。作業者が操作する操作部材は、例えば、操作パネルに所定の項目毎に配設された１又は２以上の操作ボタンや電源スイッチなどで構成される。なお、操作部６０４もタッチパネルなどの他の種類の入力装置で構成してもよい。 The operation unit 604 is provided on an operation panel disposed on the housing surface of the wire feeding device 6. The operation member operated by the worker includes, for example, one or more operation buttons or a power switch arranged on the operation panel for each predetermined item. Note that the operation unit 604 may also be configured by another type of input device such as a touch panel.

作業者が操作部６０４の操作ボタンを操作すると、その操作ボタンに対応する所定の指令信号が制御部６０３に入力される。 When the operator operates the operation button of the operation unit 604, a predetermined command signal corresponding to the operation button is input to the control unit 603.

作業者が操作部６０４によって入力するアーク溶接に関する情報は、基本的に、作業者が操作部５０３によって溶接電源装置５の制御部５０２に入力する情報と同じである。操作部６０４から制御部６０３に入力されるアーク溶接に関する情報は、一旦メモリ（ＲＡＭ）に保存され、表示部６０５に転送される。 Information regarding arc welding input by the operator through the operation unit 604 is basically the same as information input by the operator through the operation unit 503 to the control unit 502 of the welding power source device 5. Information regarding arc welding input from the operation unit 604 to the control unit 603 is temporarily stored in a memory (RAM) and transferred to the display unit 605.

溶接トーチ７は、ワイヤ送給機構６０１で送給される溶接ワイヤＷを保持するとともに、ガス供給装置８からガス電磁弁６０２を介して供給されるシールドガスを当該シールドガスが溶接ワイヤＷと母材Ｂとの間に発生するアークＡと溶接金属を覆うように出力する機能を果たす機器である。溶接トーチ７には、作業者が手元で溶接電流を調整するためのトーチスイッチ７０１が設けられている。 The welding torch 7 holds the welding wire W fed by the wire feeding mechanism 601, and the shielding gas supplied from the gas supply device 8 via the gas electromagnetic valve 602 is used as the shielding gas by the welding wire W and the mother wire. It is an apparatus that fulfills the function of outputting so as to cover the arc A generated between the material B and the weld metal. The welding torch 7 is provided with a torch switch 701 for an operator to adjust the welding current at hand.

トーチスイッチ７０１は、制御ケーブル（図示省略）でワイヤ送給装置６の制御部６０３に接続されている。トーチスイッチ７０１は、例えば、押しボタン式のモーメンタリー・スイッチで構成される。制御部６０３には、例えば、トーチスイッチ７０１が「ＯＮ」になると、ハイレベルの信号が入力され、トーチスイッチ７０１が「ＯＦＦ」になると、ローレベルの信号が入力される。 The torch switch 701 is connected to the control unit 603 of the wire feeding device 6 by a control cable (not shown). The torch switch 701 is constituted by, for example, a push button type momentary switch. For example, a high level signal is input to the control unit 603 when the torch switch 701 is “ON”, and a low level signal is input when the torch switch 701 is “OFF”.

作業者は、トーチスイッチ７０１をＯＮにした後、所定の時間ｔが経過してＯＦＦにする操作をすることにより、制御部６０３にパルス幅ｔのパルス信号を入力することができる。制御部６０３には、トーチスイッチ７０１から入力されるパルス信号の波形を特定の操作指令に変換するテーブルが設けられている。 The operator can input a pulse signal having a pulse width t to the control unit 603 by turning the torch switch 701 ON after the predetermined time t has elapsed. The control unit 603 is provided with a table for converting the waveform of the pulse signal input from the torch switch 701 into a specific operation command.

例えば、制御部６０３には、パルス幅ｔが所定の時間よりも大きいパルス信号となるクリック操作を溶接電流の通電又は遮断を行う操作指令に割り当て、パルス幅ｔが所定の時間以内のパルス信号となるシングルクリック操作を所定のステップで溶接電流を減少させる指令信号に割り当て、同パルス信号が所定の時間間隔内で連続して２回発生させるダブルクリック操作を所定のステップで溶接電流を増加させる指令信号に割り当てたテーブルが設けられている。 For example, the control unit 603 assigns a click operation in which a pulse width t is a pulse signal larger than a predetermined time to an operation command for energizing or interrupting a welding current, and a pulse signal having a pulse width t within a predetermined time. A command to increase the welding current in a predetermined step by assigning the single click operation to a command signal for decreasing the welding current in a predetermined step and generating a double-click operation in which the same pulse signal is generated twice continuously within a predetermined time interval. A table assigned to the signals is provided.

作業者は、トーチスイッチ７０１を所定の操作パターンで操作することにより、溶接作業の開始、溶接中の溶接電流の変更、溶接作業の終了などの指令を制御部６０３に入力することができる。制御部６０３は、操作部６０４から指令信号が入力されると、その指令信号を溶接電源装置５の制御部５０２に送信する。 By operating the torch switch 701 with a predetermined operation pattern, the operator can input commands such as the start of the welding operation, the change of the welding current during welding, and the end of the welding operation to the control unit 603. When a command signal is input from operation unit 604, control unit 603 transmits the command signal to control unit 502 of welding power supply device 5.

本実施の形態では、トーチスイッチ７０１をワイヤ送給装置６の制御部６０３に接続しているが、トーチスイッチ７０１を溶接電源装置５の制御部５０２に接続し、トーチスイッチ７０１の操作信号を制御部５０２に直接入力するようにしてもよい。 In the present embodiment, the torch switch 701 is connected to the control unit 603 of the wire feeding device 6, but the torch switch 701 is connected to the control unit 502 of the welding power source device 5 to control the operation signal of the torch switch 701. The data may be input directly to the unit 502.

電圧／電流検出部６０６は、溶接ワイヤＷと母材Ｂとの間の電圧（溶接電圧）と溶接ワイヤＷと母材Ｂとの間に流れる電流（溶接電流）を検出する。電圧／電流検出部６０６は、電圧を検出する電圧センサと、その電圧センサから出力される電圧検出値をデジタル信号に変換して制御部６０３に出力する。また、電圧／電流検出部６０６は、電流を検出する電流センサと、その電流センサから出力される電流検出値をデジタル信号に変換して制御部６０３に出力する。 The voltage / current detection unit 606 detects a voltage (welding voltage) between the welding wire W and the base material B and a current (welding current) flowing between the welding wire W and the base material B. The voltage / current detection unit 606 converts a voltage sensor that detects a voltage and a voltage detection value output from the voltage sensor into a digital signal and outputs the digital signal to the control unit 603. In addition, the voltage / current detection unit 606 converts a current sensor that detects current and a current detection value output from the current sensor into a digital signal and outputs the digital signal to the control unit 603.

制御部６０３には、操作部６０４若しくはトーチスイッチ７０１から作業者によって入力されたアーク溶接に関する情報と、電圧／電流検出部６０６から溶接作業中の溶接電圧検出値と溶接電流検出値に関する情報が入力される。制御部６０３に入力されるこれらの情報は、制御ケーブルを介してデータ通信によりワイヤ送給装置６から溶接電源装置５に送信される。 Information on arc welding input by the operator from the operation unit 604 or the torch switch 701 and information on a welding voltage detection value and a welding current detection value during welding work are input from the voltage / current detection unit 606 to the control unit 603. Is done. These pieces of information input to the control unit 603 are transmitted from the wire feeding device 6 to the welding power source device 5 by data communication via the control cable.

表示部６０５は、例えば、ＬＥＤや７セグメントなどの表示機で構成される。表示部６０５は、液晶ディスプレイなどの表示装置で構成されてもよい。表示部６０５は、制御部６０３から転送される情報に基づいて、所定の表示をする。 The display unit 605 is configured by a display device such as an LED or a 7 segment. The display unit 605 may be configured by a display device such as a liquid crystal display. The display unit 605 performs a predetermined display based on information transferred from the control unit 603.

音声指令入力装置１は、図１に示した音声指令入力装置１の装置本体１００内に設けられる座標変換部１０２、変換テーブル記憶部１０３、音声変換部１０４、音声認識部１０５、言語モデル記憶部１０６、指令信号変換部１０７及び指令語記憶部１０８がワイヤ送給装置６内に設けられ、音声指令入力装置１の加速度センサ１０１は、作業者Ｑの頬若しくは口元の周辺部に取り付けられている。図７では、音声指令入力装置１の装置本体１００内に設けられる部分を音声指令入力部１０９で示している。 The voice command input device 1 includes a coordinate conversion unit 102, a conversion table storage unit 103, a voice conversion unit 104, a voice recognition unit 105, and a language model storage unit provided in the device main body 100 of the voice command input device 1 shown in FIG. 106, a command signal conversion unit 107 and a command word storage unit 108 are provided in the wire feeding device 6, and the acceleration sensor 101 of the voice command input device 1 is attached to the cheek of the worker Q or the peripheral part of the mouth. . In FIG. 7, a voice command input unit 109 indicates a portion provided in the device main body 100 of the voice command input device 1.

ワイヤ送給装置６では、作業者は、操作指令を操作部６０４と音声指令入力装置１のいずれかで入力することができる。このため、ワイヤ送給装置６の操作パネルには、操作指令を入力する装置を選択するための選択ボタンが設けられている。この選択ボタンは、例えば、モーメンタリー・スイッチで構成され、作業者がモーメンタリー・スイッチをオン状態にしている期間に音声指令入力装置１を用いて音声により操作指令を制御部６０３に入力することができる。 In the wire feeding device 6, the operator can input an operation command with either the operation unit 604 or the voice command input device 1. For this reason, the operation panel of the wire feeding device 6 is provided with a selection button for selecting a device for inputting an operation command. This selection button is composed of, for example, a momentary switch, and an operation command can be input to the control unit 603 by voice using the voice command input device 1 during a period in which the worker is turning on the momentary switch. .

音声指令入力装置１における加速度センサ１０１及び座標変換部１０２については、上述した通りであるから、以下では、音声変換部１０４〜指令語記憶部１０８について説明する。 Since the acceleration sensor 101 and the coordinate conversion unit 102 in the voice command input device 1 are as described above, the voice conversion unit 104 to the command word storage unit 108 will be described below.

溶接システム４で音声指令に使用される単語は、５０音の中の一部であるから、音声指令で使用される音素について、予め変換テーブルが作成され、変換テーブル記憶部１０３に記憶されている。例えば、音声指令に登録されている用語が「でんりゅう」、「でんあつ」、「あげる」、「さげる」、「ようせつ」、「かいし」、「ていし」などの場合、「て／で」、「ん」、「りゅ」、「う」、「つ」、「あ」、「け／げ」、「る」、「か」、「さ」、「よ」、「せ」、「い」、「し」などの音素に対する変換テーブルが変換テーブル記憶部１０３に記憶されている。 Since the word used for the voice command in the welding system 4 is a part of the 50 sounds, a conversion table is created in advance and stored in the conversion table storage unit 103 for the phonemes used in the voice command. . For example, if the term registered in the voice command is “Denryu”, “Denatsu”, “Guru”, “Sasaru”, “Yosetsu”, “Kaishi”, “Toshishi”, etc. / De "," n "," ryu "," u "," tsu "," a "," ke / ge "," ru "," ka "," sa "," yo "," se " , “I”, “Shi”, and other phoneme conversion tables are stored in the conversion table storage unit 103.

音声変換部１０４は、座標変換部１０２から入力される変位位置Ｐｄの座標（Ｘｄ，Ｙｄ，Ｚｄ）の列を変換テーブル記憶部１０３に記憶された変換テーブルを参照して作業者Ｑが発声した音声のデータ（音素の列を文字コードの列に変換したデータ）に変換する。 The voice conversion unit 104 utters the operator Q with reference to the conversion table stored in the conversion table storage unit 103 for the column of coordinates (Xd, Yd, Zd) of the displacement position Pd input from the coordinate conversion unit 102. It is converted into speech data (data obtained by converting a phoneme string into a character code string).

音声変換部１０４は、例えば、座標（Ｘｄ，Ｙｄ，Ｚｄ）が（Ｘ１９±ΔＸ，Ｙ１９±ΔＹ，Ｚ１９±ΔＺ）（図６参照）であれば、「て」の音素（文字コード）に変換する。しかし、座標（Ｘｄ，Ｙｄ，Ｚｄ）が変換テーブル記憶部１０３に記憶されていない場合は、音声変換部１０４は、例えば、「？」などの文字でない記号のコードに変換する。 For example, when the coordinates (Xd, Yd, Zd) are (X19 ± ΔX, Y19 ± ΔY, Z19 ± ΔZ) (see FIG. 6), the voice conversion unit 104 converts the phoneme (character code) to “te”. To do. However, when the coordinates (Xd, Yd, Zd) are not stored in the conversion table storage unit 103, the voice conversion unit 104 converts the code into a code of a symbol that is not a character such as “?”.

例えば、座標変換部１０２から音声変換部１０４に、（Ｘ１９±ΔＸ，Ｙ１９±ΔＹ，Ｚ１９±ΔＺ）、（Ｘ２±ΔＸ，Ｙ２±ΔＹ，Ｚ２±ΔＺ）、（Ｘ１２±ΔＸ，Ｙ１２±ΔＹ，Ｚ１２±ΔＺ）（図６参照）の座標のデータが順番に入力された場合、これらの座標を音素に変換する変換データは存在するので、音声変換部１０４は、「て」、「い」、「し」の音素に順番に変換して出力する。 For example, from the coordinate conversion unit 102 to the voice conversion unit 104, (X19 ± ΔX, Y19 ± ΔY, Z19 ± ΔZ), (X2 ± ΔX, Y2 ± ΔY, Z2 ± ΔZ), (X12 ± ΔX, Y12 ± ΔY, Z12 ± ΔZ) (refer to FIG. 6), when the coordinate data is input in order, there is conversion data for converting these coordinates into phonemes. It is converted into “shi” phonemes in order and output.

しかし、音声変換部１０４に２番目に入力された座標のデータが変換テーブル記憶部１０３に記憶されていない音素、例えば、「い」の口の形に似ている「に」の音素の場合、音声変換部１０４は、「て」、「？」、「し」の音素に順番に変換して出力する。 However, in the case where the second coordinate data input to the voice conversion unit 104 is a phoneme that is not stored in the conversion table storage unit 103, for example, a “ni” phoneme that resembles the mouth shape of “i”, The voice conversion unit 104 converts the phonemes into “te”, “?”, And “shi” phonemes in order and outputs them.

音声認識部１０５は、音声変換部１０４から入力される文字コードの列を言語モデル記憶部１０６に記憶されている言語モデルを用いて音声を認識する処理を行う。言語モデル記憶部１０６には、溶接システムにおける指令語で使用される用語や単語のデータが記憶されている。 The speech recognition unit 105 performs a process of recognizing speech using the language model stored in the language model storage unit 106 from the character code string input from the speech conversion unit 104. The language model storage unit 106 stores terminology and word data used in command words in the welding system.

図８は、言語モデル記憶部１０６に記憶されている指令語に使用される用語や単語のデータの一例を示す図である。 FIG. 8 is a diagram illustrating an example of term and word data used for the command word stored in the language model storage unit 106.

図８に示すように、言語モデル記憶部１０６には、「溶接」、「電圧」、「電流」、「上げる」、「下げる」、「開始」、「停止」、「高い」、「低い」、「有り」、「無し」などの溶接作業での指令に使用される用語や単語が登録されている。 As shown in FIG. 8, the language model storage unit 106 stores “welding”, “voltage”, “current”, “raising”, “lowering”, “starting”, “stopping”, “high”, “low”. , “Yes”, “No”, etc., terms and words used for instructions in welding work are registered.

なお、言語モデル記憶部１０６には、ひらがな又はカタカナで２文字以上の溶接用の用語や単語が登録されるが、１文字の用語や単語が登録されていてもよい。以下の説明では、説明の便宜上、言語モデル記憶部１０６には２文字以上の溶接用の用語や単語が登録されるものとする。 In the language model storage unit 106, welding terms and words having two or more characters are registered in hiragana or katakana, but one-letter terms and words may be registered. In the following description, for convenience of explanation, it is assumed that two or more characters or words for welding are registered in the language model storage unit 106.

また、指令に使用される用語や単語に同音異義語の用語や単語が含まれる場合、文字列が異なるように、一方の用語を同意語や類義語に変更して言語モデル記憶部１０６に登録されているものとする。例えば、指令に使用される用語に「加工（かこう）」と「下降（かこう）」の用語が含まれる場合、「下降（かこう）」の用語は「降下（こうか）」の用語に変更して言語モデル記憶部１０６に登録される。これにより、言語モデル記憶部１０６には、指令に使用される用語や単語が、可能な限り同音異義語の用語が含まれないように登録されている。 Also, if the term or word used in the command includes a homonym term or word, one term is changed to a synonym or synonym and registered in the language model storage unit 106 so that the character string is different. It shall be. For example, if the terms used in the command include the terms “machining” and “descent”, the term “descent” is changed to the term “descent”. It is registered in the language model storage unit 106. As a result, the language model storage unit 106 registers terms and words used in the command so as not to include homonym terms as much as possible.

音声認識部１０５は、音声変換部１０４から入力される文字列（文字コードの列）と言語モデル記憶部１０６に記憶されている用語（以下、「登録用語」という。）とを照合して、文字列が完全に一致する用語を話者Ｑが発声した用語と認識する。例えば、音声変換部１０４から入力される文字列が「で」、「ん」、「あ」、「つ」で、言語モデル記憶部１０６に「電圧（でんあつ）」の用語が登録されている場合、音声認識部１０５は、「でんあつ」＝「電圧」と認識する処理を行う。 The voice recognition unit 105 collates a character string (character code string) input from the voice conversion unit 104 with a term (hereinafter referred to as “registered term”) stored in the language model storage unit 106. A term whose character string completely matches is recognized as a term uttered by the speaker Q. For example, the character string input from the voice conversion unit 104 is “de”, “n”, “a”, “tsu”, and the term “voltage” is registered in the language model storage unit 106. If so, the voice recognition unit 105 performs a process of recognizing “denatsu” = “voltage”.

また、音声認識部１０５は、音声変換部１０４から入力される文字列が言語モデル記憶部１０６に記憶されている登録用語と完全に一致しない場合、文字列と同一文字数の用語について、文字列の一致度Ｅを算出し、その一致度Ｅを用いて音声認識処理を行う。 In addition, when the character string input from the voice conversion unit 104 does not completely match the registered term stored in the language model storage unit 106, the voice recognizing unit 105 determines whether the character string has the same number of characters as the character string. The degree of coincidence E is calculated, and voice recognition processing is performed using the degree of coincidence E.

本実施の形態では、例えば、音声変換部１０４から入力される文字列の文字数をｎ（整数）とし、その文字列の文字のうち登録用語の文字に一致している文字の数をｒ（１≦ｒ≦ｎ−１の整数）とすると、文字列の一致度Ｅは、Ｅ＝ｒ／ｎ（＜１）又はＥ＝ｒ×１００／ｎ（＜１００）［％］で定義している。例えば、音声変換部１０４から入力される文字列の文字数が４個で、４文字からなる登録用語と一致している文字の数が３個の場合、一致度Ｅは３／４＝０．７５または７５［％］となる。 In the present embodiment, for example, the number of characters of the character string input from the speech conversion unit 104 is n (integer), and the number of characters that match the registered term character among the characters of the character string is r (1 ≦ r ≦ n−1), the character string matching degree E is defined as E = r / n (<1) or E = r × 100 / n (<100) [%]. For example, when the number of characters in the character string input from the voice conversion unit 104 is four and the number of characters that match a registered term consisting of four characters is three, the degree of match E is 3/4 = 0.75. Or 75 [%].

音声認識部１０５は、同一文字数の登録用語が２以上ある場合、各登録用語について一致度Ｅを算出し、予め設定された閾値Ｅ_ＴＨ以上の一致度Ｅを有する登録用語の中で最も一致度Ｅが高い登録用語を認識対象の用語に選出する。 When there are two or more registered terms having the same number of characters, the speech recognition unit 105 calculates the degree of coincidence E for each registered term, and the degree of coincidence is the highest among registered terms having a coincidence E equal to or higher than a preset threshold E _TH A registered term having a high E is selected as a recognition target term.

閾値Ｅ_ＴＨは、０．５＜Ｅ_ＴＨ＜１．０の範囲で任意に設定することができるが、本実施の形態では、例えば、文字数ｎが奇数の登録用語に対しては閾値Ｅ_ＴＨ１＝２/３≒０．６７を設定し、文字数ｎが偶数の登録用語に対しては閾値Ｅ_ＴＨ２＝３/４＝０．７５を設定している。 The threshold E _TH can be arbitrarily set within the range of 0.5 <E _TH <1.0. However, in the present embodiment, for example, for a registered term having an odd number of characters n, the threshold E _TH1 = 2 / 3≈0.67 is set, and a threshold value E _TH2 = 3/4 = 0.75 is set for a registered term having an even number of characters n.

文字数ｎが奇数の登録用語に対する閾値Ｅ_ＴＨ１＝２／３は、３文字の登録用語で過半数の文字（２文字）が一致する場合の一致度Ｅに相当している。３文字の登録用語の場合は、２／３以上の一致度Ｅを有する場合は２文字が一致する場合しかないので、音声変換部１０４から入力される３文字の文字列のうち２文字が一致した登録用語が認識対象の用語に選出されることになる。また、５文字の登録用語では、２／３以上の一致度Ｅを有する場合は４文字が一致する場合しかないので、音声変換部１０４から入力される５文字の文字列のうち４文字が一致した登録用語が認識対象の用語に選出されることになる。 The threshold value E _TH1 = 2/3 for a registered term with an odd number of characters n corresponds to the degree of coincidence E when a majority of characters (2 characters) match in a registered character of 3 characters. In the case of a registered term of 3 characters, if there is a match E of 2/3 or more, there is only a case where 2 characters match, so 2 characters out of 3 character strings input from the speech converter 104 match. The registered term is selected as the recognition target term. In addition, in the registered term of 5 characters, if the matching degree E is 2/3 or more, there is only a case where 4 characters are matched, so 4 characters out of 5 character strings input from the voice conversion unit 104 are matched. The registered term is selected as the recognition target term.

従って、音声認識部１０５は、音声変換部１０４から入力される文字列が３文字の場合は、文字列のうち２文字が一致した登録用語を話者Ｑが発声した用語と認識する。また、音声認識部１０５は、音声変換部１０４から入力される文字列が５文字の場合は、文字列のうち４文字が一致した登録用語を話者Ｑが発声した用語と認識する。 Therefore, when the character string input from the voice conversion unit 104 is three characters, the voice recognition unit 105 recognizes a registered term that matches two characters in the character string as a term uttered by the speaker Q. In addition, when the character string input from the voice conversion unit 104 is five characters, the voice recognition unit 105 recognizes a registered term that matches four characters in the character string as a term uttered by the speaker Q.

７文字の登録用語では、２／３以上の一致度Ｅを有する登録用語は５文字又は６文字が一致する登録用語が認識対象の用語となるが、最も一致度Ｅの高いのは６文字が一致する登録用語である。従って、音声認識部１０５は、音声変換部１０４から入力される７文字の文字列のうち６文字が一致した登録用語があれば、その登録用語を話者Ｑが発声した用語と認識し、５文字が一致した登録用語しかない場合は、その登録用語を話者Ｑが発声した用語と認識する。 For 7-character registered terms, registered terms that have a match E of 2/3 or more are registered terms that match 5 or 6 characters. However, the 6-character has the highest match E. Matching registered terms. Therefore, the speech recognition unit 105 recognizes the registered term as a term uttered by the speaker Q if there is a registered term that matches 6 characters in the 7-character string input from the speech conversion unit 104. If there is only a registered term that matches the character, the registered term is recognized as a term uttered by the speaker Q.

音声認識部１０５は、９文字以上の登録用語の場合も、７文字の登録用語の場合と同様にして２／３以上の一致度Ｅを有する登録用語の中で最も一致度Ｅが高い登録用語を話者Ｑが発声した用語と認識する。 In the case of a registered term of 9 characters or more, the speech recognition unit 105 registers the registered term having the highest matching degree E among the registered terms having a matching degree E of 2/3 or more, similarly to the case of the registered term of 7 characters. Is recognized as a term uttered by speaker Q.

例えば、音声認識部１０５に「で」、「ん」、「あ」、「つ」、「し」の文字列が入力された場合、言語モデル記憶部１０６には５文字の用語で２／３以上の一致度Ｅを有する登録用語が「でんあつち」＝「電圧値」しか登録されていなければ、音声認識部１０５は、「でんあつし」の文字列を「電圧値」と認識する。すなわち、音声認識部１０５は、「でんあつし」の文字列の「し」は、「ち」の音素の誤変換とみなして「でんあつち（電圧値）」と認識する。 For example, when a character string “de”, “n”, “a”, “tsu”, “shi” is input to the speech recognition unit 105, the language model storage unit 106 uses 2/3 characters in terms of 5 characters. If only the registered term “denatsuchi” = “voltage value” is registered, the speech recognition unit 105 recognizes the character string “denatsushi” as “voltage value”. . In other words, the voice recognition unit 105 recognizes that “shi” in the character string “denatsushi” is “denatsuchi (voltage value)” by regarding it as an erroneous conversion of the phoneme of “chi”.

また、認識対象の用語が複数ある場合は、音声認識部１０５は、その用語の前後の用語との関係を加味していずれか１の用語を話者Ｑが発声した用語と認識する。例えば、音声認識部１０５に「で」、「ん」、「りゅ」、「う」、「え」、「い」、「し」の文字列（「え」は誤変換された文字）が入力され、音声認識部１０５が「で」、「ん」、「りゅ」、「う」の文字列は「でんりゅう」＝「電流」と認識し、「え」、「い」、「し」の文字列の認識対象として「かいし」＝「開始」と「ていし」＝「停止」を選出したとする。 If there are a plurality of recognition target terms, the speech recognition unit 105 recognizes any one term as a term uttered by the speaker Q in consideration of the relationship with terms before and after the term. For example, a character string “de”, “n”, “ryu”, “u”, “e”, “i”, “shi” (“e” is an erroneously converted character) is stored in the voice recognition unit 105. When the voice recognition unit 105 is input, the character strings “de”, “n”, “ryu”, and “u” are recognized as “denryu” = “current”, and “e”, “i”, “ It is assumed that “kaishi” = “start” and “deashi” = “stop” are selected as recognition targets of the character string “shi”.

「開始」は、「電流」に対して意味が通じない指令語「電流開始」となる関係であるのに対し、「停止」は、「電流」に対して意味が通じる指令語「電流停止」となる関係であるので、音声認識部１０５は、「え」、「い」、「し」の文字列を「ていし」＝「停止」と認識する。 “Start” is a command word “current start” that has no meaning for “current”, whereas “stop” is a command word “current stop” that has a meaning for “current”. Therefore, the voice recognition unit 105 recognizes the character strings “e”, “i”, and “shi” as “deshi” = “stop”.

文字数ｎが偶数の登録用語に対する閾値Ｅ_ＴＨ２＝３／４は、４文字の登録用語で過半数の文字（３文字）が一致する場合の一致度Ｅに相当している。 The threshold E _TH2 = 3/4 for a registered term having an even number of characters n corresponds to the degree of coincidence E when a majority of characters (3 characters) match in a registered character of 4 characters.

２文字の登録用語は、過半数の文字が一致する場合は完全に一致する場合であり、この場合は音声認識が可能であるから、音声認識部１０５は、通常の音声認識処理を行う。一方、音声変換部１０４から入力される２文字の文字列のいずれか一方の文字が登録用語の文字に一致する場合又は２文字の文字列のいずれの文字も登録用語に一致しない場合は、一致度Ｅが閾値Ｅ_ＴＨ２＝３／４以上の条件を満たさないので、正しい音声認識をすることはできない。従って、これらの場合は、音声認識部１０５は、認識不可の処理を行う。 The two-character registered term is a case where the majority of the characters match, and in this case, the voice recognition is possible. Therefore, the voice recognition unit 105 performs a normal voice recognition process. On the other hand, if any one of the two character strings input from the voice conversion unit 104 matches the registered term character, or if any of the two character strings does not match the registered term, they match. Since the degree E does not satisfy the condition of the threshold value E _TH2 = 3/4 or more, correct speech recognition cannot be performed. Therefore, in these cases, the speech recognition unit 105 performs a process that cannot be recognized.

４文字の登録用語の場合は、３／４以上の一致度Ｅを有する場合は３文字が一致する場合しかないので、音声変換部１０４から入力される４文字の文字列のうち３文字が一致した登録用語が認識対象の用語に選出されることになる。また、６文字の登録用語では、３／４以上の一致度Ｅを有する場合は５文字が一致する場合しかないので、音声変換部１０４から入力される６文字の文字列のうち５文字が一致した登録用語が認識対象の用語に選出されることになる。 In the case of a registered term of 4 characters, if there is a match E of 3/4 or more, there is only a case where 3 characters match, so 3 characters in the 4 character string input from the voice conversion unit 104 match. The registered term is selected as the recognition target term. In addition, in the registered term of 6 characters, if the matching degree E is 3/4 or more, there is only a case where 5 characters are matched, so 5 characters in the 6-character string input from the voice conversion unit 104 are matched. The registered term is selected as the recognition target term.

従って、音声認識部１０５は、音声変換部１０４から入力される文字列が４文字の場合は、文字列のうち３文字が一致した登録用語を話者Ｑが発声した用語と認識する。また、音声認識部１０５は、音声変換部１０４から入力される文字列が６文字の場合は、文字列のうち５文字が一致した登録用語を話者Ｑが発声した用語と認識する。 Therefore, when the character string input from the voice conversion unit 104 is four characters, the voice recognition unit 105 recognizes a registered term that matches three characters in the character string as a term uttered by the speaker Q. When the character string input from the voice conversion unit 104 is six characters, the voice recognition unit 105 recognizes a registered term that matches five characters in the character string as a term uttered by the speaker Q.

８文字の登録用語では、３／４以上の一致度Ｅを有する登録用語は６文字又は７文字が一致する登録用語が認識対象の用語となるが、最も一致度Ｅの高いのは７文字が一致する登録用語である。従って、音声認識部１０５は、音声変換部１０４から入力される８文字の文字列のうち７文字が一致した登録用語があれば、その登録用語を話者Ｑが発声した用語と認識し、６文字が一致した登録用語しかない場合は、その登録用語を話者Ｑが発声した用語と認識する。 For registered terms with 8 characters, registered terms with a matching degree E of 3/4 or higher are registered terms that match 6 or 7 characters, but 7 characters have the highest degree of matching E. Matching registered terms. Therefore, the speech recognition unit 105 recognizes the registered term as a term uttered by the speaker Q if there is a registered term that matches 7 characters in the 8-character string input from the speech conversion unit 104. If there is only a registered term that matches the character, the registered term is recognized as a term uttered by the speaker Q.

音声認識部１０５は、音声変換部１０４から入力される文字列が偶数の文字数の場合も上述した文字列が奇数の文字数の場合と同様の認識処理を行う。また、音声認識部１０５は、音声変換部１０４から入力される文字列に対して閾値Ｅ_ＴＨ以上の一致度Ｅを有する登録用語が無い場合は、認識不可の処理を行う。認識不可の処理は、例えば、音声認識部１０５から指令信号変換部１０７に認識結果を出力しない処理や指令信号に変換できない所定の認識結果のデータを指令信号変換部１０７に出力する処理である。 The voice recognition unit 105 performs the same recognition process as in the case where the character string described above has an odd number of characters even when the character string input from the voice conversion unit 104 has an even number of characters. In addition, when there is no registered term having a matching degree E equal to or higher than the threshold value E _{TH with} respect to the character string input from the voice conversion unit 104, the voice recognition unit 105 performs a process that cannot be recognized. The unrecognizable process is, for example, a process of not outputting a recognition result from the voice recognition unit 105 to the command signal conversion unit 107 or a process of outputting data of a predetermined recognition result that cannot be converted into a command signal to the command signal conversion unit 107.

例えば、認識不可の処理として音声認識部１０５から指令信号変換部１０７に認識結果を出力しない処理をした場合は、指令信号変換部１０７には認識した指令語のデータが入力されないので、指令信号変換部１０７は指令信号への変換処理をしない。また、認識不可の処理として音声認識部１０５から所定の認識結果のデータを指令信号変換部１０７に出力した場合は、第２の変換テーブルに所定の認識結果のデータに対応する指令信号のデータがないので、この場合も指令信号変換部１０７は実質的に指令信号への変換処理をしない。 For example, when processing that does not output the recognition result from the speech recognition unit 105 to the command signal conversion unit 107 as unrecognizable processing, the command signal conversion unit 107 does not receive recognized command word data. The unit 107 does not perform conversion processing into a command signal. Further, when data of a predetermined recognition result is output from the voice recognition unit 105 to the command signal conversion unit 107 as a process that cannot be recognized, the data of the command signal corresponding to the data of the predetermined recognition result is stored in the second conversion table. In this case as well, the command signal conversion unit 107 does not substantially convert the command signal.

上記の例では、文字数が奇数の登録用語に対する閾値Ｅ_ＴＨ１と文字数が偶数の登録用語に対する閾値Ｅ_ＴＨ２を文字数に関係なくそれぞれ１個に固定しているが、登録用語の文字数に応じて閾値_ＴＨを異ならせるようにしてもよい。 In the above example, the threshold value E _TH1 for a registered term having an odd number of characters and the threshold value E _TH2 for a registered term having an even number of characters are fixed to one regardless of the number of characters, but the threshold value _TH depends on the number of characters of the registered term. May be different.

また、上記の例では、文認識対象の登録用語が複数ある場合、その登録用語の前後の用語との関係を加味していずれか１の登録用語を話者Ｑが発声した用語と認識するようにしているが、各登録用語の一致しない文字の誤変換を推定して話者Ｑが発声した用語を認識するようにしてもよい。 Further, in the above example, when there are a plurality of registered terms that are sentence recognition targets, any one registered term is recognized as a term uttered by the speaker Q in consideration of the relationship with terms before and after the registered term. However, it is also possible to recognize a term uttered by the speaker Q by estimating erroneous conversion of characters that do not match each registered term.

例えば、音声認識部１０５が「え」、「い」、「し」の３文字の文字列の認識対象として「かいし」＝「開始」と「ていし」＝「停止」を選出した上記の例では、音声認識部１０５は、「えいし」の文字列と「かいし」、「ていし」の登録用語との一致しない文字「え」について誤変換を推定する処理をする。音声認識部１０５は、例えば、「え」、「か」、「て」の各文字の母音を抽出し、「か」、「て」の文字のうち、「え」の口の形に近い母音を有する文字「て」を選出する。 For example, the voice recognition unit 105 selects “Kaishi” = “Start” and “Toshi” = “Stop” as recognition targets for the three character strings “E”, “I”, and “Shi”. In the example, the voice recognition unit 105 performs a process of estimating an erroneous conversion for a character “E” that does not match the character string “EISHI” and the registered terms “KAIISHI” and “TAISEI”. For example, the voice recognition unit 105 extracts the vowels of the characters “e”, “ka”, and “te”, and among the characters “ka” and “te”, the vowel close to the mouth shape of “e” The character “te” having “is selected.

音声認識部１０５は、「え」の文字が「て」の文字の誤変換と推定し、認識対象として選出した登録用語のうち、「ていし」＝「停止」の登録用語を話者Ｑが発声した用語と認識する。 The speech recognition unit 105 estimates that the character “e” is an erroneous conversion of the character “te”, and among the registered terms selected as recognition targets, the speaker Q selects the registered term “deshi” = “stop”. Recognize a spoken term.

指令信号変換部１０７は、音声認識部１０５から出力される音声認識の結果を指令信号に変換する処理を行う。指令語記憶部１０８は、溶接システムにおける音声指令として登録された指令語と指令信号との対応関係を示す変換テーブルを記憶する。指令信号変換部１０７は、指令語記憶部１０８に記憶されている変換テーブルを用いて、音声認識部１０５から出力される音声認識の結果を指令信号に変換して制御部６０３に入力する。 The command signal conversion unit 107 performs processing for converting the result of speech recognition output from the speech recognition unit 105 into a command signal. The command word storage unit 108 stores a conversion table indicating the correspondence between command words registered as voice commands in the welding system and command signals. The command signal conversion unit 107 converts the speech recognition result output from the speech recognition unit 105 into a command signal using the conversion table stored in the command word storage unit 108 and inputs the result to the control unit 603.

音声指令は、操作部６０４の操作ボタン又はトーチスイッチ７０１によって制御部６０３に入力される複数の操作指令の一部に対応する指令である。音声指令は、特に、作業者が溶接作業中に溶接電源装置５に入力する必要性の高い操作指令が選定されている。また、溶接用の用語は、操作指令を構成する単語である。 The voice command is a command corresponding to a part of a plurality of operation commands input to the control unit 603 by the operation button of the operation unit 604 or the torch switch 701. As the voice command, an operation command that is highly necessary to be input to the welding power source device 5 by the operator during the welding work is selected. Moreover, the term for welding is a word which comprises an operation command.

図９は、指令語記憶部１０８に登録されている溶接システムの指令語とその内容の一例を示す図である。 FIG. 9 is a diagram showing an example of a command word of the welding system registered in the command word storage unit 108 and its contents.

図９に示すように、指令語記憶部１０８には、「溶接開始」、「溶接停止」、「電圧上げる」、「電圧下げる」、「電流上げる」、「電流上げる」…などの溶接作業での指令語が登録されている。例えば、「溶接開始」又は「溶接停止」の指令語は、溶接トーチ７と母材Ｂへの溶接電力（アーク発生用の電力と溶接用の電力）の供給開始又は供給停止を指示する指令語である。 As shown in FIG. 9, the command word storage unit 108 has welding operations such as “welding start”, “welding stop”, “voltage increase”, “voltage decrease”, “current increase”, “current increase”, etc. The command word is registered. For example, the command word “start welding” or “stop welding” is a command word for instructing to start or stop supplying welding power (arc generation power and welding power) to the welding torch 7 and the base material B. It is.

また、「電圧上げる」又は「電圧下げる」の指令語は、現在の溶接電圧値に対して予め設定された単位電圧値ΔＶ（例えば、０．１［Ｖ］）で溶接電圧を増加させる又は減少させる指令である。また、「電流上げる」又は「電流下げる」の指令語は、現在の溶接電流値に対して予め設定された単位電流値ΔＩ（例えば、１．０［Ａ］）で溶接電流を増加させる又は減少させる指令である。 Moreover, the command word “voltage increase” or “voltage decrease” increases or decreases the welding voltage by a unit voltage value ΔV (for example, 0.1 [V]) preset with respect to the current welding voltage value. It is a command to make. Further, the command word “current increase” or “current decrease” increases or decreases the welding current by a unit current value ΔI (for example, 1.0 [A]) set in advance with respect to the current welding current value. It is a command to be made.

指令信号変換部１０７は、例えば、音声認識部１０５から「溶接開始」の認識結果が入力されると、「溶接開始」の指令に対応する指令信号を生成し、制御部６０３に入力する。また、指令信号変換部１０７は、例えば、音声認識部１０５から「電圧上げる」又は「電圧下げる」の認識結果が入力されると、「溶接電圧増加」又は「溶接電圧減少」の指令に対応する指令信号を生成し、制御部６０３に入力する。 For example, when the recognition result of “welding start” is input from the voice recognition unit 105, the command signal conversion unit 107 generates a command signal corresponding to the “welding start” command and inputs the command signal to the control unit 603. Further, for example, when the recognition result of “voltage increase” or “voltage decrease” is input from the voice recognition unit 105, the command signal conversion unit 107 responds to a command of “increase welding voltage” or “decrease welding voltage”. A command signal is generated and input to the control unit 603.

一方、指令信号変換部１０７は、音声認識部１０５から入力される認識結果に対応する指令語が指令語記憶部１０８の変換テーブルに含まれていなければ、認識結果に対応する指令信号がないので、指令信号への変化処理をしない。また、指令信号変換部１０７は、音声認識部１０５から認識不可の処理として認識結果が入力されない場合も指令信号への変化処理をしない。従って、これらの場合は、制御部６０３には指令信号は入力されない。 On the other hand, if the command word corresponding to the recognition result input from the speech recognition unit 105 is not included in the conversion table of the command word storage unit 108, the command signal conversion unit 107 does not have a command signal corresponding to the recognition result. , Do not process change to command signal. In addition, the command signal conversion unit 107 does not perform the process of changing to a command signal even when a recognition result is not input from the voice recognition unit 105 as a process that cannot be recognized. Therefore, in these cases, no command signal is input to the control unit 603.

作業者Ｑが発声した音声指令の認識ができなかった場合は、制御部６０３に指令信号が入力されないので、作業者Ｑは、制御部６０３に指令信号が入力されるまで、音声指令の発声動作を繰り返すことになる。 When the voice command uttered by the worker Q cannot be recognized, the command signal is not input to the control unit 603. Therefore, the worker Q utters the voice command until the command signal is input to the control unit 603. Will be repeated.

制御部６０３は、指令信号変換部１０７から指令信号が入力されると、制御ケーブルを介してその指令信号を溶接電源装置５の制御部５０２に送信する。制御部５０２は、ワイヤ送給装置６の制御部６０３から指令信号が入力されると、その指令信号に対応した所定の処理を実行する。 When the command signal is input from the command signal conversion unit 107, the control unit 603 transmits the command signal to the control unit 502 of the welding power source device 5 through the control cable. When a command signal is input from the control unit 603 of the wire feeder 6, the control unit 502 executes a predetermined process corresponding to the command signal.

例えば、制御部５０２は、制御部６０３は、「溶接電流増加」の指令信号が入力されると、電源部５０１の溶接用電源５０１１が出力している溶接電流を所定の電流値（例えば、１［Ａ］）だけ増加させる制御を行う。 For example, when the control unit 603 receives a command signal “increased welding current”, the control unit 603 outputs a welding current output from the welding power source 5011 of the power source unit 501 to a predetermined current value (for example, 1 Control to increase only by [A]) is performed.

次に、音声指令入力装置１による音声指令の入力制御について、図１０のフローチャートを用いて説明する。 Next, voice command input control by the voice command input device 1 will be described with reference to the flowchart of FIG.

以下の説明では、溶接作業中に作業者が操作部６０４の選択ボタンを操作してワイヤ送給装置６を音声指令の入力許可状態にし、音声指令入力装置１を用いて音声指令を制御部６０３に入力する場合について、説明する。また、溶接作業中に作業者が入力する音声指令として、溶接電流を所定の電流値だけ増加させる「溶接電流増加」の音声指令を例に説明する。 In the following description, the operator operates the selection button of the operation unit 604 during the welding operation to place the wire feeding device 6 in the voice command input permission state, and uses the voice command input device 1 to send the voice command to the control unit 603. The case of inputting to will be described. In addition, a voice command “increase welding current” for increasing the welding current by a predetermined current value will be described as an example of the voice command input by the operator during the welding operation.

制御部６０３は、操作部６０４から音声指令の入力許可が入力されたか否かを監視している（Ｓ１００のループ処理）。制御部６０３は、音声指令の入力許可が入力されていなければ（Ｓ１００：Ｎ）、操作部６０４若しくはトーチスイッチ７０１により操作指令を入力する通常の入力状態を維持し、音声指令の入力許可が入力されると（Ｓ１００：Ｙ）、通常の入力状態を音声指令の入力許可状態に切り換え、ステップＳ１０１に移行して音声指令の入力処理を行う。 The control unit 603 monitors whether or not a voice command input permission has been input from the operation unit 604 (loop processing of S100). If the voice command input permission is not input (S100: N), the control unit 603 maintains the normal input state in which the operation command is input by the operation unit 604 or the torch switch 701, and the voice command input permission is input. If so (S100: Y), the normal input state is switched to the voice command input permission state, and the process proceeds to step S101 to perform voice command input processing.

ステップＳ１０１に移行すると、座標変換部１０２は、加速度センサ１０１から出力されるＸ軸、Ｙ軸、Ｚ軸の各軸方向の加速度の検出信号Ｓｘ，Ｓｙ，Ｓｚを受け付け、その検出信号Ｓｘ，Ｓｙ，Ｓｚを検出位置Ｐｒの変位した変位位置Ｐｄの座標（Ｘｄ，Ｙｄ，Ｚｄ）に変換する。 In step S101, the coordinate conversion unit 102 receives the acceleration detection signals Sx, Sy, Sz in the X-axis, Y-axis, and Z-axis directions output from the acceleration sensor 101, and the detection signals Sx, Sy. , Sz are converted into coordinates (Xd, Yd, Zd) of the displacement position Pd displaced from the detection position Pr.

続いて、音声変換部１０４が座標変換部１０２から出力される座標（Ｘｄ，Ｙｄ，Ｚｄ）を変換テーブル記憶部１０３に記憶されている変換テーブルを参照して音素のデータ（文字コードのデータ）に変換する（Ｓ１０２）。 Subsequently, the phonetic data (character code data) with reference to the conversion table stored in the conversion table storage unit 103 is the coordinates (Xd, Yd, Zd) output from the coordinate conversion unit 102 by the voice conversion unit 104. (S102).

続いて、音声認識部１０５が音声変換部１０４から出力される文字列のデータ（文字コードの列のデータ）に対して、言語モデル記憶部１０６に記憶されている言語モデルを用いて音声の認識処理を行い（Ｓ１０３）、その音声認識処理で指令語が認識できたか否かを判断する（Ｓ１０４）。 Subsequently, the speech recognition unit 105 recognizes speech using the language model stored in the language model storage unit 106 for the character string data (character code string data) output from the speech conversion unit 104. Processing is performed (S103), and it is determined whether or not the command word has been recognized by the voice recognition processing (S104).

音声認識部１０５は、音声変換部１０４から入力される文字列と言語モデル記憶部１０６に記憶されている登録用語とを照合して、文字列が完全に一致する登録用語があれば、その登録用語若しくは登録用語の組合せを話者Ｑが発声した指令語と認識し、その認識結果（指令語のデータ）を指令信号変換部１０７に出力して（Ｓ１０４：Ｙ）、ステップＳ１０８に移行する。 The speech recognition unit 105 collates the character string input from the speech conversion unit 104 with the registered term stored in the language model storage unit 106, and if there is a registered term whose character string completely matches, the registration is performed. The term or combination of registered terms is recognized as a command word uttered by the speaker Q, and the recognition result (command word data) is output to the command signal conversion unit 107 (S104: Y), and the process proceeds to step S108.

例えば、音声変換部１０４から出力される文字列が「で」、「ん」、「りゅ」、「う」、「あ」、「げ」、「る」の場合、言語モデル記憶部１０６に「でんりゅう」と「あげる」の登録用語が登録されていれば、音声認識部１０５は、その文字列を「でんりゅう」＋「あげる」＝「電流上げる」の指令語と認識し、「電流上げる」のデータを指令信号変換部１０７に出力して（Ｓ１０４：Ｙ）、ステップＳ１０８に移行する。 For example, if the character string output from the voice conversion unit 104 is “de”, “n”, “ryu”, “u”, “a”, “ge”, “ru”, the language model storage unit 106 If the registered terms "denryu" and "raise" are registered, the speech recognition unit 105 recognizes the character string as "denryu" + "raise" = "current increase" command word, The “current increase” data is output to the command signal converter 107 (S104: Y), and the process proceeds to step S108.

一方、音声変換部１０４から入力される文字列と完全に一致する登録用語が言語モデル記憶部１０６に記憶されていない場合（Ｓ１０４：Ｎ）、音声認識部１０５は、言語モデル記憶部１０６に記憶されている登録用語のうち、音声変換部１０４から入力される文字列と同一の文字数の登録用語について、一致度Ｅを算出する（Ｓ１０５）。 On the other hand, when the registered term that completely matches the character string input from the speech conversion unit 104 is not stored in the language model storage unit 106 (S104: N), the speech recognition unit 105 stores it in the language model storage unit 106. Among the registered terms, the degree of coincidence E is calculated for the registered terms having the same number of characters as the character string input from the speech conversion unit 104 (S105).

そして、音声認識部１０５は、一致度Ｅを用いた音声認識処理を行い（Ｓ１０６）、その音声認識処理で指令語が認識できたか否かを判断する（Ｓ１０７）。 Then, the voice recognition unit 105 performs voice recognition processing using the degree of coincidence E (S106), and determines whether or not the command word can be recognized by the voice recognition processing (S107).

すなわち、音声認識部１０５は、一致度Ｅを算出した各登録用語について、予め設定された閾値Ｅ_ＴＨ以上の一致度Ｅを有する登録用語の中で最も一致度Ｅが高い登録用語を話者Ｑが発声した指令語と認識し、その認識結果を指令信号変換部１０７に出力して（Ｓ１０７：Ｙ）、ステップＳ１０８に移行する。 That is, for each registered term for which the matching degree E is calculated, the voice recognition unit 105 _{selects the} registered term having the highest matching degree E among the registered terms having a matching degree E equal to or higher than a preset threshold E _TH. Is recognized and output to the command signal converter 107 (S107: Y), and the process proceeds to step S108.

また、音声認識部１０５は、閾値Ｅ_ＴＨ以上の一致度Ｅを有する登録用語の中で最も一致度Ｅが高い登録用語が２以上ある場合は、例えば、各登録用語の前後の認識された登録用語との関係を加味して１の登録用語を決定し、その登録用語と前後の登録用語の組合せを話者Ｑが発声した指令語と認識する。 In addition, when there are two or more registered terms having the highest matching degree E among the registered terms having the matching degree E equal to or higher than the threshold E _TH , the voice recognition unit 105, for example, recognizes registered registrations before and after each registered term. One registered term is determined in consideration of the relationship with the term, and a combination of the registered term and the preceding and following registered terms is recognized as a command word uttered by the speaker Q.

例えば、音声認識部１０５は、選出した登録用語が前後の認識された登録用語との関係で全体として指令の意味をなさない指令語となる場合は、その選出した登録用語は認識対象から除外し、指令の意味をなす指令語となる場合にその選出した登録用語と前後の登録用語の組合せを話者Ｑが発声した指令語と認識する。 For example, the speech recognition unit 105 excludes the selected registered term from the recognition target when the selected registered term becomes a command word that does not make the meaning of the command as a whole in relation to the previous and subsequent recognized registered terms. When the command word makes sense, the combination of the selected registered term and the previous and subsequent registered terms is recognized as the command word uttered by the speaker Q.

例えば、音声認識部１０５が音声変換部１０４から入力された「でんりゅうえいし」の文字列に対して、「でんりゅう」＝「電流」と認識し、「えいし」に対して「停止」と「開始」の登録用語を選出したとする。音声認識部１０５は、「電流開始」は指令語として意味をなさないが、「電流停止」は指令語として意味をなすので、「でんりゅうえいし」を「電流停止」と認識する。 For example, the voice recognition unit 105 recognizes “denryu” = “current” for the character string “denrye” inputted from the voice conversion unit 104, and “ Suppose you select registered terms for “stop” and “start”. The voice recognizing unit 105 recognizes “deniyuusei” as “current stop” because “current start” does not make sense as a command word, but “current stop” makes sense as a command word.

そして、音声認識部１０５は、認識した指令語のデータを指令信号変換部１０７に出力して（Ｓ１０７：Ｙ）、ステップＳ１０８に移行する。 Then, the voice recognition unit 105 outputs the recognized command word data to the command signal conversion unit 107 (S107: Y), and proceeds to step S108.

一方、音声認識部１０５は、音声変換部１０４から入力される文字列に対して閾値Ｅ_ＴＨ以上の一致度Ｅを有する登録用語が無い場合は、認識不可の処理結果を指令信号変換部１０７に出力して（Ｓ１０７：Ｎ）、ステップＳ１００に戻る。 On the other hand, if there is no registered term having a matching degree E equal to or higher than the threshold E _{TH with} respect to the character string input from the voice conversion unit 104, the voice recognition unit 105 sends an unrecognized processing result to the command signal conversion unit 107. Output (S107: N) and return to step S100.

ステップＳ１０８に移行すると、指令信号変換部１０７は音声認識部１０５から出力される音声認識の結果に対して、指令語記憶部１０８に記憶されている指令語を用いて指令信号に変換する処理を行う。 In step S108, the command signal conversion unit 107 performs a process of converting the voice recognition result output from the voice recognition unit 105 into a command signal using the command word stored in the command word storage unit 108. Do.

指令信号変換部１０７は、音声認識結果（指令語のデータ）が指令語記憶部１０８の変換テーブルに含まれている場合、音声認識結果に対応する指令信号を生成して制御部６０３に出力した後（Ｓ１０８）、ステップＳ１００に戻る。例えば、音声認識部１０５から出力される音声認識結果が「電流上げる」の場合、指令信号変換部１０７は、［電流上げる］（溶接電流を単位電流値ΔＩ［Ａ］だけ増加させる）の指令語に対応する指令信号を生成して制御部６０３に入力した後、ステップＳ１００に戻る。 When the voice recognition result (command word data) is included in the conversion table of the command word storage unit 108, the command signal conversion unit 107 generates a command signal corresponding to the voice recognition result and outputs the command signal to the control unit 603. After (S108), the process returns to step S100. For example, when the voice recognition result output from the voice recognition unit 105 is “current increase”, the command signal conversion unit 107 uses the command word “current increase” (the welding current is increased by the unit current value ΔI [A]). Is generated and input to the control unit 603, and then the process returns to step S100.

また、指令信号変換部１０７は、音声認識結果が指令語記憶部１０８の変換テーブルに含まれてない場合や音声認識部１０５から認識不可により音声認識結果が入力されていない場合、指令信号を生成することなくステップＳ１００に戻る。 The command signal conversion unit 107 generates a command signal when the voice recognition result is not included in the conversion table of the command word storage unit 108 or when the voice recognition result is not input from the voice recognition unit 105 due to the recognition failure. It returns to step S100, without doing.

以上、説明したように、本実施の形態に係る溶接システム４によれば、音声指令入力装置１を用いているので、溶接作業中に作業者が音声により指令を入力した場合でも作業現場に生じる非常に高い騒音の影響を殆ど受けることなく高い精度で安定して音声指令に対応する指令信号を入力することができる。 As described above, according to the welding system 4 according to the present embodiment, since the voice command input device 1 is used, even when an operator inputs a command by voice during the welding work, it occurs at the work site. The command signal corresponding to the voice command can be input stably with high accuracy without being affected by very high noise.

また、作業者Ｑの頬若しくは口元の周辺部に直接加速度センサ１０１を取り付け、その加速度センサ１０１の検出信号Ｓｘ，Ｓｙ，Ｓｚを用いて作業者Ｑが発声した音声を検出するので、作業者Ｑの口に近接させてマイクを設ける構成よりもセンサが邪魔にならず、作業者Ｑの溶接作業への悪影響を低減することができる。 Further, since the acceleration sensor 101 is directly attached to the cheek or the peripheral part of the mouth of the worker Q and the voice uttered by the worker Q is detected using the detection signals Sx, Sy, Sz of the acceleration sensor 101, the worker Q As compared with the configuration in which the microphone is provided close to the mouth, the sensor does not get in the way, and the adverse effect on the welding work of the worker Q can be reduced.

また、３軸の加速度センサ１０１を作業者Ｑの口元の両側に一対設けて当該口の動きを検出するようにしているので、作業者Ｑが発声する音声に応じて微妙に変化する口の動きを精度良く検出することができる。 In addition, since a pair of three-axis acceleration sensors 101 are provided on both sides of the mouth of the worker Q so as to detect the movement of the mouth, the mouth movement that slightly changes according to the voice uttered by the worker Q. Can be detected with high accuracy.

また、加速度センサ１０１は、作業者Ｑの顔に直接貼着し、ケーブル中継部材２を介して加速度センサ１０１と装置本体１００を電気的に接続するので、加速度センサ１０１が作業者Ｑの溶接作業を邪魔して作業性を低下するということがない。 Further, since the acceleration sensor 101 is directly attached to the face of the worker Q and the acceleration sensor 101 and the apparatus main body 100 are electrically connected via the cable relay member 2, the acceleration sensor 101 is welded by the worker Q. The workability will not be deteriorated.

また、音声認識部１０５に入力された音声指令の文字列が言語モデル記憶部１０６に記憶されている登録用語に完全に一致しない場合でも、例えば、文字列と登録用語の文字の一致度Ｅを算出し、その一致度Ｅを用いて音声指令の文字列に一致する確率の高い指令語を作業者Ｑの発声した音声指令と認識するようにしているので、加速度センサ１０１による作業者Ｑの口の動きの検出値を用いて音声指令を好適に認識することができる。 Further, even when the character string of the voice command input to the voice recognition unit 105 does not completely match the registered term stored in the language model storage unit 106, for example, the matching degree E between the character string and the registered term character is determined. The command word having a high probability of matching with the character string of the voice command is calculated using the matching degree E, and is recognized as the voice command uttered by the worker Q. The voice command can be suitably recognized using the detected value of the movement.

また、音声変換部１０４が音声指令の音素を変換した文字に誤変換が含まれている場合でも音声認識部１０５で登録用語との一致度Ｅを算出し、その一致度Ｅを用いて音声指令による指令語を認識するようにしているので、高い精度で作業者Ｑが発声した音声指令に対応する指令信号をワイヤ送給装置６の制御部６０３に入力することができる。 Further, even when a character obtained by converting the phoneme of the voice command by the voice conversion unit 104 includes erroneous conversion, the voice recognition unit 105 calculates the degree of coincidence E with the registered term, and uses the degree of match E to generate the voice command. Therefore, the command signal corresponding to the voice command uttered by the operator Q can be input to the control unit 603 of the wire feeder 6 with high accuracy.

上記の実施の形態では、操作部６０４に選択スイッチを設け、作業者が選択スイッチを操作して音声による操作指令を入力可能にしていたが、選択スイッチに代えてモード切換スイッチを操作部６０４に設けてもよい。 In the above embodiment, the operation unit 604 is provided with a selection switch so that an operator can input a voice operation command by operating the selection switch. However, instead of the selection switch, a mode changeover switch is provided in the operation unit 604. It may be provided.

モード切換スイッチは、操作指令の入力モードを音声による入力モード（音声入力モード）と操作ボタン又はトーチスイッチ７０１による入力モード（通常入力モード）とに切り換えるスイッチである。 The mode switch is a switch for switching the operation command input mode between a voice input mode (voice input mode) and an operation button or torch switch 701 input mode (normal input mode).

作業者がモード切換スイッチで音声入力モードを設定すると、制御部６０３は、操作部６０４の操作ボタン又はトーチスイッチ７０１による操作指令信号の受け付け制御を行うとともに、音声指令入力装置１からの操作指令信号の受け付け制御を行う。逆に、作業者がモード切換スイッチで通常入力モードを設定すると、制御部６０３は、操作部６０４操作ボタン又はトーチスイッチ７０１による操作指令信号を受け付けるが、音声指令入力装置１からの操作指令信号を受け付けない制御を行う。 When the operator sets the voice input mode with the mode change switch, the control unit 603 performs reception control of the operation command signal by the operation button of the operation unit 604 or the torch switch 701, and the operation command signal from the voice command input device 1. The admission control is performed. Conversely, when the operator sets the normal input mode with the mode change switch, the control unit 603 accepts an operation command signal from the operation unit 604 operation button or the torch switch 701, but receives the operation command signal from the voice command input device 1. Control not accepted.

図７に示す溶接システム４では、遠隔操作装置を備えていてないが、遠隔操作装置を備えていてもよい。 The welding system 4 shown in FIG. 7 does not include a remote operation device, but may include a remote operation device.

図１１は、遠隔操作装置を備えた溶接システム野構成の一例を示す図である。 FIG. 11 is a diagram illustrating an example of a welding system field configuration including a remote control device.

図１１に示す溶接システム４Ａは、ワイヤ送給装置６Ａに無線通信によって接続可能な遠隔操作装置１０を備えたもので、その遠隔操作装置１０に音声指令入力装置１を設けたものである。 A welding system 4A shown in FIG. 11 includes a remote operation device 10 that can be connected to a wire feeding device 6A by wireless communication. The remote operation device 10 includes a voice command input device 1.

図７に示す溶接システム４では、音声指令入力装置１を作業者Ｑの近くに配置されるワイヤ送給装置６に設けていたが、図１１に示す溶接システム４Ａでは作業者Ｑが保持する遠隔操作装置１０に設けているので、溶接システム４Ａへの音声指令入力装置１の配設をより容易且つコンパクトにすることができる。 In the welding system 4 shown in FIG. 7, the voice command input device 1 is provided in the wire feeding device 6 arranged near the worker Q. However, in the welding system 4A shown in FIG. Since the operation device 10 is provided, the arrangement of the voice command input device 1 to the welding system 4A can be made easier and more compact.

特に、ワイヤ送給装置６に音声指令入力装置１を設ける場合は、作業者Ｑのワイヤ送給装置６からの移動を考慮して加速度センサ１０１から音声指令入力装置１の装置本体１００までの配線に余裕を必要とするが、遠隔操作装置１０に音声指令入力装置１を設ける場合は、作業者Ｑと遠隔操作装置１０との距離が短く、その距離の変化も少ないので、加速度センサ１０１から音声指令入力装置１の装置本体１００までの配線をよりコンパクトにすることができる。 In particular, when the voice command input device 1 is provided in the wire feeding device 6, wiring from the acceleration sensor 101 to the device main body 100 of the voice command input device 1 in consideration of the movement of the worker Q from the wire feeding device 6. However, when the voice command input device 1 is provided in the remote control device 10, the distance between the worker Q and the remote control device 10 is short and the change in the distance is small. The wiring to the apparatus main body 100 of the command input device 1 can be made more compact.

溶接システム４Ａの図７に示す溶接システム４との構成上の相違点について、簡単に説明すると、以下の通りである。 The difference in configuration of the welding system 4A from the welding system 4 shown in FIG. 7 will be briefly described as follows.

溶接システム４Ａでは、ワイヤ送給装置６Ａは、無線通信部６０７を備えている。無線通信部６０７は、例えば、無線ＬＡＮにより遠隔操作装置１０との間で無線通信を行う。無線通信部６０７は、無線ＬＡＮアクセスポイント（図示省略）を介して遠隔操作装置１０と無線接続を行い、無線通信により所定の情報の送受信を行う。無線通信部６０７は、所定の変調方式でキャリア信号を送信すべき情報で変調して無線通信信号を生成し、その無線通信信号をアンテナ６０７Ａから放射する。 In the welding system 4A, the wire feeder 6A includes a wireless communication unit 607. The wireless communication unit 607 performs wireless communication with the remote control device 10 by, for example, a wireless LAN. The wireless communication unit 607 wirelessly connects to the remote control device 10 via a wireless LAN access point (not shown), and transmits / receives predetermined information by wireless communication. The wireless communication unit 607 modulates the carrier signal with information to be transmitted using a predetermined modulation method to generate a wireless communication signal, and radiates the wireless communication signal from the antenna 607A.

例えば、無線通信部６０７は、制御部６０３から溶接電圧と溶接電流の情報が転送されると、その情報を用いて無線通信信号を生成し、その無線通信信号を遠隔操作装置１０に送信する。また、無線通信部６０７は、制御部６０３から溶接電圧検出値と溶接電流検出値の情報が転送されると、その情報を用いて無線通信信号を生成し、その無線通信信号を遠隔操作装置１０に送信する。 For example, when information on the welding voltage and welding current is transferred from the control unit 603, the wireless communication unit 607 generates a wireless communication signal using the information and transmits the wireless communication signal to the remote control device 10. In addition, when the information on the welding voltage detection value and the welding current detection value is transferred from the control unit 603, the wireless communication unit 607 generates a wireless communication signal using the information, and the wireless communication signal is transmitted to the remote control device 10. Send to.

また、無線通信部６０７は、遠隔操作装置１０から無線通信信号を受信すると、送信された情報を復調し、制御部６０３に転送する。また、無線通信部６０７は、受信レベルを検出し、その検出値を制御部６０３に入力する。制御部６０３は、無線通信の通信状態を表示させるために、その検出値に基づいて、受信レベルに関する情報を生成し、その情報を表示部６０５に転送する。 In addition, when receiving a wireless communication signal from the remote control device 10, the wireless communication unit 607 demodulates the transmitted information and transfers it to the control unit 603. Further, the wireless communication unit 607 detects the reception level and inputs the detected value to the control unit 603. In order to display the communication state of the wireless communication, the control unit 603 generates information related to the reception level based on the detected value, and transfers the information to the display unit 605.

遠隔操作装置１０は、リモコンと呼ばれるものである。遠隔操作装置１０は、機能ブロックとして、電池１００１、制御部１００２、無線通信部１００３、操作部１００４及び表示部１００５と、上述した音声指令入力装置１の音声指令入力部１０９を含む。 The remote control device 10 is called a remote controller. The remote operation device 10 includes, as functional blocks, a battery 1001, a control unit 1002, a wireless communication unit 1003, an operation unit 1004, a display unit 1005, and the voice command input unit 109 of the voice command input device 1 described above.

電池１００１は、遠隔操作装置１０の駆動電源である。電池１００１、無線通信部１００３、操作部１００４、表示部１００５及び音声指令入力部１０９は制御部１００２に接続されている。 The battery 1001 is a drive power source for the remote operation device 10. The battery 1001, the wireless communication unit 1003, the operation unit 1004, the display unit 1005, and the voice command input unit 109 are connected to the control unit 1002.

制御部１００２は、操作部１００４による情報の入力、表示部１００５による情報の出力、音声指令入力部１０９による音声指令の入力、無線通信部１００３によるワイヤ送給装置６Ａとのデータ通信などの各処理を統括的に制御する。制御部１００２は、ＲＯＭ、ＲＡＭ及びＭＰＵを有するマイクロコンピュータで実現されている。制御部１００２もＦＰＧＡなどのＰＬＤで実現してもよい。 The control unit 1002 performs various processes such as information input by the operation unit 1004, information output by the display unit 1005, voice command input by the voice command input unit 109, and data communication with the wire feeder 6A by the wireless communication unit 1003. Overall control. The control unit 1002 is realized by a microcomputer having a ROM, a RAM, and an MPU. The control unit 1002 may also be realized by a PLD such as an FPGA.

操作部１００４は、作業者がアーク溶接に関する各種の情報を制御部１００２に入力する操作を行うブロックである。操作部１００４は、遠隔操作装置１０の筐体面に配設された操作パネルに設けられており、ワイヤ送給装置６Ａの操作部６０４と同様の機能を果たす。 The operation unit 1004 is a block for performing an operation in which an operator inputs various types of information related to arc welding to the control unit 1002. The operation unit 1004 is provided on an operation panel disposed on the housing surface of the remote operation device 10, and performs the same function as the operation unit 604 of the wire feeding device 6A.

作業者が操作部１００４の起動、溶接電圧の設定・変更、溶接電流の設定・変更など操作ボタンを操作すると、その操作ボタンに対応する所定の操作指令信号が制御部１００２に入力される。 When the operator operates an operation button such as starting the operation unit 1004, setting / changing the welding voltage, setting / changing the welding current, etc., a predetermined operation command signal corresponding to the operation button is input to the control unit 1002.

遠隔操作装置１０では、作業者は、操作指令を操作部１００４と音声指令入力装置１のいずれかで入力することができる。このため、遠隔操作装置１０の操作パネルには、操作指令を入力する装置を選択するための選択ボタンが設けられている。この選択ボタンは、例えば、モーメンタリー・スイッチで構成され、作業者がモーメンタリー・スイッチをオン状態にしている期間に音声指令入力装置８を用いて音声により操作指令を制御部１００２に入力することができる。 In the remote operation device 10, the operator can input an operation command with either the operation unit 1004 or the voice command input device 1. For this reason, the operation panel of the remote operation device 10 is provided with a selection button for selecting a device for inputting an operation command. This selection button is composed of, for example, a momentary switch, and an operation command can be input to the control unit 1002 by voice using the voice command input device 8 during a period in which the worker is turning on the momentary switch. .

表示部１００５は、アーク溶接に関する各種の情報や電池１００１の残容量に関する情報やワイヤ送給装置６Ａとの無線通信状態などを表示する。表示部１００５は、例えば、液晶ディスプレイなどの表示装置で構成される。表示部１００５は、ＬＥＤや７セグメントなどの表示機で構成されてもよい。 The display unit 1005 displays various information related to arc welding, information related to the remaining capacity of the battery 1001, the state of wireless communication with the wire feeding device 6A, and the like. The display unit 1005 is configured by a display device such as a liquid crystal display, for example. The display unit 1005 may be configured by a display device such as an LED or 7 segments.

表示部１００５は、制御部１００２から転送される情報に基づいて、所定の表示をする。例えば、表示部１００５は、制御部１００２から入力される電池１００１の電圧に関する情報に基づいて、電池１００１の残容量に関する情報を絵文字や記号や文字メッセージなどで表示する。また、表示部１００５は、制御部１００２から入力される受信レベルに関する情報に基づいて、無線通信の通信状態に関する情報を絵文字や記号や文字メッセージなどで表示する。 The display unit 1005 performs a predetermined display based on information transferred from the control unit 1002. For example, the display unit 1005 displays information on the remaining capacity of the battery 1001 with pictographs, symbols, character messages, and the like based on information on the voltage of the battery 1001 input from the control unit 1002. Further, the display unit 1005 displays information on the communication state of wireless communication as pictographs, symbols, text messages, and the like based on information on the reception level input from the control unit 1002.

また、表示部１００５は、制御部１００２から転送される入力情報（操作部１００４又は音声指令入力部１０９から入力された情報や無線通信部１００３を介してワイヤ送給装置６Ａから入力された情報）を表示する。 The display unit 1005 also receives input information transferred from the control unit 1002 (information input from the operation unit 1004 or the voice command input unit 109 or information input from the wire feeder 6A via the wireless communication unit 1003). Is displayed.

無線通信部１００３は、ワイヤ送給装置６Ａとの間で無線通信を行う。無線通信部１００３は、無線通信部３４と同様に、無線ＬＡＮによりワイヤ送給装置６Ａとの間で無線通信を行う。 The wireless communication unit 1003 performs wireless communication with the wire feeding device 6A. Similarly to the wireless communication unit 34, the wireless communication unit 1003 performs wireless communication with the wire feeding device 6A through a wireless LAN.

無線通信部１００３は、無線ＬＡＮアクセスポイント（図示省略）を介してワイヤ送給装置６Ａと無線接続を行い、無線通信により所定の情報の送受信を行う。無線通信部１００３は、無線通信部６０７と同じ変調方式でキャリア信号を送信すべき情報で変調して無線通信信号を生成し、その無線通信信号をアンテナ１００３Ａから放射する。 The wireless communication unit 1003 wirelessly connects to the wire feeding device 6A via a wireless LAN access point (not shown), and transmits / receives predetermined information by wireless communication. The wireless communication unit 1003 generates a wireless communication signal by modulating a carrier signal with information to be transmitted using the same modulation method as the wireless communication unit 607, and radiates the wireless communication signal from the antenna 1003A.

例えば、無線通信部１００３は、制御部１００２から溶接電圧と溶接電流の情報が転送されると、その情報を用いて無線通信信号を生成し、その無線通信信号をワイヤ送給装置６Ａに送信する。 For example, when information on the welding voltage and welding current is transferred from the control unit 1002, the wireless communication unit 1003 generates a wireless communication signal using the information and transmits the wireless communication signal to the wire feeding device 6A. .

また、無線通信部１００３は、ワイヤ送給装置６Ａから無線通信信号を受信すると、送信された情報を復調し、制御部１００２に転送する。また、無線通信部１００３は、受信レベルを検出し、その検出値を制御部１００２に入力する。制御部１００２は、無線通信の通信状態を表示させるために、その検出値に基づいて、受信レベルに関する情報を生成し、その情報を表示部１００５に転送する。 Further, when receiving the wireless communication signal from the wire feeding device 6 </ b> A, the wireless communication unit 1003 demodulates the transmitted information and transfers it to the control unit 1002. Radio communication unit 1003 detects the reception level and inputs the detected value to control unit 1002. In order to display the communication state of the wireless communication, the control unit 1002 generates information on the reception level based on the detected value, and transfers the information to the display unit 1005.

溶接システム４Ａでは、音声指令入力装置１によって作業者Ｑの発声した音声指令が指令信号に変換されて遠隔操作装置１０の制御部１００２に入力されると、制御部１００２がその指令信号に対応する操作指令の情報を無線通信によってワイヤ送給装置６Ａの制御部６０３に送信し、さらに制御部６０３からその操作指令の情報が溶接電源装置５の制御部５０２に転送される。 In the welding system 4A, when the voice command uttered by the worker Q is converted into a command signal by the voice command input device 1 and input to the control unit 1002 of the remote operation device 10, the control unit 1002 corresponds to the command signal. Information on the operation command is transmitted to the control unit 603 of the wire feeding device 6A by wireless communication, and the information on the operation command is further transferred from the control unit 603 to the control unit 502 of the welding power source device 5.

これにより、溶接電源装置５ではワイヤ送給装置６Ａから入力された操作指令の情報に対応する所定の動作が行われる。例えば、操作指令の内容が「溶接電流の増加」であれば、溶接電源装置５では、溶接用電源５０１１が溶接トーチ７と母材Ｂに供給している溶接電流をΔＩだけ増加する制御が行われる。 As a result, the welding power supply device 5 performs a predetermined operation corresponding to the information of the operation command input from the wire feeding device 6A. For example, if the content of the operation command is “increase in welding current”, the welding power source device 5 performs control to increase the welding current supplied to the welding torch 7 and the base material B by the welding power source 5011 by ΔI. Is called.

なお、図１１の構成では、遠隔操作装置１０がワイヤ送給装置６Ａとの間で無線通信をする構成であるが、遠隔操作装置１０は、溶接電源装置５との間で無線通信をする構成であってもよく、溶接電源装置５とワイヤ送給装置６Ａの両方に対して無線通信をする構成であってもよい。 In the configuration of FIG. 11, the remote operation device 10 performs wireless communication with the wire feeding device 6 </ b> A, but the remote operation device 10 performs wireless communication with the welding power source device 5. The wireless power communication may be performed with respect to both the welding power supply device 5 and the wire feeding device 6A.

以上、説明したように本実施の形態に係る溶接システム４，４Ａによれば、加速度センサ１０１によって作業者Ｑが音声指令を発したときの頬若しくは口元の動きを検出し、その検出結果に基づいて音声指令の内容を認識し、その音声指令に対応する指令信号を溶接電源装置５の制御部５０２に入力するようにしているので、溶接作業現場の騒音が高い環境でもその騒音の影響を受けることなく音声指令の内容を認識することができる。 As described above, according to the welding systems 4 and 4A according to the present embodiment, the movement of the cheek or the mouth when the worker Q issues a voice command is detected by the acceleration sensor 101, and based on the detection result. Since the content of the voice command is recognized and the command signal corresponding to the voice command is input to the control unit 502 of the welding power source device 5, the noise is affected even in a noisy environment at the welding work site. The contents of the voice command can be recognized without any problem.

特に、溶接システム４，４Ａでは、操作ボタンによる溶接条件の変更等の操作指令が比較的簡単な内容であり、音声による指令語も簡単且つ少ない音素で構成されるので、加速度センサ１０１による作業者Ｑの頬若しくは口元の動きの検出値を用いた音声指令の認識処理でも誤動作や誤認識を低減することができる。 In particular, in the welding systems 4 and 4A, the operation command such as changing the welding condition by the operation button has relatively simple contents, and the voice command word is composed of simple and few phonemes. Malfunctions and misrecognition can also be reduced by voice command recognition processing using the detected value of the movement of the cheek or mouth of Q.

また、加速度センサ１０１による作業者Ｑの頬若しくは口元の動きの検出値から求めた音声指令の音素の一部に誤検出があっても、例えば、文字列と登録用語の文字の一致度Ｅを算出し、その一致度Ｅを用いて音声指令の文字列に一致する確率の高い登録用語又は登録用語の組合せを作業者Ｑの発声した音声指令と認識するようにしているので、音声指令を高い精度で当該音声指令に対応した指令信号に変換して入力することができる。 Even if a part of the phoneme of the voice command obtained from the detected value of the movement of the cheek or mouth of the worker Q by the acceleration sensor 101 is erroneously detected, for example, the matching degree E between the character string and the character of the registered term is determined. Since the coincidence E is calculated and a registered term or a combination of registered terms having a high probability of matching the character string of the voice command is recognized as the voice command uttered by the worker Q, the voice command is high. It can be converted into a command signal corresponding to the voice command and input with accuracy.

従って、音声指令により高い精度で安定して溶接に関する捜査指令の指令信号を溶接電源装置５に入力することができる。これにより、作業者Ｑは、溶接作業中における溶接条件の変更等のための手入力操作が低減され、溶接作業が容易になる。 Therefore, the command signal of the investigation command regarding welding can be input to the welding power source device 5 stably with high accuracy by the voice command. Thereby, the operator Q can reduce the manual input operation for changing the welding conditions during the welding operation, and the welding operation is facilitated.

１音声指令入力装置
１００装置本体
１０１，１０１Ｒ，１０１Ｌ加速度センサ
１０１１リード線
１０１２コネクタ
１０２座標変換部（第１の変換手段）
１０３音声変換部（第２の変換手段）
１０４変換テーブル記憶部（第１の記憶手段）
１０５音声認識部（音声認識手段）
１０６言語モデル記憶部（第２の記憶手段）
１０７指令信号変換部（第３の変換手段）
１０８指令語記憶部（第３の記憶手段）
１０９音声指令入力部
２ケーブル中継部材
２０１頭部装着部
２０２接続部
２０３信号線
３接着部材
４，４Ａ溶接システム
５溶接電源装置
５０１電源部
５０１１溶接用電源
５０１２ワイヤ送給用電源
５０２制御部
５０３操作部
５０４表示部
６，６Ａワイヤ送給装置
６０１ワイヤ送給機構
６０２ガス電磁弁
６０３制御部
６０４操作部
６０５表示部
６０６電圧／電流検出部
６０７無線通信部
６０７Ａアンテナ
７溶接トーチ
７０１トーチスイッチ
８ガス供給装置
９ワイヤリール
１０遠隔操作装置
１００１電池
１００２制御部
１００３無線通信部
１００３Ａアンテナ
１００４操作部
１００５表示部
Ａアーク
Ｂ母材
Ｗ溶接ワイヤ DESCRIPTION OF SYMBOLS 1 Voice command input device 100 Apparatus main body 101,101R, 101L Acceleration sensor 1011 Lead wire 1012 Connector 102 Coordinate conversion part (1st conversion means)
103 Voice conversion unit (second conversion means)
104 Conversion table storage unit (first storage unit)
105 Voice recognition unit (voice recognition means)
106 Language model storage unit (second storage means)
107 Command signal converter (third converter)
108 Command word storage unit (third storage means)
DESCRIPTION OF SYMBOLS 109 Voice command input part 2 Cable relay member 201 Head mounting part 202 Connection part 203 Signal line 3 Adhesive member 4,4A Welding system 5 Welding power supply device 501 Power supply part 5011 Welding power supply 5012 Wire feeding power supply 502 Control part 503 Operation Unit 504 display unit 6,6A wire feeding device 601 wire feeding mechanism 602 gas solenoid valve 603 control unit 604 operation unit 605 display unit 606 voltage / current detection unit 607 wireless communication unit 607A antenna 7 welding torch 701 torch switch 8 gas supply Device 9 Wire reel 10 Remote operation device 1001 Battery 1002 Control unit 1003 Wireless communication unit 1003A Antenna 1004 Operation unit 1005 Display unit A Arc B Base material W Welding wire

Claims

A voice command input device for inputting a command signal by voice,
Detecting means for detecting movement of the speaker's mouth when the speaker utters a voice command;
Command word conversion means for converting the movement of the mouth detected by the detection means into a command word uttered by the speaker;
Command signal generating means for generating the command signal corresponding to the command word converted by the converting means;
A voice command input device characterized by comprising:

The detection means includes
An acceleration sensor attached to the periphery of the speaker's cheek or mouth;
First conversion means for converting a detection value of acceleration output from the acceleration sensor into data of movement of the mouth of the speaker in response to the speaker speaking a voice command;
Including
The command word conversion means includes
First storage means for storing a first conversion table indicating a correspondence between a phoneme uttered by the speaker and a mouth movement corresponding to the phoneme;
Second storage means for storing one or more command words registered in advance;
Second conversion means for converting the mouth movement data of the speaker output from the first conversion means into a phoneme string using the first conversion table;
Speech recognition means for recognizing a command word uttered by the speaker by collating a sequence of phonemes converted by the second conversion means with a command word registered in the second storage means;
Including
The command signal generating means includes
Third storage means for storing a second conversion table showing a correspondence relationship between one or more command words and the command signal stored in the second storage means;
Third conversion means for converting a command word recognized by the voice recognition means into a command signal corresponding to the command word using the second conversion table;
The voice command input device according to claim 1, comprising:

The speech recognition means calculates the degree of coincidence between the command word registered in the second storage means and the phoneme string converted by the second conversion means, and has a degree of coincidence equal to or greater than a predetermined threshold. The voice command input device according to claim 2, wherein a command word having the highest degree of coincidence is recognized as a voice command uttered by the speaker.

Mouth movement data according to the phoneme uttered by the speaker is displacement vector data of the acceleration sensor based on a change in facial expression when the speaker utters speech based on a facial expression that does not utter speech. The voice command input device according to claim 2 or 3.

5. The acceleration sensor according to claim 1, wherein the acceleration sensor is a triaxial acceleration sensor that detects accelerations in three axial directions orthogonal to each other, and is mounted on both cheeks or both sides of the mouth of the speaker. Voice command input device.

Welding torch,
A wire feeding device for feeding a welding wire to the welding torch;
A welding power source for supplying welding power between the welding torch and the base material via the wire feeding device;
A command input device for inputting various commands to the welding power source;
A welding system comprising:
The said command input device is comprised by the audio | voice command input device in any one of Claims 1 thru | or 5, The welding system characterized by the above-mentioned.

The welding system according to claim 6, wherein the voice command input device is provided in the wire feeding device.

A remote control device that can be connected to the wire feeding device by wireless communication,
The welding system according to claim 7, wherein the voice command input device is provided in the remote control device.