JP2011075989A

JP2011075989A - Method and device for creating operation command signal

Info

Publication number: JP2011075989A
Application number: JP2009229636A
Authority: JP
Inventors: Eiji Uchino; 英治内野; Noriaki Suetake; 規哲末竹; Takaaki Koga; 崇了古賀
Original assignee: Yamaguchi University NUC
Current assignee: Yamaguchi University NUC
Priority date: 2009-10-01
Filing date: 2009-10-01
Publication date: 2011-04-14

Abstract

<P>PROBLEM TO BE SOLVED: To further diversify information which can be input when an information processing apparatus or an electronic device is operated by those who cannot freely use fingers, or that used under a noise environment, without making input operation complicated, and to make a device configuration simple. <P>SOLUTION: Bone conduction sound which is detected by a bone conduction microphone attached in one auricle is amplified and sampled, and it is stored in a memory. The bone conduction sound signal stored in the memory is taken out, and variable threshold processing is performed. Multi-class classification by partial space pattern classification is performed for occlusal sound candidates which are detected by the variable threshold processing. A signal according to an occlusal data of the occlusal sound candidates on which multi-class classification is performed is created as an operation command signal. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は操作指令信号生成方法及び操作指令信号生成装置に関し、より詳細には、骨伝導マイクロフォン等の加速度センサにより検知した信号からコンピュータや機器への操作指令信号を生成するようにした操作指令信号生成方法及び操作指令信号生成装置に関する。 The present invention relates to an operation command signal generation method and an operation command signal generation device, and more specifically, an operation command signal for generating an operation command signal to a computer or a device from a signal detected by an acceleration sensor such as a bone conduction microphone. The present invention relates to a generation method and an operation command signal generation device.

コンピュータ等の情報処理装置や指令操作に基づいて動作する電子機器において、通常の形態としては、キーボード、マウス、タッチパネル、コントローラ等の入力装置を用いて情報、操作指令の内容を入力する。このような入力装置ではキー、ボタン等を押したり、パネル上でタッチすることにより入力を行う。また、他に音声入力装置があり、これは音声認識装置を備えており、発声による指令を認識して入力を行うものである。 In an information processing device such as a computer or an electronic device that operates based on a command operation, as a normal form, information and contents of an operation command are input using an input device such as a keyboard, a mouse, a touch panel, or a controller. In such an input device, input is performed by pressing a key, a button, or the like, or touching on a panel. In addition, there is another voice input device, which is equipped with a voice recognition device, which recognizes and inputs a command by utterance.

ところで、病気や障害により手指が自由に使えない者はキーやボタンを操作して入力を行うことは困難であるため、キーボードやマウス等の操作に適合しないが、音声入力装置を用いることにより、情報処理装置や電子機器を操作することは可能になる。音声入力装置は発話能力があれば利用することができるが、発話障害のある場合には音声入力装置を使用することはできない。また、音声入力装置は周囲に他の者がいる場合に入力情報が知られるため、要秘性の事項に関して入力を行うには適切ではなく、また、周囲が騒音の大きい環境にあれば、音声認識が不正確または不可能になるという欠点がある。 By the way, those who cannot use their fingers freely due to illness or disability are difficult to operate by operating keys and buttons, so they are not suitable for keyboard and mouse operations, but by using a voice input device, It becomes possible to operate the information processing apparatus and the electronic device. The voice input device can be used as long as it has a speech ability, but the voice input device cannot be used when there is a speech failure. Also, since the input information is known when there is another person in the surroundings, the voice input device is not suitable for inputting confidential matters, and if the surroundings are in a noisy environment, the voice input device The disadvantage is that recognition becomes inaccurate or impossible.

手指を自由に使えない者にも操作入力を行うことができ、あるいは周囲の者に知覚されない形で操作入力信号を生成するための手段として、骨伝導マイクロフォンを用いたものがあり、これについて提案、開発がなされている。骨伝導マイクロフォンは、発声時に生じる頭蓋骨の振動を検知する音声信号を検出するイヤホン型の加速度センサであり、これを耳介に取り付けて検出を行う。骨伝導マイクロフォンは空中を伝播する周囲の雑音を検知しないので、これを用いた操作入力装置は雑音環境下においても利用可能性がある。 There is a method using a bone conduction microphone as a means to generate an operation input signal in a form that can be input even to those who can not use fingers freely or is not perceived by surrounding people. Development has been done. The bone conduction microphone is an earphone-type acceleration sensor that detects an audio signal for detecting a vibration of the skull that is generated at the time of utterance, and is detected by attaching it to the auricle. Since the bone conduction microphone does not detect ambient noise propagating in the air, the operation input device using this can be used even in a noisy environment.

骨伝導マイクロフォンあるいは音声入力装置を用いた操作入力装置に関して、以下のような文献に開示されている。特許文献１には、耳介の後方下部に骨伝導マイクロフォンを取り付け、つぶやき音、歯咬音、舌打ち音などを採取し、採取された信号に対しＡＤ変換、量子化、音響モデルを用いた音声認識等の処理を行うことについて記載されており、また、特許文献２には、骨伝導マイクロフォンで口腔内の物音を検知し、予め処理内容ごとに登録されたデータと照合することにより処理内容を判定する入力方法、処理システムについて記載されている。 An operation input device using a bone conduction microphone or a voice input device is disclosed in the following documents. In Patent Document 1, a bone conduction microphone is attached to the lower rear part of the auricle, and muttering sounds, tooth biting sounds, tongue hitting sounds, and the like are collected, and AD conversion, quantization, and sound using an acoustic model are collected on the collected signals. It describes that processing such as recognition is performed, and Patent Document 2 detects the sound in the oral cavity with a bone conduction microphone and compares the processing content with data registered in advance for each processing content. An input method and a processing system for determination are described.

特許文献３には、操作者の顎の開閉動作により生じる超低周波音をイヤホンにより検出し、超低周波数を波形成形して顎の開閉動作回数に対応する数のパルス信号を生成し、このパルス数に対応して機器を制御することについて記載されており、また、特許文献４には、骨伝導マイク等を用いた音声入力装置により音声をピックアップし、それが操作データテーブルに登録されたものと判別された場合に、音声に対応づけられたコマンドを実行するようにした電子装置について記載されている。 In Patent Document 3, an ultra-low frequency sound generated by an operator's jaw opening / closing operation is detected by an earphone, and an ultra-low frequency waveform is formed to generate a pulse signal having a number corresponding to the number of jaw opening / closing operations. It describes that the device is controlled according to the number of pulses, and Patent Document 4 picks up a voice by a voice input device using a bone conduction microphone or the like and registers it in the operation data table. An electronic device that executes a command associated with a voice when it is determined to be a device is described.

特許文献５には、口腔内で発生する音を骨伝導センサにより検出し、呼び出し信号としての特徴量を抽出し一定の閾値を超えた場合に呼び出し信号を子機に送出するようにした骨伝導呼出装置について記載され、また、特許文献６には、左右の耳にマイクロフォンを取り付けて歯の噛み合わせ音及び発生位置を検出し、発生位置に応じたコマンドを発生させるようにした操作指令入力装置について記載されている。 In Patent Document 5, the bone conduction sensor detects a sound generated in the oral cavity with a bone conduction sensor, extracts a feature value as a call signal, and sends a call signal to a child device when a certain threshold is exceeded. Patent Document 6 describes an operation command input device in which microphones are attached to left and right ears to detect a tooth engagement sound and a generation position, and a command corresponding to the generation position is generated. Is described.

特許文献７には、骨伝導マイクロフォンによって口腔内音を検出し、その検出信号によりスイッチを操作させるようにして、病気やけが等で手が使えない者でも操作できる操作スイッチについて記載されており、また、特許文献８には、顎の骨または皮膚表面を伝わる加速度を骨動マイクにより検出し、検出信号が歯を噛み合わせた時に発生する特定の周波数成分を含む場合にトリガ信号を発生し、その時系列のトリガ信号に基づいて電話の送受信を行うようにした常装着型電話装置について記載されている。 Patent Document 7 describes an operation switch that can be operated even by a person who cannot use a hand due to illness, injury, etc., by detecting sound in the oral cavity with a bone conduction microphone and operating the switch by the detection signal. Further, in Patent Document 8, the acceleration transmitted through the jaw bone or the skin surface is detected by a bone motion microphone, and a trigger signal is generated when the detection signal includes a specific frequency component generated when the teeth mesh with each other. There is a description of a permanent wearable telephone device that transmits and receives a telephone call based on the time-series trigger signal.

これらのうち、特許文献１〜５、７、８によるものでは、骨伝導マイクロフォンを用いて顎の開閉動作等を検知しそれに基づいて入力、機器の操作を行うようにしているが、１回の顎の１回の動作はそれ自体として認識されるのであり、動作の種類は識別されないので、オンオフ動作によるスイッチ機能による操作装置として用いるには妥当性があるが、入力装置として用いる場合には、多種の入力内容に応じて多くの動作を必要とすることになり、それだけ操作が煩雑とならざるを得ない。 Among these, in Patent Documents 1 to 5, 7, and 8, the bone conduction microphone is used to detect the opening and closing operation of the jaw, and based on this, input and operation of the device are performed. One movement of the jaw is recognized as itself, and since the type of movement is not identified, it is appropriate to be used as an operation device with a switch function by on / off operation, but when used as an input device, Many operations are required according to various types of input contents, and the operation must be complicated accordingly.

特許文献６によるものでは、耳孔挿入式のヘッドホンに左側マイク及び右側マイクを備え、左右のマイクによって検出された音を認識処理することにより口内音とその検出位置とを検出し、発生位置に応じたコマンドを発生するようにしている。この音認識処理によっては右側の歯の噛み合わせと左側の歯の噛み合わせとを識別するということが可能である程度であり、入力情報の内容としてそれ以上の多様性は可能性が少なく、用途としても携帯音響装置の操作のような単純化されたものにとどまるとともに、センサは左右両側のものを備えることが前提になり、機能の割には装置構成がそれだけ煩雑なものになる。 According to Patent Document 6, an ear canal insertion type headphone is provided with a left microphone and a right microphone, and the sound detected by the left and right microphones is recognized to detect the mouth sound and its detection position, and the sound is detected according to the generation position. Command is generated. Depending on the sound recognition process, it is possible to distinguish between the meshing of the right tooth and the meshing of the left tooth, and there is little possibility of further diversity as the contents of the input information. However, the sensor is assumed to be simplified as in the operation of the portable audio device, and the sensor is assumed to be provided on both the left and right sides, and the device configuration is complicated for the function.

特許第３７６０１７３号公報Japanese Patent No. 3760173 特開２００２−１６２９９０号公報JP 2002-162990 A 特開２００７−１１６２７号公報JP 2007-11627 A 特開２００４−１９８４８８号公報JP 2004-198488 A 特開２００６−１８０９９４号公報JP 2006-180994 A 特開２００６−３４３９６５号公報JP 2006-343965 A 特開２００５−１３０４２７号公報JP 2005-130427 A 特許第３５２５６６５号公報Japanese Patent No. 3525665

手指を自由に使えない者でも情報処理機器や電子機器に関し入力操作、指令操作を行うようにし、また、騒音の環境下において、あるいは周囲にいる者に知られないような形でこれらの機器への入力、操作を行えるようにするための入力信号生成について、従来の手法によるものでは、入力すべき情報内容にそれほど多様性が可能でなく、そのため歯の噛み合わせのような動作をそれだけ多く反復する必要があるというように、操作が煩雑になることを免れないものであった。また、左右の音発生の位置を検出することを可能とするというものについても、それ以上の音認識の多様性はなされず、装置構成としてやはり煩雑性を伴うものになっていた。 Even those who cannot use their fingers are allowed to perform input and command operations on information processing equipment and electronic devices, and to these devices in a noisy environment or in a form that is unknown to those around them. For the input signal generation to enable the input and operation of the input, the conventional method does not allow a great variety of information contents to be input, and therefore, the operation such as tooth meshing is repeated many times. Therefore, it is inevitable that the operation is complicated. In addition, regarding the ability to detect the positions of left and right sound generation, there is no further diversity in sound recognition, and the apparatus configuration is also complicated.

このような状況において、情報処理装置や電子機器への入力可能な情報をより多様なものにし、入力操作を煩雑にせず、さらに装置構成をより簡素なものにして操作入力を行えるようにすることが求められていた。 Under such circumstances, the information that can be input to the information processing apparatus and the electronic device is made more diverse, the input operation is not complicated, and the apparatus configuration is further simplified so that the operation input can be performed. Was demanded.

本発明は前述した課題を解決すべくなしたものであり、本発明による操作指令信号生成方法は、一方の耳介に取り付けられた骨伝導マイクロフォンにより骨伝導音を検知し骨伝導音信号を生成することと、該骨伝導音信号を増幅しサンプリングしてメモリに格納することと、前記格納された骨伝導音信号を取り出し可変閾値処理を行って歯咬音候補を検出することと、前記検出された歯咬音候補に対して部分空間パターン識別による多クラス識別を行うことと、前記多クラス識別された歯咬音候補の歯咬音データに応じた信号を操作指令信号として生成することと、からなるものである。 The present invention has been made to solve the above-described problems, and an operation command signal generation method according to the present invention generates a bone conduction sound signal by detecting a bone conduction sound with a bone conduction microphone attached to one auricle. Amplifying and sampling the bone conduction sound signal, storing it in a memory, taking out the stored bone conduction sound signal, performing variable threshold processing, and detecting a tooth biting sound candidate; and Performing multi-class identification by partial space pattern identification on the identified dental bite sound candidate, and generating a signal corresponding to the dental bite sound data of the multi-class identified dental bite sound candidate as an operation command signal; , Is made up of.

骨伝導音の発生位置から前記一方の耳介に取り付けられた骨伝導マイクロフォンまでの骨伝導音の伝達経路が異なり骨伝導音に含まれる特徴が変化することにより異なる位置での骨伝導音の識別がなされるものとしてもよい。 Differentiating the bone conduction sound transmission path from the bone conduction sound generation position to the bone conduction microphone attached to the one auricle, and distinguishing the bone conduction sound at different positions by changing the characteristics included in the bone conduction sound It is good also as what is made.

前記骨伝導音マイクロフォンにより採取された骨伝導音に関し、少なくとも前歯、左歯、右歯の噛み合わせ動作を識別して認識するものとしてもよい。頭部の動作を検知するセンサにより採取された頭部動作に応じたデータを取得し、該頭部動作に応じた信号と前記歯咬音データに応じた信号とを組み合わせて操作指令信号を生成するようにしてもよい。 Regarding the bone conduction sound collected by the bone conduction sound microphone, at least the meshing operation of the front teeth, left teeth, and right teeth may be identified and recognized. Acquires data corresponding to the head movement collected by the sensor that detects the head movement, and generates an operation command signal by combining the signal corresponding to the head movement and the signal corresponding to the tooth biting sound data You may make it do.

本発明による操作指令信号生成装置は、骨伝導マイクロフォンと、該骨伝導マイクロフォンにより生成された骨伝導音信号を増幅する増幅部と、増幅された前記骨伝導音信号をサンプリングし骨伝導音データとして格納するデータロッガと、該データロッガのメモリに格納された骨伝導音データに対し歯咬音候補データを切り出す可変閾値処理部と、前記切り出された歯咬音候補データに対し多クラス識別を行う部分空間パターン識別処理部と、多クラス識別がなされた歯咬音候補データに応じた操作指令信号を生成する操作指令信号生成部とを備えるものである。 An operation command signal generation apparatus according to the present invention includes a bone conduction microphone, an amplification unit that amplifies a bone conduction sound signal generated by the bone conduction microphone, and samples the amplified bone conduction sound signal as bone conduction sound data. A data logger to be stored, a variable threshold processing unit for cutting out tooth biting sound candidate data from bone conduction sound data stored in the memory of the data logger, and a partial space for performing multi-class identification on the cut out tooth biting sound candidate data A pattern identification processing unit and an operation command signal generation unit that generates an operation command signal corresponding to the tooth and bite sound candidate data subjected to multi-class identification are provided.

骨伝導音の発生位置から前記一方の耳介に取り付けられた骨伝導マイクロフォンまでの骨伝導音の伝達経路が異なり骨伝導音に含まれる特徴が変化することにより異なる位置での骨伝導音の識別がなされようにしてもよい。 Differentiating the bone conduction sound transmission path from the bone conduction sound generation position to the bone conduction microphone attached to the one auricle, and distinguishing the bone conduction sound at different positions by changing the characteristics included in the bone conduction sound May be made.

前記部分空間パターン識別処理部は前記骨伝導音マイクロフォンにより採取された骨伝導音に関し、少なくとも前歯、左歯、右歯の噛み合わせ動作を識別して認識するようにしてもよい。頭部の動作を検知するセンサと、該センサにより採取された頭部動作に応じたデータを取得し、該頭部動作に応じた信号を生成する頭部動作認識部とをさらに備え、前記操作指令信号生成部においては前記歯咬音データに応じた信号と前記頭部動作認識部により生成された信号とを組み合わせて操作指令信号を生成するようにしてもよい。 The partial space pattern identification processing unit may identify and recognize at least the engagement operation of the front teeth, left teeth, and right teeth with respect to the bone conduction sounds collected by the bone conduction sound microphone. A sensor for detecting the movement of the head; and a head movement recognition unit that acquires data corresponding to the head movement collected by the sensor and generates a signal corresponding to the head movement. The command signal generation unit may generate an operation command signal by combining a signal corresponding to the tooth biting sound data and a signal generated by the head movement recognition unit.

本発明においては、一方の耳介に取り付けられた骨伝導マイクロフォンにより骨伝導音を検知し、骨伝導音の発生位置から骨伝導マイクロフォンまでの伝達経路が異なることに応じて異なる位置で発生した骨伝導音を空間パターン識別によって精度よく多クラス識別することができ、前歯、右歯、左歯による噛み合わせ動作等を識別することにより、情報処理装置や電子機器を手指が自由に使えない者が操作する場合や騒音環境下で使用する場合にも入力可能な情報をより多様なものにし、入力操作を煩雑にせず、さらに装置構成をより簡素なものにすることができる。また、カメラや赤外線センサ等と組み合わせた形で構成することによりさらに入力情報を多様化した操作指令入力信号を生成することができる。 In the present invention, bone conduction sound is detected by a bone conduction microphone attached to one auricle, and bones generated at different positions according to different transmission paths from the bone conduction sound generation position to the bone conduction microphone. Conducted sound can be identified in multiple classes with high accuracy by spatial pattern identification, and by identifying the meshing movements of front teeth, right teeth, left teeth, etc., those who cannot use information processing devices and electronic devices freely Information that can be input even when operated or used in a noisy environment can be diversified, the input operation is not complicated, and the apparatus configuration can be further simplified. In addition, an operation command input signal in which input information is further diversified can be generated by being configured in combination with a camera, an infrared sensor, or the like.

骨伝導マイクロフォンで得られる信号を示す図である。It is a figure which shows the signal obtained with a bone conduction microphone. 歯咬音の波形と持続時間を示す図である。It is a figure which shows the waveform and duration of a tooth biting sound. 可変閾値処理を用いた歯咬音検出処理を説明する図である。It is a figure explaining the tooth biting sound detection process using a variable threshold process. 可変閾値処理による歯咬音検出結果を示す図である。It is a figure which shows the tooth biting sound detection result by a variable threshold value process. ２段階の閾値処理による歯咬音検出処理を説明する図である。It is a figure explaining the tooth biting sound detection process by the threshold value process of two steps. ２段階の閾値処理による歯咬音の検出結果を示す図である。It is a figure which shows the detection result of the tooth biting sound by a threshold value process of two steps. 歯咬音の識別についての評価試験結果を示す図である。It is a figure which shows the evaluation test result about identification of a tooth biting sound. 歯咬音識別の手順を示す図である。It is a figure which shows the procedure of tooth biting sound identification. 操作指令信号生成装置の構成を示す図である。It is a figure which shows the structure of the operation command signal generation apparatus.

本発明による操作指令信号生成方法及び操作指令信号生成装置においては、例えば骨伝導マイクロフォンを左右いずれか一方の耳穴に挿入した状態で歯の噛み合わせとその位置、舌打ち、飲み込みのような動作等を検出し識別し、それに基づいて操作指令信号を生成する。 In the operation command signal generation method and the operation command signal generation device according to the present invention, for example, with a bone conduction microphone inserted into either the left or right ear hole, tooth engagement and its position, tongue-like operation, swallowing operation, etc. An operation command signal is generated based on the detection and identification.

骨伝動マイクロフォンは耳介に取り付けた場合に発声等の動作により生じる振動を頭蓋骨を介して検知する加速度検出型センサである。耳介に取り付けた場合に頭蓋骨を介して振動を検知する。騒音環境下でも空中を伝播する周囲の雑音等は検知せずに、身体の動作に応じた振動を検知することができる。ただし、振動としては、歯の噛み合わせの動作を意図した場合、それによる振動のほかに、舌打ち、首振り等他の意図しない身体動作による振動をも検知する。そのため、操作指令入力信号生成のためには、骨伝導信号の種類を識別する必要がある。 The bone transmission microphone is an acceleration detection type sensor that detects vibration generated by an operation such as utterance through the skull when attached to the auricle. When attached to the pinna, vibration is detected through the skull. Even in a noisy environment, it is possible to detect vibrations according to body movements without detecting ambient noise or the like propagating in the air. However, as the vibration, when the operation of meshing teeth is intended, in addition to the vibration caused by this, vibration due to other unintended body movement such as tongue hitting and swinging is also detected. Therefore, in order to generate an operation command input signal, it is necessary to identify the type of bone conduction signal.

本発明では、一方の耳介に取り付けた骨伝導マイクロフォンにより得られた歯咬音の信号を解析処理して前歯、右歯、左歯のいずれであるかを識別するというように入力信号のパターン識別を行うという手法を用いて操作指令信号の生成を行う。そこで、歯咬音のパターン識別について説明する。 In the present invention, the pattern of the input signal is such that the tooth bite sound signal obtained by the bone conduction microphone attached to one pinna is analyzed to identify the front tooth, right tooth, or left tooth. An operation command signal is generated using a technique of identifying. Thus, pattern identification of tooth biting sound will be described.

［歯咬音のパターン識別］
骨伝導マイクロフォンを右耳に装着し、前歯、左歯、右歯のみが接するように意識して歯を噛み合わせる場合について説明する。噛み合わせの際には、
・何回咬んでも疲れない強さで行う
・歯の接触位置は毎回同じにする
・楽な姿勢で、大きく体を動かさない
というような条件を課す。信号採取に際しては、骨伝導マイクロフォンを使用中に発声する、唾を飲み込む、首を動かすというような、使用者の意図にかかわらず発声する歯咬音以外の骨伝導音も採取する。 [Pattern identification of tooth bite]
A case will be described in which a bone conduction microphone is attached to the right ear and the teeth are meshed with the consciousness that only the front, left, and right teeth are in contact. When meshing,
・ Do not get tired no matter how many times you bite. ・ Make the contact position of the teeth the same every time. ・ Place a comfortable posture and do not move your body. When collecting a signal, bone conduction sounds other than the tooth biting sounds that are uttered regardless of the user's intention, such as speaking while swallowing the bone conduction microphone, swallowing saliva, and moving the neck are also collected.

（１）歯咬音の検出
音声や身体動作によって発生した振動を骨伝導マイクロフォンにより骨伝導音として検知する。検知された骨伝導音の信号は数ｍＶ程度で微弱であるため、増幅器により３５ｄＢ増幅した後、サンプリング周波数２０ｋＨｚで採取し、メモリに格納する。 (1) Detection sound of tooth biting sound and vibration generated by body movement are detected as bone conduction sound by a bone conduction microphone. Since the detected bone conduction sound signal is weak at about several mV, it is amplified by 35 dB with an amplifier, and then collected at a sampling frequency of 20 kHz and stored in a memory.

骨伝導マイクロフォンで得られる信号は図１に示すようなものであり、これから歯咬音の識別候補となる区間を自動的に検出するために、図２に示すように、歯咬音の持続時間とその特徴を解析する。その結果、歯咬音がピーク値をとる時点を基準として、その前５ｍｓｅｃ、後１０ｍｓｅｃの区間に歯咬音が収まることが確認される。したがって、このピーク値の前後１５ｍｓｅｃの区間に含まれる信号を歯咬音の識別候補として自動検出する。 The signal obtained by the bone conduction microphone is as shown in FIG. 1, and in order to automatically detect a section that is a candidate for identification of tooth bite sound from now on, as shown in FIG. And its characteristics are analyzed. As a result, it is confirmed that the tooth biting sound falls within a section of 5 msec before and 10 msec after the time when the tooth biting sound takes a peak value. Therefore, a signal included in a 15 msec section before and after the peak value is automatically detected as a tooth and sound identification candidate.

ピーク値の検出方法については、２５６ｍｓｅｃの幅（Ｗ）の検出窓に含まれる信号の絶対値の平均値ηを用いて可変閾値Ｖ_ｔｈを次のように設定する（図３）。
Ｖ_ｔｈ＝η×２^４（１）
検出窓中の信号の絶対値の最大値が可変閾値よりも大きい時、その検出窓内に歯咬音候補となる信号が含まれているとし、ピーク値前後１５ｍｓｅｃの区間を切り出す。 Regarding the detection method of the peak value, the variable threshold value V _th is set as follows using the average value η of the absolute values of the signals included in the detection window having a width (W) of 256 msec (FIG. 3).
V _th = η × 2 ⁴ (1)
When the maximum value of the absolute value of the signal in the detection window is larger than the variable threshold, it is assumed that the detection window includes a signal that is a candidate for a dental bite sound, and a section of 15 msec before and after the peak value is cut out.

この方法を１回行うのみでは、図４に示すように、雑音を歯咬音として検出する可能性がある。そのため、可変閾値処理によって検出した信号に対して、もう一度図５のように閾値（歯咬音の最小振幅値）を用いた処理を行うことにより、雑音を除去する。この２段階の閾値処理により、図６に示すような歯咬音候補を検出し、パターン識別を行う。 If this method is performed only once, there is a possibility that noise is detected as a dental bite as shown in FIG. Therefore, noise is removed by performing a process using a threshold value (minimum amplitude value of tooth-biting sound) once again on the signal detected by the variable threshold process as shown in FIG. By this two-stage threshold processing, tooth biting sound candidates as shown in FIG. 6 are detected and pattern identification is performed.

（２）歯咬音のパターン識別
検出された歯咬音となる信号に対して、代表的な部分空間パターン識別法であるクラフィック（ＣＬＡＦＩＣ）法を用いてパターン識別を行う。識別のための部分空間を作成する特徴量として、１５ｍｓｅｃ間の時系列信号をそのまま用いる。 (2) Pattern identification of tooth biting sound Pattern identification is performed on a signal that becomes a detected tooth biting sound by using a CLAFIC method, which is a typical subspace pattern identification method. A time series signal for 15 msec is used as it is as a feature value for creating a partial space for identification.

識別のための部分空間を構成するために、予めクラス情報が既知の信号を用い、クラスごとに部分空間を作成する。あるクラスｉに属するパターンｘ（パターン数ｎ_ｉ）のクラス自己相関行列

の固有値を求め、大きい順に並び替える。さらに、この時の固有値λ_ｉｊに対応する固有ベクトルｕ_ｉｊ（ｊ＝１，……，ｄ）によって張られる空間をクラスｉの部分空間とする。ここで、部分空間の次元ｄ_ｉは、累積寄与率

が０．９以上になるように決定する。 In order to construct a partial space for identification, a signal whose class information is already known is used to create a partial space for each class. Class autocorrelation matrix of pattern x (number of patterns n _i ) belonging to a certain class i

The eigenvalues of are obtained and rearranged in descending order. Further, a space spanned by the eigenvector u _ij (j = 1,..., D) corresponding to the eigenvalue λ _ij at this time is defined as a subspace of class i. Here, the dimension d _i of the subspace is the cumulative contribution rate

Is determined to be 0.9 or more.

実際の識別段階では、このようにして得られた各クラスの部分空間とクラス情報が未知な信号との類似度

を求める。 In the actual identification stage, the similarity between the subspace of each class obtained in this way and the signal with unknown class information

Ask for.

また、未知信号と各クラスの部分空間の類似度のうち、最大類似度が閾値以上であれば該当するクラスの歯咬音（前歯、右歯、左歯）とし、閾値以下であれば雑音（唾の飲み込み音、首振り等の身体動作により生じる音）として信号をリジェクトする。 In addition, among the similarities between the unknown signal and the subspace of each class, if the maximum similarity is equal to or greater than a threshold, the tooth biting sound (front tooth, right tooth, left tooth) of the corresponding class is used. The signal is rejected as a swallowing sound of the saliva or a sound generated by physical movements such as a head swing.

［パターン識別の有効性］
前述のパターン識別法により、検出された歯咬音の識別がなされることについて、実際に被検者から採取した信号に関してパターン識別を行い確認する。実際例として、１５人の被検者の各々について歯咬音をそれぞれ３０サンプル、ノイズを５０サンプルずつ用い、３種類の歯咬音（前歯、右歯、左歯）とノイズの混合した信号からパターン識別法により多クラスに識別した結果に関しクロスバリデーションにより識別性能の評価を行っており、その結果を図７に示す。 [Effectiveness of pattern identification]
The fact that the detected tooth bite sound is identified by the pattern identification method described above is confirmed by performing pattern identification on the signal actually collected from the subject. As an actual example, for each of 15 subjects, 30 samples of tooth biting sound and 50 samples of noise were used, respectively, and signals from a mixture of three types of tooth biting sounds (anterior teeth, right teeth, left teeth) and noise. With respect to the results of multi-class discrimination by the pattern discrimination method, the discrimination performance is evaluated by cross validation, and the results are shown in FIG.

クロスバリデーションに際しては、データ全体を５グループに分け、１つのグループに対し他のグループのデータを用いた推定評価を行うことを５回反復し平均値を算出する。図７において、黒棒は３種類の歯咬音とノイズの混合した信号から歯咬音を正しいクラスに識別できた正解率の平均値を識別率（感度％）として被検者ごとに示し、白棒は異なる部位の歯咬音と誤識別した割合（偽陽性率％）を示している。前歯、右歯、左歯の各歯咬音についての１５人の被検者についての感度及び偽陽性率とその平均値は表１のようであった。 At the time of cross-validation, the entire data is divided into five groups, and estimation evaluation using data of another group is repeated five times for one group, and an average value is calculated. In FIG. 7, the black bar shows the average value of the correct answer rate that can identify the tooth bite sound in the correct class from the mixed signal of the three kinds of tooth bite sound and noise as the discrimination rate (sensitivity%) for each subject. The white bar indicates the rate of false identification (false positive rate%) as a tooth biting sound of a different part. Table 1 shows the sensitivity, false-positive rate, and average value of 15 subjects for the front teeth, right teeth, and left teeth.

このように、採取された歯咬音データに対するパターン識別により、前歯、右歯、左歯の歯咬音を９０％程度の精度で識別できるのであり、このようなパターン識別を用いて歯咬音の識別に基づいた操作指令信号の生成を行うことができる。 As described above, the pattern identification with respect to the collected tooth biting sound data can identify the tooth biting sounds of the front teeth, the right teeth, and the left teeth with an accuracy of about 90%. The operation command signal can be generated based on the identification.

前述の例では、部分空間パターン識別法としてクラフィック法を用いた例について示したが、多重線形判別分析法（ＭＤＡ）あるいは他の非線形部分空間識別法を用いることによっても歯咬音の識別がなされ、これらのいずれかのパターン識別法を用いて多クラス識別器を構成することができる。また、特徴量についても、識別手段の前段に特徴抽出手段を導入し、骨伝導音のスペクトラムを特徴量としてもよい。 In the above example, the example using the graphic method as the subspace pattern identification method has been shown. However, the multiple bite discriminant analysis method (MDA) or other nonlinear subspace identification method can also be used to identify the occlusal sound. Thus, a multi-class classifier can be configured using any one of these pattern identification methods. As for the feature amount, feature extraction means may be introduced before the identification means, and the spectrum of bone conduction sound may be used as the feature amount.

前述においては、信号を採取するセンサとして骨伝導マイクロフォンを一方の耳介に取り付けて用いている。骨伝導マイクロフォンは、加速度検出型のセンサであって、耳穴に装着した場合頭蓋骨を介して伝わる振動を検知し、前歯、右歯、左歯の歯咬音が異なる形の振動としてセンサに伝わることにより識別が可能になる。そのことから、１つのセンサで複数の振動発生位置を検出することができるものである。 In the foregoing, a bone conduction microphone is attached to one auricle as a sensor for collecting signals. A bone conduction microphone is an acceleration detection type sensor that detects vibrations transmitted through the skull when worn in the ear canal, and the biting sounds of the front, right, and left teeth are transmitted to the sensor as different forms of vibration. Can be identified. Therefore, a plurality of vibration generation positions can be detected by one sensor.

本発明においては、一方の耳介に取り付けられた骨伝導マイクロフォンにより骨伝導音を検知し、骨伝導音の発生位置から骨伝導マイクロフォンまでの伝達経路が異なることに応じて異なる位置で発生した骨伝導音を空間パターン識別によって精度よく多クラス識別することができる。 In the present invention, bone conduction sound is detected by a bone conduction microphone attached to one auricle, and bones generated at different positions according to different transmission paths from the bone conduction sound generation position to the bone conduction microphone. Conducted sounds can be identified in multiple classes with high accuracy by spatial pattern identification.

検出対象としては、前歯、右歯、左歯の場合について説明したが、他にハミング、飲み込み、首振り等の動作に伴う振動についても検知、識別が可能である。これらは前歯、右歯、左歯の歯咬音に着目した場合はノイズとなるが、それら他の振動についても着目し、識別されれば検知対象信号として扱える。頭部以外にも、肩部、腹部、脚部等での振動発生についても骨伝導マイクロフォンで検知可能であり、それらを識別し信号として扱うことも可能である。 Although the case of the front tooth, the right tooth, and the left tooth has been described as the detection target, vibrations associated with operations such as humming, swallowing, and swinging can also be detected and identified. These become noise when focusing on the tooth biting sound of the front teeth, right teeth, and left teeth, but also pay attention to these other vibrations and can be treated as detection target signals if they are identified. In addition to the head, vibrations in the shoulder, abdomen, legs, etc. can also be detected by the bone conduction microphone, and these can be identified and handled as signals.

［歯咬音識別における処理の手順］
歯咬音識別を概括してその処理の手順を示すと図８のようになる。はじめに、（ａ）音声や身体動作によって発生した振動を骨伝導マイクロフォンにより骨伝導音として検知する。 [Procedures for tooth biting sound identification]
FIG. 8 shows a general procedure for the identification of tooth biting sound and shows the processing procedure. First, (a) vibration generated by voice or body movement is detected as bone conduction sound by a bone conduction microphone.

（ｂ）検知された骨伝導音の信号は数ｍＶ程度で微弱であるため、増幅器により３５ｄＢ程度増幅し、サンプリング周波数２０ｋＨｚで採取しメモリに格納する。 (B) Since the detected bone conduction sound signal is weak at about several mV, it is amplified by an amplifier by about 35 dB, collected at a sampling frequency of 20 kHz, and stored in a memory.

（ｃ）現時刻より一定時間前の信号の平均値を時間による可変閾値として保持しておき、メモリ格納された信号を逐次スキャンし、可変閾値処理を行って、信号の振幅値が可変閾値を上回る信号を検出した場合に歯咬音候補を部分的に切り出し、メモリに格納するというようにして、歯咬音候補検出を行う。信号の切り出しを終えた時点で、それ以前の信号をメモリから消去する。この操作により、歯咬音以外の音声信号等はほぼ除去される。 (C) An average value of a signal a predetermined time before the current time is held as a variable threshold value according to time, a signal stored in the memory is sequentially scanned, variable threshold processing is performed, and the amplitude value of the signal is set to a variable threshold value. Tooth bite sound candidate detection is performed by partially cutting out the tooth bite sound candidate when the signal exceeding the signal is detected and storing it in the memory. When the signal is cut out, the previous signal is erased from the memory. By this operation, audio signals other than the tooth biting sound are almost removed.

（ｄ）（ｃ）で切り出された信号についてノルムによる正規化処理を行って部分空間パターン識別法による多クラス識別を行う。それにより当初の歯咬音データに対して、前歯、右歯、左歯の歯咬音であるというような識別がなされる。 (D) The signal cut out in (c) is subjected to normalization processing using a norm to perform multi-class identification by the subspace pattern identification method. As a result, the initial tooth biting sound data is identified as being the biting sound of the front teeth, right teeth, and left teeth.

上述の（ｂ）〜（ｄ）において、メモリに格納された複数の歯咬音信号に対し逐次スキャンして多クラス識別を行っているが、各歯咬音信号について逐次多クラス識別を行うようにしてもよい。この場合、メモリは採取された骨伝導音信号を一時的に保持するバッファとなる。 In the above (b) to (d), multi-class identification is performed by sequentially scanning a plurality of tooth-biting sound signals stored in the memory. It may be. In this case, the memory is a buffer that temporarily holds the collected bone conduction sound signal.

［操作指令信号生成装置］
骨伝導マイクロフォンを用いて骨伝導音を採取し、それに対して部分空間パターン識別による多クラス識別を行って歯咬音を識別することにより、意図された歯咬音（前歯、右歯、左歯）に応じた信号を生成し、操作指令入力とすることができる。このような操作指令信号生成のための装置は、前述の（ａ）〜（ｄ）の手順を実行するものとして構成され、図９に示されるようになる。 [Operation command signal generator]
By collecting bone conduction sound using a bone conduction microphone and performing multi-class discrimination by subspace pattern identification to identify the tooth bite sound, the intended tooth biting sound (anterior teeth, right teeth, left teeth) ) To generate an operation command input. Such an apparatus for generating an operation command signal is configured to execute the above-described procedures (a) to (d) and is as shown in FIG.

図９の操作指令信号生成装置は大略的に骨伝導マイクロフォン１０、増幅器２０、データロッガ３０，歯咬音信号処理部４０、操作指令入力信号生成部５０を備えており、また、６０は生成された操作指令信号を受ける機器（情報機器、電子機器等）である。増幅器２０は骨伝導マイクロフォン１０により検知された骨伝導音信号を３５ｄＢ程度のレベルに増幅する。データロッガ３０は増幅された信号をＡＤ変換し、周波数２０ｋＨｚでサンプリングするサンプリング部３１とサンプリングされた信号を歯咬音データとして格納するためのメモリ３２とからなる。 The operation command signal generation device of FIG. 9 generally includes a bone conduction microphone 10, an amplifier 20, a data logger 30, a tooth biting signal processing unit 40, and an operation command input signal generation unit 50, and 60 is generated. A device (information device, electronic device, etc.) that receives an operation command signal. The amplifier 20 amplifies the bone conduction sound signal detected by the bone conduction microphone 10 to a level of about 35 dB. The data logger 30 includes an A / D conversion of the amplified signal and a sampling unit 31 for sampling at a frequency of 20 kHz and a memory 32 for storing the sampled signal as tooth biting sound data.

歯咬音信号処理部４０はメモリ３２に格納された歯咬音データを逐次呼び出し、可変閾値処理により歯咬音候補を検出する可変閾値処理部４１と、検出された歯咬音候補に対して空間パターン識別を用いたパターン識別処理により歯咬音の多クラス識別を行うパターン識別処理部４２とからなる。操作指令信号生成部５０では多クラス識別処理がなされた歯咬音信号に応じた操作指令信号を生成し、機器６０に送出する。 The tooth biting sound signal processing unit 40 sequentially calls the tooth biting sound data stored in the memory 32, detects the tooth biting sound candidate by the variable threshold processing, and the detected tooth biting sound candidate. The pattern identification processing unit 42 performs multi-class identification of tooth biting sound by pattern identification processing using spatial pattern identification. The operation command signal generation unit 50 generates an operation command signal corresponding to the tooth biting sound signal that has been subjected to the multi-class identification process, and sends it to the device 60.

図９の操作指令信号生成装置は採取された歯咬音信号を多クラスに識別し、操作者が意図した噛み合わせの動作に応じた信号から機器への入力信号を生成する。噛み合わせの動作としては、特に前歯、右歯、左歯の噛み合わせに関して説明しているが、飲み込み動作、首振り動作等についても識別すれば、さらに入力信号は多様化される。 The operation command signal generation device of FIG. 9 identifies the collected tooth biting sound signals in multiple classes, and generates an input signal to the device from a signal corresponding to the meshing operation intended by the operator. As the meshing operation, the description has been made especially with respect to the meshing of the front teeth, the right teeth, and the left teeth. However, if the swallowing operation, the swinging operation, and the like are identified, the input signals are further diversified.

また、図９の操作指令信号生成装置は、センサとして骨伝導マイクロフォンを用いた形態のものであるが、さらに他の既存のセンサ、例えばカメラ、赤外線センサ等を併せて備える形態として、入力信号を多様化するような入力信号生成装置として構成することができる。 In addition, the operation command signal generation device of FIG. 9 has a configuration using a bone conduction microphone as a sensor, but further includes other existing sensors such as a camera, an infrared sensor, etc. It can be configured as a diversified input signal generation device.

例えば、カメラにより頭部を撮影した動画からジェスチャ認識処理によって頭部の動作を認識してマウスポインタの機能を与え、前歯、右歯、左歯の各噛み合わせ動作によりマウスのクリック機能を与えるというように、骨伝導マイクロフォンとカメラとを組み合わせた操作指令入力信号生成装置として構成することにより、多様な入力操作を実行することができる。このように骨伝導マイクロフォンと他のセンサとを備えた操作指令入力信号生成装置においては、歯咬音信号について他クラス識別を行う歯咬音信号処理部のほかに、他のセンサにより採取された信号の識別処理部を備え、それらの処理部により生成された信号を併せて操作指令入力信号生成部において入力信号を形成する。 For example, the motion of the head is recognized by a gesture recognition process from a moving image of the head taken by a camera and the function of a mouse pointer is given, and the click function of the mouse is given by the engagement operation of the front teeth, right teeth, and left teeth. Thus, various input operations can be executed by configuring as an operation command input signal generation device combining a bone conduction microphone and a camera. As described above, in the operation command input signal generation device provided with the bone conduction microphone and the other sensor, in addition to the tooth biting signal processing unit that performs other class identification on the tooth biting sound signal, the signal is collected by another sensor. A signal identification processing unit is provided, and an input signal is formed in the operation command input signal generation unit by combining the signals generated by these processing units.

本発明は口内音や身体動作によるトリガ信号生成装置、多値スイッチ、パソコン等の情報処理機器、携帯電話機、携帯音響機器等の操作指令入力生成装置として利用することができ、また、重度障害や全身麻酔等で手指が自由に使えない者による意志伝達手段、他者に気づかれないようにすることが必要な通信・機器操作を行う手段として利用するというように、多様な形態で利用可能である。 INDUSTRIAL APPLICABILITY The present invention can be used as an operation command input generation device for a trigger signal generation device by mouth sound or body movement, an information processing device such as a multi-value switch, a personal computer, a mobile phone, a portable audio device, etc. It can be used in various forms, such as a means of communication by those who cannot use their fingers freely due to general anesthesia, etc., and a means of communication and equipment operations that need to be made unnoticeable by others. is there.

１０骨伝導マイクロフォン
２０増幅器
３０データロッガ
３１サンプリング部
３２メモリ
４０歯咬音信号処理部
４１可変閾値処理部
４２パターン識別処理部
５０操作指令信号生成部
６０機器

DESCRIPTION OF SYMBOLS 10 Bone conduction microphone 20 Amplifier 30 Data logger 31 Sampling part 32 Memory 40 Tooth biting sound signal processing part 41 Variable threshold value processing part 42 Pattern identification processing part 50 Operation command signal generation part
60 equipment

Claims

Detecting a bone conduction sound with a bone conduction microphone attached to one pinna and generating a bone conduction sound signal;
Amplifying and sampling the bone conduction sound signal and storing it in memory;
Extracting the stored bone conduction sound signal and performing a variable threshold process to detect a tooth bite sound candidate;
Performing multi-class identification by partial space pattern identification on the detected tooth biting sound candidates;
Generating a signal corresponding to the tooth biting sound data of the multi-class identified tooth biting sound candidate as an operation command signal;
An operation command signal generation method comprising:

Differentiating the bone conduction sound transmission path from the bone conduction sound generation position to the bone conduction microphone attached to the one auricle, and distinguishing the bone conduction sound at different positions by changing the characteristics included in the bone conduction sound The operation command signal generation method according to claim 1, wherein:

The bone conduction sound collected by the bone conduction sound microphone identifies and recognizes at least the engagement operation of the front tooth, the left tooth, and the right tooth according to any one of claims 1 and 2. Operation command signal generation method.

Acquires data corresponding to the head movement collected by the sensor that detects the head movement, and generates an operation command signal by combining the signal corresponding to the head movement and the signal corresponding to the tooth biting sound data The operation command signal generation method according to claim 1, wherein the operation command signal generation method is performed.

A bone conduction microphone, an amplifying unit for amplifying a bone conduction sound signal generated by the bone conduction microphone, a data logger for sampling the amplified bone conduction sound signal and storing it as bone conduction sound data, and a memory of the data logger A variable threshold value processing unit that cuts out tooth-biting sound candidate data from stored bone conduction sound data, a subspace pattern identification processing unit that performs multi-class identification from the cut-out tooth biting sound candidate data, and multi-class identification An operation command signal generation device comprising: an operation command signal generation unit configured to generate an operation command signal according to the made tooth and bite sound candidate data.

Differentiating the bone conduction sound transmission path from the bone conduction sound generation position to the bone conduction microphone attached to the one auricle, and distinguishing the bone conduction sound at different positions by changing the characteristics included in the bone conduction sound The operation command signal generation device according to claim 5, wherein:

The partial space pattern identification processing unit is configured to identify and recognize at least an engagement operation of an anterior tooth, a left tooth, and a right tooth with respect to a bone conduction sound collected by the bone conduction sound microphone. The operation command signal generation device according to any one of 5 and 6.

A sensor for detecting the movement of the head; and a head movement recognition unit that acquires data corresponding to the head movement collected by the sensor and generates a signal corresponding to the head movement. The command signal generation unit generates an operation command signal by combining a signal corresponding to the tooth biting sound data and a signal generated by the head movement recognition unit. The operation command signal generation device according to any one of the above.