JPH08275279A

JPH08275279A - Voice pickup system

Info

Publication number: JPH08275279A
Application number: JP7075875A
Authority: JP
Inventors: Takuro Yamaguchi; 卓郎山口
Original assignee: Foster Electric Co Ltd
Current assignee: Foster Electric Co Ltd
Priority date: 1995-03-31
Filing date: 1995-03-31
Publication date: 1996-10-18
Anticipated expiration: 2020-05-11
Also published as: JP3647499B2

Abstract

PURPOSE: To realize the voice pickup system in which a voice is sent with a high articulation without picking up a surrounding noise. CONSTITUTION: The voice pickup system detecting a bone conduction sound or an air conduction sound by a pickup 1 is provided with a voice recognition means 2 capable of recognizing a sound detected by the pickup 1 and a spectrum generating means 3a using the sound data recognized by the voice recognition means 2 to generate a spectrum with a frequency component when the sound data recognized with a conventional microcophone, and also with a comparison means 3b comparing the sound spectrum detected by the pickup 1 with the generated spectrum to obtain a missing frequency component, a missing spectrum generating means 3c generating a sound of the frequency component missing in the sound detected by the pickup 1 as a supplement sound according to the result of comparison by the comparator means 3b, and a synthesis means 4 synthesizing the sound detected by the pickup 1 with the supplement sound generated by the missing spectrum generating means 3c and providing the synthesized output.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は音声ピックアップシステ
ムに関し、更に詳しくは、骨伝導音や気道音をピックア
ップで検出する音声ピックアップシステムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice pickup system, and more particularly to a voice pickup system for detecting bone conduction sounds and airway sounds by a pickup.

【０００２】[0002]

【従来の技術】騒音が存在する環境で使用するマイクロ
ホンとして、骨伝導音を検出する骨伝導音ピックアップ
が知られている。2. Description of the Related Art A bone conduction sound pickup for detecting bone conduction sound is known as a microphone used in a noisy environment.

【０００３】この骨伝導音ピックアップはユーザの顔面
や頭部に密着させておき、ユーザの発声に伴う顔面や頭
部の振動を検出するものである。また、似たようなマイ
クロホンとして、イヤホンのような形状のピックアップ
を外耳に挿入して、外耳道の気道音を検出するものも存
在している。This bone conduction sound pickup is made to be in close contact with the user's face or head and detects the vibration of the face or head accompanying the user's utterance. In addition, as a similar microphone, there is a microphone that inserts a pickup having a shape like an earphone into the outer ear to detect airway sound of the ear canal.

【０００４】以上のような骨伝導音や気道音を検出する
ピックアップは周囲の騒音を比較的拾わずに、目的とす
るユーザの音声を検出し易いという利点を有する。The pickup for detecting bone conduction sound and airway sound as described above has an advantage that it is easy to detect the voice of the intended user without relatively picking up ambient noise.

【０００５】[0005]

【発明が解決しようとする課題】しかし、通常の音声の
スペクトルが３００Ｈｚ〜３ｋＨｚであるとした場合
に、図５特性Ｂに示すように、１ｋＨｚ以上の周波数領
域で検出レベルが低下する問題を有している。However, when the spectrum of normal voice is 300 Hz to 3 kHz, there is a problem that the detection level is lowered in the frequency region of 1 kHz or more as shown in the characteristic B of FIG. are doing.

【０００６】すなわち、骨伝導音や気道音といった間接
的に音声を検出する形式のピックアップの検出音は、通
常のマイクロホンで検出した音声信号（図５特性Ａ）と
比較して、音声の低域成分が強調されて高域成分が徐々
に低下する感じになり（図５特性Ｂ）、明瞭度が低下す
る問題を有している。That is, a detection sound of a pickup that indirectly detects a sound such as a bone conduction sound or an airway sound is lower than a sound signal (characteristic A in FIG. 5) detected by an ordinary microphone. The component is emphasized and the high frequency component gradually decreases (characteristic B in FIG. 5), which causes a problem that the clarity decreases.

【０００７】図６は男性の声を実際に骨伝導音ピックア
ップで検出した場合の周波数特性を示す特性図である。
この特性図からも高域成分の低下の様子が読み取れる。
実際には、骨伝導音ピックアップを顔面若しくは頭部に
押さえつける際の圧力や、男性／女性の別などによって
高域成分の低下の度合は若干異なるが、高域成分が低下
することには変わりがない。FIG. 6 is a characteristic diagram showing frequency characteristics when a male voice is actually detected by a bone conduction sound pickup.
From this characteristic diagram, it is possible to read how the high frequency components are reduced.
Actually, the degree of lowering the high-frequency component is slightly different depending on the pressure when the bone conduction sound pickup is pressed against the face or head, and whether male / female or not, but the high-frequency component decreases. Absent.

【０００８】本発明は上記の問題点に鑑みてなされたも
ので、その目的は、周囲の騒音を拾うことなく、かつ、
明瞭度の高い状態で音声を伝達することが可能な音声ピ
ックアップシステムを提供することにある。The present invention has been made in view of the above problems, and an object of the present invention is to pick up ambient noise, and
An object of the present invention is to provide a voice pickup system capable of transmitting voice in a state of high intelligibility.

【０００９】[0009]

【課題を解決するための手段】本件出願の発明者は、従
来の音声ピックアップシステムにおいて予想される明瞭
度等の不具合を改良すべく鋭意研究を行った結果、従来
は周波数特性の点で明瞭度に問題を有していた骨伝導音
や気道音を検出するピックアップにおいても明瞭度の高
い音声を伝達できる構成を見出し、本発明を完成させた
ものである。Means for Solving the Problems The inventor of the present application has conducted earnest research to improve inconveniences and the like expected in a conventional voice pickup system, and as a result, conventionally, the intelligibility in terms of frequency characteristics has been improved. The present invention has been completed by finding out a configuration capable of transmitting a voice with high intelligibility even in a pickup for detecting bone conduction sound or airway sound, which had a problem in the above.

【００１０】従って、課題を解決する手段である本発明
は以下に説明するように構成されたものである。（１）すなわち、上記の課題を解決する第１の手段は、
骨伝導音若しくは気道音をピックアップで検出する音声
ピックアップシステムにおいて、前記ピックアップで検
出した音を認識可能な音声認識手段と、前記音声認識手
段で認識された音のデータを用いて、この認識された音
のデータを通常のマイクロホンで検出した場合の周波数
成分のスペクトルを生成するスペクトル発生手段と、こ
のスペクトル発生手段が生成したスペクトルと前記ピッ
クアップで検出した音のスペクトルとを比較して欠落し
ている周波数成分を求める比較手段と、前記比較手段で
の比較の結果から前記ピックアップで検出した音に欠落
している周波数成分の音を補完音として生成する欠落ス
ペクトル発生手段と、前記ピックアップで検出した音と
前記欠落スペクトル発生手段で生成した補完音とを合成
して出力する合成手段と、を有することを特徴とする音
声ピックアップシステムである。Therefore, the present invention, which is a means for solving the problems, is configured as described below. (1) That is, the first means for solving the above problems is
In a voice pickup system that detects a bone conduction sound or an airway sound with a pickup, this recognition is performed by using a voice recognition unit capable of recognizing the sound detected by the pickup and the sound data recognized by the voice recognition unit. A spectrum generation unit that generates a spectrum of frequency components when sound data is detected by a normal microphone is compared with the spectrum generated by this spectrum generation unit and the spectrum of the sound detected by the pickup and is missing. Comparison means for obtaining a frequency component, missing spectrum generating means for generating a sound of a frequency component missing from the sound detected by the pickup as a complementary sound from the result of comparison by the comparison means, and sound detected by the pickup And a complementary sound generated by the missing spectrum generating means are combined and output. An audio pickup system comprising: the stage, a.

【００１１】尚、このような音声ピックアップシステム
において、欠落スペクトル発生手段において生成する補
完音としては、予めシステムに音声合成用の学習をさせ
ておいて、使用者の音声に似せた音声を発生するために
必要な欠落スペクトルとすることも可能である。In such a voice pickup system, as the complementary sound generated by the missing spectrum generating means, the system is preliminarily trained for voice synthesis and a voice similar to the voice of the user is generated. It is also possible to set a missing spectrum necessary for this.

【００１２】（２）また、上記の課題を解決する第２の
手段は、骨伝導音若しくは気道音をピックアップで検出
する音声ピックアップシステムにおいて、前記ピックア
ップで検出した音を認識可能な音声認識手段と、前記音
声認識手段で認識された音のデータに対応する音を人工
音として生成する人工音生成手段と、を有することを特
徴とする音声ピックアップシステムである。(2) A second means for solving the above-mentioned problems is a voice recognition means capable of recognizing the sound detected by the pickup in a voice pickup system for detecting bone conduction sound or airway sound by the pickup. And an artificial sound generation unit that generates a sound corresponding to the sound data recognized by the voice recognition unit as an artificial sound.

【００１３】尚、このような音声ピックアップシステム
において、人工音生成手段において生成する人工音とし
ては、予めシステムに音声合成用の学習をさせておいて
使用者の音声に似せた音声を発生することも、別の一般
的な音声を発生することも可能である。In such a voice pickup system, as the artificial sound generated by the artificial sound generating means, the system should be trained for voice synthesis in advance to generate a voice similar to the voice of the user. It is also possible to generate another common voice.

【００１４】[0014]

【作用】課題を解決する第１の手段である音声ピックア
ップシステムにおいて、骨伝導音若しくは気道音をピッ
クアップで検出し、ピックアップで検出した音を音声認
識手段で認識し、音声認識手段で認識された音のデータ
を用いて、この認識された音のデータを通常のマイクロ
ホンで検出した場合の周波数成分のスペクトルを生成
し、このように生成したスペクトルとピックアップで実
際に検出した音のスペクトルとを比較手段で比較し欠落
している周波数成分を求めて、比較手段での比較の結果
からピックアップで検出した音に欠落している周波数成
分の音を欠落スペクトル発生手段で補完音として生成
し、ピックアップで検出した音と欠落スペクトル発生手
段で生成した補完音とを合成手段において合成して出力
する。In the voice pickup system which is the first means for solving the problem, the bone conduction sound or the airway sound is detected by the pickup, the sound detected by the pickup is recognized by the voice recognition means, and the sound is recognized by the voice recognition means. The sound data is used to generate a spectrum of frequency components when this recognized sound data is detected by a normal microphone, and the spectrum thus generated is compared with the sound spectrum actually detected by the pickup. Means to find the missing frequency component, and from the result of the comparison in the comparing means, the sound of the missing frequency component in the sound detected by the pickup is generated as a complementary sound by the missing spectrum generating means, and is picked up by the pickup. The detected sound and the complementary sound generated by the missing spectrum generating means are combined by the combining means and output.

【００１５】以上のような音声ピックアップシステムに
よれば、骨伝導音や気道音を検出した後に音声認識して
欠落スペクトルを補完することで、本人の音声の特徴を
損なうことなく通常の音声に近い明瞭な音声信号を生成
することができるようになる。また、周囲の騒音の影響
を受けることもない。According to the voice pickup system as described above, by detecting bone conduction sound or airway sound and then recognizing the voice and complementing the missing spectrum, the voice is close to normal voice without spoiling the characteristics of the voice of the person. It becomes possible to generate a clear audio signal. In addition, it is not affected by ambient noise.

【００１６】課題を解決する第２の手段である音声ピッ
クアップシステムにおいて、骨伝導音若しくは気道音を
ピックアップで検出し、ピックアップで検出した音を音
声認識手段で認識し、音声認識手段で認識された音に対
応する音を人工音生成手段で人工音として生成する。In the voice pickup system as the second means for solving the problem, bone conduction sound or airway sound is detected by the pickup, the sound detected by the pickup is recognized by the voice recognition means, and is recognized by the voice recognition means. A sound corresponding to the sound is generated as an artificial sound by the artificial sound generating means.

【００１７】以上のような音声ピックアップシステムに
よれば、骨伝導音や気道音を検出し、音声認識されたデ
ータにより対応する人工音声を発生することで、通常の
音声に近い明瞭な音声信号を生成することができるよう
になる。また、周囲の騒音の影響を受けることもない。According to the voice pickup system as described above, a bone conduction sound or an airway sound is detected, and an artificial voice corresponding to the voice-recognized data is generated to generate a clear voice signal close to a normal voice. Will be able to generate. In addition, it is not affected by ambient noise.

【００１８】[0018]

【実施例】図面を用いて本発明の一実施例について詳細
に説明する。＜音声ピックアップシステムの構成（１）＞まず、本発
明の一実施例である音声ピックアップシステムの構成に
ついて図１を用いて説明を行なう。An embodiment of the present invention will be described in detail with reference to the drawings. <Structure (1) of Voice Pickup System> First, the structure of a voice pickup system according to an embodiment of the present invention will be described with reference to FIG.

【００１９】ピックアップ１は骨伝導音若しくは気道音
などを検出する検出手段であり、骨伝導音を検出するも
のとしては骨伝導マイクロホン（骨伝導音ピックアッ
プ）、気道音を検出するものとしては気道音マイクロホ
ンが該当する。The pickup 1 is a detecting means for detecting bone conduction sound or airway sound. A bone conduction microphone (bone conduction sound pickup) is used for detecting bone conduction sound, and an airway sound is used for detecting airway sound. A microphone is applicable.

【００２０】音声認識回路２は周知の音声若しくは音節
を認識する回路であり、ユーザの個々の特徴部分を学習
するものであっても、また、このような学習を行わない
ものであっても構わない。The voice recognition circuit 2 is a well-known circuit for recognizing voices or syllables, and may or may not learn individual characteristic parts of the user. Absent.

【００２１】補完音生成回路３は前記音声認識回路２で
認識された音（音声，音節）のデータを用いて、ピック
アップ１で検出された音に欠落している周波数成分の音
を補完音として生成する一種の人工音発生回路である。
また、この補完音生成回路３は、スペクトル発生回路３
ａと、比較回路３ｂと、欠落スペクトル発生回路３ｃと
から構成されている。The complementary sound generation circuit 3 uses the data of the sound (voice, syllable) recognized by the voice recognition circuit 2 as a complementary sound for the frequency component missing in the sound detected by the pickup 1. It is a kind of artificial sound generation circuit that generates.
In addition, the complementary sound generation circuit 3 includes a spectrum generation circuit 3
a, a comparison circuit 3b, and a missing spectrum generation circuit 3c.

【００２２】スペクトル発生回路３ａは前記音声認識回
路２で認識された音のデータを用いて、認識された音が
通常のマイクロホンで検出された場合のスペクトルを発
生する。比較回路３ｂは前記スペクトル発生回路３ａが
発生したスペクトルと、前記ピックアップで検出した音
のスペクトルとを比較する。欠落スペクトル発生回路３
ｃは、比較回路３ｂの比較結果に応じて、スペクトルの
差分に応じた部分のスペクトル（欠落スペクトル）を補
完音として発生する。The spectrum generation circuit 3a uses the sound data recognized by the voice recognition circuit 2 to generate a spectrum when the recognized sound is detected by a normal microphone. The comparison circuit 3b compares the spectrum generated by the spectrum generation circuit 3a with the spectrum of the sound detected by the pickup. Missing spectrum generation circuit 3
In accordance with the comparison result of the comparison circuit 3b, c generates a spectrum (missing spectrum) of a portion corresponding to the difference in spectrum as a complementary sound.

【００２３】合成回路４はピックアップ１で検出された
音（骨伝導音，気道音）と補完音生成回路３で生成され
た補完音とを合成して出力する出力手段である。＜音声ピックアップシステムの動作（１）＞本発明の一
実施例である音声ピックアップシステムの動作は、大き
く分けて以下に示したような，，，，，の
各ステップにより構成されている。このステップを順を
追って説明する。The synthesizing circuit 4 is an output means for synthesizing the sound (bone conduction sound, airway sound) detected by the pickup 1 and the complementary sound generated by the complementary sound generating circuit 3 and outputting the synthesized sound. <Operation of Voice Pickup System (1)> The operation of the voice pickup system according to the embodiment of the present invention is roughly divided into the following steps ,. This step will be described step by step.

【００２４】音（骨伝導音，気道音）の検出：ピック
アップ１を用いて骨伝導音若しくは気道音を検出する。音（骨伝導音，気道音）の認識：ピックアップ１の検
出音を音声認識回路２で認識する。この場合、音声認識
回路２の認識方法により、単音での認識か音節での認識
かが異なるが、いずれであっても構わない。また、ユー
ザの音声を学習して認識するものであっても、また、ユ
ーザを特定した学習を行わないで認識するものであって
も構わない。Detection of sound (bone conduction sound, airway sound): Bone conduction sound or airway sound is detected using the pickup 1. Recognition of sound (bone conduction sound, airway sound): The sound detected by the pickup 1 is recognized by the voice recognition circuit 2. In this case, depending on the recognition method of the voice recognition circuit 2, the recognition by a single sound or the recognition by a syllable differs, but either one may be used. Further, the user's voice may be learned and recognized, or the user's voice may be recognized without learning.

【００２５】認識音のスペクトル発生：認識音のデー
タを用いて、スペクトル発生回路３ａが通常のマイクロ
ホンで検出した場合に得られるであろうスペクトル（以
下、これを標準音のスペクトルと言う）を発生する。こ
のために、スペクトル発生回路３ａは認識音（単音，音
節）に従ったスペクトルを有しているものとし、認識音
に従って対応するスペクトルが呼び出されるようになっ
ている。Generation of spectrum of recognized sound: Using the data of the recognized sound, a spectrum that will be obtained when the spectrum generation circuit 3a detects it with a normal microphone (hereinafter referred to as a spectrum of a standard sound) is generated. To do. For this reason, the spectrum generation circuit 3a is assumed to have a spectrum according to the recognized sound (single note, syllable), and the corresponding spectrum is called according to the recognized sound.

【００２６】この場合のスペクトルとしては、ユーザ毎
のスペクトルを有しても良いし、標準的なスペクトルを
有しても良い。また、標準的なスペクトルを有するとし
た場合には、成人男性，成人女性，子供等のように幾つ
かのスペクトルを有するようにしても構わない。The spectrum in this case may be a spectrum for each user or a standard spectrum. Moreover, when it has a standard spectrum, it may have several spectra such as an adult male, an adult female, and a child.

【００２７】標準音と検出音とのスペクトル比較：比
較回路３ｂにおいて、標準音のスペクトルと検出音との
スペクトルとを比較する。Spectrum comparison between standard sound and detected sound: The comparison circuit 3b compares the spectrum of the standard sound with the spectrum of the detected sound.

【００２８】例えば、図５を用いて説明すると、認識さ
れた検出音毎に、標準音のスペクトルＡと検出音のスペ
クトルＢとを比較して、検出音の欠落スペクトルＣ（＝
Ａ−Ｂ）を算出する。For example, referring to FIG. 5, the spectrum A of the standard sound and the spectrum B of the detected sound are compared for each recognized detected sound, and the missing spectrum C (=
Calculate AB).

【００２９】補完音（欠落スペクトル）発生：比較回
路３ｂで得られた欠落スペクトルのデータに応じて欠落
スペクトル発生回路３ｃが欠落スペクトルの信号を発生
する。この場合も、スペクトル発生回路３ａと同じ様
に、発生する欠落スペクトルとして、ユーザ毎のスペク
トルを有しても良いし、標準的なスペクトルを有しても
良い。また、標準的なスペクトルを有するとした場合に
は、成人男性，成人女性，子供等のように幾つかのスペ
クトルを有するようにしても構わない。Generation of complementary sound (missing spectrum): The missing spectrum generating circuit 3c generates a missing spectrum signal in accordance with the missing spectrum data obtained by the comparison circuit 3b. In this case as well, similar to the spectrum generating circuit 3a, the missing spectrum to be generated may have a spectrum for each user or may have a standard spectrum. Moreover, when it has a standard spectrum, it may have several spectra such as an adult male, an adult female, and a child.

【００３０】検出音と補完音との合成：合成回路４に
おいて、欠落スペクトルと検出音のスペクトルとを合成
する。この合成処理により、検出音の欠落スペクトルが
補完音として加算され、標準スペクトルと同等なスペク
トルの合成音が得られる。従って、標準的なマイクロホ
ンで集音したものと同等な音声信号が得られる。Synthesis of detected sound and complementary sound: The synthesis circuit 4 synthesizes the missing spectrum and the spectrum of the detected sound. By this synthesizing process, the missing spectrum of the detected sound is added as a complementary sound, and a synthetic sound having a spectrum equivalent to the standard spectrum is obtained. Therefore, an audio signal equivalent to that picked up by a standard microphone can be obtained.

【００３１】尚、補完音がユーザ本人のものであれば合
成された結果得られる合成音も本人のものとなるが、補
完音が標準的なデータに基づくものであったとしても、
補完音の部分は高域の部分のみであるので違和感は極め
て少ない。If the complementary sound is that of the user himself, the synthesized sound obtained as a result of synthesis is also that of the user himself. Even if the complementary sound is based on standard data,
Since the portion of the complementary sound is only the high frequency portion, there is very little discomfort.

【００３２】尚、ユーザの声に応じた欠落スペクトルを
発生したい場合には、図２に示すように、個人データメ
モリ３ｄを備えておいて、ユーザの音声を予め収録（サ
ンプリング）しておいて特徴部分のデータを格納してお
くことが可能である。When it is desired to generate a missing spectrum corresponding to the voice of the user, as shown in FIG. 2, the personal data memory 3d is provided and the voice of the user is recorded (sampled) in advance. It is possible to store the data of the characteristic part.

【００３３】また、ユーザの声の質を判定して、欠落ス
ペクトル発生用に複数備えた標準的なスペクトルの中か
ら近いものを自動的に選択するようなことも可能であ
る。＜構成（１）により得られる効果＞以上のような音声ピ
ックアップシステムによれば、骨伝導音や気道音を検出
した後に音声認識して欠落スペクトルを補完すること
で、本人の音声の特徴を損なうことなく通常の音声に近
い明瞭な音声信号を生成することができるようになる。
また、周囲の騒音の影響を受けることもない。It is also possible to judge the quality of the user's voice and automatically select a close one from a plurality of standard spectra provided for generating a missing spectrum. <Effects Obtained by Configuration (1)> According to the voice pickup system as described above, the feature of the voice of the person is impaired by performing voice recognition and complementing the missing spectrum after detecting bone conduction sound or airway sound. It becomes possible to generate a clear voice signal close to a normal voice without the need.
In addition, it is not affected by ambient noise.

【００３４】また、歯噛音などのようにピックアップで
検出されるものの無意味な音については、音声認識の処
理で意味をなさないので補完音が生成されない。従っ
て、ピックアップで検出された低域成分のみが出力され
るため、悪影響は少ない。Further, regarding a meaningless sound such as a tooth-cluttering sound which is detected by the pickup, a complementary sound is not generated because it does not make sense in the voice recognition process. Therefore, since only the low frequency component detected by the pickup is output, the adverse effect is small.

【００３５】＜音声ピックアップシステムの構成（２）
＞まず、本発明の第二の実施例である音声ピックアップ
システムの構成について図３を用いて説明を行なう。<Structure of voice pickup system (2)
First, the configuration of the voice pickup system according to the second embodiment of the present invention will be described with reference to FIG.

【００３６】ピックアップ１は骨伝導音若しくは気道音
などを検出する検出手段であり、骨伝導音を検出するも
のとしては骨伝導マイクロホン、気道音を検出するもの
としては気道音マイクロホンが該当する。The pickup 1 is a detecting means for detecting bone conduction sound or airway sound, and a bone conduction microphone is used for detecting bone conduction sound, and an airway sound microphone is used for detecting airway sound.

【００３７】音声認識回路２は周知の音声若しくは音節
を認識する回路であり、ユーザの個々の特徴部分を学習
するものであっても、また、このような学習を行わない
ものであっても構わない。The voice recognition circuit 2 is a known circuit for recognizing voices or syllables, and may or may not learn individual characteristic parts of the user. Absent.

【００３８】人工音発生回路５は前記音声認識回路２で
認識された音（音声，音節）のデータを用いて、ピック
アップ１で検出された音に対応した人工音を生成するも
のである。The artificial sound generating circuit 5 uses the data of the sound (voice, syllable) recognized by the voice recognition circuit 2 to generate an artificial sound corresponding to the sound detected by the pickup 1.

【００３９】＜音声ピックアップシステムの動作（２）
＞本発明の一実施例である音声ピックアップシステムの
動作は、大きく分けて以下に示したような，，の
各ステップにより構成されている。このステップを順を
追って説明する。<Operation of the voice pickup system (2)
> The operation of the voice pickup system according to the embodiment of the present invention is roughly divided into the following steps. This step will be described step by step.

【００４０】音（骨伝導音，気道音）の検出：ピック
アップ１を用いて骨伝導音若しくは気道音を検出する。音（骨伝導音，気道音）の認識：ピックアップ１の検
出音を音声認識回路２で認識する。この場合、音声認識
回路２の認識方法により、単音での認識か音節での認識
かが異なるが、いずれであっても構わない。また、ユー
ザの音声を学習して認識するものであっても、また、ユ
ーザを特定した学習を行わないで認識するものであって
も構わない。Detection of sound (bone conduction sound, airway sound): The pickup 1 is used to detect bone conduction sound or airway sound. Recognition of sound (bone conduction sound, airway sound): The sound detected by the pickup 1 is recognized by the voice recognition circuit 2. In this case, depending on the recognition method of the voice recognition circuit 2, the recognition by a single sound or the recognition by a syllable differs, but either one may be used. Further, the user's voice may be learned and recognized, or the user's voice may be recognized without learning.

【００４１】認識音のスペクトル発生：認識音のデー
タを用いて、人工音発生回路５が通常のマイクロホンで
検出した場合に得られるであろう標準スペクトルを発生
する。このために、人工音発生回路５は認識音（単音，
音節）に従ったスペクトルを有しているものとし、認識
音に従って対応するスペクトルが呼び出されるようにな
っている。Generation of spectrum of recognized sound: Data of the recognized sound is used to generate a standard spectrum that would be obtained when the artificial sound generation circuit 5 detects the sound with a normal microphone. For this reason, the artificial sound generation circuit 5 causes the recognition sound (single sound,
It has a spectrum according to a syllable), and a corresponding spectrum is called according to a recognized sound.

【００４２】この場合のスペクトルとしては、ユーザ毎
のスペクトルを有しても良いし、標準的なスペクトルを
有しても良い。また、標準的なスペクトルを有するとし
た場合には、成人男性，成人女性，子供等のように幾つ
かのスペクトルを有するようにして切り替えて使用する
構成でも構わない。従って、標準的なマイクロホンで集
音したものと同等な音声信号が得られる。The spectrum in this case may be a spectrum for each user or may be a standard spectrum. Further, in the case of having a standard spectrum, it may be configured to have several spectra such as an adult male, an adult female, a child, etc., and switch and use them. Therefore, an audio signal equivalent to that picked up by a standard microphone can be obtained.

【００４３】尚、ユーザの声に応じた欠落スペクトルを
発生したい場合には、図４に示すように、個人データメ
モリ６を備えておいて、ユーザの音声を予め収録（サン
プリング）しておいて特徴部分のデータを格納しておく
ことが可能である。When it is desired to generate a missing spectrum corresponding to the voice of the user, the personal data memory 6 is provided as shown in FIG. 4, and the voice of the user is recorded (sampled) in advance. It is possible to store the data of the characteristic part.

【００４４】また、ユーザの声の質を判定して、欠落ス
ペクトル発生用に複数備えた標準的なスペクトルの中か
ら近いものを自動的に選択するようなことも可能であ
る。＜構成（２）により得られる効果＞以上のような音声ピ
ックアップシステムによれば、骨伝導音や気道音を検出
し、音声認識されたデータにより対応する人工音声を発
生することで、通常の音声に近い明瞭な音声信号を生成
することができるようになる。また、周囲の騒音の影響
を受けることもない。It is also possible to judge the quality of the user's voice and automatically select a close one from a plurality of standard spectra provided for generating a missing spectrum. <Effects Obtained by Configuration (2)> According to the voice pickup system as described above, a normal voice is generated by detecting bone conduction sound and airway sound and generating an artificial voice corresponding to the voice-recognized data. It becomes possible to generate a clear audio signal close to. In addition, it is not affected by ambient noise.

【００４５】また、歯噛音などのようにピックアップで
検出されるものの無意味な音については、音声認識の処
理で意味をなさないので人工音が生成されない。従っ
て、出力されないため悪影響は少ない。Further, as for a meaningless sound such as a tooth biting sound which is detected by the pickup, since it does not make sense in the voice recognition process, an artificial sound is not generated. Therefore, since it is not output, the adverse effect is small.

【００４６】＜その他の好ましい例＞以上のような音声
ピックアップシステムは各種の応用が可能であるが、騒
音環境下で音声を伝達する各種システムに組み込んで使
用することが可能である。例えば、携帯電話等の機器に
組み込むことで明瞭な送話が可能になる。そして、周囲
の音を相手に聞かれることが無いという利点も有してい
る。<Other Preferred Examples> Although the voice pickup system as described above can be applied in various ways, it can be used by incorporating it into various systems for transmitting voice in a noisy environment. For example, by incorporating it in a device such as a mobile phone, clear transmission becomes possible. It also has the advantage that the surrounding sound is not heard by the other party.

【００４７】[0047]

【発明の効果】以上のような音声ピックアップシステム
によれば、骨伝導音や気道音を検出した後に音声認識し
て欠落スペクトルを補完することで、本人の音声の特徴
を損なうことなく通常の音声に近い明瞭な音声信号を生
成することができるようになる。また、周囲の騒音の影
響を受けることもない。従って、周囲の騒音を拾うこと
なく、かつ、明瞭度の高い状態で音声を伝達することが
可能な音声ピックアップシステムを実現できるようにな
る。As described above, according to the voice pickup system as described above, the voice recognition is performed after the bone conduction sound or the airway sound is detected and the missing spectrum is complemented, so that the normal voice can be obtained without spoiling the characteristics of the voice of the person. It becomes possible to generate a clear audio signal close to. In addition, it is not affected by ambient noise. Therefore, it becomes possible to realize a voice pickup system capable of transmitting voice in a state of high intelligibility without picking up ambient noise.

【００４８】また、骨伝導音や気道音を検出し、音声認
識されたデータにより対応する人工音声を発生すること
で、通常の音声に近い明瞭な音声信号を生成することが
できるようになる。また、周囲の騒音の影響を受けるこ
ともない。従って、周囲の騒音を拾うことなく、かつ、
明瞭度の高い状態で音声を伝達することが可能な音声ピ
ックアップシステムを実現できるようになる。Further, by detecting bone conduction sound or airway sound and generating a corresponding artificial voice from the voice-recognized data, a clear voice signal close to a normal voice can be generated. In addition, it is not affected by ambient noise. Therefore, without picking up ambient noise, and
It becomes possible to realize a voice pickup system capable of transmitting voice in a state of high clarity.

[Brief description of drawings]

【図１】本発明の一実施例の音声ピックアップシステム
の構成を示す構成図である。FIG. 1 is a configuration diagram showing a configuration of an audio pickup system according to an embodiment of the present invention.

【図２】本発明の一実施例の音声ピックアップシステム
の変形例の構成を示す構成図である。FIG. 2 is a configuration diagram showing a configuration of a modified example of the voice pickup system according to the exemplary embodiment of the present invention.

【図３】本発明の第二の実施例の音声ピックアップシス
テムの構成を示す構成図である。FIG. 3 is a configuration diagram showing a configuration of a voice pickup system according to a second embodiment of the present invention.

【図４】本発明の第二の実施例の音声ピックアップシス
テムの変形例の構成を示す構成図である。FIG. 4 is a configuration diagram showing a configuration of a modified example of the audio pickup system according to the second embodiment of the present invention.

【図５】音声のスペクトルを模式的に示す特性図であ
る。FIG. 5 is a characteristic diagram schematically showing a spectrum of voice.

【図６】音声のスペクトルの実測結果を示す特性図であ
る。FIG. 6 is a characteristic diagram showing an actual measurement result of a voice spectrum.

[Explanation of symbols]

１ピックアップ２音声認識回路３補完音生成回路３ａスペクトル発生回路３ｂ比較回路３ｃ欠落スペクトル発生回路４合成回路 1 Pickup 2 Voice recognition circuit 3 Complementary sound generation circuit 3a Spectrum generation circuit 3b Comparison circuit 3c Missing spectrum generation circuit 4 Synthesis circuit

Claims

[Claims]

1. A voice pickup system for detecting a bone conduction sound or an airway sound by a pickup, using voice recognition means capable of recognizing a sound detected by the pickup, and sound data recognized by the voice recognition means. , Comparing the spectrum generated by this spectrum generating means with the spectrum of the sound detected by the pickup, and a spectrum generating means for generating a spectrum of frequency components when the recognized sound data is detected by an ordinary microphone. Comparing means for obtaining a missing frequency component, a missing spectrum generating means for producing a sound of a frequency component missing from the sound detected by the pickup as a complementary sound from the result of the comparison by the comparing means, and Synthesizes the sound detected by the pickup and the complementary sound generated by the missing spectrum generating means. Sound pickup system characterized by having a synthesizing means for outputting Te.

2. A voice pickup system for detecting a bone conduction sound or an airway sound by a pickup, which corresponds to voice recognition means capable of recognizing the sound detected by the pickup and sound data recognized by the voice recognition means. An artificial sound generating means for generating a sound as an artificial sound.