JP2004165775A

JP2004165775A - Sound input output apparatus

Info

Publication number: JP2004165775A
Application number: JP2002326511A
Authority: JP
Inventors: Seiichi Furuta; 誠一古田; Ichiro Akahori; 一郎赤堀
Original assignee: Denso Corp
Current assignee: Denso Corp
Priority date: 2002-11-11
Filing date: 2002-11-11
Publication date: 2004-06-10
Anticipated expiration: 2022-11-11
Also published as: JP4161685B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a sound input output apparatus in a simple configuration, capable of making a call with high quality. <P>SOLUTION: Microphones A1, B2 pick up voice of a talker and respectively generate a voice signal. The microphones A1, B2 are placed at an equal distance from a speaker 3 and transduce the voice outputted from the speaker 3 into the voice signal of the same level. A signal subtractor 4 algebraically subtracts the level of the voice signal picked up by the microphone B2 from the level of the voice signal picked up by the microphone A1 to generate a transmission speech signal. Thus, the voice outputted from the speaker 3 and intruded in the microphones A1, B2 can be eliminated from the transmission signal. Thus, production of echo and howling can be prevented and a call with high quality can be attained in a simple configuration. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
本発明は、音声入出力装置に関する。
【０００２】
【従来の技術】
近年、通信技術の発達により、様々な通話形態によって通話を行う装置が知られている。
【０００３】
例えば、特許文献１では、三人の話者が同時に通話することが可能な携帯端末機が示されている。この携帯端末機は、ＰＤＣ方式とＰＨＳ方式の２つの通信方式によって通信が可能であり、各通信方式毎にエコーキャンセラが設けられている。三人が同時に通話を行う場合、話者Ａは話者Ｂと話者Ｃに対して、異なる通信方式を利用して回線を接続し、通話を行う。話者Ｂと話者Ｃとの間の通話は話者Ａの携帯端末機を介して行われるが、異なる通信方式を利用して通話を行っているため、各通信方式毎に設けられたエコーキャンセラによって話者Ｂと話者Ｃの通話音声が遮断されることなく通話が可能である。
【０００４】
【特許文献１】
特開２０００−１９６７３６号公報
【０００５】
【発明が解決しようとする課題】
一方、このような通話形態の多様化にともない、通話装置の構成も複雑化してきている。特に、通話装置の一機能を担う音声入出力装置はその代表例であり、ハウリング防止機能やエコーキャンセル機能等の搭載により、複雑化の程度も著しい。しかしながら、通話装置の使用に際して課される制約条件によっては、複雑な構成の音声入出力装置は使用できない場合がある。かといって、音声入出力装置の有する機能を省くことによって簡略化を行うと、通話品質が劣化して通話自体が困難となる。
【０００６】
本発明は、前記の問題点を鑑み、簡易な構成で高品質な通話が可能な音声入出力装置の提供を目的とする。
【０００７】
【課題を解決するための手段】
上記目的を達成するために、請求項１に記載の音声入出力装置は、話者の音声を入力して音声信号を生成する複数の音声入力手段の各々が、通話相手方の音声を出力する音声出力手段から等距離に配置され、複数の音声入力手段の各々が生成した各音声信号のレベルの偏差を算出し、これに基づいて送話信号を生成することで、話者の音声とともに音声入力手段に混入した、音声出力手段から出力された音声を、送話信号から除去する処理手段と、処理手段が生成した送話信号を通話相手方へ送信し、通話相手方から送信された受話信号を受信して音声出力手段へ出力する通信手段とを備えることを特徴とする。
【０００８】
このように、請求項１の音声入出力装置では、話者の音声は通話相手方の音声を出力する音声出力手段から等距離に配置された、複数の音声入力手段から入力される。このとき、複数の音声入力手段は、音声出力手段から等距離に配置されているので、音声出力手段からの音声が各音声入力手段に入力されたとき、等しいレベルの音声信号となる。このため、処理手段が、各音声入力手段が生成した各音声信号のレベルの偏差を算出することで、音声出力手段からの音声に基づく音声信号成分が除去され、話者の音声に基づく送話信号が生成される。このようにして生成された送話信号には、エコーやハウリングの原因となる音声出力手段の出力した信号成分が含まれていないため、送話信号が通信手段によって通話相手方へ送信されたとき、通話相手方はクリアな音声にて話者の音声を聞くことができる。
【０００９】
上述したように、請求項１の音声入出力装置によれば、エコーやハウリングの発生を防止することができ、簡易な構成で高品質な通話が可能となるのである。
【００１０】
請求項２に記載のように、音声入力手段は２つ設けられ、音声入力手段の各々および音声出力手段は一直線上に配置されることが望ましい。これにより、エコーやハウリングの発生を防止しつつ、本音声入出力装置を小型化することができ、設置が容易となる。
【００１１】
また、請求項３に記載の音声入出力装置は、話者の音声を入力して音声信号を生成する複数の音声入力手段と、通話相手方の音声を出力する音声出力手段と、音声出力手段から複数の音声入力手段に入力される音声による音声信号のレベルが等しくなるように、複数の音声入力手段が生成した音声信号の少なくとも一方を加工する加工手段と、加工手段による加工後に各音声信号のレベルの偏差を算出し、これに基づいて送話信号を生成することで、話者の音声とともに音声入力手段に混入した、音声出力手段から出力された音声を、送話信号から除去する処理手段と、処理手段が生成した送話信号を通話相手方へ送信し、通話相手方から送信された受話信号を受信して音声出力手段へ出力する通信手段とを備えることを特徴とする。
【００１２】
このように、請求項３の音声入出力装置では、話者の音声は複数の音声入力手段から入力されるが、各音声入力手段が音声出力手段から等距離に配置されていない場合には、各音声入力手段は音声出力手段からの音声が入力されたとき、等しいレベルの音声信号を生成することができない。すなわち、各音声入力手段が生成した音声信号をそのまま利用するだけでは、処理手段は音声出力手段からの音声に基づく音声信号成分を除去することができない。また、各音声入力手段が音声出力手段から等距離に配置されていても、各音声入力手段の有するレベル特性や位相特性のばらつきが大きい場合には、各音声入力手段は音声出力手段からの音声が入力されたとき、等しいレベルの音声信号を生成することができず、処理手段は音声出力手段からの音声に基づく音声信号成分を十分に除去することはできない。さらには、例えば車室内の音響特性によっても、処理手段は音声出力手段からの音声に基づく音声信号成分を除去することができない場合がある。
【００１３】
そこで、請求項３の音声入出力装置では、音声出力手段から各音声入力手段に入力される音声による音声信号のレベルが等しくなるように、各音声入力手段が生成した音声信号の少なくとも一方を加工手段によって加工し、加工後の各音声信号のレベルの偏差を算出して送話信号を生成する。これにより、各音声入力手段の配置、各音声入力手段の有するレベル特性や位相特性のばらつき、および車室内の音響特性によらず、音声出力手段からの音声に基づく音声信号成分を除去して、話者の音声に基づく送話信号を生成することが可能となる。
【００１４】
請求項４に記載のように、加工手段は、音声信号のレベルを調整する機能を有することが望ましい。各音声入力手段の生成した音声信号のレベルを増幅して調整することにより、各音声入力手段の配置や、各音声入力手段の有するレベル特性のばらつき、および車室内の音響特性による音声信号のレベルの変動を容易に補正することが可能となる。
【００１５】
請求項５に記載のように、加工手段は、音声信号の位相を調整する機能を有することが望ましい。これにより、各音声入力手段の配置、各音声入力手段の有する位相特性のばらつき、および車室内の音響特性による音声信号の位相の変動まで考慮した、高精度な補正が可能となる。
【００１６】
請求項６に記載のように、処理手段が生成した送話信号を検出する検出手段と、検出手段の検出結果に基づいて、加工手段の動作を制御する制御手段とを備えることが望ましい。生成された送話信号のレベルを検出手段によって検出し、その検出結果に基づいて制御手段が加工手段の動作を制御することで、より高精度な音声信号の補正が可能となる。
【００１７】
請求項７に記載のように、音声入出力装置は車両に搭載されて動作するものであり、音声入出力装置は、車両における車室の幅を車両の進行方向と平行に２等分する２等分線上に配置されることが望ましい。
【００１８】
本音声入出力装置は、複数の音声入力手段が生成した各音声信号のレベルの偏差を算出することで、音声出力手段からの音声に基づく音声信号成分を除去する。しかしながら、話者が各音声入力手段から等距離に位置する場所から音声を発すると、話者の音声から各音声入力手段が生成する音声信号の各々のレベルが全て等しくなり、処理手段が各音声信号のレベルの偏差を算出する際に、話者の音声に基づく音声信号成分が除去されてしまう。本音声入出力装置を車両に搭載して動作させる際には、車両における車室の幅を車両の進行方向と平行に２等分する２等分線上に配置することで、話者が車内のどのシートに座って発話しても、話者から各音声入力手段までの距離が等しくならず、話者の音声に基づく音声信号成分が除去されることなく通話が可能となるのである。
【００１９】
請求項８に記載のように、音声入出力装置は、車室内のルームライト、ルームミラー、マップランプ、もしくはオーバーヘッドモジュールのいずれかの車室の天井に取り付けられた機器に配置されることが望ましい。これにより、話者の音声に基づく音声信号成分が除去されることなく通話が可能であり、なおかつ、車室内の空間を有効に活用することができるのである。
【００２０】
【発明の実施の形態】
（第１実施形態）
図１は、本発明の第１実施形態である音声入出力装置の全体構成を示すブロック図である。
【００２１】
音声入力手段であるマイクＡ１およびマイクＢ２は、例えば小型のマイクロホンであり、話者の音声を入力して音声信号を生成する。音声出力手段であるスピーカ３は、例えば小型のスピーカであり、通話相手方の音声を出力する。なお、本実施形態におけるマイクＡ１とマイクＢ２とは、いずれもスピーカ３から距離ｄだけ離れて配置され、かつ、マイクＡ１とマイクＢ２とが、スピーカ３を間に挟んで一直線となるように配置されている。これにより、エコーやハウリングの発生を防止しつつ、本音声入出力装置を小型化することができ、設置が容易となるためである。
【００２２】
なお、本音声入出力装置は、車両のルームライトの上方に設置される。これにより、話者がどのシートに座っていても、話者と各マイクとの距離が等しくならず、話者の音声に基づく音声信号成分が除去されることなく通話が可能となるのである（詳細については後述する）。さらには、車室内の空間を有効に活用することもできる。本音声入出力装置の設置場所としては、話者と各マイクとの間の距離が等しくならないような場所、すなわち、車室の幅を車両の進行方向と平行に２等分する２等分線上に設置するのであれば、何処に設置してもよい。
【００２３】
処理手段である信号減算器４は、例えばオペアンプから構成され、マイクＡ１が生成した音声信号のレベルから、マイクＢ２が生成した音声信号のレベルを代数的に減算し、２つの音声信号におけるレベルの偏差を算出して送話信号を生成する。送話信号の生成に関しては、マイクＢ２が生成した音声信号のレベルからマイクＡ１が生成した音声信号のレベルを代数的に減算し、２つの音声信号におけるレベルの偏差を算出して送話信号を生成してもよい。また、音声信号のレベルの減算に関しては、例えばＡ／Ｄ変換器によってマイクＡ１およびマイクＢ２が生成した音声信号のレベルを一旦ディジタル化し、ディジタル減算回路によって減算を行うこととしても良い。また、信号減算器４によって、送信すべき話者の音声信号も減算され、その偏差のレベルが小さくなるため、信号減算器４の前段もしくは後段に増幅器を接続することが好ましい。
【００２４】
通信手段である通信装置５は、例えば車載用無線機であり、信号減算器４が生成した送話信号を通話相手方へ送信する。また、通話相手方から送信された受話信号を受信してスピーカ３へ出力する。通話相手方との通信に関しては、例えばナビゲーション装置に搭載された通信機能を利用することとしても良いし、携帯電話の通信機能を利用することとしても良い。
【００２５】
次に、第１実施形態の音声入出力装置が、送話信号を生成する処理を順に説明する。
【００２６】
話者が発した音声は、マイクＡ１およびマイクＢ２から入力され、それぞれ音声信号が生成される。次に、信号減算器４は、マイクＡ１が生成した音声信号のレベルから、マイクＢ２が生成した音声信号のレベルを代数的に減算して、２つの音声信号のレベルの偏差を算出し、これを送話信号として生成する。本音声入出力装置は、車両のルームライトの上方に設置されていることから、話者が車室内のどのシートに座っていても、話者と各マイクとの間の距離は等しくならない。そのため、各マイクが生成する音声信号の話者の音声に基づく音声信号成分のレベルも等しくならず、信号減算器４が２つの音声信号のレベルを代数的に減算しても、話者の音声に基づく音声信号成分は除去されない。一方、スピーカ３と各マイクとは等距離に配置されているため、各マイクが生成する音声信号のスピーカ３から出力された音声に基づく音声信号成分のレベルは概ね等しくなり、信号減算器４が２つの音声信号のレベルを代数的に減算することによって、スピーカ３から出力された音声に基づく音声信号成分が除去されるのである。
【００２７】
このように、本実施形態の音声入出力装置では、スピーカ３から等距離に配置されたマイクＡ１およびマイクＢ２が生成した各音声信号のレベルを代数的に減算し、２つの音声信号のレベルの偏差を送話信号として生成することにより、マイクＡ１およびマイクＢ２に混入した、スピーカ３から出力された音声を、送話信号から除去することができる。これにより、エコーやハウリングの発生を防止することができ、簡易な構成で高品質な通話が可能となるのである。
（第２実施形態）
図２は、本発明の第２実施形態である音声入出力装置の全体構成を示すブロック図である。図２に示すように、本実施形態では、上述の第１実施形態における音声入出力装置において、増幅器Ａ６および増幅器Ｂ７、レベル検出器８、リモコン９、スイッチング回路１０、ＥＣＵ１１を新たに設けた点が第１実施形態と異なる。
【００２８】
加工手段である増幅器Ａ６および増幅器Ｂ７は、増幅率を変更することが可能な可変増幅器であり、マイクＡ１およびマイクＢ２が生成した各音声信号のレベルを、設定された増幅率に従ってそれぞれ増幅する。各増幅器の増幅率は、ＥＣＵ１１によって変更される。音声信号のレベルの増幅に関しては、例えばＡ／Ｄ変換器によって一旦ディジタル信号に変換した後に増幅することとしても良い。
【００２９】
検出手段であるレベル検出器８は、例えば信号計測器であり、信号減算器４が生成した送話信号のレベルを検出してＥＣＵ１１へ出力する。送話信号のレベルの検出に関しては、例えばＡ／Ｄ変換器によってディジタル信号に変換して検出することとしても良い。
【００３０】
リモコン９は、例えばテンキーと機能キーとを備えた多機能リモコンであり、ＥＣＵ１１に対して増幅器Ａ６と増幅器Ｂ７の増幅率の調整を行うよう指示する。増幅率の調整の指示に関しては、コンピュータから通信ネットワークを介して指示することとしてもよい。また、各増幅器に調整スイッチを設け、ユーザーもしくはサービスマンが手動で調整することとしてもよい。
【００３１】
スイッチング回路１０は、例えばフリップフロップ等から構成され、ＥＣＵ１１の指示に従ってスピーカ３の接続先を通信装置５側もしくはＥＣＵ１１側へ切り換える。スピーカ３の接続先の切り換えに関しては、例えば無接点リレーを用いて行うこととしてもよい。
【００３２】
ＥＣＵ１１は、リモコン９から増幅器Ａ６と増幅器Ｂ７の増幅率の調整を行うよう指示されると、スイッチング回路１０に対してスピーカ３の接続先をＥＣＵ１１側へ切り換えるよう指示する。そして、テスト信号をスピーカ３へ出力し、レベル検出器８が検出した送話信号のレベルが所定値以下となるように、増幅器Ａ６と増幅器Ｂ７の増幅率を変更する。これらの動作は、車両に搭載されたカーナビゲーション装置や通信装置等のＣＰＵに行わせることとしてもよい。
【００３３】
なお、本実施形態のマイクＡ１はスピーカ３から距離ｄ１だけ離れて配置されており、マイクＢ２はスピーカ３から距離ｄ２だけ離れて配置されている。
【００３４】
その他の構成・動作については、前述の第１実施形態と同様なので、説明を省略する。
【００３５】
図３は、第２実施形態の音声入出力装置が、増幅器Ａ６と増幅器Ｂ７の増幅率を調整する処理のフローチャートである。本フローチャートの処理は、話者がリモコン９から各増幅器の増幅率の調整を行うよう指示するたびに実行される。
【００３６】
ステップ３０１では、スイッチング回路１０のスイッチをＥＣＵ１１側へ切り換える。ステップ３０２では、テスト信号の出力を開始する。テスト信号はＴＳＰ信号、正弦波、ホワイトノイズ、インパルスのいずれかでよく、スピーカ３から出力される。
【００３７】
ステップ３０３では、所定時間だけウェイトし、テスト信号から送話信号が生成されるのを待つ。ステップ３０４では、ステップ３０２で出力したテスト信号に応じて、生成された送話信号のレベルを検出する。ステップ３０５では、ステップ３０４で検出した送話信号のレベルの絶対値が所定値Ｋ以下であるか否かを判定する。送話信号のレベルの絶対値が所定値Ｋ以下であれば、ステップ３０７へ進み、テスト信号の出力を終了する。その後、ステップ３０８にてスイッチング回路１０のスイッチを通信装置５側へ切り換え、処理を終了する。一方、ステップ３０５で検出した送話信号のレベルの絶対値が所定値Ｋ以下でない場合は、ステップ３０６へ進み、増幅器Ａ６もしくは増幅器Ｂ７の増幅率を、送話信号のレベルの絶対値がゼロに近づくように変更する。その後、ステップ３０３へ戻り、送話信号のレベルの絶対値が所定値Ｋ以下となるまで、上述の手順を繰り返す。
【００３８】
このように、マイクＡ１およびマイクＢ２がスピーカ３から等距離に配置されていない場合、各マイクが生成する音声信号のうち、スピーカ３からの音声に基づく音声信号成分は等しいレベルとならず、信号減算器４はスピーカ３からの音声に基づく音声信号成分を送話信号から除去することができない。また、マイクＡ１およびマイクＢ２がスピーカ３から等距離に配置されていても、各マイクの有するレベル特性のばらつきが大きい場合には、スピーカ３からの音声に基づく音声信号成分は等しいレベルとならず、信号減算器４はスピーカ３からの音声に基づく音声信号成分を送話信号から十分に除去することはできない。さらには、例えば車室内の音響特性によっても、信号減算器４はスピーカ３からの音声に基づく音声信号成分を送話信号から除去することができない場合がある。そのため、スピーカ３からの音声に基づく音声信号成分のレベルが等しくなるように各増幅器の増幅率を調整し、増幅された各音声信号のレベルの偏差に基づいて送話信号を生成することで、各マイクの配置や、各マイクの有するレベル特性のばらつき、および車室内の音響特性によらず、スピーカ３からの音声に基づく音声信号成分を送話信号から除去することが可能となるのである。また、生成された送話信号をレベル検出器８によって検出し、検出結果に基づいてＥＣＵ１１が増幅器Ａ６および増幅器Ｂ７の増幅率を変更することで、マイクＡ１およびマイクＢ２の配置や、各マイクのレベル特性のばらつき、および車室内の音響特性による音声信号のレベルの変動を高精度に補正することが可能となるのである。
【００３９】
なお、本フローチャートの処理は、本音声入出力装置が使用されていない際に、一定時間毎に自動的に行うこととしても良い。また、本音声入出力装置を搭載した車両のディーラーや工場等で定期的に行うこととしてもよい。
【００４０】
次に、第２実施形態の音声入出力装置が、送話信号を生成する処理を順に説明する。
【００４１】
話者が発した音声は、マイクＡ１およびマイクＢ２から入力され、それぞれ音声信号が生成される。次に、増幅器Ａ６および増幅器Ｂ７は、マイクＡ１およびマイクＢ２によって生成された音声信号を、図３のフローチャートの処理を実行することによって設定された増幅率に従って増幅する。信号減算器４は、増幅器Ａ６によって増幅された音声信号のレベルから、増幅器Ｂ７によって増幅された音声信号のレベルを代数的に減算して、２つの音声信号のレベルの偏差を算出し、スピーカ３から出力された音声の音声信号成分を除去した送話信号を生成する。
【００４２】
このように、本実施形態の音声入出力装置では、マイクＡ１およびマイクＢ２が生成した音声信号のレベルを増幅器Ａ６および増幅器Ｂ７によってそれぞれ増幅し、増幅された各音声信号のレベルの偏差を信号減算器４によって算出して、送話信号を生成する。これにより、各マイクの配置や、各マイクのレベル特性のばらつき、および車室内の音響特性によらず、スピーカ３から出力された音声を送話信号から除去することが可能となるのである。なお、各マイクとスピーカとの距離は等距離であっても良い。また、増幅器は１つだけ設けることとし、どちから一方の音声信号を増幅することとしても良い。
（第３実施形態）
図４は、本発明の第３実施形態である音声入出力装置の全体構成を示すブロック図である。図４に示すように、本実施形態では、上述の第２実施形態における音声入出力装置において、増幅器Ａ６および増幅器Ｂ７に代えて波形成形器Ａ１２および波形成形器Ｂ１３をそれぞれ設け、さらに、レベル検出器８に代えて波形検出器１４を設けた点が第２実施形態と異なる。
【００４３】
波形成形器Ａ１２および波形成形器Ｂ１３は、マイクＡ１およびマイクＢ２が生成した各音声信号の波形を、設定された成形パラメータに従って成形し、各音声信号のレベルおよび位相を成形パラメータに従って変更する。各波形成形器の成形パラメータは可変であり、ＥＣＵ１１によって変更される。音声信号のレベルや位相の変更に関しては、例えばＡ／Ｄ変換器によって音声信号をディジタル信号に変換し、ｚ変換、ｗａｖｅｌｅｔ変換等により行うこととしても良い。
【００４４】
波形検出器１４は、例えば信号計測器であり、信号減算器４が生成した送話信号の波形を検出する。送話信号の波形の検出に関しては、例えばＡ／Ｄ変換器によってディジタル信号に変換して検出することとしてもよい。
【００４５】
本実施形態のリモコン９は、ＥＣＵ１１に対して波形成形器Ａ１２と波形成形器Ｂ１３の成形パラメータの調整を行うよう指示する。
【００４６】
本実施形態のＥＣＵ１１は、リモコン９から波形成形器Ａ１２と波形成形器Ｂ１３の成形パラメータの調整を行うよう指示されると、スイッチング回路１０に対してスピーカ３の接続先をＥＣＵ１１側へ切り換えるよう指示する。そして、テスト信号をスピーカ３へ出力し、波形検出器１４が検出した送話信号の波形と、テスト信号の波形との偏差が所定値よりも小さくなるように、波形成形器Ａ１２と波形成形器Ｂ１３の成形パラメータを変更する。
【００４７】
その他の構成・動作については、前述の第２実施形態と同様なので、説明を省略する。
【００４８】
図５は、第３実施形態の音声入出力装置が、波形成形器Ａ１２と波形成形器Ｂ１３の成形パラメータを調整する処理のフローチャートである。図５に示すように、本実施形態のフローチャートでは、前述の第２実施形態における図３のフローチャートにおいて、生成された送話信号のレベルを検出するステップに代わり、生成された送話信号の波形を検出するステップを設け、送話信号のレベルの絶対値が所定値Ｋ以下であるか否かを判定するステップに代わり、送話信号の波形とテスト信号の波形との偏差が所定値Ｌよりも小さくなるか否かを判定するステップを設ける。また、増幅器Ａ６もしくは増幅器Ｂ７の増幅率を変更するステップに代わり、波形成形器Ａ１２もしくは波形成形器Ｂ１３の成形パラメータを変更するステップを設ける。換言すれば、ステップ５０４、５０５、５０６以外の全ての処理は、前述の第２実施形態における図３のフローチャートの処理と同様であり、その説明は省略する。なお、本フローチャートの処理は、話者がリモコン９から各波形成形器の成形パラメータを調整するよう指示するたびに実行される。
【００４９】
ステップ５０４では、生成された送話信号の波形を検出する。ステップ５０５では、ステップ５０４で検出した送話信号の波形と、ステップ５０２で出力したテスト信号の波形との偏差を算出し、これが所定値Ｌ以下であるか否かを判定する。送話信号の波形とテスト信号の波形との偏差が所定値Ｌ以下でない場合は、ステップ５０６へ進み、波形成形器Ａ１２および波形成形器Ｂ１３の成形パラメータを、送話信号の波形とテスト信号の波形との偏差が小さくなるように変更する。その後、ステップ５０３へ戻り、送話信号の波形とテスト信号の波形との偏差が所定値Ｌ以下となるまで、上述の手順を繰り返す。
【００５０】
このように、生成された送話信号を波形検出器１４によって検出し、検出結果に基づいてＥＣＵ１１が波形成形器Ａ１２および波形成形器Ｂ１３の成形パラメータを変更することで、各マイクの配置や、各マイクのレベル特性および位相特性のばらつき、および車室内の音響特性による音声信号のレベルおよび位相の変動を、より高精度に補正することが可能となるのである。
【００５１】
なお、本フローチャートの処理は、本音声入出力装置が使用されていない際に、一定時間毎に自動的に行うこととしても良い。また、本音声入出力装置を搭載した車両のディーラーや工場等で定期的に行うこととしてもよい。
【００５２】
次に、第３実施形態の音声入出力装置が、送話信号を生成する処理を順に説明する。
【００５３】
話者が発した音声は、マイクＡ１およびマイクＢ２から入力され、それぞれ音声信号が生成される。次に、波形成形器Ａ１２および波形成形器Ｂ１３は、マイクＡ１およびマイクＢ２によって生成された音声信号の波形を、図５のフローチャートの処理を実行することによって設定された成形パラメータに従って、成形する。信号減算器４は、波形成形器Ａ１２によって成形された音声信号のレベルから、波形成形器Ｂ１３によって成形された音声信号のレベルを減算して、２つの音声信号のレベルの偏差を算出し、スピーカ３から出力された音声の音声信号成分を除去した送話信号を生成する。
【００５４】
このように、本実施形態の音声入出力装置では、各マイクが生成した音声信号の波形を波形成形器Ａ１２および波形成形器Ｂ１３によってそれぞれ成形し、信号減算器４は成形された各音声信号のレベルの偏差を算出して送話信号を生成する。これにより、各マイクの配置や、各マイクのレベル特性のばらつき、および車室内の音響特性によらず、スピーカ３から出力された音声を、より高い精度で送話信号から除去することが可能となるのである。また、波形成形器は１つだけ設けることとし、どちから一方の音声信号の波形を成形することとしても良い。
【００５５】
なお、上述した各実施形態において、信号減算器４の後段にエコーキャンセラやハウリング防止回路等を設けてもよい。これにより、本音声入出力装置の構成は複雑化するが、より高品質な通話を行うことが可能となる。また、マイクは３つ以上あっても良いし、スピーカも複数あってもよい。
【００５６】
さらに、上述した各実施形態は、それぞれ単独で用いても良いし、複合して用いても良いことは言うまでもない。
【図面の簡単な説明】
【図１】本発明の第１実施形態である音声入出力装置の全体構成を示すブロック図である。
【図２】本発明の第２実施形態である音声入出力装置の全体構成を示すブロック図である。
【図３】第２実施形態の音声入出力装置が、増幅器Ａと増幅器Ｂの増幅率を調整する処理のフローチャートである。
【図４】本発明の第３実施形態である音声入出力装置の全体構成を示すブロック図である。
【図５】第３実施形態の音声入出力装置が、波形成形器Ａと波形成形器Ｂの成形パラメータを調整する処理のフローチャートである
【符号の説明】
１…マイクＡ、２…マイクＢ、３…スピーカ、４…信号減算器、５…通信装置[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a voice input / output device.
[0002]
[Prior art]
2. Description of the Related Art In recent years, with the development of communication technology, devices that perform communication in various communication modes are known.
[0003]
For example, Patent Literature 1 discloses a portable terminal that allows three speakers to talk at the same time. This portable terminal can communicate using two communication systems, a PDC system and a PHS system, and an echo canceller is provided for each communication system. When three people talk at the same time, speaker A connects a line to speaker B and speaker C using different communication methods, and talks. The call between the speaker B and the speaker C is performed via the portable terminal of the speaker A. However, since the call is performed using a different communication method, an echo provided for each communication method is provided. A call can be made without interrupting the call voices of the speakers B and C by the canceller.
[0004]
[Patent Document 1]
JP 2000-196736 A
[0005]
[Problems to be solved by the invention]
On the other hand, with the diversification of such communication modes, the configuration of the communication apparatus has become complicated. In particular, a voice input / output device that performs one function of a communication device is a typical example, and the degree of complexity is remarkable due to the installation of a howling prevention function, an echo cancellation function, and the like. However, a voice input / output device having a complicated configuration may not be used depending on restrictions imposed when using the communication device. On the other hand, if the simplification is performed by omitting the functions of the voice input / output device, the call quality deteriorates and the call itself becomes difficult.
[0006]
The present invention has been made in view of the above problems, and has as its object to provide a voice input / output device capable of performing high-quality communication with a simple configuration.
[0007]
[Means for Solving the Problems]
To achieve the above object, the voice input / output device according to claim 1, wherein each of the plurality of voice input means for inputting a voice of a speaker and generating a voice signal outputs voice of a communication partner. Calculates a deviation of the level of each audio signal generated by each of the plurality of audio input means and is arranged at an equal distance from the output means, and generates a transmission signal based on the calculated deviation. Processing means for removing the voice output from the voice output means mixed in the means from the transmission signal, transmitting the transmission signal generated by the processing means to the other party, and receiving the reception signal transmitted from the other party And communication means for outputting to the audio output means.
[0008]
As described above, in the voice input / output device according to the first aspect, the voice of the speaker is input from the plurality of voice input units that are arranged equidistant from the voice output unit that outputs the voice of the other party. At this time, since the plurality of voice input units are arranged at the same distance from the voice output unit, when the voice from the voice output unit is input to each voice input unit, the voice signal has the same level. For this reason, the processing means calculates the deviation of the level of each audio signal generated by each audio input means, so that the audio signal component based on the audio from the audio output means is removed, and the transmission based on the voice of the speaker is performed. A signal is generated. Since the transmission signal generated in this manner does not include a signal component output from the audio output unit that causes echo or howling, when the transmission signal is transmitted to the other party by the communication unit, The other party can hear the speaker's voice in clear voice.
[0009]
As described above, according to the voice input / output device of the first aspect, it is possible to prevent the occurrence of echo and howling, and to achieve a high-quality call with a simple configuration.
[0010]
As described in claim 2, it is desirable that two voice input means are provided, and each of the voice input means and the voice output means are arranged on a straight line. This makes it possible to reduce the size of the audio input / output device while preventing the occurrence of echoes and howling, thereby facilitating installation.
[0011]
Further, the voice input / output device according to claim 3 includes a plurality of voice input means for inputting a voice of a speaker to generate a voice signal, a voice output means for outputting a voice of a communication partner, and a voice output means. Processing means for processing at least one of the audio signals generated by the plurality of audio input means so that the levels of the audio signals by the voices input to the plurality of audio input means are equal; and Processing means for calculating a level deviation and generating a transmission signal based on the calculated level deviation, thereby removing, from the transmission signal, the voice output from the voice output means, which has been mixed into the voice input means together with the speaker's voice; And communication means for transmitting the transmission signal generated by the processing means to the other party, receiving the reception signal transmitted from the other party, and outputting the received signal to the voice output means.
[0012]
As described above, in the voice input / output device according to the third aspect, the voice of the speaker is input from the plurality of voice input units, but when each voice input unit is not arranged at the same distance from the voice output unit, Each audio input means cannot generate an equal level audio signal when audio is input from the audio output means. That is, the processing means cannot remove the sound signal component based on the sound from the sound output means only by using the sound signal generated by each sound input means as it is. Further, even if each voice input means is arranged at the same distance from the voice output means, if the level characteristics and phase characteristics of each voice input means have large variations, each voice input means outputs the voice from the voice output means. Is input, an audio signal of the same level cannot be generated, and the processing means cannot sufficiently remove the audio signal component based on the audio from the audio output means. Furthermore, the processing means may not be able to remove the audio signal component based on the sound from the sound output means, for example, due to the acoustic characteristics of the vehicle interior.
[0013]
Therefore, in the voice input / output device according to the third aspect, at least one of the voice signals generated by the voice input means is processed so that the level of the voice signal by voice input from the voice output means to each voice input means is equal. The voice signal is processed by the means, and the deviation of the level of each processed voice signal is calculated to generate a transmission signal. Thereby, regardless of the arrangement of each voice input unit, the variation in the level characteristics and phase characteristics of each voice input unit, and the acoustic characteristics in the cabin, the voice signal component based on the voice from the voice output unit is removed, It is possible to generate a transmission signal based on the voice of the speaker.
[0014]
Preferably, the processing means has a function of adjusting the level of the audio signal. By amplifying and adjusting the level of the audio signal generated by each audio input unit, the arrangement of each audio input unit, the variation in the level characteristics of each audio input unit, and the level of the audio signal due to the acoustic characteristics of the vehicle interior Can be easily corrected.
[0015]
It is preferable that the processing means has a function of adjusting the phase of the audio signal. As a result, highly accurate correction can be performed in consideration of the arrangement of the voice input units, the variation in the phase characteristics of the voice input units, and the variation in the phase of the voice signal due to the acoustic characteristics of the vehicle interior.
[0016]
As described in claim 6, it is desirable to include a detecting means for detecting the transmission signal generated by the processing means, and a control means for controlling the operation of the processing means based on the detection result of the detecting means. The level of the generated transmission signal is detected by the detecting unit, and the control unit controls the operation of the processing unit based on the detection result, so that the voice signal can be corrected with higher accuracy.
[0017]
According to a seventh aspect of the present invention, the voice input / output device operates while being mounted on a vehicle. The voice input / output device divides a width of a cabin of the vehicle into two equal parts in parallel with a traveling direction of the vehicle. Desirably, they are arranged on an equal line.
[0018]
The voice input / output device removes a voice signal component based on voice from the voice output means by calculating a deviation of the level of each voice signal generated by the plurality of voice input means. However, when the speaker emits a sound from a place located at an equal distance from each voice input means, the level of each of the voice signals generated by each voice input means from the speaker's voice becomes equal, and the processing means When calculating the signal level deviation, a voice signal component based on the voice of the speaker is removed. When the present voice input / output device is mounted on a vehicle and operated, the width of the cabin of the vehicle is arranged on a bisector that is bisected in parallel with the traveling direction of the vehicle, so that the speaker can move inside the vehicle. No matter which seat you speak, the distance from the speaker to each voice input means will not be equal, and it will be possible to talk without removing the voice signal component based on the voice of the speaker.
[0019]
As described in claim 8, the voice input / output device is desirably disposed on a device mounted on the ceiling of the vehicle compartment, such as a room light, a room mirror, a map lamp, or an overhead module. . As a result, a call can be made without removing a voice signal component based on the voice of the speaker, and the space in the vehicle compartment can be effectively utilized.
[0020]
BEST MODE FOR CARRYING OUT THE INVENTION
(1st Embodiment)
FIG. 1 is a block diagram showing an overall configuration of a voice input / output device according to a first embodiment of the present invention.
[0021]
The microphones A1 and B2, which are voice input means, are, for example, small microphones, and generate voice signals by inputting the voice of the speaker. The speaker 3, which is an audio output unit, is a small speaker, for example, and outputs the voice of the other party. Note that both the microphone A1 and the microphone B2 in the present embodiment are arranged at a distance d from the speaker 3, and the microphone A1 and the microphone B2 are arranged in a straight line with the speaker 3 interposed therebetween. Have been. This is because it is possible to reduce the size of the audio input / output device while preventing the occurrence of echo and howling, and to facilitate installation.
[0022]
The voice input / output device is installed above the room light of the vehicle. As a result, no matter which seat the speaker sits on, the distance between the speaker and each microphone is not equal, and a voice call can be made without removing a voice signal component based on the voice of the speaker ( Details will be described later). Furthermore, the space in the vehicle compartment can be effectively utilized. The location of the voice input / output device is such that the distance between the speaker and each microphone is not equal, that is, on a bisector that divides the width of the cabin into two in parallel with the traveling direction of the vehicle. If it is installed in any location, it may be installed anywhere.
[0023]
The signal subtractor 4 as a processing means is composed of, for example, an operational amplifier, and algebraically subtracts the level of the audio signal generated by the microphone B2 from the level of the audio signal generated by the microphone A1, and calculates the level of the two audio signals. The transmission signal is generated by calculating the deviation. Regarding the generation of the transmission signal, the level of the audio signal generated by the microphone A1 is algebraically subtracted from the level of the audio signal generated by the microphone B2, and the level difference between the two audio signals is calculated to calculate the transmission signal. May be generated. As for the subtraction of the level of the audio signal, for example, the level of the audio signal generated by the microphones A1 and B2 may be digitized once by the A / D converter, and the subtraction may be performed by the digital subtraction circuit. Since the signal of the speaker to be transmitted is also subtracted by the signal subtractor 4 and the level of the deviation is reduced, it is preferable to connect an amplifier before or after the signal subtractor 4.
[0024]
The communication device 5 as a communication means is, for example, an in-vehicle wireless device, and transmits the transmission signal generated by the signal subtractor 4 to the other party of the call. Also, it receives the reception signal transmitted from the other party and outputs it to speaker 3. As for communication with the other party, for example, a communication function mounted on the navigation device may be used, or a communication function of a mobile phone may be used.
[0025]
Next, a process in which the voice input / output device of the first embodiment generates a transmission signal will be described in order.
[0026]
The voice uttered by the speaker is input from the microphone A1 and the microphone B2, and a voice signal is generated. Next, the signal subtractor 4 algebraically subtracts the level of the audio signal generated by the microphone B2 from the level of the audio signal generated by the microphone A1, and calculates a deviation between the levels of the two audio signals. Is generated as a transmission signal. Since the present voice input / output device is installed above the room light of the vehicle, the distance between the speaker and each microphone is not equal even if the speaker is sitting on any seat in the vehicle interior. Therefore, the level of the audio signal component based on the speaker's voice of the audio signal generated by each microphone is not equal, and even if the signal subtractor 4 algebraically subtracts the levels of the two audio signals, the speaker's voice is Is not removed. On the other hand, since the speaker 3 and each microphone are arranged at the same distance, the level of the audio signal component of the audio signal generated by each microphone based on the audio output from the speaker 3 becomes substantially equal, and the signal subtracter 4 By algebraically subtracting the levels of the two audio signals, an audio signal component based on the audio output from the speaker 3 is removed.
[0027]
As described above, in the audio input / output device of the present embodiment, the levels of the audio signals generated by the microphones A1 and B2 arranged at the same distance from the speaker 3 are algebraically subtracted, and the levels of the two audio signals are reduced. By generating the deviation as the transmission signal, it is possible to remove from the transmission signal the sound output from the speaker 3 and mixed into the microphones A1 and B2. As a result, occurrence of echo and howling can be prevented, and high-quality communication can be performed with a simple configuration.
(2nd Embodiment)
FIG. 2 is a block diagram showing an overall configuration of a voice input / output device according to a second embodiment of the present invention. As shown in FIG. 2, the present embodiment is different from the first embodiment in that an amplifier A6 and an amplifier B7, a level detector 8, a remote controller 9, a switching circuit 10, and an ECU 11 are newly provided. Is different from the first embodiment.
[0028]
The amplifiers A6 and B7, which are processing means, are variable amplifiers capable of changing the amplification factor, and amplify the levels of the respective audio signals generated by the microphones A1 and B2 according to the set amplification factors. The amplification factor of each amplifier is changed by the ECU 11. As for the amplification of the level of the audio signal, the signal may be once converted into a digital signal by an A / D converter and then amplified.
[0029]
The level detector 8 serving as a detecting unit is, for example, a signal measuring device, and detects the level of the transmission signal generated by the signal subtractor 4 and outputs the level to the ECU 11. As for the detection of the level of the transmission signal, the signal may be converted into a digital signal by an A / D converter and detected.
[0030]
The remote controller 9 is, for example, a multi-function remote controller having ten keys and function keys, and instructs the ECU 11 to adjust the amplification factors of the amplifiers A6 and B7. The instruction for adjusting the amplification factor may be issued from a computer via a communication network. Further, an adjustment switch may be provided for each amplifier, and a user or a service person may manually adjust.
[0031]
The switching circuit 10 is configured by, for example, a flip-flop or the like, and switches the connection destination of the speaker 3 to the communication device 5 or the ECU 11 according to an instruction from the ECU 11. The switching of the connection destination of the speaker 3 may be performed using, for example, a non-contact relay.
[0032]
When instructed by the remote controller 9 to adjust the amplification factors of the amplifiers A6 and B7, the ECU 11 instructs the switching circuit 10 to switch the connection destination of the speaker 3 to the ECU 11 side. Then, the test signal is output to the speaker 3, and the amplification factors of the amplifiers A6 and B7 are changed so that the level of the transmission signal detected by the level detector 8 becomes equal to or less than a predetermined value. These operations may be performed by a CPU such as a car navigation device or a communication device mounted on the vehicle.
[0033]
Note that the microphone A1 of this embodiment is arranged at a distance d1 from the speaker 3, and the microphone B2 is arranged at a distance d2 from the speaker 3.
[0034]
Other configurations and operations are the same as those of the above-described first embodiment, and a description thereof will not be repeated.
[0035]
FIG. 3 is a flowchart of a process in which the audio input / output device of the second embodiment adjusts the amplification factors of the amplifiers A6 and B7. The process of this flowchart is executed every time the speaker gives an instruction from the remote controller 9 to adjust the amplification factor of each amplifier.
[0036]
In step 301, the switch of the switching circuit 10 is switched to the ECU 11 side. In step 302, the output of the test signal is started. The test signal may be any of a TSP signal, a sine wave, white noise, and an impulse, and is output from the speaker 3.
[0037]
In step 303, the system waits for a predetermined time and waits for a transmission signal to be generated from the test signal. In step 304, the level of the generated transmission signal is detected according to the test signal output in step 302. In step 305, it is determined whether or not the absolute value of the level of the transmission signal detected in step 304 is equal to or less than a predetermined value K. If the absolute value of the level of the transmission signal is equal to or smaller than the predetermined value K, the process proceeds to step 307, and the output of the test signal ends. Then, in step 308, the switch of the switching circuit 10 is switched to the communication device 5 side, and the process ends. On the other hand, if the absolute value of the level of the transmission signal detected in step 305 is not equal to or smaller than the predetermined value K, the process proceeds to step 306, and the amplification factor of the amplifier A6 or B7 is reduced to zero. Change to get closer. Thereafter, the process returns to step 303, and the above-described procedure is repeated until the absolute value of the level of the transmission signal becomes equal to or less than the predetermined value K.
[0038]
As described above, when the microphone A1 and the microphone B2 are not arranged at the same distance from the speaker 3, among the audio signals generated by the microphones, the audio signal component based on the audio from the speaker 3 does not have the same level, and The subtracter 4 cannot remove the audio signal component based on the audio from the speaker 3 from the transmission signal. Further, even if the microphones A1 and B2 are arranged at the same distance from the speaker 3, if the level characteristics of the microphones vary greatly, the sound signal components based on the sound from the speaker 3 do not have the same level. The signal subtracter 4 cannot sufficiently remove the audio signal component based on the audio from the speaker 3 from the transmission signal. Furthermore, the signal subtractor 4 may not be able to remove the audio signal component based on the audio from the speaker 3 from the transmission signal, for example, due to the acoustic characteristics of the vehicle interior. Therefore, by adjusting the amplification factor of each amplifier so that the level of the audio signal component based on the audio from the speaker 3 becomes equal, and generating the transmission signal based on the deviation of the level of each amplified audio signal, It is possible to remove the audio signal component based on the sound from the speaker 3 from the transmission signal irrespective of the arrangement of the microphones, the variation in the level characteristics of the microphones, and the acoustic characteristics in the cabin. Further, the generated transmission signal is detected by the level detector 8, and based on the detection result, the ECU 11 changes the amplification factors of the amplifiers A6 and B7, thereby disposing the microphones A1 and B2 and the microphones. This makes it possible to accurately correct variations in level characteristics and fluctuations in the level of audio signals due to acoustic characteristics in the vehicle cabin.
[0039]
Note that the processing of this flowchart may be automatically performed at regular intervals when the audio input / output device is not used. Further, it may be performed periodically at a dealer or a factory of a vehicle equipped with the voice input / output device.
[0040]
Next, a process in which the voice input / output device of the second embodiment generates a transmission signal will be described in order.
[0041]
The voice uttered by the speaker is input from the microphone A1 and the microphone B2, and a voice signal is generated. Next, the amplifiers A6 and B7 amplify the audio signals generated by the microphones A1 and B2 according to the amplification factor set by executing the processing of the flowchart of FIG. The signal subtractor 4 algebraically subtracts the level of the audio signal amplified by the amplifier B7 from the level of the audio signal amplified by the amplifier A6, calculates a deviation between the levels of the two audio signals, and To generate a transmission signal from which the audio signal component of the audio output from is removed.
[0042]
As described above, in the audio input / output device of the present embodiment, the levels of the audio signals generated by the microphones A1 and B2 are amplified by the amplifiers A6 and B7, respectively, and the difference between the levels of the amplified audio signals is subtracted. The transmission signal is calculated by the calculator 4. This makes it possible to remove the sound output from the speaker 3 from the transmission signal regardless of the arrangement of the microphones, the variation in the level characteristics of the microphones, and the acoustic characteristics in the vehicle interior. Note that the distance between each microphone and the speaker may be equal. Further, only one amplifier may be provided, and one of the audio signals may be amplified.
(Third embodiment)
FIG. 4 is a block diagram showing an overall configuration of a voice input / output device according to a third embodiment of the present invention. As shown in FIG. 4, in the present embodiment, in the audio input / output device according to the above-described second embodiment, a waveform shaper A12 and a waveform shaper B13 are provided instead of the amplifier A6 and the amplifier B7, respectively. The difference from the second embodiment is that a waveform detector 14 is provided instead of the detector 8.
[0043]
The waveform shaper A12 and the waveform shaper B13 shape the waveform of each audio signal generated by the microphones A1 and B2 according to the set shaping parameters, and change the level and phase of each audio signal according to the shaping parameters. The molding parameters of each waveform shaper are variable, and are changed by the ECU 11. The level and phase of the audio signal may be changed by, for example, converting the audio signal into a digital signal using an A / D converter and performing z-conversion, wavelet conversion, or the like.
[0044]
The waveform detector 14 is, for example, a signal measuring device, and detects the waveform of the transmission signal generated by the signal subtractor 4. As for the detection of the waveform of the transmission signal, the signal may be converted into a digital signal by an A / D converter and detected.
[0045]
The remote controller 9 of the present embodiment instructs the ECU 11 to adjust the molding parameters of the waveform shapers A12 and B13.
[0046]
When instructed by the remote controller 9 to adjust the shaping parameters of the waveform shapers A12 and B13, the ECU 11 of the present embodiment instructs the switching circuit 10 to switch the connection destination of the speaker 3 to the ECU 11 side. I do. The test signal is output to the speaker 3, and the waveform shaper A12 and the waveform shaper A12 are arranged such that the deviation between the waveform of the transmission signal detected by the waveform detector 14 and the waveform of the test signal is smaller than a predetermined value. Change the molding parameters of B13.
[0047]
Other configurations and operations are the same as those in the above-described second embodiment, and thus description thereof is omitted.
[0048]
FIG. 5 is a flowchart of a process in which the voice input / output device of the third embodiment adjusts the shaping parameters of the waveform shapers A12 and B13. As shown in FIG. 5, in the flowchart of this embodiment, the waveform of the generated transmission signal is replaced with the step of detecting the level of the generated transmission signal in the flowchart of FIG. 3 in the second embodiment. And a step of determining whether or not the absolute value of the level of the transmission signal is equal to or less than a predetermined value K is performed, and the deviation between the waveform of the transmission signal and the waveform of the test signal is smaller than the predetermined value L. And a step of determining whether or not the size is also reduced. Further, instead of changing the amplification factor of the amplifier A6 or B7, a step of changing the shaping parameters of the waveform shaper A12 or B13 is provided. In other words, all the processes other than steps 504, 505, and 506 are the same as the processes in the flowchart of FIG. 3 in the above-described second embodiment, and a description thereof will be omitted. The process of this flowchart is executed every time the speaker gives an instruction from the remote controller 9 to adjust the shaping parameters of each waveform shaper.
[0049]
In step 504, the waveform of the generated transmission signal is detected. In step 505, the deviation between the waveform of the transmission signal detected in step 504 and the waveform of the test signal output in step 502 is calculated, and it is determined whether or not this is less than or equal to a predetermined value L. If the difference between the waveform of the transmission signal and the waveform of the test signal is not less than or equal to the predetermined value L, the process proceeds to step 506, where the shaping parameters of the waveform shaper A12 and the waveform shaper B13 are changed. Change so that the deviation from the waveform is small. Thereafter, the process returns to step 503, and the above-described procedure is repeated until the deviation between the waveform of the transmission signal and the waveform of the test signal becomes equal to or smaller than the predetermined value L.
[0050]
In this way, the generated transmission signal is detected by the waveform detector 14, and the ECU 11 changes the shaping parameters of the waveform shaper A12 and the waveform shaper B13 based on the detection result. Variations in the level characteristics and phase characteristics of each microphone and fluctuations in the level and phase of the audio signal due to the acoustic characteristics in the vehicle cabin can be corrected with higher accuracy.
[0051]
Note that the processing of this flowchart may be automatically performed at regular intervals when the audio input / output device is not used. Further, it may be performed periodically at a dealer or a factory of a vehicle equipped with the voice input / output device.
[0052]
Next, a process in which the voice input / output device of the third embodiment generates a transmission signal will be described in order.
[0053]
The voice uttered by the speaker is input from the microphone A1 and the microphone B2, and a voice signal is generated. Next, the waveform shaper A12 and the waveform shaper B13 shape the waveform of the audio signal generated by the microphones A1 and B2 according to the shaping parameters set by executing the processing of the flowchart of FIG. The signal subtracter 4 subtracts the level of the audio signal shaped by the waveform shaper B13 from the level of the audio signal shaped by the waveform shaper A12 to calculate the deviation between the levels of the two audio signals, and A speech signal from which the speech signal component of the speech output from the speech signal 3 is removed is generated.
[0054]
As described above, in the audio input / output device of the present embodiment, the waveform of the audio signal generated by each microphone is shaped by the waveform shaper A12 and the waveform shaper B13, and the signal subtractor 4 outputs the waveform of each shaped audio signal. The transmission signal is generated by calculating the level deviation. This makes it possible to remove the voice output from the speaker 3 from the transmission signal with higher accuracy regardless of the arrangement of the microphones, the variation in the level characteristics of the microphones, and the acoustic characteristics in the vehicle compartment. It becomes. Further, only one waveform shaper may be provided, and the waveform of one of the audio signals may be shaped.
[0055]
In each of the above-described embodiments, an echo canceller, a howling prevention circuit, or the like may be provided after the signal subtractor 4. As a result, the configuration of the voice input / output device is complicated, but it is possible to perform higher quality communication. Further, three or more microphones may be provided, and a plurality of speakers may be provided.
[0056]
Furthermore, it goes without saying that each of the above-described embodiments may be used alone or in combination.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating an overall configuration of a voice input / output device according to a first embodiment of the present invention.
FIG. 2 is a block diagram illustrating an overall configuration of a voice input / output device according to a second embodiment of the present invention.
FIG. 3 is a flowchart of a process in which the audio input / output device of the second embodiment adjusts the amplification factors of an amplifier A and an amplifier B.
FIG. 4 is a block diagram showing an overall configuration of a voice input / output device according to a third embodiment of the present invention.
FIG. 5 is a flowchart of a process in which the voice input / output device of the third embodiment adjusts shaping parameters of the waveform shapers A and B.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 ... Microphone A, 2 ... Microphone B, 3 ... Speaker, 4 ... Signal subtractor, 5 ... Communication device

Claims

Each of a plurality of voice input means for inputting a speaker's voice and generating a voice signal is disposed equidistant from a voice output means for outputting a voice of the other party,
Calculating the deviation of the level of each audio signal generated by each of the plurality of audio input means, by generating a transmission signal based on this, mixed with the audio of the speaker into the audio input means, Processing means for removing the sound output from the sound output means from the transmission signal;
Communication means for transmitting the transmission signal generated by the processing means to the other party, receiving the reception signal transmitted from the other party, and outputting the received signal to the voice output means. .

The voice input / output device according to claim 1, wherein two voice input units are provided, and each of the voice input units and the voice output unit are arranged on a straight line.

A plurality of voice input means for inputting a speaker's voice to generate a voice signal;
Voice output means for outputting the voice of the other party;
Processing means for processing at least one of the audio signals generated by the plurality of audio input means, so that the level of the audio signal by the audio input to the plurality of audio input means from the audio output means is equal,
After processing by the processing means, the level deviation of each audio signal is calculated, and a transmission signal is generated based on the calculated deviation, so that the output from the voice output means mixed with the voice of the speaker into the voice input means is output. Processing means for removing the speech from the transmission signal;
Communication means for transmitting the transmission signal generated by the processing means to the other party, receiving the reception signal transmitted from the other party, and outputting the received signal to the voice output means. .

The audio input / output device according to claim 3, wherein the processing means has a function of adjusting a level of the audio signal.

The audio input / output device according to claim 3, wherein the processing unit has a function of adjusting a phase of the audio signal.

Detecting means for detecting the transmission signal generated by the processing means;
4. The voice input / output device according to claim 3, further comprising control means for controlling an operation of said processing means based on a detection result of said detection means.

The voice input / output device is mounted and operated on a vehicle, and the voice input / output device is arranged on a bisector that bisects a width of a cabin of the vehicle in parallel with a traveling direction of the vehicle. The audio input / output device according to any one of claims 1 to 6, wherein:

The said sound input / output device is arrange | positioned at the apparatus attached to the ceiling of the vehicle interior of any of a room light in a vehicle interior, a room mirror, a map lamp, or an overhead module, The claim 7 characterized by the above-mentioned. Voice input / output device.