JP2010263401A

JP2010263401A - Handsfree speech communication device, and voice correcting method of the device

Info

Publication number: JP2010263401A
Application number: JP2009112538A
Authority: JP
Inventors: Masaya Yamaguchi; 正也山口
Original assignee: Alps Electric Co Ltd
Current assignee: Alps Alpine Co Ltd
Priority date: 2009-05-07
Filing date: 2009-05-07
Publication date: 2010-11-18

Abstract

<P>PROBLEM TO BE SOLVED: To provide a handsfree speech communication device for specifying a speaker by acquiring a voice feature of the speaker, and to provide a voice correcting method of the device. <P>SOLUTION: The handsfree speech communication device includes a sampling section (44) for extracting beforehand a voice volume feature and a voice quality feature of the speaker from the voice of the speaker, and a correcting section (48) for setting a correction condition corresponding to the speaker beforehand from the extracted voice volume feature and voice quality feature, and on the other hand, specifying a caller by comparing the extracted voice volume feature and voice quality feature with a voice of the caller to output a corrected voice of the caller on the basis of the correction condition. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は、特に車両内での使用に好適なハンズフリー通話装置、及び該装置の音声補正方法に関するものである。 The present invention relates to a hands-free communication device particularly suitable for use in a vehicle and a sound correction method for the device.

この種のハンズフリー通話装置は、発音者、例えば運転者の携帯電話機に電気的に接続されており、運転者は所有の携帯電話機をその手で持つことなく、外部との通話が可能に構成される。車両の運転中における携帯電話機の使用は法律で規制されているからである。
また、当該通話装置は、発音者の例えば耳にそのフックを引掛ける構造の他、車両内の適宜位置に設置する構造がある。 This type of hands-free communication device is electrically connected to a sound generator, for example, a driver's mobile phone, so that the driver can talk to the outside without holding the mobile phone of his / her own Is done. This is because the use of a mobile phone while driving a vehicle is regulated by law.
In addition, the call device has a structure in which the hook is hooked on, for example, an ear of a speaker, and is installed at an appropriate position in the vehicle.

この後者の構造では、通話装置が発音者から離れており、その音声を拾い難くなる。そのため、複数の増幅回路をハンズフリー通話装置に備えた技術を利用することも考えられる（例えば、特許文献１参照）。 In this latter structure, the communication device is separated from the sound generator, and it is difficult to pick up the voice. Therefore, it is conceivable to use a technique in which a hands-free communication device includes a plurality of amplifier circuits (see, for example, Patent Document 1).

特開２００２−５１１４２号公報JP 2002-51142 A

ところで、発音者の音声には固有の波形があり、通話装置がこの波形の特徴を得ることができれば、走行時の通話者を特定し、その音声を外部により鮮明に伝えられるはずである。
しかしながら、上述した従来の技術では、発音者の音声の特徴を得られ難いとの問題がある。発音者の音声を単に増幅するのみでは、この音声の大小の波形が総て最大値まで大きくなって抑揚がなくなるからである。 By the way, a voice of a speaker has a unique waveform, and if the call device can obtain the characteristics of this waveform, the caller at the time of running should be identified and the voice should be clearly transmitted to the outside.
However, the above-described conventional technique has a problem that it is difficult to obtain the characteristics of the voice of the sound generator. This is because, by simply amplifying the voice of the sound generator, all the large and small waveforms of this voice are increased to the maximum value and there is no inflection.

このように、上記従来の技術では、発音者の音声の特徴を得る点については依然として課題が残されている。
そこで、本発明の目的は、上記課題を解消し、発音者の音声の特徴を得て、発音者を特定可能なハンズフリー通話装置、及び該装置の音声補正方法を提供することである。 As described above, in the above-described conventional technique, there is still a problem with respect to obtaining the characteristics of the voice of the sound generator.
SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to provide a hands-free call device capable of solving the above-described problems, obtaining the voice characteristics of a sounder, and identifying the sounder, and a sound correction method for the device.

上記目的を達成するための第１の発明は、発音者の音声から発音者の音量特徴及び音質特徴を予め抽出するサンプリング部と、抽出した音量特徴及び音質特徴から発音者に応じた補正条件を予め設定する一方、抽出した音量特徴及び音質特徴と通話者の音声とを比較して当該通話者を特定し、補正条件に基づいて当該通話者の補正音声を出力する補正部とを具備する。 According to a first aspect of the present invention for achieving the above object, there is provided a sampling unit for preliminarily extracting a sounder's volume characteristic and sound quality characteristic from a sounder's voice, and a correction condition corresponding to the sounder from the extracted volume characteristic and sound quality characteristic. On the other hand, there is provided a correction unit that compares the extracted volume feature and sound quality feature with the voice of the caller to identify the caller and outputs the corrected voice of the caller based on the correction condition.

第１の発明によれば、ハンズフリー通話装置はサンプリング部及び補正部を備えている。このサンプリング部は、発音者の音声からその音量特徴及び音質特徴を予め抽出可能であり、例えば車両に関するノイズの生じていない静かな環境下での音声と同様に、抑揚を有した音声の特徴を得ることができる。これは、従来の如く音声を単に増幅したのでは得られないものである。 According to the first invention, the hands-free communication device includes the sampling unit and the correction unit. This sampling unit can extract the volume characteristic and the sound quality characteristic from the voice of the sound generator in advance.For example, the voice characteristic with the inflection is similar to the voice in a quiet environment where no noise related to the vehicle is generated. Obtainable. This cannot be obtained by simply amplifying the sound as in the prior art.

また、補正部は、この抽出した音量特徴及び音質特徴から発音者に応じた補正条件を予め設定しており、この補正条件も抑揚を有した音声の特徴から設定できる。
そして、この補正部は、実際の通話時にて、抽出した音量特徴及び音質特徴と通話者の音声とを比較しているため、通話者を確実に特定可能になる。さらに、当該補正部は、上述の予め設定した補正条件に基づき、この通話者の補正音声を出力しているので、外部、つまり、通話の相手側には、上記通話者の特徴を引き出した音声が伝わることになる。この結果、ハンズフリー通話装置の信頼性が大幅に向上する。 In addition, the correction unit presets correction conditions according to the sound generator from the extracted volume characteristics and sound quality characteristics, and the correction conditions can also be set from the characteristics of the speech with inflection.
The correction unit compares the extracted volume feature and sound quality feature with the voice of the caller during an actual call, so that the caller can be reliably identified. Further, since the correction unit outputs the corrected voice of the caller based on the preset correction condition described above, the voice that draws the characteristics of the caller is provided outside, that is, the other party of the call. Will be transmitted. As a result, the reliability of the hands-free call device is greatly improved.

第２の発明は、第１の発明の構成において、音質特徴に対する補正条件は、発音者の音声の主帯域周波数に応じて設定されることを特徴とする。
第２の発明によれば、第１の発明の作用に加えてさらに、音質特徴に対する補正条件を、例えば男性と女性との如く主帯域周波数の高低によって分類すれば、発音者に適したノイズ抑制を行える。 According to a second aspect of the invention, in the configuration of the first aspect of the invention, the correction condition for the sound quality feature is set according to the main band frequency of the voice of the sound generator.
According to the second invention, in addition to the operation of the first invention, if the correction condition for the sound quality feature is classified according to the level of the main band frequency, such as male and female, noise suppression suitable for a speaker is achieved. Can be done.

第３の発明は、第２の発明の構成において、補正部は、主帯域外をカットするフィルタを有することを特徴とする。
第３の発明によれば、第２の発明の作用に加えてさらに、フィルタによって主帯域外をカットすれば主帯域のみが残るため、通話の相手側には、上記通話者の音質特徴をより一層引き出した音声が伝わる。 According to a third invention, in the configuration of the second invention, the correction unit has a filter for cutting outside the main band.
According to the third invention, in addition to the operation of the second invention, if the outside of the main band is cut by a filter, only the main band remains, so that the other party's sound quality characteristics are further improved on the other side of the call. The voice that is pulled out further is transmitted.

第４の発明は、第１から第３の発明の構成において、発音者の音声を有限数格納する記憶部をさらに具備することを特徴とする。
第４の発明によれば、第１から第３の発明の作用に加えてさらに、記憶部が発音者の音声を格納すれば発音者の確認が容易になるし、また、格納数が有限であるので、補正部は、有限の抽出した音量特徴及び音質特徴と通話者の音声との比較で済み、当該通話者の特定精度向上に寄与する。 According to a fourth invention, in the configurations of the first to third inventions, a storage unit for storing a finite number of voices of a sound generator is further provided.
According to the fourth invention, in addition to the operations of the first to third inventions, if the storage unit stores the voice of the speaker, the confirmation of the speaker can be facilitated, and the number of storage is limited. Therefore, the correction unit only needs to compare the finitely extracted volume feature and sound quality feature with the voice of the caller, and contributes to improving the accuracy of the caller.

第５の発明は、第１から第３の発明の構成において、抽出した音量特徴及び音質特徴を有限数格納する記憶部をさらに具備することを特徴とする。
第５の発明によれば、第１から第３の発明の作用に加えてさらに、記憶部が抽出した音量特徴及び音質特徴を有限で格納すれば、補正部は、有限の抽出した音量特徴及び音質特徴と通話者の音声との比較で済むことから、この点も当該通話者の特定精度向上に寄与する。 According to a fifth aspect of the present invention, in the configuration of the first to third aspects of the present invention, the apparatus further includes a storage unit that stores a finite number of the extracted volume characteristics and sound quality characteristics.
According to the fifth aspect of the invention, in addition to the effects of the first to third aspects of the invention, if the volume feature and the sound quality feature extracted by the storage unit are stored in a finite amount, the correction unit may Since the comparison between the sound quality feature and the voice of the caller is sufficient, this point also contributes to the improvement of the specific accuracy of the caller.

第６の発明は、ハンズフリー通話装置の音声補正方法である。そして、車両に関するノイズの生じていない環境下にて、発音者の音声から発音者の音量特徴及び音質特徴を抽出するステップと、抽出した音量特徴及び音質特徴から発音者に応じた補正条件を設定するステップと、通話時に、抽出した音量特徴及び音質特徴と車両内における通話者の音声とを比較して通話者を特定するステップと、補正条件に基づいて通話者の補正音声を出力するステップとから構成される。 A sixth invention is a voice correction method for a hands-free call device. Then, in an environment where no noise related to the vehicle is generated, a step of extracting the sound volume characteristic and sound quality characteristic of the sound generator from the sound of the sound generator, and a correction condition corresponding to the sound generator are set from the extracted volume characteristic and sound quality characteristic. A step of comparing the extracted volume feature and sound quality feature with the voice of the caller in the vehicle and identifying the caller during the call, and outputting the caller's corrected voice based on the correction condition Consists of

第６の発明によれば、発音者の音量特徴及び音質特徴が、車両に関するノイズの生じていない環境下にて抽出されており、抑揚を有した音声の特徴を得ることができる。また、この補正条件も抑揚を有した音声の特徴から設定できる。
そして、実際の通話時には、抽出した音量特徴及び音質特徴と通話者の音声とが比較されており、通話者を確実に特定可能になる。さらに、上述の予め設定した補正条件に基づき、この通話者の補正音声が出力されるため、通話の相手側には、上記通話者の特徴を引き出した音声を伝わることができる。 According to the sixth aspect, the sound volume characteristic and the sound quality characteristic of the sound generator are extracted in an environment in which no noise related to the vehicle is generated, and it is possible to obtain a voice characteristic having an inflection. Also, this correction condition can be set from the characteristics of speech with inflection.
Then, during an actual call, the extracted volume characteristics and sound quality characteristics are compared with the voice of the caller, and the caller can be reliably identified. Furthermore, since the caller's corrected voice is output based on the above-described preset correction conditions, the caller's characteristics can be transmitted to the other party of the call.

本発明によれば、ノイズの生じていない静かな環境下と同等の抑揚を有した音声の特徴を得ているため、発音者を確実に特定し、その特徴を引き出した音声を通話の相手側に伝えるハンズフリー通話装置、及び該装置の音声補正方法を提供することができる。 According to the present invention, since a voice characteristic having an inflection equivalent to that in a quiet environment where noise is not generated is obtained, a sound producer is surely identified, and the voice from which the characteristic is extracted is transmitted to the other party of the call A hands-free communication device that communicates with the device and a sound correction method for the device can be provided.

本実施例のハンズフリー通話装置を搭載した車内の説明図である。It is explanatory drawing in the vehicle carrying the hands-free call apparatus of a present Example. 図１のハンズフリー通話装置と外部との関係を説明する図である。It is a figure explaining the relationship between the hands-free telephone apparatus of FIG. 1, and the exterior. 図１のハンズフリー通話装置の内部構成図である。It is an internal block diagram of the hands-free call apparatus of FIG. 図１のハンズフリー通話装置による動作フロー図である。It is an operation | movement flowchart by the hands-free call apparatus of FIG. 図２の補正部による音質補正条件の説明図である。It is explanatory drawing of the sound quality correction conditions by the correction | amendment part of FIG.

以下、本発明の好適な実施例を図面に基づいて説明する。
図１は車両１の車室であり、同図の右手前側に運転者のシート４が示されている。この運転席の左側には助手席が設けられる。インストルメントパネル２はシート４から視て各座席の前方に配置され、フロントガラス６は同じくシート４から視てパネル２の前方に設けられ、各座席からは車両１の進行方向を見渡すことができる。 Preferred embodiments of the present invention will be described below with reference to the drawings.
FIG. 1 shows a passenger compartment of a vehicle 1, and a driver's seat 4 is shown on the right front side of the figure. A passenger seat is provided on the left side of the driver seat. The instrument panel 2 is disposed in front of each seat when viewed from the seat 4, and the windshield 6 is also provided in front of the panel 2 when viewed from the seat 4, so that the traveling direction of the vehicle 1 can be viewed from each seat. .

この運転席において、計器類８はパネル２の正面に配置され、車両１の速度、走行距離やシフト位置等を表示する。ハンドル１０は、パネル２の下方からシート４に向けて延びた軸１２の先端に回転可能に取り付けられている。また、シフトレバー１４は、この軸１２の周壁から助手席側に向けて延びており、上下方向に移動可能である。 In this driver's seat, the instruments 8 are arranged in front of the panel 2 and display the speed, travel distance, shift position, etc. of the vehicle 1. The handle 10 is rotatably attached to the tip of a shaft 12 extending from the lower side of the panel 2 toward the seat 4. The shift lever 14 extends from the peripheral wall of the shaft 12 toward the passenger seat and is movable in the vertical direction.

エアーコンディショニング装置の吹き出し口１６は、運転席から助手席に亘ってパネル２の正面に配置され、この装置の起動に伴い、空気が車室内に向けて送出される。
モニター１８は、運転席と助手席との間であって当該パネル２の正面に設置されている。このモニター１８は、ナビゲーション装置の起動に伴い、地図や操作メニュー等を表示可能である。 The air outlet 16 of the air conditioning device is arranged in front of the panel 2 from the driver's seat to the passenger seat, and air is sent out toward the vehicle interior as the device is activated.
The monitor 18 is installed in front of the panel 2 between the driver seat and the passenger seat. The monitor 18 can display a map, an operation menu, and the like as the navigation device is activated.

センターコンソール２０は、モニター１８の下方からシート４に向けて突出しており、オーディオ装置のスロット２２は、このモニター１８の近傍に配置され、ＣＤやＭＤ等を挿入できる。
ここで、本実施例では、ハンズフリー通話装置２４がコンソール２０のうち、スロット２２よりもシート４側の適宜位置に設けられている。 The center console 20 protrudes from the lower side of the monitor 18 toward the seat 4, and the slot 22 of the audio device is disposed in the vicinity of the monitor 18, and a CD, MD, or the like can be inserted therein.
Here, in the present embodiment, the hands-free communication device 24 is provided at an appropriate position on the seat 4 side of the console 20 in the console 20.

当該通話装置２４は、その操作パネルやマイク等の周辺を残してコンソール２０に埋設され、発音者の音声を検出することができる。そして、この通話装置２４は、図２に示された発音者所有の携帯電話機２６を経由して他の携帯電話機６０との通話が可能である。
具体的には、本実施例の通話装置２４は、近距離用の無線回線を介して発音者所有の携帯電話機２６に接続可能に構成されている。なお、この無線回線に替えてケーブルで接続されていても良い。 The call device 24 is embedded in the console 20 leaving the periphery of the operation panel, microphone, etc., and can detect the sound of the sound generator. The call device 24 can make a call with another mobile phone 60 via the mobile phone 26 owned by the sound generator shown in FIG.
Specifically, the call device 24 according to the present embodiment is configured to be connectable to a cellular phone 26 owned by a speaker via a short-distance wireless line. In addition, it may replace with this radio | wireless line and may be connected with the cable.

一方、この携帯電話機２６は、携帯電話用の通信網などの公衆回線を介して相手側の携帯電話機６０に接続されている。これにより、上記発音者の音声はマイク４０から通話装置２４、携帯電話機２６、携帯電話機６０を経て相手側に伝わり、この相手側の音声は、携帯電話機６０、携帯電話機２６、通話装置２４を経てスピーカ３８から上記発音者に伝わる。 On the other hand, the mobile phone 26 is connected to the mobile phone 60 on the other side via a public line such as a communication network for mobile phones. Thereby, the voice of the above-mentioned sound generator is transmitted from the microphone 40 to the other party through the telephone device 24, the mobile phone 26, and the mobile phone 60, and the voice of the other party passes through the mobile phone 60, the mobile phone 26, and the telephone device 24. The sound is transmitted from the speaker 38 to the speaker.

より詳しくは、図３に示される如く、本実施例の通話装置２４はアンテナ３０を有し、このアンテナ３０は通信部３２に接続されている。当該通信部３２は携帯電話機２６との間で無線による通話信号を送受し、アンテナ３０を介して受信した信号は復調され、通信部３２から信号処理部３４に向けて出力される。 More specifically, as shown in FIG. 3, the call device 24 of this embodiment has an antenna 30, and this antenna 30 is connected to a communication unit 32. The communication unit 32 transmits / receives a wireless call signal to / from the mobile phone 26, and the signal received via the antenna 30 is demodulated and output from the communication unit 32 toward the signal processing unit 34.

この信号処理部３４は通信に必要な信号処理を実行しており、この処理結果をスピーカ用Ｄ／Ａ３６に向けて出力すると、スピーカ３８が鳴動し、相手側の音声が車室に出力される。
これに対し、車室の発音者の音声は、マイク４０に拾われてマイク用Ａ／Ｄ４２から信号処理部３４に入力される。この信号処理部３４では、車室の環境ノイズの抑制処理や、スピーカ３８からマイク４０に回り込んだエコー成分の除去処理も行われており、当該処理結果を通信部３２に向けて出力すると、変調された信号がアンテナ３０から送信される。 The signal processing unit 34 performs signal processing necessary for communication. When the processing result is output to the speaker D / A 36, the speaker 38 rings and the other party's voice is output to the passenger compartment. .
On the other hand, the voice of the speaker in the passenger compartment is picked up by the microphone 40 and input from the microphone A / D 42 to the signal processing unit 34. In this signal processing unit 34, processing for suppressing environmental noise in the passenger compartment and processing for removing echo components that have circulated from the speaker 38 to the microphone 40 are also performed. When the processing result is output to the communication unit 32, The modulated signal is transmitted from the antenna 30.

ところで、本実施例の通話装置２４は、制御部（サンプリング部）４４、記憶部４６、及び補正部４８をさらに備えている。
詳しくは、この制御部４４は、車室の発音者の音声から当該発音者の音声特徴を予め抽出可能に構成されており、図示しないホストＣＰＵからのモード信号に応じて、信号処理部３４に入力された発音者の音声の例えば声紋から当該発音者の音量特徴及び音質特徴を抽出する。 By the way, the communication device 24 of this embodiment further includes a control unit (sampling unit) 44, a storage unit 46, and a correction unit 48.
Specifically, the control unit 44 is configured to be able to previously extract the sound characteristics of the sound generator from the sound of the sound generator in the passenger compartment, and in response to a mode signal from a host CPU (not shown), For example, the sound volume characteristic and the sound quality characteristic of the sound generator are extracted from the voice print of the sound of the sound input.

この発音者の音声自体、及び抽出された音量特徴や音質特徴は、記憶部４６に向けて出力され、その容量の範囲内で格納される。つまり、発音者の音声、抽出した音量特徴や音質特徴は有限数だけ格納される。
本実施例の補正部４８は、発音者の音声の伝達方向で視て信号処理部３４の下流側に配置され、この抽出された音量特徴や音質特徴に応じた補正条件を予め設定可能に構成されている。なお、上記記憶部４６は、この補正条件も格納可能である。 The voice of the sound generator itself and the extracted volume characteristics and sound quality characteristics are output to the storage unit 46 and stored within the capacity range. That is, a finite number of sound generator's voices, extracted volume features and sound quality features are stored.
The correction unit 48 according to the present embodiment is arranged on the downstream side of the signal processing unit 34 when viewed in the transmission direction of the voice of the sound generator, and is configured so that correction conditions according to the extracted volume characteristic and sound quality characteristic can be set in advance. Has been. The storage unit 46 can also store this correction condition.

また、この補正部４８は、その後の通話者、つまり、この車両１の運転者である携帯電話機２６の所有者を特定することができる。
詳しくは、本実施例では、補正部４８が、予め抽出された発音者の音量特徴及び音質特徴と、実際に相手側との通話を行う通話者の音声とを比較して当該通話者を特定している。 Further, the correction unit 48 can specify the subsequent caller, that is, the owner of the mobile phone 26 that is the driver of the vehicle 1.
Specifically, in this embodiment, the correction unit 48 identifies the speaker by comparing the volume characteristics and sound quality characteristics of the previously extracted speaker with the voice of the speaker who actually makes a call with the other party. is doing.

さらに、当該補正部４８は、発音者毎に設定された補正条件に基づいて当該通話者の補正音声を通信部３２に向けて出力している。
なお、上述した制御部４４は、発音者の他、その後の通話者の音声特徴をも抽出し、そして、補正部４８が、予め抽出された発音者の音量特徴及び音質特徴と、この通話者の音声特徴とを比較して当該通話者を特定しても良い。 Further, the correction unit 48 outputs the corrected voice of the caller to the communication unit 32 based on the correction condition set for each sound generator.
Note that the control unit 44 described above also extracts the voice characteristics of the subsequent caller in addition to the speaker, and the correction unit 48 extracts the volume characteristic and sound quality characteristic of the speaker that has been extracted in advance and the speaker. The caller may be specified by comparing with the voice feature of

図４を参照すると、通話装置２４による発音者の音声を相手側に出力するまでの動作フロー図が示されている。以下、上記の如く構成されたハンズフリー通話装置２４の本発明に係る作用について説明する。
同図のステップＳ４０１では、携帯電話機２６を例えば鞄に収納した発音者が車両１に乗車し、通話装置２４が、車両１のエンジンスタート前、例えば車両１のイグニッションキーのオン作動に伴って起動する。 Referring to FIG. 4, there is shown an operation flow chart until the voice of the sound generator by the call device 24 is output to the other party. Hereinafter, the operation according to the present invention of the hands-free communication device 24 configured as described above will be described.
In step S401 in FIG. 5, a speaker who stores the cellular phone 26 in a bag, for example, gets on the vehicle 1, and the communication device 24 is activated before the engine of the vehicle 1 is started, for example, when the ignition key of the vehicle 1 is turned on. To do.

次に、ステップＳ４０２では、マイク４０が乗車した発音者の音声を取り込む。ここで、この音声は、車両１の停車時など、車両１のロードノイズや風切音のような車両に関するノイズの生じていない静かな環境下にて、特定のフレーズ、例えば「もしもし」、「乗車します」、若しくは「はい、営業１課です」など、短い音声フレーズをマイク４０に向かって喋ってもらう。 Next, in step S402, the voice of the speaker who is on the microphone 40 is captured. Here, this voice is generated in a specific environment such as “Hoshi” or “Hoshi” in a quiet environment where no noise related to the vehicle such as road noise or wind noise of the vehicle 1 occurs, such as when the vehicle 1 is stopped. Get a short voice phrase toward the microphone 40, such as "I'll get on" or "Yes, I'm in sales department 1."

ステップＳ４０３では、上記ホストＣＰＵにて、発音者が登録されているか否かを判別し、仮に当該発音者が車両１に未だ登録されていない場合、すなわちＹＥＳと判定した場合にはステップＳ４０４に進む。
このステップＳ４０４では、このホストＣＰＵが制御部４４に録音モードの信号を出力する。 In step S403, the host CPU determines whether or not a sound generator is registered. If the sound generator is not yet registered in the vehicle 1, that is, if it is determined YES, the process proceeds to step S404. .
In step S404, the host CPU outputs a recording mode signal to the control unit 44.

制御部４４は、当該モードを受け取ると、マイク４０で取り込んだ上述の短い音声フレーズから発音者の音量特徴及び音質特徴を抽出する。これにより、発音者の地声の程度や、性別も認識できる。
同時に、補正部４８は、この抽出した音量特徴及び音質特徴から発音者に応じた音量補正条件並びに音質補正条件を設定する。 When receiving the mode, the control unit 44 extracts the sound volume characteristic and the sound quality characteristic of the speaker from the short voice phrase captured by the microphone 40. As a result, it is possible to recognize the level of the voice of the pronunciation and the gender.
At the same time, the correction unit 48 sets a volume correction condition and a sound quality correction condition according to the sound generator from the extracted volume feature and sound quality feature.

具体的には、この音量補正条件は、発音者に適した音量に調整するためのものであり、例えば、発音者が小声であった場合には増幅量を大きくする一方、大声であった場合には聴き取り易くなるように増幅量を絞っている。
また、音質補正条件は、発音者に適したノイズ抑制を行うためのものである。より詳しくは、一般にノイズは、図５に斜線で示されるように、低めの周波数域に存在している。 Specifically, the volume correction condition is for adjusting the volume to be suitable for a sounder. For example, when the sounder is a low voice, the amplification amount is increased while the volume is high. In order to make it easier to listen to, the amount of amplification is narrowed down.
Also, the sound quality correction condition is for performing noise suppression suitable for the sound producer. More specifically, noise is generally present in a lower frequency range, as indicated by hatching in FIG.

ここで、男性の主帯域周波数は約１２０Ｈｚであるのに対し、女性の主帯域周波数は約２２０Ｈｚであると云われており、男性の音声（同図（ａ））は、女性の音声（同図（ｂ））に比して上記ノイズと重なり合う領域が多くなることが分かる。
そこで、本実施例の補正部４８は、発音者の音声の主帯域周波数に応じて音質補正条件を設定しており、主帯域周波数を残し、この帯域外をカットするフィルタを用いている。 Here, it is said that the main band frequency of men is about 120 Hz, whereas the main band frequency of women is about 220 Hz. It can be seen that there are more areas overlapping with the noise than in FIG.
Therefore, the correction unit 48 of the present embodiment sets the sound quality correction condition according to the main band frequency of the voice of the sound generator, and uses a filter that leaves the main band frequency and cuts outside this band.

つまり、例えば、発音者が男性であった場合にはノイズのカット量を減らし、図５（ａ）のノイズ分を単にカットすることによって当該男性の特徴も表出され難くなってしまうのを回避する。
一方、例えば、発音者が女性であって場合には、上記男性に比してノイズのカット量を増やしている。これは、同図（ｂ）に示されるように、主帯域周波数が高い場合には、ノイズ分を単にカットしても当該女性の特徴は表出され得るからである。 In other words, for example, if the speaker is a male, the amount of noise cut is reduced, and by simply cutting the noise shown in FIG. To do.
On the other hand, for example, when the speaker is a woman, the amount of noise cut is increased compared to the man. This is because, as shown in FIG. 5B, when the main band frequency is high, the characteristics of the woman can be expressed even if the noise is simply cut.

続いて、ステップＳ４０５では、記憶部４６が発音者の音声、その音声特徴、及び補正条件を格納してステップＳ４０６に進む、
このステップＳ４０６では、携帯電話機２６と車両１の外部とが公衆回線を介して通話状態に至ったか否かを判別し、通話状態に至るまで通話装置２４は待機状態になる。 Subsequently, in step S405, the storage unit 46 stores the voice of the sound generator, its voice characteristics, and correction conditions, and the process proceeds to step S406.
In step S406, it is determined whether or not the mobile phone 26 and the outside of the vehicle 1 are in a call state via a public line, and the call device 24 is in a standby state until the call state is reached.

その後、携帯電話機２６と車両１の外部とが通話状態に至った場合、すなわちＹＥＳと判定した場合にはステップＳ４０２に戻り、マイク４０が車室の通話者の音声を取り込む。
続いて、ステップＳ４０３では、この通話者が登録されているか否かを判別し、当該通話者が車両１に既に登録されていた場合には、ホストＣＰＵが制御部４４に通常モードの信号を出力してステップＳ４０７に進む。 Thereafter, when the mobile phone 26 and the outside of the vehicle 1 reach a call state, that is, when it is determined YES, the process returns to step S402, and the microphone 40 captures the voice of the caller in the passenger compartment.
In step S403, it is determined whether or not the caller is registered. If the caller has already been registered in the vehicle 1, the host CPU outputs a normal mode signal to the control unit 44. Then, the process proceeds to step S407.

このステップＳ４０７では、補正部４８がマイク４０で取り込んだ通話者の音声を受け取る。なお、当該通話者の特定をより確実に行うべく、制御部４４が、登録された総ての発音者の音声データを記憶部４６から取り出して補正部４８に向けて出力しても良い。
そして、補正部４８は、予め抽出された発音者の音量特徴及び音質特徴と、その後の通話者の音声とを比較して当該通話者を特定してステップＳ４０８に進む。 In step S407, the correcting unit 48 receives the voice of the caller captured by the microphone 40. Note that the control unit 44 may extract all registered voice data of the sound generator from the storage unit 46 and output the voice data to the correction unit 48 in order to more reliably identify the caller.
Then, the correcting unit 48 compares the sound volume characteristics and sound quality characteristics of the speaker extracted in advance with the subsequent voices of the caller, identifies the caller, and proceeds to step S408.

このステップＳ４０８では、補正部４８が、設定した補正条件に基づいて当該通話者の補正音声を通信部３２に向けて出力する。
より詳しくは、当該通話者の音量を上述した音量補正条件に応じて補正しており、この通話者が小声の発音者であった場合には増幅量を大きくするし、大声の発音者であった場合には聴き取り易くなるように増幅量を絞る。 In step S <b> 408, the correction unit 48 outputs the corrected voice of the caller to the communication unit 32 based on the set correction condition.
More specifically, the volume of the caller is corrected according to the volume correction conditions described above. If the caller is a loud speaker, the amount of amplification is increased and the speaker is a loud speaker. In such a case, the amount of amplification is reduced so that it can be easily heard.

さらに、当該通話者の音質もまた、上述の音質補正条件に応じて補正し、この通話者が男性の発音者であった場合にはノイズのカット量を減らし、女性の発音者であった場合にはノイズのカット量を増やす。
次いで、ステップＳ４０９では、通信部３２が補正部４８からの信号を変調してアンテナ３０に向けて出力し、一連のルーチンを抜ける。 Furthermore, the sound quality of the caller is also corrected according to the sound quality correction conditions described above. If the caller is a male speaker, the amount of noise cut is reduced, and the sounder is a female speaker. Increase the amount of noise cut.
Next, in step S409, the communication unit 32 modulates the signal from the correction unit 48 and outputs the modulated signal to the antenna 30, and exits a series of routines.

以上のように、本実施例によれば、ハンズフリー通話装置２４は制御部４４及び補正部４８を備えている。この制御部４４は、発音者の音声からその音量特徴及び音質特徴を予め抽出可能であり、例えば車両１に関するノイズの生じていない静かな環境下での音声と同様に、抑揚を有した音声の特徴を得ることができる。これは、従来の如く音声を単に増幅したのでは得られないものである。 As described above, according to the present embodiment, the hands-free call device 24 includes the control unit 44 and the correction unit 48. The control unit 44 can previously extract the volume characteristic and the sound quality characteristic from the voice of the sound generator. For example, the voice of the voice with inflection can be obtained in the same manner as the voice in a quiet environment where no noise is generated with respect to the vehicle 1. Features can be obtained. This cannot be obtained by simply amplifying the sound as in the prior art.

また、補正部４８は、この抽出した音量特徴及び音質特徴から発音者に応じた補正条件を予め設定しており、この補正条件も抑揚を有した音声の特徴から設定できる。
そして、この補正部４８は、実際の通話時にて、抽出した音量特徴及び音質特徴と車両１の通話者の音声とを比較しているため、通話者を確実に特定可能になる。さらに、当該補正部４８は、上述の予め設定した補正条件に基づき、この通話者の補正音声を通信部３２に出力しているので、外部、つまり、通話の相手側には、上記通話者の特徴を引き出した音声が伝わることになる。 Further, the correction unit 48 presets a correction condition according to the sound generator from the extracted volume feature and sound quality feature, and this correction condition can also be set from the feature of the voice having an inflection.
And since this correction | amendment part 48 compares the extracted volume characteristic and sound quality characteristic with the voice of the caller of the vehicle 1 at the time of an actual call, it becomes possible to identify a caller reliably. Further, since the correction unit 48 outputs the caller's corrected voice to the communication unit 32 based on the above-described preset correction conditions, the outside of the caller, that is, the other party of the call, The voice that brings out the features will be transmitted.

この結果、発音者を車両１側に登録しておけば、この車両１に乗車した発音者は、車両１の停車時などの静かな環境下で、特定の短い音声フレーズをマイク４０に向かって予め喋ると、ハンズフリー通話装置２４に対して何等の操作を行うことなく、この通話装置２４が、その後の通話者の特徴を引き出した音声を出力するため、通話装置２４の利便性が向上するし、その信頼性も大幅に向上する。 As a result, if a speaker is registered on the vehicle 1 side, the speaker who has boarded the vehicle 1 can send a specific short voice phrase to the microphone 40 in a quiet environment such as when the vehicle 1 is stopped. When speaking in advance, the call device 24 outputs a voice that draws out the features of the subsequent caller without performing any operation on the hands-free call device 24, so the convenience of the call device 24 is improved. In addition, its reliability is greatly improved.

また、音質補正条件を、例えば男性と女性との如く主帯域周波数の高低によって分類すれば、発音者に適したノイズ抑制を行える。
さらに、フィルタによって主帯域外をカットすれば主帯域のみが残るため、通話の相手側には、上記通話者の音質特徴をより一層引き出した音声が伝わる。
さらにまた、音量補正条件は音声の音量に応じて設定され、例えば、小声の場合には増幅量を大きくするし、大声の場合には聴き取り易くなるように絞ると、発音者に適した音量に調整可能になる。 Further, if the sound quality correction conditions are classified according to the level of the main band frequency, such as male and female, noise suppression suitable for a speaker can be performed.
Further, if the outside of the main band is cut by the filter, only the main band remains, so that the voice that further draws out the voice quality characteristics of the caller is transmitted to the other party of the call.
Furthermore, the volume correction condition is set according to the volume of the voice. For example, if the amplification amount is increased in the case of a loud voice and the volume is adjusted so that it can be easily heard in the case of a loud voice, Can be adjusted.

また、記憶部４６が発音者の音声を格納すれば発音者の確認が容易になるし、しかも、その格納数が有限であるので、補正部４８は、有限の抽出した音量特徴及び音質特徴とその後の通話者の音声との比較で済み、当該通話者の特定精度向上に寄与する。
さらに、記憶部４６が抽出した音量特徴及び音質特徴を有限で格納すれば、補正部４８は、有限の抽出した音量特徴及び音質特徴とその後の通話者の音声との比較で済むことから、この点も当該通話者の特定精度向上に寄与する。 In addition, if the storage unit 46 stores the voice of the sounder, the confirmation of the sounder is facilitated, and since the number of storages is finite, the correction unit 48 includes the finitely extracted volume feature and sound quality feature. The comparison with the subsequent caller's voice is sufficient, which contributes to the improvement of the accuracy of the caller.
Further, if the volume feature and sound quality feature extracted by the storage unit 46 are stored in a finite amount, the correction unit 48 can compare the finitely extracted volume feature and sound quality feature with the subsequent voice of the caller. This also contributes to improving the accuracy of the caller.

本発明は、上記実施例に限定されず、特許請求の範囲を逸脱しない範囲で種々の変更を行うことができる。
例えば、上記実施例では、制御部４４と補正部４６とが別個に設けられているが、必ずしもこの形態に限定されるものではなく、これら制御部と補正部とは同一回路に設けられていても良い。 The present invention is not limited to the above embodiments, and various modifications can be made without departing from the scope of the claims.
For example, in the above embodiment, the control unit 44 and the correction unit 46 are provided separately. However, the present invention is not necessarily limited to this form, and the control unit and the correction unit are provided in the same circuit. Also good.

また、本発明のハンズフリー通話装置は、センターコンソール２０の他、インストルメントパネル２上や、フロントピラーに設置されていても良い。
そして、これらいずれの場合にも上記と同様に、発音者の音声の特徴を得て、発音者を特定できるとの効果を奏する。 Further, the hands-free communication device of the present invention may be installed on the instrument panel 2 or on the front pillar in addition to the center console 20.
In any of these cases, similar to the above, it is possible to obtain the sound characteristics of the sound generator and specify the sound generator.

１車両
２４ハンズフリー通話装置
４４制御部（サンプリング部）
４６記憶部
４８補正部 1 Vehicle 24 Hands-free call device 44 Control unit (sampling unit)
46 storage unit 48 correction unit

Claims

A sampling unit that previously extracts the sound volume characteristics and sound quality characteristics of the sound generator from the sound of the sound generator;
A correction condition corresponding to the sound generator is set in advance from the extracted volume feature and sound quality feature, while the extracted volume feature and sound quality feature is compared with the voice of the caller to identify the caller and the correction A hands-free call device comprising: a correction unit that outputs a correction voice of the caller based on a condition.

The hands-free call device according to claim 1,
The hands-free call device according to claim 1, wherein the correction condition for the sound quality feature is set according to a main band frequency of the voice of the sound generator.

The hands-free call device according to claim 2,
The correction unit has a filter that cuts out of the main band.

The hands-free call device according to any one of claims 1 to 3,
A hands-free call device, further comprising a storage unit for storing a finite number of voices of the sound generator.

The hands-free call device according to any one of claims 1 to 3,
A hands-free call device, further comprising a storage unit for storing a finite number of the extracted volume features and sound quality features.

Extracting the sounder's volume characteristics and sound quality characteristics from the sound of the sounder in an environment free from noise related to the vehicle;
Setting a correction condition according to the sound generator from the extracted volume feature and sound quality feature;
Comparing the extracted volume feature and sound quality feature with the voice of the caller in the vehicle during a call to identify the caller;
Outputting a corrected voice of the caller based on the correction condition.