JP2009017331A

JP2009017331A - Voice processor, voice processing method, voice processing program, and its recording medium

Info

Publication number: JP2009017331A
Application number: JP2007178036A
Authority: JP
Inventors: Iwao Ikeda; 巌池田; Toru Yamashita; 徹山下; Kazuhiko Takahashi; 和彦高橋; Tadashi Nozawa; 忠史野澤; Hiroki Tomita; 裕樹富田
Original assignee: Pioneer Electronic Corp
Current assignee: Pioneer Corp
Priority date: 2007-07-06
Filing date: 2007-07-06
Publication date: 2009-01-22

Abstract

<P>PROBLEM TO BE SOLVED: To achieve comfortable communication and talking between speakers. <P>SOLUTION: A test signal generation part 220 successively generates a test voice signal by voice output setting following the candidates of a plurality of reception voice reproduction conditions configured of the combination of a selection speaker to be selected for the reproduction output of a received voice and a sound volume to be outputted from the selection speaker. A test voice is collected by a sound collection unit, and the sound collection result is transmitted to an echo detection part 250. The echo detection part 250 detects the echo volume of the test voice, and transmits it to an optimal condition determination part 261. The optimal condition determination part 261 determines the optimal conditions for minimizing sneak echo from the speaker of the received voice to a microphone from among the candidates of the received sound reproduction conditions. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、音声処理装置、音声処理方法、音声処理プログラム、及び、当該音声処理プログラムが記録された記録媒体に関する。 The present invention relates to a voice processing device, a voice processing method, a voice processing program, and a recording medium on which the voice processing program is recorded.

近年、携帯電話等の移動通信端末装置による通信通話は、日常生活を行う上で、欠かせないものとなってきている。このため、移動通信端末装置による通信通話を、車両の運転中にも、運転操作に支障をきたさずに行うことが可能な、いわゆるハンズフリー通話が注目されている。 In recent years, communication calls using mobile communication terminals such as mobile phones have become indispensable for daily life. For this reason, attention is being paid to so-called hands-free calling, which allows a communication call by a mobile communication terminal device to be performed without hindering driving operation even during driving of a vehicle.

こうしたハンズフリー通話を行うことのできる車載用の装置については、ハンズフリー通話とハンドセット通話とを切り換えることができるもの（特許文献１参照：以下、「従来例１」という）等、様々な技術が提案されている。かかるハンズフリー通話が可能な装置では、送信音声を集音するマイクロフォンと、受信音声を出力するスピーカとを備えているが、通常の電話装置と比べて、スピーカから出力された音声が、マイクロフォンで集音されてしまうエコー現象が発生し易い。 Various in-vehicle devices that can perform such a hands-free call include those that can switch between a hands-free call and a handset call (see Patent Document 1: hereinafter referred to as “Conventional Example 1”). Proposed. A device capable of hands-free calling includes a microphone that collects transmission sound and a speaker that outputs reception sound. Compared to a normal telephone device, the sound output from the speaker is a microphone. Echo phenomenon that is collected easily occurs.

このため、車載用のハンズフリー通話通信装置の多くにおいては、エコーキャンセラが装備されている。かかるエコーキャンセラとしては、例えば、送信音声信号から伝送音声帯域以外の雑音をバンドエリミネーションフィルタで抽出し、その抽出結果を受信音声信号に加算した信号に基づいてエコーを推定する技術が提案されている（特許文献２参照；「従来例２」という）。 For this reason, many of the in-vehicle hands-free communication devices are equipped with an echo canceller. As such an echo canceller, for example, a technique has been proposed in which noise other than a transmission voice band is extracted from a transmission voice signal by a band elimination filter and an echo is estimated based on a signal obtained by adding the extraction result to a reception voice signal. (Refer to Patent Document 2; referred to as “Conventional Example 2”).

特開２００３−７８６０７号公報JP 2003-78607 A 特許２９１９４２２号公報Japanese Patent No. 2919422

ところで、近年において、多くの車両には音声のステレオ出力やマルチチャンネル出力のために、複数のスピーカを備える音響装置が搭載されている。こうした車両においてハンズフリー通話通信を実現する場合に、当該音響装置の複数のスピーカを利用することが考えられる。 By the way, in recent years, many vehicles are equipped with an audio device including a plurality of speakers for audio stereo output and multi-channel output. When hands-free call communication is realized in such a vehicle, it is conceivable to use a plurality of speakers of the acoustic device.

かかる場合において、通信通話時には、当該複数のスピーカの全てから同一の音量で受信音声を出力させることが最も簡易な方法である。しかしながら、この方法では、利用者にとっての聴取性を確保しつつ、当該複数のスピーカからマイクロフォンへの回り込みエコーを、従来例２のようなエコーキャンセラのみにより十分に除去することは困難であった。 In such a case, at the time of a communication call, it is the simplest method to output the received sound at the same volume from all of the plurality of speakers. However, with this method, it has been difficult to sufficiently remove the wraparound echo from the plurality of speakers to the microphone by using only the echo canceller as in the conventional example 2 while ensuring the audibility for the user.

このため、複数のスピーカを備える場合において、受話音声のスピーカからマイクロフォンへの回り込みエコーを低減することができる技術が切望されていた。かかる要請に応えることが、本発明が解決すべき課題の一つとして挙げられる。 For this reason, in the case where a plurality of speakers are provided, there has been a strong demand for a technique that can reduce the wraparound echo from the speaker of the received voice to the microphone. Meeting this requirement is one of the problems to be solved by the present invention.

本発明は、上記の事情を鑑みてなされたものであり、通話者間で快適な通信通話を行うことのできる新たな音声処理装置及び音声処理方法を提供することを目的とする。 The present invention has been made in view of the above circumstances, and an object of the present invention is to provide a new voice processing apparatus and voice processing method capable of performing a comfortable communication call between callers.

請求項１に記載の発明は、通話通信機能を有する音声処理装置であって、外部との通話のために発せられた音声を集音し、送信音声信号に変換する集音手段と；外部からの受信音声信号に基づいて受信音声を再生出力する複数のスピーカと；前記受信音声の再生に際して、前記複数のスピーカの中から再生出力用に選択される少なくとも１つの選択スピーカと、前記選択スピーカから出力される音声の音量との組み合わせから成る複数の受信音声再生条件候補が記憶された記憶手段と；前記複数の受信音声再生条件候補のそれぞれに従って、内部的に発生したテスト音声を、順次、前記選択スピーカから再生出力させるテスト音声出力制御手段と；前記テスト音声の再生出力に起因して前記集音手段により集音された音のエコー量を検出するエコー検出手段と；前記複数の受信音声再生条件候補のそれぞれに対応する前記エコー検出手段による検出結果に基づいて、前記複数の受信音声再生条件候補の中から最適条件を決定する最適条件決定手段と；通話通信中において、前記最適条件を受信音声再生条件として設定する条件設定手段と；を備えることを特徴とする音声処理装置である。 The invention according to claim 1 is a voice processing device having a call communication function, which collects a voice emitted for a call with the outside and converts it into a transmission voice signal; A plurality of speakers that reproduce and output received audio based on the received audio signal; at least one selected speaker selected for reproduction output from among the plurality of speakers when reproducing the received audio; and the selected speaker Storage means for storing a plurality of reception sound reproduction condition candidates composed of a combination with the volume of the output sound; and test sounds generated internally in accordance with each of the plurality of reception sound reproduction condition candidates, sequentially, Test sound output control means for reproducing and outputting from the selected speaker; and detecting an echo amount of the sound collected by the sound collecting means due to the reproduction output of the test sound Code detecting means; and optimum condition determining means for determining an optimum condition from among the plurality of received voice reproduction condition candidates based on a detection result by the echo detection means corresponding to each of the plurality of received voice reproduction condition candidates. And a condition setting means for setting the optimum condition as a received voice reproduction condition during a call communication.

請求項６に記載の発明は、外部との通話のために発せられた音声を集音し、送信音声信号に変換する集音手段と；外部からの受信音声信号に基づいて受信音声を再生出力する複数のスピーカと；前記受信音声の再生に際して、前記複数のスピーカの中から再生出力用に選択される少なくとも１つの選択スピーカと、前記選択スピーカから出力される音声の音量との組み合わせから成る複数の受信音声再生条件候補が記憶された記憶手段とを備える音声処理装置において使用される音声処理方法であって、前記複数の受信音声再生条件候補のそれぞれに従って、内部的に発生したテスト音声を、順次、前記選択スピーカから再生出力するテスト音声出力工程と；前記テスト音声の再生出力に起因して前記集音手段により集音された音のエコー量を検出するエコー検出工程と；前記複数の受信音声再生条件候補のそれぞれに対応する前記エコー検出工程における検出結果に基づいて、前記複数の受信音声再生条件候補の中から最適条件を決定する最適条件決定工程と；通話通信中において、前記最適条件を受信音声再生条件として設定する条件設定工程と；を備えることを特徴とする音声処理方法である。 According to a sixth aspect of the present invention, there is provided sound collecting means for collecting a voice emitted for a call with the outside and converting it into a transmission voice signal; and reproducing and outputting the received voice based on the received voice signal from the outside. A plurality of speakers, each of which is a combination of at least one selected speaker selected for reproduction output from among the plurality of speakers and the volume of the sound output from the selected speaker. Is a voice processing method used in a voice processing device comprising a storage means storing received voice playback condition candidates, and internally generated test voices according to each of the plurality of received voice playback condition candidates, A test sound output step of sequentially reproducing and outputting from the selected speaker; and an echo amount of the sound collected by the sound collecting means due to the reproduction output of the test sound An echo detection step to detect; and an optimum condition determination for determining an optimum condition from among the plurality of received voice reproduction condition candidates based on a detection result in the echo detection step corresponding to each of the plurality of received voice reproduction condition candidates And a condition setting step of setting the optimum condition as a reception voice reproduction condition during a call communication.

請求項７に記載の発明は、請求項６に記載の音声処理方法を演算手段に実行させる、ことを特徴とする音声処理プログラムである。 A seventh aspect of the present invention is a voice processing program characterized by causing a calculation means to execute the voice processing method according to the sixth aspect.

請求項８に記載の発明は、請求項７に記載の音声処理プログラムが、演算手段により読み取り可能に記録された記録媒体である。 The invention according to claim 8 is a recording medium on which the sound processing program according to claim 7 is recorded so as to be readable by the arithmetic means.

以下、本発明の一実施形態を、図１〜図１０を参照して説明する。なお、以下の説明及び図面においては、同一又は同等の要素には同一の符号を付し、重複する説明は省略する。 Hereinafter, an embodiment of the present invention will be described with reference to FIGS. In the following description and drawings, the same or equivalent elements will be denoted by the same reference numerals, and redundant description will be omitted.

［構成］
図１には、一実施形態に係る音声処理装置１００の概略的な構成がブロック図にて示されている。なお、以下の説明においては、音声処理装置１００は、車両ＣＲ（図２参照）に搭載される装置であるものとする。また、この音声処理装置１００は、携帯電話装置９００との間で無線通信を行うものである。音声処理装置１００は、携帯電話装置９００及び移動通信網を介して、通話相手とハンズフリーで通話通信を行うことができるものであるとする。 [Constitution]
FIG. 1 is a block diagram illustrating a schematic configuration of a speech processing apparatus 100 according to an embodiment. In the following description, it is assumed that the voice processing device 100 is a device mounted on the vehicle CR (see FIG. 2). The voice processing device 100 performs wireless communication with the mobile phone device 900. It is assumed that the voice processing device 100 can perform hands-free call communication with a call partner via the mobile phone device 900 and a mobile communication network.

この図１に示されるように、音声処理装置１００は、制御ユニット１１０と、ドライブユニット１２０とを備えている。 As shown in FIG. 1, the sound processing apparatus 100 includes a control unit 110 and a drive unit 120.

また、音声処理装置１００は、音出力ユニット１３０_Lと、音出力ユニット１３０_Rと、音出力ユニット１３０_SLと、音出力ユニット１３０_SRとを備えている。 In addition, the sound processing apparatus 100 includes a sound output unit 130 _L , a sound output unit 130 _R , a sound output unit 130 _SL, and a sound output unit 130 _SR .

ここで、音出力ユニット１３０_Lはレフトスピーカ１３１_L（以下、「Ｌスピーカ」とも記す）を有し、音出力ユニット１３０_Rはライトスピーカ１３１_R（以下、「Ｒスピーカ」とも記す）を有している。また、音出力ユニット１３０_SLはサラウンドレフトスピーカ１３１_SL（以下、「ＳＬスピーカ」とも記す）を有し、音出力ユニット１３０_SRはサラウンドライトスピーカ１３１_SR（以下、「ＳＲスピーカ」とも記す）を有している。 Here, the sound output unit 130 _L includes a left speaker 131 _L (hereinafter also referred to as “L speaker”), and the sound output unit 130 _R includes a right speaker 131 _R (hereinafter also referred to as “R speaker”). ing. The sound output unit 130 _SL has a surround left speaker 131 _SL (hereinafter also referred to as “SL speaker”), and the sound output unit 130 _SR has a surround right speaker 131 _SR (hereinafter also referred to as “SR speaker”). is doing.

さらに、音声処理装置１００は、集音手段としての集音ユニット１４０と、表示ユニット１５０と、操作入力ユニット１６０と、記憶手段としてのハードディスク装置等の記憶装置１７０と、アンテナ１８０とを備えている。 Furthermore, the sound processing apparatus 100 includes a sound collection unit 140 as sound collection means, a display unit 150, an operation input unit 160, a storage device 170 such as a hard disk device as storage means, and an antenna 180. .

なお、制御ユニット１１０以外の要素１２０〜１８０は、制御ユニット１１０に接続されている。 Elements 120 to 180 other than the control unit 110 are connected to the control unit 110.

制御ユニット１１０は、音声処理装置１００の全体を統括制御する。この制御ユニット１１０の詳細については、後述する。 The control unit 110 performs overall control of the entire voice processing apparatus 100. Details of the control unit 110 will be described later.

ドライブユニット１２０は、音声コンテンツが記録されたコンパクトディスクＣＤが挿入されると、その旨を制御ユニット１１０に報告する。そして、ドライブユニット１２０は、コンパクトディスクＣＤが挿入された状態で、制御ユニット１１０から音声コンテンツの再生指令ＤＶＣを受けると、再生指定がなされた音声をコンパクトディスクＣＤから読み出す。かかる音声コンテンツの読み出し結果は、オーディオ信号であるコンテンツデータＣＴＤとして、制御ユニット１１０へ向けて送られる。 When the compact disc CD on which the audio content is recorded is inserted, the drive unit 120 reports that fact to the control unit 110. When the drive unit 120 receives the audio content reproduction command DVC from the control unit 110 in a state where the compact disc CD is inserted, the drive unit 120 reads the audio designated for reproduction from the compact disc CD. The result of reading out the audio content is sent to the control unit 110 as content data CTD that is an audio signal.

音出力ユニット１３０_L〜１３０_SRのそれぞれは、上述したスピーカ１３１_L〜１３１_SRの他に、制御ユニット１１０から受信した音声出力信号ＡＯＳ_L〜ＡＯＳ_SRを増幅する増幅器とを備えている。これらの音出力ユニット１３０_L〜１３０_SRは、制御ユニット１１０による制御のもとで、移動通信網及び携帯電話装置９００を順次介した通話相手からの受信音声信号に対応する音声、制御ユニット１１０において生成されたテスト音声信号に対応する音声、楽曲等を再生して出力する。 Each of the sound output units 130 _{L to} 130 _SR includes an amplifier that amplifies the audio output signals AOS _{L to} AOS _SR received from the control unit 110 in addition to the speakers 131 _{L to} 131 _SR described above. These sound output units 130 _{L to} 130 _SR are controlled by the control unit 110 under the control of the control unit 110, and the sound corresponding to the received voice signal from the other party through the mobile communication network and the mobile phone device 900. Audio, music, etc. corresponding to the generated test audio signal are reproduced and output.

本実施形態では、図２に示されるように、音出力ユニット１３０_Lのレフトスピーカ１３１_Lは、助手席側の前方ドア筐体内に配置される。このレフトスピーカ１３１_Lは、助手席側を向くように配設されている。 In the present embodiment, as shown in FIG. 2, the left speaker 131 _L of the sound output unit 130 _L is disposed in the front door housing on the passenger seat side. The left speaker 131 _L is disposed so as to face the passenger seat side.

音出力ユニット１３０_Rのライトスピーカ１３１_Rは、運転席側の前方ドア筐体内に配置される。このライトスピーカ１３１_Rは、運転席側を向くように配設されている。 The light speaker 131 _R of the sound output unit 130 _R is disposed in the front door housing on the driver's seat side. The light speaker 131 _R is arranged to face the driver's seat side.

音出力ユニット１３０_SLのサラウンドレフトスピーカ１３１_SLは、助手席側後部の筐体内に配置される。このサラウンドレフトスピーカ１３１_SLは、助手席側の後部座席を向くように配設されている。 Surround left speakers 131 _SL of the sound output unit 130 _SL is arranged in the housing of the front passenger's seat side rear. The surround left speaker 131 _SL is disposed so as to face the rear seat on the passenger seat side.

音出力ユニット１３０_SRのサラウンドライトスピーカ１３１_SRは、運転席側後部の筐体内に配置される。このサラウンドライトスピーカ１３１_SRは、運転席側の後部座席を向くように配設されている。 Surround right speakers 131 _SR sound output unit 130 _SR is arranged in the housing of the driver's side rear. The surround light speaker 131 _SR is arranged so as to face the rear seat on the driver's seat side.

図１に戻り、集音ユニット１４０は、（ｉ）周囲の音を収集して電気的なアナログ音声信号とするマイクロフォン１４１、（ii）マイクロフォンから出力されたアナログ音声信号を増幅する増幅器、（iii）増幅されたアナログ音声信号をデジタル音声信号に変換するＡＤ変換器（Analog to Digital Converter）とを備えて構成されている。ここで、マイクロフォン１４１は、図２に示されるように、運転席の前方に配置され、その指向性は後部座席に向かって放射状になっている。集音ユニット１５０による集音結果は、集音結果データＡＡＤとして、制御ユニット１１０に報告される。 Returning to FIG. 1, the sound collection unit 140 includes (i) a microphone 141 that collects ambient sounds to be an electrical analog audio signal, (ii) an amplifier that amplifies the analog audio signal output from the microphone, (iii) ) An AD converter (Analog to Digital Converter) that converts the amplified analog audio signal into a digital audio signal. Here, as shown in FIG. 2, the microphone 141 is disposed in front of the driver's seat, and the directivity thereof is radial toward the rear seat. The sound collection result by the sound collection unit 150 is reported to the control unit 110 as sound collection result data AAD.

図１に戻り、表示ユニット１５０は、（ｉ）液晶パネル、有機ＥＬ（Electro Luminescence）パネル、ＰＤＰ（Plasma Display Panel）等の表示デバイス１５１と、（ii）制御ユニット１１０から送出された表示制御データに基づいて、表示ユニット１５０全体の制御を行うグラフィックレンダラ等の表示コントローラと、（iii）表示画像データを記憶する表示画像メモリ等を備えて構成されている。この表示ユニット１５０は、制御ユニット１１０による制御のもとで、ドライブユニット１２０を利用した音声再生用の操作ガイダンスや、通信通話における通話相手の電話番号、電話番号に対応する氏名が登録されている場合には通話相手の氏名、通話時間等を表示する。 Returning to FIG. 1, the display unit 150 includes (i) a display device 151 such as a liquid crystal panel, an organic EL (Electro Luminescence) panel, a PDP (Plasma Display Panel), and (ii) display control data sent from the control unit 110. And a display controller such as a graphic renderer for controlling the entire display unit 150, and (iii) a display image memory for storing display image data. When the display unit 150 is controlled by the control unit 110, an operation guidance for voice reproduction using the drive unit 120, a telephone number of a communication partner in a communication call, and a name corresponding to the telephone number are registered. Displays the name of the other party, the call duration, and the like.

操作入力ユニット１６０は、音声処理装置１００の本体部に設けられたキー部、あるいはキー部を備えるリモート入力装置等により構成される。ここで、本体部に設けられたキー部としては、表示ユニット１５０の表示デバイス１５１に設けられたタッチパネルを用いることができる。なお、キー部を有する構成に代えて、音声入力する構成を採用することもできる。 The operation input unit 160 includes a key unit provided in the main body of the voice processing device 100, or a remote input device including the key unit. Here, a touch panel provided on the display device 151 of the display unit 150 can be used as the key part provided on the main body. In addition, it can replace with the structure which has a key part, and the structure which inputs voice can also be employ | adopted.

この操作入力ユニット１６０を利用者が操作することにより、音声処理装置１００の動作内容の設定が行われる。例えば、後述する受信音声再生条件情報１７１の設定、音声コンテンツの再生指令等を、利用者が操作入力ユニット１６０を利用して行う。こうした入力内容は、操作入力データＩＰＤとして、操作入力ユニット１６０から制御ユニット１１０へ向けて送られる。 When the user operates the operation input unit 160, the operation content of the voice processing apparatus 100 is set. For example, the user uses the operation input unit 160 to set the received audio playback condition information 171 (to be described later), play back audio content, and the like. Such input contents are sent from the operation input unit 160 to the control unit 110 as operation input data IPD.

記憶装置１７０は、不揮発性の記憶装置であるハードディスク装置等から構成される。記憶装置１７０内には、受信音声再生条件情報１７１などの様々な情報が記憶されている。 The storage device 170 includes a hard disk device that is a nonvolatile storage device. In the storage device 170, various pieces of information such as the received audio reproduction condition information 171 are stored.

受信音声再生条件情報１７１とは、受信音声の再生に際して、スピーカ１３１_L〜１３１_SRの中から再生出力用に選択される選択スピーカと、選択スピーカから出力される音声の音量との組み合わせからなる複数の受信音声再生条件の候補（以下、「プリセット」ともいう）である。かかる受信音声再生条件情報１７１の例が、図３に示されている。この図３では、例えば、プリセット（Ｐ＝１）が、選択スピーカを車両ＣＲの前方に配置されるＬスピーカとＲスピーカとし、そのスピーカから出力される音声の音量レベルを６とするプリセットであり、プリセット（Ｐ＝２）が、選択スピーカを車両ＣＲの後方に配置されるＳＬスピーカとＳＲスピーカとし、そのスピーカから出力される音声の音量レベルを６とするプリセットとなっている。なお、本実施形態では、音声の音量レベルは０〜１０までとする。 The received audio reproduction condition information 171 is a plurality of combinations of a selected speaker selected for reproduction output from the speakers 131 _{L to} 131 _SR and a volume of audio output from the selected speaker when the received audio is reproduced. The received audio playback condition candidate (hereinafter also referred to as “preset”). An example of the received audio reproduction condition information 171 is shown in FIG. In FIG. 3, for example, the preset (P = 1) is a preset in which the selected speaker is an L speaker and an R speaker arranged in front of the vehicle CR, and the volume level of the sound output from the speaker is 6. The preset (P = 2) is a preset in which the selected speaker is an SL speaker and an SR speaker disposed behind the vehicle CR, and the volume level of the sound output from the speaker is 6. In the present embodiment, the sound volume level is 0 to 10.

図１に戻り、アンテナ１８０は、携帯電話装置９００からの無線信号を受信するとともに、携帯電話装置９００へ向けて無線信号を送信する。アンテナ１８０による受信結果は、受信音声信号ＲＥＳとして、制御ユニット１１０へ向けて出力される。また、アンテナ１８０は、制御ユニット１１０からの送信音声信号ＴＲＳを受け、当該送信音声信号ＴＲＳに対応する無線信号を携帯電話装置９００へ向けて送信する。 Returning to FIG. 1, the antenna 180 receives a radio signal from the mobile phone device 900 and transmits the radio signal toward the mobile phone device 900. A reception result by the antenna 180 is output to the control unit 110 as a reception audio signal RES. The antenna 180 receives the transmission audio signal TRS from the control unit 110 and transmits a radio signal corresponding to the transmission audio signal TRS to the mobile phone device 900.

制御ユニット１１０は、上述したように、音声処理装置１００の全体を統括制御する。この制御ユニット１１０は、図４に示されるように、制御処理部１１１と、オーディオ処理部１１２と、通話音声処理部１１３とを備えている。また、制御ユニット１１０は、アナログ変換部１１４と、出力信号選択部１１５と、音量調整部１１６とを備えている。 As described above, the control unit 110 performs overall control of the entire voice processing apparatus 100. As shown in FIG. 4, the control unit 110 includes a control processing unit 111, an audio processing unit 112, and a call voice processing unit 113. The control unit 110 also includes an analog conversion unit 114, an output signal selection unit 115, and a volume adjustment unit 116.

制御処理部１１１は、操作入力ユニット１６０に入力された指令入力等に基づいて、オーディオ処理部１１２、通話音声処理部１１３、出力信号選択部１１５及び音量調整部１１６を制御する。また、制御処理部１１１は、ドライブユニット１２０及び表示ユニット１５０を制御する。この制御処理部１１１については、後述する。 The control processing unit 111 controls the audio processing unit 112, the call voice processing unit 113, the output signal selection unit 115, and the volume adjustment unit 116 based on a command input or the like input to the operation input unit 160. In addition, the control processing unit 111 controls the drive unit 120 and the display unit 150. The control processing unit 111 will be described later.

オーディオ処理部１１２は、音声コンテンツを含む再生すべきコンテンツの指定入力がなされたことが操作入力ユニット１６０から報告された場合、及び、ドライブユニット１２０にコンパクトディスクＣＤが挿入されたときにコンテンツを自動再生すべき旨の設定がなされている場合に、制御処理部１１１からの音声コンテンツ処理制御指令ＡＰＣに従って、当該再生すべきコンテンツに対応するコンテンツデータＣＴＤをドライブユニット１２０から読み出して展開し、デジタル音データ信号を生成する。引き続き、オーディオ処理部１１２は、生成されたデジタル音データ信号を解析し、デジタル音データ信号に含まれるチャンネル指定情報に従って、デジタル音データ信号を、上述したスピーカ１３１_L，１３１_R，１３１_SL，１３１_SRのそれぞれに供給されるように分離する。このようにして分離された信号は、チャンネル処理信号ＰＣＤ_L，ＰＣＤ_R，ＰＣＤ_SL，ＰＣＤ_SRとして、アナログ変換部１１４へ向けて出力される。 The audio processing unit 112 automatically reproduces content when the operation input unit 160 reports that a content to be reproduced including audio content has been input and when the compact disc CD is inserted into the drive unit 120. When the setting to be performed is made, the content data CTD corresponding to the content to be reproduced is read out from the drive unit 120 and expanded in accordance with the audio content processing control command APC from the control processing unit 111, and the digital sound data signal Is generated. Subsequently, the audio processing unit 112 analyzes the generated digital sound data signal, and converts the digital sound data signal into the above-described speakers 131 _L , 131 _R , 131 _SL , 131 according to the channel designation information included in the digital sound data signal. Separated to be supplied to each _SR . The signals thus separated are output to the analog conversion unit 114 as channel processing signals PCD _L , PCD _R , PCD _SL , and PCD _SR .

通話音声処理部１１３は、携帯電話装置９００及び移動通信網を介した外部との通話音声の処理を行う。この通話音声処理部１１３は、図５に示されるにように、通信インターフェイス部２１０と、テスト信号発生部２２０と、信号切替部２３０とを備えている。また、通話音声処理部１１３は、エコーキャンセル手段としてのエコーキャンセル部２４０と、エコー検出手段としてのエコー検出部２５０とを備えている。 The call voice processing unit 113 processes call voices with the mobile phone device 900 and the mobile communication network. As shown in FIG. 5, the call voice processing unit 113 includes a communication interface unit 210, a test signal generation unit 220, and a signal switching unit 230. The call voice processing unit 113 includes an echo cancellation unit 240 as an echo cancellation unit and an echo detection unit 250 as an echo detection unit.

通信インターフェイス部２１０は、携帯電話装置９００との間における信号授受に関して利用される。この通信インターフェイス部２１０は、受信部２１１と、送信部２１２とを備えて構成されている。 The communication interface unit 210 is used for signal exchange with the mobile phone device 900. The communication interface unit 210 includes a receiving unit 211 and a transmitting unit 212.

受信部２１１は、携帯電話装置９００からの受信音声信号ＲＥＳを受ける。そして、受信部２１１は、受信音声信号ＲＥＳを内部処理用の受信信号ＲＥＤに変換して、信号切替部２３０へ向けて出力する。また、受信部２１１は、受信音声信号ＲＥＳに基づいて通話通信の着信及び切断を検出し、検出結果を、着信指示ＲＥＩとして、制御処理部１１１へ向けて出力する。 The receiving unit 211 receives the received audio signal RES from the mobile phone device 900. Then, the reception unit 211 converts the reception audio signal RES into a reception signal RED for internal processing and outputs it to the signal switching unit 230. The receiving unit 211 detects the incoming and outgoing calls of the call communication based on the received audio signal RES, and outputs the detection result to the control processing unit 111 as an incoming call instruction REI.

送信部２１２は、エコーキャンセル部２４０からの送信信号ＴＲＤを受ける。そして、送信部２１２は、送信信号ＴＲＤを携帯電話装置９００への送信用の送信音声信号ＴＲＳに変換して、携帯電話装置９００へ向けて出力する。 The transmission unit 212 receives the transmission signal TRD from the echo cancellation unit 240. Then, the transmission unit 212 converts the transmission signal TRD into a transmission audio signal TRS for transmission to the mobile phone device 900, and outputs it to the mobile phone device 900.

テスト信号発生部２２０は、制御処理部１１１からテスト音声信号を発生すべき旨のテスト信号発生指令ＳＧＣを受けると、テスト音声信号ＳＧＤを発生させる。こうして発生させたテスト音声信号ＳＧＤは、信号切替部２３０へ向けて送られる。 When the test signal generation unit 220 receives a test signal generation command SGC indicating that a test audio signal should be generated from the control processing unit 111, the test signal generation unit 220 generates a test audio signal SGD. The test audio signal SGD generated in this way is sent to the signal switching unit 230.

信号切替部２３０は、スイッチ素子を備えている。このスイッチ素子は、入力端子としてＡ端子及びＢ端子を有するとともに、出力端子としてＣ端子を有している。端子Ａは受信部２１１に接続された端子であり、Ｂ端子はテスト信号発生部２２０に接続された端子である。端子Ａでは受信信号ＲＥＤを受け、端子Ｂではテスト音声信号ＳＧＤを受ける。そして、制御処理部１１１からの指令ＲＳＣに従って、Ａ端子とＣ端子とを導通したり、Ｂ端子とＣ端子とを導通したり、更には、Ａ端子及びＢ端子のいずれともＣ端子を導通しなかったりする。Ｃ端子からは、選択された信号が、信号ＲＳＤとしてアナログ変換部１１４へ向けて送られるとともに、エコーキャンセル部２４０へ向けて送られる。 The signal switching unit 230 includes a switch element. This switch element has an A terminal and a B terminal as input terminals and a C terminal as an output terminal. A terminal A is a terminal connected to the reception unit 211, and a B terminal is a terminal connected to the test signal generation unit 220. The terminal A receives the reception signal RED, and the terminal B receives the test audio signal SGD. And according to the instruction | command RSC from the control processing part 111, A terminal and C terminal are conduct | electrically_connected, B terminal and C terminal are conduct | electrically_connected, Furthermore, both A terminal and B terminal conduct | electrically connect C terminal. There is not. From the C terminal, the selected signal is sent to the analog conversion unit 114 as the signal RSD and also sent to the echo cancellation unit 240.

エコーキャンセル部２４０は、通信通話中の集音結果データＡＡＤに含まれるエコー音成分を除去するためのものである。このエコーキャンセル部２４０は、図６に示されるように、スイッチ２４１と、フィルタ手段としての適応フィルタ２４２と、減算手段としての減算器２４３とを備えている。 The echo cancellation unit 240 is for removing an echo sound component included in the sound collection result data AAD during a communication call. As shown in FIG. 6, the echo canceling unit 240 includes a switch 241, an adaptive filter 242 as filter means, and a subtractor 243 as subtraction means.

スイッチ素子２４１は、入力端子Ａと出力端子Ｂとを有している。このスイッチ素子２４１は、減算器２４３の出力する送信信号ＴＲＤが、適応フィルタ２４２のフィルタ係数を算出するために適応フィルタ２４２へ送られる信号伝送路上に設けられている。スイッチ素子２４１のオン・オフの動作は、制御処理部１１１からの指令ＥＣＣに従って行われる。 The switch element 241 has an input terminal A and an output terminal B. The switch element 241 is provided on a signal transmission path through which the transmission signal TRD output from the subtracter 243 is sent to the adaptive filter 242 in order to calculate the filter coefficient of the adaptive filter 242. The on / off operation of the switch element 241 is performed according to a command ECC from the control processing unit 111.

適応フィルタ２４２は、不図示の係数更新部とフィルタ部とから構成されている。係数更新部は、ＬＭＳ（Least Mean Square）アルゴリズムなどの学習同定法により、減算器２４３の出力する送信信号ＴＲＤのパワーが最小となるように、適応フィルタ２４２のフィルタ係数を算出し、このフィルタ係数をフィルタ部に設定する処理を繰り返す。フィルタ部は、係数更新部により設定された係数により定まるインパルス応答を有するＦＩＲフィルタなどである。このフィルタ部は、信号ＲＳＤから擬似エコー信号ＥＣＤを生成する。こうして生成した擬似エコー信号ＥＣＤは、減算器２４３へ向けて送られる。 The adaptive filter 242 includes a coefficient update unit and a filter unit (not shown). The coefficient updating unit calculates the filter coefficient of the adaptive filter 242 so that the power of the transmission signal TRD output from the subtractor 243 is minimized by a learning identification method such as an LMS (Least Mean Square) algorithm, and this filter coefficient Repeat the process of setting to the filter section. The filter unit is an FIR filter having an impulse response determined by the coefficient set by the coefficient updating unit. This filter unit generates a pseudo echo signal ECD from the signal RSD. The pseudo echo signal ECD generated in this way is sent to the subtracter 243.

減算器２４３は、集音ユニット１４０からの集音結果データＡＡＤから、擬似エコー信号ＥＣＤを減算する。減算結果は、送信信号ＴＲＤとして、送信部２１２へ向けて送られるとともに、エコー検出部２５０へ向けて送られる。 The subtractor 243 subtracts the pseudo echo signal ECD from the sound collection result data AAD from the sound collection unit 140. The subtraction result is sent as a transmission signal TRD toward the transmission unit 212 and also sent toward the echo detection unit 250.

図５に戻り、エコー検出部２５０は、エコーキャンセル部２４０から送られてくる送信信号ＴＲＤのエコー量を検出する。検出結果は、エコー量値ＥＶＤとして、エコー検出部２５０から制御処理部１１１へ向けて送られる。 Returning to FIG. 5, the echo detection unit 250 detects the echo amount of the transmission signal TRD transmitted from the echo cancellation unit 240. The detection result is sent from the echo detection unit 250 to the control processing unit 111 as an echo amount value EVD.

図４に戻り、アナログ変換部１１４は、オーディオ処理部１１２からのデジタル信号であるチャンネル処理信号ＰＣＤ_L〜ＰＣＤ_SRと、通話音声処理部１１３からのデジタル信号である信号ＲＳＤとを受ける。そして、アナログ変換部１１４は、オーディオ処理部１１２から受けたチャンネル処理信号ＰＣＤ_L〜ＰＣＤ_SRをアナログ信号ＰＣＳ_L〜ＰＣＳ_SRに変換し、通話音声処理部１１３から受けた信号ＲＳＤをアナログ信号ＲＳＳに変換する。このアナログ変換部１１４は、当該５種のデジタル信号に対応して、互いに同様に構成された５個のＤＡ（Digital to Analogue）変換器を備えている。このアナログ変換部１１４による変換結果であるアナログ信号ＰＣＳ_L〜ＰＣＳ_SR，ＲＳＳは、出力信号選択部１１５へ向けて送られる。 Returning to FIG. 4, the analog conversion unit 114 receives channel processing signals PCD _{L to} PCD _SR which are digital signals from the audio processing unit 112 and a signal RSD which is a digital signal from the call voice processing unit 113. Then, the analog conversion unit 114 converts the channel processing signals PCD _{L to} PCD _SR received from the audio processing unit 112 into analog signals PCS _{L to} PCS _SR, and converts the signal RSD received from the call voice processing unit 113 into the analog signal RSS. Convert. The analog conversion unit 114 includes five DA (Digital to Analogue) converters configured similarly to each other corresponding to the five types of digital signals. Analog signals PCS _{L to} PCS _SR and RSS, which are conversion results by the analog conversion unit 114, are sent to the output signal selection unit 115.

出力信号選択部１１５は、アナログ変換部１１４からのアナログ信号ＰＣＳ_L〜ＰＣＳ_SR，ＲＳＳを受ける。そして、出力信号選択部１１５は、制御処理部１１１からの出力信号選択指令ＯＤＳに従って、音量調整部１１６へ向けての、アナログ信号ＰＣＳ_L〜ＰＣＳ_SRの供給、アナログ信号ＲＳＳの供給、及び、いずれの信号も供給しないかを選択する。かかる機能を有する出力信号選択部１１５は、図７に示されるように、４個のスイッチ素子１１５_L〜１１５_SRを備えている。 The output signal selection unit 115 receives the analog signals PCS _{L to} PCS _SR and RSS from the analog conversion unit 114. Then, the output signal selection unit 115 supplies the analog signals PCS _{L to} PCS _SR , supplies the analog signal RSS to the volume adjustment unit 116 according to the output signal selection command ODS from the control processing unit 111, and It is selected whether or not to supply the signal. As shown in FIG. 7, the output signal selection unit 115 having such a function includes four switch elements 115 _{L to} 115 _SR .

各スイッチ素子１１５_L〜１１５_SRは、入力端子としてＡ端子及びＢ端子を有するとともに、出力端子としてＣ端子を有している。端子Ａ及び端子Ｂはアナログ変換部１１４に接続された端子である。また、端子Ｃは音量調整部１１６に接続された端子である。各スイッチ素子１１５_L〜１１５_SRでは、Ａ端子でアナログ信号ＰＣＳ_L〜ＰＣＳ_SRを受けるとともに、Ｂ端子でアナログ信号ＲＳＳを受ける。そして、制御処理部１１１からの出力信号選択指令ＯＤＳにおける個別出力選択指令ＯＤＳ_L〜ＯＤＳ_SRに従って、Ａ端子とＣ端子とを導通したり、Ｂ端子とＣ端子とを導通したり、更には、Ａ端子及びＢ端子のいずれともＣ端子を導通しなかったりする。スイッチ素子１１５_L〜１１５_SRのＣ端子からは、選択された信号（無信号を含む）が、音出力選択信号ＰＢＳ_L〜ＰＢＳ_SRとして音量調整部１１６へ向けて送られる。 Each of the switch elements 115 _{L to} 115 _SR has an A terminal and a B terminal as input terminals, and a C terminal as an output terminal. Terminals A and B are terminals connected to the analog conversion unit 114. A terminal C is a terminal connected to the volume adjusting unit 116. Each switching element 115 _{L to} 115 _SR receives analog signals PCS _{L to} PCS _SR at the A terminal and receives analog signal RSS at the B terminal. Then, according to the individual output selection commands ODS _{L to} ODS _SR in the output signal selection command ODS from the control processing unit 111, the A terminal and the C terminal are made conductive, the B terminal and the C terminal are made conductive, Neither the A terminal nor the B terminal conducts the C terminal. From the C terminal of the switch element 115 _L to 115 _SR, it is selected signal (including a no-signal) and sent toward the volume adjusting unit 116 as the sound output select signal PBS _L ~PBS _SR.

図４に戻り、音量調整部１１６は、出力信号選択部１１５からの音出力選択信号ＰＢＳ_L〜ＰＢＳ_SRを受ける。この音量調整部１１６は、音出力選択信号ＰＢＳ_L〜ＰＢＳ_SRのそれぞれごとに、いわゆる電子ボリューム等により、互いに独立な音量調整が可能となっている。 Returning to FIG. 4, the volume adjustment unit 116 receives the sound output selection signals PBS _{L to} PBS _SR from the output signal selection unit 115. The volume adjusting unit 116 can adjust the volume independently of each other by a so-called electronic volume or the like for each of the sound output selection signals PBS _{L to} PBS _SR .

音量調整部１１６は、音出力選択信号ＰＢＳ_L〜ＰＢＳ_SRのそれぞれに対して、制御処理部１１１からの音量調整指令ＶＬＣ_L〜ＶＬＣ_SRに従った音量調整を行う。かかる調整結果は、音声出力信号ＡＯＳ_L〜ＡＯＳ_SRとして、音出力ユニット１３０_L〜１３０_SRへ向けて出力される。 The volume adjustment unit 116 performs volume adjustment according to the volume adjustment commands VLC _{L to} VLC _SR from the control processing unit 111 for each of the sound output selection signals PBS _{L to} PBS _SR . The adjustment result is output to the sound output units 130 _{L to} 130 _SR as audio output signals AOS _{L to} AOS _SR .

制御処理部１１１は、上述した他の構成要素を制御しつつ、音声処理装置１００の機能を発揮させる。この制御処理部は、図８に示されるように、最適条件決定手段としての最適条件決定部２６１と、テスト音声出力制御手段、条件設定手段及びエコーキャンセル制御手段としての制御部２６２とを備えている。 The control processing unit 111 exerts the function of the sound processing apparatus 100 while controlling the other components described above. As shown in FIG. 8, the control processing unit includes an optimum condition determining unit 261 as an optimum condition determining unit, and a control unit 262 as a test sound output control unit, a condition setting unit, and an echo cancellation control unit. Yes.

最適条件決定部２６１は、受信音声の出力に際して、受信音声再生条件情報１７１に設定された受信音声再生条件の候補の中から、最適条件を決定する。この最適条件決定部２６１は、後述する「最適条件決定モード」時のときのみ処理を行う。 The optimum condition determination unit 261 determines the optimum condition from the received voice reproduction condition candidates set in the received voice reproduction condition information 171 when the received voice is output. The optimum condition determination unit 261 performs processing only in an “optimum condition determination mode” described later.

最適条件決定部２６１は、制御部２６２からの処理指令ＤＭＣを受けると、エコー検出部２５０から送られてくるエコー量値ＥＶＤを取得するとともに、当該エコー量値ＥＶＤを不図示のメモリに記憶する。このエコー量値ＥＶＤは、受信音声再生条件の候補の数（プリセットＰの数）だけ、順次、エコー検出部２５０から送られてくる。そして、最適条件決定部２６１は、順次、送られてくるエコー量値ＥＶＤの中から、エコー量値ＥＶＤが最小となるときのプリセットＰの値を決定する。このようにして決定したプリセットＰの値は、値ＰＢＤとして、制御部２６２へ向けて送られる。 When receiving the processing command DMC from the control unit 262, the optimum condition determination unit 261 acquires the echo amount value EVD sent from the echo detection unit 250 and stores the echo amount value EVD in a memory (not shown). . The echo amount value EVD is sequentially sent from the echo detection unit 250 by the number of reception sound reproduction condition candidates (the number of presets P). Then, the optimum condition determination unit 261 sequentially determines a preset P value when the echo amount value EVD is minimized from the echo amount values EVD that are sent. The preset P value determined in this way is sent to the control unit 262 as a value PBD.

制御部２６２は、音声処理装置１００における「コンテンツ再生モード」と「通話モード」の２つのモードの動作を制御する。ここで、「コンテンツ再生モード」とはコンパクトディスクＣＤから音声コンテンツを読み出してオーディオ信号を再生するモードである。また、「通話モード」とは、携帯電話装置９００及び移動通信網を介して外部と通信通話を行うモードである。なお、「通話モード」には、通常通話を行う「通常通話モード」と、受信音声再生条件の候補の中から最適条件を決定する「最適条件決定モード」の２種類がある。 The control unit 262 controls the operation of the two modes of “content reproduction mode” and “call mode” in the audio processing device 100. Here, the “content playback mode” is a mode in which audio content is read from a compact disc CD and an audio signal is played back. The “call mode” is a mode for performing a communication call with the outside via the mobile phone device 900 and the mobile communication network. There are two types of “call mode”: “normal call mode” in which a normal call is made and “optimum condition determination mode” in which an optimum condition is determined from candidates for a received voice reproduction condition.

制御部２６２は、通常は、「コンテンツ再生モード」の動作の制御を行う。そして、制御部２６２は、操作入力ユニット１６０からの通話発信指令を受けた場合、及び、受信部２１１からの着信指示ＲＥＩを受けた場合に、「通常通話モード」の動作制御を開始する。一方、制御部２６２は、「通常通話モード」にある状態において、操作入力ユニット１６０からの通話切断指令を受けた場合、及び、受信部２１１からの着信指示ＲＥＩが途絶えた場合に、「コンテンツ再生モード」の動作制御に復帰する。なお、制御部２６２による「最適条件決定モード」の動作制御は、操作入力ユニット１６０からの最適条件決定指令を受けた場合にのみ行われる。 The control unit 262 normally controls the operation of the “content reproduction mode”. Then, the control unit 262 starts the operation control of the “normal call mode” when receiving a call outgoing instruction from the operation input unit 160 and when receiving an incoming call instruction REI from the receiving unit 211. On the other hand, when the control unit 262 receives the call disconnect command from the operation input unit 160 in the state of the “normal call mode” and when the incoming call instruction REI from the reception unit 211 is interrupted, Return to "Mode" operation control. Note that the operation control in the “optimum condition determination mode” by the control unit 262 is performed only when an optimal condition determination command is received from the operation input unit 160.

ここで、「コンテンツ再生モード」と「通話モード」との切替えは、出力信号選択部１１５におけるスイッチ操作で行われる。また、「通常通話モード」と「最適条件決定モード」との切替えは、信号切替部２３０におけるスイッチ操作で行われる。 Here, switching between the “content reproduction mode” and the “call mode” is performed by a switch operation in the output signal selection unit 115. Further, switching between the “normal call mode” and the “optimum condition determination mode” is performed by a switch operation in the signal switching unit 230.

「最適条件決定モード」の動作制御に際し、制御部２６２は、まず、受信音声再生条件情報１７１から、受信音声再生条件を取得する。この受信音声再生条件には複数の候補があり（図３参照）、制御部２６２は、１番目（プリセット（Ｐ＝１））の候補に対応した音声処理装置１００の設定を行う。 In the operation control of the “optimum condition determination mode”, the control unit 262 first acquires the reception sound reproduction condition from the reception sound reproduction condition information 171. There are a plurality of candidates for the received voice reproduction condition (see FIG. 3), and the control unit 262 sets the voice processing apparatus 100 corresponding to the first (preset (P = 1)) candidate.

この設定に際し、制御部２６２は、出力信号選択部１１５へ向けて、信号ＲＳＳを選択すべき旨の指令を送る。より具体的には、出力信号選択部１１５における最初の測定対象となるスピーカに対応するスイッチ素子のＢ端子とＣ端子とを導通させるとともに、他のスイッチ素子におけるＣ端子がＡ端子及びＢ端子のいずれとも導通しないことを指定する出力信号選択指令ＯＤＳを出力信号選択部１１５へ向けて送る。図３を例にとって説明すると、制御部２６２は、Ｌスピーカ、Ｒスピーカに対応するスイッチ素子１１５_L，１１５_RのＢ端子とＣ端子を導通させるとともに、ＳＬスピーカ、ＳＲスピーカに対応するスイッチ素子１１５_SL，１１５_SRにおけるＣ端子がＡ端子及びＢ端子のいずれとも導通しないことを指定する出力信号選択指令ＯＤＳを出力信号選択部１１５へ向けて送る。 In this setting, the control unit 262 sends a command to the output signal selection unit 115 to select the signal RSS. More specifically, the B terminal and the C terminal of the switch element corresponding to the first measurement target speaker in the output signal selection unit 115 are electrically connected, and the C terminals of the other switch elements are connected to the A terminal and the B terminal. An output signal selection command ODS designating that neither of them is conducted is sent to the output signal selection unit 115. Referring to FIG. 3 as an example, the control unit 262 conducts the B terminal and the C terminal of the switch elements 115 _L and 115 _R corresponding to the L speaker and the R speaker, and switches the switch element 115 corresponding to the SL speaker and the SR speaker. An output signal selection command ODS that designates that the C terminal in _SL , 115 _SR is not conductive with either the A terminal or the B terminal is sent to the output signal selection unit 115.

また、この設定に際し、制御部２６２は、信号切替部２３０へ向けて、テスト信号発生部２２０からのテスト音声信号ＳＧＤを選択すべき旨の指令、すなわち、スイッチ素子の端子Ｂと端子Ｃを導通させることを指定する指令ＲＳＣを送る（図５参照）。 In this setting, the control unit 262 directs the signal switching unit 230 to select the test audio signal SGD from the test signal generation unit 220, that is, to connect the terminal B and the terminal C of the switch element. A command RSC for designating the transmission is sent (see FIG. 5).

さらに、この設定に際し、制御部２６２は、スイッチ２４１へ向けて、減算器２４３からの送信信号ＴＲＤを適応フィルタ２４２へ送らない旨の指令、すなわち、スイッチ２４１をオフにすべき旨の指令ＥＣＣを送る（図６参照）。 Further, at the time of this setting, the control unit 262 issues a command to the switch 241 that the transmission signal TRD from the subtractor 243 is not sent to the adaptive filter 242, that is, a command ECC that the switch 241 should be turned off. Send (see FIG. 6).

さらに、この設定に際し、制御部２６２は、音量調整部１１６へ向けて、ＬスピーカとＲスピーカから出力すべき音量を、レベル６に調整すべき旨の音量調整指令ＶＬＣ_L，ＶＬＣ_Rを送る。 Further, in this setting, the control unit 262 sends volume adjustment commands VLC _L and VLC _R to the volume adjustment unit 116 to adjust the volume to be output from the L speaker and the R speaker to level 6.

このようにして、プリセット（Ｐ＝１）の候補に対応した音声処理装置１００の設定が、終了すると、制御部２６２は、次いで、テスト音声信号ＳＧＤを発生すべき旨のテスト信号発生指令ＳＧＣを、テスト信号発生部２２０へ向けて送る。また、制御部２６２は、テスト信号発生指令ＳＧＣと同時に、処理指令ＤＭＣを、最適条件決定部２６１へ向けて送る（図８参照）。 In this way, when the setting of the sound processing apparatus 100 corresponding to the preset (P = 1) candidate is completed, the control unit 262 then issues a test signal generation command SGC indicating that the test sound signal SGD should be generated. And sent to the test signal generator 220. Further, the control unit 262 sends a processing command DMC to the optimum condition determining unit 261 simultaneously with the test signal generation command SGC (see FIG. 8).

制御部２６２は、エコー検出部２５０からエコー量値ＥＶＤを受けると、プリセット（Ｐ＝１）の候補における処理が終了した判断し、次のプリセット（Ｐ＝２）の候補に対応した音声処理装置１００の設定を行う。 When the control unit 262 receives the echo amount value EVD from the echo detection unit 250, the control unit 262 determines that the processing for the preset (P = 1) candidate is completed, and the audio processing device corresponding to the next preset (P = 2) candidate Set 100.

すべての受信音声再生条件の候補についてのエコー量値ＥＶＤの検出が終了した後、制御部２６２は、最適条件決定部２６１から、最適条件のプリセットである値ＰＢＤを受ける。この値ＰＢＤは、制御部２６２内の不図示の記憶部に記憶される。 After the detection of the echo amount value EVD for all the received audio reproduction condition candidates is completed, the control unit 262 receives a value PBD that is a preset of the optimum condition from the optimum condition determination unit 261. The value PBD is stored in a storage unit (not shown) in the control unit 262.

こうして、受信音声再生条件の候補の中から最適条件が決定されると、制御部２６２は、「最適条件決定モード」の動作制御を終了する。 Thus, when the optimum condition is determined from the received audio reproduction condition candidates, the control unit 262 ends the operation control of the “optimum condition determination mode”.

前述したように、音声処理装置１００は、通常は、「コンテンツ再生モード」の動作の制御を行うが、操作入力ユニット１６０からの通話発信指令を受けた場合、及び、受信部２１１からの着信指示ＲＥＩを受けた場合に、「通常通話モード」の動作制御を開始する。 As described above, the audio processing apparatus 100 normally controls the operation of the “content reproduction mode”. However, when receiving a call origination command from the operation input unit 160, the voice processing apparatus 100 receives an incoming call instruction from the receiving unit 211. When the REI is received, operation control of the “normal call mode” is started.

この「通常通話モード」の動作制御に際し、制御部２６２は、まず、最適条件決定部２６１から受けた最適条件のプリセットである値ＰＢＤを参照し、受信音声再生条件情報１７１から、最適条件となる受信音声再生条件（以下、単に「最適条件」という）を取得する。そして、この最適条件に対応した音声処理装置１００の設定を行う。 In the operation control of the “normal call mode”, the control unit 262 first refers to the value PBD that is a preset of the optimum condition received from the optimum condition determination unit 261, and becomes the optimum condition from the received voice reproduction condition information 171. Received audio playback conditions (hereinafter simply referred to as “optimum conditions”) are acquired. Then, the voice processing device 100 corresponding to the optimum condition is set.

この設定に際し、制御部２６２は、出力信号選択部１１５へ向けて、信号ＲＳＳを最適条件の選択スピーカから出力することが可能なスイッチ操作をすべき旨の出力信号選択指令ＯＤＳを送る。 In this setting, the control unit 262 sends an output signal selection command ODS to the output signal selection unit 115 indicating that a switch operation that can output the signal RSS from the selection speaker under the optimum condition is to be performed.

また、この設定に際し、制御部２６２は、信号切替部２３０へ向けて、受信部２１１からの受信信号ＲＥＤを選択すべき旨の指令、すなわち、スイッチ素子の端子Ａと端子Ｃを導通させることを指定する指令ＲＳＣを送る（図５参照）。 In this setting, the control unit 262 instructs the signal switching unit 230 to select the reception signal RED from the reception unit 211, that is, to make the terminal A and the terminal C of the switch element conductive. A designated command RSC is sent (see FIG. 5).

さらに、この設定に際し、制御部２６２は、スイッチ２４１へ向けて、減算器２４３からの送信信号ＴＲＤを適応フィルタ２４２へ送る旨の指令、すなわち、スイッチ２４１をオンにすべき旨の指令ＥＣＣを送る（図６参照）。 Further, at the time of this setting, the control unit 262 sends to the switch 241 an instruction to send the transmission signal TRD from the subtractor 243 to the adaptive filter 242, that is, an instruction ECC to turn on the switch 241. (See FIG. 6).

さらに、この設定に際し、制御部２６２は、音量調整部１１６へ向けて、選択スピーカから出力すべき音量を、最適条件のレベルに調整すべき旨の音量調整指令ＶＬＣ_L〜ＶＬＣ_Rを送る。 Furthermore, in this setting, the control unit 262 sends volume adjustment commands VLC _{L to} VLC _R to the volume adjustment unit 116 to adjust the volume to be output from the selected speaker to the optimum condition level.

「コンテンツ再生モード」の動作制御に際し、制御部２６２は、出力信号選択部１１５へ向けて、スイッチ素子１１５_L〜１１５_SRの全てについて、Ａ端子とＣ端子とを導通させるべきことを指定する出力信号選択指令ＯＤＳを送る。この結果、アナログ変換部１１４からのアナログ信号ＰＣＳ_L〜ＰＣＳ_SRが、出力信号選択部１１５を介して、音出力選択信号ＰＢＳ_L〜ＰＢＳ_SRとして、音量調整部１１６へ向けて供給されるようになる。 In the operation control of the “content reproduction mode”, the control unit 262 outputs, to the output signal selection unit 115, an instruction that specifies that the A terminal and the C terminal should be conducted for all of the switch elements 115 _{L to} 115 _SR. A signal selection command ODS is sent. As a result, the analog signals PCS _{L to} PCS _SR from the analog conversion unit 114 are supplied to the volume adjustment unit 116 as the sound output selection signals PBS _{L to} PBS _SR via the output signal selection unit 115. Become.

また、制御部２６２は、「コンテンツ再生モード」の動作制御に際し、利用者が再生すべき音声コンテンツの指定を支援するための案内画面を表示ユニット１５０に表示させる。そして、操作入力ユニット１６０から音声コンテンツを指定した再生指令が入力されると、制御部２６２は、ドライブユニット１２０を制御して、再生コンテンツのデータ読み出しを制御する。 In addition, the control unit 262 causes the display unit 150 to display a guidance screen for assisting the user in specifying the audio content to be played back when controlling the operation of the “content playback mode”. When a reproduction command designating audio content is input from the operation input unit 160, the control unit 262 controls the drive unit 120 to control reading of the reproduction content data.

また、制御部２６２は、「コンテンツ再生モード」の動作制御に際し、音量調整部１１６を制御して、音出力ユニット１３０_L〜１３０_SRのスピーカ１３１_L〜１３１_SRへの出力音量を調整する。この出力音量の制御に際して、制御部２６２は、操作入力ユニット１６０に入力された音量指定に基づいて音量調整指令ＶＬＣ_L〜ＶＬＣ_SRを生成し、音量調整部１１６へ向けて送る。 The control unit 262, upon operation control of "content reproduction mode", and controls the sound volume adjustment section 116 adjusts the output volume of the speaker 131 _L to 131 _SR sound output unit 130 _L to 130 DEG _SR. When controlling the output volume, the control unit 262 generates volume adjustment commands VLC _{L to} VLC _SR based on the volume designation input to the operation input unit 160 and sends it to the volume adjustment unit 116.

［動作］
次に、上記のように構成された音声処理装置１００の動作について、「最適条件決定モード」のときの動作に主に着目して説明する。 [Operation]
Next, the operation of the speech processing apparatus 100 configured as described above will be described mainly focusing on the operation in the “optimum condition determination mode”.

利用者が操作入力ユニット１６０を利用して、受信音声再生条件の候補の中から最適条件を決定すべき旨の指令を入力することにより、音声処理装置１００の「最適条件決定モード」の動作が開始する。こうして、「最適条件決定モード」の動作が開始すると、図９のステップＳ１１において、「最適条件決定モード」の設定が行われる。 When the user uses the operation input unit 160 to input a command to determine the optimum condition from the received voice reproduction condition candidates, the operation of the “optimum condition determination mode” of the sound processing apparatus 100 is performed. Start. Thus, when the operation in the “optimum condition determination mode” starts, the “optimum condition determination mode” is set in step S11 of FIG.

このステップＳ１１では、制御部２６２は、通話音声処理部１１３内の信号切替部２３０へ向けて、テスト信号発生部２２０からのテスト音声信号ＳＧＤを選択すべき旨の指令ＲＳＣを送る。この指令ＲＳＣに従って、信号切替部２３０によりテスト音声信号ＳＧＤが選択され、信号ＲＳＤとしてアナログ変換部１１４へ送られることになる。 In step S <b> 11, the control unit 262 sends a command RSC indicating that the test voice signal SGD from the test signal generation unit 220 should be selected to the signal switching unit 230 in the call voice processing unit 113. In accordance with this command RSC, the test voice signal SGD is selected by the signal switching unit 230 and is sent to the analog conversion unit 114 as the signal RSD.

また、制御部２６２は、通話音声処理部１１３におけるエコーキャンセル部２４０内のスイッチ２４１へ向けて、減算器２４３からの送信信号ＴＲＤを適応フィルタ２４２へ送らない旨の指令ＥＣＣを送る。この指令ＥＣＣにより、スイッチ２４１はオフとなる。この結果、エコーキャンセル部２４０におけるエコーキャンセルの処理は行われない。これらの設定が終了すると、処理はステップＳ１２へ進む。 In addition, the control unit 262 sends a command ECC indicating that the transmission signal TRD from the subtractor 243 is not sent to the adaptive filter 242 toward the switch 241 in the echo cancellation unit 240 in the call voice processing unit 113. The switch 241 is turned off by this command ECC. As a result, the echo cancellation processing in the echo cancellation unit 240 is not performed. When these settings are completed, the process proceeds to step S12.

ステップＳ１２では、デフォルト設定時におけるテスト音声のエコー量値ＥＶＤを検出する。ここで、デフォルト設定とは、本実施形態では、テスト音声を出力する選択スピーカをスピーカ１３１_L〜１３１_SRのすべてとし、その音量レベルを６とするものであるとする。 In step S12, the echo amount value EVD of the test voice at the time of default setting is detected. Here, in this embodiment, the default setting is that all the speakers 131 _L to 131 _SR are selected speakers that output the test sound, and the volume level is set to 6.

このステップＳ１２では、制御部２６２は、まず、上述したデフォルト設定を行う。このデフォルト設定に際し、制御部２６２は、出力信号選択部１１５へ向けて、全てのスイッチ素子１１５_L〜１１５_SRについて、Ｂ端子とＣ端子を導通させるべき旨の出力信号選択指令ＯＤＳを送る。次いで、制御部２６２は、音量調整部１１６へ向けて、スピーカ１３１_L〜１３１_SRから出力すべき音量を、レベル６に調整すべき旨の音量調整指令ＶＬＣ_L〜ＶＬＣ _SRを送る。この設定後、制御部２６２は、テスト信号発生部２２０へ向けて、テスト音声信号ＳＧＤを発生すべき旨のテスト信号発生指令ＳＧＣを送る。 In step S12, the control unit 262 first performs the default setting described above. At the time of this default setting, the control unit 262 sends an output signal selection command ODS to the output signal selection unit 115 indicating that the B terminal and the C terminal should be conducted for all the switch elements 115 _{L to} 115 _SR . Next, the control unit 262 sends volume adjustment commands VLC _{L to} VLC _SR indicating that the volume to be output from the speakers 131 _{L to} 131 _SR should be adjusted to level 6 toward the volume adjustment unit 116. After this setting, the control unit 262 sends a test signal generation command SGC to the test signal generation unit 220 to generate the test audio signal SGD.

テスト信号発生指令ＳＧＣを受けたテスト信号発生部２２０は、テスト音声信号ＳＧＤを発生させる。こうして発生させたテスト音声信号ＳＧＤは、信号切替部２３０へ向けて送られる。この結果、信号切替部２３０、アナログ変換部１１４、出力信号選択部１１５音量調整部１１６を経由して、音出力ユニット１３０_L〜１３０_SRからテスト音声が出力される。 The test signal generator 220 that has received the test signal generation command SGC generates a test audio signal SGD. The test audio signal SGD generated in this way is sent to the signal switching unit 230. As a result, the test sound is output from the sound output units 130 _{L to} 130 _SR via the signal switching unit 230, the analog conversion unit 114, and the output signal selection unit 115 and the volume adjustment unit 116.

このテスト音声は、集音ユニット１４０で集音され、集音結果データＡＡＤとして、減算器２４３へ向けて送られる（図６参照）。なお、前述したように、「最適条件決定モード」ではスイッチ２４１がオフとなっているため、減算器２４３からエコー検出部２５０へ向けて送られる送信信号ＴＲＤは、エコーキャンセルの処理が施されていない。そして、エコー検出部２５０は、減算器２４３から送られてくる送信信号ＴＲＤを受けて、そのエコー量を検出する。検出結果は、エコー量値ＥＶＤとして、最適条件決定部２６１へ向けて送られる。そして、最適条件決定部２６１は、このエコー量値ＥＶＤをエコー量（Ａ）として内部の記憶部に記憶する。 The test sound is collected by the sound collection unit 140 and sent to the subtracter 243 as sound collection result data AAD (see FIG. 6). As described above, since the switch 241 is off in the “optimum condition determination mode”, the transmission signal TRD transmitted from the subtractor 243 to the echo detector 250 is subjected to echo cancellation processing. Absent. The echo detection unit 250 receives the transmission signal TRD sent from the subtractor 243 and detects the echo amount. The detection result is sent to the optimum condition determination unit 261 as an echo amount value EVD. Then, the optimum condition determining unit 261 stores the echo amount value EVD as an echo amount (A) in the internal storage unit.

次いで、ステップＳ１３において、受信音声を再生するための最適条件の決定処理が行われる。この処理では、図１０に示されるように、まず、ステップＳ２１において、制御部２６２は、受信音声再生条件情報１７１から受信音声再生条件を取得し、プリセット（Ｐ＝１）の設定を行う。かかるプリセットに対応する選択スピーカの設定については、出力信号選択部１１５へ向けての出力信号選択指令ＯＤＳで行い、選択スピーカの音量の設定については、音量調整部１１６へ向けての音量調整指令ＶＬＣ_L〜ＶＬＣ_SRで行う。この設定が完了すると、処理はステップＳ２２へ進む。 Next, in step S13, an optimum condition determination process for reproducing the received sound is performed. In this process, as shown in FIG. 10, first, in step S21, the control unit 262 acquires the reception audio reproduction condition from the reception audio reproduction condition information 171 and sets a preset (P = 1). The setting of the selected speaker corresponding to the preset is performed by the output signal selection command ODS toward the output signal selection unit 115, and the volume adjustment command VLC toward the volume adjustment unit 116 is set regarding the volume setting of the selected speaker. _{L to} VLC _SR . When this setting is completed, the process proceeds to step S22.

ステップＳ２２では、当該プリセットにおけるテスト音声のエコー量値ＥＶＤを検出する。このステップでは、制御部２６２は、テスト音声信号ＳＧＤを発生すべき旨のテスト信号発生指令ＳＧＣを、テスト信号発生部２２０へ向けて送る。この結果、テスト信号発生部２２０からテスト音声信号ＳＧＤが発生し、このテスト音声信号ＳＧＤは、信号切替部２３０、アナログ変換部１１４、出力信号選択部１１５及び音量調整部１１６を経由して、音出力ユニット１３０_L〜１３０_SRからテスト音声として出力される。そして、その後はステップＳ１２におけるのと同様に、テスト音声は、集音ユニット１４０で集音され、集音結果データＡＡＤとして、減算器２４３へ向けて送られる。そして、減算器２４３からは、エコーキャンセルの処理が施されていない送信信号ＴＲＤがエコー検出部２５０へ向けて送られる。エコー検出部２５０は、減算器２４３から送られてくる送信信号ＴＲＤを受けて、そのエコー量を検出する。検出結果は、エコー量値ＥＶＤとして、最適条件決定部２６１へ向けて送られる。そして、最適条件決定部２６１は、このエコー量値ＥＶＤをエコー量（Ｂ）として内部の記憶部に記憶する。この後、処理はステップＳ２３へ進む。 In step S22, the echo amount value EVD of the test sound in the preset is detected. In this step, the control unit 262 sends a test signal generation command SGC indicating that the test audio signal SGD should be generated to the test signal generation unit 220. As a result, a test audio signal SGD is generated from the test signal generation unit 220, and the test audio signal SGD is transmitted through the signal switching unit 230, the analog conversion unit 114, the output signal selection unit 115, and the volume adjustment unit 116. Output from the output units 130 _{L to} 130 _SR as test audio. After that, as in step S12, the test sound is collected by the sound collection unit 140 and sent to the subtractor 243 as sound collection result data AAD. Then, from the subtractor 243, a transmission signal TRD that has not been subjected to echo cancellation processing is sent to the echo detector 250. The echo detector 250 receives the transmission signal TRD sent from the subtractor 243 and detects the echo amount. The detection result is sent to the optimum condition determination unit 261 as an echo amount value EVD. Then, the optimum condition determination unit 261 stores the echo amount value EVD as an echo amount (B) in the internal storage unit. Thereafter, the process proceeds to step S23.

ステップＳ２３では、最適条件決定部２６１が、数値化されたエコー量（Ａ）とエコー量（Ｂ）の大きさを比較し、エコー量（Ａ）がエコー量（Ｂ）より大きいか否かを判定する。この判定の結果が肯定的であった場合（ステップＳ２３：Ｙ）には、処理はステップＳ２４へ進む。 In step S23, the optimum condition determination unit 261 compares the numerical values of the echo amount (A) and the echo amount (B), and determines whether the echo amount (A) is larger than the echo amount (B). judge. If the result of this determination is affirmative (step S23: Y), the process proceeds to step S24.

ステップＳ２４では、最適条件決定部２６１が、エコー量（Ｂ）をエコー量（Ａ）に置き換えるとともに、このときのプリセットＰの値を値ＰＢＤに設定する。これらの設定が完了すると、処理はステップＳ２５へ進む。 In step S24, the optimum condition determination unit 261 replaces the echo amount (B) with the echo amount (A), and sets the value of the preset P at this time to the value PBD. When these settings are completed, the process proceeds to step S25.

一方、ステップＳ２３における判定の結果が否定的であった場合（ステップＳ２３：Ｎ）には、処理はステップＳ２５へ進む。 On the other hand, when the result of the determination in step S23 is negative (step S23: N), the process proceeds to step S25.

ステップＳ２５では、制御部２６２が、すべてのプリセットに関するテスト音声の音量検出が終了したか否かを判定する。この判定の結果が否定的であった場合（ステップＳ２５：Ｎ）には、処理はステップＳ２６へ進む。 In step S25, the control unit 262 determines whether or not the test sound volume detection for all presets has been completed. If the result of this determination is negative (step S25: N), the process proceeds to step S26.

ステップＳ２６では、次の測定対象となるプリセットに対する音声処理装置１００の設定が行われる。この設定も、ステップＳ２１におけるのと同様に、制御部２６２は、受信音声再生条件情報１７１から受信音声再生条件を基にして、測定対象となるプリセットに対する設定を行う。 In step S26, the sound processing apparatus 100 is set for the next preset to be measured. In this setting, as in step S21, the control unit 262 performs setting for the preset to be measured based on the reception sound reproduction condition from the reception sound reproduction condition information 171.

ステップＳ２６の処理が終了すると、処理はステップＳ２２へ戻る。以後、ステップＳ２５における判定の結果が肯定的となるまで、ステップＳ２２〜Ｓ２６の処理が繰り返される。 When the process of step S26 ends, the process returns to step S22. Thereafter, the processes in steps S22 to S26 are repeated until the result of the determination in step S25 becomes affirmative.

全てのプリセットに対応する計測処理が終了し、ステップＳ２５における判定の結果が肯定的になると（ステップＳ２５：Ｙ）、最適条件決定部２６１が、最適条件のプリセットである値ＰＢＤを制御部２６２へ報告する。そして、制御部２６２が、値ＰＢＤを記憶部に記憶すると、「最適条件決定モード」の動作が終了する。 When the measurement processing corresponding to all the presets is completed and the determination result in step S25 becomes affirmative (step S25: Y), the optimum condition determination unit 261 sends the value PBD that is a preset of the optimum conditions to the control unit 262. Report. When the control unit 262 stores the value PBD in the storage unit, the operation of the “optimum condition determination mode” ends.

「最適条件決定モード」の動作が終了すると、音声処理装置１００は「コンテンツ再生モード」になる。なお、前述したように、音声処理装置１００は、通常時は「コンテンツ再生モード」の動作を行うが、制御部２６２が、操作入力ユニット１６０からの通話発信指令、または、受信部２１１から着信指示ＲＥＩを受けた場合に、「通常通話モード」となる。 When the operation of the “optimum condition determination mode” ends, the sound processing apparatus 100 enters the “content reproduction mode”. As described above, the sound processing apparatus 100 normally operates in the “content reproduction mode”, but the control unit 262 performs a call transmission instruction from the operation input unit 160 or an incoming instruction from the reception unit 211. When the REI is received, the “normal call mode” is set.

音声処理装置１００が「通常通話モード」になると、制御部２６２は、まず、最適条件決定部２６１から受けた最適条件のプリセットである値ＰＢＤを参照し、受信音声再生条件情報１７１から、最適条件となる受信音声再生条件を取得する。次いで、出力信号選択部１１５へ向けて、信号ＲＳＳを最適条件の選択スピーカから出力することが可能なスイッチ操作をすべき旨の出力信号選択指令ＯＤＳを送る。 When the voice processing apparatus 100 is in the “normal call mode”, the control unit 262 first refers to the value PBD that is a preset of the optimum condition received from the optimum condition determination unit 261 and determines the optimum condition from the received voice reproduction condition information 171. The received audio playback condition is acquired. Next, an output signal selection instruction ODS is sent to the output signal selection unit 115 indicating that a switch operation that can output the signal RSS from the selection speaker under the optimum condition is to be performed.

さらに、制御部２６２は、信号切替部２３０へ向けて、受信部２１１からの受信信号ＲＥＤを選択すべき旨の指令を送り、スイッチ２４１へ向けて、減算器２４３からの送信信号ＴＲＤを適応フィルタ２４２へ送る旨の指令を送る。このスイッチ２４１の設定により、エコーキャンセル部２４０において、エコーキャンセル処理が行われる。さらに、制御部２６２は、音量調整部１１６へ向けて、選択スピーカから出力すべき音量を、最適条件のレベルに調整すべき旨の音量調整指令ＶＬＣ_L〜ＶＬＣ_SRを送る。 Further, the control unit 262 sends an instruction to the signal switching unit 230 to select the reception signal RED from the reception unit 211, and sends the transmission signal TRD from the subtractor 243 to the adaptive filter to the switch 241. An instruction to send to 242 is sent. Depending on the setting of the switch 241, the echo cancellation unit 240 performs echo cancellation processing. Furthermore, the control unit 262 sends to the volume adjustment unit 116 volume adjustment commands VLC _{L to} VLC _SR indicating that the volume to be output from the selected speaker should be adjusted to the optimum condition level.

「通常通話モード」時において、操作入力ユニット１６０からの通話切断指令を受けた場合、及び、受信部２１１からの着信指示ＲＥＩが途絶えた場合に、音声処理装置１００は、「コンテンツ再生モード」の動作を再開する。 When receiving a call disconnect command from the operation input unit 160 in the “normal call mode” and when the incoming call instruction REI from the receiving unit 211 is interrupted, the audio processing device 100 is in the “content playback mode”. Resume operation.

制御部２６２は、「コンテンツ再生モード」時には、利用者が再生すべき音声コンテンツの指定を支援するための案内画面を表示ユニット１５０に表示させる。そして、操作入力ユニット１６０に音声コンテンツを指定した再生指令が入力されると、制御部２６２は、ドライブユニット１２０を制御して、音声コンテンツのデータ読み出しを制御する。 The control unit 262 causes the display unit 150 to display a guidance screen for assisting the user in specifying the audio content to be reproduced in the “content reproduction mode”. When a reproduction command designating audio content is input to the operation input unit 160, the control unit 262 controls the drive unit 120 to control data reading of the audio content.

また、制御部２６２は、「コンテンツ再生モード」時には、オーディオ処理部１１２を制御して、ドライブユニット１２０からのコンテンツデータＣＴＤを４個のチャンネル処理信号ＰＣＤ_L〜ＰＣＤ_SRに分離させる。 In the “content reproduction mode”, the control unit 262 controls the audio processing unit 112 to separate the content data CTD from the drive unit 120 into four channel processing signals PCD _{L to} PCD _SR .

また、制御部２６２は、「コンテンツ再生モード」時には、音量調整部１１６を制御して、音出力ユニット１３０_L〜１３０_SRのスピーカ１３１_L〜１３１_SRのからの出力音量を調整する。 The control unit 262 is sometimes "content reproduction mode", and controls the sound volume adjustment section 116 adjusts the output volume of the color of the speaker 131 _L to 131 _SR sound output unit 130 _L to 130 DEG _SR.

以上説明したように、本実施形態では、車両内でハンズフリー通話を行う際に、受信音声を再生するために４個のスピーカ１３１_L〜１３１_SRを使用する。これらのスピーカの中から、受信音声の再生出力用に選択される選択スピーカと、その選択スピーカから出力される音量との組み合わせからなる複数の受信音声再生条件の候補を、利用者が設定する。そして、各受信音声再生条件のもとで、テスト音声を順次、発生させ、その条件下でのエコー音成分を計測する。そして、この計測結果を基にして、受話音声のスピーカから出力のマイクロフォンへの回り込みエコーが最小となる最適条件を自動的に決定する。これにより、複数のスピーカを備える場合において、受話音声のスピーカから出力のマイクロフォンへの回り込みエコーを低減することができる。 As described above, in the present embodiment, the four speakers 131 _{L to} 131 _SR are used to reproduce the received voice when performing a hands-free call in the vehicle. From these speakers, the user sets a plurality of reception sound reproduction condition candidates consisting of a combination of a selected speaker selected for reproduction output of the received sound and a volume output from the selected speaker. Then, a test sound is sequentially generated under each received sound reproduction condition, and an echo sound component under the condition is measured. Then, based on the measurement result, the optimum condition for minimizing the wraparound echo from the speaker of the received voice to the output microphone is automatically determined. Thereby, in the case of providing a plurality of speakers, it is possible to reduce the sneak echo from the speaker of the received voice to the output microphone.

したがって、本実施形態によれば、車両の運転中にハンズフリー通話を行う際に、通話者間で快適な通信通話を行うことができる。 Therefore, according to the present embodiment, when a hands-free call is made during driving of the vehicle, a comfortable communication call can be made between the callers.

［実施形態の変形］
本発明は、上記の実施形態に限定されるものではなく、様々な変形が可能である。 [Modification of Embodiment]
The present invention is not limited to the above-described embodiment, and various modifications are possible.

例えば、上記の実施形態では、音声処理装置１００と携帯電話装置９００との間の通信を無線通信で実現することとしたが、例えば、音声処理装置１００と携帯電話装置９００との間をケーブルを用いて接続し、音声処理装置１００と携帯電話装置９００との間の通信を有線通信とするものであってもよい。 For example, in the above embodiment, communication between the voice processing device 100 and the mobile phone device 900 is realized by wireless communication. For example, a cable is connected between the voice processing device 100 and the mobile phone device 900. The communication between the audio processing device 100 and the mobile phone device 900 may be wired communication.

また、上記の実施形態では、音声処理装置１００は携帯電話装置９００と無線通信を行うとしたが、携帯電話装置に限定せず、ＰＨＳやＰＤＡ、自動車電話等の移動通信端末装置と無線通信を行うものであってもよい。 In the above embodiment, the voice processing device 100 performs wireless communication with the mobile phone device 900. However, the voice processing device 100 is not limited to the mobile phone device, and performs wireless communication with mobile communication terminal devices such as PHS, PDA, and automobile phone. You may do it.

また、上記の実施形態では、受信音声を再生するスピーカとして４個のスピーカを備えることとしたが、２個、３個、又は、５個以上のスピーカから受信音声を再生出力させるようにすることもできる。 In the above embodiment, the four speakers are provided as the speakers for reproducing the received sound. However, the received sound is reproduced and output from two, three, or five or more speakers. You can also.

また、上記の実施形態においては、車両に搭載される音声処理装置に本発明を適用したが、車両以外の他の移動体に搭載される音声処理装置にも本発明を適用することもできるし、また、例えば、家庭内等において使用される音声処理装置に本発明を適用することもできる。 In the above embodiment, the present invention is applied to a sound processing device mounted on a vehicle. However, the present invention can also be applied to a sound processing device mounted on a moving body other than the vehicle. Also, for example, the present invention can be applied to a sound processing device used in a home or the like.

なお、上記の実施形態における制御ユニット１１０の一部又は全部を中央処理装置（ＣＰＵ：Central Processing Unit）、ＤＳＰ（Digital Signal Processor）、読出専用メモリ（ＲＯＭ：Read Only Memory）、ランダムアクセスメモリ（ＲＡＭ：Random Access Memory）等を備えた演算手段としてのコンピュータとして構成し、予め用意されたプログラムを当該コンピュータで実行することにより、上記の実施形態における処理の一部又は全部を実行するようにしてもよい。このプログラムはハードディスク、ＣＤ−ＲＯＭ、ＤＶＤ等のコンピュータで読み取り可能な記録媒体に記録され、当該コンピュータによって記録媒体から読み出されて実行される。また、このプログラムは、ＣＤ−ＲＯＭ、ＤＶＤ等の可搬型記録媒体に記録された形態で取得されるようにしてもよいし、インターネットなどのネットワークを介した配送の形態で取得されるようにしてもよい。 In addition, a part or all of the control unit 110 in the above-described embodiment includes a central processing unit (CPU), a DSP (Digital Signal Processor), a read only memory (ROM), a random access memory (RAM). : Random Access Memory) or the like, and configured as a computer, and a part of or all of the processing in the above embodiment is executed by executing a program prepared in advance on the computer. Good. This program is recorded on a computer-readable recording medium such as a hard disk, CD-ROM, or DVD, and is read from the recording medium and executed by the computer. The program may be acquired in a form recorded on a portable recording medium such as a CD-ROM or DVD, or may be acquired in a form of delivery via a network such as the Internet. Also good.

本発明の一実施形態に係る音声処理装置の構成を概略的に示すブロック図である。1 is a block diagram schematically showing the configuration of a speech processing apparatus according to an embodiment of the present invention. 図１の４個のスピーカ及びマイクロフォンの配置位置を説明するための図である。It is a figure for demonstrating the arrangement position of four speakers of FIG. 1, and a microphone. 図１の再生音声条件情報を説明するためのブロック図である。It is a block diagram for demonstrating the reproduction | regeneration audio | voice condition information of FIG. 図１の制御ユニットの構成を説明するためのブロック図である。It is a block diagram for demonstrating the structure of the control unit of FIG. 図４の通話音声処理部の構成を説明するためのブロック図である。It is a block diagram for demonstrating the structure of the telephone call speech process part of FIG. 図５のエコーキャンセル部の構成を説明するためのブロック図である。It is a block diagram for demonstrating the structure of the echo cancellation part of FIG. 図４の出力信号選択部の構成を説明するためのブロック図である。FIG. 5 is a block diagram for explaining a configuration of an output signal selection unit in FIG. 4. 図４の制御処理部の構成を説明するためのブロック図である。It is a block diagram for demonstrating the structure of the control process part of FIG. 図１の装置による受信音声の最適条件決定処理を説明するためのフローチャートである。6 is a flowchart for explaining optimum condition determination processing for received voice by the apparatus of FIG. 1. 図９における最適条件決定処理を説明するためのフローチャートである。10 is a flowchart for explaining an optimum condition determination process in FIG. 9.

Explanation of symbols

１００ … 音声処理装置
１３０ … 記憶装置（記憶手段）
１３１_L〜１３１_SR … スピーカ
１４０ … 集音ユニット（集音手段）
２４０ … エコーキャンセル部（エコーキャンセル手段）
２４２ … 適応フィルタ（フィルタ手段）
２４３ … 減算器（減算手段）
２５０ … エコー検出部（エコー検出手段）
２６１ … 最適条件決定部（最適条件決定手段）
２６２ … 制御部（テスト音声出力制御手段、条件設定手段、エコーキャ
ンセル制御手段） DESCRIPTION OF SYMBOLS 100 ... Voice processing apparatus 130 ... Storage | storage device (storage means)
131 _L to 131 _SR ... speaker 140 ... sound collecting unit (sound collecting means)
240 ... Echo canceling part (echo canceling means)
242 ... Adaptive filter (filter means)
243 ... Subtractor (subtraction means)
250 ... Echo detection unit (echo detection means)
261 ... Optimal condition determination unit (optimum condition determination means)
262... Control unit (test sound output control means, condition setting means, echo
Canceling control means)

Claims

A voice processing device having a call communication function,
A sound collecting means for collecting a voice emitted for a call with the outside and converting it into a transmission voice signal;
A plurality of speakers that reproduce and output received audio based on externally received audio signals;
A plurality of reception sound reproduction condition candidates comprising a combination of at least one selected speaker selected for reproduction output from the plurality of speakers and the volume of the sound output from the selected speaker when the reception sound is reproduced. Storage means in which is stored;
Test audio output control means for sequentially reproducing and outputting internally generated test audio from the selected speaker in accordance with each of the plurality of received audio playback condition candidates;
Echo detecting means for detecting an echo amount of the sound collected by the sound collecting means due to the reproduction output of the test sound;
An optimum condition determining means for determining an optimum condition from the plurality of received voice reproduction condition candidates based on a detection result by the echo detection means corresponding to each of the plurality of received voice reproduction condition candidates;
A condition setting means for setting the optimum condition as a reception voice reproduction condition during call communication;
An audio processing apparatus comprising:

The speech processing apparatus according to claim 1, further comprising echo canceling means for reducing echo components in the transmission voice signal during the call communication.

The echo canceling means is
Filter means for generating a pseudo echo signal that simulates the echo component based on the received audio signal;
Subtracting means for generating the transmission voice signal by subtracting the pseudo echo signal from the sound collection result in the sound collecting means;
The speech processing apparatus according to claim 2, further comprising:

The audio processing apparatus according to claim 2, further comprising an echo cancellation control unit configured to perform setting so as not to reduce the echo component by the echo cancellation unit when the test audio is reproduced and output.

The sound processing apparatus according to claim 1, wherein the sound processing apparatus is mounted on a moving body.

Sound collecting means for collecting sound that is emitted for a call with the outside and converting it into a transmission sound signal; a plurality of speakers that reproduce and output received sound based on a received sound signal from the outside; and the received sound Are stored, a plurality of reception sound reproduction condition candidates that are combinations of at least one selected speaker selected for reproduction output from the plurality of speakers and the volume of the sound output from the selected speaker are stored. A voice processing method used in a voice processing apparatus comprising a storage means,
A test sound output step of sequentially reproducing and outputting internally generated test sound from the selected speaker according to each of the plurality of received sound reproduction condition candidates;
An echo detection step of detecting an echo amount of the sound collected by the sound collecting means due to the reproduction output of the test sound;
An optimum condition determining step for determining an optimum condition from the plurality of received voice reproduction condition candidates based on the detection result in the echo detection step corresponding to each of the plurality of received voice reproduction condition candidates;
A condition setting step of setting the optimum condition as a reception voice reproduction condition during call communication;
An audio processing method comprising:

An audio processing program for causing an arithmetic means to execute the audio processing method according to claim 6.

8. A recording medium on which the audio processing program according to claim 7 is recorded so as to be readable by an arithmetic means.