JP4604836B2

JP4604836B2 - Voice processing device, communication device, and program

Info

Publication number: JP4604836B2
Application number: JP2005148631A
Authority: JP
Inventors: 智浩伊藤
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 2005-05-20
Filing date: 2005-05-20
Publication date: 2011-01-05
Anticipated expiration: 2025-05-20
Also published as: JP2006323308A

Abstract

<P>PROBLEM TO BE SOLVED: To provide a speech processor capable of performing signal processing to a voice signal by following the change of the voice, even if the voice whose volume level often changes. <P>SOLUTION: The speech processor equipped in a multi-function product measures the signal level Lc of the voice signal which is transferred between the multi-function product and communication destination equipment (S145), and updates a sound parameter group (S165) when the absolute value (¾Lp-Lc¾) of difference between a peak value Lp stored in an RAM and the signal level Lc measured in the processing of S145 is larger than a predetermined threshold Lh (S150:yes) and also the signal level Lc measured in the processing of S145 is larger than -30dBm (S155; yes). As a result, the optimizing of the sound volume and the cutting of noise component corresponding to the volume and the canceling of echo, etc., are applied to the voice signal according to the sound volume, and the voice signal which is transferred between the product and the communication destination equipment is corrected to the voice easy to hear. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、通信先機器との間で伝送される音声信号に対して、音量や音質などを変更するための信号処理を施す音声処理装置と、この音声処理装置を備えた通話装置、および、コンピュータを音声処理装置として機能させるためのプログラムに関する。 The present invention relates to a voice processing device that performs signal processing for changing a volume, a sound quality, and the like on a voice signal transmitted to a communication destination device, a communication device including the voice processing device, and The present invention relates to a program for causing a computer to function as an audio processing device.

従来、通信先機器との間で伝送される音声を対象にして、音量の変更を実施する装置としては、例えば、下記特許文献１に記載のものが知られている。
この特許文献１に記載の装置は、音声データの回帰性の高さに基づいて音声であるか否かを判断し、音声である場合には、数百ミリ秒程度の期間内に含まれる複数フレームを対象にしてフレームエネルギーの積分を行って、この積分値に基づいて音量レベルの判定を行っていた。
特開平７−１７７０８５号公報 2. Description of the Related Art Conventionally, for example, a device described in Patent Document 1 below is known as a device for changing a volume for audio transmitted to and from a communication destination device.
The device described in Patent Document 1 determines whether or not the sound is based on the high regressibility of the sound data, and in the case of the sound, a plurality of devices included in a period of about several hundred milliseconds are included. The frame energy is integrated for the frame, and the volume level is determined based on the integrated value.
JP-A-7-177085

しかし、上記特許文献１に記載の技術では、数百ミリ秒程度の期間が経過した後でないと音量レベルの変化を検出することができず、実際に音量レベルが変化するタイミングと、その変化を検出して音量を変更するタイミングとの間には、少なくとも数百ミリ秒程度のずれが生じていた。そのため、例えば、抑揚のある音声など、音量がしばしば変化するような場合には、実際の音量の変化に音量の変更制御がうまく追従できず、必ずしも聞き取りやすい音量にならないことがあった。 However, in the technique described in Patent Document 1, a change in volume level cannot be detected unless a period of about several hundred milliseconds has elapsed, and the timing at which the volume level actually changes and the change are detected. There was a difference of at least several hundred milliseconds between the detection and the timing of changing the volume. For this reason, for example, when the volume often changes, such as an inflection sound, the volume change control cannot follow the actual volume change well, and the volume may not always be easy to hear.

また、上記特許文献１に記載の技術の場合、きわめて音量が小さい音声であっても、音声であると判断した場合には音量の変更制御を行ってしまうため、例えば、通話相手の周囲から拾った音声など、通常であれば聞こえなくても構わないようなごく僅かな音量の音声まで音量変更の対象となってしまうことがあった。 Further, in the case of the technique described in Patent Document 1, even if the sound is extremely low, if the sound is determined to be sound, control of changing the sound volume is performed. In some cases, the sound volume may be changed even to a very low volume sound that may not normally be heard.

本発明は、上記諸問題を解決するためになされたものであり、その目的は、ある程度以上の音量を持つ音声を対象にして、その音量の変化に追従して音声信号に対する信号処理を施すことができる音声処理装置を提供することにある。さらに、上記のような音声処理装置を備えた通話装置、および、コンピュータを上記のような音声処理装置として機能させるためのプログラムを提供することにある。
The present invention has been made to solve the above problems, its object is directed to a voice with a certain level of volume, performing signal processing on the audio signal following the change in the volume An object of the present invention is to provide a voice processing apparatus capable of It is another object of the present invention to provide a communication device including the above-described voice processing device and a program for causing a computer to function as the above-described voice processing device.

以下、本発明において採用した特徴的構成について説明する。
本発明の音声処理装置は、通信先機器から送信されてくる音声信号の音量レベルを測定する測定手段と、前記測定手段によって測定された前記音量レベルのピーク値を記憶可能なピーク値記憶手段と、前記測定手段によって測定された前記音量レベルのピーク値が、あらかじめ定められた下限値より大きい場合に、前記ピーク値記憶手段に記憶された前記ピーク値の更新を行うと判定する第１判定手段と、前記ピーク値記憶手段に記憶された前記ピーク値と、前記測定手段によって測定された前記音量レベルのピーク値に、あらかじめ定められた大きさ以上の差がある場合に、前記ピーク値記憶手段に記憶された前記ピーク値の更新を行うと判定する第２判定手段と、前記第１判定手段および前記第２判定手段によって前記ピーク値の更新を行うと判定された場合に、前記ピーク値記憶手段に記憶された前記ピーク値を、前記測定手段によって測定された前記ピーク値で更新する更新手段と、前記更新手段により更新された前記ピーク値記憶手段に記憶された前記音量レベルのピーク値に基づいて、前記通信先機器との間で伝送される音声信号に対し、前記ピーク値に応じて決まる信号処理として、音量レベルを前記ピーク値に応じて決まるレベルに変更する処理、および、エコーキャンセルのレベルを前記ピーク値に応じて決まるレベルに変更する処理のうち、少なくとも１つの処理を施す信号処理手段とを備えたことを特徴とする。
The characteristic configuration employed in the present invention will be described below.
The audio processing apparatus of the present invention includes a measuring unit that measures a volume level of an audio signal transmitted from a communication destination device, and a peak value storage unit that can store a peak value of the volume level measured by the measuring unit. First determination means for determining to update the peak value stored in the peak value storage means when the peak value of the volume level measured by the measurement means is greater than a predetermined lower limit value The peak value stored in the peak value storage means and the peak value of the volume level measured by the measurement means when there is a difference of a predetermined magnitude or more, the peak value storage means a second judging means judges that updating of said stored peak value, updating said peak value by the first determination unit and the second judging means An update means for updating the peak value stored in the peak value storage means with the peak value measured by the measurement means, and the peak value storage means updated by the update means, As a signal processing determined according to the peak value for the audio signal transmitted to the communication destination device based on the peak value of the volume level stored in the volume level, the volume level is determined according to the peak value. Signal processing means for performing at least one of a process for changing to a level determined and a process for changing the level of echo cancellation to a level determined in accordance with the peak value is provided.

この音声処理装置において、信号処理手段は、通信先機器との間で伝送される音声信号に対して信号処理を施す際に、測定手段によって測定された音量レベルのピーク値に基づいて、ピーク値に応じて決まる信号処理を施す。このような音量レベルのピーク値は、所定の期間にわたる積分値等とは異なり、ほぼ瞬時に測定可能である。 In this audio processing device, the signal processing means performs peak processing based on the peak value of the volume level measured by the measurement means when performing signal processing on the audio signal transmitted to the communication destination device. The signal processing determined according to is performed. Such a peak value of the sound volume level can be measured almost instantaneously, unlike an integrated value over a predetermined period.

したがって、この音声処理装置であれば、例えば、抑揚のある音声など、音量がしばしば変化するような場合にも、実際の音量の変化に遅れることなく信号処理を施すことができ、より聞き取りやすい音量や音質に補正することができる。
また、第１判定手段は、測定手段によって測定された前記音量レベルのピーク値が、あらかじめ定められた下限値より大きい場合に、ピーク値記憶手段に記憶された前記ピーク値の更新を行うと判定する。
そのため、このように構成された音声処理装置によれば、測定手段によって測定された音量レベルのピーク値に変化があったとしても、第１判定手段がピーク値記憶手段に記憶されたピーク値の更新を行うと判定した場合しか、更新手段がピーク値記憶手段に記憶されたピーク値を更新せず、特に、測定手段によって測定された音量レベルのピーク値が、あらかじめ定められた下限値以下であれば、ピーク値記憶手段に記憶されたピーク値は更新されないことになる。
したがって、ピーク値が極端に小さくなったときには、信号処理手段による信号処理の内容は変化しないので、瞬間的な無音状態に合わせて音量の増大が図られるようなことがなく、利用者が聴覚上の不自然さを感じるのを防ぐことができる。
Therefore, with this audio processing device, for example, even when the volume changes frequently, such as inflected audio, signal processing can be performed without delaying the actual change in volume, making the volume easier to hear. And sound quality can be corrected.
Further, the first determination unit determines to update the peak value stored in the peak value storage unit when the peak value of the volume level measured by the measurement unit is larger than a predetermined lower limit value. To do.
Therefore, according to the sound processing device configured as described above, even if there is a change in the peak value of the volume level measured by the measuring unit, the first determination unit stores the peak value stored in the peak value storage unit. Only when it is determined to update, the updating means does not update the peak value stored in the peak value storing means, and in particular, the peak value of the volume level measured by the measuring means is below a predetermined lower limit value. If there is, the peak value stored in the peak value storage means is not updated.
Therefore, when the peak value becomes extremely small, the content of the signal processing by the signal processing means does not change, so that the volume is not increased in accordance with the instantaneous silent state, and the user can not hear it. You can prevent the feeling of unnaturalness.

また、本発明の音声処理装置において、前記信号処理手段は、前記ピーク値に応じて決まる信号処理として、音量レベルを前記ピーク値に応じて決まるレベルに変更する処理、および、エコーキャンセルのレベルを前記ピーク値に応じて決まるレベルに変更する処理のうち、少なくとも１つの処理を、前記通信先機器との間で伝送される音声信号に対して施すように構成されている。
In the audio processing device according to the present invention, the signal processing means may change a volume level to a level determined according to the peak value, and an echo cancellation level as signal processing determined according to the peak value. of the process of changing a level determined in accordance with the peak value, that is configured to at least one treatment, applied to the audio signals transmitted with the communication destination device.

音量レベルを前記ピーク値に応じて決まるレベルに変更する処理を行えば、例えば、ピーク値が大きい場合ほど音量を低減したり、ピーク値が小さい場合ほど音量を増大させたりすることで、音声を聞き取りやすくすることができる。 If the process of changing the volume level to a level determined according to the peak value is performed, for example, the volume is reduced as the peak value is larger, or the volume is increased as the peak value is smaller. It can make it easy to hear.

また、エコーキャンセル処理については、ピーク値が大きい場合ほどエコー（自分の声のはね返り音）も共に増大する傾向があるので、ピーク値が大きい場合にはエコーキャンセルがより強く効く一方、ピーク値が小さい場合にはエコーキャンセルがより弱く効くように、エコーキャンセルのレベル（設定）を変更することで、音声を聞き取りやすくすることができる。ちなみに、エコーが過剰に大きい場合は、通話相手の音声が聞き取りにくくなる傾向があるものの、エコーを完全に消失させても違和感があり、必ずしも聞きやすく話しやすい通話環境にはならない傾向がある。つまり、エコーキャンセルの効き具合は弱すぎても問題であるが、エコーキャンセルの効き具合が強すぎることが理想的な訳ではない。この点、上記のようにピーク値に応じてエコーキャンセルのレベルを変更すれば、音量が大きいときにエコーまで過剰に大きくなることが無く、音量が小さいときにエコーが過剰に小さくなることも無い。 As for the echo cancellation processing, the echo (the bounce sound of your voice) tends to increase as the peak value increases, so if the peak value is large, the echo cancellation works more strongly, while the peak value is By changing the echo cancellation level (setting) so that the echo cancellation works more weakly when it is small, it is possible to make it easier to hear the voice. By the way, if the echo is excessively large, the voice of the other party tends to be difficult to hear, but even if the echo is completely lost, there is a sense of incongruity, and there is a tendency that it is not always easy to hear and speak. That is, it is a problem if the echo cancellation is too weak, but it is not ideal that the echo cancellation is too strong. In this regard, if the echo cancellation level is changed according to the peak value as described above, the echo will not be excessively increased when the volume is high, and the echo will not be excessively decreased when the volume is low. .

これらの処理は一方を採用するだけでも相応の効果を得られるが、両方とも採用すればさらに効果的である。
These treatments can achieve a reasonable effect by adopting only one, but are more effective if both are adopted .

また、本発明の音声処理装置において、第２判定手段は、前記ピーク値記憶手段に記憶された前記ピーク値と、前記測定手段によって測定された前記音量レベルのピーク値に、あらかじめ定められた大きさ以上の差がある場合に、前記ピーク値記憶手段に記憶された前記ピーク値の更新を行うと判定し、前記更新手段は、前記第２判定手段によって前記ピーク値の更新を行うと判定された場合に、前記ピーク値記憶手段に記憶された前記ピーク値を、前記測定手段によって測定された前記ピーク値で更新する。
In the audio processing device of the present invention , the second determination means has a predetermined magnitude between the peak value stored in the peak value storage means and the peak value of the volume level measured by the measurement means. If there is a difference greater than or equal to this, it is determined to update the peak value stored in the peak value storage means, and the update means is determined to update the peak value by the second determination means. when the, the peak value stored in the peak value storage unit, to update with the peak value measured by said measuring means.

したがって、ピーク値が僅かに変化した程度では、信号処理手段による信号処理の内容は変化しないので、利用者が聴覚上の不自然さを感じるのを防ぐことができる。 Therefore , since the content of the signal processing by the signal processing means does not change when the peak value is slightly changed, it is possible to prevent the user from feeling unnatural on hearing.

なお、以上説明した本発明の音声処理装置は、電話機やパーソナルコンピュータ等を利用して構成されるＩＰ電話装置などの通話装置に適用すると好適である。
また、電話機に内蔵されるコンピュータやパーソナルコンピュータを、上述した音声処理装置が備える各手段として機能させるためのプログラムがあれば、それらのコンピュータを利用して本発明の音声処理装置を構成することができる。 The above-described voice processing apparatus of the present invention is preferably applied to a call device such as an IP phone device configured using a telephone, a personal computer, or the like.
Further, if there is a program for causing a computer or personal computer incorporated in the telephone to function as each means included in the above-described voice processing apparatus, the voice processing apparatus of the present invention can be configured using those computers. it can.

次に、本発明の実施形態について一例を挙げて説明する。
以下に説明する実施形態は、本発明に係る音声処理装置を、ファクシミリ機能、電話機能、プリンタ機能、スキャナ機能、およびコピー機能等を兼ね備えた複合機（一般にＭＦＰ（Multi-Function Product）等と呼ばれる装置）に適用したものである。本発明に係る音声処理装置は、複合機の電話機能を利用したときに通信先機器との間で伝送される音声信号を対象にして、後述する信号処理を行うことになる。 Next, an embodiment of the present invention will be described with an example.
In the embodiments described below, the voice processing apparatus according to the present invention is called a multi-function product (generally MFP (Multi-Function Product)) having a facsimile function, a telephone function, a printer function, a scanner function, and a copy function. Device). The voice processing apparatus according to the present invention performs signal processing, which will be described later, on a voice signal transmitted to a communication destination device when the telephone function of the multifunction machine is used.

［複合機の構成］
まず、複合機１の構成について説明する。
複合機１には、図１に示すように、ＣＰＵ１１、ＲＯＭ１２、ＥＥＰＲＯＭ１３、ＲＡＭ１４、画像メモリ１５、回線Ｉ／Ｆ部１９、モデム２０、バッファ２１、スキャナ２２、符号化部２３、復号化部２４、プリンタ２５、操作パネル４、ＬＣＤ（液晶表示パネル）５、アンプ２７、通話用のハンドセット４７などが設けられており、これらがバスライン３０を介して互いに接続されている。 [Configuration of MFP]
First, the configuration of the multifunction machine 1 will be described.
As shown in FIG. 1, the multifunction device 1 includes a CPU 11, a ROM 12, an EEPROM 13, a RAM 14, an image memory 15, a line I / F unit 19, a modem 20, a buffer 21, a scanner 22, an encoding unit 23, and a decoding unit 24. , A printer 25, an operation panel 4, an LCD (liquid crystal display panel) 5, an amplifier 27, a telephone handset 47, and the like, which are connected to each other via a bus line 30.

ＣＰＵ１１は、バスライン３０により接続された各部を制御する装置であり、例えば、回線Ｉ／Ｆ部１９を介して音声や画像データの送受信を行うための処理等を実行する。
ＲＯＭ１２は、この複合機１で実行される制御プログラム等を格納した書き換え不能な記憶装置であり、後述する処理をＣＰＵ１１に実行させるためのプログラムは、このＲＯＭ１２に格納されている。また、ＲＯＭ１２には、ハンドセット音響パラメータテーブル１２ａ（詳細は後述）も格納されている。 The CPU 11 is a device that controls each unit connected by the bus line 30 and executes, for example, processing for transmitting and receiving voice and image data via the line I / F unit 19.
The ROM 12 is a non-rewritable storage device that stores a control program executed by the multifunction device 1. A program for causing the CPU 11 to execute processing to be described later is stored in the ROM 12. The ROM 12 also stores a handset acoustic parameter table 12a (details will be described later).

ＥＥＰＲＯＭ１３は、複合機１の電源が遮断された後も書き込まれたデータを保持することができる書き換え可能な不揮発性の記憶装置である。
ＲＡＭ１４は、複合機１の各動作の実行時に各種のデータや着信履歴等を記憶するための読み込みおよび書き換えが可能な記憶装置であり、後述する処理の中で書き込みおよび更新がなされるハンドセット音響パラメータ１４ａ、およびピーク音声レベル１４ｂなどは、このＲＡＭ１４に格納されている。 The EEPROM 13 is a rewritable nonvolatile storage device that can retain written data even after the power of the multifunction device 1 is shut off.
The RAM 14 is a storage device that can be read and rewritten to store various data, incoming call history, and the like when each operation of the multifunction device 1 is executed, and is a handset acoustic parameter that is written and updated during processing to be described later. 14a, the peak audio level 14b, and the like are stored in the RAM 14.

画像メモリ１５は、画像データ及び印刷のためのビットイメージを記憶するための記憶装置であり、安価な大容量メモリであるダイナミックＲＡＭ（ＤＲＡＭ）により構成されている。そして、受信された画像データは、一旦画像メモリ１５に記憶され、プリンタ２５によって記録紙に印刷された後に、この画像メモリ１５から消去される。また、スキャナ２２によって読み取られた画像データも、この画像メモリ１５に記憶される。 The image memory 15 is a storage device for storing image data and a bit image for printing, and includes a dynamic RAM (DRAM) that is an inexpensive large-capacity memory. The received image data is temporarily stored in the image memory 15, printed on recording paper by the printer 25, and then erased from the image memory 15. The image data read by the scanner 22 is also stored in the image memory 15.

回線Ｉ／Ｆ部１９は、回線制御を行うためのものであり、電話回線側（例えば、交換機あるいはＩＰ電話アダプタ）から送られてくる呼出信号（リング信号）や相手側装置の電話番号等の発信元識別情報（以下、発信元識別情報をＣａｌｌｅｒＩＤという。）を示す信号等の各種信号を受信するとともに、操作パネル４上のキーの操作に応じた発信時のダイヤル信号を電話回線側へ送信する。 The line I / F unit 19 is used for line control, such as a call signal (ring signal) sent from a telephone line side (for example, an exchange or an IP telephone adapter), a telephone number of a counterpart device, etc. Various signals such as a signal indicating the sender identification information (hereinafter referred to as “CallerID”) are received and a dial signal at the time of outgoing call according to the operation of the key on the operation panel 4 is transmitted to the telephone line side. To do.

モデム２０は、画情報及び通信データを変調及び復調して伝送するとともに、伝送制御用の各種手順信号を送受信するためのものである。
バッファ２１は、相手側装置との間で送受信される符号化された画情報を含むデータを一時的に記憶するためのものである。 The modem 20 is used to modulate and demodulate and transmit image information and communication data, and to transmit and receive various procedure signals for transmission control.
The buffer 21 is for temporarily storing data including encoded image information transmitted / received to / from the counterpart device.

スキャナ２２は、原稿挿入口に挿入された原稿を画像データとして読み取るためのものであり、原稿搬送用モータを備えている。
符号化部２３は、スキャナ２２により読み取られた画像データの符号化を行うものである。 The scanner 22 is for reading a document inserted into the document insertion slot as image data, and includes a document transport motor.
The encoding unit 23 encodes image data read by the scanner 22.

復号化部２４は、バッファ２１又は画像メモリ１５に記憶された画像データを読み出して、これを復号化するものであり、復号化されたデータは、プリンタ２５により記録紙に印刷される。 The decoding unit 24 reads image data stored in the buffer 21 or the image memory 15 and decodes it. The decoded data is printed on a recording sheet by the printer 25.

プリンタ２５は、記録紙を搬送する記録紙用搬送モータ、印字ヘッドを搭載したキャリッジを移動させるキャリッジモータ、及び記録紙へインクを吐出する印字ヘッド等を備えた、周知のインクジェット方式のプリンタで構成されている。 The printer 25 is a well-known inkjet printer including a recording paper transport motor that transports recording paper, a carriage motor that moves a carriage on which the print head is mounted, and a print head that ejects ink onto the recording paper. Has been.

ＬＣＤ５は、ＣＰＵ１１から指令信号に基づいて、様々な情報を文字又は画像を通じてユーザに向けて発するものであり、その情報としては、例えば、ＦＡＸデータを送受信中である旨、又は発信元の電話番号もしくはファクシミリ番号や発信元の名前もしくは名称等の発信元の情報等が挙げられる。 The LCD 5 emits various information to the user through characters or images based on a command signal from the CPU 11. The information includes, for example, the fact that FAX data is being transmitted / received or the telephone number of the caller Or the information of the sender such as a facsimile number and the name or name of the sender can be used.

アンプ２７は、そのアンプ２７に接続されたスピーカ２８を鳴動して、呼出音や音声を出力するためのものである。
ハンドセット４７は、通信先機器から送信されてくる音声信号を再生するスピーカと通話者が発する音声を入力するマイクロフォン等とが一体化された通話用の送受話器である。 The amplifier 27 is used to ring a speaker 28 connected to the amplifier 27 and output a ringing tone or voice.
The handset 47 is a handset for a call in which a speaker that reproduces a sound signal transmitted from a communication destination device and a microphone that inputs sound emitted from a caller are integrated.

フックスイッチ４８は、利用者がハンドセット４７を持ち上げる操作（フックアップ）を行ったときにオンになり、ハンドセット４７を元の位置に戻す操作（フックダウン）を行ったときにオフになるスイッチである。フックスイッチ４８のオン／オフはＣＰＵ１１によって監視され、フックスイッチ４８がオンになるとＣＰＵ１１は回線を閉結し、フックスイッチ４８がオフになるとＣＰＵ１１は回線を開放する。また、このフックスイッチ４８は、操作パネル４での操作によってもオン／オフを切り替え可能になっており、操作パネル４を利用すれば、ハンドセット４７を持ち上げる操作（フックアップ）を実際に行わなくても、複合機１の状態をフックアップ／フックダウンが行われた場合と同等な状態に切り替えることができる。 The hook switch 48 is turned on when the user performs an operation (hook up) for lifting the handset 47, and is turned off when an operation for returning the handset 47 to the original position (hook down) is performed. . On / off of the hook switch 48 is monitored by the CPU 11, and when the hook switch 48 is turned on, the CPU 11 closes the line, and when the hook switch 48 is turned off, the CPU 11 opens the line. Further, the hook switch 48 can be switched on / off by an operation on the operation panel 4. If the operation panel 4 is used, an operation (hook up) for lifting the handset 47 is not actually performed. In addition, the state of the multifunction device 1 can be switched to a state equivalent to the case where hookup / hookdown is performed.

［ハンドセット音響パラメータテーブル］
次に、ＲＯＭ１２に格納されたハンドセット音響パラメータテーブル１２ａについて説明する。 [Handset acoustic parameter table]
Next, the handset acoustic parameter table 12a stored in the ROM 12 will be described.

ハンドセット音響パラメータテーブル１２ａは、本実施形態においては、図２に示すようなデータ構造のテーブルとなっている。すなわち、本実施形態において、ハンドセット音響パラメータテーブル１２ａには、３組分の音響パラメータ群が格納されており、これら３組の音響パラメータ群が、３つの範囲（−２０ｄＢｍ〜−３０ｄＢｍ、−１０ｄＢｍ〜−２０ｄＢｍ、−１０ｄＢｍ以上）に区分された音声信号の信号レベル（＝音量レベル）に対して、一対一の関係で対応づけられている。 In the present embodiment, the handset acoustic parameter table 12a has a data structure as shown in FIG. In other words, in the present embodiment, three sets of acoustic parameter groups are stored in the handset acoustic parameter table 12a, and these three sets of acoustic parameter groups have three ranges (−20 dBm to −30 dBm, −10 dBm to Corresponding in a one-to-one relationship with the signal level (= volume level) of the audio signal divided into −20 dBm and −10 dBm or more).

１組分の音響パラメータ群には、各組とも、送話レベル、受話レベル、フィルター特性値、およびＬＥＣ（ラインエコーキャンセラー）設定値、以上４種の音響パラメータが含まれている。これら４種の音響パラメータのうち、送話レベルおよび受話レベルの２種については、操作パネル４で設定可能な３通りの音量設定（大、中、小）それぞれに対応する３対の音響パラメータが格納されている。 Each set of acoustic parameter groups includes a transmission level, a reception level, a filter characteristic value, an LEC (line echo canceller) setting value, and the above four types of acoustic parameters. Among these four types of acoustic parameters, for two types of transmission level and reception level, there are three pairs of acoustic parameters corresponding to each of the three volume settings (large, medium, and small) that can be set on the operation panel 4. Stored.

３組の音響パラメータ群のうち、−２０ｄＢｍ〜−３０ｄＢｍに対応する音響パラメータ群は、比較的小さい音声を対象にして信号処理を施すためのパラメータであり、この音響パラメータ群に基づいて信号処理を行うことで、音量の増大、ノイズ成分のカット、エコーキャンセル等が施されて、比較的小さい音声がより聞き取りやすい音量および音質に変換される。 Among the three acoustic parameter groups, the acoustic parameter group corresponding to −20 dBm to −30 dBm is a parameter for performing signal processing on a relatively small sound, and the signal processing is performed based on the acoustic parameter group. As a result, volume is increased, noise components are cut, echo cancellation, and the like are performed, and a relatively small sound is converted into a volume and sound quality that are easier to hear.

また、−１０ｄＢｍ〜−２０ｄＢｍに対応する音響パラメータ群は、標準的な音量の音声を対象にして信号処理を施すためのパラメータであり、この音響パラメータ群に基づいて信号処理を行うことで、音量は殆ど変更せず、ノイズ成分のカット、エコーキャンセル等が施されて、標準的な音量の音声がより聞き取りやすい音質に変換される。 Also, the acoustic parameter group corresponding to −10 dBm to −20 dBm is a parameter for performing signal processing on a sound with a standard volume, and by performing signal processing based on this acoustic parameter group, No change is made, noise components are cut, echo cancellation, etc. are performed, and the sound of standard volume is converted to a sound quality that is easier to hear.

さらに、−１０ｄＢｍ以上に対応する音響パラメータ群は、比較的大きい音声を対象にして信号処理を施すためのパラメータであり、この音響パラメータ群に基づいて信号処理を行うことで、音量の低減、ノイズ成分のカット、エコーキャンセル等が施されて、比較的大きい音声が聞き取りやすい音量および音質に変換される。 Furthermore, the acoustic parameter group corresponding to -10 dBm or more is a parameter for performing signal processing on a relatively loud sound. By performing signal processing based on this acoustic parameter group, sound volume reduction, noise The components are cut, echo cancelled, etc., and a relatively loud sound is converted into a volume and sound quality that are easy to hear.

これら３組の音響パラメータ群は、後述する処理の中で、いずれか１組が択一的に利用されて、通信先機器との間で伝送される音声信号に対する信号処理が施されることになる。このとき、送話レベルおよび受話レベルの２種については、音量設定（大、中、小）に対応する３対が用意された音響パラメータの中から、操作パネル４で設定された音量設定（大、中、小）に応じて、いずれか１対の音響パラメータが利用される。 Of these three acoustic parameter groups, any one of the groups will be used alternatively in the processing to be described later, and signal processing is performed on the audio signal transmitted to the communication destination device. Become. At this time, for the two types of transmission level and reception level, the volume setting (high volume) set on the operation panel 4 is selected from the acoustic parameters for which three pairs corresponding to the volume setting (high, medium, low) are prepared. , Medium, and small), any one pair of acoustic parameters is used.

後述する処理の中では、ＣＰＵ１１が通信先機器から送信されてくる音声信号の信号レベルのピーク値を測定し、所定の更新条件（詳細は後述）を満たした場合に、測定されたピーク値がピーク音声レベル１４ｂに格納される。また、ピーク音声レベル１４ｂに格納されたピーク値が更新された場合には、そのピーク値に対応する１組の音響パラメータ群の中から、操作パネル４での音量設定（大、中、小）も考慮して４種の音響パラメータが読み出され、読み出された４種の音響パラメータがハンドセット音響パラメータ１４ａに格納される。 In the processing to be described later, when the CPU 11 measures the peak value of the signal level of the audio signal transmitted from the communication destination device and satisfies a predetermined update condition (described later in detail), the measured peak value is Stored in the peak audio level 14b. When the peak value stored in the peak audio level 14b is updated, the volume setting (large, medium, small) on the operation panel 4 is selected from the set of acoustic parameters corresponding to the peak value. In consideration, the four types of acoustic parameters are read out, and the read out four types of acoustic parameters are stored in the handset acoustic parameter 14a.

ＣＰＵ１１は、通信先機器との間で伝送される音声信号に対し、常にハンドセット音響パラメータ１４ａを参照して信号処理を行うように構成されており、ピーク音声レベル１４ｂに格納されたピーク値が更新された際には、その更新に伴ってハンドセット音響パラメータ１４ａが更新されるので、その結果、ＣＰＵ１１は、通信先機器との間で伝送される音声信号に対し、更新後のピーク値に応じて決まる信号処理を施すことになる。 The CPU 11 is configured to always perform signal processing with reference to the handset acoustic parameter 14a for the audio signal transmitted to the communication destination device, and the peak value stored in the peak audio level 14b is updated. When this is done, the handset acoustic parameter 14a is updated along with the update. As a result, the CPU 11 responds to the updated peak value with respect to the audio signal transmitted to the communication destination device. The determined signal processing is performed.

［音声信号に対する信号処理］
次に、音声信号に対する信号処理について、図３に示すフローチャートに基づいて説明する。図３に示すフローチャートは、複合機１においてＣＰＵ１１によって常時実行される処理の中から、本発明の要部に関連する処理ステップのみを抜粋して示したものである。ＣＰＵ１１は、実際には図３に表れない処理をも実行しているが、本発明の要部に関連しない処理については図示を省略してある。 [Signal processing for audio signals]
Next, signal processing for an audio signal will be described based on the flowchart shown in FIG. The flowchart shown in FIG. 3 shows only the processing steps related to the main part of the present invention extracted from the processes that are always executed by the CPU 11 in the multifunction machine 1. The CPU 11 actually executes processing that does not appear in FIG. 3, but illustration of processing that is not related to the main part of the present invention is omitted.

ＣＰＵ１１によってこの処理が実行されると、まず、フックスイッチ４８が監視されて（Ｓ１０５）、利用者がハンドセットをフックアップしたか否かが判断される（Ｓ１１０）。ここで、利用者がハンドセットをフックアップしていない場合は（Ｓ１１０：いいえ）、Ｓ１０５の処理へと戻ってＳ１０５〜Ｓ１１０の処理が繰り返され、フックスイッチ４８の監視が継続される。なお、Ｓ１１０の処理では、利用者が実際にハンドセット４７をフックアップした場合はもちろんのこと、操作パネル４での操作によって複合機１の状態をフックアップした場合と同等な状態に切り替えた場合でも、ハンドセットをフックアップしたと判断される。 When this process is executed by the CPU 11, first, the hook switch 48 is monitored (S105), and it is determined whether or not the user has hooked up the handset (S110). Here, when the user has not hooked up the handset (S110: No), the process returns to S105, the processes of S105 to S110 are repeated, and the monitoring of the hook switch 48 is continued. In the process of S110, not only when the user actually hooks up the handset 47, but also when the state of the multifunction device 1 is switched to the same state as when the state of the multifunction device 1 is hooked up by the operation on the operation panel 4. It is determined that the handset is hooked up.

そして、Ｓ１０５〜Ｓ１１０の処理が繰り返される間に、利用者がハンドセットをフックアップしたら（Ｓ１１０：はい）、回線が閉結され（Ｓ１１５）、ハンドセット４７による通話開始処理が実行され（Ｓ１２０）、利用者は通信先機器を利用する通話相手との通話を行うことができる状態となる。 Then, if the user hooks up the handset while the processes of S105 to S110 are repeated (S110: Yes), the line is closed (S115), and the call start process by the handset 47 is executed (S120). The person can enter a call with the other party using the communication destination device.

続いて、ピーク値（Ｌｐ）の初期化が行われる（Ｓ１２５）。このピーク値（Ｌｐ）は、ＲＡＭ１４のピーク音声レベル１４ｂに格納される値であり、後述する処理の中で、通信先機器との間で伝送される音声信号の信号レベル（音量レベル）を測定した後は、測定された信号レベルのピーク値が格納されることになるが、Ｓ１２５の処理の段階では、通信先機器との間で伝送される音声信号の信号レベルがまだ実測されていないので、Ｓ１２５の処理では、ＲＡＭ１４のピーク音声レベル１４ｂに初期値として０ｄＢｍが格納される。 Subsequently, the peak value (Lp) is initialized (S125). This peak value (Lp) is a value stored in the peak audio level 14b of the RAM 14, and the signal level (volume level) of the audio signal transmitted to the communication destination device is measured in the process described later. After that, the peak value of the measured signal level is stored, but since the signal level of the audio signal transmitted to the communication destination device has not been actually measured at the stage of processing of S125. In the process of S125, 0 dBm is stored in the peak audio level 14b of the RAM 14 as an initial value.

続いて、ハンドセット音響パラメータの初期化が行われる（Ｓ１３０）。このＳ１３０の処理では、ピーク値（Ｌｐ）の初期値（０ｄＢｍ）に対応する音響パラメータ群が、ＲＯＭ１２のハンドセット音響パラメータテーブル１２ａから読み出される。このとき、操作パネル４で設定された音量設定（大、中、小）も参照され、この音量設定に応じた２種の音響パラメータ（送話レベルおよび受話レベル）と、他の２種の音響パラメータ（フィルター特性値およびＬＥＣ設定値）が読み出される。そして、読み出された４種の音響パラメータが、ＲＡＭ１４のハンドセット音響パラメータ１４ａに格納される。既に説明した通り、ＣＰＵ１１は、通信先機器との間で伝送される音声信号に対し、常にハンドセット音響パラメータ１４ａを参照して信号処理を行っているため、Ｓ１３０の処理を実行した時点では、通信先機器との間で伝送される音声信号の信号レベルのピーク値が０ｄＢｍであるとの想定で信号処理が行われることになる。 Subsequently, initialization of handset acoustic parameters is performed (S130). In the process of S130, the acoustic parameter group corresponding to the initial value (0 dBm) of the peak value (Lp) is read from the handset acoustic parameter table 12a of the ROM 12. At this time, the volume setting (high, medium, small) set on the operation panel 4 is also referred to, and two types of acoustic parameters (sending level and receiving level) according to the volume setting and the other two types of sound are set. Parameters (filter characteristic value and LEC set value) are read. Then, the read four kinds of acoustic parameters are stored in the handset acoustic parameter 14 a of the RAM 14. As already described, since the CPU 11 always performs signal processing on the audio signal transmitted to the communication destination device with reference to the handset acoustic parameter 14a, the communication is performed when the processing of S130 is performed. Signal processing is performed on the assumption that the peak value of the signal level of the audio signal transmitted to the destination device is 0 dBm.

なお、上述の通り、音声信号の信号レベルのピーク値が０ｄＢｍであるとの想定で信号処理を行う理由は、実際に通信先機器との間で伝送される音声信号の信号レベルが比較的過大であった場合には適切に音量を低減することができ、しかも、万一、信号レベルが比較的過小であった場合でもさらに音量が低減されるだけで、少なくともハンドセット４７から予期しない大音量の音声を発してしまうことは防止できるからである。仮に、通信先機器との間で伝送される音声信号の信号レベルのピーク値が−３０ｄＢｍといった想定で信号処理が行われたとすれば、実際に通信先機器との間で伝送される音声信号の信号レベルが比較的過小であった場合には適切に音量の増大を図ることができるかもしれないものの、万一、信号レベルが比較的過大であった場合にはさらに音量が増大してしまい、ハンドセット４７から予期しない大音量の音声を発してしまうおそれがある。 As described above, the reason for performing signal processing on the assumption that the peak value of the signal level of the audio signal is 0 dBm is that the signal level of the audio signal actually transmitted to the communication destination device is relatively excessive. If it is, the volume can be reduced appropriately, and even if the signal level is relatively low, the volume is only further reduced. This is because it is possible to prevent the voice from being emitted. If signal processing is performed on the assumption that the peak value of the signal level of the audio signal transmitted to the communication destination device is −30 dBm, the audio signal actually transmitted to the communication destination device If the signal level is relatively low, the volume may be increased appropriately, but if the signal level is relatively high, the volume will increase further. There is a risk of unexpected loud sound from the handset 47.

さて、以上の処理を終えたら、フックスイッチ４８が監視されて（Ｓ１３５）、ハンドセットをフックダウンしたか否かが判断される（Ｓ１４０）。ここで、ハンドセットをフックダウンしていないと判断された場合は（Ｓ１４０：いいえ）、通信先機器との間で伝送される音声信号の信号レベル（Ｌｃ）が測定される（Ｓ１４５）。 When the above processing is completed, the hook switch 48 is monitored (S135), and it is determined whether or not the handset is hooked down (S140). If it is determined that the handset is not hooked down (S140: No), the signal level (Lc) of the audio signal transmitted to the communication destination device is measured (S145).

そして、ＲＡＭ１４のピーク音声レベル１４ｂに格納されたピーク値（Ｌｐ）とＳ１４５の処理で測定された信号レベル（Ｌｃ）との差の絶対値（｜Ｌｐ−Ｌｃ｜）が、あらかじめ定められた閾値Ｌｈ（本実施形態では５ｄＢｍ）より大きいか否かが判断され（Ｓ１５０）、｜Ｌｐ−Ｌｃ｜＞Ｌｈであれば（Ｓ１５０：はい）、Ｓ１４５の処理で測定された信号レベル（Ｌｃ）が−３０ｄＢｍより大きいか否かが判断される（Ｓ１５５）。 The absolute value (| Lp−Lc |) of the difference between the peak value (Lp) stored in the peak audio level 14b of the RAM 14 and the signal level (Lc) measured in the process of S145 is a predetermined threshold value. It is determined whether or not it is greater than Lh (5 dBm in this embodiment) (S150). If | Lp-Lc |> Lh (S150: Yes), the signal level (Lc) measured in the process of S145 is −. It is determined whether it is larger than 30 dBm (S155).

ここで、Ｌｃ＞−３０ｄＢｍであれば（Ｓ１５５：はい）、ピーク音声レベルが更新される（Ｓ１６０）。具体的には、Ｓ１４５の処理で測定された信号レベル（Ｌｃ）をＲＡＭ１４のピーク音声レベル１４ｂに格納することによりピーク値（Ｌｐ）を更新する。そして、この新たなピーク値（Ｌｐ）に応じたハンドセット音響パラメータが設定される（Ｓ１６５）。このＳ１６５の処理では、新たなピーク値（Ｌｐ）に対応する４種の音響パラメータ（送話レベル、受話レベル、フィルター特性値、およびＬＥＣ設定値）が、ＲＯＭ１２のハンドセット音響パラメータテーブル１２ａから読み出され、読み出された音響パラメータ群が、ＲＡＭ１４のハンドセット音響パラメータ１４ａに格納される。以後、ＣＰＵ１１は、通信先機器との間で伝送される音声信号に対し、ハンドセット音響パラメータ１４ａを参照して信号処理を行うので、新たな音響パラメータ群に基づく信号処理が行われることになる。Ｓ１６５の処理を終えたら、Ｓ１３５の処理へと戻る。 Here, if Lc> −30 dBm (S155: Yes), the peak audio level is updated (S160). Specifically, the peak value (Lp) is updated by storing the signal level (Lc) measured in the process of S145 in the peak audio level 14b of the RAM 14. And the handset acoustic parameter according to this new peak value (Lp) is set (S165). In the process of S165, four types of acoustic parameters (transmission level, reception level, filter characteristic value, and LEC setting value) corresponding to the new peak value (Lp) are read from the handset acoustic parameter table 12a of the ROM 12. Then, the read acoustic parameter group is stored in the handset acoustic parameter 14 a of the RAM 14. Thereafter, since the CPU 11 performs signal processing on the audio signal transmitted to the communication destination device with reference to the handset acoustic parameter 14a, signal processing based on a new acoustic parameter group is performed. When the process of S165 is completed, the process returns to S135.

一方、Ｓ１５０の処理において、｜Ｌｐ−Ｌｃ｜≦Ｌｈであった場合は（Ｓ１５０：いいえ）、Ｓ１６０〜Ｓ１６５の処理を実行することなく、Ｓ１３５の処理へと戻る。このような処理を行うことにより、Ｓ１４５の処理で測定された信号レベル（Ｌｃ）が、３組の音響パラメータ群に対応づけられた３つの信号レベルの範囲の境界値（本実施形態の場合は−１０ｄＢｍ、−２０ｄＢｍ）付近で変動したときに、音響パラメータ群の更新が過剰に頻繁に行われてしまうのを防止することができる。具体例を挙げれば、例えば、Ｓ１４５の処理で測定された信号レベル（Ｌｃ）が、境界値−１０ｄＢｍを挟んで−９ｄＭｍ〜−１１ｄＢｍ間で変動しているような場合に、Ｓ１５０の処理を実施していないと、音響パラメータ群の更新が過剰に頻繁に行われ、その結果、音量の低減／低減の停止が頻繁に繰り返されるおそれがあるが、Ｓ１５０の処理を実施すれば、音響パラメータ群の更新は行われないので、音量の低減／低減の停止が頻繁に繰り返されることはないのである。 On the other hand, if | Lp−Lc | ≦ Lh in the process of S150 (S150: No), the process returns to S135 without executing the processes of S160 to S165. By performing such a process, the signal level (Lc) measured in the process of S145 is the boundary value of the range of the three signal levels associated with the three acoustic parameter groups (in the case of the present embodiment). When the frequency fluctuates in the vicinity of −10 dBm, −20 dBm), it is possible to prevent the acoustic parameter group from being updated excessively frequently. To give a specific example, for example, when the signal level (Lc) measured in the process of S145 varies between −9 dBm to −11 dBm across the boundary value of −10 dBm, the process of S150 is performed. Otherwise, the update of the acoustic parameter group is performed excessively frequently, and as a result, there is a possibility that the volume reduction / reduction stop is frequently repeated. However, if the process of S150 is performed, the acoustic parameter group is updated. Since the update is not performed, the volume reduction / reduction stop is not frequently repeated.

また、Ｓ１５５の処理において、Ｌｃ≦−３０ｄＢｍであった場合も（Ｓ１５５：いいえ）、Ｓ１６０〜Ｓ１６５の処理を実行することなく、Ｓ１３５の処理へと戻る。このような処理を行うことにより、Ｓ１４５の処理で測定された信号レベル（Ｌｃ）が、瞬間的に無音あるいは無音に近い状態となっただけで音響パラメータ群が更新されて無用な音量の増大が図られるのを防止することができる。 Also, in the process of S155, if Lc ≦ −30 dBm (S155: No), the process returns to S135 without executing the processes of S160 to S165. By performing such processing, the acoustic parameter group is updated only when the signal level (Lc) measured in the processing of S145 instantaneously becomes silent or nearly silent, and an unnecessary increase in volume is achieved. It can be prevented from being planned.

このように、Ｓ１５０の処理で否定判断されるか、Ｓ１５５の処理で否定判断されるか、あるいは、Ｓ１６５の処理を終えると、いずれの場合ともＳ１３５の処理へと戻ることになり、以降、Ｓ１４０の処理でハンドセットをフックダウンしていないと判断されている限り（Ｓ１４０：いいえ）、Ｓ１３５〜Ｓ１６５の処理が繰り返される。すなわち、通話中は、Ｓ１３５〜Ｓ１６５の処理が繰り返されることになる。そして、このＳ１３５〜Ｓ１６５の繰り返し処理の中で、ＲＡＭ１４のピーク音声レベル１４ｂに格納されたピーク値（Ｌｐ）とＳ１４５の処理で測定された信号レベル（Ｌｃ）との差の絶対値（｜Ｌｐ−Ｌｃ｜）が、あらかじめ定められた閾値Ｌｈより大きく（Ｓ１５０：はい）、且つ、Ｓ１４５の処理で測定された信号レベル（Ｌｃ）が−３０ｄＢｍより大きい場合には（Ｓ１５５：はい）、音響パラメータ群が更新されて、その結果、音量の最適化、音量に応じたノイズ成分のカット、音量に応じたエコーキャンセルなどが施され、通信先機器との間で伝送される音声信号が聞き取りやすい音声に補正される。 As described above, when a negative determination is made in the process of S150, a negative determination is made in the process of S155, or when the process of S165 is completed, the process returns to the process of S135 in any case. As long as it is determined that the handset is not hooked down in the process (S140: No), the processes in S135 to S165 are repeated. That is, during the call, the processes of S135 to S165 are repeated. Then, in the repetition processing of S135 to S165, the absolute value (| Lp) of the difference between the peak value (Lp) stored in the peak audio level 14b of the RAM 14 and the signal level (Lc) measured in the processing of S145. -Lc |) is larger than a predetermined threshold Lh (S150: Yes) and the signal level (Lc) measured in the process of S145 is larger than -30 dBm (S155: Yes), the acoustic parameter The group is updated, and as a result, the audio signal transmitted to and from the communication destination device is easy to hear because the volume is optimized, the noise component is cut according to the volume, the echo is canceled according to the volume, etc. It is corrected to.

なお、以上のようなＳ１３５〜Ｓ１６５の繰り返し処理中に、Ｓ１４０の処理でハンドセットをフックダウンしたと判断された場合は（Ｓ１４０：はい）、回線が開放され（Ｓ１７０）、Ｓ１０５の処理へと戻る。以降は、再びハンドセットをフックアップしたと判断されるまでは、Ｓ１０５〜Ｓ１１０の処理が繰り返されることになる。 If it is determined that the handset is hooked down in the process of S140 during the repeat process of S135 to S165 as described above (S140: Yes), the line is released (S170), and the process returns to S105. . Thereafter, the processing of S105 to S110 is repeated until it is determined that the handset is hooked up again.

以上説明した本発明の実施形態において、上記Ｓ１４５の処理を実行するＣＰＵ１１は、本発明でいう測定手段に相当し、上記Ｓ１６５の処理によってＲＡＭ１４のハンドセット音響パラメータ１４ａに格納される音響パラメータ群を参照して、通信先機器との間で伝送される音声信号に対する信号処理を行うＣＰＵ１１は、本発明でいう信号処理手段に相当する。また、Ｓ１６０の処理によって信号レベルが格納されることになるＲＡＭ１４（ピーク音声レベル１４ｂ）は、本発明でいうピーク値記憶手段に相当し、Ｓ１５５の処理を実行するＣＰＵ１１は、本発明でいう第１判定手段に相当し、Ｓ１５０の処理を実行するＣＰＵ１１は、本発明でいう第２判定手段に相当し、Ｓ１６０の処理を実行するＣＰＵ１１は、本発明でいう更新手段に相当する。 In the embodiment of the present invention described above, the CPU 11 that executes the process of S145 corresponds to the measurement means referred to in the present invention, and refers to the acoustic parameter group stored in the handset acoustic parameter 14a of the RAM 14 by the process of S165. And CPU11 which performs signal processing with respect to the audio | voice signal transmitted between communication destination apparatuses is equivalent to the signal processing means said by this invention. The RAM 14 (peak audio level 14b) in which the signal level is stored by the process of S160 corresponds to the peak value storage means in the present invention, and the CPU 11 that executes the process of S155 is referred to in the present invention. The CPU 11 that corresponds to the first determination means and executes the process of S150 corresponds to the second determination means in the present invention, and the CPU 11 that executes the process of S160 corresponds to the update means in the present invention.

以上、本発明の実施形態について説明したが、本発明は上記の具体的な一実施形態に限定されず、この他にも種々の形態で実施することができる。
例えば、上記実施形態では、本発明の音声処理装置を複合機１に適用する例を示したが、複合機１以外の通話装置に適用してもよいのはもちろんである。具体的には、ファクシミリ機能等を持たない単機能の電話機において本発明の構成を採用してもよい。また、電話機としては、ＰＳＴＮ網に接続される一般的な電話機の他、ＩＰ網に接続される電話機（いわゆるＩＰ電話機）や携帯電話機などもあるが、これらの電話機のどれにでも本発明の構成を採用することができる。さらに、パーソナルコンピュータにマイクロフォンやヘッドフォンを接続するとともに、そのパーソナルコンピュータにＩＰ電話ソフトウェアをインストールすれば、パーソナルコンピュータを利用したＩＰ通話が可能となるが、そのようなパーソナルコンピュータを利用して構成されたＩＰ通話装置においても、本発明の構成を採用することができる。 As mentioned above, although embodiment of this invention was described, this invention is not limited to said specific one Embodiment, In addition, it can implement with a various form.
For example, in the above-described embodiment, an example in which the voice processing device of the present invention is applied to the multi-function device 1 has been described. Specifically, the configuration of the present invention may be adopted in a single function telephone having no facsimile function or the like. In addition to general telephones connected to the PSTN network, telephones include telephones connected to the IP network (so-called IP telephones) and mobile phones. The configuration of the present invention is applicable to any of these telephones. Can be adopted. Furthermore, if a microphone and headphones are connected to a personal computer and IP telephone software is installed in the personal computer, an IP call using the personal computer can be performed. The personal computer is configured using such a personal computer. The configuration of the present invention can also be adopted in an IP telephone device.

また、上記実施形態では、音声信号の信号レベルを３つの範囲に区分して、各範囲に対応する３組の音響パラメータ群をハンドセット音響パラメータテーブル１２ａに格納していたが、音声信号の信号レベルは必ずしも３つの範囲に区分しなくてもよく、例えば、２つの範囲あるいは４つ以上の範囲に区分してもよい。また、各範囲の境界値についても−１０ｄＢｍ、−２０ｄＢｍに限定されるものではなく、任意に変更可能である。さらに、１組の音響パラメータ群を４種の音響パラメータで構成していたが、この音響パラメータの数も任意である。 In the above embodiment, the signal level of the audio signal is divided into three ranges, and three sets of acoustic parameter groups corresponding to each range are stored in the handset acoustic parameter table 12a. May not necessarily be divided into three ranges, for example, it may be divided into two ranges or four or more ranges. Further, the boundary value of each range is not limited to −10 dBm and −20 dBm, and can be arbitrarily changed. Furthermore, although one set of acoustic parameter groups is composed of four types of acoustic parameters, the number of acoustic parameters is also arbitrary.

また、上記実施形態では、あらかじめ用意されたハンドセット音響パラメータテーブル１２ａから、音響パラメータ群を読み取るように構成してあったが、音響パラメータ群を得るための方法は、テーブル参照方式に限られるものではなく、例えば、音声信号の信号レベルｘを変数とする所定の数式ｆ（ｘ）を用いて、所定の音響パラメータｙ＝ｆ（ｘ）を算出するように構成してもよい。 In the above embodiment, the acoustic parameter group is read from the handset acoustic parameter table 12a prepared in advance. However, the method for obtaining the acoustic parameter group is not limited to the table reference method. Instead, for example, a predetermined acoustic parameter y = f (x) may be calculated using a predetermined mathematical formula f (x) having the signal level x of the audio signal as a variable.

また、上記実施形態では、Ｓ１５０の処理において、閾値Ｌｈを５ｄＢｍに設定していたが、この閾値Ｌｈについては５ｄＢｍ以外の値を採用してもよく、また、Ｓ１５５の処理においては、信号レベル（Ｌｃ）が−３０ｄＢｍより大きいか否かを判断していたが、この値についても、−３０ｄＢｍ以外の値を採用してもよい。 In the above embodiment, the threshold value Lh is set to 5 dBm in the process of S150. However, a value other than 5 dBm may be adopted for the threshold value Lh. In the process of S155, the signal level ( Although it has been determined whether or not Lc) is greater than −30 dBm, a value other than −30 dBm may be employed for this value.

さらに、通信先機器との間で伝送される音声信号としては、通信先機器へ伝送される音声信号と、通信先機器から伝送されてくる音声信号の２つがあり、これら２つの音声信号の両方を本発明の音声処理装置による処理対象とすれば最も効果的であるが、いずれか一方の音声信号のみを本発明の音声処理装置による処理対象としても相応の効果は得られる。すなわち、本発明の構成は、送話側および受話側の双方に適用してもよいし、送話側のみに適用してもよいし、受話側のみに適用してもよい。送話側に適用した場合は、この通話装置の利用者が発する音声が過小または過大でも適切な音量や音質に補正してから通信先機器へと伝送することができ、受話側に適用した場合は、通信先機器から伝送されてくる音声が過小または過大でも適切な音量や音質に補正してから、補正後の音声をこの通話装置の利用者に聞かせることができる。 Furthermore, there are two audio signals transmitted to and from the communication destination device: an audio signal transmitted to the communication destination device and an audio signal transmitted from the communication destination device, both of these two audio signals. Is the most effective processing target by the speech processing apparatus of the present invention, but the corresponding effect can be obtained even if only one of the speech signals is processed by the speech processing apparatus of the present invention. That is, the configuration of the present invention may be applied to both the transmission side and the reception side, may be applied only to the transmission side, or may be applied only to the reception side. When applied to the transmitting side, even if the voice emitted by the user of this communication device is too low or too high, it can be transmitted to the destination device after being corrected to an appropriate volume and sound quality. When applied to the receiving side Can correct the sound and the sound quality to be appropriate even if the sound transmitted from the communication destination device is too small or too large, and then can hear the sound after the correction to the user of the call device.

本発明の実施形態として説明した通話装置の概略構成を示すブロック図。The block diagram which shows schematic structure of the telephone apparatus demonstrated as embodiment of this invention. ハンドセット音響パラメータテーブルの一例を示す説明図。Explanatory drawing which shows an example of a handset acoustic parameter table. 音声信号に対する信号処理を示すフローチャート。The flowchart which shows the signal processing with respect to an audio | voice signal.

Explanation of symbols

１・・・複合機、４・・・操作パネル、５・・・ＬＣＤ、１１・・・ＣＰＵ、１２・・・ＲＯＭ、１２ａ・・・ハンドセット音響パラメータテーブル、１３・・・ＥＥＰＲＯＭ、１４・・・ＲＡＭ、１４ａ・・・ハンドセット音響パラメータ、１４ｂ・・・ピーク音声レベル、１５・・・画像メモリ、１９・・・回線Ｉ／Ｆ部、２０・・・モデム、２１・・・バッファ、２２・・・スキャナ、２３・・・符号化部、２４・・・復号化部、２５・・・プリンタ、２７・・・アンプ、２８・・・スピーカ、３０・・・バスライン、４７・・・ハンドセット、４８・・・フックスイッチ。
DESCRIPTION OF SYMBOLS 1 ... MFP, 4 ... Operation panel, 5 ... LCD, 11 ... CPU, 12 ... ROM, 12a ... Handset acoustic parameter table, 13 ... EEPROM, 14 ... RAM, 14a: handset acoustic parameters, 14b: peak sound level, 15: image memory, 19: line I / F unit, 20: modem, 21: buffer, 22 ..Scanner, 23 ... encoding unit, 24 ... decoding unit, 25 ... printer, 27 ... amplifier, 28 ... speaker, 30 ... bus line, 47 ... handset 48 ... Hook switch.

Claims

Measuring means for measuring the volume level of the audio signal transmitted from the communication destination device;
Peak value storage means capable of storing a peak value of the volume level measured by the measurement means;
A first determination unit that determines to update the peak value stored in the peak value storage unit when a peak value of the volume level measured by the measurement unit is greater than a predetermined lower limit value; ,
When the peak value stored in the peak value storage means and the peak value of the volume level measured by the measurement means have a difference greater than a predetermined magnitude, the peak value storage means stores the difference. Second determination means for determining to update the peak value,
When the peak value stored in the peak value storage unit is determined to be updated by the first determination unit and the second determination unit , the peak value measured by the measurement unit is used. Update means to update with,
Based on the peak value of the volume level stored in the peak value storage unit updated by the updating unit, a signal determined according to the peak value with respect to an audio signal transmitted to the communication destination device Signal processing means for performing at least one of processing for changing the volume level to a level determined according to the peak value and processing for changing the level of echo cancellation to a level determined according to the peak value as processing A voice processing apparatus comprising:

A communication device comprising the voice processing device according to claim 1 .

The program for functioning a computer as each means with which the audio | voice processing apparatus of Claim 1 is provided.