JP4942755B2

JP4942755B2 - Audio signal processing apparatus and method

Info

Publication number: JP4942755B2
Application number: JP2008535814A
Authority: JP
Inventors: 晃永瓶子; 俊明久保; 聡山中; 雅之石田; 貴久青柳
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2007-11-16
Filing date: 2008-07-28
Publication date: 2012-05-30
Anticipated expiration: 2028-07-28
Also published as: JPWO2009063662A1; WO2009063662A1

Description

本発明は、音声信号処理装置及び方法に関する。本発明は特に、デジタル音声信号の量子化する際のビット不足による歪みを改善する技術に関する。より詳しく述べれば、本発明は、デジタル音声信号の量子化ビット数を拡張したデータを生成する技術に係わり、特に、デジタル音声信号歪みの発生の抑制に関するものである。 The present invention relates to an audio signal processing apparatus and method. In particular, the present invention relates to a technique for improving distortion caused by insufficient bits when quantizing a digital audio signal. More specifically, the present invention relates to a technique for generating data in which the number of quantization bits of a digital audio signal is expanded, and more particularly to suppression of occurrence of digital audio signal distortion.

近年音声機器（オーディオ機器）、車載用音声機器（カーオーディオ機器）、ＤＶＤレコーダーにおける音声信号処理のデジタル化は一般的となっている。例えばＣＤなどは１６ビット量子化、サンプリング周波数４４．１ｋＨｚ（デジタル・オーディオ・テープの規格はサンプリング周波数４８ｋＨｚ）が規格として定められている。
一方、音声信号のソースはもともとアナログ的に生成されるような自然音が一般的であり、その入力レベルのダイナミックレンジは非常に広い。よって最小ビット付近の微小レベルで変化する入力信号は量子化ビット数不足により波形歪みが発生する。In recent years, digitalization of audio signal processing in audio equipment (audio equipment), in-vehicle audio equipment (car audio equipment), and DVD recorders has become common. For example, 16-bit quantization for a CD or the like and a sampling frequency of 44.1 kHz (a digital audio tape standard is a sampling frequency of 48 kHz) are defined as standards.
On the other hand, a natural sound that is generated in an analog manner is generally used as the source of the audio signal, and the dynamic range of the input level is very wide. Therefore, waveform distortion occurs in an input signal that changes at a minute level near the minimum bit due to an insufficient number of quantization bits.

下記の特許文献１では、このような量子化ビット数不足による信号歪みに対応するために
（ａ）アナログ信号形態の音声信号を量子化して得た順次の標本値を２のＮ乗分の１の分解能でデジタル信号に変換してなるＮビットのデジタル音声信号の符号情報を得ること、及び
（ｂ）このＮビットの符号情報の最下位桁（ＬＳＢ）に（Ｍ−Ｎ）ビットの付加符号情報を連続させて、Ｍ＞Ｎの関係にあるＭビットの符号情報にビット数変換を行うこと
が開示されている。In the following Patent Document 1, in order to cope with such signal distortion due to the insufficient number of quantization bits, (a) a sequential sample value obtained by quantizing an audio signal in the form of an analog signal is a 1 / Nth power of 2. Obtaining code information of an N-bit digital audio signal converted into a digital signal with a resolution of (b), and (b) an additional code of (MN) bits in the least significant digit (LSB) of the N-bit code information It is disclosed that the number of bits is converted into M-bit code information having a relationship of M> N by making information continuous.

下記の特許文献２では、このような量子化のビット数不足による信号歪みに対応するために
（ａ）入力されたデジタルデータのサンプルデータ毎のデータ変化のうち、変化しない区間をはさんだＬＳＢの変化の時間長を判断基準として、異なるカットオフ周波数を有する低域通過フィルタのうち少なくとも一つを選定して、信号出力を得ること、及び
（ｂ）ＬＳＢ変化の時間間隔からレベルの変化率を判別し、その判別された期間で波形変化をなめらかにするＬＳＢよりも下位のデータ列を生成後、ＬＳＢ変化点の前後にわたってＬＳＢ以下のデータを生成してビット拡張することが記載されている。In the following Patent Document 2, in order to cope with such signal distortion due to the insufficient number of bits of quantization, (a) among the data changes for each sample data of the input digital data, Using at least one of the low-pass filters having different cut-off frequencies as a criterion for the change time length to obtain a signal output, and (b) the level change rate from the time interval of the LSB change It is described that after generating a data string lower than the LSB that discriminates and smoothes the waveform change in the discriminated period, data below the LSB is generated before and after the LSB change point to extend the bit.

特許第３３３６８２３号明細書Japanese Patent No. 3336823 特公平７−７３１８６号公報Japanese Patent Publication No. 7-73186 雛元孝夫監修、棟安実治、夫田口亮著、「非線形ディジタル信号処理」朝倉書店、１９９９年３月２０日、ｐ．７２−７４、ｐ．１０６−１０８Supervised by Takao Hinamoto, Mitsuji Muneyasu, Ryo Otaguchi, “Nonlinear Digital Signal Processing”, Asakura Shoten, March 20, 1999, p. 72-74, p. 106-108

なお、上記の非特許文献１については、後に言及する。 The above Non-Patent Document 1 will be mentioned later.

例えば特許文献１の装置による方法は、最下位桁に付加符号情報を連続させるという処理を行っている。この方法には、最下位桁のみが変化する波形部分にしか付加符号情報が追加されない構成である。即ち、原データをｎビット、ビット数変換後をＭビットのデータ幅とすると、補正用に付加する符号情報は（Ｍ−ｎ）ビット幅しかない。そのため、波形歪み改善効果に乏しい。即ち、入力信号最下位桁のみが変化するサンプリング点間の変化しか高量子化ビット数で補間されない。 For example, the method of the apparatus of Patent Document 1 performs a process of making additional code information continuous at the least significant digit. In this method, the additional code information is added only to the waveform portion where only the least significant digit changes. That is, if the original data is n bits and the bit width is M bits, the code information added for correction has only (M−n) bits. Therefore, the waveform distortion improvement effect is poor. That is, only the change between sampling points where only the least significant digit of the input signal changes is interpolated with a high quantization bit number.

特許文献２による方法は、ＬＳＢ変化の時間長のみでカットオフ周波数を選択している。つまりＬＳＢ変化がある度にカットオフ周波数が変化する。また、ＬＳＢ変化の時間長のみで正確にカットオフ周波数を選択するは困難である。例えば一定周波数の正弦波でもＬＳＢ変化の時間長は一定ではない。そのため、長い時間範囲でみると処理のつなぎめがなめらかに変化せず、歪みが十分補正されない。 In the method according to Patent Document 2, the cutoff frequency is selected only by the time length of the LSB change. That is, the cut-off frequency changes every time there is an LSB change. In addition, it is difficult to accurately select the cut-off frequency only by the time length of the LSB change. For example, even with a sine wave having a constant frequency, the time length of the LSB change is not constant. For this reason, the process stitching does not change smoothly over a long time range, and the distortion is not sufficiently corrected.

本発明は、信号振幅が急峻に大きく変化する領域を有する音声信号入力波形の再現性を損なわないことを実現しながら、従来よりも細かく自由度の大きな波形歪み補正用の補間データを生成し、音声信号入力波形に近づけるような音声信号処理装置を提供することを目的とする。 The present invention generates interpolated data for waveform distortion correction that is finer and has a higher degree of freedom than in the past while realizing that the reproducibility of an audio signal input waveform having a region where the signal amplitude changes sharply and greatly is not impaired, It is an object of the present invention to provide an audio signal processing device that approximates an audio signal input waveform.

この発明の音声信号処理装置は、
ｎビット（ｎは整数）の音声信号データの列の各データをαビット分（αは整数）ビット拡張してｎ＋αビットの音声信号データの列を生成して出力する原データビット拡張部と、
前記ビット拡張によって生成されたｎ＋αビットの音声信号データの列を平滑化処理するエッジ保存型平滑化フィルタ部と、
前記ｎビットの音声信号の周波数と振幅を推定する周波数振幅推定部と、
前記周波数と前記振幅に基づいて低域通過フィルタ係数を生成するフィルタ係数生成部とを備え、
前記エッジ保存型平滑化フィルタ部は、前記フィルタ係数生成部で生成されたフィルタ係数を用いて平滑化を行い、
前記フィルタ係数生成部は、
推定された音声信号の周波数の高調波成分を除去する低域通過フィルタのフィルタ係数を生成し、推定された振幅が大きいほど前記低域通過フィルタの次数が小さくなる
ことを特徴とする。 The audio signal processing device of the present invention is
an original data bit extension unit for generating and outputting a sequence of n + α-bit audio signal data by extending each bit of the n-bit (n is an integer) audio signal data by α bits (α is an integer) bit;
An edge-preserving smoothing filter that smoothes a sequence of n + α-bit audio signal data generated by the bit extension ;
A frequency amplitude estimator for estimating the frequency and amplitude of the n-bit audio signal;
A filter coefficient generation unit that generates a low-pass filter coefficient based on the frequency and the amplitude,
The edge preserving smoothing filter unit performs smoothing using the filter coefficient generated by the filter coefficient generation unit,
The filter coefficient generation unit
A filter coefficient of a low-pass filter that removes harmonic components of the frequency of the estimated speech signal is generated, and the order of the low-pass filter decreases as the estimated amplitude increases. To do.

この発明によれば、波形歪み補正用に付加する情報のデータ幅はビット数変換（拡張）後のデータ幅で歪み補正データを生成するように構成している。従って、補正対象となるデータ幅が広い点で、上記特許文献１の先行技術とは異なる。さらに歪み補正用データ生成手段としてエッジ保存型平滑化フィルタ（例えば、一次元ｍ次εフィルタ）を適用している。これらの構成によって、従来方法よりも細かく自由度の大きな歪み補正用の補間データを生成し、原音波形に近づけることができる。
またエッジ保存型平滑化フィルタ（例えば、一次元ｍ次εフィルタ）を適用することにより、信号振幅が急峻に大きく変化する領域を有する音声信号の再現性は損なわれない。これは、フィルタ動作の閾値設定が可能であることによる。According to the present invention, the data width of information added for waveform distortion correction is configured to generate distortion correction data with a data width after bit number conversion (expansion). Therefore, it differs from the prior art of Patent Document 1 in that the data width to be corrected is wide. Further, an edge-preserving smoothing filter (for example, a one-dimensional mth-order ε filter) is applied as distortion correction data generation means. With these configurations, it is possible to generate interpolation data for distortion correction that is finer and has a higher degree of freedom than the conventional method, and to approximate the original sound waveform.
Further, by applying an edge preserving smoothing filter (for example, a one-dimensional m-order ε filter), the reproducibility of an audio signal having a region where the signal amplitude changes sharply and greatly is not impaired. This is because the threshold value for the filter operation can be set.

本発明の実施の形態１に係る音声信号出力装置の構成を示すブロック図である。It is a block diagram which shows the structure of the audio | voice signal output device which concerns on Embodiment 1 of this invention. 式（１）の関数ｆ（ｕ）を示す図である。It is a figure which shows the function f (u) of Formula (1). （ａ）、（ｂ）は、εフィルタの動作を説明するための図である。(A), (b) is a figure for demonstrating operation | movement of an epsilon filter. （ａ）、（ｂ）は、εフィルタの動作を説明するための図である。(A), (b) is a figure for demonstrating operation | movement of an epsilon filter. 実施の形態１の他の構成例による音声信号出力装置を示すブロック図である。5 is a block diagram showing an audio signal output device according to another configuration example of Embodiment 1. FIG. 実施の形態１のさらに他の構成例による音声信号出力装置を示すブロック図である。5 is a block diagram showing an audio signal output device according to still another configuration example of Embodiment 1. FIG. 実施の形態１のより具体的な構成例を示すブロック図である。3 is a block diagram illustrating a more specific configuration example of the first embodiment. FIG. （ａ）及び（ｂ）は、実施の形態１で用いられる入力処理部２の動作を説明するための図である。(A) And (b) is a figure for demonstrating operation | movement of the input process part 2 used in Embodiment 1. FIG. （ａ）及び（ｂ）は、実施の形態１で用いられる原データビット拡張部５の動作を説明するための図である。(A) And (b) is a figure for demonstrating operation | movement of the original data bit expansion part 5 used in Embodiment 1. FIG. 実施の形態１で用いられるデータ格納部７の構成例を示す図である。4 is a diagram illustrating a configuration example of a data storage unit 7 used in Embodiment 1. FIG. 実施の形態１で用いられるデータ格納部７の出力を示す図である。6 is a diagram illustrating an output of a data storage unit 7 used in the first embodiment. FIG. 実施の形態１で用いられる第１の差分算出部８−１〜第４の差分算出部８−４、第１のε判定部９−１〜第４のε判定部９−４、及び判定付き加重平均部１０で生成されるデータをまとめた表である。First difference calculation unit 8-1 to fourth difference calculation unit 8-4 used in the first embodiment, first ε determination unit 9-1 to fourth ε determination unit 9-4, and determination 3 is a table summarizing data generated by the weighted average unit 10. （ａ）及び（ｂ）は、実施の形態１で用いられる係る第１のε判定部９−１〜第４のε判定部９−４の動作、及び判定付き加重平均部１０の動作を説明するための図である。(A) And (b) demonstrates operation | movement of the 1st (epsilon) determination part 9-1-the 4th (epsilon) determination part 9-4 which are used in Embodiment 1, and operation | movement of the weighted average part 10 with determination. It is a figure for doing. 音声信号データＤＭに対応する加重平均値ＥＭを示す図である。It is a figure which shows the weighted average value EM corresponding to the audio | voice signal data DM. （ａ）〜（ｃ）は、実施の形態１で用いられるデータ加算部１１の動作を説明するための図である。(A)-(c) is a figure for demonstrating operation | movement of the data addition part 11 used in Embodiment 1. FIG. （ａ）〜（ｅ）は、実施の形態１に係る音声信号出力装置に信号振幅が急峻で大きく（閾値よりも大きく）変化する信号が入力された場合の動作を説明するための図である。(A)-(e) is a figure for demonstrating operation | movement when the signal which the signal amplitude is steep and is large (it is larger than a threshold value) is input into the audio | voice signal output device which concerns on Embodiment 1. FIG. . 実施の形態１で用いられる第１の差分算出部８−１〜第４の差分算出部８−４、第１のε判定部９−１〜第４のε判定部９−４、及び判定付き加重平均部１０で生成されるデータを示す表である。First difference calculation unit 8-1 to fourth difference calculation unit 8-4 used in the first embodiment, first ε determination unit 9-1 to fourth ε determination unit 9-4, and determination 4 is a table showing data generated by the weighted average unit 10. 本発明の実施の形態１の音声信号出力装置を示すブロック図である。It is a block diagram which shows the audio | voice signal output device of Embodiment 1 of this invention. 実施の形態１に係る音声信号出力装置の処理工程を示すフローチャートである。4 is a flowchart illustrating processing steps of the audio signal output device according to the first embodiment. 本発明の実施の形態１に係る音声信号出力装置を示すブロック図である。It is a block diagram which shows the audio | voice signal output device which concerns on Embodiment 1 of this invention. 本発明の実施の形態２の音声信号出力装置の別の構成例を示すブロック図である。It is a block diagram which shows another structural example of the audio | voice signal output device of Embodiment 2 of this invention. 本発明の実施の形態３に係る音声信号処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the audio | voice signal processing apparatus which concerns on Embodiment 3 of this invention. （ａ）及び（ｂ）は、実施の形態３のｎビットの音声信号の一例を示す図である。(A) And (b) is a figure which shows an example of the n-bit audio | voice signal of Embodiment 3. FIG. （ａ）及び（ｂ）は、εフィルタの動作を説明するための図である。(A) And (b) is a figure for demonstrating the operation | movement of an epsilon filter. （ａ）及び（ｂ）は、原データビット拡張部２４の動作を示す図である。(A) And (b) is a figure which shows the operation | movement of the original data bit expansion part 24. FIG. （ａ）及び（ｂ）は、エッジ保存型平滑化フィルタ部２５の動作を示す図である。(A) And (b) is a figure which shows operation | movement of the edge preservation | save type | mold smoothing filter part 25. FIG. 周波数振幅推定部２２の一例を示すブロック図である。3 is a block diagram illustrating an example of a frequency amplitude estimation unit 22. FIG. （ａ）〜（ｄ）は、図２７の周波数振幅推定部２２の動作を示す図である。(A)-(d) is a figure which shows operation | movement of the frequency amplitude estimation part 22 of FIG. （ａ）〜（ｄ）は、図２７の周波数振幅推定部２２の動作を示す図である。(A)-(d) is a figure which shows operation | movement of the frequency amplitude estimation part 22 of FIG. （ａ）〜（ｃ）は、異なる周波数成分が重畳された音声信号が入力されたときの周波数振幅推定部２２の動作を説明するための図である。(A)-(c) is a figure for demonstrating operation | movement of the frequency amplitude estimation part 22 when the audio | voice signal on which the different frequency component was superimposed is input. （ａ）〜（ｄ）は、異なる周波数成分が重畳された音声信号が入力されたときの周波数振幅推定部２２の動作を説明するための図である。(A)-(d) is a figure for demonstrating operation | movement of the frequency amplitude estimation part 22 when the audio | voice signal on which the different frequency component was superimposed is input. （ａ）〜（ｄ）は、周波数振幅推定部２２の異なる例の動作を示す図である。(A)-(d) is a figure which shows operation | movement of the example from which the frequency amplitude estimation part 22 differs. フィルタ係数生成部２３内に記憶された係数テーブルの一例を示す図である。6 is a diagram illustrating an example of a coefficient table stored in a filter coefficient generation unit 23. FIG. （ａ）及び（ｂ）は、カットオフ周波数と次数が異なる複数の低域通過フィルタ係数を格納するフィルタ係数テーブルを説明するための図である。(A) And (b) is a figure for demonstrating the filter coefficient table which stores the some low-pass filter coefficient from which a cut-off frequency and an order differ. 実施の形態３に係る音声信号処理装置の処理工程を示すフローチャートである。10 is a flowchart illustrating processing steps of the audio signal processing device according to the third embodiment.

Explanation of symbols

１入力端子、２入力処理部、３拡張データ生成処理部、４出力処理部、５原データビット拡張部、６一次元ｍ次εフィルタ部、７データ格納部、８―１〜８−４差分算出部、９―１〜９−４ ε判定部、１０判定付き加重平均部、１１データ加算部、１２係数プログラマブル判定付き加重平均部、１３次数可変判定付き加重平均部、１５出力処理部、１５ａプログラマブルアンプ、１６比較部、１７ビット拡張及び判定付き加重平均部、１８ビット拡張付きデータ加算部、１８ａビット拡張部、１８ｂデータ加算部、１９ビット拡張付き一次元ｍ次εフィルタ部、２１入力端子、２２周波数振幅推定部、２３フィルタ係数生成部、２４原データビット拡張部、２５エッジ保存型平滑化フィルタ部、２７変曲点検出部、２８周波数推定部、２９振幅推定部、３１一次微分算出部、３２符号変化点検出部。 1 input terminal, 2 input processing unit, 3 extended data generation processing unit, 4 output processing unit, 5 original data bit expansion unit, 6 one-dimensional m-order ε filter unit, 7 data storage unit, 8-1 to 8-4 difference Calculation unit, 9-1 to 9-4 ε determination unit, 10 weighted average unit with determination, 11 data addition unit, 12 weighted average unit with coefficient programmable determination, 13 weighted average unit with variable degree determination, 15 output processing unit, 15a Programmable amplifier, 16 comparison unit, 17-bit extension and weighted average unit with determination, 18-bit extension data addition unit, 18a-bit extension unit, 18b data addition unit, one-dimensional m-order ε filter unit with 19-bit extension, 21 input terminal , 22 Frequency amplitude estimation unit, 23 Filter coefficient generation unit, 24 Original data bit extension unit, 25 Edge Presence smoothing filter section, 27 the inflection point detecting unit, 28 a frequency estimation unit, 29 an amplitude estimating unit, 31 a primary differential calculating section, 32 sign change point detecting unit.

実施の形態１．
図１は、本発明の実施の形態１に係る音声信号出力装置の構成を示すブロック図である。実施の形態１に係る音声信号出力装置は、入力端子１、入力処理部２、拡張データ生成処理部３、及び出力処理部４を備える。これらのうち、入力処理部２、及び拡張データ生成処理部３により、音声信号処理装置が構成されている。Embodiment 1 FIG.
FIG. 1 is a block diagram showing a configuration of an audio signal output apparatus according to Embodiment 1 of the present invention. The audio signal output device according to Embodiment 1 includes an input terminal 1, an input processing unit 2, an extended data generation processing unit 3, and an output processing unit 4. Among these, the input signal processing unit 2 and the extended data generation processing unit 3 constitute an audio signal processing device.

本実施の形態では、入力処理部２をアナログの音声信号からデジタルの音声信号データに変換するＡ／Ｄ変換器として説明を行うが、入力処理部２にアナログ処理回路を備え、入力処理部２の内部でイコライジングなどのアナログ処理した後にデジタルの音声信号データに変換しても良いし、入力処理部２をデジタルインターフェースとして入力端子１よりデジタルのデータを受信して、ｎビット音声信号データＤＩを出力しても良い。 In the present embodiment, the input processing unit 2 is described as an A / D converter that converts analog audio signals into digital audio signal data. However, the input processing unit 2 includes an analog processing circuit, and the input processing unit 2 May be converted into digital audio signal data after analog processing such as equalizing, or digital data is received from the input terminal 1 using the input processing unit 2 as a digital interface, and the n-bit audio signal data DI is It may be output.

アナログの音声信号ＳＡが入力端子１より入力処理部２に入力され、入力処理部２は、アナログの音声信号ＳＡを所定のサンプリング周期でサンプリングして、ｎビットの音声信号データＤＩの列に変換して拡張データ生成処理部３に出力する。拡張データ生成処理部３は、原データビット拡張部５と、一次元ｍ次εフィルタ部６を備え、ｎビットの音声信号データＤＩの列の各データをｎ＋αビットに変換して出力処理部４に出力する。原データビット拡張部５は、ｎビットの音声信号データＤＩをαビット分ビット拡張（「ビットシフト」とも言う）したｎ＋αビットの音声信号データＤＳの列を一次元ｍ次εフィルタ部６に出力する。 An analog audio signal SA is input from the input terminal 1 to the input processing unit 2, and the input processing unit 2 samples the analog audio signal SA at a predetermined sampling period and converts it into a sequence of n-bit audio signal data DI. And output to the extended data generation processing unit 3. The extension data generation processing unit 3 includes an original data bit extension unit 5 and a one-dimensional mth-order ε filter unit 6, converts each data in a sequence of n-bit audio signal data DI into n + α bits, and outputs an output processing unit 4. Output to. The original data bit extension unit 5 outputs to the one-dimensional m-order ε filter unit 6 a sequence of n + α-bit audio signal data DS obtained by bit-extending (also referred to as “bit shift”) n-bit audio signal data DI by α bits. To do.

音声信号データＤＳは（他の音声信号データＤＩなども同様であるが）、時系列でサンプリングされた、サンプリング点（サンプリングタイミング）ごとのデータ値（サンプル値）を順に表すデータの列から成り、時間軸上の座標値ｉは、１サンプリング周期毎に１ずつ大きくなる。従って、入力処理部２、原データビット拡張部５など、ディジタル信号を出力する回路から出力されるデータなどはデータの列を構成するものであるが、以下の説明では、単にデータと呼ぶことがある。 The audio signal data DS (although the other audio signal data DI and the like are the same) is composed of a sequence of data sequentially representing data values (sample values) for each sampling point (sampling timing) sampled in time series. The coordinate value i on the time axis increases by one for each sampling period. Therefore, data output from a circuit that outputs a digital signal, such as the input processing unit 2 and the original data bit expansion unit 5, constitutes a data string, but in the following description, it is simply referred to as data. is there.

入力処理部２における音声信号ＳＡから音声信号データＤＩへの変換や、入力処理部２から原データビット拡張部５への音声信号データＤＩの供給、原データビット拡張部５における、ビット拡張、原データビット拡張部５からデータ格納部７への音声信号データＤＳの供給などは（以下に説明する拡張データ生成処理部３の他の部分の動作も同様であるが）、図示しない制御部から供給されるサンプリング周波数クロックに同期して（サンプリング周波数と同じ周波数、又はサンプリング周波数の整数倍又は整数分の１の周波数で）で行われる。 Conversion from the audio signal SA to the audio signal data DI in the input processing unit 2, supply of the audio signal data DI from the input processing unit 2 to the original data bit expansion unit 5, bit extension in the original data bit expansion unit 5, The supply of the audio signal data DS from the data bit extension unit 5 to the data storage unit 7 (the operation of other parts of the extension data generation processing unit 3 described below is the same) is supplied from a control unit (not shown). (Same frequency as the sampling frequency, or an integral multiple of the sampling frequency or a fraction of an integer).

一次元ｍ次εフィルタ部６は、原データビット拡張部５から出力される、ｎ＋αビットの音声信号データの列を受け、注目サンプリング点（以下、「注目サンプリング点」を単に「注目点」と言うことがある）の音声信号データと、注目点の前後の他のサンプリング点の音声信号のデータ（即ち時間軸方向に整列したサンプリング点のデータ）に基づいて、急峻で大きな変化が存在する部分を保存しながら、小振幅成分を雑音として扱って平滑化を行うエッジ保存型平滑化フィルタとして用いられているものであり、図示の一次元ｍ次εフィルタ部６は、データ格納部７、第１の差分算出部８−１〜第ｍの差分算出部８−ｍ、第１のε判定部９−１〜第ｍのε判定部９−ｍ、判定付き加重平均部１０、及びデータ加算部１１を備える。原データビット拡張部５で増やしたデータ幅を使って平滑化することで量子化ビット数不足による歪みをなくし、実効的な量子化ビット数を増やす役割を持つ。 The one-dimensional m-th order ε filter unit 6 receives a sequence of n + α-bit audio signal data output from the original data bit extension unit 5 and simply selects a target sampling point (hereinafter, “target sampling point” as “target point”). A portion where there is a steep and large change based on the audio signal data of the other sampling points before and after the point of interest (that is, sampling point data aligned in the time axis direction). Is used as an edge-preserving smoothing filter that performs smoothing by treating a small amplitude component as noise, and a one-dimensional m-th order ε filter unit 6 shown in FIG. 1 difference calculation unit 8-1 to m-th difference calculation unit 8-m, first ε determination unit 9-1 to m-th ε determination unit 9-m, weighted average unit with determination 10, and data addition unit 11 is provided. By performing smoothing using the data width increased by the original data bit extension unit 5, the distortion due to the insufficient number of quantization bits is eliminated, and the effective number of quantization bits is increased.

一次元ｍ次εフィルタ部６により、実効的な量子化ビット数を増やした音声信号データＤＯの列は、出力処理部４に出力され、出力処理部４は、ｎ＋αビットの音声信号データＤＯの列のデータを例えばデジタル−アナログ変換し、アナログ的なゲインコントロール、イコライジングなどを施して出力する。
出力処理部４は他に、電気的信号から光信号への変換、デジタルデータのシリアル−パラレル、パラレル−シリアル変換などの処理を実施することが想定される。The sequence of the audio signal data DO with the effective number of quantization bits increased by the one-dimensional m-order ε filter unit 6 is output to the output processing unit 4, and the output processing unit 4 outputs the n + α-bit audio signal data DO. For example, the data of the column is converted from digital to analog, and subjected to analog gain control, equalizing, and the like.
In addition, the output processing unit 4 is assumed to perform other processes such as conversion from an electrical signal to an optical signal, serial-parallel and parallel-serial conversion of digital data.

ここで、音声信号に適用するεフィルタについて説明する。εフィルタは例えば上記の非特許文献１に説明されている。 Here, the ε filter applied to the audio signal will be described. The ε filter is described in Non-Patent Document 1 described above, for example.

εフィルタによる時間軸方向の一次元処理は、例えば式（１）で表わされる。

ここで、ｘ（ｉ）はサンプリングタイミングｉの入力データ値、ｙ（ｉ）は入力データｘ（ｉ）が入力されたときの出力データ値、ｍは低域通過フィルタの次数、ａ_ｋはフィルタ係数、ｋはサンプリングタイミングｉの前後のサンプリングタイミングの、サンプリングタイミングｉからの時間差（相対的な位置）をサンプリング周期数で表す値、εは閾値である。
式（１）は、サンプリング点ｉを注目点としたときのフィルタリング処理を表すものである。One-dimensional processing in the time axis direction by the ε filter is expressed by, for example, Expression (1).

Here, x (i) is the input data value at the sampling timing i, y (i) is the output data value when the input data x (i) is input, m is the order of the low-pass filter, and _ak is the filter A coefficient, k is a value representing the time difference (relative position) from the sampling timing i of the sampling timing before and after the sampling timing i, and ε is a threshold value.
Equation (1) represents the filtering process when the sampling point i is the point of interest.

上記の式（１）はｍが偶数である場合のものであり、注目点のデータ、注目点の（ｍ／２）サンプリング周期前からのサンプリングデータ、注目点の｛（ｍ／２）−１｝サンプリング周期後までのサンプリングデータの計ｍ個のデータを用いてフィルタ処理する式として記載している。もしも注目点の前後の等しいサンプリングデータを用いてεフィルタ処理するのであれば、ｍを奇数として規定し、注目点を基準として（ｍ−１）／２サンプリング周期から注目点（ｍ−１）／２サンプリング周期後までのサンプリングデータを計算するように式（１）を表せばよい。もちろん、注目点より後のサンプリングデータのみを使ってεフィルタ処理するのであれば、ｍは奇数にも偶数にも規定されず、注目点のデータから注目点よりｍサンプリング周期後までのサンプリングデータを使ってεフィルタ処理すればよい。即ち、どの範囲のサンプリングデータを使ってεフィルタ処理するかは、適用する信号に応じて、最適なように設定すればよい。 The above equation (1) is for the case where m is an even number, and the data of the point of interest, the sampling data of the point of interest before the (m / 2) sampling cycle, the point of interest {(m / 2) −1 } It is described as an expression for filtering using a total of m pieces of sampling data up to after the sampling period. If ε filter processing is performed using equal sampling data before and after the attention point, m is defined as an odd number, and the attention point (m−1) / Expression (1) may be expressed so as to calculate sampling data up to two sampling periods later. Of course, if ε filter processing is performed using only the sampling data after the attention point, m is not defined as an odd number or an even number, and sampling data from the attention point data up to m sampling cycles after the attention point is obtained. Ε filter processing may be used. That is, what range of sampling data should be used for the ε filter processing may be set optimally according to the applied signal.

図２は、式（１）のｆ（ｕ）を示す図である。ｕの絶対値が閾値ε以下の場合ｆ（ｕ）＝ｕとなり、ｕの絶対値が閾値εより大きい場合ｆ（ｕ）＝０となる。この結果、フィルタ入力とレベルとフィルタ出力レベルは同じになって、入力前後の閾値εより大きな信号変化はそのまま残る。
以下は式（１）に基づいて説明を実施する。上記の例では、ｕとしてｘ（ｉ−ｋ）−ｘ（ｉ）が供給される。FIG. 2 is a diagram illustrating f (u) in the equation (1). When the absolute value of u is less than or equal to the threshold ε, f (u) = u, and when the absolute value of u is greater than the threshold ε, f (u) = 0. As a result, the filter input, the level, and the filter output level become the same, and the signal change larger than the threshold value ε before and after the input remains as it is.
The following description is based on the formula (1). In the above example, x (ik) -x (i) is supplied as u.

注目点ｉの音声信号データｘ（ｉ）と、注目点ｉからｋサンプリング周期だけ離れた点（ｉ−ｋ）における音声信号データｘ（ｉ−ｋ）との、差分ｘ（ｉ−ｋ）−ｘ（ｉ）が小さい（差分が閾値ε以下である）場合、ｆ（ｕ）＝ｆ｛ｘ（ｉ−ｋ）−ｘ（ｉ）｝はほぼ線形であり、係数ａ_ｋの総和を１と仮定すると、式（１）は式（２）と書き換えられ、これは重み付け平均値フィルタと等しくなる。Difference x (ik) − between the speech signal data x (i) at the point of interest i and the speech signal data x (i−k) at the point (i−k) separated from the point of interest i by the k sampling period. When x (i) is small (difference is equal to or smaller than the threshold ε), f (u) = f {x (ik) −x (i)} is almost linear and the sum of the coefficients a _k is 1. Assuming equation (1) is rewritten as equation (2), which is equal to the weighted average filter.

また、式（１）は、式（３）のように書き換えることができる。

Also, equation (1) can be rewritten as equation (3).

式（３）のｆ（ｕ）に図２の非線形関数を適用した場合に新たに定義されるｘ’（ｉ−ｋ）を用いると、式（３）は、式（４）と書換えることができる。式（４）において、ｘ’（ｉ−ｋ）は、差分ｘ（ｉ−ｋ）−ｘ（ｉ）が閾値ε以下の場合はｘ（ｉ−ｋ）となり、差分ｘ（ｉ−ｋ）−ｘ（ｉ）が閾値εより大きい場合はｘ（ｉ）となる。 When x ′ (ik) newly defined when the nonlinear function of FIG. 2 is applied to f (u) of Expression (3), Expression (3) is rewritten as Expression (4). Can do. In Expression (4), x ′ (ik) becomes x (ik) when the difference x (ik) −x (i) is less than or equal to the threshold ε, and the difference x (ik) − When x (i) is larger than the threshold ε, x (i) is obtained.

図３及び図４は、εフィルタにステップ状に変化する信号が入力されたときの動作を示す図である。図３（ａ）及び（ｂ）はステップ状の変化の幅が閾値ε以下である場合を示し、図４（ａ）及び（ｂ）は、ステップ状の変化の幅が閾値εよりも大きい場合を示す。図３（ａ）及び図４（ａ）は、εフィルタの入力信号ｘ（ｉ）の波形を示し、図３（ｂ）及び図４（ｂ）は、それぞれ図３（ａ）及び図４（ａ）の信号が入力されたときのεフィルタの、サンプリング点ａ、及びｂの出力信号ｙ（ａ）、ｙ（ｂ）を、入力信号ｘ（ｉ）とともに示す。 3 and 4 are diagrams illustrating an operation when a signal that changes in a stepwise manner is input to the ε filter. 3A and 3B show the case where the step-like change width is equal to or smaller than the threshold ε, and FIGS. 4A and 4B show the case where the step-like change width is larger than the threshold ε. Indicates. 3A and 4A show the waveform of the input signal x (i) of the ε filter, and FIGS. 3B and 4B show FIGS. 3A and 4B, respectively. The output signals y (a) and y (b) at the sampling points a and b of the ε filter when the signal a) is input are shown together with the input signal x (i).

これらの図において、横軸は時間軸でありサンプリング点（サンプリングタイミング）
ｉ（ｉ＝…（ａ−３）、（ａ−２）、（ａ−１）、ａ、ｂ、（ｂ＋１）、（ｂ＋２）…）、縦軸は信号ｘ（ｉ）、ｙ（ｉ）のデータ値レベルを示している。ｘ（ｂ）はｘ（ａ）のサンプリング点ａの次のサンプリング点ｂの入力信号であり、サンプリング点ａとサンプリング点ｂの間でステップ状の変化（上昇）が起きた場合を想定している。また、ｘ（ｂ＋１）、ｘ（ｂ＋２）…は、それぞれサンプリング点ｂよりも１サンプリング周期、２サンプリング周期後のサンプリング点ｉ（ｂ＋１）、ｉ（ｂ＋２）…の信号、ｘ（ａ−１）、ｘ（ａ−２）、ｘ（ａ−３）…は、サンプリング点ａよりも１サンプリング周期、２サンプリング周期、３サンプリング周期前のサンプリング点（ａ−１）、（ａ−２）、（ａ−３）…の信号を表す。In these figures, the horizontal axis is the time axis and sampling points (sampling timing)
i (i =... (a-3), (a-2), (a-1), a, b, (b + 1), (b + 2)...), and the vertical axis represents signals x (i) and y (i). Indicates the data value level. x (b) is an input signal of the sampling point b next to the sampling point a of x (a), and a case where a step-like change (rise) occurs between the sampling point a and the sampling point b is assumed. Yes. Further, x (b + 1), x (b + 2)... Are signals of sampling points i (b + 1), i (b + 2). , X (a-2), x (a-3)... Are sampling points (a-1), (a-2), (a) one sampling period, two sampling periods, three sampling periods before the sampling point a. a-3)...

以下、ここでは（図３（ａ）、図３（ｂ）、図４（ａ）、図４（ｂ）を参照した説明では）、ｍ＝６であるものとして説明する。
図３（ａ）に示すような入力信号波形を処理する場合、図２の関数ｆを用いたεフィルタによれば、サンプリング点ａ、ｂ信号のｘ（ａ）と信号ｘ（ｂ）のレベルの差が閾値ε以下のため、サンプリング点ａのフィルタリング後の信号ｙ（ａ）は、範囲Ｗａ内のサンプリング値ｘ（ａ−３）、ｘ（ａ−２）、ｘ（ａ−１）、ｘ（ａ）、ｘ（ｂ）、ｘ（ｂ＋１）の加重平均であり、範囲Ｗａ内には、信号ｘ（ｂ）のレベルのデータｘ（ｂ）、ｘ（ｂ＋１）が含まれ、これらのデータも加重平均に用いられる。この結果、フィルタリング後の信号ｙ（ａ）は、フィルタリング前の信号ｘ（ａ）よりも大きい値のものとなる。In the following description, it is assumed that m = 6 (in the description with reference to FIGS. 3A, 3B, 4A, and 4B).
When an input signal waveform as shown in FIG. 3A is processed, according to the ε filter using the function f in FIG. 2, the levels of x (a) and x (b) of the sampling points a and b of the signal Is equal to or smaller than the threshold ε, the filtered signal y (a) at the sampling point a is a sampling value x (a−3), x (a−2), x (a−1) in the range Wa, x (a), x (b), x (b + 1) is a weighted average, and the range Wa includes data x (b), x (b + 1) at the level of the signal x (b). Data is also used for the weighted average. As a result, the signal y (a) after filtering has a larger value than the signal x (a) before filtering.

またサンプリング点ｂのフィルタリング後の信号ｙ（ｂ）は、範囲Ｗｂ内のサンプリング値ｘ（ａ−２）、ｘ（ａ−１）、ｘ（ａ）、ｘ（ｂ）、ｘ（ｂ＋１）、ｘ（ｂ＋２）の加重平均であり、範囲Ｗｂ内には、信号ｘ（ａ）のレベルのデータｘ（ａ−２）、ｘ（ａ−１）、ｘ（ａ）が含まれ、これらのデータも加重平均に用いられる。この結果、フィルタリング後の信号ｙ（ｂ）は、フィルタリング前の信号ｘ（ｂ）よりも小さい値のものとなる。 Further, the filtered signal y (b) at the sampling point b is a sampling value x (a−2), x (a−1), x (a), x (b), x (b + 1) within the range Wb, x (b + 2) is a weighted average, and within the range Wb, data x (a-2), x (a-1), and x (a) at the level of the signal x (a) are included. Is also used for the weighted average. As a result, the signal y (b) after filtering has a smaller value than the signal x (b) before filtering.

例えばここでビット拡張前のサンプリング点ａとサンプリング点ｂのデータ値レベル差が１６ビット幅データの１ＬＳＢ差である場合、それぞれの出力レベルは１６ビット幅データ処理では表現し得ないデータ値レベル差になってしまう。しかし本実施の形態の音声信号出力装置ではεフィルタリングの前にビット拡大処理しているため、図３（ｂ）の出力波形のように入力波形のデータ値レベル差を平滑化した波形にフィルタ処理することができる。 For example, when the data value level difference between the sampling point a and the sampling point b before bit extension is 1 LSB difference of 16-bit width data, each output level is a data value level difference that cannot be expressed by 16-bit width data processing. Become. However, since the audio signal output apparatus of this embodiment performs bit expansion processing before ε filtering, it performs filtering to a waveform in which the data value level difference of the input waveform is smoothed as in the output waveform of FIG. can do.

次に図４（ａ）に示すような入力信号波形を処理する場合、図２の関数ｆを用いたεフィルタによれば、サンプリング点ａの信号ｘ（ａ）とサンプリング点（ｂ）の信号ｘ（ｂ）のレベルの差が閾値εよりも大きいため、式（１）のｆ｛（ｘ（ｉ−ｋ）−ｘ（ｉ）｝＝０となり、サンプリング点ａ、ｂのフィルタリング後の出力ｙ（ａ）及びｙ（ｂ）は図４（ｂ）のように入力レベルｘ（ａ）、ｘ（ｂ）と同じとなる。これは、
サンプリング点ａのフィルタリング出力ｙ（ａ）を求めるときにはサンプリング点ｂの信号ｘ（ｂ）のレベルのデータｘ（ｂ）、ｘ（ｂ＋１）をすべてサンプリング点ａのデータ値ｘ（ａ）に置き換えて加重平均を求め、サンプリング点ｂのフィルタリング出力ｙ（ｂ）を求めるときにはサンプリング点ａの信号ｘ（ａ）のレベルのデータｘ（ａ−２）、ｘ（ａ−１）、ｘ（ａ）をすべてサンプリング点ｂのデータ値ｘ（ｂ）に置き換えて加重平均を求めることによる。
このような処理を行う結果、信号レベルの急峻な変化を劣化させることがない。
つまり、このεフィルタは入力信号の大振幅で急峻な変化を保持しながら、小振幅のデータ値変化をビット拡大処理との組み合わせによって平滑化したデータを生成することができる。Next, when an input signal waveform as shown in FIG. 4A is processed, the signal x (a) at the sampling point a and the signal at the sampling point (b) are obtained according to the ε filter using the function f in FIG. Since the difference in the level of x (b) is larger than the threshold value ε, f {(x (ik) −x (i)} = 0 in equation (1), and the output after filtering of the sampling points a and b y (a) and y (b) are the same as the input levels x (a) and x (b) as shown in FIG.
When obtaining the filtering output y (a) at the sampling point a, the data x (b) and x (b + 1) at the level of the signal x (b) at the sampling point b are all replaced with the data value x (a) at the sampling point a. When the weighted average is obtained and the filtering output y (b) at the sampling point b is obtained, the data x (a-2), x (a-1), x (a) at the level of the signal x (a) at the sampling point a is obtained. By replacing all the data values x (b) at the sampling point b and obtaining a weighted average.
As a result of such processing, a sharp change in signal level is not deteriorated.
That is, the ε filter can generate data obtained by smoothing a small amplitude data value change in combination with a bit expansion process while maintaining a large amplitude and steep change in the input signal.

ここで式（１）の係数ａ_ｋを制御すると、図３（ａ）の信号波形が入力されたとき、図３（ｂ）におけるサンプリング点ａ、ｂにおける出力レベルｙ（ａ）、ｙ（ｂ）の入力レベルｘ（ａ）、ｘ（ｂ）からの移動量即ち変化量｛ｙ（ａ）−ｘ（ａ）｝、｛ｙ（ｂ）−ｘ（ｂ）｝が変わる。
注目点サンプリング点と差分をとったサンプリング点の信号に対する係数値（加重値）が大きいほど、出力レベルの入力レベルからの変化量が大きくなる。
その変化量が大きければ入力データ値レベル変化はより平滑化されて平滑化された出力波形になる。
逆に変化量が小さければ閾値ε以下の入力データ値レベル変化に対しても入力信号の急峻な変化波形に近づいてくる。
変化量を小さくすると低域通過フィルタのカットオフ周波数を高くしたのと同様に、急峻な波形変化を残すようになってくる。Here, when the coefficient _ak of the equation (1) is controlled, when the signal waveform of FIG. 3A is input, the output levels y (a) and y (b) at the sampling points a and b in FIG. ) From the input levels x (a) and x (b), that is, changes {y (a) −x (a)} and {y (b) −x (b)}.
The larger the coefficient value (weighted value) for the signal at the sampling point that is different from the sampling point of interest, the greater the amount of change in the output level from the input level.
If the amount of change is large, the input data value level change is smoothed to a smoothed output waveform.
On the other hand, if the amount of change is small, the input signal has a steep change waveform even when the input data value level changes below the threshold ε.
When the amount of change is reduced, a steep waveform change remains as in the case where the cutoff frequency of the low-pass filter is increased.

エッジ保存型平滑化フィルタ部（一次元ｍ次εフィルタ）の特性（平滑化処理特性（低域通過フィルタ特性））を調整できるように実施の形態１（図１）内の判定付き加重平均部１０の代わりに、図５に示されるように係数プログラマブル判定付き加重平均部１２を用いることとしても良く、そうすることによって、係数(加重値)ａ_ｋを自由に設定できる機能を持った音声信号出力装置（図５）を構成することができる。
このような係数プログラマブル判定付き加重平均部１２は、例えば、第１乃至第ｍの判定部９−１〜９−ｍの出力ＥＥ（１）〜ＥＥ（ｍ）に対して式（１）或いは式（５）に準ずる計算を行なう際に用いられる係数ａ_ｋの値を外部からの設定に応じて変えることができるものである。Weighted average unit with determination in Embodiment 1 (FIG. 1) so that the characteristics (smoothing process characteristic (low-pass filter characteristic)) of the edge preserving type smoothing filter part (one-dimensional m-order ε filter) can be adjusted. In place of 10, a weighted average unit 12 with coefficient programmable determination may be used as shown in FIG. 5, and by doing so, an audio signal having a function capable of freely setting the coefficient (weighted value) _ak An output device (FIG. 5) can be configured.
Such a weighted average unit with coefficient programmable determination 12 is, for example, an expression (1) or an expression for the outputs EE (1) to EE (m) of the first to mth determination units 9-1 to 9-m. The value of the coefficient _ak used when the calculation according to (5) is performed can be changed according to the setting from the outside.

なお、図５に示される係数プログラマブル判定付き加重平均部１２は、図１の判定付き加重平均部１０に対して、係数が可変である点を除き、同様のものである。図５に示される他の構成要素も図１に同じ符号で示されるものと同様のものである。 The weighted average unit with coefficient programmable determination 12 shown in FIG. 5 is the same as the weighted average unit with determination 10 in FIG. 1 except that the coefficient is variable. The other components shown in FIG. 5 are the same as those shown in FIG.

式（１）のεフィルタ処理を実施すると、図３（ａ）、図４（ａ）の信号波形が入力されたとき、図３（ｂ）、図４（ｂ）における出力レベルｙ（ａ）、ｙ（ｂ）を決定するために、注目点の前後のサンプリングデータ（注目点を基準として（ｍ／２）サンプリング周期前のデータから｛（ｍ／２）−１｝サンプリング周期後のデータまでの加重平均を求めることになる。次数ｍが大きいほど注目点に対して広い範囲のデータを平滑化することになり、高周波成分が抑制されることになる。つまり低域通過フィルタのカットオフ周波数を低く設定する形になる。例えば実施の形態１（図１）内の判定付き加重平均部１０の代わりに、図６に示される次数可変判定付き加重平均部１３を用いることとしても良く、そうすることによって、一次元ｍ次εフィルタの次数ｍを可変して平滑化処理特性（低域通過フィルタ特性）を変化させることができる機能を持った音声信号出力装置を構成することができる。 When the ε filter processing of Expression (1) is performed, when the signal waveforms of FIG. 3A and FIG. 4A are input, the output level y (a) in FIG. 3B and FIG. , Y (b) to determine the sampling data before and after the target point (from the data before the sampling period (m / 2) to the data after the {(m / 2) -1} sampling period with reference to the target point) The higher the order m, the smoother the data in a wider range with respect to the point of interest, and the higher frequency components are suppressed, that is, the cut-off frequency of the low-pass filter. For example, instead of the weighted average unit with determination 10 in the first embodiment (FIG. 1), the weighted average unit 13 with order variable determination shown in FIG. By one dimension It is possible to configure the audio signal output device having the following ε variable to smoothing processing characteristics the degree m of a filter function which can be changed (the low-pass filter characteristic).

このような次数可変判定付き加重平均部１３は、例えば、第１乃至第ｍの判定部９−１〜９−ｍの出力ＥＥ（１）〜ＥＥ（ｍ）のうちの選択したもののみを用いて、式（１）或いは式（５）に準ずる計算を行なうことができるものであり、出力ＥＥ（１）〜ＥＥ（ｍ）のいずれを選択するか、及びこれらに対して掛けられる係数ａ_ｋの値を外部からの設定に応じて変えることができるものである。For example, the weighted average unit 13 with variable order of determination uses only one selected from the outputs EE (1) to EE (m) of the first to mth determination units 9-1 to 9-m. Thus, the calculation according to the formula (1) or the formula (5) can be performed, and any one of the outputs EE (1) to EE (m) is selected and the coefficient a _k to be multiplied by these is selected. The value of can be changed according to the setting from the outside.

なお、図６に示される次数可変判定付き加重平均部１３は、図１の判定付き加重平均部１０に対して、次数が可変である点を除き、同様のものである。図６に示される他の構成要素も図１に同じ符号で示されるものと同様のものである。 6 is the same as the weighted average unit with determination variable 10 shown in FIG. 1 except that the order is variable. The other components shown in FIG. 6 are the same as those shown in FIG.

本実施の形態ではビット拡張によって、ｎ＋αビットに増えたデータ幅を使って、小振幅変化部が滑らかに変化するようにエッジ保存型平滑化フィルタ処理を行う。これに伴い、小振幅変化雑音もフィルタ処理により、軽減される。つまり、ｎビットからｎ＋αビットにビット拡張する場合、ビット拡張後のデータに対するεフィルタの閾値εを２^αとする。（ビット拡張前の最小データ値レベル変化を、αビット拡張後のデータ値レベル変化として換算すると２^αになる。）こうすることで、εフィルタは急な大音量入力など音声信号の変化を保持しながら、実効的な量子化ビット数を増やすことができる。In this embodiment, edge preserving type smoothing filter processing is performed using the data width increased to n + α bits by bit expansion so that the small amplitude change portion changes smoothly. Along with this, small amplitude variation noise is also reduced by the filter processing. That is, when the bit extension from n bits to n + alpha bits, the threshold value ε of ε filter to the data after the bit extension and 2 ^alpha. (The minimum data value level change before the bit extension, becomes to the 2 ^alpha calculated as data value level change after alpha-bit extension.) In this way, epsilon filter holds the change of sudden loud input voice signal However, the effective number of quantization bits can be increased.

以下、例としてα＝２の場合、即ちｎビットの音声信号をｎ＋２ビットで量子化ビット不足改善処理する場合について説明する。なお、ＣＤなどのデジタル音声信号は１６ビット量子化で符号化されていることが規格となっており、８ビット増やして２４ビット符合に量子化ビット数拡大される場合がある。しかし以下では説明を簡単にするため、２ビット増やす場合について説明する。 Hereinafter, as an example, a case where α = 2, that is, a case where an n-bit audio signal is subjected to a quantization bit shortage improvement process with n + 2 bits will be described. Note that the standard is that digital audio signals such as CDs are encoded by 16-bit quantization, and there are cases where the number of quantization bits is increased to 24 bits by increasing 8 bits. However, hereinafter, in order to simplify the description, a case where 2 bits are increased will be described.

図７は、ｍ＝４の場合の実施の形態１の音声信号処理装置の構成を示す図である。閾値εは２のα乗（２乗）＝４として説明する。 FIG. 7 is a diagram illustrating the configuration of the audio signal processing device according to the first embodiment when m = 4. The description will be made assuming that the threshold ε is 2 to the power of α (square) = 4.

図１では、差分算出部８−１〜８−ｍに入力されるデータの符号としてＤＭ（１）〜ＤＭ（ｍ）が用いられているが、図７では、差分算出部８−１〜８−４に入力されるデータの符号として、ＤＭ（ｌ２）、ＤＭ（ｌ１）、ＤＭ（ｃ）、ＤＭ（ｒ１）を用いている。符号ＤＭ（ｌ２）、ＤＭ（ｌ１）、ＤＭ（ｃ）、ＤＭ（ｒ１）で表されるデータは、ｍ＝４の場合に符号ＤＭ（１）、ＤＭ（２）、ＤＭ（３）、ＤＭ（４）で表されるデータと同じである。差分算出部８−１〜８−４の出力ＥＤ（ｌ２）〜（ｒ１）、ε判定部９−１〜９−４の出力ＥＥ（ｌ２）〜ＥＥ（ｒ１）についても同様である。 In FIG. 1, DM (1) to DM (m) are used as codes of data input to the difference calculation units 8-1 to 8-m, but in FIG. 7, the difference calculation units 8-1 to 8-8 are used. DM (l2), DM (l1), DM (c), and DM (r1) are used as codes of data input to -4. Data represented by codes DM (l2), DM (l1), DM (c), DM (r1) is represented by codes DM (1), DM (2), DM (3), DM when m = 4. It is the same as the data represented by (4). The same applies to the outputs ED (l2) to (r1) of the difference calculation units 8-1 to 8-4 and the outputs EE (l2) to EE (r1) of the ε determination units 9-1 to 9-4.

図８（ａ）及び（ｂ）は、入力処理部２の動作を説明するための図である。横軸は時間軸ｉを示している。図８（ａ）はアナログの音声入力信号ＳＡの信号変化を示し、図８（ｂ）はデジタルの音声信号データＤＩのデータ値レベル変化を示している。 FIGS. 8A and 8B are diagrams for explaining the operation of the input processing unit 2. The horizontal axis indicates the time axis i. FIG. 8A shows the signal change of the analog audio input signal SA, and FIG. 8B shows the data value level change of the digital audio signal data DI.

図８（ａ）に示すようなアナログの音声入力信号ＳＡが、入力端子１より入力処理部２に入力される。入力処理部２は、アナログの音声入力信号ＳＡを図８（ｂ）に示すようなｎビットの音声信号データＤＩに変換して、拡張データ生成処理部３に出力する。拡張データ生成処理部３では、ｎビットの音声信号データＤＩは原データビット拡張部５に入力される。 An analog audio input signal SA as shown in FIG. 8A is input from the input terminal 1 to the input processing unit 2. The input processing unit 2 converts the analog audio input signal SA into n-bit audio signal data DI as shown in FIG. 8B and outputs it to the extended data generation processing unit 3. In the extension data generation processing unit 3, the n-bit audio signal data DI is input to the original data bit extension unit 5.

図８（ａ）に示した音声入力信号ＳＡは、信号レベルが緩やかに変化しているが、信号レベルの変化に対する量子化の分解能が低い（ビット数が少ない）ため、図８（ｂ）に示した音声信号データＤＩでは、２値のデータ値（ＹとＹ＋１）に変換されている。 The audio input signal SA shown in FIG. 8 (a) has a gentle change in signal level, but the quantization resolution with respect to the change in signal level is low (the number of bits is small), so FIG. 8 (b). The audio signal data DI shown is converted into binary data values (Y and Y + 1).

図９（ａ）及び（ｂ）は、原データビット拡張部５の動作を説明するための図である。
横軸は時間軸ｉを示している。図９（ａ）はｎビットの音声信号データＤＩのデータ値レベル変化を示し、図９（ｂ）はｎ＋２ビットの音声信号データＤＳのデータ値レベル変化を示している。原データビット拡張部５は、図９（ａ）に示すようなｎビットの音声信号データＤＩを２ビットだけビット拡張し（このビット拡張は、データＤＩの最下位ビットの下位側に、値が「００」の２桁のビットを付加することで行われる）、図９（ｂ）に示したようなｎ＋２ビットの音声信号データＤＳを一次元４次εフィルタ部１４に出力する。FIGS. 9A and 9B are diagrams for explaining the operation of the original data bit extension unit 5.
The horizontal axis indicates the time axis i. FIG. 9A shows the change in the data value level of the n-bit audio signal data DI, and FIG. 9B shows the change in the data value level of the n + 2 bit audio signal data DS. The original data bit extension unit 5 bit-extends n bits of audio signal data DI as shown in FIG. 9A by 2 bits (this bit extension has a value on the lower side of the least significant bit of the data DI. This is done by adding two-digit bits of “00”), and outputs n + 2 bits of audio signal data DS as shown in FIG. 9B to the one-dimensional fourth-order ε filter unit 14.

図９（ｂ）に示すようなｎ＋２ビットの音声信号データＤＳは、２ビット分ビット拡張されたため、図９（ａ）の音声信号データＤＩのデータ値レベルＹが４Ｙに、Ｙ＋１が４（Ｙ＋１）に変換されている。 Since the n + 2 bit audio signal data DS as shown in FIG. 9B has been expanded by 2 bits, the data value level Y of the audio signal data DI in FIG. 9A is 4Y, and Y + 1 is 4 (Y + 1). ).

なお、上記のように、デジタル音声信号の多くは１６ビットで規格化されている。この１６ビットの音声信号データは８ビット拡張される場合があり、そのときは音声信号データＤＩのデータ値レベルＹが２５６Ｙに、Ｙ＋１が２５６（Ｙ＋１）に変換されることになる。 As described above, most digital audio signals are standardized by 16 bits. The 16-bit audio signal data may be expanded by 8 bits. At this time, the data value level Y of the audio signal data DI is converted to 256Y, and Y + 1 is converted to 256 (Y + 1).

図７のデータ格納部７は時間軸方向に連続したｎ＋αビットの音声信号データの列のデータを保持して同時に出力するものであり、例えば図１０に示されるように構成されている。図示のデータ格納部７は、カスケード接続された４個のフリップフロップＦＦ_１〜ＦＦ_４を備え、入力される音声信号データＤＳを、サンプリングクロックに同期して、４個のフリップフロップＦＦ_１〜ＦＦ_４で転送する。即ち、入力される音声信号データＤＳを、フリップフロップＦＦ_１に、フリップフロップＦＦ_１の出力をフリップフロップＦＦ_２の入力に、フリップフロップＦＦ_２の出力をフリップフロップＦＦ_３の入力に、フリップフロップＦＦ_３の出力をフリップフロップＦＦ_４の入力に供給し、それぞれ保持させている。 The data storage unit 7 in FIG. 7 holds data in a sequence of n + α-bit audio signal data continuous in the time axis direction and outputs the data simultaneously, and is configured as shown in FIG. 10, for example. The illustrated data storage unit 7 includes four flip-flops FF_1 to FF_4 connected in cascade, and the input audio signal data DS is transferred by the four flip-flops FF_1 to FF_4 in synchronization with the sampling clock. . That is, the input audio signal data DS is input to the flip flop FF_1, the output of the flip flop FF_1 is input to the flip flop FF_2, the output of the flip flop FF_2 is input to the flip flop FF_3, and the output of the flip flop FF_3 is flip flop. FF_4 is supplied to the input and held.

この結果、入力される連続した４つのサンプリング点の音声信号データを４個のフリップフロップＦＦ_１〜ＦＦ_４で同時に保持し、同時に出力することができる。
即ち、フリップフロップＦＦ２に保持されているデータは、注目点（タイミング）ｃのデータ値レベルＤＭ（ｃ）を表すデータとして出力され、フリップフロップＦＦ_３に保持されているデータは、注目点の１サンプリング周期前の音声信号のデータ値レベルＤＭ（ｌ１）を表すデータとして出力され、フリップフロップＦＦ_４に保持されているデータは、注目点の２サンプリング周期前の音声信号のデータ値レベルＤＭ（ｌ２）を表すデータとして出力され、フリップフロップＦＦ_１に保持されているデータは、注目点の１サンプリング周期後の音声信号のデータ値レベルＤＭ（ｒ１）を表すデータとして出力される。As a result, the input audio signal data at four consecutive sampling points can be simultaneously held and output simultaneously by the four flip-flops FF_1 to FF_4.
That is, the data held in the flip-flop FF2 is output as data representing the data value level DM (c) of the attention point (timing) c, and the data held in the flip-flop FF_3 is one sampling of the attention point. The data that is output as data representing the data value level DM (l1) of the audio signal before the cycle and held in the flip-flop FF_4 is the data value level DM (l2) of the audio signal two samples before the target point. The data that is output as data that is held and held in the flip-flop FF_1 is output as data that represents the data value level DM (r1) of the audio signal after one sampling period of the target point.

図１１は、サンプリング点ｃを注目点とした場合のデータ格納部７の出力を示す。横軸は時間ｉを示し、縦軸はデータ格納部７に入力される音声信号データＤＳ及びデータ格納部から出力される音声信号データＤＭのデータ値レベルを示す。図示の例では、データ格納部７は、ＤＭ（ｌ２）＝４Ｙ、ＤＭ（ｌ１）＝４Ｙ、ＤＭ（ｃ）＝４Ｙ、ＤＭ（ｒ１）＝４Ｙ＋４を出力し、これらは第１の差分算出部８−１〜第４の差分算出部８−４に供給される。このように、データ格納部７は、注目点ｃの音声信号データＤＭ（ｃ）と、注目点に連続（前後）した、該注目点の周辺（前後）のサンプリング点の音声信号データＤＭ（ｌ２）、ＤＭ（ｌ１）、ＤＭ（ｒ１）とを同時に出力する。 FIG. 11 shows the output of the data storage unit 7 when the sampling point c is the point of interest. The horizontal axis represents time i, and the vertical axis represents the audio signal data DS input to the data storage unit 7 and the data value level of the audio signal data DM output from the data storage unit. In the illustrated example, the data storage unit 7 outputs DM (l2) = 4Y, DM (l1) = 4Y, DM (c) = 4Y, DM (r1) = 4Y + 4, which are the first difference calculation unit. 8-1 to the fourth difference calculation unit 8-4. As described above, the data storage unit 7 stores the audio signal data DM (c) at the point of interest c and the audio signal data DM (l2) at the sampling points around (at the front and rear of) the point of interest that is continuous (before and after) the point of interest. ), DM (l1), and DM (r1) are output simultaneously.

図７の差分算出部８−１〜８−４は、データ格納部７から出力されるデータのうちの、注目サンプリング点の音声信号データＤＭ（ｃ）と、該注目サンプリング点の前後のサンプリング点の音声信号データＤＭ（ｌ２）、ＤＭ（ｌ１）、ＤＭ（ｒ１）各々との差分を表す差分データを生成する。 The difference calculation units 8-1 to 8-4 in FIG. 7 include the audio signal data DM (c) at the target sampling point and the sampling points before and after the target sampling point among the data output from the data storage unit 7. Difference data representing differences from each of the audio signal data DM (l2), DM (l1), and DM (r1).

具体的には、第１の差分算出部８−１はサンプリング点ｌ２のデータ値ＤＭ（ｌ２）と注目点ｃ（サンプリング点ｃ）のデータ値ＤＭ（ｃ）の差分データＥＤ（ｌ２）を生成し、第１のε判定部９−１及び判定付き加重平均部１０に出力する。第２の差分算出部８−２〜第４の差分算出部８−４も同様にそれぞれ注目点前後のデータ値ＤＭ（ｌ１）〜ＤＭ（ｒ１）に基づいて差分データＥＤ（ｌ１）〜ＥＤ（ｒ１）を生成し、それぞれ第２のε判定部９−２〜第４のε判定部９−４及び判定付き加重平均部１０に出力する。このように、差分算出部８−１乃至８−４は、注目点の音声信号データＤＭ（ｃ）と、注目点に連続した、該注目点の周辺（前後）のサンプリング点の音声信号データＤＭ（ｌ２）、ＤＭ（ｌ１）、ＤＭ（ｒ１）の各々との差分を算出し、差分を表す差分データを出力する。 Specifically, the first difference calculation unit 8-1 generates difference data ED (l2) between the data value DM (l2) at the sampling point l2 and the data value DM (c) at the point of interest c (sampling point c). And output to the first ε determination unit 9-1 and the weighted average unit 10 with determination. Similarly, the second difference calculation unit 8-2 to the fourth difference calculation unit 8-4 also use the difference data ED (l1) to ED (based on the data values DM (l1) to DM (r1) before and after the attention point, respectively. r1) are generated and output to the second ε determination unit 9-2 to the fourth ε determination unit 9-4 and the weighted average unit with determination 10, respectively. As described above, the difference calculation units 8-1 to 8-4 each receive the audio signal data DM (c) at the point of interest and the audio signal data DM at the sampling points around the point of interest (before and after) of the point of interest. The difference between each of (l2), DM (l1), and DM (r1) is calculated, and difference data representing the difference is output.

図１２は、図１１のサンプリング点ｃを注目点としたときに、データ格納部７が出力するデータＤＭ（ｋ）、及びこれに基づいて第１の差分算出部８−１〜第４の差分算出部８−４、第１のε判定部９−１〜第４のε判定部９−４、及び判定付き加重平均部１０で生成されるデータＥＤ（ｋ）、ＥＥ（ｋ）、及びＥＭを示す表であり、判定付き加重平均部１０における演算の過程で得られるｆ｛ＥＤ（ｋ）｝の値も併せて示されている。 FIG. 12 shows the data DM (k) output from the data storage unit 7 when the sampling point c in FIG. 11 is the point of interest, and the first difference calculation unit 8-1 to the fourth difference based on the data DM (k). Data ED (k), EE (k), and EM generated by the calculation unit 8-4, the first ε determination unit 9-1 to the fourth ε determination unit 9-4, and the weighted average unit with determination 10 The value of f {ED (k)} obtained in the process of calculation in the weighted average unit with determination 10 is also shown.

第１の差分算出部８−１は、
ＥＤ（ｌ２）＝ＤＭ（ｌ２）−ＤＭ（ｃ）＝４Ｙ−４Ｙ＝０、
を出力し、第２の差分算出部８−２は、
ＥＤ（ｌ１）＝ＤＭ（ｌ１）−ＤＭ（ｃ）＝４Ｙ−４Ｙ＝０、
を出力し、第３の差分算出部８−３は、
ＥＤ（ｃ）＝ＤＭ（ｃ）−ＤＭ（ｃ）＝４Ｙ−４Ｙ＝０、
を出力し、第４の差分算出部８−４は、
ＥＤ（ｒ１）＝ＤＭ（ｒ１）−ＤＭ（ｃ）＝（４Ｙ＋４）−４Ｙ＝４、
を出力する。The first difference calculation unit 8-1
ED (l2) = DM (l2) -DM (c) = 4Y-4Y = 0,
The second difference calculation unit 8-2
ED (l1) = DM (l1) -DM (c) = 4Y-4Y = 0,
The third difference calculation unit 8-3
ED (c) = DM (c) -DM (c) = 4Y-4Y = 0,
The fourth difference calculation unit 8-4
ED (r1) = DM (r1) -DM (c) = (4Y + 4) -4Y = 4,
Is output.

ε判定部９−１〜９−４は、注目サンプリング点の前後のサンプリング点の各々について、差分算出部８−１〜８−４が出力する差分データＥＤ（ｌ２）〜ＥＤ（ｒ１）が閾値εより大きいか否かを示す判定データを生成する。
具体的には、第１のε判定部９−１〜第４のε判定部９−４は、差分値ＥＤ（ｋ）を受け、ＥＤ（ｋ）がεより大きいか否かの判定を行い、該判定結果を図１３（ａ）に示される判定データＥＥ（ｋ）として判定付き加重平均部１０に出力する。図示の例では、ＥＤ（ｋ）がεより大きい場合は「０」を、ε以下の場合は「１」をＥＥ（ｋ）として出力する。For each of the sampling points before and after the target sampling point, the ε determination units 9-1 to 9-4 use the difference data ED (l2) to ED (r1) output from the difference calculation units 8-1 to 8-4 as threshold values. Determination data indicating whether or not is larger than ε is generated.
Specifically, the first ε determination unit 9-1 to the fourth ε determination unit 9-4 receive the difference value ED (k) and determine whether ED (k) is greater than ε. The determination result is output to the weighted average unit 10 with determination as determination data EE (k) shown in FIG. In the illustrated example, “0” is output as EE (k) when ED (k) is greater than ε, and “1” is output as EE (k) when ED or less.

即ち、第１のε判定部９−１は、差分データＥＤ（ｌ２）がεよりも大きい場合は「０」を生成し、ε以下の場合は「１」を生成する。
第２のε判定部９−２〜第４のε判定部９−４も同様に差分データＥＤ（ｌ１）〜ＥＤ（ｒ１）に基づいて判定データＥＥ（ｌ１）〜ＥＥ（ｒ１）を生成する。
生成された判定データは、判定付き加重平均部１０に供給される。
ここでα＝２なので閾値ε＝４と設定されている。That is, the first ε determination unit 9-1 generates “0” when the difference data ED (l2) is larger than ε, and generates “1” when it is equal to or less than ε.
Similarly, the second ε determination unit 9-2 to the fourth ε determination unit 9-4 also generate determination data EE (l1) to EE (r1) based on the difference data ED (l1) to ED (r1). .
The generated determination data is supplied to the weighted average unit 10 with determination.
Since α = 2 here, the threshold ε = 4 is set.

図１２に示される例では、第１、第２、及び第３のε判定部９−１、９−２及び９−３には差分データＥＤ（ｌ２）、ＥＤ（ｌ１）、ＥＤ（ｃ）として「０」が入力され、第４のε判定部９−４には差分データＥＤ（ｒ１）として「４」が入力され、これらの差分データＥＤ（ｌ２）〜ＥＤ（ｒ１）は全てε以下であるため判定データＥＥ（ｌ２）〜ＥＥ（ｒ１）は全て「１」となる。 In the example shown in FIG. 12, the first, second, and third ε determination units 9-1, 9-2, and 9-3 include difference data ED (l2), ED (l1), and ED (c). “0” is input as “4”, and “4” is input as the difference data ED (r1) to the fourth ε determination unit 9-4, and these difference data ED (l2) to ED (r1) are all equal to or less than ε. Therefore, the determination data EE (l2) to EE (r1) are all “1”.

判定付き加重平均部１０は、注目サンプリング点の前後のサンプリング点の各々についてε判定部９−１〜９−４が出力する判定データＥＥ（ｌ２）〜ＥＥ（ｒ１）と差分算出部８−１〜８−４が出力する差分データＥＤ（ｌ２）〜ＥＤ（ｒ１）に基づいて加重平均値を生成する。具体的には、判定付き加重平均部１０は、判定データＥＥ（ｌ２）〜ＥＥ（ｒ１）に基づいて差分データＥＤ（ｌ２）〜ＥＤ（ｒ１）を加重平均して、その結果として得られる加重平均値を表す平均データＥＭを出力する。判定データが「１」の場合は、係数ａ_ｋに差分データを乗算して加算し、「０」の場合は加算しない。言い換えると、判定データの値に応じて、差分値ＥＤ（ｋ）に対して図１３（ｂ）に示す関係を有する関数ｆ｛ＥＤ（ｋ）｝の加重平均を求める演算を行っている。The weighted average unit with determination 10 includes determination data EE (l2) to EE (r1) output from the ε determination units 9-1 to 9-4 and the difference calculation unit 8-1 for each of the sampling points before and after the sampling point of interest. A weighted average value is generated based on the difference data ED (l2) to ED (r1) output by ˜8-4. Specifically, the weighted average unit 10 with determination performs weighted averaging of the difference data ED (l2) to ED (r1) based on the determination data EE (l2) to EE (r1), and the weight obtained as a result thereof Average data EM representing the average value is output. When the determination data is “1”, the coefficient _ak is multiplied by the difference data and added. When the determination data is “0”, no addition is performed. In other words, in accordance with the value of the determination data, an operation for obtaining a weighted average of the function f {ED (k)} having the relationship shown in FIG. 13B with respect to the difference value ED (k) is performed.

係数ａ_ｋ（＝ａ_ｌ２、ａ_ｌ１、ａ_ｃ、ａ_ｒ１）を全て０．２５とした場合（説明を簡単にするため、総和１の１／４にしている。）、判定データＥＥ（ｌ２）〜ＥＥ（ｒ１）が上記の例では全て「１」であるので、判定付き加重平均部１０の出力ＥＭは、以下の式（５）で与えられる。
ＥＭ
＝ａ_ｌ２×ＥＤ（ｌ２）＋ａ_ｌ１×ＥＤ（ｌ１）＋ａ_ｃ×ＥＤ（ｃ）＋ａ_ｒ１×ＥＤ（ｒ１）
＝０．２５×０＋０．２５×０＋０．２５×０＋０．２５×４
＝１
…（５）When the coefficients a _k (= a _l2 , a _l1 , a _c , a _r1 ) are all set to 0.25 (for the sake of simplicity, the sum is set to ¼ of the sum 1), the determination data EE (l2 ) To EE (r1) are all “1” in the above example, the output EM of the weighted average unit with determination 10 is given by the following equation (5).
EM
= A _l2 × ED (l2) + a _l1 × ED (l1) + a _c × ED (c) + a _r1 × ED (r1)
= 0.25 × 0 + 0.25 × 0 + 0.25 × 0 + 0.25 × 4
= 1
... (5)

原データビット拡張部５から供給される音声信号データＤＳに対し、各サンプリング点を順に注目点として上記の処理が行われ、式（５）と同様の演算が行われる。
図１４は、図１１の音声信号データＤＭに対応する加重平均値ＥＭ、即ち各サンプリング点を注目点としたときの加重平均値ＥＭを示す。図１４の横軸が注目点、縦軸が加重平均値ＥＭのデータ値を示している。判定付き加重平均部１０は、加重平均値ＥＭをデータ加算部１１に出力する。The above processing is performed on the audio signal data DS supplied from the original data bit expansion unit 5 with each sampling point as an attention point in order, and the same calculation as in Expression (5) is performed.
FIG. 14 shows the weighted average value EM corresponding to the audio signal data DM of FIG. 11, that is, the weighted average value EM when each sampling point is a point of interest. In FIG. 14, the horizontal axis indicates the attention point, and the vertical axis indicates the data value of the weighted average value EM. The weighted average unit 10 with determination outputs the weighted average value EM to the data adding unit 11.

図１５（ａ）〜（ｃ）は、データ加算部１１の動作を説明するための図である。横軸は時間軸でありサンプリングタイミングｉを示している。
図１５（ａ）は音声信号データＤＳのデータ値レベル変化を示し、図１５（ｂ）は加重平均値ＥＭのデータ値レベル変化を示し、図１５（ｃ）は音声信号データＤＯのデータ値レベル変化を示す。
データ加算部１１は、図１５（ａ）に示した音声信号データＤＳと、図１５（ｂ）に示した加重平均値ＥＭを加算して、図１５（ｃ）に示す音声信号データＤＯを出力する。例えば、図１５（ａ）〜（ｃ）におけるサンプリングタイミングｄでは（サンプリングタイミングｄを注目点として処理を行うときは）、ＤＳ（ｄ）＝４Ｙ、ＥＭ（ｄ）＝１なので
ＤＯ（ｄ）＝ＤＳ（ｄ）＋ＥＭ（ｄ）＝４Ｙ＋１
となる。FIGS. 15A to 15C are diagrams for explaining the operation of the data adding unit 11. The horizontal axis is the time axis and indicates the sampling timing i.
FIG. 15A shows the data value level change of the audio signal data DS, FIG. 15B shows the data value level change of the weighted average value EM, and FIG. 15C shows the data value level of the audio signal data DO. Showing change.
The data adder 11 adds the audio signal data DS shown in FIG. 15A and the weighted average value EM shown in FIG. 15B, and outputs the audio signal data DO shown in FIG. To do. For example, at the sampling timing d in FIGS. 15A to 15C (when processing is performed with the sampling timing d as a point of interest), since DS (d) = 4Y and EM (d) = 1, DO (d) = DS (d) + EM (d) = 4Y + 1
It becomes.

以上のように、本実施の形態は、データ値レベル変化が緩やかな領域において実効的な量子化ビット数を増やすことができる。ｎビットの音声信号をｎ＋２ビットに変換する場合、アナログ入力音声信号ＳＡが緩やかに変化し、これに対応してビット拡張された音声信号データＤＳのデータ値レベル変化が図１５（ａ）に示されるように、４Ｙから４Ｙ＋４へ（４＝２^２）だけジャンプする場合に、図１５（ｃ）に示すように、４Ｙ＋１、４Ｙ＋２、４Ｙ＋３を使って補間し、平滑化することができる。つまりビット拡張前では表現できなかった信号変化を得ることができ、原音（アナログ入力音声信号ＳＡ）波形にフィルタ処理前よりも近づけることができる。As described above, the present embodiment can increase the effective number of quantization bits in a region where the data value level changes gradually. When an n-bit audio signal is converted into n + 2 bits, the analog input audio signal SA changes gently, and the data value level change of the audio signal data DS bit-extended corresponding to this changes is shown in FIG. As shown in FIG. 15C, when jumping from 4Y to 4Y + 4 (4 = 2 ² ), interpolation can be performed using 4Y + 1, 4Y + 2, 4Y + 3, and smoothing can be performed. That is, a signal change that cannot be expressed before bit expansion can be obtained, and the waveform of the original sound (analog input sound signal SA) can be made closer than before the filter processing.

次に、εフィルタに信号振幅が急峻に大きく変化する（閾値εよりも大きい幅でステップ状に変化する）信号が入力された場合の動作を説明する。図１６（ａ）〜（ｅ）は、上記の音声信号処理装置に信号振幅が急峻に大きく変化する信号が入力された場合の動作を説明するための図である。横軸は時間軸でサンプリングタイミングｉを示している。
図１６（ａ）はアナログの音声入力信号ＳＡの振幅変化波形を示し、図１６（ｂ）はｎビットの音声信号データＤＩのデータ値レベル変化を示し、図１６（ｃ）はｎ＋αビットの音声信号データＤＳ、及びＤＭのデータ値レベル変化を示し、図１６（ｄ）は加重平均値ＥＭのデータ値レベル変化を示し、図１６（ｅ）はｎ＋αビットの音声信号データＤＯのデータ値レベル変化を示している。Next, an operation when a signal whose signal amplitude changes suddenly and greatly (changes stepwise with a width larger than the threshold ε) is input to the ε filter will be described. FIGS. 16A to 16E are diagrams for explaining the operation in the case where a signal whose signal amplitude changes suddenly and greatly is input to the audio signal processing apparatus. The horizontal axis represents the sampling timing i on the time axis.
FIG. 16A shows the amplitude change waveform of the analog audio input signal SA, FIG. 16B shows the data value level change of the n-bit audio signal data DI, and FIG. 16C shows the n + α-bit audio. FIG. 16D shows a data value level change of the weighted average value EM, and FIG. 16E shows a data value level change of the n + α-bit audio signal data DO. Is shown.

図１６（ａ）に示すようなアナログの音声信号ＳＡが、入力端子１より入力処理部２に入力される。入力処理部２は、アナログの音声信号ＳＡを図１６（ｂ）に示すようなデジタルの音声信号データＤＩ（ｎビットでは２値のデータ値レベル（ＹとＹ＋４））に変換して、原データビット拡張部５に出力する。 An analog audio signal SA as shown in FIG. 16A is input from the input terminal 1 to the input processing unit 2. The input processing unit 2 converts the analog audio signal SA into digital audio signal data DI (binary data value levels (Y and Y + 4 for n bits)) as shown in FIG. Output to the bit extension unit 5.

原データビット拡張部５では、図１６（ｂ）の音声信号データＤＩのデータ値レベルＹが４Ｙに、Ｙ＋４が４（Ｙ＋４）に変換され、図１６（ｃ）に示したような音声信号データＤＳを一次元４次εフィルタ部６に出力する。 In the original data bit expansion unit 5, the data value level Y of the audio signal data DI of FIG. 16B is converted to 4Y and Y + 4 is converted to 4 (Y + 4), and the audio signal data as shown in FIG. The DS is output to the one-dimensional fourth-order ε filter unit 6.

図１７は、図１６（ｃ）のサンプリング点ｃを注目点としたときに、データ格納部７が出力するデータＤＭ（ｋ）、及びこれに基づいて第１の差分算出部８−１〜第４の差分算出部８−４、第１のε判定部９−１〜第４のε判定部９−４、及び判定付き加重平均部１０で生成されるデータＥＤ（ｋ）、ＥＥ（ｋ）、ＥＭを示す表であり、判定付き加重平均部１０における演算の過程で得られるｆ｛ＥＤ（ｋ）｝の値も併せて示されている。 FIG. 17 shows the data DM (k) output from the data storage unit 7 when the sampling point c in FIG. 16C is the point of interest, and the first difference calculation units 8-1 to 8-1 based on the data DM (k). Data ED (k), EE (k) generated by the difference calculation unit 8-4, the first ε determination unit 9-1 to the fourth ε determination unit 9-4, and the weighted average unit 10 with determination , EM, and the value of f {ED (k)} obtained in the process of calculation in the weighted average unit 10 with determination is also shown.

データ格納部７はＤＭ（ｌ２）＝４Ｙ、ＤＭ（ｌ１）＝４Ｙ、ＤＭ（ｃ）＝４Ｙ、ＤＭ（ｒ１）＝４Ｙ＋１６を第１の差分算出部８−１〜第４の差分算出部８−４に出力する。 The data storage unit 7 sets DM (l2) = 4Y, DM (l1) = 4Y, DM (c) = 4Y, DM (r1) = 4Y + 16 to the first difference calculation unit 8-1 to the fourth difference calculation unit 8. Output to -4.

第１の差分算出部８−１は、
ＥＤ（ｌ２）＝ＤＭ（ｌ２）−ＤＭ（ｃ）＝４Ｙ−４Ｙ＝０、
を出力し、第２の差分算出部８−２は、
ＥＤ（ｌ１）＝ＤＭ（ｌ１）−ＤＭ（ｃ）＝４Ｙ−４Ｙ＝０、
を出力し、第３の差分算出部８−３は、
ＥＤ（ｃ）＝ＤＭ（ｃ）−ＤＭ（ｃ）＝４Ｙ−４Ｙ＝０、
を出力し、第４の差分算出部８−４は、
ＥＤ（ｒ１）＝ＤＭ（ｒ１）−ＤＭ（ｃ）＝（４Ｙ＋１６）−４Ｙ＝１６、
を出力する。The first difference calculation unit 8-1
ED (l2) = DM (l2) -DM (c) = 4Y-4Y = 0,
The second difference calculation unit 8-2
ED (l1) = DM (l1) -DM (c) = 4Y-4Y = 0,
The third difference calculation unit 8-3
ED (c) = DM (c) -DM (c) = 4Y-4Y = 0,
The fourth difference calculation unit 8-4
ED (r1) = DM (r1) -DM (c) = (4Y + 16) -4Y = 16,
Is output.

従って、第１のε判定部９−１〜第３のε判定部９−３には「０」が入力され、第４のε判定部９−４には「１６」が入力される。閾値εを「４」と設定した場合、ＥＤ（ｌ２）〜ＥＤ（ｃ）は閾値以下であるので判定データＥＥ（ｌ２）〜ＥＥ（ｃ）は「１」となり、ＥＤ（ｒ１）は閾値εより大きいのでＥＥ（ｒ１）は「０」となる。 Accordingly, “0” is input to the first ε determination unit 9-1 to the third ε determination unit 9-3, and “16” is input to the fourth ε determination unit 9-4. When the threshold value ε is set to “4”, the determination data EE (l2) to EE (c) are “1” because ED (l2) to ED (c) are equal to or less than the threshold value, and ED (r1) is equal to the threshold value ε. Since it is larger, EE (r1) becomes “0”.

判定付き加重平均部１０は、判定データが「１」の場合は係数ａ_ｋに差分データを乗算して加算し、「０」の場合は加算しない。係数ａ_ｋ（＝ａ_ｌ２、ａ_ｌ１、ａ_ｃ）を全て０．２５とした場合、判定付き加重平均部１０の出力ＥＭは、以下の式（６）で与えられる。
ＥＭ＝ａ_ｌ２×ＥＤ（ｌ２）＋ａ_ｌ１×ＥＤ（ｌ１）＋ａ_ｃ×ＥＤ（ｃ）
＝０．２５×０＋０．２５×０＋０．２５×０
＝０
…（６）The weighted average unit 10 with determination multiplies the coefficient _ak by the difference data when the determination data is “1”, and does not add it when it is “0”. When the coefficients a _k (= a _l2 , a _l1 , a _c ) are all 0.25, the output EM of the weighted average unit with determination 10 is given by the following equation (6).
EM = a _l2 × ED (l2) + a _l1 × ED (l1) + a _c × ED (c)
= 0.25 × 0 + 0.25 × 0 + 0.25 × 0
= 0
(6)

原データビット拡張部５から供給される音声信号データＤＳに対し、各サンプリングタイミングを順に注目点として上記の処理が行われ、式（６）と同様の演算が行われる。
図１６（ｄ）は、図１６（ｃ）の音声信号データＤＭに対応する加重平均値ＥＭ、即ち各サンプリングタイミングｉを注目点としたときの加重平均値ＥＭのデータ値レベル変化を示す。
図１６（ｄ）の横軸が注目点のサンプリングタイミングｉ、縦軸が加重平均値ＥＭのデータ値を示している。
判定付き加重平均部１０は、加重平均値ＥＭをデータ加算部１１に出力する。The above processing is performed on the audio signal data DS supplied from the original data bit expansion unit 5 with each sampling timing as an attention point in order, and the same calculation as in Expression (6) is performed.
FIG. 16D shows the data level change of the weighted average value EM corresponding to the audio signal data DM of FIG. 16C, that is, the weighted average value EM when each sampling timing i is the point of interest.
In FIG. 16D, the horizontal axis represents the sampling timing i of the target point, and the vertical axis represents the data value of the weighted average value EM.
The weighted average unit 10 with determination outputs the weighted average value EM to the data adding unit 11.

データ加算部１１は、図１６（ｃ）に示した音声信号データＤＳと、図１６（ｄ）に示した加重平均値ＥＭを加算して、図１６（ｅ）に示す音声信号データＤＯを出力する。 The data adder 11 adds the audio signal data DS shown in FIG. 16C and the weighted average value EM shown in FIG. 16D, and outputs the audio signal data DO shown in FIG. To do.

図１６（ｅ）に示されるように、音声信号データＤＯは、入力ＳＡ或いはＤＩと同様に急峻な変化を有し、εフィルタは信号振幅の変化が大きい場合、急峻に大きく変化する領域の鮮鋭度を保存することができる。 As shown in FIG. 16E, the audio signal data DO has a steep change similarly to the input SA or DI, and the ε filter sharpens a sharply changing region when the signal amplitude changes greatly. The degree can be saved.

以上のように、ｎビットの音声信号データをｎ＋αビットに増加させる量子化ビット不足改善処理を行なう場合、ビット拡張前の最小データ値レベル変化をさらに細かく補間できるように、閾値εを２のα乗と設定したとすると、入力音声信号と出力音声信号のビット分解能の差による量子化ビット数不足による信号波形歪みが低減される。すなわち、実効的な量子化ビット数を増やすことができる。また緩やかに信号レベルが変化する領域の量子化ビット数を増やす一方で、音声信号のダイナミックな振幅変化特性を劣化させないようにすることができる。
なお、閾値を２^αとするのは、２ビット拡張前の最小データ値レベル変化を、αビット拡張後のデータ値レベル変化として換算すると２^αになることを考慮したものであるが、実使用状態に応じて最適化を図るため閾値を２α以外の値に設定しても良い。As described above, when the quantization bit shortage improvement process for increasing the n-bit audio signal data to n + α bits is performed, the threshold ε is set to 2 α so that the minimum data value level change before bit expansion can be further finely interpolated. If the power is set, the signal waveform distortion due to the insufficient number of quantization bits due to the difference in bit resolution between the input audio signal and the output audio signal is reduced. That is, the effective number of quantization bits can be increased. Further, it is possible to increase the number of quantization bits in a region where the signal level changes gradually, while preventing the dynamic amplitude change characteristics of the audio signal from being deteriorated.
Incidentally, to the threshold value 2 ^alpha is the minimum data value level change 2 bits before expansion, but is taken into consideration that when converted becomes 2 ^alpha as a data value level change after alpha bit extension, actual use The threshold may be set to a value other than 2α for optimization according to the state.

図１８は、図１に示される音声信号出力装置に改変を加えた音声信号出力装置を示すブロック図である。
図１８の音声信号出力装置は、概して図１に示される音声信号出力装置と同じであるが、図１の出力処理部４の代わりに、デジタル式、あるいはアナログ式の振幅制御プログラマブルアンプ１５ａ内蔵の出力処理部１５が設けられている。
このプログラマブルアンプ１５ａは後述の比較部１６からの利得制御信号ＧＣに応じて利得（増幅率）を変更することができるものである。
比較部１６は、データ格納部７から出力される注目点のデータＤＭ（ｃ）とデータ加算部１１の出力ＤＯが入力され、これらの比に応じた利得制御信号ＧＣを出力する。この利得制御信号ＧＣは、出力処理部１５の振幅制御プログラマブルアンプ１５ａに供給される。FIG. 18 is a block diagram showing an audio signal output device obtained by modifying the audio signal output device shown in FIG.
The audio signal output device of FIG. 18 is generally the same as the audio signal output device shown in FIG. 1, but instead of the output processing unit 4 of FIG. 1, a digital or analog type amplitude control programmable amplifier 15a is incorporated. An output processing unit 15 is provided.
The programmable amplifier 15a can change the gain (amplification factor) in accordance with a gain control signal GC from the comparison unit 16 described later.
The comparison unit 16 receives the data DM (c) of the target point output from the data storage unit 7 and the output DO of the data addition unit 11 and outputs a gain control signal GC corresponding to the ratio between them. The gain control signal GC is supplied to the amplitude control programmable amplifier 15a of the output processing unit 15.

図１８に示される音声信号出力装置の動作を説明する。入力処理部２、原データビット拡張部５と一次元ｍ次εフィルタ部６は図１の例について説明したのと同様に動作する。
比較部１６ではデータ格納部７から出力される注目点のデータＤＭ（ｃ）とデータ加算部出力ＤＯが入力され、これらのデータを比較している。たとえばこれらのデータの比が一定以上になった場合、比較部１６の出力データで振幅制御プログラマブルアンプ１５ａの利得、従って出力処理部１５の出力の振幅を制御する。
例えば比較部１６内でそれぞれの入力の比（ＤＯ／ＤＭ（ｃ））を求め、この逆数を制御に用いる。即ち入力の比ＤＯ／ＤＭ（ｃ）が小さいほど、振幅制御プログラマブルアンプ１５ａの利得を大きくする。The operation of the audio signal output apparatus shown in FIG. 18 will be described. The input processing unit 2, the original data bit extension unit 5, and the one-dimensional mth-order ε filter unit 6 operate in the same manner as described in the example of FIG.
The comparison unit 16 receives the data DM (c) of the attention point output from the data storage unit 7 and the data addition unit output DO, and compares these data. For example, when the ratio of these data exceeds a certain level, the gain of the amplitude control programmable amplifier 15a, and hence the output amplitude of the output processing unit 15, is controlled by the output data of the comparison unit 16.
For example, the ratio (DO / DM (c)) of each input is obtained in the comparison unit 16, and this reciprocal is used for control. That is, the smaller the input ratio DO / DM (c), the larger the gain of the amplitude control programmable amplifier 15a.

このように構成することにより、一次元ｍ次εフィルタ部６の平滑化動作時に伴う低域通過フィルタの性能の影響を補正することができる。たとえば低域通過フィルタの性能によって信号データの高周波振幅が減少した場合、比較部１６に入力される注目点のデータＤＭ（ｃ）の振幅よりもデータ加算部出力ＤＯの振幅が小さくなることが判別できるため、これらのデータの比（ＤＯ／ＤＭ（ｃ））に基づいて定めた制御信号ＧＣを出力し、出力処理部１５を、平滑化波形を残しながら、振幅を大きくするように動作させる。このようにして周波数補正を行なうことができる。 With this configuration, it is possible to correct the influence of the performance of the low-pass filter that accompanies the smoothing operation of the one-dimensional mth-order ε filter unit 6. For example, when the high-frequency amplitude of the signal data is decreased due to the performance of the low-pass filter, it is determined that the amplitude of the data addition unit output DO is smaller than the amplitude of the data DM (c) of the attention point input to the comparison unit 16. Therefore, the control signal GC determined based on the ratio (DO / DM (c)) of these data is output, and the output processing unit 15 is operated to increase the amplitude while leaving the smoothed waveform. In this way, frequency correction can be performed.

なお、比較部１６の出力で、出力処理部１５を制御する代わりに、出力処理部１５より後段に配置された、別の振幅制御手段を制御する（それにより、データ加算部１１の出力を補正する）こととしても良く、その場合にも上記と同様な振幅補正効果を得ることができる。 In addition, instead of controlling the output processing unit 15 with the output of the comparison unit 16, another amplitude control unit disposed downstream from the output processing unit 15 is controlled (thereby correcting the output of the data adding unit 11). In this case, the same amplitude correction effect as described above can be obtained.

図１９は、以上に説明した本実施の形態に係る音声信号処理装置の処理工程を示すフローチャートである。以下の説明では、εフィルタ部の次数がｍであるものとし、データの符号として、図１で用いたのと同じもの、及びこれに準じたものが用いられる。 FIG. 19 is a flowchart showing the processing steps of the audio signal processing device according to the present embodiment described above. In the following description, it is assumed that the order of the ε filter section is m, and the same data code as that used in FIG.

まず、アナログ音声入力信号ＳＡが入力端子１に入力され、入力処理部２は、アナログ音声入力信号ＳＡを受信してｎビットの音声信号データＤＩを出力する（ＳＴ１）。
入力処理部２が出力する音声信号データＤＩは、拡張データ生成処理部３の原データビット拡張部５に入力され、原データビット拡張部５では音声信号データＤＩをαビットだけビット拡張してｎ＋αビットの音声信号データＤＳを出力する（ＳＴ２）。First, an analog audio input signal SA is input to the input terminal 1, and the input processing unit 2 receives the analog audio input signal SA and outputs n-bit audio signal data DI (ST1).
The audio signal data DI output from the input processing unit 2 is input to the original data bit extension unit 5 of the extension data generation processing unit 3, and the original data bit extension unit 5 bit-extends the audio signal data DI by α bits to n + α. Bit audio signal data DS is output (ST2).

データ格納部７には、ｎ＋αビットの音声信号データＤＳが入力され、時間軸上のｍ個（ｍ≧２^α）の連続したデータを保持して、ＤＭ（１）〜ＤＭ（ｍ）として出力する（ＳＴ３）。
第１の差分算出部８−１〜第ｍの差分算出部８−ｍには、連続するｍ個のデータＤＭ（１）〜ＤＭ（ｍ）が入力され、注目点ｃのデータと、その前後のサンプリングタイミングのデータの各々との差分をとり、差分データＥＤ（１）〜ＥＤ（ｍ）を出力する（ＳＴ４）。The data storage unit 7 receives n + α-bit audio signal data DS, holds m (m ≧ 2 ^α ) continuous data on the time axis, and outputs them as DM (1) to DM (m). (ST3).
The first difference calculation unit 8-1 to the m-th difference calculation unit 8-m receive m pieces of continuous data DM (1) to DM (m), and the data of the point of interest c and its front and back And difference data ED (1) to ED (m) are output (ST4).

第１のε判定部９−１〜第ｍのε判定部９−ｍには、差分データＥＤ（１）〜ＥＤ（ｍ）が入力され、加算を行うかどうかを示す判定データＥＥ（１）〜ＥＥ（ｍ）を出力する（ＳＴ５）。
判定付き加重平均部１０には、差分データＥＤ（１）〜ＥＤ（ｍ）と判定データＥＥ（１）〜ＥＥ（ｍ）が入力され、判定付き加重平均部１０は、判定データに基づいて差分データを加重平均し、加重平均値ＥＭを出力する（ＳＴ６）。Difference data ED (1) to ED (m) are input to the first ε determination unit 9-1 to the m-th ε determination unit 9-m, and determination data EE (1) indicating whether to perform addition. To EE (m) are output (ST5).
Difference data ED (1) to ED (m) and determination data EE (1) to EE (m) are input to the weighted average unit 10 with determination, and the weighted average unit 10 with determination uses the difference based on the determination data. Data is weighted and a weighted average value EM is output (ST6).

データ加算部１１には、加重平均値ＥＭと注目点ｃの原データＤＭ（ｃ）が入力され、データ加算部１１は、これらを加算して音声信号データＤＯを出力する（ＳＴ７）。音声信号データＤＯは出力処理部４に入力され、出力処理部４は音声信号データＤＯに基づいて、例えばＤ／Ａ変換処理して音声信号を出力する（ＳＴ８）。 The data adding unit 11 receives the weighted average value EM and the original data DM (c) of the point of interest c, and the data adding unit 11 adds these and outputs the audio signal data DO (ST7). The audio signal data DO is input to the output processing unit 4, and the output processing unit 4 performs, for example, D / A conversion processing based on the audio signal data DO and outputs an audio signal (ST8).

実施の形態２．
図２０は、本発明の実施の形態２の音声信号出力装置を示す図である。図２０の音声信号出力装置は、概して図１の音声信号出力装置と同じであるが、図１の原データビット拡張部５が設けられておらず、図１の判定付き加重平均部１０及びデータ加算部１１の代わりに、ビット拡張及び判定付き加重平均部１７及びビット拡張付きデータ加算部１８が設けられている。Embodiment 2. FIG.
FIG. 20 is a diagram showing an audio signal output apparatus according to Embodiment 2 of the present invention. The audio signal output device of FIG. 20 is generally the same as the audio signal output device of FIG. 1, but the original data bit extension unit 5 of FIG. 1 is not provided, and the weighted average unit 10 with determination and data of FIG. Instead of the addition unit 11, a weighted average unit 17 with bit extension and determination and a data addition unit 18 with bit extension are provided.

差分算出部８−１〜８−ｍ、及びε判定部９−１〜９−ｍはｎビットのデータに対する処理を行なう。ε判定部９−１〜９−ｍにおける閾値は「１」と設定される。これは、εフィルタ部の前にビット拡張部がないためである。 The difference calculation units 8-1 to 8-m and the ε determination units 9-1 to 9-m perform processing on n-bit data. The threshold value in the ε determination units 9-1 to 9-m is set to “1”. This is because there is no bit extension part before the ε filter part.

ビット機能拡張及び判定付き加重平均部１７は、加重平均演算及びビット拡張を行なって、αビットだけビット拡張された加重平均値ＥＭを出力する。これは例えば、図７などの例について説明したのと同様の判定付き加重平均の演算を行なったときに現れる小数点以下の桁の数値をも表すデータを平均データＥＭとして生成することで、このビット拡張を行なうことができる。
ビット拡張付きデータ加算部１８は、ビット拡張付きデータ加算部１８は、ビット拡張部１８ａと、データ加算部１８ｂとを有し、ビット拡張部１８ａで入力ＤＭ（ｃ）をｎ＋αビットにビット拡張してビット拡張されたデータＤＭａ（ｃ）を出力する。
データ加算部１８ｂは、ビット拡張されたデータＤＭａ（ｃ）と、ビット拡張及び判定付き加重平均部１７の出力ＥＭとを加算して、加算結果を表すデータＤＯを出力する。The weighted average unit 17 with bit function extension and determination performs a weighted average operation and bit extension, and outputs a weighted average value EM that is bit-extended by α bits. For example, this bit is generated by generating as the average data EM data representing the numerical values of the decimal places that appear when the weighted average with determination similar to that described in the example of FIG. 7 is performed. Can be extended.
The data adder with bit extension 18 has a bit adder 18a and a data adder 18b. The bit extender 18a bit-extends the input DM (c) to n + α bits. The data DMa (c) expanded in bit is output.
The data adding unit 18b adds the bit expanded data DMa (c) and the output EM of the weighted average unit 17 with bit extension and determination, and outputs data DO representing the addition result.

このように、図２０の音声信号出力装置では、ビット拡張をεフィルタ部の前で行なわず、εフィルタの演算中に行なう。以上のように構成することで、図１に示した例と同等の効果が得られる。 As described above, in the audio signal output device of FIG. 20, the bit expansion is not performed before the ε filter unit, but is performed during the calculation of the ε filter. By configuring as described above, the same effect as the example shown in FIG. 1 can be obtained.

図２１は、図２０に示される音声信号出力装置に改変を加えた音声信号出力装置を示すブロック図である。
図２１の音声信号出力装置は、概して図２０に示される音声信号出力装置と同じであるが、図２０の出力処理部４の代わりに、デジタル式、あるいはアナログ式の振幅制御プログラマブルアンプ１５ａ内蔵の出力処理部１５が設けられている。
このプログラマブルアンプ１５ａは、図１８に示すものと同様に、後述の比較部１６からの利得制御信号ＧＣに応じて利得（増幅率）を変更することができるものである。FIG. 21 is a block diagram showing an audio signal output device obtained by modifying the audio signal output device shown in FIG.
The audio signal output device of FIG. 21 is generally the same as the audio signal output device shown in FIG. 20, but instead of the output processing unit 4 of FIG. 20, a digital or analog amplitude control programmable amplifier 15a is incorporated. An output processing unit 15 is provided.
This programmable amplifier 15a can change the gain (amplification factor) in accordance with a gain control signal GC from the comparison unit 16, which will be described later, as shown in FIG.

ビット拡張付きデータ加算部１８は、図２０のビット拡張付きデータ加算部１８と同様に、ビット拡張部１８ａと、データ加算部１８ｂとを有し、ビット拡張部１８ａで入力ＤＭ（ｃ）をｎ＋αビットにビット拡張してビット拡張されたデータＤＭａ（ｃ）を出力する。
データ加算部１８ｂでは、ビット拡張されたデータＤＭａ（ｃ）と、平均データＥＭを加算して、加算結果を表すデータＤＯを出力する。
ビット拡張部１８ａの出力は、ビット拡張付きデータ加算部１８の外部にも出力され、比較部１６に供給される。Similarly to the data addition unit 18 with bit extension of FIG. 20, the data addition unit 18 with bit extension has a bit extension unit 18a and a data addition unit 18b. The bit extension unit 18a receives the input DM (c) as n + α. Bit extended data DMa (c) is output.
The data adder 18b adds the bit expanded data DMa (c) and the average data EM, and outputs data DO representing the addition result.
The output of the bit expansion unit 18 a is also output to the outside of the data addition unit 18 with bit expansion and is supplied to the comparison unit 16.

比較部１６は、ビット拡張された注目点のデータＤＭａ（ｃ）とデータ加算部１８ｂにおける加算結果出力ＤＯが入力され、これらの比に応じた利得制御信号ＧＣを出力する。
この利得制御信号ＧＣは、出力処理部１５の振幅制御プログラマブルアンプ１５ａに供給される。The comparison unit 16 receives the bit-expanded point-of-interest data DMa (c) and the addition result output DO from the data addition unit 18b, and outputs a gain control signal GC corresponding to the ratio.
The gain control signal GC is supplied to the amplitude control programmable amplifier 15a of the output processing unit 15.

図２１に示される音声信号出力装置の動作を説明する。入力処理部２、及びビット拡張付き一次元ｍ次εフィルタ１９は、図２０の例について説明したのと同様に動作する。
比較部１６ではビット拡張部１８ａから出力されるαビット拡張された注目点のデータＤＭａ（ｃ）と、データ加算部１８ｂの加算結果出力ＤＯが入力され、これらのデータを比較している。たとえばこれらのデータの比が一定以上になった場合、比較部１６の出力データで振幅制御プログラマブルアンプ１５ａの利得、従って出力処理部１５の出力の振幅を制御する。
例えば比較部１６内でそれぞれの入力の比（ＤＯ／ＤＭ（ｃ））を求め、この逆数を制御に用いる。即ち入力の比ＤＯ／ＤＭ（ｃ）が小さいほど、振幅制御プログラマブルアンプ１５ａの利得を大きくする。The operation of the audio signal output device shown in FIG. 21 will be described. The input processing unit 2 and the one-dimensional m-order ε filter 19 with bit extension operate in the same manner as described in the example of FIG.
The comparison unit 16 receives the α-bit expanded point-of-interest data DMa (c) output from the bit expansion unit 18a and the addition result output DO of the data addition unit 18b, and compares these data. For example, when the ratio of these data exceeds a certain level, the gain of the amplitude control programmable amplifier 15a, and hence the output amplitude of the output processing unit 15, is controlled by the output data of the comparison unit 16.
For example, the ratio (DO / DM (c)) of each input is obtained in the comparison unit 16, and this reciprocal is used for control. That is, the smaller the input ratio DO / DM (c), the larger the gain of the amplitude control programmable amplifier 15a.

このように構成することにより、ビット拡張付き一次元ｍ次εフィルタ部１９の平滑化動作時に伴う低域通過フィルタ性能の影響を補正することができる。たとえば低域通過フィルタの性能によって信号データの高周波振幅が減少した場合、比較部１６に入力される注目点のデータＤＭａ（ｃ）の振幅よりもデータ加算部１８ｂの加算結果出力ＤＯの振幅が小さくなることが判別できるため、これらのデータの比（ＤＯ／ＤＭ（ｃ））に基づいて定めた制御信号ＧＣを出力し、出力処理部１５を、平滑化波形を残しながら、振幅を大きくするように動作させる。 With this configuration, it is possible to correct the influence of the low-pass filter performance that accompanies the smoothing operation of the one-dimensional m-order ε filter unit 19 with bit extension. For example, when the high-frequency amplitude of the signal data is reduced due to the performance of the low-pass filter, the amplitude of the addition result output DO of the data addition unit 18b is smaller than the amplitude of the data DMa (c) of the target point input to the comparison unit 16. Since the control signal GC determined based on the ratio of these data (DO / DM (c)) is output, the output processing unit 15 increases the amplitude while leaving the smoothed waveform. To work.

なお、図１８の例について述べたのと同様、比較部１６の出力で、出力処理部１５を制御する代わりに、出力処理部１５より後段に配置された、別の振幅制御手段を制御する（それにより、データ加算部１１の出力を補正する）こととしても良く、その場合にも上記と同様な振幅補正効果を得ることができる。 As described in the example of FIG. 18, instead of controlling the output processing unit 15 with the output of the comparison unit 16, another amplitude control unit arranged downstream from the output processing unit 15 is controlled ( In this case, the same amplitude correction effect as described above can be obtained.

図２０、図２１を参照して説明した実施の形態に対しても、図５，図６を参照して説明したのと同様の変形(係数プログラマブル判定付き加重平均部１２の使用、或いは次数可変判定付き加重平均部１３の使用)を加えることが可能である。 20 and FIG. 21, the same modification as that described with reference to FIG. 5 and FIG. 6 (use of weighted average unit 12 with coefficient programmable determination, or variable order) Use of the weighted average unit 13 with determination) can be added.

実施の形態３．
図２２は、本発明の実施の形態３に係る音声信号処理装置の構成を示す図である。実施の形態３に係る音声信号処理装置は、入力端子２１、周波数振幅推定部２２、フィルタ係数生成部２３、原データビット拡張部２４、及びエッジ保存型平滑化フィルタ部２５を備える。Embodiment 3 FIG.
FIG. 22 is a diagram showing a configuration of an audio signal processing device according to Embodiment 3 of the present invention. The audio signal processing apparatus according to Embodiment 3 includes an input terminal 21, a frequency amplitude estimation unit 22, a filter coefficient generation unit 23, an original data bit extension unit 24, and an edge preserving smoothing filter unit 25.

入力端子２１には、ｎビット（ｎは正の整数）の音声信号Ｘが供給される。音声信号Ｘは、図１の音声信号データＤＩに相当するものである。図２３は、実施の形態３のｎビットの音声信号の一例を示す図である。図２３（ａ）の横軸は時間ｉ、縦軸は信号レベルを示している。図２３（ａ）に示すように、この音声信号は、相連続するサンプリング点毎の、デジタル信号の列であり、時間（を表す数値）ｉは、サンプリング点毎に１ずつ増加する。
図２３（ｂ）の横軸は周波数ｆ、縦軸はパワーおよびゲインを示している。図２３（ａ）はｎビットで量子化された周波数ｆ１の正弦波の音声信号を示し、図２３（ｂ）の実線は図２３（ａ）の音声信号の周波数スペクトル、点線は（後述の）低域通過フィルタの周波数特性を示している。図２３（ｂ）の周波数スペクトルが示すように周波数ｆ１の正弦波であるにもかかわらず量子化によりｆ２やｆ３のような高調波が存在する。An n-bit (n is a positive integer) audio signal X is supplied to the input terminal 21. The audio signal X corresponds to the audio signal data DI in FIG. FIG. 23 is a diagram illustrating an example of an n-bit audio signal according to the third embodiment. In FIG. 23A, the horizontal axis represents time i, and the vertical axis represents signal level. As shown in FIG. 23A, this audio signal is a sequence of digital signals for each successive sampling point, and the time (representing numerical value) i increases by one for each sampling point.
In FIG. 23B, the horizontal axis indicates frequency f, and the vertical axis indicates power and gain. FIG. 23 (a) shows a sine wave audio signal of frequency f1 quantized with n bits, the solid line in FIG. 23 (b) is the frequency spectrum of the audio signal in FIG. 23 (a), and the dotted line is (described later). The frequency characteristic of a low-pass filter is shown. As shown in the frequency spectrum of FIG. 23 (b), harmonics such as f2 and f3 exist due to quantization even though it is a sine wave of frequency f1.

本実施の形態ではｎビットの音声信号の高調波成分を除去し、量子化による音声信号の波形歪みを補正する。 In this embodiment, harmonic components of an n-bit audio signal are removed, and waveform distortion of the audio signal due to quantization is corrected.

具体的には、まず、音声信号から周波数を推定し、推定した周波数の高調波を除去する周波数特性を有する低域通過フィルタのフィルタ係数を生成する。また、周波数だけではなく振幅も推定し、低域通過フィルタのフィルタ係数の次数を決定する。次数の設定についてはエッジ保存型平滑化フィルタと共に説明する。 Specifically, first, a frequency is estimated from the audio signal, and a filter coefficient of a low-pass filter having frequency characteristics for removing harmonics of the estimated frequency is generated. Further, not only the frequency but also the amplitude is estimated, and the order of the filter coefficient of the low-pass filter is determined. The setting of the order will be described together with the edge preserving smoothing filter.

例えば、図２３（ｂ）の実線が示すような周波数ｆ１と推定した場合、図２３（ｂ）の点線のようなカットオフ周波数特性の低域通過フィルタ係数を生成する。周波数ｆ１が推定できればｆ２は自明であるため、ｆ１とｆ２の間にｆｃ１を設定すればよい。 For example, when the frequency f1 is estimated as indicated by the solid line in FIG. 23B, a low-pass filter coefficient having a cutoff frequency characteristic as indicated by the dotted line in FIG. 23B is generated. Since f2 is obvious if the frequency f1 can be estimated, fc1 may be set between f1 and f2.

次に、生成した低域通過フィルタ係数を用いてビット拡張した音声信号に対して平滑化を行うことで、高調波成分を除去して量子化による波形歪みを補正する。平滑化にはエッジ保存型平滑化フィルタを用いる。 Next, smoothing is performed on the audio signal that has been bit-extended using the generated low-pass filter coefficient, thereby removing harmonic components and correcting waveform distortion due to quantization. An edge preserving smoothing filter is used for smoothing.

エッジ保存型平滑化フィルタは、実施の形態１に関連して説明したように、急峻で大きな変化が存在する部分の先鋭度を保存しながら、小さな変化のみを平滑化するフィルタであり、εフィルタ、トリムド平均値フィルタ（ＤＷ−ＭＴＭフィルタ）、バイラテラルフィルタなどがある。εフィルタ及びトリムド平均値フィルタ（ＤＷ−ＭＴＭフィルタ）は上記の非特許文献１に説明されている。以下、例としてエッジ保存型平滑化フィルタをεフィルタとして説明する。 As described in connection with the first embodiment, the edge-preserving smoothing filter is a filter that smoothes only small changes while preserving the sharpness of a portion where there is a steep and large change, and an ε filter. , Trimmed average filter (DW-MTM filter), bilateral filter, and the like. The ε filter and the trimmed average value filter (DW-MTM filter) are described in Non-Patent Document 1 described above. Hereinafter, an edge preserving smoothing filter will be described as an ε filter as an example.

図２４は、εフィルタの動作を説明するための図である。図２４（ａ）はεフィルタの入力信号ｘ（ｉ）を示し、図２４（ｂ）は図２４（ａ）の位置ｉ１を注目点とした時の
ｆ｛ｘ（ｉ１−ｋ）−ｘ（ｉ１）｝
を示している。FIG. 24 is a diagram for explaining the operation of the ε filter. FIG. 24A shows an input signal x (i) of the ε filter, and FIG. 24B shows f {x (i1-k) −x (when the position i1 in FIG. i1)}
Is shown.

図２４（ａ）のような緩やかな傾斜を持つ信号がεフィルタの入力信号の場合、次数が大きすぎると注目点と周辺の点の差分が閾値を超えてしまい、図２４（ｂ）のようにその領域が０とされて重み付平均を取るため平滑化の効果が弱まる。 When the signal having a gentle slope as shown in FIG. 24A is the input signal of the ε filter, if the order is too large, the difference between the attention point and the surrounding points exceeds the threshold value, as shown in FIG. Since the area is set to 0 and the weighted average is taken, the smoothing effect is weakened.

εフィルタなどのエッジ保存型平滑化フィルタの多くは低域通過フィルタに比べて、急峻で大きな変化を保存できる利点がある。しかし、信号が緩やかな傾斜を持つ場合、平滑化の性能が低くなる。そこで、実施の形態３では音声信号の周波数と振幅を推定することにより信号の傾斜を考慮している。周波数が同じで振幅が大きい場合、傾斜は急になるのでフィルタの次数を小さくし、εフィルタでの平滑化の性能を上げている。 Many edge-preserving smoothing filters such as the ε filter have the advantage of being able to preserve steep and large changes compared to low-pass filters. However, when the signal has a gentle slope, the smoothing performance is low. In the third embodiment, the inclination of the signal is taken into account by estimating the frequency and amplitude of the audio signal. When the frequency is the same and the amplitude is large, the slope becomes steep, so the order of the filter is reduced and the smoothing performance of the ε filter is improved.

実施の形態３の構成では、まずｎビットの音声信号Ｘが入力端子２１から周波数振幅推定部２２及び原データビット拡張部２４に入力される。周波数振幅推定部２２は、入力された音声信号Ｘの周波数と振幅を推定してフィルタ係数生成部２３に周波数Ｆと振幅Ａを出力する。フィルタ係数生成部２３は、周波数Ｆと振幅Ａに基づいて低域通過フィルタ係数Ｃを生成し、エッジ保存型平滑化フィルタ部２５に出力する。原データビット拡張部２４は、図１などの原データビット拡張部５と同様のものであり、ｎビットの音声信号Ｘをαビット分ビット拡張したｎ＋αビットの音声信号Ｘ’をエッジ保存型平滑化フィルタ部２５に出力する。エッジ保存型平滑化フィルタ部２５は、ｎ＋αビットの音声信号Ｘ’に対して低域通過フィルタ係数Ｃを用いてエッジ保存型平滑化フィルタを使って平滑化処理し、ｎ＋αビットの音声信号Ｙを出力する。エッジ保存型平滑化フィルタ部２５としては、図５に示される係数プログラマブル判定付き加重平均部１２を備えた一次元ｍ次εフィルタ部６を用いることができる。 In the configuration of the third embodiment, an n-bit audio signal X is first input from the input terminal 21 to the frequency amplitude estimation unit 22 and the original data bit extension unit 24. The frequency amplitude estimation unit 22 estimates the frequency and amplitude of the input audio signal X and outputs the frequency F and the amplitude A to the filter coefficient generation unit 23. The filter coefficient generation unit 23 generates a low-pass filter coefficient C based on the frequency F and the amplitude A, and outputs the low-pass filter coefficient C to the edge preserving smoothing filter unit 25. The original data bit extension unit 24 is the same as the original data bit extension unit 5 shown in FIG. 1 and the like, and an edge-preserving smoothing is performed on an n + α bit audio signal X ′ obtained by extending the n bit audio signal X by α bits. To the filter 25. The edge-preserving smoothing filter unit 25 smoothes the n + α-bit speech signal X ′ using the low-pass filter coefficient C using the edge-preserving smoothing filter, and converts the n + α-bit speech signal Y into the n + α-bit speech signal Y. Output. As the edge preserving smoothing filter unit 25, the one-dimensional m-order ε filter unit 6 including the weighted average unit 12 with coefficient programmable determination shown in FIG. 5 can be used.

図２３（ａ）のようなｎビットで量子化された周波数ｆ１の正弦波の音声信号Ｘが入力された場合の実施の形態３の動作を説明する。音声信号Ｘは周波数振幅推定部２２と原データビット拡張部２４に入力される。 The operation of the third embodiment when a sinusoidal audio signal X having a frequency f1 quantized with n bits as shown in FIG. 23A is input will be described. The audio signal X is input to the frequency amplitude estimation unit 22 and the original data bit extension unit 24.

周波数振幅推定部２２では、音声信号Ｘから周波数Ｆ＝ｆ１と振幅Ａ＝２を推定し、フィルタ係数生成部２３に周波数Ｆ＝ｆ１と振幅Ａ＝２を出力する。 The frequency amplitude estimation unit 22 estimates the frequency F = f1 and the amplitude A = 2 from the audio signal X, and outputs the frequency F = f1 and the amplitude A = 2 to the filter coefficient generation unit 23.

フィルタ係数生成部２３では、周波数ｆ１の高調波成分を除去するようなカットオフ周波数ｆｃ１と、周波数Ｆ＝ｆ１と振幅Ａ＝２に基づいた次数を有するフィルタ係数を生成し、エッジ保存型平滑化フィルタ部２５に出力する。 The filter coefficient generation unit 23 generates a filter coefficient having a cut-off frequency fc1 that removes the harmonic component of the frequency f1, and an order based on the frequency F = f1 and the amplitude A = 2, and performs edge-preserving smoothing Output to the filter unit 25.

図２５は、原データビット拡張部２４の動作を説明するための図である。横軸は時間ｉを示し、縦軸は信号レベルを示している。図２５（ａ）はｎビットの音声信号Ｘを示し、図２５（ｂ）はｎ＋αビットの音声信号Ｘ’を示している。原データビット拡張部２４は、図２５（ａ）に示すようなｎビットの音声信号をαビットだけビット拡張し、図２５（ｂ）に示したようなｎ＋αビットの音声信号をエッジ保存型平滑化フィルタ部２５に出力する。 FIG. 25 is a diagram for explaining the operation of the original data bit extension unit 24. The horizontal axis indicates time i, and the vertical axis indicates the signal level. FIG. 25A shows an n-bit audio signal X, and FIG. 25B shows an n + α-bit audio signal X ′. The original data bit extension unit 24 bit-extends the n-bit audio signal as shown in FIG. 25A by α bits, and converts the n + α-bit audio signal as shown in FIG. To the filter 25.

エッジ保存型平滑化フィルタ部２５ではεフィルタを使って平滑化処理を行う。εフィルタはビット拡張により増えたビット数を小振幅成分として扱う。つまり、ｎビットからｎ＋αビットにビット拡張する場合、εフィルタの閾値εは２のα乗とする。これにより、ｎビットにおける１ＬＳＢ以下の段差を平滑化する。 The edge preserving type smoothing filter unit 25 performs a smoothing process using an ε filter. The ε filter treats the number of bits increased by bit expansion as a small amplitude component. That is, when bit expansion is performed from n bits to n + α bits, the threshold ε of the ε filter is set to 2 to the power of α. Thereby, a step of 1 LSB or less in n bits is smoothed.

図２６は、エッジ保存型平滑化フィルタ部２５の動作を説明するための図である。横軸は時間ｉを示し、縦軸は信号レベルを示している。図２６（ａ）はｎ＋αビットの音声信号Ｘ’（ｉ）を示し、図２６（ｂ）はεフィルタ処理されたｎ＋αビットの音声信号Ｙ（ｉ）を示している。エッジ保存型平滑化フィルタ部２５は、図２６（ａ）に示すようなｎ＋αビットの音声信号から高調波除去することで波形歪みを補正し、図２６（ｂ）に示したようなｎ＋αビットの音声信号を出力する。 FIG. 26 is a diagram for explaining the operation of the edge preserving smoothing filter unit 25. The horizontal axis indicates time i, and the vertical axis indicates the signal level. 26A shows an n + α-bit audio signal X ′ (i), and FIG. 26B shows an n + α-bit audio signal Y (i) subjected to the ε filter process. The edge preserving smoothing filter unit 25 corrects the waveform distortion by removing harmonics from the n + α-bit audio signal as shown in FIG. 26 (a), and the n + α-bit as shown in FIG. 26 (b). Output audio signals.

このように、実施の形態３はｎビットの音声信号から高調波除去することで波形歪みを補正したｎ＋αビットの音声信号を出力することができる。 As described above, the third embodiment can output an n + α-bit audio signal in which waveform distortion is corrected by removing harmonics from the n-bit audio signal.

図２７は、周波数振幅推定部２２の詳細な構成を示す図である。周波数振幅推定部２２は、図２７に示すように、変曲点検出部２７、周波数推定部２８、及び振幅推定部２９を備えている。 FIG. 27 is a diagram illustrating a detailed configuration of the frequency amplitude estimation unit 22. As shown in FIG. 27, the frequency amplitude estimation unit 22 includes an inflection point detection unit 27, a frequency estimation unit 28, and an amplitude estimation unit 29.

変曲点検出部２７は、ｎビットの音声信号が一連の単調増加を開始する点又は終了する点及び一連の単調減少を開始する点又は終了する点を変曲点として検出する。ここで、「一連の単調増加」とは、途中で減少が生じることのない（同じ値が続くことはあっても良い）増加の連続を意味する。この一連の単調増加の区間内では、
Ｘ（ｉ）≦Ｘ（ｉ＋１）
の関係が連続して（すべてのサンプリング点ｉにおいて）満たされる。
同様に、「一連の単調減少」とは、途中で増加が生じることのない（同じ値が続くことはあっても良い）減少の連続を意味する。この一連の単調減少の区間内では、
Ｘ（ｉ）≧Ｘ（ｉ＋１）
の関係が連続して（全てのサンプリング点ｉにおいて）満たされる。The inflection point detection unit 27 detects a point at which the n-bit audio signal starts or ends a series of monotone increases and a point at which a series of monotone decreases starts or ends as inflection points. Here, “a series of monotonous increases” means a continuous increase in which no decrease occurs in the middle (the same value may continue). Within this series of monotonically increasing intervals,
X (i) ≦ X (i + 1)
Are continuously satisfied (at all sampling points i).
Similarly, “a series of monotonous decreases” means a series of decreases in which no increase occurs in the middle (the same value may continue). Within this series of monotonically decreasing intervals,
X (i) ≧ X (i + 1)
Are continuously satisfied (at all sampling points i).

図示の変曲点検出部２７は、一次微分算出部３１及び符号変化点検出部３２を備える。
一次微分算出部３１は、入力されたｎビットの音声信号から一次微分データＤを算出して出力する。例えば、一次微分算出部３１は、
各サンプリング点における前記ｎビットの音声信号をＸ（ｉ）で表し、
次のサンプリング点における前記ｎビットの音声信号をＸ（ｉ＋１）で表すとき、
Ｄ（ｉ）＝Ｘ（ｉ＋１）−Ｘ（ｉ）
で得られるＤ（ｉ）を、一次微分データとして出力する。The illustrated inflection point detection unit 27 includes a first derivative calculation unit 31 and a sign change point detection unit 32.
The primary differential calculation unit 31 calculates and outputs primary differential data D from the input n-bit audio signal. For example, the first derivative calculation unit 31
The n-bit audio signal at each sampling point is represented by X (i),
When the n-bit audio signal at the next sampling point is represented by X (i + 1),
D (i) = X (i + 1) −X (i)
D (i) obtained in step 1 is output as first-order differential data.

符号変化点検出部３２は、一次微分データＤの符号が正に変化した点及び負に変化した点を、変曲点として検出する。より具体的には、符号変化点検出部３２は、一次微分データＤが、符号が負である状態又はゼロである状態から符号が正である状態に変わった点（以下、「正への変化点」と言う）、及び符号が正である状態又はゼロである状態から符号が負である状態に変わった点（以下「負への変化点」と言う）を、上記の「符号が変化した点」として検出する。符号変化点検出部３２の出力は２値データであり、「正への変化点」から、「負への変化点」までは、第１の値（例えば、高レベルの値）を取り、「負への変化点」から、「正への変化点」までは、第２の値（例えば、低レベルの値）を取る。
符号変化点検出部３２のこのような動作は、一次微分データＤから符号のみの２値データへの変換し（ただし、０の場合は前データの符号とする）と言うこともできる。
上記の一次微分算出部３１と符号変化点検出部３２とで構成される変曲点検出部２７は、ｎビットの音声信号が一連の単調増加を開始する点及び一連の単調減少を開始する点を変曲点として検出する。The sign change point detection unit 32 detects a point where the sign of the primary differential data D has changed positively and a point where the sign has changed negatively as an inflection point. More specifically, the sign change point detection unit 32 detects that the primary differential data D has changed from a state where the sign is negative or a state where the sign is negative to a state where the sign is positive (hereinafter referred to as “change to positive”). Point)), and the point where the sign changes from a positive or zero state to a negative sign (hereinafter referred to as “change point to negative”) Detect as “point”. The output of the sign change point detection unit 32 is binary data. From the “change point to positive” to the “change point to negative”, the first value (for example, a high level value) is taken. From the “change point to negative” to the “change point to positive”, a second value (for example, a low level value) is taken.
Such an operation of the sign change point detection unit 32 can be said to convert the primary differential data D into binary data with only a sign (however, in the case of 0, the sign of the previous data is used).
The inflection point detection unit 27 configured by the first-order differential calculation unit 31 and the sign change point detection unit 32 has a point where an n-bit audio signal starts a series of monotone increases and a point where a series of monotone decreases starts. Is detected as an inflection point.

周波数推定部２８は音声信号の変曲点と前の（直前の）変曲点の区間長を算出して周波数Ｆとして出力する。
振幅推定部２９は音声信号の変曲点と前の（直前の）変曲点の音声信号のレベル差を振幅Ａとして出力する。The frequency estimation unit 28 calculates the section length of the inflection point of the audio signal and the previous (immediately preceding) inflection point and outputs it as the frequency F.
The amplitude estimation unit 29 outputs the difference in level between the audio signal inflection point and the previous (immediate) inflection point audio signal as an amplitude A.

図２８は、周波数振幅推定部２２の動作を説明するための図である。横軸は時間ｉを示している。図２８（ａ）はｎビットの音声信号Ｘ（ｉ）を示し、図２８（ｂ）は音声信号の一次微分データＤ（ｉ）を示し、図２８（ｃ）は２値データを示し、図２８（ｄ）は音声信号の変曲点位置Ｓを示している。一次微分算出部３１は、図２８（ａ）はｎビットの音声信号が入力された場合、図２８（ｂ）のような一次微分データを算出する。符号変化点検出部３２は、一次微分データＤから図２８（ｃ）のような符号のみの２値データに変換して、図２８（ｄ）のようなその２値データが変化する位置Ｓ＝ｉ１、ｉ２、ｉ３（その時間軸上の位置が縦方向に延びた点線で示されている）を検出する。よって、周波数推定部２８及び振幅推定部２９は、ｉ１〜ｉ２区間では
周波数としてＦ＝１／（ｉ２−ｉ１）、振幅としてＡ＝｜Ｘ（ｉ２）−Ｘ（ｉ１）｜をそれぞれ求め、
ｉ２〜ｉ３区間では、周波数としてＦ＝１／（ｉ３−ｉ２）、振幅としてＡ＝｜Ｘ（ｉ３）−Ｘ（ｉ２）｜をそれぞれ求める。
求められた周波数Ｆ及び振幅Ａはフィルタ係数生成部２３に供給される。FIG. 28 is a diagram for explaining the operation of the frequency amplitude estimator 22. The horizontal axis indicates time i. 28A shows an n-bit audio signal X (i), FIG. 28B shows primary differential data D (i) of the audio signal, FIG. 28C shows binary data, 28 (d) indicates the inflection point position S of the audio signal. When the n-bit audio signal is input in FIG. 28A, the primary differential calculation unit 31 calculates primary differential data as shown in FIG. The sign change point detection unit 32 converts the primary differential data D into binary data having only a sign as shown in FIG. 28C, and a position S where the binary data as shown in FIG. 28D changes. i1, i2, and i3 (the positions on the time axis are indicated by dotted lines extending in the vertical direction) are detected. Therefore, the frequency estimation unit 28 and the amplitude estimation unit 29 obtain F = 1 / (i2−i1) as the frequency and A = | X (i2) −X (i1) |
In the interval from i2 to i3, F = 1 / (i3−i2) is obtained as the frequency and A = | X (i3) −X (i2) | is obtained as the amplitude.
The obtained frequency F and amplitude A are supplied to the filter coefficient generation unit 23.

図２９は、図２８とは別の音声信号が入力された場合の周波数振幅推定部２２の動作を説明するための図である。横軸は時間ｉを示している。図２９（ａ）はｎビットの音声信号Ｘ（ｉ）を示し、図２９（ｂ）は音声信号の一次微分データＤ（ｉ）を示し、図２９（ｃ）は２値データを示し、図２９（ｄ）は音声信号の変曲点位置Ｓを示している。一次微分算出部３１は、図２９（ａ）はｎビットの音声信号が入力された場合、図２９（ｂ）のような一次微分データを算出する。符号変化点検出部３２は、一次微分データから図２９（ｃ）のような符号のみの２値データに変換して、図２９（ｄ）のようなその２値データが変化する位置Ｓ＝ｉ４、ｉ５、ｉ６（その時間軸上の位置が縦方向に延びた点線で示されている）を検出する。よって、周波数推定部２８及び振幅推定部２９は、ｉ４〜ｉ５区間では、周波数としてＦ＝１／（ｉ５−ｉ４）、振幅としてＡ＝｜Ｘ（ｉ５）−Ｘ（ｉ４）｜をそれぞれ求め、ｉ５〜ｉ６区間では、周波数としてＦ＝１／（ｉ６−ｉ５）、振幅としてＡ＝｜Ｘ（ｉ６）−Ｘ（ｉ５）｜をそれぞれ求める。
求められた周波数Ｆ及び振幅Ａはフィルタ係数生成部２３に供給される。FIG. 29 is a diagram for explaining the operation of the frequency amplitude estimation unit 22 when an audio signal different from that in FIG. 28 is input. The horizontal axis indicates time i. 29A shows an n-bit audio signal X (i), FIG. 29B shows primary differential data D (i) of the audio signal, FIG. 29C shows binary data, 29 (d) shows the inflection point position S of the audio signal. The primary differential calculation unit 31 calculates primary differential data as shown in FIG. 29B when an n-bit audio signal is input in FIG. The sign change point detector 32 converts the primary differential data into binary data having only a sign as shown in FIG. 29C, and a position S = i4 where the binary data as shown in FIG. 29D changes. , I5, i6 (the positions on the time axis are indicated by dotted lines extending in the vertical direction). Therefore, the frequency estimation unit 28 and the amplitude estimation unit 29 obtain F = 1 / (i5-i4) as the frequency and A = | X (i5) −X (i4) | In the interval i5 to i6, F = 1 / (i6-i5) is obtained as the frequency, and A = | X (i6) −X (i5) |
The obtained frequency F and amplitude A are supplied to the filter coefficient generation unit 23.

図２８及び図２９を参照して説明したように、
図２７に示す周波数振幅推定部２２で算出される音声信号の一次微分データの符号の変化する位置（「正への変化点」及び「負への変化点」）は音声信号の変曲点として扱われ、その相前後する変曲点の時間間隔の逆数が、音声信号の周波数であると推定することができる。また、変曲点は、一連の単調増加の開始点又は一連の単調減少の開始点であるので、相前後する変曲点の信号値の差が振幅であると推定することができる。As described with reference to FIGS. 28 and 29,
The position where the sign of the primary differential data of the audio signal calculated by the frequency amplitude estimation unit 22 shown in FIG. 27 changes (“change point to positive” and “change point to negative”) is the inflection point of the audio signal. It can be estimated that the reciprocal of the time interval between the inflection points that are treated and the phase is the frequency of the audio signal. In addition, since the inflection point is the start point of a series of monotone increases or the start point of a series of monotone decreases, it can be estimated that the difference between the signal values of successive inflection points is the amplitude.

図２８及び図２９の例では、音声信号が単一の周波数成分からなる。
次に、音声信号が複数の周波数成分からなる場合について、図３０及び図３１を参照して説明する。
図３０（ａ）は周波数ｆ１１、振幅ａ１１の信号成分を示し、図３０（ｂ）は周波数ｆ１２（＞ｆ１１）、振幅ａ１２の信号成分を示し、図３０（ｃ）は、図３０（ａ）の信号成分と図３０（ｂ）の信号成分が重畳した信号を示す。図の縦方向に延びた点線は変曲点の時間的位置を示す。相前後する変曲点相互の時間間隔（区間長さ）が信号成分の半周期として、周波数の推定に用いられる。
この時、図３０（ｃ）のようにほとんどの領域で、高いほうの周波数であるｆ１２が周波数として推定され、ａ１２が振幅として推定され、このように、ｆ１２、ａ１２が推定された区間では、周波数ｆ１２、振幅ａ１２に基づいてフィルタ係数が選択され、波形歪みの補正が正確に行われる。In the example of FIGS. 28 and 29, the audio signal is composed of a single frequency component.
Next, the case where the audio signal is composed of a plurality of frequency components will be described with reference to FIGS. 30 and 31. FIG.
30A shows the signal component of frequency f11 and amplitude a11, FIG. 30B shows the signal component of frequency f12 (> f11) and amplitude a12, and FIG. 30C shows FIG. 30A. The signal component of FIG. 30 and the signal component of FIG. The dotted line extending in the vertical direction in the figure indicates the temporal position of the inflection point. The time interval (section length) between the inflection points that follow each other is used as a half cycle of the signal component for frequency estimation.
At this time, in most regions as shown in FIG. 30 (c), the higher frequency f12 is estimated as the frequency, and a12 is estimated as the amplitude. Thus, in the section where f12 and a12 are estimated, A filter coefficient is selected based on the frequency f12 and the amplitude a12, and the waveform distortion is accurately corrected.

サンプリング点ｉ１１〜ｉ１２のように、周波数がｆ１２と推定されない（推定結果がｆ１２よりも大きな値となる）区間では、重複された信号を一つの正弦波と近似して周波数が推定される。しかし、ｉ１１〜ｉ１２区間は非常に短く、ｆ１２より高い周波数が推定されるので、これにより生じる問題は大きくはない。 In a section where the frequency is not estimated to be f12 (sampling result is a value larger than f12) like the sampling points i11 to i12, the frequency is estimated by approximating the overlapped signal with one sine wave. However, since the sections i11 to i12 are very short and a frequency higher than f12 is estimated, the problem caused by this is not significant.

図３１に、図３０とは異なる例を示す。
図３１（ａ）は周波数ｆ１３、振幅ａ１３の信号成分を示し、図３１（ｂ）は周波数ｆ１４（＞ｆ１３）、振幅ａ１４の信号成分を示し、図３１（ｃ）は、図３１（ａ）に示す信号成分と図３１（ｂ）に示す信号成分とが重畳した信号を示す。図の縦方向に延びた点線は変曲点の時間的位置を示す。相前後する変曲点の時間間隔（区間長）が信号成分の半周期として、周波数の推定に用いられる。
図３１（ｄ）はｎビット信号（図３１（ｃ））を実施の形態３で処理したｎ＋αビットの出力を示す。図３１（ｃ）のようにほとんどの領域で高いほうの周波数であるｆ１３が推定され、振幅ａ１３が推定される。FIG. 31 shows an example different from FIG.
FIG. 31A shows signal components of frequency f13 and amplitude a13, FIG. 31B shows signal components of frequency f14 (> f13) and amplitude a14, and FIG. 31C shows FIG. 31A. The signal component shown in FIG. 31 and the signal component shown in FIG. The dotted line extending in the vertical direction in the figure indicates the temporal position of the inflection point. The time interval (section length) between the inflection points that follow each other is used as a half cycle of the signal component for frequency estimation.
FIG. 31D shows an output of n + α bits obtained by processing the n-bit signal (FIG. 31C) in the third embodiment. As shown in FIG. 31 (c), f13, which is the higher frequency, is estimated in most regions, and the amplitude a13 is estimated.

しかし、サンプリング点ｉ１３〜ｉ１４の区間のようにｆ３が推定されない（推定結果がｆ３よりも低い値となる）区間では、重複された信号を一つの正弦波と近似して周波数が推定されてしまう。
しかし、ｉ１３〜ｉ１４の区間における補正量はビット拡張前の信号（ｎビットの信号）のＬＳＢの１／２以上にはならず、仮にビット拡張後の信号（ｎ＋αビットの信号）をｎビットに量子化した場合には、補正量が０になる。つまり、ビット拡張後のｎ＋αビットの信号を、仮に量子化してｎビットの信号に戻した場合、ｉ１３〜ｉ１４の区間でも入力信号を再現することができる。
また、影響があるのはｉ１３〜ｉ１４区間のみであり、全体からみるとごく一部の区間に限られる。However, in a section where f3 is not estimated as in the section of sampling points i13 to i14 (the estimation result is a value lower than f3), the frequency is estimated by approximating the overlapped signal with one sine wave. .
However, the correction amount in the interval from i13 to i14 is not more than 1/2 of the LSB of the signal before bit extension (n-bit signal), and the signal after bit extension (n + α-bit signal) is assumed to be n bits. When quantized, the correction amount becomes zero. That is, if an n + α-bit signal after bit expansion is quantized and returned to an n-bit signal, the input signal can be reproduced even in the period from i13 to i14.
Further, only the sections i13 to i14 have an influence, and are limited to only a part of the section as a whole.

図３０及び図３１を参照して説明したように、図２７に示す周波数振幅推定部２２で算出される音声信号の一次微分データの符号の変化する位置（「正への変化点」及び「負への変化点」）は音声信号のうちの最も高い周波数の信号成分の変曲点として扱われ、その相前後する変曲点の時間間隔の逆数が、音声信号の最も高い周波数の成分の周波数であると推定することができる。また、変曲点は、一連の単調増加の開始点又は一連の単調減少の開始点であるので、相前後する変曲点の信号値の差が、音声信号に含まれる最も高い周波数の成分の振幅であると推定することができる。 As described with reference to FIGS. 30 and 31, the positions (“change points to positive” and “negative” of the sign of the first derivative data of the audio signal calculated by the frequency amplitude estimation unit 22 shown in FIG. ")" Is treated as the inflection point of the highest frequency signal component of the audio signal, and the reciprocal of the time interval between the inflection points of the audio signal is the frequency of the highest frequency component of the audio signal. It can be estimated that. In addition, since the inflection point is the start point of a series of monotone increases or the start point of a series of monotone decreases, the difference between the signal values of successive inflection points is the highest frequency component contained in the audio signal. It can be estimated that it is an amplitude.

図２７に示す周波数振幅推定部２２では、音声信号に含まれる最も高い周波数の成分の半周期を推定することで周波数を算出することができ、同時に変曲点における信号値の差から振幅を算出することができる。 In the frequency amplitude estimation unit 22 shown in FIG. 27, the frequency can be calculated by estimating the half cycle of the highest frequency component included in the audio signal, and at the same time, the amplitude is calculated from the difference in signal value at the inflection point. can do.

そして、このようにして推定された周波数及び振幅に基づいて決定されたフィルタ係数が、それぞれの区間（相前後する変曲点相互の区間）の信号値にフィルタリングに用いられる。たとえば、図２８の相前後する変曲点ｉ１、ｉ２により求められた周波数、振幅により決定されたフィルタ係数を用いて、変曲点ｉ１の次のサンプル点から、変曲点ｉ２までのデータに対するフィルタリングが行われる。 The filter coefficients determined based on the frequency and the amplitude estimated in this way are used for filtering in the signal values of the respective sections (intersections between successive inflection points). For example, for the data from the sample point next to the inflection point i1 to the inflection point i2, using the filter coefficient determined by the frequency and amplitude obtained by the inflection points i1 and i2 in FIG. Filtering is performed.

なお、上記の変曲点検出部２７は、ｎビットの音声信号が一連の単調増加を開始する点及び一連の単調減少を開始する点を変曲点として検出するものであるが、代わりにｎビットの音声信号が一連の単調増加を終了する点及び一連の単調減少を終了する点を変曲点として検出するように変曲点検出部を構成しても良い。その場合には、例えば、図３２（ａ）で示される信号を入力とする一次微分算出部として、図３２（ｂ）に示すように、
Ｄ（ｉ）＝Ｘ（ｉ）−Ｘ（ｉ−１）
で得られるＤ（ｉ）を、一次微分データとして出力するものを用い、符号変化点検出部として、図３２（ｃ）、（ｄ）に示すように、一次微分データＤの符号が負に変化した点より前で、それに最も近い正であった点、及び一次微分データＤの符号が正に変化した点より前で、それに最も近い負であった点を検出するものを用いれば良い。
図３２には、検出された変曲点の時間的位置が符号ｉ２１、ｉ２２、ｉ２３の付された点線で示されている。The inflection point detection unit 27 detects a point at which an n-bit audio signal starts a series of monotone increases and a point at which a series of monotone decreases starts as an inflection point. The inflection point detection unit may be configured to detect, as an inflection point, a point at which a bit sound signal ends a series of monotone increases and a point at which a series of monotone decreases ends. In that case, for example, as shown in FIG. 32 (b), as a first-order differential calculation unit that receives the signal shown in FIG. 32 (a),
D (i) = X (i) -X (i-1)
As shown in FIGS. 32 (c) and 32 (d), the sign of the primary differential data D changes to negative as the sign change point detection unit using the output of D (i) obtained in step 1 as the primary differential data. What is necessary is just to use the point that is the closest positive point before the point and the point that was the closest negative point before the point where the sign of the primary differential data D changed to positive.
In FIG. 32, the temporal positions of the detected inflection points are indicated by dotted lines with symbols i21, i22, i23.

図３３は、フィルタ係数生成部２３の動作を説明するための図である。フィルタ係数生成部２３は、図３３のようなカットオフ周波数と次数が異なる複数の低域通過フィルタ係数を格納するフィルタ係数テーブルを備え、フィルタ係数テーブルから周波数Ｆと振幅Ａに基づいて低域通過フィルタ係数Ｃを選択してエッジ保存型平滑化フィルタ部２５に出力する。 FIG. 33 is a diagram for explaining the operation of the filter coefficient generation unit 23. The filter coefficient generation unit 23 includes a filter coefficient table for storing a plurality of low-pass filter coefficients having different orders from the cutoff frequency as shown in FIG. 33, and the low-pass filter is generated based on the frequency F and the amplitude A from the filter coefficient table. The filter coefficient C is selected and output to the edge preserving smoothing filter unit 25.

図３４は、カットオフ周波数と次数が異なる複数の低域通過フィルタ係数を格納するフィルタ係数テーブルを説明するための図である。このテーブルが保持している低域通過フィルタ係数は、音声信号の周波数に基づいて低域通過フィルタのカットオフ周波数特性と、周波数と振幅に基づいて低域通過フィルタの次数を決定される。例えば図３４（ａ）の実線のような周波数特性を持つ音声信号の場合、図３４（ａ）の点線のようなカットオフ周波数特性の低域通過フィルタ係数を生成する。（周波数ｆ１が推定できればｆ２は自明であるため、ｆ１とｆ２の間にｆｃ１を設定する）。図３４（ｂ）の実線のような周波数特性を持つ音声信号の場合には、図３４（ｂ）の点線のようなカットオフ周波数特性の低域通過フィルタ係数を生成する。（周波数ｆ４が推定できればｆ５は自明であるため、ｆ４とｆ５の間にｆｃ４を設定する）。このように周波数に応じた低域通過フィルタのカットオフ周波数特性を持つようにする。 FIG. 34 is a diagram for explaining a filter coefficient table that stores a plurality of low-pass filter coefficients having orders different from the cutoff frequency. The low-pass filter coefficient held in this table is determined based on the cut-off frequency characteristic of the low-pass filter based on the frequency of the audio signal, and the order of the low-pass filter based on the frequency and amplitude. For example, in the case of an audio signal having a frequency characteristic such as a solid line in FIG. 34A, a low-pass filter coefficient having a cutoff frequency characteristic such as a dotted line in FIG. (If the frequency f1 can be estimated, f2 is self-explanatory, so fc1 is set between f1 and f2.) In the case of an audio signal having a frequency characteristic as shown by the solid line in FIG. 34B, a low-pass filter coefficient having a cutoff frequency characteristic as shown by the dotted line in FIG. 34B is generated. (If frequency f4 can be estimated, f5 is self-explanatory, so fc4 is set between f4 and f5). In this way, the low-pass filter has a cut-off frequency characteristic corresponding to the frequency.

また、周波数と振幅から傾きを求めて次数を決定する。周波数が同じで振幅が大きい場合、傾きは大きくなるのでフィルタの次数は小さくする必要がある。 Also, the order is determined by obtaining the slope from the frequency and amplitude. When the frequency is the same and the amplitude is large, the gradient becomes large, so the order of the filter needs to be small.

以上のようにフィルタ係数生成部２３は、推定された周波数Ｆの高調波成分をカットし、推定された振幅Ａが大きいほど低域通過フィルタの次数が小さいフィルタ係数を生成することができる。 As described above, the filter coefficient generation unit 23 can generate a filter coefficient with a lower order of the low-pass filter as the estimated amplitude A is larger by cutting the estimated harmonic component of the frequency F.

実施の形態３では音声信号の周波数と振幅を推定して、その周波数と振幅に基づいたカットオフ周波数特性と次数を有する低域通過フィルタ係数を生成して平滑化するため、音声信号の量子化による波形歪みを的確に補正することができる。一方で平滑化の際にエッジ保存型平滑化フィルタを適用しているため、信号振幅が急峻に大きく変化する領域を有する音声信号入力波形の再現性を損なわない。 By estimating the frequency and amplitude of the third the speech signals embodiment, the frequency and for smoothing and generates a low-pass filter coefficients with a cutoff frequency characteristics and the order based on the amplitude quantization of the audio signal The waveform distortion due to can be accurately corrected. On the other hand, since the edge-preserving smoothing filter is applied at the time of smoothing, the reproducibility of the audio signal input waveform having a region where the signal amplitude changes sharply and greatly is not impaired.

これまではエッジ保存型平滑化フィルタ部２５はεフィルタである構成で説明したが、実施の形態３はこの構成に限ったものではない。バイラテラルフィルタ、トリムド平均値フィルタ（ＤＷ−ＭＴＭフィルタ）などの他のエッジ保存型フィルタでも同様の効果が期待できる。 So far, the edge preserving smoothing filter unit 25 has been described as being an ε filter, but the third embodiment is not limited to this configuration. Similar effects can be expected with other edge-preserving filters such as a bilateral filter and a trimmed average value filter (DW-MTM filter).

図３５は、以上に説明した実施の形態３に係る音声信号処理装置の処理工程を示すフローチャートである。 FIG. 35 is a flowchart showing processing steps of the audio signal processing device according to Embodiment 3 described above.

まず、入力端子２１よりｎビットの音声信号が周波数振幅推定部２２に入力される。周波数振幅推定部２２はｎビットの音声信号から周波数と振幅を推定してフィルタ係数生成部２３に出力する（ＳＴ１１）。フィルタ係数生成部２３は周波数と振幅に基づいて低域通過フィルタ係数をエッジ保存型平滑化フィルタ部２５に出力する（ＳＴ１２）。原データビット拡張部２４は、ｎビットの音声信号をαビット分ビット拡張したｎ＋αビットの音声信号をエッジ保存型平滑化フィルタ部２５に出力する（ＳＴ１３）。エッジ保存型平滑化フィルタ部２５は、生成した低域通過フィルタ係数を用いてｎ＋αビットの音声信号をエッジ保存型平滑化フィルタ処理して、ｎ＋αビットの音声信号を出力する（ＳＴ１４）。 First, an n-bit audio signal is input to the frequency amplitude estimation unit 22 from the input terminal 21. The frequency / amplitude estimation unit 22 estimates the frequency and amplitude from the n-bit audio signal and outputs them to the filter coefficient generation unit 23 (ST11). The filter coefficient generation unit 23 outputs the low-pass filter coefficient to the edge preserving smoothing filter unit 25 based on the frequency and amplitude (ST12). The original data bit extension unit 24 outputs an n + α bit audio signal obtained by extending the n bit audio signal by α bits to the edge preserving smoothing filter unit 25 (ST13). The edge-preserving smoothing filter unit 25 performs edge preserving smoothing filter processing on the n + α-bit audio signal using the generated low-pass filter coefficient, and outputs an n + α-bit audio signal (ST14).

図２２に示す音声信号処理装置は、ソフトウエアで、即ちプログラムされたコンピュータで実現することもできる。その場合、コンピュータによる処理の手順は、図３５を参照して上記したのと同様である。 The audio signal processing apparatus shown in FIG. 22 can also be realized by software, that is, by a programmed computer. In this case, the processing procedure by the computer is the same as that described above with reference to FIG.

実施の形態３によれば、ｎビットの音声信号から比較的長い時間範囲で周波数と振幅を推定し、その周波数と振幅に基づいて算出された低域通過フィルタを使って平滑化することで、量子化によって発生する高調波成分を除去し、量子化による音声信号の波形歪みを補正することができ、従来よりも量子化前の音声信号波形に近づけるような音声信号処理が実現できる。またエッジ保存型平滑化フィルタを適用することにより、信号振幅が急峻に大きく変化する領域を有する音声信号の再現性を損なわないようにすることができる。 According to the third embodiment, the frequency and amplitude are estimated from an n-bit audio signal in a relatively long time range, and smoothed using the low-pass filter calculated based on the frequency and amplitude. Harmonic components generated by quantization can be removed, waveform distortion of the audio signal due to quantization can be corrected, and audio signal processing that approximates the audio signal waveform before quantization than before can be realized. Further, by applying an edge preserving smoothing filter, it is possible to prevent the reproducibility of an audio signal having a region where the signal amplitude changes sharply and greatly.

実施の形態３で説明した、音声信号の周波数及び振幅の推定結果に基づいて生成したフィルタ係数を用いたエッジ保存型平滑化は、実施の形態１、実施の形態２のいずれとも組み合わせ可能である。例えば、図２２のフィルタ係数生成部２３で生成した係数を、図５に示した係数プログラマブル判定付き加重平均部１２に供給することとすれば良い。また、図１８の音声処理装置においても、判定付き加重平均部１０として、係数プログラマブルなものを用い、該係数として、図２２のフィルタ係数生成部２３で生成した係数を用いることとしても良い。同様に、図２０の音声処理装置においても、ビット拡張及び判定付き加重平均部１７として、係数プログラマブルなものを用い、該係数として、図２２のフィルタ係数生成部２３で生成した係数を用いることとしても良い。 The edge preserving smoothing using the filter coefficient generated based on the estimation result of the frequency and amplitude of the audio signal described in the third embodiment can be combined with either the first embodiment or the second embodiment. . For example, the coefficients generated by the filter coefficient generation unit 23 in FIG. 22 may be supplied to the weighted average unit 12 with coefficient programmable determination shown in FIG. Also, in the speech processing apparatus of FIG. 18, a coefficient programmable unit may be used as the weighted average unit 10 with determination, and the coefficient generated by the filter coefficient generation unit 23 of FIG. 22 may be used as the coefficient. Similarly, in the speech processing apparatus of FIG. 20, as the weighted average unit 17 with bit extension and determination, a coefficient programmable unit is used, and the coefficient generated by the filter coefficient generation unit 23 of FIG. 22 is used as the coefficient. Also good.

本発明の活用例として、カーオーディオ、ＰＡ(ポータブルオーディオ)などの音声信号処理装置に適用できる。 As an application example of the present invention, the present invention can be applied to an audio signal processing apparatus such as a car audio and a PA (portable audio).

Claims

an original data bit extension unit for generating and outputting a sequence of n + α-bit audio signal data by extending each bit of the n-bit (n is an integer) audio signal data by α bits (α is an integer) bit;
An edge-preserving smoothing filter that smoothes a sequence of n + α-bit audio signal data generated by the bit extension ;
A frequency amplitude estimator for estimating the frequency and amplitude of the n-bit audio signal;
A filter coefficient generation unit that generates a low-pass filter coefficient based on the frequency and the amplitude,
The edge preserving smoothing filter unit performs smoothing using the filter coefficient generated by the filter coefficient generation unit,
The filter coefficient generation unit
Generating a filter coefficient of a low-pass filter that removes harmonic components of the frequency of the estimated sound signal, and the order of the low-pass filter is reduced as the estimated amplitude is increased. apparatus.

The filter coefficient generation unit
A filter coefficient table that stores a plurality of low-pass filter coefficients having different orders from the cutoff frequency,
The audio signal processing device according to claim 1 , wherein the low-pass filter coefficient is selected based on the estimated frequency and amplitude from the filter coefficient table.

The audio signal processing apparatus according to claim 1 , wherein the frequency amplitude estimation unit detects an inflection point of the audio signal and estimates a frequency and an amplitude from the detected inflection point.

The frequency amplitude estimator is
An inflection point detecting unit that detects a point at which the n-bit audio signal starts or ends a series of monotone increases and a point at which a series of monotonic decreases starts or ends as an inflection point;
A frequency estimation unit that estimates a frequency from the section length of the position of the detected inflection point;
The audio signal processing apparatus according to claim 3 , further comprising: an amplitude estimation unit that estimates a level difference of the detected inflection point as an amplitude.

The inflection point detector is
A first derivative calculating unit for calculating first derivative data of the n-bit audio signal;
The audio signal processing apparatus according to claim 4 , further comprising: a sign change point detection unit that detects, as the inflection point, a point at which a sign of the primary differential data has changed positively and a point at which the sign has changed negatively. .

The first derivative calculation unit
The n-bit audio signal at each sampling point is represented by X (i),
When the n-bit audio signal at the next sampling point is represented by X (i + 1),
D (i) = X (i + 1) −X (i)
The audio signal processing apparatus according to claim 5 , wherein D (i) obtained in step (2) is output as the first-order differential data.

The sign change point detection unit is configured such that the first derivative data is changed from a state where the sign is negative or zero to a state where the sign is positive, and a state where the sign is positive or zero. The audio signal processing apparatus according to claim 6 , wherein a point at which the sign has changed to a negative state is detected as a point at which the sign has changed.

The audio signal processing apparatus according to claim 1, wherein the edge preserving smoothing filter unit includes an ε filter unit.

The threshold ε of the ε filter unit is 2 ^αα Is
The audio signal processing apparatus according to claim 8.

The audio signal processing apparatus according to claim 8 , wherein the ε filter unit is a one-dimensional m-th order ε filter unit that performs processing on a sequence of input signal data.

The audio signal processing apparatus according to claim 10 , wherein a weight value of an average calculation performed in the one-dimensional m-th order ε filter unit can be changed.

The audio signal processing apparatus according to claim 10 , wherein the filter order m of the one-dimensional m-order ε filter unit is changeable.

The one-dimensional m-order ε filter unit is
A data storage unit that holds data in a row of n + α-bit audio signal data continuous in the time direction and simultaneously outputs the data;
Among the data stored in the data storage unit, a difference calculation unit that generates difference data between the audio signal data of the target sampling point and each of the audio signal data of the sampling points before and after the target sampling point;
For each of the sampling points before and after the target sampling point, an ε determination unit that generates determination data indicating whether or not the difference data output by the difference calculation unit is greater than a threshold;
For each of the sampling points before and after the sampling point of interest, a weighted average unit with determination that generates a weighted average value based on the determination data output by the ε determination unit and the difference data output by the difference calculation unit,
The audio signal processing apparatus according to claim 10 , further comprising: an addition unit that generates an addition value of the audio signal data of the sampling point of interest and the weighted average value data output from the weighted average unit with determination.

A comparison unit for obtaining a ratio between the audio signal data of the target sampling point input to the data storage unit or the audio signal data of the target sampling point output from the data storage unit and the data output from the data addition unit; ,
The audio signal processing apparatus according to claim 13 , wherein correction is performed on the addition value output from the data addition unit in accordance with the ratio obtained by the comparison unit.

  an original data bit extending step for generating and outputting a sequence of n + α-bit audio signal data by extending each bit of n-bit (n is an integer) audio signal data by α bits (α is an integer);
  An edge-preserving smoothing filter step for smoothing a sequence of n + α-bit audio signal data generated by the bit extension;
  A frequency amplitude estimating step for estimating the frequency and amplitude of the n-bit audio signal;
  A filter coefficient generation step for generating a low-pass filter coefficient based on the frequency and the amplitude,
  The edge preserving type smoothing filter step performs smoothing using the filter coefficient generated in the filter coefficient generation step,
  The filter coefficient generation step includes:
  A filter coefficient of a low-pass filter that removes harmonic components of the frequency of the estimated audio signal is generated, and the order of the low-pass filter decreases as the estimated amplitude increases.
  An audio signal processing method.