JP2570603B2

JP2570603B2 - Audio signal transmission device and noise suppression device

Info

Publication number: JP2570603B2
Application number: JP5293623A
Authority: JP
Inventors: 泰祐佐々田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1993-11-24
Filing date: 1993-11-24
Publication date: 1997-01-08
Anticipated expiration: 2012-01-08
Also published as: JPH07147566A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、音声信号伝送装置の送
信装置に利用する。特に、ノイズ重畳された広帯域音声
信号を音声帯域の回線を介して送信する送信装置の高域
に重畳されたノイズを抑圧するノイズ抑圧装置に関する
ものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention is used for a transmitting device of a voice signal transmitting device. In particular, the present invention relates to a noise suppression device that suppresses noise superimposed on a high frequency band of a transmission device that transmits a broadband audio signal on which noise is superimposed via a line in an audio band.

【０００２】[0002]

【従来の技術】従来、音声信号伝送装置は、広帯域音声
信号に重畳されたノイズを除去することなく伝送してい
た。2. Description of the Related Art Conventionally, an audio signal transmission apparatus has transmitted without removing noise superimposed on a wideband audio signal.

【０００３】[0003]

【発明が解決しようとする課題】しかし、このような従
来の音声信号伝送装置では、音声信号に重畳されたノイ
ズを除去することなく伝送していたために、音声の通話
品質が劣化する問題点があった。However, in such a conventional audio signal transmitting apparatus, since the noise superimposed on the audio signal is transmitted without being removed, the voice communication quality deteriorates. there were.

【０００４】ここで、音声信号をディジタル信号処理し
てピッチを得、音声信号の雑音重畳区間をピッチで区切
って同期加算し、こうして得た雑音除去音声で音声信号
の雑音重畳区間を補間する提案があるが（特開昭６３−
２６１２３号公報）、広帯域音声信号の高域周波数成分
の雑音を抑圧するものではない。Here, a pitch is proposed by digitally processing a speech signal to obtain a pitch, a noise superimposed section of the speech signal is divided by the pitch, synchronously added, and the noise superimposed section of the speech signal is interpolated with the noise-removed speech thus obtained. There is (Japanese
No. 26123) does not suppress the noise of the high frequency component of the wideband audio signal.

【０００５】本発明は前記の問題点を解決するもので、
広帯域音声信号に重畳されたノイズを除去し音声の品質
劣化を防止できる音声信号伝送装置を提供することを目
的とする。The present invention solves the above problems,
It is an object of the present invention to provide an audio signal transmission device capable of removing noise superimposed on a wideband audio signal and preventing deterioration of audio quality.

【０００６】[0006]

【課題を解決するための手段】本発明は、送信すべき広
帯域音声を回線に送信する送信装置と、この回線からこ
の送信装置の送信信号を受信する受信装置とを備えた音
声信号伝送装置において、前記送信装置は、前記広帯域
音声を時系列的なディジタル信号に変換するアナログデ
ィジタル変換器、このアナログディジタル変換器の出力
ディジタル信号を周波数分割された信号に変換する変換
手段、この変換手段の出力信号の高域周波数成分を間引
きして帯域圧縮する圧縮手段、この圧縮手段の出力信号
を時系列的なディジタル信号に変換する逆変換手段、お
よびこの逆変換手段の出力信号をアナログ信号に変換す
るディジタルアナログ変換器を含むノイズ抑圧装置を備
えたことを特徴とする。According to the present invention, there is provided an audio signal transmitting apparatus comprising: a transmitting apparatus for transmitting a wideband voice to be transmitted to a line; and a receiving apparatus for receiving a transmission signal of the transmitting apparatus from the line. An analog-to-digital converter for converting the wideband voice into a time-series digital signal, a conversion unit for converting a digital signal output from the analog-to-digital converter into a frequency-divided signal, and an output of the conversion unit. Compression means for thinning out the high frequency components of the signal and band-compressing the signal; inverse conversion means for converting the output signal of the compression means into a time-series digital signal; and conversion of the output signal of the inverse conversion means into an analog signal A noise suppression device including a digital-to-analog converter is provided.

【０００７】また、本発明は、前記変換手段は、前記ア
ナログディジタル変換器からのディジタル信号から有限
個のデータを取り出しフレーム処理および窓かけ処理を
行いこの処理されたフレーム内のデータに対する周波数
領域への変換を高速フーリエ変換により行う高速フーリ
エ変換演算回路を含み、前記逆変換手段は、前記高速フ
ーリエ変換演算回路の出力信号を逆高速フーリエ変換に
より時系列的なディジタル信号に変換しオーバラップ処
理を行う逆高速フーリエ変換演算回路を含むことができ
る。Further, according to the present invention, the conversion means takes out a finite number of data from the digital signal from the analog-to-digital converter, performs frame processing and windowing processing, and converts the data into a frequency domain for data in the processed frame. The fast Fourier transform operation circuit for performing the conversion by the fast Fourier transform, the inverse transform means converts the output signal of the fast Fourier transform operation circuit into a time-series digital signal by an inverse fast Fourier transform, and performs overlap processing. It may include an inverse fast Fourier transform operation circuit for performing the operation.

【０００８】さらに、本発明は、前記圧縮手段は、高い
周波数の間引く区間は間引き率を高くし、その区間の最
大パワーの周波数成分を残すべき有効成分として選択し
帯域を圧縮する手段を含むことができる。Further, the present invention includes a compression means for increasing a thinning rate in a section where high frequency is thinned out, selecting a frequency component having a maximum power in the section as an effective component to be left, and compressing a band. Can be.

【０００９】また、本発明の別の観点は、前記音声信号
伝送装置に利用するノイズ抑圧装置である。この装置は
前記音声信号伝送装置とは別に個別に商品として取り引
きすることができる。[0009] Another aspect of the present invention is a noise suppression device used in the audio signal transmission device. This device can be separately traded separately from the audio signal transmission device.

【００１０】[0010]

【作用】ノイズが重畳された広帯域音声をアナログディ
ジタル変換して広帯域音声の時系列信号に変換し、高速
フーリエ変換でこの時系列信号から周波数軸中の信号に
変換して高域周波数成分に対して間引きを施す。さら
に、逆高速フーリエ変換で周波数成分信号を時系列信号
に逆変換してディジタルアナログ変換することにより、
広帯域音声信号に重畳されたノイズを除去し音声の品質
劣化を防止できる。[Function] A wide-band sound on which noise is superimposed is converted from analog to digital to a time-series signal of a wide-band sound, and the time-series signal is converted into a signal in the frequency axis by a fast Fourier transform to remove a high frequency component. Thin out. Furthermore, by inversely transforming the frequency component signal into a time-series signal by inverse fast Fourier transform and performing digital-to-analog conversion,
It is possible to remove noise superimposed on the wideband audio signal and prevent deterioration of audio quality.

【００１１】[0011]

【実施例】本発明の実施例について図面を参照して説明
する。Embodiments of the present invention will be described with reference to the drawings.

【００１２】図１は本発明一実施例音声信号伝送装置の
ブロック構成図である。FIG. 1 is a block diagram of an audio signal transmission apparatus according to one embodiment of the present invention.

【００１３】図１において、音声信号伝送装置は、送信
すべき広帯域音声を入力する入力手段２０およびこの入
力手段２０の出力を回線に送信する送信手段３０を含む
送信装置４０と、この回線から送信装置４０の送信信号
（ノイズ抑圧信号）を受信する受信装置５０とを備え
る。In FIG. 1, an audio signal transmitting apparatus includes a transmitting apparatus 40 including input means 20 for inputting broadband audio to be transmitted, and transmitting means 30 for transmitting an output of the input means 20 to a line, and transmitting from the line. A receiving device 50 that receives the transmission signal (noise suppression signal) of the device 40.

【００１４】ここで本発明の特徴とするところは、送信
装置４０は、入力手段２０からの広帯域音声を時系列的
なディジタル信号に変換するアナログディジタル変換器
１１、アナログディジタル変換器１１の出力ディジタル
信号を周波数分割された信号に変換する変換手段、この
変換手段の出力信号の高域周波数成分を間引きして帯域
圧縮する圧縮手段１３、圧縮手段１３の出力信号を時系
列的なディジタル信号に変換する逆変換手段、およびこ
の逆変換手段の出力信号をアナログ信号に変換して送信
手段３０に出力するディジタルアナログ変換器５１を含
むノイズ抑圧装置１０を備えたことにある。[0014] The filtrate come when the feature of the present invention herein, the transmission device 40, an analog-digital converter 11 for converting a wideband speech from the input unit 20 in a time-series digital signal, the output of the analog-digital converter 11 Conversion means for converting a digital signal into a frequency-divided signal; compression means 13 for thinning out a high-frequency component of the output signal of the conversion means and band-compressing the signal; converting the output signal of the compression means 13 into a time-series digital signal There is provided a noise suppressing device 10 including an inverse conversion means for converting, and a digital / analog converter 51 for converting an output signal of the inverse conversion means into an analog signal and outputting the analog signal to the transmitting means 30.

【００１５】また、前記変換手段は、アナログディジタ
ル変換器１１からのディジタル信号から有限個のデータ
を取り出しフレーム処理および窓かけ処理を行いこの処
理されたフレーム内のデータに対する周波数領域への変
換を高速フーリエ変換により行う高速フーリエ変換演算
回路としてＦＦＴ演算回路１２を含み、前記逆変換手段
は、ＦＦＴ変換演算回路１２の出力信号を逆高速フーリ
エ変換により時系列的なディジタル信号に変換しオーバ
ラップ処理を行う逆高速フーリエ変換演算回路として逆
ＦＦＴ演算回路１４を含む。The conversion means takes out a finite number of data from the digital signal from the analog-to-digital converter 11 and performs frame processing and windowing processing to convert the data in the processed frame into the frequency domain at high speed. An FFT operation circuit 12 is included as a fast Fourier transform operation circuit for performing a Fourier transform, and the inverse transform means converts an output signal of the FFT transform operation circuit 12 into a time-series digital signal by an inverse fast Fourier transform to perform overlap processing. An inverse FFT operation circuit 14 is included as an inverse fast Fourier transform operation circuit to be performed.

【００１６】さらに、圧縮手段１３は、高い周波数の間
引く区間は間引き率を高くし、その区間の最大パワーの
周波数成分を残すべき有効成分として選択し帯域を圧縮
する手段を含む。Further, the compression means 13 includes means for increasing a thinning rate in a section where high frequency is thinned out, selecting a frequency component having a maximum power in the section as an effective component to be left, and compressing the band.

【００１７】このような構成の音声信号伝送装置の動作
について説明する。The operation of the audio signal transmitting apparatus having such a configuration will be described.

【００１８】図２は本発明の音声信号伝送装置の広帯域
音声のスペクトルを示す図であり、横軸は周波数を示
し、縦軸は信号パワーを示す。また、図２（ａ）はノイ
ズ付加の広帯域音声のスペクトルを示し、図２（ｂ）は
ノイズ抑圧を施した広帯域音声のスペクトルを示す。FIG. 2 is a diagram showing a spectrum of a wideband voice of the voice signal transmitting apparatus according to the present invention. The horizontal axis indicates frequency, and the vertical axis indicates signal power. FIG. 2A shows the spectrum of a broadband speech with noise added, and FIG. 2B shows the spectrum of the wideband speech with noise suppression.

【００１９】図１において、ノイズが付加された広帯域
音声信号をアナログディジタル変換器１１にて時系列の
ディジタル信号に変換し、その信号をＦＦＴ演算回路１
２で周波数軸上の信号に変換する。この時点でのスペク
トルは、図２（ａ）に示す。この周波数成分の内、高域
周波数成分に対しては周波数を間引く。周波数を間引く
区間は、図２（ａ）に示すように、間引く率が２：１の
区間から間引く率が４：１の区間へと周波数が高くなる
につれ間引く率を上げている。In FIG. 1, a wideband audio signal to which noise has been added is converted into a time-series digital signal by an analog-to-digital converter 11 and the signal is converted to an FFT operation circuit 1
In step 2, the signal is converted into a signal on the frequency axis. The spectrum at this point is shown in FIG. Of these frequency components, the frequency is thinned out for high frequency components. As shown in FIG. 2A, in the section where the frequency is thinned, the thinning rate is increased as the frequency increases from the section where the thinning rate is 2: 1 to the section where the thinning rate is 4: 1.

【００２０】有効成分として残すべき周波数成分の設定
は、間引く区間の最大パワーを選択するものとする。図
中の黒で示した周波数成分は有効成分として残す成分、
白で示す成分は、間引きにより削除する成分である。す
なわち削除される周波数成分は、削除しても広帯域音声
信号に大きく影響を与えない部分であり、ノイズ成分の
比重が大きい信号であると見なしている。これらの間引
き処理を圧縮手段１３で行う。さらに、この圧縮した周
波数信号を逆ＦＦＴ演算回路１４にて、時系列信号に戻
した後に、ディジタルアナログ変換器１５にてアナログ
信号に変換し、ノイズ抑圧した広帯域音声として出力す
る。The setting of the frequency component to be left as the effective component is to select the maximum power of the section to be thinned out. Frequency components shown in black in the figure are components to be left as effective components,
Components shown in white are components to be deleted by thinning. That is, the frequency component to be deleted is a portion that does not significantly affect the wideband audio signal even if the frequency component is deleted, and is regarded as a signal having a large specific gravity of the noise component. These thinning processes are performed by the compression means 13. Further, after the compressed frequency signal is returned to a time-series signal by the inverse FFT operation circuit 14, it is converted to an analog signal by the digital-to-analog converter 15 and output as a noise-suppressed wideband sound.

【００２１】実施例の内でノイズ抑圧装置に関しては本
実施例の対象として市場で個別に商品として取り引きす
ることができ、利用者は音声信号伝送装置の送信装置に
組み込んで使用することができる。Among the embodiments, the noise suppressing device can be individually traded in the market as a target of the present embodiment, and the user can use it incorporated in the transmitting device of the voice signal transmitting device.

【００２２】次に、本発明のノイズ抑圧装置を使用して
帯域圧縮して送信し、その送信信号を受信し帯域伸張す
る広帯域音声信号のアナログ伝送方式について説明す
る。Next, a description will be given of an analog transmission system of a wideband audio signal in which the band is compressed and transmitted using the noise suppressing apparatus of the present invention, the transmission signal is received, and the band is extended.

【００２３】このアナログ伝送方式では、広帯域音声を
電話帯域に帯域圧縮してアナログ伝送し、受信側の帯域
伸張により広帯域音声を再生する。帯域圧縮は、広帯域
音声の中高域周波数成分を間引くことにより電話帯域内
に圧縮する。帯域伸張は、圧縮音声の低域成分からピッ
チ検出を行い、ピッチの整数倍に近い周波数位置に中高
域成分を再配置して伸張する。１２ｋＨｚ帯域音声を用
いた計算機シミュレーションに基づく５段階ＭＯＳ値
（Mean Opinion Score）の主観評価結果を用いて、電話
帯域音声２．４より評価の高い３．１が得られることを
示す。この方式により、既存のアナログ電話回線を用い
て、より肉声に近い通話のできる広帯域音声会議端末を
実現できる。In this analog transmission system, a wideband voice is band-compressed into a telephone band and transmitted analog, and the wideband voice is reproduced by band expansion on the receiving side. In the band compression, broadband speech is compressed into the telephone band by thinning out middle and high frequency components. In band expansion, pitch detection is performed from the low-frequency component of the compressed voice, and the middle-high frequency component is rearranged and expanded at a frequency position close to an integral multiple of the pitch. Using a subjective evaluation result of a five-stage MOS value (Mean Opinion Score) based on a computer simulation using a 12 kHz band voice, it is shown that 3.1 having a higher evaluation than the telephone band voice 2.4 can be obtained. According to this method, it is possible to realize a broadband audio conference terminal capable of making a call closer to the real voice using an existing analog telephone line.

【００２４】通信での使用を主とした約４ｋＨｚ帯域の
電話帯域音声符号化では、回線使用の効率化をはかるべ
く従来の６４ｋｂｐｓのＰＣＭ伝送から３２ｋｂｐｓの
ＡＤＰＣＭ、１６ｋｂｐｓのＭＰＣ（Multi Pulse Code
c)、８ｋｂｐｓのＣＥＬＰ（Codebook Excited LPC) 方
式等の符号化技術へと低ビットレート化が進んでいる
（丸善発行のマルチメディア符号化の国際標準、1991.
5、安田浩編著）。また、電話帯域より広い帯域を対象
とする符号化では、７ｋＨｚ帯域を６４ｋｂｐｓで伝送
するＳＢＡＤＰＣＭ（Sub-band ADPCM) が代表的であ
る。さらに、最近では約２０ｋＨｚのオーディオ帯域を
約１／６〜１／１２に圧縮符号化できるＭＰＥＧ（Moti
on Picture Expert Group)方式もＩＳＯで標準化され
（テレビジョン学会誌、Vol.46,No.9,pp.1072-1075,199
2)、伝送端末での実用化も検討されつつある。音声通信
の分野においても、電話帯域より広い帯域を伝送するシ
ステムへの要求が高まっている。In the telephone band voice coding of about 4 kHz band mainly for use in communication, in order to improve the efficiency of line use, 32 kbps ADPCM and 16 kbps MPC (Multi Pulse Code) are used from conventional 64 kbps PCM transmission.
c), encoding techniques such as the 8 kbps CELP (Codebook Excited LPC) scheme are being reduced in bit rate (International standard for multimedia encoding published by Maruzen, 1991.
5, edited by Hiroshi Yasuda). In coding for a band wider than the telephone band, SBADPCM (Sub-band ADPCM) for transmitting a 7 kHz band at 64 kbps is typical. Furthermore, recently, MPEG (Moti) capable of compressing and encoding an audio band of about 20 kHz to about 1/6 to 1/12 is used.
on Picture Expert Group) standardized by ISO (Television Society of Japan, Vol. 46, No. 9, pp. 1072-1075, 199).
2) Practical use in transmission terminals is also being studied. In the field of voice communication, there is an increasing demand for a system that transmits a band wider than the telephone band.

【００２５】これらの圧縮技術は、すべてディジタル符
号化技術であり、加入者伝送路に適用するには、ＩＳＤ
Ｎ等のディジタル回線が対象となる。しかし、現在の加
入者線路におけるディジタル回線の普及率は高いとは言
えず、従来からのアナログ電話回線での通信が大半を占
めている。一方、アナログ電話回線で広帯域音声を伝送
する場合、ディジタル符号化された広帯域音声を、高速
モデムで用いて伝送する方式が考えられる。しかし、現
在の最高速モデムは約２０ｋｂｐｓ程度の通信スピード
であるため、実現性は低い。All of these compression techniques are digital encoding techniques, and to be applied to a subscriber transmission path, an ISD
Digital networks such as N are targeted. However, it cannot be said that the penetration rate of the digital line in the current subscriber line is high, and the communication through the conventional analog telephone line occupies most. On the other hand, when transmitting broadband voice over an analog telephone line, a method of transmitting digitally coded broadband voice using a high-speed modem is conceivable. However, since the current highest speed modem has a communication speed of about 20 kbps, its feasibility is low.

【００２６】ここでは、通常約４ｋＨｚ以下の帯域しか
伝送できない既存のアナログ電話回線を対象とした広帯
域の音声通話を可能とする伝送方式について説明する。
まず、この伝送方式の原理を説明し、次にシミュレーシ
ョンによる再生音声の主観評価結果を用いてこのアナロ
グ伝送方式の有効性を示す。Here, a description will be given of a transmission system which enables a wideband voice call for an existing analog telephone line which can normally transmit only a band of about 4 kHz or less.
First, the principle of this transmission system will be described, and then, the effectiveness of this analog transmission system will be shown using subjective evaluation results of reproduced sound by simulation.

【００２７】（Ａ）アナログ伝送方式の構成図３はアナログ伝送方式のブロック構成図であり、二線
式伝送装置に適用した例を示す。図３において、送信側
では、アナログディジタル変換器１１で広帯域音声のア
ナログ信号をディジタル信号に変換しＦＦＴ演算回路１
２でこの時系列信号を周波数領域に変換する。次に、圧
縮手段１３で周波数領域上の広帯域信号を後述する帯域
圧縮処理により電話帯域まで圧縮する。さらに、逆ＦＦ
Ｔ演算回路１４で再び時間領域に戻したあと、ディジタ
ルアナログ変換器１５でディジタル信号からアナログ信
号に変換し、アナログの圧縮音声として伝送する。(A) Configuration of Analog Transmission System FIG. 3 is a block diagram of the analog transmission system, showing an example applied to a two-wire transmission device. In FIG. 3, on the transmitting side, an analog-to-digital converter 11 converts an analog signal of a wideband voice into a digital signal, and converts the analog signal into a digital signal.
In step 2, the time series signal is converted to the frequency domain. Next, the compression means 13 compresses the wideband signal in the frequency domain to the telephone band by a band compression process described later. Furthermore, inverse FF
After returning to the time domain again by the T operation circuit 14, the digital-to-analog converter 15 converts the digital signal into an analog signal and transmits it as analog compressed voice.

【００２８】受信側では、到来する圧縮音声をアナログ
ディジタル変換し、ＥＱＬ（回線損失補償）回路５２で
回線の周波数損失性を補償した後に、ＦＦＴ演算回路５
３で周波数領域への変換を行う。伸張手段５５ではこの
信号から、ピッチ検出回路５４で検出されるピッチの値
に基づいて、後述する帯域伸張処理を施す。さらに、逆
ＦＦＴ演算回路５６で再び時間領域に戻し、ディジタル
アナログ変換器５７で伸張音声として再生する。On the receiving side, the incoming compressed sound is converted from analog to digital, and the frequency loss of the line is compensated by an EQL (line loss compensation) circuit 52.
In step 3, conversion to the frequency domain is performed. The expansion unit 55 performs a band expansion process described later based on the pitch value detected by the pitch detection circuit 54 from this signal. Further, the signal is returned to the time domain again by the inverse FFT operation circuit 56, and is reproduced by the digital-to-analog converter 57 as expanded voice.

【００２９】（Ｂ）帯域圧縮処理図４はアナログ伝送方式の帯域圧縮処理を示すフローチ
ャートである。図５はアナログ伝送方式の帯域圧縮処理
における分割区間と間引き率とを示す図である。図４に
おいて、帯域圧縮処理は、フレーム処理、窓かけ処理、
ＤＦＴ（Discrete Fourier Transform、離散フーリェ変
換）間引き処理、ＩＤＦＴ（逆ＤＦＴ）、オーバラップ
処理からなる。(B) Band Compression Processing FIG. 4 is a flowchart showing band compression processing of the analog transmission system. FIG. 5 is a diagram showing a divided section and a thinning rate in the band compression processing of the analog transmission method. In FIG. 4, band compression processing includes frame processing, windowing processing,
It comprises DFT (Discrete Fourier Transform, Discrete Fourier Transform) thinning processing, IDFT (Inverse DFT), and overlap processing.

【００３０】まず、時系列データから有限個のデータを
抜き出すフレーム処理を行い取り出されたフレーム内の
データに対する周波数領域への変換をＤＦＴにより行
う。ＤＦＴを行うにあたり、連続周期性を欠如しないよ
う時間窓関数を乗じる必要がある（窓欠け処理）。時間
窓関数には多々の窓関数があり目的に即した窓関数を選
択しなければならない（Prentice-Hall,1975、Theory a
nd applications of digital signal processing、L.R.
Rabiner and B.Gold) 。このアナログ伝送方式では、フ
レーム境界歪を軽減するオーバラップ処理を単純な加算
で実現するため、台形窓あるいは三角窓を用いる。First, frame processing for extracting a finite number of data from the time-series data is performed, and the data in the extracted frame is converted into the frequency domain by DFT. In performing DFT, it is necessary to multiply by a time window function so as not to lack continuous periodicity (window missing processing). There are many window functions in the time window function, and it is necessary to select a window function suitable for the purpose (Prentice-Hall, 1975, Theory a
nd applications of digital signal processing, LR
Rabiner and B. Gold). In this analog transmission system, a trapezoidal window or a triangular window is used in order to realize an overlap process for reducing frame boundary distortion by simple addition.

【００３１】また、帯域圧縮および帯域伸張を行うにあ
たり、音声の周波数特性を考慮した周波数間引き区間と
間引き率とを送受信の端末間で取り決めておく必要があ
る。音声の長時間平均周波数スペクトルは通常、８００
Ｈｚまでほぼ平坦で、８００Ｈｚ以上はオクターブ当た
り−１０ｄＢの傾斜を持つ周波数特性で近似されること
が多い（研究実用化報告、Vol.4 、No.2,pp.245-262,19
55.1、音声の瞬時レベル分布およびスペクトル、三浦種
敏、越川常治）。この高域減衰特性を適用し、図５に示
されるような分割区間毎の間引き率を用いる。図５中の
横軸は周波数を示し、記されている分数は間引き率を表
している。例えば、１／４が記されている分割区分で
は、周波数領域のサンプル数を１／４に間引き、１／４
の圧縮音声サンプルを得る。この操作は、１／４の帯域
幅に圧縮することを示している。１／１が記されている
低域の分割区間は間引きを行わず、中域から高域にかけ
て間引き率を上げていく。Further, when performing band compression and band expansion, it is necessary to determine a frequency thinning section and a thinning rate in consideration of the frequency characteristics of voice between the transmitting and receiving terminals. The long-term average frequency spectrum of speech is typically 800
Hz is almost flat, and 800 Hz or more is often approximated by frequency characteristics having a slope of -10 dB per octave (Research and practical use report, Vol.4, No.2, pp.245-262,19).
55.1, Instantaneous level distribution and spectrum of speech, T. Miura, J. Koshikawa). Applying this high-frequency attenuation characteristic, a thinning rate for each divided section as shown in FIG. 5 is used. The horizontal axis in FIG. 5 indicates the frequency, and the indicated fraction indicates the thinning rate. For example, in a division section in which 1/4 is described, the number of samples in the frequency domain is thinned out to 1/4, and 1/4
To obtain a compressed audio sample of. This operation indicates compression to 1/4 of the bandwidth. In the low-frequency divided section in which 1/1 is described, the thinning rate is increased from the middle range to the high range without thinning.

【００３２】このような分割区分および間引き率によ
り、帯域圧縮における周波数成分毎の移動位置が送受信
間で定められる。例えば、間引き率１／４の分割区間で
は、伸張帯域における４サンプル幅の周波数範囲に対し
て、圧縮帯域内の一つの周波数位置が定められている。
このアナログ伝送方式では、周波数成分の移動で周波数
値を逆転させることのない昇順の配置とした。The moving position for each frequency component in the band compression is determined between the transmission and the reception by the division and the thinning rate. For example, in a division section with a thinning rate of 1/4, one frequency position in the compression band is determined for a frequency range of 4 sample widths in the extension band.
In this analog transmission system, the arrangement is performed in an ascending order without inverting the frequency value by moving the frequency component.

【００３３】間引き処理では、まずＤＦＴ演算により得
られた実部および虚部の値から、その２乗和をとり周波
数成分毎のスペクトルパワーを求め、最大パワーとなっ
た成分を、伝送すべき有効な周波数成分として選び出
す。次に、その有効成分を後述する位相補正を行った後
に、送受信間で定められた電話帯域内へ移動される。例
えば、ＤＦＴ演算の周波数成分数で３２サンプルとなる
帯域幅をもつ分割区間において、８サンプルの周波数成
分となる帯域に圧縮する場合を考える。まず、４サンプ
ルごとにスペクトルパワーを求め、最大パワーとなった
１サンプルの周波数成分を有効とし、定められた電話帯
域内の周波数位置に移動させる。これを８回繰り返し、
８サンプルの有効成分を取り出すことで、１／４の帯域
圧縮を行うことになる。また、２／３の圧縮を行う分割
区間では、３サンプルごとにパワーを求め、最小パワー
でない成分の２サンプルを有効成分として圧縮を行う。In the decimation process, first, from the values of the real part and the imaginary part obtained by the DFT operation, the sum of the squares is calculated to obtain the spectral power for each frequency component. Frequency components. Next, after the effective component is subjected to a phase correction, which will be described later, the effective component is moved into a telephone band determined between transmission and reception. For example, let us consider a case in which a divided section having a bandwidth of 32 samples in the number of frequency components of the DFT operation is compressed to a frequency component of 8 samples. First, the spectrum power is obtained for every four samples, the frequency component of one sample having the maximum power is made valid, and the spectrum component is moved to a frequency position within a predetermined telephone band. Repeat this eight times,
By extracting the effective components of eight samples, the band is compressed by 1/4. In a divided section where 2/3 compression is performed, power is obtained for every three samples, and compression is performed using two samples of components that are not the minimum power as effective components.

【００３４】このような周波数成分の間引きにより帯域
を圧縮した信号をＩＤＦＴにより時系列信号に戻した
後、オーバラップ処理で隣接したフレームが重なり合う
区間データをそれぞれ加算し、連続的な圧縮信号を得
る。After the signal whose band has been compressed by thinning out such frequency components is returned to a time-series signal by IDFT, section data in which adjacent frames overlap by an overlap process are added to obtain a continuous compressed signal. .

【００３５】（Ｃ）帯域伸張処理図６はアナログ伝送方式の帯域伸張処理を示すフローチ
ャートである。図６において、帯域伸張処理も帯域圧縮
処理と同様に、有限個のデータを抜き出すフローチャー
ト処理および時間窓関数を乗じる窓かけ処理を施した後
に、ＤＦＴ演算を行う。帯域伸張は、帯域圧縮処理で移
動した周波数成分を、フレーム内のピッチ検出結果に基
づき、その整数倍の値に最も近い周波数位置へ移動させ
る。なお、周波数成分の移動においては、圧縮時と同様
に後述の位相補正を施す必要がある。ピッチ検出は、フ
レームデータに対するローパスフィルタ（ＬＰＦ）を介
して、自己相関法（オーム社発行の音声情報処理の基
礎、１９８１、斎藤收三、中田和男）により求められ
る。さらに、帯域圧縮処理と同様に、時系列信号に戻す
ＩＤＦＴ演算とオーバラップ処理を施すことで、連続的
な伸張信号を得られる。(C) Band Expansion Processing FIG. 6 is a flowchart showing the band expansion processing of the analog transmission system. In FIG. 6, similarly to the band compression process, the band expansion process performs a DFT operation after performing a flowchart process for extracting a finite number of data and a windowing process for multiplying by a time window function. In the band expansion, the frequency component moved in the band compression process is moved to a frequency position closest to an integer multiple of the frequency component based on the pitch detection result in the frame. In the movement of the frequency component, it is necessary to perform the phase correction described later, as in the compression. Pitch detection is obtained by an autocorrelation method (Basics of speech information processing issued by Ohmsha, 1981, Shozo Saito, Kazuo Nakada) via a low-pass filter (LPF) for frame data. Further, similarly to the band compression processing, a continuous expanded signal can be obtained by performing an IDFT operation for returning to a time-series signal and an overlap processing.

【００３６】（Ｄ）周波数成分の移動とオーバラップ処
理における位相補正図７はアナログ伝送方式の周波数成分移動による位相ず
れを示す図である。(D) Movement of Frequency Components and Phase Correction in Overlap Processing FIG. 7 is a diagram showing a phase shift due to movement of frequency components in the analog transmission system.

【００３７】帯域圧縮および伸張処理時に必要な周波数
成分の移動において、フレームオーバラップ処理を加算
とする場合には、周波数成分毎の位相補正処理が必要と
なる。位相補正を行わない最悪のケースでは、オーバラ
ップ区間の加算フレームで移動された周波数成分の位相
差がπとなった場合に、それらのフレームを加算するこ
とでオーバラップ区間内の周波数成分を０にすることに
なる。図７に１／２フレームオーバラップの場合におけ
る位相ずれの様子を示す。Ｊ番目フレームと（Ｊ＋１）
番目のフレーム内で周波数成分を移動した場合の信号波
形であり、オーバラップ区間において、図７（ａ）はπ
の位相差を生じる周波数成分の移動の例、図７（ｂ）は
位相差を生じない周波数成分の移動の例である。図７中
のＦ（２）、Ｆ_J（３）、Ｆ_J+1（４）等は、ＤＦＴ演
算における周波数成分であり、その周波数位置を２、
３、４のインデックスで示している。図７（ａ）では周
波数位置２の周波数成分Ｆ（２）を、各フレーム内でそ
のまま周波数位置３に移動させた時に再生される波形を
表している。Ｊ番目と（Ｊ＋１）番目フレームにおける
処理で再生される波形Ｆ_J（３）とＦ_J+1（３）のオー
バラップ区間に位相ずれπが表されている。図７（ｂ）
では、同様にＦ（４）への移動であるがオーバラップ区
間の位置ずれはない。In the case of adding the frame overlap processing to the movement of the frequency components required for the band compression and expansion processing, a phase correction processing for each frequency component is required. In the worst case where the phase correction is not performed, when the phase difference between the frequency components shifted in the addition frame in the overlap section becomes π, the frequency components in the overlap section are reduced to 0 by adding those frames. It will be. FIG. 7 shows a state of a phase shift in the case of 1/2 frame overlap. J-th frame and (J + 1)
FIG. 7A shows a signal waveform when a frequency component is moved in the フレーム th frame, and FIG.
FIG. 7B shows an example of the movement of a frequency component that does not cause a phase difference. F (2), F _J (3), F _{J + 1} (4) and the like in FIG. 7 are frequency components in the DFT operation, and the frequency position is 2,
Indicated by 3 and 4 indices. FIG. 7A shows a waveform reproduced when the frequency component F (2) at the frequency position 2 is directly moved to the frequency position 3 in each frame. A phase shift π is shown in the overlap section between the waveforms F _J (3) and F _{J + 1} (3) reproduced in the processing in the J-th and (J + 1) -th frames. FIG. 7 (b)
Then, the movement is similarly to F (4), but there is no displacement in the overlap section.

【００３８】位相差は、オーバラップ幅と周波数軸上で
の移動距離に関係しており、次のように求めることがで
きる。The phase difference is related to the overlap width and the moving distance on the frequency axis, and can be obtained as follows.

【００３９】まず、Ｎ個のデータｆ（ｎ）、（ｎ＝０、
…、Ｎ−１）とＮ個の周波数成分Ｆ（κ）、（κ＝０、
…、Ｎ−１）におけるＤＦＴおよびＩＤＦＴの定義式
は、First, N pieces of data f (n), (n = 0,
.., N−1) and N frequency components F (κ), (κ = 0,
, N-1) are defined as:

【００４０】[0040]

【数１】である。(Equation 1) It is.

【００４１】いま、フレームとして切り出す前の時間領
域ｉ、（ｉ＝０、…、∞）における入力信号ｇ（ｉ）
を、周波数位置を示す整数ｕの位置に複素数Ｇ（ｕ）の
周波数成分のみをもつものと仮定すればＩＤＦＴの定義
から、Now, the input signal g (i) in the time domain i, (i = 0,..., ∞) before being cut out as a frame.
Is assumed to have only the frequency component of the complex number G (u) at the position of the integer u indicating the frequency position, from the definition of IDFT,

【００４２】[0042]

【数２】となる。これはスペクトル１本に相当する。(Equation 2) Becomes This corresponds to one spectrum.

【００４３】時間領域ｉにおける入力信号の０からＮサ
ンプル切り出した信号を第１フレームとして、フレーム
内の時間領域ｎにおけるＦ₁（ｎ）とすれば、ｆ₁（ｎ）＝ｇ（ｉ），（ｎ＝ｉ：０≦ｎ＜Ｎ）（５）と表される。これをＤＦＴすると、式（６）に示される
ように周波数位置ｕにＧ（ｕ）の成分をもち、他の周波
数位置の成分は０となる。Assuming that a signal obtained by cutting out N samples of the input signal from 0 in the time domain i is the first frame and F ₁ (n) in the time domain n in the frame, f ₁ (n) = g (i), (N = i: 0 ≦ n <N) (5) When this is subjected to DFT, a component of G (u) is present at a frequency position u as shown in Expression (6), and components at other frequency positions become zero.

【００４４】[0044]

【数３】ここで、位置ｕの代わりに、位置γに成分Ｇ（ｕ）を有
する信号をＦ₁'(κ) とすると、(Equation 3) Here, assuming that a signal having a component G (u) at a position γ is F ₁ ′ (κ) instead of the position u,

【００４５】[0045]

【数４】となる。Ｆ₁'(κ) に対して、ＩＤＦＴを行いその信号
をｆ₁'(ｎ) で表すと、(Equation 4) Becomes IDFT is performed on F ₁ ′ (κ), and the signal is represented by f ₁ ′ (n).

【００４６】[0046]

【数５】となる。これを式（５）の条件式ｎ＝ｉを用いて、もと
の時間領域ｉに戻し、第１フレームでの再生信号ｇ₁'
(ｉ) が式（９）のとおり得られる。(Equation 5) Becomes This is returned to the original time domain i by using the conditional expression n = i in Expression (5), and the reproduced signal g ₁ ′ in the first frame is obtained.
(i) is obtained as in equation (9).

【００４７】[0047]

【数６】次に、第１フレームに対してＭサンプル（０＜Ｍ＜Ｎ）
ずれた点からを第２フレームとし、入力信号ｇ（ｉ）か
らＮサンプル切出したものをｆ₂（ｎ) として表すと次
式を得る。(Equation 6) Next, M samples (0 <M <N) for the first frame
The following equation is obtained when the shifted point is defined as the second frame, and a signal obtained by cutting out N samples from the input signal g (i) is represented as f ₂ (n).

【００４８】ｆ₂（ｎ），（ｎ＝ｉ−Ｍ：０≦ｎ≦Ｎ）（１０）ｆ₂（ｎ）をＤＦＴし、式（４）および式（１０）を適
用すると、F ₂ (n), (n = i−M: 0 ≦ n ≦ N) (10) By performing DFT on f ₂ (n) and applying equations (4) and (10),

【００４９】[0049]

【数７】となる。周波数ｕ以外の成分は０であり、ｆ₂（ｕ) だ
けが周波数成分をもつ。ここで、式（７）と同様に、次
式でＦ₂'(κ) を表す。(Equation 7) Becomes Components other than the frequency u are 0, and only f ₂ (u) has a frequency component. Here, similarly to the equation (7), F ₂ ′ (κ) is represented by the following equation.

【００５０】[0050]

【数８】Ｆ₂'(κ) に対して、ＩＤＦＴを行うその信号をＦ₂'
(ｎ) で表すと、(Equation 8) For F ₂ ′ (κ), the signal for performing IDFT is given by F ₂ ′
Expressed by (n),

【００５１】[0051]

【数９】となる。これを式（１０）の条件式ｎ＝ｉ−Ｍを用い
て、もとの時間領域ｉに戻し、第２フレームでの再生信
号ｇ₂'(ｉ) が得られる。(Equation 9) Becomes This is returned to the original time domain i by using the conditional expression n = i−M in Expression (10), and a reproduced signal g ₂ ′ (i) in the second frame is obtained.

【００５２】[0052]

【数１０】第一フレームでの再生信号ｇ₁'(ｉ) と第２フレームで
の再生信号ｇ₂'(ｉ) との位相差θは、式（９）と式
（１４）から、(Equation 10) The phase difference θ between the reproduced signal g ₁ ′ (i) in the first frame and the reproduced signal g ₂ ′ (i) in the second frame is given by Expression (9) and Expression (14).

【００５３】[0053]

【数１１】となる。ただしａｒｇは複素数から位相角をとりだす関
数である。位相差θはＭに依存するオーバラップ幅と周
波数成分の移動距離（γ−ｕ）とによるものであること
がわかる。例えば、Ｍ＝Ｎ３／４となる１／４フレーム
オーバラップの位相差θ_1/4は、 θ_1/4＝−３π（γ−ｕ）／２（１６）となる。周波数成分の移動距離０、１、２、３、４、
５、…に対しては、θ_1/4が０、π／２、π、３π／
２、０、π／２、…となる。すなわち、理想的な位相差
０を実現するためには、移動距離に対して４の剰余で処
理される位相補正が必要となる。また、Ｍ＝Ｎ／２とな
る１／２フレームオーバラップでの位相差θ_1/2は、 θ_1/2＝−π（γ−ｕ）（１７）となり、周波数成分の移動距離０、１、２、３、…に対
して、θ_1/2＝０、π、０、π、…が得られる。これ
は、ＤＦＴの演算結果における周波数位置を示すインデ
ックス（ｕおよびγ）の値が、奇数値と偶数値との間の
移動にあたる時のみπの位相差を生じると解釈できる。[Equation 11] Becomes Here, arg is a function that extracts a phase angle from a complex number. It can be seen that the phase difference θ is due to the overlap width depending on M and the moving distance (γ-u) of the frequency component. For example, the phase difference θ _1/4 of １／ frame overlap where M = N3 / 4 is θ _1/4 = −3π (γ−u) / 2 (16). Moving distances of frequency components 0, 1, 2, 3, 4,
For..., Θ _1/4 is 0, π / 2, π, 3π /
2, 0, π / 2,... That is, in order to realize an ideal phase difference of 0, it is necessary to perform a phase correction that is processed with a remainder of 4 with respect to the moving distance. The phase difference θ _1/2 at 1/2 frame overlap where M = N / 2 is θ _1/2 = −π (γ−u) (17), and the moving distance of the frequency component is 0, 1 , 2, 3,..., Θ _1/2 = 0, π, 0, π,. This can be interpreted as that a phase difference of π occurs only when the value of the index (u and γ) indicating the frequency position in the DFT operation result corresponds to the movement between the odd value and the even value.

【００５４】これらの位相補正は、基本的には、毎フレ
ーム行うことが必要である。いま、隣接するフレーム間
での位相差がθで、（Ｊ−１）番目フレームとＪ番目の
フレームとの加算に対する位相補正をＪ番目のフレーム
で行うことを考える。ただし、ＪはＪ＞１の自然数とす
る。ここで、（Ｊ＋１）番目フレームにおいてはＪ番目
フレームに対する位相差θを補正するが、Ｊ番目フレー
ム自体既に（Ｊ−１）番目フレームに対する位相差θが
補正されていることに着目すると、（Ｊ＋１）番目フレ
ームにおける位相補正は２θとなる。すなわち、（Ｊ−
１）番目フレームに対する（Ｊ＋１）番目フレームにお
ける位相補正は、１＝０、１、２、３、４、…、Ｌに対
してθ、２θ、３θ、４θ、５θ、…、（Ｌ＋１）θと
累積されることになる。These phase corrections basically need to be performed for each frame. Now, it is assumed that the phase difference between adjacent frames is θ, and the phase correction for the addition of the (J−1) th frame and the Jth frame is performed in the Jth frame. Here, J is a natural number satisfying J> 1. Here, in the (J + 1) -th frame, the phase difference θ with respect to the J-th frame is corrected, but if the J-th frame itself is already corrected for the phase difference θ with respect to the (J−1) -th frame, (J + 1) The phase correction in the ()) th frame is 2θ. That is, (J-
The phase correction in the (J + 1) -th frame with respect to the 1) -th frame is as follows: 1 = 0, 1, 2, 3, 4,..., L, θ, 2θ, 3θ, 4θ, 5θ,. Will be cumulative.

【００５５】例えば、周波数上の移動距離が１で、１／
４フレームオーバラップの場合には、θ_1/4＝π／２で
あるため、θ、２θ、３θ、４θ、５θ、…と累積する
位相補正量は、π／２、π、３π／２、０、π／２、３
π／２、…となる。同様に、１／２フレームオーバラッ
プの場合には、θ_1/2＝πであるため、θ、２θ、３
θ、４θ、５θ、…と累積する位相補正量は、π、０、
π、０、π、…となる。これは、１／２フレームオーバ
ラップの場合には１フレームおき毎に位相補正を行えば
良いことを表しており、１／４フレームオーバラップの
場合より簡単な処理で位相補正を行うことができる。For example, if the moving distance on the frequency is 1, 1 /
In the case of 4-frame overlap, since θ _1/4 = π / 2, the phase correction amounts accumulated as θ, 2θ, 3θ, 4θ, 5θ,... Are π / 2, π, 3π / 2,. 0, π / 2, 3
π / 2, ... Similarly, in the case of _1/2 frame overlap, since θ _1/2 = π, θ, 2θ, 3
.., 0,...
π, 0, π,... This means that phase correction should be performed every other frame in the case of 1/2 frame overlap, and the phase correction can be performed with simpler processing than in the case of 1/4 frame overlap. .

【００５６】（Ｅ）シミュレーションシミュレーション設定条件図８はアナログ伝送方式の１２ｋＨｚ帯域の広帯域音声
と電話帯域（３．４ｋＨｚ帯域）音声との間で帯域圧縮
および帯域伸張を行ったシミュレーションの設定条件を
示す図である。(E) Simulation Simulation Setting Conditions FIG. 8 shows simulation setting conditions in which band compression and band expansion are performed between a 12-kHz wideband voice and a telephone-band (3.4 kHz) voice in the analog transmission system. FIG.

【００５７】図９はアナログ伝送方式のシミュレーショ
ンに用いた分割帯域と間引き率との関係を示す図であ
る。（Ｂ）節で述べた周波数成分の移動位置は、図９に
より定められている。例えば、１／４の間引き率の区間
では、広帯域音声の周波数位置９６、９７、９８、９９
の四つに対して圧縮音声の周波数位置６９、また１０
０、１０１、１０２、１０３に対して７０が定められて
いることを示している。FIG. 9 is a diagram showing the relationship between the divided bands used in the simulation of the analog transmission system and the thinning rate. The moving position of the frequency component described in the section (B) is determined by FIG. For example, in the section of the の間 thinning rate, the frequency positions 96, 97, 98, 99
Frequency positions 69 and 10 for the compressed voice for the four
This indicates that 70 is defined for 0, 101, 102, and 103.

【００５８】（Ｆ）主観評価結果図１０はアナログ伝送方式の伸張音声の主観評価結果を
示す図である。(F) Subjective Evaluation Results FIG. 10 is a diagram showing the subjective evaluation results of the expanded voice of the analog transmission system.

【００５９】シミュレーションにより帯域圧縮・帯域伸
張を施された音声を５段階評価のＭＯＳ値（Mean Opini
on Score) を用いて主観評価を行った。入力信号とし
て、単独音声および、音声会議を想定した音声と背景雑
音またはＢＧＭ（Back GroundMusic)の加算信号を用い
た。背景雑音には一般的な事務所の周囲雑音を、ＢＧＭ
には弦楽四重奏の楽曲を使用している。評定者には、評
価点５の基準信号として１２ｋＨｚ帯域の信号を視聴さ
せた。また評価は、ダブルブラインド方式（Swedish Br
oadcasting Corporation(SR)Research and Development
Department,1991.5、The SR Report on The MPEG/Audi
o Subjective Listening Test 、Sten Bergman,Christe
r Grewin,Thomas Ryden)を用い、ＣＣＩＲＲｅｃ．５
６２（International Radio Consultative Committee
C.C.I.R. Recommendation No.562,1990) のグレードス
ケールにより行った。音声評価に関してエキスパートで
はない一般の評定者１４名による主観評価結果をＭＯＳ
値にて図１０に示す。The voice which has been subjected to the band compression / band expansion by the simulation is evaluated with a MOS value (Mean Opini
on Score). As an input signal, a single voice and, voice conference was speech and background noise or that assumes was using the added signal of the BGM (Back GroundMusic). Background noise includes general office ambient noise, BGM
Uses string quartet music. The evaluator watched a signal in a 12 kHz band as a reference signal of evaluation point 5. The evaluation was based on the double blind method (Swedish Br
oadcasting Corporation (SR) Research and Development
Department, 1991.5, The SR Report on The MPEG / Audi
o Subjective Listening Test, Sten Bergman, Christe
r Grewin, Thomas Ryden) using CCIR Rec. 5
62 (International Radio Consultative Committee
(CCIR Recommendation No.562,1990). MOS based on subjective evaluation results by 14 general raters who are not experts in voice evaluation
The values are shown in FIG.

【００６０】このアナログ伝送方式による伸張音声で
は、それぞれのサンプルにおいて、既存の電話帯域音声
よりも高い評価が得られている。また、サンプル自体に
高域周波数成分が少なかった男声サンプルでの評価結果
は、女声サンプルに比較するとやや低く、電話帯域音声
との差が縮まっている。しかし、帯域の広い楽曲を付加
した男声＋ＢＧＭのサンプルでは、このアナログ伝送方
式のＭＯＳ値が３．０と電話帯域音声２．２より高くな
っており、帯域を広げた本方式の優位性が認められる。In the expanded voice according to the analog transmission system, higher evaluation is obtained in each sample than in the existing telephone band voice. In addition, the evaluation result of the male voice sample in which the sample itself has few high frequency components is slightly lower than that of the female voice sample, and the difference from the telephone band voice is reduced. However, in the sample of male voice + BGM to which a song with a wide band is added, the MOS value of this analog transmission system is 3.0, which is higher than the telephone band voice 2.2, and the superiority of this system with an expanded band is recognized. Can be

【００６１】前述のように、本アナログ伝送方式は、広
帯域音声を電話帯域に帯域圧縮してアナログ伝送し、受
信側の帯域伸張により広帯域音声を再生する。帯域圧縮
では、広帯域音声の中高域周波数成分を間引くことによ
り電話帯域内に圧縮する。帯域伸張では、圧縮音声の低
域成分からピッチ検出を行い、ピッチの整数倍に近い周
波数位置に中高域成分を再配置して伸張する。As described above, in the present analog transmission system, a wideband voice is band-compressed into a telephone band and transmitted analog, and the wideband voice is reproduced by expanding the band on the receiving side. In band compression, broadband speech is compressed into the telephone band by thinning out middle and high frequency components. In the band expansion, pitch detection is performed from the low-frequency component of the compressed voice, and the mid-high frequency component is rearranged and expanded at a frequency position close to an integral multiple of the pitch.

【００６２】１２ｋＨｚ帯域音声を用いた計算機シミュ
レーションに基づく５段階ＭＯＳ値の主観評価結果を用
いて、電話帯域音声の評価２．４より評価の高い３．１
が得られることを示した。このアナログ伝送方式により
既存のアナログ電話回線を用いて、より肉声に近い通話
のできる広帯域音声会議端末が実現できる。Using a subjective evaluation result of a five-stage MOS value based on a computer simulation using a 12 kHz band voice, 3.1 which is higher in evaluation than a telephone band voice evaluation 2.4.
Was obtained. With this analog transmission system, a wideband audio conference terminal capable of making a call closer to the real voice can be realized using an existing analog telephone line.

【００６３】[0063]

【発明の効果】以上説明したように、本発明は、音声信
号に付加されたノイズを除去し音声の品質劣化を防止で
きる優れた効果がある。As described above, the present invention has an excellent effect of removing noise added to a voice signal and preventing voice quality deterioration.

[Brief description of the drawings]

【図１】本発明一実施例音声信号伝送装置のブロック構
成図。FIG. 1 is a block diagram of an audio signal transmission apparatus according to an embodiment of the present invention.

【図２】本発明の音声信号伝送装置の送信装置の広帯域
音声信号のスペクトルを示す図。FIG. 2 is a diagram showing a spectrum of a wideband audio signal of a transmission device of the audio signal transmission device of the present invention.

【図３】アナログ伝送方式のブロック構成図。FIG. 3 is a block diagram of an analog transmission system.

【図４】アナログ伝送方式の帯域圧縮処理を示すフロー
チャート。FIG. 4 is a flowchart showing band compression processing of the analog transmission system.

【図５】アナログ伝送方式の帯域圧縮処理における分割
区間と間引き率とを示す図。FIG. 5 is a diagram showing a division section and a thinning rate in a band compression process of an analog transmission system.

【図６】アナログ伝送方式の帯域伸張処理を示すフロー
チャート。FIG. 6 is a flowchart showing band extension processing of the analog transmission system.

【図７】アナログ伝送方式の周波数成分移動による位相
ずれを示す図。FIG. 7 is a diagram showing a phase shift due to frequency component movement in the analog transmission system.

【図８】アナログ伝送方式の１２ｋＨｚ帯域の広帯域音
声と電話帯域（３．４ｋＨｚ帯域）音声との間で帯域圧
縮および帯域伸張を行ったシミュレーションの設定条件
を示す図。FIG. 8 is a diagram showing setting conditions of a simulation in which band compression and band expansion are performed between a 12-kHz band wide band voice and a telephone band (3.4 kHz band) voice of the analog transmission system.

【図９】アナログ伝送方式のシミュレーションに用いた
分割帯域と間引き率との関係を示す図。FIG. 9 is a diagram illustrating a relationship between a divided band and a thinning rate used in a simulation of the analog transmission scheme.

【図１０】アナログ伝送方式の伸張音声の主観評価結果
を示す図。FIG. 10 is a diagram showing a subjective evaluation result of an expanded voice in the analog transmission system.

[Explanation of symbols]

１０ノイズ抑圧装置１１、５１アナログディジタル変換器１２、５３ＦＦＴ演算回路１３圧縮手段１４、５６逆ＦＦＴ演算回路１５、５７ディジタルアナログ変換器２０入力手段３０送信手段４０送信装置５０受信装置５２ＥＱＬ回路５４ピッチ検出回路５５伸張手段６０二線四線変換器 REFERENCE SIGNS LIST 10 noise suppression device 11, 51 analog-to-digital converter 12, 53 FFT operation circuit 13 compression means 14, 56 inverse FFT operation circuit 15, 57 digital-analog converter 20 input means 30 transmission means 40 transmission apparatus 50 reception apparatus 52 EQL circuit 54 Pitch detection circuit 55 Decompression means 60 Two-wire to four-wire converter

Claims

(57) [Claims]

1. Broadband voice to be transmitted is transmitted to a line.
A transmitting device, and receiving a transmission signal of the transmitting device from this line.
An audio signal transmission device comprising: a reception device that transmits the wideband audio in a time-series digital format.
Analog-to-digital converter that converts
The output digital signal of the log digital converter is divided by frequency
Conversion means for converting the signal into divided signals, the output of the conversion means
A compression method that thins out the high-frequency components of a signal and compresses the band
Stage, the output signal of this compression means is converted into a time-series digital signal.
Conversion means for converting to a signal, and the output of the inverse conversion means
Digital-to-analog conversion that converts signals to analog signals
Equipped with a noise suppression device, The conversion means is configured to output the analog-to-digital converter
Extraction of finite number of data from digital signal
Processing and windowing processing, and within this processed frame
Fast Fourier transform of the frequency domain
Including a fast Fourier transform operation circuit performed by The inverse transform means outputs the fast Fourier transform operation circuit.
Time-series digitization of force signal by inverse fast Fourier transform
Inverse fast Fourier to convert to overlapped signal
Including a conversion operation circuit, The compression means sets a thinning rate in a section where the high frequency is thinned.
High and leave the frequency component with the highest power in that section
Includes means to select as active ingredient and compress bandwidth That
Characteristic audio signal transmission device.

2. An analog-to-digital converter for converting a wideband voice into a time-series digital signal, a conversion unit for converting an output digital signal of the analog-to-digital converter into a frequency-divided signal, and an output signal of the conversion unit. Compression means for thinning out the high-frequency components and compressing the band, inverse conversion means for converting the output signal of the compression means into a time-series digital signal, and digital analog for converting the output signal of the inverse conversion means to an analog signal In a noise suppression device including a converter, the conversion means takes out a finite number of data from the digital signal from the analog-to-digital converter, performs frame processing and windowing processing, and performs frequency processing on the data in the processed frame. A fast Fourier transform operation circuit that performs the The inverse transform unit includes an inverse fast Fourier transform operation circuit that converts an output signal of the fast Fourier transform operation circuit into a time-series digital signal by inverse fast Fourier transform and performs an overlap process, and the compression unit includes: A noise suppression device comprising: means for increasing a thinning rate in a section where high frequency is thinned, selecting a frequency component having a maximum power in the section as an effective component to be left, and compressing a band.