JP2652371B2

JP2652371B2 - Audio coding method

Info

Publication number: JP2652371B2
Application number: JP61158008A
Authority: JP
Inventors: 博喜内山; 和作山田
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1986-07-07
Filing date: 1986-07-07
Publication date: 1997-09-10
Anticipated expiration: 2012-09-10
Also published as: JPS6314524A

Description

【発明の詳細な説明】［技術分野］本発明は、音声符号化方法に関する。Description: TECHNICAL FIELD The present invention relates to a speech encoding method.

［従来技術］例えば、高速デジタル回線を用いて音声信号を伝送し
たり、音声応答装置のために音声信号を蓄積および合成
するなど音声信号をデジタル処理するとき、この音声信
号をなんらかの方法でデジタル信号に変換する必要があ
る。[Prior Art] For example, when a voice signal is transmitted through a high-speed digital line or when a voice signal is digitally processed such as storing and synthesizing a voice signal for a voice response device, the voice signal is converted into a digital signal by some method. Need to be converted to

基本的には音声信号は周波数帯域が0.3〜3.4KHzのア
ナログ信号であり、これをデジタル信号に変換するに
は、例えばサンプリング周波数8KHzで分解能が８ビット
のアナログ／デジタル変換器で変換すればよい（PCM（P
ulse Code Modulation）符号化）。そして、このデジタ
ル信号を元の音声信号に戻すには、サンプリング周波数
8KHzで分解能が８ビットのデジタル／アナログ変換器で
アナログ信号に変換し、さらにローパスフィルタを通し
て波形整形してやればよい。このとき、アナログ／デジ
タル変換器およびデジタル／アナログ変換器の分解能
（すなわちPCM符号のビット幅）が大きいほど再生した
音声の品質が高い。Basically, an audio signal is an analog signal having a frequency band of 0.3 to 3.4 KHz. In order to convert this into a digital signal, for example, an analog / digital converter having a sampling frequency of 8 KHz and a resolution of 8 bits may be used. (PCM (P
ulse Code Modulation). Then, to return this digital signal to the original audio signal, the sampling frequency
What is necessary is to convert to an analog signal with a digital / analog converter having a resolution of 8 bits at 8 KHz, and to further shape the waveform through a low-pass filter. At this time, the higher the resolution of the analog / digital converter and the digital / analog converter (that is, the bit width of the PCM code), the higher the quality of the reproduced sound.

ところで、このようなPCM符号化された音声信号は１
秒あたりのビット速度（データ速度；以下ビットレート
という）が64Kbpsとなり、このように高いビットレート
の音声信号を伝送するには非常に高速な伝送路を必要と
し、また、かかる音声信号を蓄積するためには厖大な記
憶容量のメモリを必要とする。そこで、従来から、音声
信号のビットレートを低減するための種々の提案がなさ
れている。By the way, such a PCM-encoded audio signal is 1
The bit rate per second (data rate; hereinafter, referred to as bit rate) is 64 Kbps, and transmitting such a high bit rate audio signal requires a very high-speed transmission path, and accumulates such an audio signal. For this purpose, a memory having a huge storage capacity is required. Therefore, various proposals have conventionally been made for reducing the bit rate of the audio signal.

その１つに、時系列的に隣接するPCM符号の差分を形
成する差分PCM符号化方式がある。この差分PCM符号化方
式は、音声波形の相関性に基づく冗長性を利用したもの
であり、隣接したサンプル間の値の変化が、多くの場合
ダイナミック・レンジの限られた範囲に含まれることか
ら、１サンプルあたりのビット数を低減することができ
る。この差分PCM符号化方式をさらに進めた適応差分PCM
符号化方式の１つである、CCITT（国際電信電話諮問委
員会）勧告による適応差分PCM方式では、32Kbpsのビッ
トレートを実現している。One of them is a differential PCM coding method for forming a difference between PCM codes adjacent in time series. This differential PCM coding method uses redundancy based on the correlation of the audio waveform, and since the value change between adjacent samples is often included in a limited dynamic range. The number of bits per sample can be reduced. Adaptive differential PCM which further advanced this differential PCM coding method
The adaptive differential PCM system, which is one of the encoding systems and recommended by CCITT (International Telegraph and Telephone Consultative Committee), achieves a bit rate of 32 Kbps.

この他には、音声信号の非定在性と線形予測可能性を
利用したAPC−AB（Adaptive Prediction Coding with A
daptive Bit Allocation）方式、または、音声分析合成
手法によるLSP（Line Spectrum Pair）方式などがあ
る。In addition, APC-AB (Adaptive Prediction Coding with APC-AB) using non-locality and linear predictability of speech signals
daptive Bit Allocation) method or LSP (Line Spectrum Pair) method using a voice analysis / synthesis method.

しかしながら、このような適応差分PCM方式、APC−AB
方式およびLSP方式は、符号化および復号化の処理が非
常に複雑であり、それらを実現するための装置は非常に
高価なものとなるという不都合がある。However, such an adaptive difference PCM method, APC-AB
The scheme and the LSP scheme have a disadvantage that encoding and decoding processes are very complicated, and a device for realizing them is very expensive.

一方、放送衛星用の高品位なPCM音声伝送方式の１つ
に準瞬時圧伸方式がある。この準瞬時圧伸方式は、PCM
符号化された音声データを時系列に所定数ごとのブロッ
クに分割し、おのおののブロックにおける信号絶対値の
最大値に対応した最上位桁をあらわすスケールデータを
識別し、その最上位桁を含む所定ビット数のデータを符
号データに整形するものであり、比較的符号化処理が簡
単で、かつ、容易に１サンプルのビット数を低減するこ
とが可能である。しかしながら、このような準瞬時圧伸
方式は、効率的には充分なものではない。On the other hand, there is a quasi-instantaneous companding method as one of high-quality PCM audio transmission methods for broadcasting satellites. This semi-instantaneous companding method uses PCM
The coded audio data is divided into blocks of a predetermined number in a time series, and scale data representing the most significant digit corresponding to the maximum value of the signal absolute value in each block is identified. The bit number data is shaped into code data, and the encoding process is relatively simple, and the bit number of one sample can be easily reduced. However, such a quasi-instantaneous drawing method is not efficient enough.

そこで、この準瞬時圧伸方式の効率を改善する手法と
して「差分PCM方式と準瞬時圧伸との結合」が考えられ
るが、一般に単に準瞬時圧伸を差分PCM方式に適用した
だけでは、圧縮時の欠落ビットが伝送誤差を生じ、受信
側の積分器で誤差が累積して受信不能となる。Therefore, as a method of improving the efficiency of the quasi-instantaneous companding method, a `` combination of the differential PCM method and the quasi-instantaneous companding method '' can be considered. The missing bit at the time causes a transmission error, and the error is accumulated in the integrator on the receiving side to make reception impossible.

次にこの点について説明する。 Next, this point will be described.

いま準瞬時圧伸を差分PCM方式に適用した符号化方式
によって、第11図（ａ）に示したような音声信号を符号
化することを考える。まず、差分PCM符号化のためにこ
の音声信号を例えば8KHzのサンプリング周波数でサンプ
リングしてサンプル間の差分値を形成する。ここでは、
隣接するサンプル間の差分値を符号付の８ビットのデー
タすなわち２の補数表現の８ビットデータであらわす。
そして、準瞬時圧伸のための条件は、８サンプルで１ブ
ロックを構成し、伝送データは１サンプルあたり３ビッ
トとする。また、スケールデータは３ビットである。Now, consider encoding a speech signal as shown in FIG. 11 (a) by an encoding method in which quasi-instantaneous companding is applied to the differential PCM method. First, the audio signal is sampled at a sampling frequency of, for example, 8 KHz for differential PCM encoding to form a difference value between samples. here,
The difference value between adjacent samples is represented by signed 8-bit data, that is, 8-bit data in 2's complement representation.
The condition for the quasi-instantaneous companding is that one block is composed of eight samples, and the transmission data is three bits per sample. The scale data is 3 bits.

ここで、８つのサンプル♯１〜♯８における差分値が
第12図（ａ）のように得られたものとする。このブロッ
ク内において差分値の絶対値が最大になるものは、サン
プル♯１であり、したがって、このときのスケール位置
POSは、このサンプル♯１のビットパタンの最上位桁で
あるビット４となり、スケール位置POSの値は（100）_２
になる。Here, it is assumed that the difference values of the eight samples # 1 to # 8 are obtained as shown in FIG. In this block, the sample having the largest absolute value of the difference value is the sample ♯1, and therefore, the scale position at this time.
POS is bit 4 which is the most significant digit of the bit pattern of sample # 1, and the value of scale position POS is (100) ₂
become.

これにより、各サンプルの伝送ビット（伝送データ；
すなわち符号データ）は、このスケール位置POSよりも
１つ上位桁で、符号値をあらわすビット５（サインビッ
ト）から３ビットのデータ、すなわち、ビット5,4,3の
データとなる。Thereby, the transmission bit (transmission data;
That is, the code data) is data of three bits from bit 5 (sign bit) representing the code value, that is, data of bits 5, 4, and 3, which is one digit higher than the scale position POS.

したがって、このブロックでは、最初にスケール位置
POSを、それに続いてサンプル♯１〜♯８の伝送ビット
を順次連続して配置して構成した伝送データ（符号デー
タ）は、同図（ｂ）に示したようなものとなる。Therefore, in this block, first the scale position
The transmission data (code data) configured by POS followed by the transmission bits of samples # 1 to # 8 sequentially and sequentially arranged is as shown in FIG.

このような符号データを復号するとき、まず、１ブロ
ック分の符号データを３ビットずつに分解し、その最初
の３ビットの値からスケール位置POSを識別する。そし
て、後続する３ビットの符号データを８ビットのデータ
に伸張するときには、スケール位置POSよりも１つ上位
桁に符号データのMSB（最上位ビット）を一致させ、そ
のMSBよりも上位桁の各桁には符号ビットの値をセット
し、そのLSB（最下位ビット）よりも下位桁の各桁には
０をセットする。When decoding such code data, first, the code data for one block is decomposed into 3 bits, and the scale position POS is identified from the value of the first 3 bits. Then, when expanding the following 3-bit code data to 8-bit data, the MSB (most significant bit) of the code data is made to coincide with the one digit higher than the scale position POS, and each of the digits higher than the MSB is set. The value of the sign bit is set in the digit, and 0 is set in each digit lower than the LSB (least significant bit).

これにより、同図（ｃ）に示したデータが、復号後の
データとして得られる。このデータと、符号化前のデー
タとを比較すると、復号データでは伝送ビットよりも下
位桁の情報（直流成分）が欠落していることがわかる
（第11図（ｂ）参照）。すなわち、情報に欠落ビットを
生じている。As a result, the data shown in FIG. 3C is obtained as decoded data. When this data is compared with the data before encoding, it is found that information (DC component) of lower digits than the transmission bit is missing in the decoded data (see FIG. 11 (b)). That is, missing bits are generated in the information.

このような復号データに基づいて音声信号を再生する
と、第11図（ｃ）に一点鎖線で示したように、欠落ビッ
ト分の誤差が蓄積して負の直流シフトを生じ、同図に破
線で示した元の波形よりも右下がりの波形となり、その
結果、適正に情報を伝送することができない。When an audio signal is reproduced based on such decoded data, as shown by a dashed line in FIG. 11 (c), errors corresponding to missing bits are accumulated and a negative DC shift is generated. The waveform becomes lower right than the original waveform shown, and as a result, information cannot be transmitted properly.

このような問題を解決する１つの方法として、「欠落
ビットのアキュムレーションによる差分圧伸PCM（DC−P
CM）」（高橋ほか、電子通信学会論文誌'84/10 Vol.J67
−B No.10）が提案されている。One solution to such a problem is to use a differential companding PCM (DC-P
CM) ”(Takahashi et al., IEICE Transactions '84 / 10 Vol.J67
−B No. 10) has been proposed.

しかしながら、この方法は15ビット程度の差分データ
を８ビット程度に圧縮する場合に有効であり、８ビット
の差分データを３ビット程度に圧縮するような低ビット
レートの符号化方式には適用できない。However, this method is effective when the difference data of about 15 bits is compressed to about 8 bits, and cannot be applied to a low bit rate coding method in which the difference data of 8 bits is compressed to about 3 bits.

すなわち、このような低ビットレートの場合にはブロ
ック間において音声波形の振幅が大幅に変化したときな
どブロック間でスケール位置が大幅に変動することがあ
り、そのためにアキュムレーションされている誤差信号
の方が伝送すべき有効なデータよりも大きな値なること
がある。かかる場合には、伝送すべきデータが誤差信号
に埋もれてしまい、適正なデータ伝送を実現できない。In other words, in the case of such a low bit rate, the scale position may fluctuate greatly between blocks, for example, when the amplitude of the audio waveform greatly changes between blocks, and therefore, the accumulated error signal May be larger than the valid data to be transmitted. In such a case, data to be transmitted is buried in the error signal, and proper data transmission cannot be realized.

［目的］本発明は、上述した従来技術の不都合を解決するため
になされたものであり、低ビットレートで、簡単な処理
により、高品質な音声を再現できる音声符号化方法を提
供することを目的としている。[Purpose] The present invention has been made in order to solve the above-described disadvantages of the related art, and has as its object to provide a speech encoding method capable of reproducing high-quality speech with a low bit rate and simple processing. The purpose is.

［構成］本発明は、PCM符号化された音声データのうち隣接す
るサンプル間で直前サンプルとの差分値を形成してその
差分値を時系列に所定数ごとのブロックに分割し、おの
おののブロックにおける差分値の絶対値の最大値に対応
した最上位桁をあらわすスケールデータを識別し、その
最上位桁を含む所定ビット数のデータを符号データに整
形して上記音声データを上記ブロック単位に圧縮する音
声符号化方法において、上記差分値の最下位ビットに１
を加算した第１の一時差分値と、上記差分値の最下位ビ
ットから１を減算した第２の一時差分値を算出し、上記
差分値に基づく第１の一時符号データと、上記第１の一
時差分値に基づく第２の一時符号データと、上記第２の
一時差分値に基づく第３の一時符号データを算出し、上
記第１の一時符号データに基づく第１の一時復号値と、
上記第２の一時符号データに基づく第２の一時復号値
と、上記第３の一時符号データに基づく第３の一時復号
値を算出し、上記第１の一時復号値と、上記第２の一時
復号値と、上記第３の一時復号値のうち、当該サンプル
における原信号の値に最も近い値をとる一時復号値を判
定し、その判定した一時復号値に対応した符号データ
を、当該サンプルにおける符号データとして選択すると
ともに、後続するブロックの最初のサンプルの差分値を
算出するときには当該ブロックの最後のサンプルの復号
値を基準とするようにしたものである。また、前記差分
値は、その絶対値の最大値を制限するようにするとよ
い。[Configuration] The present invention forms a difference value between adjacent samples in PCM-encoded audio data, and divides the difference value into a predetermined number of blocks in a time-series manner. Identifies the scale data representing the most significant digit corresponding to the maximum value of the absolute value of the difference value in the above, shapes the data of a predetermined number of bits including the most significant digit into code data, and compresses the audio data in the block unit The least significant bit of the difference value is 1
Is calculated, a second temporary difference value obtained by subtracting 1 from the least significant bit of the difference value is calculated, and first temporary code data based on the difference value is calculated, Calculating second temporary code data based on the temporary difference value and third temporary code data based on the second temporary difference value, and calculating a first temporary decoded value based on the first temporary code data;
A second temporary decoded value based on the second temporary code data and a third temporary decoded value based on the third temporary code data are calculated, and the first temporary decoded value and the second temporary decoded value are calculated. A decoded value and a temporary decoded value that takes a value closest to the value of the original signal in the sample among the third temporary decoded values are determined, and code data corresponding to the determined temporary decoded value is determined in the sample. In addition to the selection as the code data, the difference value of the first sample of the subsequent block is calculated based on the decoded value of the last sample of the block. Further, the difference value may limit a maximum value of the absolute value.

また、PCM符号化された音声データのうち隣接するサ
ンプル間で直前サンプルとの差分値を形成してその差分
値を時系列に所定数ごとのブロックに分割し、おのおの
のブロックにおける差分値の絶対値の最大値に対応した
最上位桁をあらわすスケールデータを識別し、その最上
位桁を含む所定ビット数のデータを符号データに整形し
て上記音声データを上記ブロック単位に圧縮する音声符
号化方法において、上記最上位桁およびこの最上位桁に
それぞれ隣接する桁に対応した複数のスケールデータに
それぞれ基づいて上記符号データを整形した複数の符号
データ系列を形成するとともに、上記複数の符号データ
系列について、それぞれ、上記差分値の最下位ビットに
１を加算した第１の一時差分値と、上記差分値の最下位
ビットから１を減算した第２の一時差分値を算出し、上
記差分値に基づく第１の一時符号データと、上記第１の
一時差分値に基づく第２の一時符号データと、上記第２
の一時差分値に基づく第３の一時符号データを算出し、
上記第１の一時符号データに基づく第１の一時復号値
と、上記第２の一時符号データに基づく第２の一時復号
値と、上記第３の一時符号データに基づく第３の一時復
号値を算出し、上記第１の一時復号値と、上記第２の一
時複号値と、上記第３の一時復号値のうち、当該サンプ
ルにおける原信号の値に最も近い値をとる一時復号値を
判定し、その判定した一時復号値に対応した符号データ
を、当該サンプルにおける符号データとして選択するよ
うに、上記複数の符号データ系列を補正するとともに、
それらの補正された符号データ系列のうちブロック内誤
差が最も小さくなる符号データ系列を当該ブロックの符
号データ系列として選択し、後続するブロックの最初の
サンプルの差分値を算出するときには当該ブロックで選
択した符号データ系列の最後のサンプルの復号値を基準
とするようにしたものである。Also, a difference value between adjacent samples of the PCM-encoded audio data is formed with respect to the immediately preceding sample, and the difference value is divided in time series into blocks of a predetermined number, and the absolute value of the difference value in each block is calculated. An audio encoding method for identifying scale data representing the most significant digit corresponding to the maximum value, shaping data of a predetermined number of bits including the most significant digit into code data, and compressing the audio data in block units And forming a plurality of code data sequences obtained by shaping the code data based on the most significant digit and a plurality of scale data respectively corresponding to the digits adjacent to the most significant digit, and forming the plurality of code data sequences. And a first temporary difference value obtained by adding 1 to the least significant bit of the difference value, and 1 from the least significant bit of the difference value. Calculating a second temporary difference value, the first temporary code data based on the difference value, and a second temporary code data based on the first temporary difference value, the second
Calculates third temporary code data based on the temporary difference value of
A first temporary decoded value based on the first temporary code data, a second temporary decoded value based on the second temporary code data, and a third temporary decoded value based on the third temporary code data And calculating a temporary decoded value that takes a value closest to the value of the original signal in the sample among the first temporary decoded value, the second temporary decoded value, and the third temporary decoded value. And correcting the plurality of code data sequences so as to select code data corresponding to the determined temporary decoded value as code data in the sample,
The code data sequence having the smallest error in the block is selected as the code data sequence of the block from among the corrected code data sequences, and is selected in the block when calculating the difference value of the first sample of the subsequent block. This is based on the decoded value of the last sample of the encoded data sequence.

また、前記ブロック内誤差は、おのおのの符号データ
系列のブロック内における誤差パワーを用いることがで
きる。また、前記ブロック内誤差は、おのおのの符号デ
ータ系列のブロック内の各サンプルにおける誤差の絶対
値の総和を用いることができる。また、前記差分値は、
その絶対値の最大値を制限するとよい。Further, the error power within each block of the code data sequence can be used as the intra-block error. Further, as the intra-block error, a sum of absolute values of errors in each sample in each block of the code data sequence can be used. Further, the difference value is:
It is advisable to limit the maximum value of the absolute value.

以下、添付図面を参照しながら、本発明の実施例を詳
細に説明する。Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

第１図は、本発明の一実施例にかかる音声符号化装置
を示している。この実施例は、差分PCM符号化方式に準
瞬時圧伸を適用したものであり、準瞬時圧伸を施したさ
いにもとまるブロック化した圧縮差分データを順次復号
し、原信号と比較することで圧縮ビット数内で誤差の少
ない差分データとなるようにサンプル点毎に補正してい
る。またこの実施例では、8KHzのサンプリング周波数で
音声信号をサンプリングしてサンプル間の差分値を２の
補数表現の８ビットデータであらわし、８サンプルで準
瞬時圧伸のための１ブロックを構成し、伝送データは１
サンプルあたり３ビット、および、スケールデータは３
ビットとする。FIG. 1 shows a speech encoding apparatus according to one embodiment of the present invention. In this embodiment, the quasi-instantaneous companding is applied to the differential PCM coding method, and the block-compressed differential data which is obtained when the quasi-instantaneous companding is performed is sequentially decoded and compared with the original signal. Is corrected for each sample point so as to obtain differential data having a small error within the number of compressed bits. Further, in this embodiment, an audio signal is sampled at a sampling frequency of 8 KHz, and a difference value between samples is represented by 8-bit data in a two's complement representation, and one block for quasi-instantaneous companding is constituted by eight samples. Transmission data is 1
3 bits per sample and 3 scale data
Bit.

同図において、入力音声信号SSは、ローパスフィルタ
１によって帯域制限されたのちアナログ／デジタル変換
器２に加えられて８ビットのデジタル信号DSに変換され
る。このアナログ／デジタル変換器２は、サンプリング
周波数が8KHzで直線量子化するものである。In FIG. 1, an input audio signal SS is band-limited by a low-pass filter 1 and then applied to an analog / digital converter 2 to be converted into an 8-bit digital signal DS. This analog / digital converter 2 performs linear quantization at a sampling frequency of 8 KHz.

デジタル信号DSは、１ブロックをなす８サンプル分の
記憶容量をもつバッファ３に蓄積され、このバッファ３
に蓄積されたデジタル信号DSdは、差分データを形成す
るための加減算器４のプラス入力端に加えられている。The digital signal DS is stored in a buffer 3 having a storage capacity of 8 samples forming one block.
Is added to the plus input terminal of an adder / subtracter 4 for forming difference data.

この加減算器４から出力される９ビットの差分データ
DDsは、最大値制限回路５によって８ビットの差分デー
タDDに変換される。このようにして差分データの最大値
を制限したのは、次のような理由による。9-bit difference data output from the adder / subtracter 4
DDs is converted into 8-bit difference data DD by the maximum value limiting circuit 5. The reason for limiting the maximum value of the difference data in this way is as follows.

すなわち、差分データDDsをそのまま準瞬時圧縮した
場合、突発的に大きな差分データを含むブロックにおい
てはそれに対応した大きなスケール値が設定され、その
ために他の小さな差分データに対する準瞬時圧縮後のデ
ータの追従性が悪化する。その結果、復元した音声信号
が、聴覚上、ギクシャクした感じを与える。そこで、こ
のように差分データの最大値をある程度に制限すること
により、このような聴覚上の問題を解決することができ
る。In other words, when the difference data DDs are directly compressed in the quasi-instant manner, a large scale value corresponding to the block containing the large difference data is suddenly set, and therefore, the data following the quasi-instantaneous compression follows other small difference data. The sex worsens. As a result, the restored audio signal gives an auditory jerky feeling. Therefore, by limiting the maximum value of the difference data to a certain value, such an auditory problem can be solved.

この差分データDDは、８サンプル分の記憶容量をもつ
バッファ６、準瞬時圧縮のためのスケール値を設定する
ためのスケール値設定部７および加算器８の一入力端に
加えられている。The difference data DD is applied to a buffer 6 having a storage capacity of 8 samples, a scale value setting unit 7 for setting a scale value for quasi-instantaneous compression, and one input terminal of an adder 8.

この加算器８の出力はレジスタ９に加えられ、このレ
ジスタ９の出力は加減算器４のマイナス入力端、加算器
８の他入力端、および、８サンプル分の記憶容量をもつ
バッファ10に加えられている。The output of the adder 8 is applied to a register 9. The output of the register 9 is applied to a minus input terminal of the adder / subtractor 4, another input terminal of the adder 8, and a buffer 10 having a storage capacity of 8 samples. ing.

このようにして、最大値制限回路５によって８ビット
に制限された差分データDDの積算値が加算器８によって
形成され、このデータが差分データDDsを形成するため
の直前のサンプルのデータとして用いられている。In this way, the integrated value of the difference data DD limited to 8 bits by the maximum value limiting circuit 5 is formed by the adder 8, and this data is used as the data of the immediately preceding sample for forming the difference data DDs. ing.

バッファ６の記憶データは、サンプル毎に準瞬時圧縮
符号化する準瞬時圧縮部11に加えられている。The data stored in the buffer 6 is added to a quasi-instantaneous compression unit 11 that performs quasi-instantaneous compression encoding for each sample.

スケール値設定部７は、最大値制限回路５から出力さ
れる差分データDDの連続する８サンプルのうち、最も絶
対値の大きいものを識別し、そのビットパタンの最上位
桁を判別してそのビット位置を３ビットのスケールデー
タDKで出力する。このスケールデータDKは、準瞬時圧縮
部11、準瞬時圧縮部11から出力される圧縮差分データDC
を最適なデータに変換するための最適化処理部12、１ブ
ロック分のデータを所定の信号形式に整形するためのマ
ルチプレクサ13の一入力端、および、最適化された圧縮
差分データを伸長するための準瞬時伸長部14に加えられ
ている。The scale value setting unit 7 identifies the one having the largest absolute value among the eight consecutive samples of the difference data DD output from the maximum value limiting circuit 5, determines the most significant digit of the bit pattern, and determines the bit. The position is output as 3-bit scale data DK. The scale data DK is the quasi-instantaneous compression unit 11 and the compressed difference data DC output from the quasi-instantaneous compression unit 11.
, An input terminal of a multiplexer 13 for shaping data of one block into a predetermined signal format, and for decompressing the optimized compressed difference data. Are added to the quasi-instantaneous extension unit 14 of the first embodiment.

またバッファ10の記憶データは、レジスタ９から出力
される差分データDDを積算して形成された復号データで
あり、符号化の対象である元の音声信号（以下原信号と
いう）として最適化処理部12に加えられている。The data stored in the buffer 10 is decoded data formed by integrating the difference data DD output from the register 9, and is used as an original audio signal to be encoded (hereinafter referred to as an original signal) by an optimization processing unit. Added to 12

準瞬時圧縮部11は、バッファ６から加えられる８ビッ
トの差分データDDについて、おのおののサンプル毎に、
スケール値設定部７から加えられるスケールデータDKが
あらわすスケール位置よりも１ビット上位桁をMSBとす
る３ビットのデータを抜きだしてそれを圧縮差分データ
DCとして最適化処理部12に出力する。The quasi-instantaneous compression unit 11 calculates the difference data DD of 8 bits added from the buffer 6 for each sample.
Extracts 3-bit data with the MSB as the first digit higher than the scale position represented by the scale data DK added from the scale value setting unit 7 and compresses it.
Output to the optimization processing unit 12 as DC.

最適化処理部12は、準瞬時圧縮部11から加えられる１
ブロック分の圧縮差分データDCを順次復号し、バッファ
10から加えられる原信号と比較することにより、圧縮ビ
ット数内で誤差の少ない差分データとなるようにサンプ
ル点毎に圧縮差分データDCを補正し、これを最適化差分
データDCoとしてマルチプレクサ13の他入力端および準
瞬時伸長部14に加えている。The optimization processing unit 12 is configured to add 1
The compressed difference data DC for the block is sequentially decoded and buffered.
By comparing with the original signal added from 10, the compressed difference data DC is corrected for each sample point so that the difference data has a small error within the number of compressed bits, and the corrected difference data DCo is used as the optimized difference data DCo. It is added to the input terminal and the quasi-instantaneous extension unit 14.

マルチプレクサ13は、第２図に示したように、スケー
ル値設定部７から出力されたスケールデータDKを先頭に
配置し、それに続いて各サンプルの最適化差分データDC
oを順次配置して構成した信号を、１ブロック分の符号
化データDLとして形成して次段装置（例えばデータ伝送
装置、あるいは、データ記憶装置等）に出力する。As shown in FIG. 2, the multiplexer 13 arranges the scale data DK output from the scale value setting unit 7 at the head, and subsequently places the optimized difference data DC of each sample.
The signal constituted by sequentially arranging o is formed as encoded data DL for one block, and is output to the next-stage device (for example, a data transmission device or a data storage device).

準瞬時伸長部14は、スケール値設定部７から加えられ
るスケールデータDKが示している桁よりも１つ上位桁に
MSBが一致するように、最適化処理部12から出力される
３ビットの最適化差分データDCoを配置するとともに、
それよりも上位桁には最適化差分データDCoの符号デー
タの値を配置し、下位桁には０を配置して８ビットの復
号データDEを形成するものであり、その復号データDEは
積分部15に加えられる。The quasi-instantaneous decompression unit 14 places one digit higher than the digit indicated by the scale data DK added from the scale value setting unit 7.
The 3-bit optimized difference data DCo output from the optimization processing unit 12 is arranged so that the MSBs match,
The value of the code data of the optimized difference data DCo is arranged in the upper digit, and 0 is arranged in the lower digit to form 8-bit decoded data DE. Added to fifteen.

積分値15は、準瞬時伸長部14から出力される復号デー
タDEを積算し、符号化データDLを実際に復号化処理して
復元したときの復元データSDを形成し、この復元データ
SDをレジスタ９に出力している。レジスタ９では、この
復元データSDを、１つのブロックの処理を終了して次の
ブロックの処理を開始する直前で取り込む。The integrated value 15 is obtained by multiplying the decoded data DE output from the quasi-instantaneous decompression unit 14 and forming the decoded data SD when the encoded data DL is actually decoded and restored.
SD is output to the register 9. The register 9 captures the restored data SD immediately before the processing of one block is completed and the processing of the next block is started.

これにより、準瞬時圧伸に特有な欠落ビットによる誤
差の累積を、次のブロックの最初のサンプルデータを形
成するときに解消することができ、その結果、より正確
な符号化データDLを形成することができる。As a result, the accumulation of errors due to missing bits specific to quasi-instantaneous companding can be eliminated when forming the first sample data of the next block, thereby forming more accurate encoded data DL. be able to.

このようにして、本実施例では、最適化処理部12によ
って元の音声信号により追従するように圧縮差分データ
DCを補正するとともに、積分部15で形成した復元データ
SDによってブロック内の累積誤差を次のブロックに反映
させているので、準瞬時圧伸による低ビットレートの音
声符号化処理をより精度よく実現することができる。As described above, in the present embodiment, the compressed difference data is controlled by the optimization processing unit 12 so as to follow the original audio signal.
In addition to correcting DC, the restored data formed by the integration unit 15
Since the accumulated error in the block is reflected in the next block by SD, it is possible to more accurately realize a low bit rate speech encoding process by quasi-instantaneous companding.

次に、最適化処理部12が実行する圧縮差分データの最
適化処理について説明する。Next, the optimization processing of the compressed difference data executed by the optimization processing unit 12 will be described.

いま、第３図（ａ）に示したような音声波形を符号化
することを考える。サンプル♯０を基準としてサンプル
♯1,♯2,♯３の差分をそれぞれ８ビットで形成したと
き、この場合の差分の絶対値はサンプル♯１が最大にな
る。したがって、このときの３ビットの圧縮差分データ
はこのサンプル♯１を基準に形成され、スケール値はそ
のビットパタンの最上位桁の桁位置となる。Now, consider encoding a speech waveform as shown in FIG. 3 (a). When each of the differences of samples # 1, # 2, and # 3 is formed by 8 bits with reference to sample # 0, sample # 1 has the largest absolute value of the difference in this case. Therefore, the 3-bit compressed difference data at this time is formed based on this sample # 1, and the scale value is the digit position of the most significant digit of the bit pattern.

さて、圧縮差分データを３ビットであらわした場合、
表現できるデータはスケール値よりも１ビット下位桁の
量子化幅のものであり、したがって、おのおののサンプ
ル♯1,♯2,♯３の値は、この量子化幅で表現可能なデー
タに置換される。例えば、サンプル♯１の圧縮差分デー
タはこの表現できるデータのうち実際の値P11よりも下
の値P12（＝（010）₂;ただしこの場合LSBはスケール位
置よりも１ビット下位桁。以下同じ）になる。Now, when the compressed difference data is represented by 3 bits,
The data that can be represented has a quantization width that is one bit lower than the scale value. Therefore, the value of each sample # 1, # 2, and # 3 is replaced with data that can be represented by this quantization width. You. For example, the compressed difference data of the sample # 1 is a value P12 (= (010) ₂ ; lower than the actual value P11 of the data that can be expressed, where LSB is one bit lower digit than the scale position; the same applies hereinafter). become.

ところで、この量子化幅で表現できるデータのうち、
P12よりも１つ大きな値P13（＝（011）_２）に対応した
データの方がよりサンプル♯１の実際の値P11に近い。
したがって、この値P13をサンプル♯１の圧縮差分デー
タとすれば、復号化したときの音声信号（復号値）の誤
差を小さくすることができる。すなわち、このときの復
号値の誤差は、最大でもこの圧縮差分データの量子化幅
の1/2に抑えることができる。By the way, of the data that can be expressed by this quantization width,
Data corresponding to the value P13 (= (011) ₂ ) which is one larger than P12 is closer to the actual value P11 of the sample # 1.
Therefore, if this value P13 is used as the compressed difference data of sample # 1, the error of the decoded audio signal (decoded value) can be reduced. That is, the error of the decoded value at this time can be suppressed to a half of the quantization width of the compressed difference data at the maximum.

同様に、サンプル♯2,♯３について考えると、その復
号値が符号化前の信号の値（サンプル♯２では値P21、
サンプル♯３では値P31）に最も近くなる圧縮差分デー
タを選択すればよい。Similarly, when considering samples # 2 and # 3, the decoded value is the value of the signal before encoding (the value P21,
In sample # 3, the compressed difference data closest to the value P31) may be selected.

すなわち、この場合、サンプル♯２については、値P2
1よりも小さい値P22に基づいた復号値に較べて値P21よ
りも大きい値P23に基づいた復号値の方がより値P21に近
いので、サンプル♯１の復号値である値P13と値P23との
差分（＝（110）_２）を圧縮差分データに設定する。ま
たサンプル♯３については、値P31が圧縮差分データで
あらわし得る復号値に一致するので、サンプル♯２の復
号値である値P23と値P31との差分（＝（001）_２）を圧
縮差分データに設定する。That is, in this case, for sample # 2, the value P2
Since the decoded value based on the value P23 that is larger than the value P21 is closer to the value P21 than the decoded value based on the value P22 that is smaller than 1, the decoded values of the sample # 1 that are the values P13 and P23 are closer to the value P21. (= (110) ₂ ) is set as the compressed difference data. For sample # 3, since the value P31 matches the decoded value that can be represented by the compressed difference data, the difference (= (001) ₂ ) between the value P23, which is the decoded value of sample # 2, and the value P31 is represented by the compressed difference data. Set to.

このようにして、元の音声信号に対する追従性が向上
した圧縮差分データを形成することができる。そのため
の処理である最適化差分ビットルーチンの一例を第４図
（ａ），（ｂ）に示す。In this way, it is possible to form compressed difference data with improved tracking of the original audio signal. FIGS. 4 (a) and 4 (b) show an example of an optimized difference bit routine which is a process for that.

まず、準瞬時圧縮部11から圧縮差分データｄ（DC）を
入力し（処理101）、その圧縮差分データｄの値が、圧
縮ビット数（この場合は３ビット）であらわすことので
きる正の最大値MAX（＝（011）_２）より大きいか、ある
いは、負の最大値MIN（＝（100）_２）より小さいかを判
別し（判断102,103）、判断102の結果がYESになるとき
には圧縮差分データｄに値MAXを代入し（処理104）、判
断103の結果がYESになるときに圧縮差分データｄに値MI
Nを代入する（処理105）。First, the compressed difference data d (DC) is input from the quasi-instantaneous compression unit 11 (process 101), and the value of the compressed difference data d is a positive maximum that can be represented by the number of compressed bits (3 bits in this case). It is determined whether the value is larger than the value MAX (= (011) ₂ ) or smaller than the negative maximum value MIN (= (100) ₂ ) (decisions 102 and 103). If the result of the decision 102 is YES, the compressed difference data is determined. The value MAX is substituted for d (process 104), and when the result of the judgment 103 becomes YES, the value MI is added to the compressed difference data d.
N is substituted (process 105).

次に、圧縮差分データｄよりもLSB（＝（001）_２）だ
け小さな値dmと圧縮差分データｄよりもLSBだけ大きな
値dpを形成し（処理106,107）、値dmが値MINよりも小さ
くなった場合には値dmに値MINを代入し（判断108、処理
109）、値dpが値MAXよりも大きくなった場合には値dpに
値MAXを代入する（判断110、処理111）。Next, a value dm smaller by LSB (= (001) ₂ ) than the compressed difference data d and a value dp larger by LSB than the compressed difference data d are formed (processing 106 and 107), and the value dm becomes smaller than the value MIN. , The value MIN is substituted for the value dm (decision 108, processing
109) If the value dp is larger than the value MAX, the value MAX is substituted for the value dp (decision 110, processing 111).

このようにして値dp,dmを形成すると、スケールデー
タDKに基づいて値d,dp,dmを準瞬時伸長したときの８ビ
ットの値dd,ddp,ddmを算出する（処理112）。そして、
この値dd,ddp,ddmにそれぞれ１サンプル前のデータの復
号値da₀を加算し、おのおのの値d,dp,dmに対応したロー
カル復号値da,dap,damを形成する（処理113）。なお処
理112および113でローカル復号処理を実現しており、そ
のために、直前のサンプルの復号値da₀を記憶してお
く。When the values dp, dm are formed in this way, 8-bit values dd, ddp, ddm when the values d, dp, dm are quasi-instantaneously expanded based on the scale data DK are calculated (process 112). And
The decoded values da ₀ of the data one sample before are added to the values dd, ddp, ddm, respectively, to form local decoded values da, dap, dam corresponding to the respective values d, dp, dm (process 113). Note has realized local decoding processing in the processing 112 and 113, in order that stores the decoded value da ₀ of the previous sample.

次に、当該サンプルに対応した原信号の値daiをバッ
ファ10から読み込み、この原信号の値daiと、おのおの
のローカル復号値da,dap,damとの差の絶対値Da,Dp,Dmを
算出し（処理114）、原信号の値daiがローカル復号値da
よりも大きいかどうかを調べる（判断115）。Next, the value dai of the original signal corresponding to the sample is read from the buffer 10, and the absolute values Da, Dp, Dm of the difference between the value dai of the original signal and each of the local decoded values da, dap, dam are calculated. (Process 114), the value dai of the original signal is
It is checked whether it is greater than (decision 115).

この判断115の結果がYESになるときには、絶対値Daが
絶対値Dpよりも大きいかどうかを調べ（判断116）、こ
の判断116の結果がYESになるときには、値ｄに値dpを代
入し（処理117）処理106へと戻り、判断116の結果がNO
になるときには、値ｄを最適化差分データDCoとして出
力する（処理118）。When the result of this determination 115 is YES, it is checked whether or not the absolute value Da is greater than the absolute value Dp (determination 116). When the result of this determination 116 is YES, the value dp is substituted for the value d ( Process 117) Return to process 106, and the result of judgment 116 is NO
, The value d is output as the optimized difference data DCo (process 118).

また判断115の結果がNOになるときには、絶対値Daが
絶対値Dpよりも大きいかどうかを調べ（判断119）、こ
の判断119の結果がYESになるときには、値ｄに値dmを代
入し（処理120）処理106へと戻り、判断119の結果がNO
になるときには処理118を実行する。When the result of the determination 115 is NO, it is checked whether or not the absolute value Da is greater than the absolute value Dp (determination 119). When the result of the determination 119 is YES, the value dm is substituted for the value d ( Process 120) Return to process 106, and the result of judgment 119 is NO
When it becomes, the process 118 is executed.

すなわち、第３図（ｂ）の左側に示したように、原信
号の値daiが復号値daよりも復号値dapに近いときには、
圧縮差分データｄをLSBだけ大きい値dpに更新して、そ
の値を最適化差分データDCoとして出力する。また、そ
の反対に原信号の値daiが復号値daよりも復号値damに近
いときには、圧縮差分データｄをLSBだけ小さい値dmに
更新して、その値を最適化差分データDCoとして出力す
る。なお、１回の処理で最適な値が得られなかった場合
には、この処理を繰り返し実行する。That is, as shown on the left side of FIG. 3B, when the value dai of the original signal is closer to the decoded value dap than the decoded value da,
The compressed difference data d is updated to a value dp larger by LSB, and the value is output as optimized difference data DCo. Conversely, when the value dai of the original signal is closer to the decoded value dam than the decoded value da, the compressed difference data d is updated to a value dm smaller by LSB, and the value is output as the optimized difference data DCo. If an optimum value cannot be obtained in one process, this process is repeatedly executed.

このようにして、準瞬時圧伸で得られた圧縮差分デー
タに圧縮ビットのLSBを加算、減算する操作をサンプル
点毎に繰り返し行ない、復号値と原信号との誤差を少な
くするように差分データを補正している。In this way, the operation of adding and subtracting the LSB of the compressed bit to and from the compressed difference data obtained by the quasi-instantaneous companding is repeated for each sample point, so that the difference between the decoded value and the original signal is reduced. Has been corrected.

なお、バッファ10に原信号として記憶するデータとし
ては、最大値制限回路５に入力される前のバッファ３の
出力データを用いることもできる。The data stored in the buffer 10 as an original signal may be output data of the buffer 3 before being input to the maximum value limiting circuit 5.

したがって、例えば第５図（ａ）に示したような音声
信号（第11図（ａ）と同じ）をこのような最適化差分ビ
ットルーチンを用いた準瞬時圧縮によって符号化するこ
とを考える。Therefore, for example, consider encoding a speech signal as shown in FIG. 5 (a) (same as FIG. 11 (a)) by quasi-instantaneous compression using such an optimized difference bit routine.

まず、この音声信号の８つのサンプル♯１〜♯８にお
ける差分値が第６図（ａ）のように得られ、したがっ
て、このときのスケール位置POSは、サンプル♯１のビ
ットパタンの最上位桁であるビット４となり、スケール
位置POSの値は（100）_２になる。これにより、準瞬時圧
縮部11によって形成される圧縮差分データDCは、おのお
のサンプル♯１〜♯８について、それぞれ第６図（ｂ）
に示したものとなる。First, the difference values of the eight samples # 1 to # 8 of the audio signal are obtained as shown in FIG. 6 (a). Therefore, the scale position POS at this time is the most significant digit of the bit pattern of the sample # 1. And the value of the scale position POS is (100) ₂ . As a result, the compressed difference data DC formed by the quasi-instantaneous compression unit 11 is shown in FIG. 6 (b) for each of the samples # 1 to # 8.
It becomes what was shown in.

そして、この圧縮差分データDCは、おのおののサンプ
ル♯１〜♯８について、それぞれ上述の最適化差分ビッ
トルーチンにより第６図（ｃ）に示したような最適化差
分データDCoにそれぞれ補正される。The compressed difference data DC is corrected to the optimized difference data DCo as shown in FIG. 6C for each of the samples # 1 to # 8 by the above-described optimized difference bit routine.

その結果、後述する音声復号化装置では、この最適化
差分データDCoのおのおののサンプル♯１〜♯８の値が
第６図（ｄ）に示したように８ビットの差分データに伸
長され、この差分データに基づいて、音声信号が第５図
（ｃ）に実線で示したように復号される。ここで、上述
した圧縮差分データDCをそのまま符号化データに用いた
場合の再生された音声信号を第５図（ｃ）に一点鎖線で
示す。As a result, in the speech decoding device described later, the values of the samples # 1 to # 8 of the optimized difference data DCo are expanded to 8-bit difference data as shown in FIG. Based on the difference data, the audio signal is decoded as shown by the solid line in FIG. 5 (c). Here, the reproduced audio signal when the above-described compressed difference data DC is used as it is as the encoded data is shown by a dashed line in FIG. 5 (c).

これらの比較からわかるように、最適化差分データDC
oに基づいて再生された音声信号は、第５図（ｃ）に破
線で示した符号化前の音声信号にまとわりつくようなも
のとなり、符号化前の音声信号に対する追従性が大きく
向上している。As can be seen from these comparisons, the optimized difference data DC
The audio signal reproduced on the basis of o is clinged to the audio signal before encoding indicated by the broken line in FIG. 5 (c), and the followability to the audio signal before encoding is greatly improved. .

なお、上述した最適化差分ビットルーチン以外で、上
述したような圧縮差分データの最適化処理を実現するこ
とができる。例えば、スケール値を決定した後には、お
のおののサンプルについて、そのスケール値に対応した
量子化幅でとりうる復号値のうち最もサンプリング値に
近いものを順次選択し、その復号値に対応した圧縮差分
データを形成することで、圧縮差分データを最適化する
ことができる。Note that, other than the above-described optimization difference bit routine, the above-described optimization processing of the compressed difference data can be realized. For example, after determining the scale value, for each sample, sequentially select the decoded value that is closest to the sampled value among the decoded values that can be taken with the quantization width corresponding to the scale value, and calculate the compression difference corresponding to the decoded value. By forming the data, the compressed difference data can be optimized.

第７図は、本発明の一実施例にかかる音声復号化装置
の一例を示している。この音声復号化装置は、上述した
音声符号化装置によって符号化された符号化データDLを
復号して音声信号を出力するものである。FIG. 7 shows an example of a speech decoding apparatus according to one embodiment of the present invention. This audio decoding device decodes the encoded data DL encoded by the above-described audio encoding device and outputs an audio signal.

同図において、例えばデータ受信装置あるいはデータ
記憶装置等の前段装置（図示略）から出力された符号化
データDLは、デマルチプレクサ21に加えられ、おのおの
のブロック毎に、先頭の３ビットがスケール値SCとして
識別されて準瞬時伸長部22のスケール値入力端に加えら
れ、それ以外のコードデータ（圧縮差分データ）は、準
瞬時伸長部22のコードデータ入力端に加えられる。In the figure, coded data DL output from a preceding device (not shown) such as a data receiving device or a data storage device is applied to a demultiplexer 21, and for each block, the first three bits are scale values. It is identified as SC and is applied to the scale value input terminal of the quasi-instantaneous decompression unit 22, and the other code data (compression difference data) is applied to the code data input terminal of the quasi-instantaneous decompression unit 22.

準瞬時伸長部22は、加えられるコードデータを３ビッ
トずつに区別るとともに、８ビットデータにおいて入力
したスケールデータSCに対応したビット位置にその３ビ
ットのデータを配置し、そのコードデータよりも上位桁
には符号ビットの内容を、下位桁には０を配置して８ビ
ットデータ（第６図（ｄ）参照）に伸長し、この８ビッ
トデータを積分部23に出力する。The quasi-instantaneous decompression unit 22 distinguishes the added code data by 3 bits, arranges the 3-bit data at a bit position corresponding to the input scale data SC in the 8-bit data, and places the higher-order data above the code data. The contents of the sign bit are placed in the digits, and 0 is placed in the lower digits to expand the data into 8-bit data (see FIG. 6D).

積分部23は、順次入力される８ビットデータを積算し
て音声信号の各サンプルにおける信号値を形成し、これ
をデジタル／アナログ変換器24に出力する。The integrator 23 integrates the sequentially input 8-bit data to form a signal value in each sample of the audio signal, and outputs the signal value to the digital / analog converter 24.

デジタル／アナログ変換器24は、受入した信号値を8K
Hzの変換周波数で対応するアナログ信号（レベル信号）
に変換し、これをローパスフィルタ25に出力する。この
アナログ信号は、ローパスフィルタ25によって波形整形
されたのち、再生音声信号として次段装置（例えば音声
出力装置等）に出力される。The digital / analog converter 24 converts the received signal value to 8K
Analog signal (level signal) corresponding to the conversion frequency of Hz
And outputs this to the low-pass filter 25. The analog signal is shaped by the low-pass filter 25 and then output as a reproduced audio signal to a next-stage device (for example, an audio output device).

このように、本発明にかかる符号化データを復号する
ための音声復号化装置の構成は、非常に簡単なものとな
る。したがって、例えば、汎用の８ビットマイクロプロ
セッサを用いてこの音声復号化装置を実現することもで
き、コストを極く小さく抑えることができる。Thus, the configuration of the audio decoding device for decoding the encoded data according to the present invention is very simple. Therefore, for example, this speech decoding device can be realized using a general-purpose 8-bit microprocessor, and the cost can be kept extremely small.

ところで、準瞬時圧伸においては、１つのブロック内
において信号値の絶対値が最大になるビットパタンに基
づいてスケール位置を設定しているので、例えば、その
ブロックにおいて突発的に大きな値をとるサンプルがあ
った場合、その大きな値に対応したスケール値が設定さ
れるため、他の小さな値に対する符号化データの追従性
が悪くなり、聴覚上好ましくないという不都合を生じる
ことがある。By the way, in the quasi-instantaneous companding, since the scale position is set based on the bit pattern in which the absolute value of the signal value is maximum in one block, for example, a sample that suddenly takes a large value in that block In such a case, since a scale value corresponding to the large value is set, the ability of the encoded data to follow another small value is deteriorated, which may cause an inconvenience that it is not perceptible.

次に、このような不都合を解消できる、本発明にかか
る他の実施例について説明する。Next, another embodiment according to the present invention which can solve such a problem will be described.

され、上述した不都合を解消するには、そのブロック
において最も誤差の少ない符号化データを形成すればよ
く、そのためには適切なスケール位置（値）を設定すれ
ばよい。Therefore, in order to solve the above-mentioned inconvenience, it is only necessary to form encoded data having the least error in the block, and for that purpose, an appropriate scale position (value) may be set.

そこで、本実施例では、最初に設定したスケール位置
に基づいて準瞬時圧縮したときと、そのスケール位置よ
りも１桁上位桁をスケール位置に設定した場合にそのス
ケール位置に基づいて準瞬時圧縮したときと、その最初
のスケール位置よりも１桁下位桁をスケール位置に設定
した場合にそのスケール位置に基づいて準瞬時圧縮した
ときとで、おのおののサンプルにおいて復号値と符号化
前の音声信号の値との誤差を算出してそれらに基づいて
それぞれのスケール位置での評価値を形成し、この評価
値に基づき、それらのスケール位置のもののうち最も良
好なものを選択するようにしている。なお、この評価値
としては、おのおののサンプルにおける元の音声信号と
復号値との差の絶対値の総和や、その差を自乗した値の
総和またはその平方根（誤差パワー）を用いることがで
きる。Therefore, in the present embodiment, the quasi-instantaneous compression is performed based on the scale position when the quasi-instantaneous compression is performed based on the initially set scale position and when the digit one digit higher than the scale position is set as the scale position. When the quasi-instantaneous compression is performed on the basis of the scale position when one digit lower order than the first scale position is set as the scale position, the decoded value of each sample and the audio signal before encoding are An error with the value is calculated, and an evaluation value at each scale position is formed based on the error. Based on the evaluation value, the best one at those scale positions is selected. As the evaluation value, the sum of the absolute value of the difference between the original speech signal and the decoded value in each sample, the sum of the squared values of the difference, or the square root (error power) can be used.

すなわち、例えば第８図（ａ）に示したような音声信
号があったとき、この音声信号の各サンプル♯１〜♯８
のうち差分値が最も大きいサンプル♯１に基づいてスケ
ール値が決定され、その状態が同図であるとする。That is, for example, when there is an audio signal as shown in FIG. 8 (a), each sample # 1 to # 8 of this audio signal
It is assumed that the scale value is determined based on the sample # 1 having the largest difference value among them, and the state is as shown in FIG.

これに対し、スケール値を１つ小さくした場合にはサ
ンプル値の量子化幅が１段階小さくなるので同図（ｂ）
の状態になり、スケール値を１つ大きくした場合にはサ
ンプル値の量子化幅が１段階大きくなるので同図（ｃ）
の状態となる。On the other hand, when the scale value is reduced by one, the quantization width of the sample value is reduced by one step, and therefore, FIG.
When the scale value is increased by one, the quantization width of the sample value is increased by one step, so that FIG.
State.

また、おのおのの状態で、上述した最適化差分ビット
ルーチンによる最適化処理を実行したとき、各サンプル
における符号ビットは、次の表のようになる。Further, when the optimization process is performed by the above-described optimization difference bit routine in each state, the sign bit in each sample is as shown in the following table.

ここで、SC₀は第８図（ａ）の場合のスケール値を、S
C_-1はSC₀よりも１だけ小さい同図（ｂ）の場合のスケー
ル値を、SC₁はSC₀よりも１だけ大きい同図（Ｃ）の場合
のスケール値をそれぞれ示している。 Here, SC ₀ represents the scale value in the case of FIG.
C _-1 is the scale value of the case of FIG one less than SC ₀ (b), SC ₁ shows the scale values in the case of FIG one greater than SC ₀ (C), respectively.

このようにスケール値を変更すると、当然のことなが
ら元の音声信号と復号値との差の総和あるいは誤差パワ
ーは、そのブロックにおける音声信号の変化に対応して
いずれか１つのスケール値のものが最小となり、その最
小となるものが当該ブロックにおいて音声信号に対する
追従性が最良である。When the scale value is changed in this manner, naturally, the sum or error power of the difference between the original audio signal and the decoded value is changed to one of the scale values corresponding to the change in the audio signal in the block. The smallest one has the best followability to the audio signal in the block.

例えば、発明者等の実験によれば統計的にみて、スケ
ール値SC₀のものの誤差パワーが最も小さくなるのはブ
ロック総数のうち60％程度であり、スケール値SC_-1のも
のの誤差パワーが最も小さくなるのはブロック総数のう
ち30％程度であり、スケール値SC₁のものの誤差パワー
が最も小さくなるのはブロック総数のうち10％程度であ
る。For example, as viewed in inventors statistically According to experiments, is approximately 60% of total blocks is the error power is the smallest of those of the scale value SC _0, the error power of one of the scale value SC _-1 is most the smaller is about 30% of total blocks, the error power of one of the scale value SC ₁ of the smallest is about 10% of total blocks.

このようにして、ブロック毎に使用するスケール値を
選択することで、再生した音声信号の音質（聴覚上の）
が向上する。In this way, by selecting the scale value to be used for each block, the sound quality of the reproduced audio signal (audible)
Is improved.

この実施例にかかる音声符号化装置の一例を第９図に
示す。なお、同図において第１図と同一部分には同一符
号を付し、相当する部分にはそれと同一符号にサフィッ
クスを付加してその説明を省略する。またこの装置は、
準瞬時圧縮の１ブロック単位に作動する。FIG. 9 shows an example of the speech encoding apparatus according to this embodiment. In this figure, the same parts as those in FIG. 1 are denoted by the same reference numerals, and the corresponding parts are denoted by the same reference numerals with suffixes, and the description thereof is omitted. This device also
Operates in units of one block of quasi-instantaneous compression.

同図において、スケール値設定部7aは、最大値制限回
路５から処理される差分データDDの連続する８サンプル
からなるブロックでスケール値を判別するとともに、そ
のスケール値よりも１つ大きな値および１つ小さな値を
形成し、それぞれスケールデータDK₀,DK₁,DK_-1として出
力する。これらのスケールデータDK₀,DK₁,DK_-1は、上述
したスケール値SC₀,SC₁,SC_-1にそれぞれ相当する。In the figure, a scale value setting unit 7a determines a scale value in a block composed of continuous eight samples of difference data DD processed from the maximum value limiting circuit 5, and also sets a value larger by one than the scale value and 1 The next smaller value is formed and output as scale data DK ₀ , DK ₁ , DK _-1 respectively. These scale data DK ₀ , DK ₁ , DK _-1 correspond to the above-mentioned scale values SC ₀ , SC ₁ , SC _-1 respectively.

スケールデータDK₀は、準瞬時圧縮部11₀、最適化処理
12₀、準瞬時伸長部14₀およびセレクタ31の一入力端に加
えられ、スケールデータDK₁は、準瞬時圧縮部11₁、最適
化処理部12₁、準瞬時伸長部14₁およびセククタ31の他入
力端に加えられ、スケールデータDK_-1は、準瞬時圧縮部
11_-1、最適化処理部12_-1、準瞬時伸長部14_-1およびセレ
クタ31のさらに他の入力端に加えられている。The scale data DK ₀ is converted to a quasi-instantaneous compression unit ₁₁₀ , an optimization process
12 ₀ are added to the quasi-instantaneous expansion unit 14 ₀ and single input terminal of the selector 31, the scale data DK ₁ is quasi-instantaneous compression unit 11 _1, the optimization processing unit 12 _1, quasi-instantaneous extension portion 14 of the _first and Sekukuta 31 The scale data DK- ₁ applied to the other input terminal is
11 _-1 , an optimization processing unit 12 _-1 , a quasi-instantaneous decompression unit _14-1, and a selector 31.

準瞬時圧縮部11₀はスケールデータDK₀に基づきバッフ
ァ６の出力データから圧縮差分データDC₀を形成し、最
適化処理部12₀はこの圧縮差分データDC₀に上述した最適
化差分ビットルーチンを適用して最適化差分データDCo₀
を形成し、この最適化差分データDCo₀は準瞬時伸長部14
₀およびセレクタ32の一入力端に加えられる。準瞬時伸
長部14₀は入力される最適化差分データDCo₀をスケール
データDK₀を基準として８ビットデータのデータDE₀に伸
長し、このデータDE₀は積分部15₀によって積算されて復
号され、その復号値SD₀は比較部33の一入力端およびセ
レクタ34の一入力端に加えられている。Quasi instantaneous compression unit 11 ₀ forms a compressed difference data DC ₀ from the output data of the buffer 6 on the basis of the scale data DK _0, the optimization processing unit 12 ₀ is the optimized differential bit routine described above to the compressed difference data DC ₀ Apply and optimize difference data DCo ₀
The optimized difference data DCo ₀ is generated by the quasi-instantaneous decompression unit 14.
₀ and one input terminal of the selector 32 are added. Quasi instantaneous expansion unit 14 ₀ is extended to optimize differential data DCo ₀ input to the data DE ₀ 8-bit data on the basis of the scale data DK _0, the data DE ₀ is decoded is accumulated by the integrator 15 ₀ , The decoded value SD ₀ is applied to one input terminal of the comparison unit 33 and one input terminal of the selector 34.

また、準瞬時圧縮部11₁はスケールデータDK₁に基づき
バッファ６の出力データから圧縮差分データDC₁を形成
し、最適化処理部12₁はこの圧縮差分データDC₁に上述し
た最適化差分ビットルーチンを適用して最適化差分デー
タDCo₁を形成し、この最適化差分データDCo₁は準瞬時伸
長部14₁およびセレクタ32の一入力端に加えられる。準
瞬時伸長部14₁は入力される最適化差分データDCo₁をス
ケールデータDK₁を基準として８ビットデータのデータD
E₁に伸長し、このデータDE₁は積分部15₁によって積算さ
れて復号され、その復号値SD₁は比較部33の一入力端お
よびセレクタ34の一入力端に加えられている。The quasi-instantaneous compression unit 11 ₁ forms compressed difference data DC ₁ from the output data of the buffer 6 based on the scale data DK ₁ , and the optimization processing unit 12 ₁ adds the above-mentioned optimized difference bit to the compressed difference data DC _1. by applying the routines to form an optimized difference data DCo _1, this optimization difference data DCo ₁ is applied to one input terminal of the quasi-instantaneous decompression unit 14 ₁ and the selector 32. Data D 8-bit data quasi instantaneous decompression unit 14 ₁ optimization difference data DCo ₁ input as a reference scale data DK ₁
Extends E _1, the data DE ₁ is decoded is accumulated by the integrator 15 _1, the decoded value SD ₁ is applied to one input terminal and one input terminal of the selector 34 of the comparator 33.

同様に、準瞬時圧縮部11_-1はスケールデータDK_-1に基
づきバッファ６の出力データから圧縮差分データDC_-1を
形成し、最適化処理部12_-1はこの圧縮差分データDC_-1に
上述した最適化差分ビットルーチンを適用して最適化差
分データDCo_-1を形成し、この最適化差分データDCo_-1は
準瞬時伸長部14_-1およびセレクタ32の一入力端に加えら
れる。準瞬時伸長部14_-1は入力される最適化差分データ
DCo_-1をスケールデータDK_-1を基準として８ビットデー
タのデータDE_-1に伸長し、このデータDE_-1は積分部15_-1
によって積算されて復号され、その復号値SD_-1は比較部
33の一入力端およびセレクタ34の一入力端に加えられて
いる。Similarly, the quasi-instantaneous compression unit 11 _-1 to form a compressed difference data DC _-1 from the output data of the buffer 6 on the basis of the scale data DK _-1, the optimization processing unit 12 _-1 to the compressed difference data DC _-1 forming an optimized differential data DCo _-1 by applying an optimization difference bit routine described above, the optimization difference data DCo _-1 is applied to one input terminal of the quasi-instantaneous expansion unit 14 ₁ and the selector 32. The quasi-instantaneous decompressor _14-1 receives the input optimized difference data
The DCo _-1 extends data DE _-1 of 8-bit data on the basis of the scale data DK _-1, the data DE _-1 integral part 15 _-1
And the decoded value SD- ₁ is compared by the comparator
One input terminal 33 and one input terminal of the selector 34 are added.

また、最大値制限回路５から出力される差分データDD
は、バッファ10aによって１ブロック分が蓄積され、こ
のバッファ10aの出力は積分部15aによって順次積算され
て圧縮されていない音声信号（すなわち原信号）が形成
され、それに対応したデータSDaが最適化処理部12₀,1
2₁,12_-1および比較部33に加えられている。Also, the difference data DD output from the maximum value limiting circuit 5
Is stored in the buffer 10a, the output of the buffer 10a is sequentially integrated by the integrator 15a to form an uncompressed audio signal (that is, an original signal), and the data SDa corresponding thereto is subjected to optimization processing. Part 12 ₀ , 1
2 ₁ , 12 _-1 and the comparison unit 33.

このように、比較部33には、原信号に対応したデータ
SDa、スケール値SC₀（スケールデータDK₀）に対応した
最適化差分データDCo₀を復号したときの復号値SD₀、ス
ケール値SC₁（スケールデータDK₁）に対応した最適化差
分データDCo₁を復号したときの復号値SD₁、および、ス
ケール値SC_-1（スケールデータDK_-1）に対応した最適化
差分データDCo_-1を復号したときの復号値SD_-1が、サン
プル毎にそれぞれ加えられる。As described above, the comparison unit 33 stores the data corresponding to the original signal.
SDa, scale value SC ₀ (scale data DK ₀₎ decoded value SD ₀ when decoding the optimization difference data DCo ₀ corresponding to the scale value SC ₁ (scale data DK ₁₎ optimizing the difference data corresponding to DCo ₁ The decoded value SD ₁ obtained when decoding is performed, and the decoded value SD ₋₁ obtained when the optimized difference data DCo ₋₁ corresponding to the scale value SC ₋₁ (scale data DK ₋₁ ) are decoded are obtained for each sample. Added.

比較部33は、データSDaと復号値SD₀,SD₁,SD_-1に基づ
き、それらの復号値SD₀,SD₁,SD_-1におけるデータSDaか
らの誤差をそれぞれのサンプル毎に形成し、１ブロック
における誤差パワーをおのおのの復号値SD₀,SD₁,SD_-1に
ついて算出し、それらの中で最小値をとるものを判別す
る。Comparator 33, based on data SDa the decoded value _{_{_{SD 0, SD 1, SD -1}}} , to form their decoded value SD _0, SD _1, the SD _-1 error from the data SDa for each sample, The error power in one block is calculated for each of the decoded values SD ₀ , SD ₁ , SD ₋₁ , and the one that takes the minimum value is determined.

そして、復号値SD₀についての誤差パワーが最小値に
なるときには、セレクタ31によってスケールデータDK₀
を選択してこれをマルチプレクサ13に出力するとともに
セレクタ32によって最適化差分データDCo₀を選択してこ
れをマルチプレクサ13に出力する。またセレクタ34によ
って符号値SD₀を選択し、これをレジスタ９に取り込ま
せるデータとする。Then, when the error power for the decoded value SD ₀ becomes the minimum value, the selector 31 sets the scale data DK ₀
Is selected and output to the multiplexer 13, and the optimized difference data DCo ₀ is selected by the selector 32 and output to the multiplexer 13. The code value SD ₀ is selected by the selector 34 and is used as data to be taken into the register 9.

また、復号値SD₁についての誤差パワーが最小になる
ときには、セレクタ31によってスケールデータDK₁を選
択してこれをマルチプレクサ13に出力するとともにセレ
クタ32によって最適化差分データDCo₁を選択してこれを
マルチプレクサ13に出力する。またセレクタ34によって
符号値SD₁を選択し、これをレジスタ９に取り込ませる
データとする。When the error power for the decoded value SD ₁ is minimized, the selector 31 selects the scale data DK ₁ and outputs it to the multiplexer 13, and the selector 32 selects the optimized difference data DCo ₁ and outputs it. The signal is output to the multiplexer 13. Further, the code value SD ₁ is selected by the selector 34 and is used as data to be taken into the register 9.

同様に、復号値SD_-1についての誤差パワーが最小にな
るときには、セレクタ31によってスケールデータDK_-1を
選択してこれをマルチプレクサ13に出力するとともにセ
レクタ32によって最適化差分データDCo_-1を選択してこ
れをマルチプレクサ13に出力する。またセレクタ34によ
って符号値SD_-1を選択し、これをレジスタ９に取り込ま
せるデータとする。Similarly, when the error power of the decoded value SD- ₁ is minimized, the selector 31 selects the scale data DK- ₁ and outputs it to the multiplexer 13, and the selector 32 selects the optimized difference data DCo- ₁ . This is output to the multiplexer 13. The code value SD- ₁ is selected by the selector 34, and is used as data to be taken into the register 9.

したがって、マルチプレクサ13からは、そのブロック
において最も誤差パワーが小さくなる符号化データDLが
出力される。Therefore, the coded data DL having the smallest error power in the block is output from the multiplexer 13.

比較部33が実行する比較ルーチンの一例を第10図に示
す。FIG. 10 shows an example of the comparison routine executed by the comparison unit 33.

まず、データSDaおよび復号値SD₀,SD₁,SD_-1をサンプ
ル毎に入力するとともに（処理201）、それらの入力し
たデータに基づき、次式を演算して、復号値SD₀,SD₁,SD
_-1におのおの対応した誤差パワーRMS₀,RMS₁,RMS_-1をそ
れぞれ算出する（処理202）。First, the data SDa and the decoded values SD ₀ , SD ₁ , and SD ₋₁ are input for each sample (processing 201), and the following expression is calculated based on the input data to obtain the decoded values SD ₀ and SD _1. , SD
Respectively calculates each corresponding error power RMS _0, RMS _1, RMS _-1 to _-1 (processing 202).

ここで、ｋ＝0,1,−１、ｊはブロック内におけるサン
プル番号、SDa_jはブロック内のおのおののサンプルにお
けるデータ、SD_kjはブロック内のおのおののサンプルに
おける符号値SD_kをあらわす。 Here, k = 0, 1, -1 and j are sample numbers in the block, SDa _j is data in each sample in the block, and SD _kj is a code value SD _k in each sample in the block.

そして、いずれの誤差パワーRMS₀,RMS₁,RMS_-1が最も
小さいかを識別し（判断203,204,205）、最小の誤差パ
ワーRMS₀,RMS₁,RMS_-1に対応したスケールデータDK₀,D
K₁,DK_-1と最適化差分データDCo₀,DCo₁,DCo_-1をそれぞれ
選択する（処理206,207,208）。Then, it is determined which of the error powers RMS ₀ , RMS ₁ , RMS ₋₁ is the smallest (decisions 203, 204, 205), and the scale data DK ₀ , D corresponding to the minimum error powers RMS ₀ , RMS ₁ , RMS ₋₁
K ₁ , DK ₋₁ and optimized difference data DCo ₀ , DCo ₁ , DCo ₋₁ are selected (processing 206, 207, 208).

なお、この比較ルーチンにおいて、おのおのの復号値
SD₀,SD₁,SD_-1とデータSDaとの誤差に基づいた評価値と
しては、上述した誤差パワー以外に、例えば、おのおの
のサンプルにおけるデータSDaと復号値SD₀,SD₁,SD_-1と
の差の絶対値の総和等を用いることができる。In this comparison routine, each decrypted value
As the evaluation value based on the error between SD ₀ , SD ₁ , SD _-1 and the data SDa, in addition to the above error power, for example, the data SDa and the decoded values SD ₀ , SD ₁ , SD _-1 in each sample are used. And the sum of the absolute values of the differences.

また、この実施例によって形成された符号化データDL
を復号化する音声復号化装置は、第７図と同一のものを
用いることができる。Also, the encoded data DL formed by this embodiment
The same audio decoding device as that shown in FIG. 7 can be used for decoding.

ところで、以上説明した各実施例においては、差分PC
M符号について準瞬時圧伸を適用する場合について述べ
たが、本発明は、PCM符号について準瞬時圧伸を適用す
る場合についても、同様にして適用することができる。By the way, in each embodiment described above, the difference PC
Although the case where the quasi-instantaneous companding is applied to the M code has been described, the present invention can be similarly applied to the case where the quasi-instantaneous companding is applied to the PCM code.

なお、上述した各実施例における各種定数は、本発明
を実施する上での一例を示したものであり、それらに限
ることはない。It should be noted that the various constants in the above-described embodiments are merely examples for implementing the present invention, and the present invention is not limited to these.

［効果］以上説明したように、本発明によれば、PCM符号化さ
れた音声データのうち隣接するサンプル間で直前サンプ
ルとの差分値を形成してその差分値を時系列に所定数ご
とのブロックに分割し、おのおののブロックにおける差
分値の絶対値の最大値に対応した最上位桁をあらわすス
ケールデータを識別し、その最上位桁を含む所定ビット
数のデータを符号データに整形して上記音声データを上
記ブロック単位に圧縮する音声符号化方法において、上
記差分値の最下位ビットに１を加算した第１の一時差分
値と、上記差分値の最下位ビットから１を減算した第２
の一時差分値を算出し、上記差分値に基づく第１の一時
符号データと、上記第１の一時差分値に基づく第２の一
時符号データと、上記第２の一時差分値に基づく第３の
一時符号データを算出し、上記第１の一時符号データに
基づく第１の一時復号値と、上記第２の一時符号データ
に基づく第２の一時復号値と、上記第３の一時符号デー
タに基づく第３の一時復号値を算出し、上記第１の一時
復号値と、上記第２の一時復号値と、上記第３の一時復
号値のうち、当該サンプルにおける原信号の値に最も近
い値をとる一時復号値を判定し、その判定した一時復号
値に対応した符号データを、当然サンプルにおける符号
データとして選択するとともに、後続するブロックの最
初のサンプルの差分値を算出するときには当該ブロック
の最後のサンプルの復号値を基準とするようにしたの
で、符号化時に、準瞬時圧縮処理による圧縮データとし
て、元の音声信号により近づく値が選択され、低ビット
レートで、高音質の音声を再現することができるという
効果を得る。また、符号器側における符号化処理の段階
で、圧縮データの選択を行うので、復号器側の復号化処
理が簡単なものとなり、その結果、復号器の装置構成を
簡単化でき、復号器の装置コストを大幅に削減すること
ができるという効果も得る。[Effects] As described above, according to the present invention, a difference value between adjacent samples in PCM-encoded audio data is formed with respect to the immediately preceding sample, and the difference value is time-sequentially calculated for each predetermined number. By dividing the data into blocks, identifying scale data representing the most significant digit corresponding to the maximum absolute value of the difference value in each block, shaping data of a predetermined number of bits including the most significant digit into code data, In the audio encoding method for compressing audio data in block units, a first temporary difference value obtained by adding 1 to the least significant bit of the difference value and a second temporary difference value obtained by subtracting 1 from the least significant bit of the difference value
Are calculated, a first temporary code data based on the difference value, a second temporary code data based on the first temporary difference value, and a third temporary code data based on the second temporary difference value. Calculating temporary code data, based on the first temporary decode value based on the first temporary code data, the second temporary decode value based on the second temporary code data, and the third temporary code data; A third temporary decoded value is calculated, and a value closest to the value of the original signal in the sample among the first temporary decoded value, the second temporary decoded value, and the third temporary decoded value is calculated. The temporary decoding value to be taken is determined, the code data corresponding to the determined temporary decoding value is naturally selected as the code data in the sample, and the difference value of the first sample of the subsequent block is calculated when the difference value of the first sample of the subsequent block is calculated. sample Since the decoded value is used as a reference, a value closer to the original audio signal is selected as the compressed data by the quasi-instantaneous compression process at the time of encoding, and a high-quality sound can be reproduced at a low bit rate. The effect is obtained. Further, since the selection of the compressed data is performed at the stage of the encoding process on the encoder side, the decoding process on the decoder side is simplified, and as a result, the device configuration of the decoder can be simplified, and the The effect that the apparatus cost can be significantly reduced is also obtained.

また、PCM符号化された音声データのうち隣接するサ
ンプル間で直前サンプルとの差分値を形成してその差分
値を時系列に所定数ごとのブロックに分割し、おのおの
のブロックにおける差分値の絶対値の最大値に対応した
最上位桁をあらわすスケールデータを識別し、その最上
位桁を含む所定ビット数のデータを符号データに整形し
て上記音声データを上記ブロック単位に圧縮する音声符
号化方法において、上記最上位桁およびこの最上位桁に
それぞれ隣接する桁に対応した複数のスケールデータに
それぞれ基づいて上記符号データを整形した複数の符号
データ系列を形成するとともに、上記複数の符号データ
系列について、それぞれ、上記差分値の最下位ビットに
１を加算した第１の一時差分値と、上記差分値の最下位
ビットから１を減算した第２の一時差分値を算出し、上
記差分値に基づく第１の一時符号データと、上記第１の
一時差分値に基づく第２の一時符号データと、上記第２
の一時差分値に基づく第３の一時符号データを算出し、
上記第１の一時符号データに基づく第１の一時復号値
と、上記第２の一時符号データに基づく第２の一時復号
値と、上記第３の一時符号データに基づく第３の一時復
号値を算出し、上記第１の一時復号値と、上記第２の一
時復号値と、上記第３の一時復号値のうち、当該サンプ
ルにおける原信号の値に最も近い値をとる一時復号値を
判定し、その判定した一時復号値に対応した符号データ
を、当該サンプルにおける符号データとして選択するよ
うに、上記複数の符号データ系列を補正するとともに、
それらの補正された符号データ系列のうちブロック内誤
差が最も小さくなる符号データ系列を当該ブロックの符
号データ系列として選択し、後続するブロックの最初の
サンプルの差分値を算出するときには当該ブロックで選
択した符号データ系列の最後のサンプルの復号値を基準
とするようにしたので、符号化時に、準瞬時圧縮処理に
よる圧縮データとして、元の音声信号との誤差がより少
ない符号データ系列が選択され、低ビットレートで、高
音質の音声を再現することができるという効果を得る。
また、符号器側における符号化処理の段階で、圧縮デー
タの選択を行うので、復号器側の復号化処理が簡単なも
のとなり、その結果、復号器の装置構成を簡単化でき、
復号器の装置コストを大幅に削減することができるとい
う効果も得る。Also, a difference value between adjacent samples of the PCM-encoded audio data is formed with respect to the immediately preceding sample, and the difference value is divided in time series into blocks of a predetermined number, and the absolute value of the difference value in each block is calculated. An audio encoding method for identifying scale data representing the most significant digit corresponding to the maximum value, shaping data of a predetermined number of bits including the most significant digit into code data, and compressing the audio data in block units And forming a plurality of code data sequences obtained by shaping the code data based on the most significant digit and a plurality of scale data respectively corresponding to the digits adjacent to the most significant digit, and forming the plurality of code data sequences. And a first temporary difference value obtained by adding 1 to the least significant bit of the difference value, and 1 from the least significant bit of the difference value. Calculating a second temporary difference value, the first temporary code data based on the difference value, and a second temporary code data based on the first temporary difference value, the second
Calculates third temporary code data based on the temporary difference value of
A first temporary decoded value based on the first temporary code data, a second temporary decoded value based on the second temporary code data, and a third temporary decoded value based on the third temporary code data And calculating a temporary decoding value that takes the value closest to the value of the original signal in the sample among the first temporary decoding value, the second temporary decoding value, and the third temporary decoding value. And correcting the plurality of code data sequences so as to select code data corresponding to the determined temporary decoded value as code data in the sample,
Among the corrected code data sequences, the code data sequence with the smallest error in the block is selected as the code data sequence of the block, and when calculating the difference value of the first sample of the subsequent block, the code data sequence is selected. Since the decoded value of the last sample of the coded data sequence is used as a reference, a coded data sequence with a smaller error from the original audio signal is selected as the compressed data by the quasi-instantaneous compression process during encoding. An effect is obtained that high-quality sound can be reproduced at a bit rate.
Also, at the stage of the encoding process on the encoder side, the selection of the compressed data is performed, so that the decoding process on the decoder side is simplified, and as a result, the device configuration of the decoder can be simplified,
The effect that the apparatus cost of the decoder can be significantly reduced is also obtained.

[Brief description of the drawings]

第１図は本発明の一実施例にかかる音声符号化装置を示
したブロック図、第２図は符号化データの信号形式を例
示した信号配置図、第３図（ａ）は最適化処理を説明す
るための波形図、同図（ｂ）は最適化差分ビットルーチ
ンの作用を説明するための信号配置図、第４図（ａ），
（ｂ）は最適化差分ビットルーチンの一例を示したフロ
ーチャート、第５図は最適化処理の効果の説明図、第６
図（ａ）〜（ｄ）は最適化の様子を示した信号配置図、
第７図は本発明の一実施例にかかる音声復号化装置の一
例を示したブロック図、第８図（ａ）〜（ｃ）は本発明
の他の実施例の原理の説明図、第９図は本発明の他の実
施例にかかる音声符号化装置を示したブロック図、第10
図は比較ルーチンの一例を示したフローチャート、第11
図は従来技術の説明図、第12図（ａ）〜（ｃ）は従来技
術の符号化復号化の状況を説明するための信号配置図で
ある。 1,25……ローパスフィルタ、２……アナログ／デジタル
変換器、3,6,10,10a……バッファ、４……加減算器、５
……最大値制限回路、7,7a……スケール値設定部、８…
…加算器、９……レジスタ、11,11₀,11₁,11_-1……準瞬
時圧縮部、12,12₀,12₁,12_-1……最適化処理部、13……
マルチプレクサ、14,14₀,14₁,14_-1,22……準瞬時伸長
部、15,15₀,15₁,15_-1,15a,23……積分部、21……デマル
チプレクサ、24……デジタル／アナログ変換器、31,32,
34……セレクタ、33……比較部。FIG. 1 is a block diagram showing a speech encoding apparatus according to one embodiment of the present invention, FIG. 2 is a signal arrangement diagram illustrating a signal format of encoded data, and FIG. FIG. 4B is a waveform diagram for explaining the operation, and FIG. 4B is a signal arrangement diagram for explaining the operation of the optimized difference bit routine.
(B) is a flowchart showing an example of the optimization difference bit routine, FIG. 5 is a diagram for explaining the effect of the optimization processing, and FIG.
FIGS. 7A to 7D are signal arrangement diagrams showing the state of optimization;
FIG. 7 is a block diagram showing an example of a speech decoding apparatus according to an embodiment of the present invention. FIGS. 8 (a) to 8 (c) are illustrations of the principle of another embodiment of the present invention. FIG. 13 is a block diagram showing a speech encoding apparatus according to another embodiment of the present invention, and FIG.
FIG. 11 is a flowchart showing an example of the comparison routine, and FIG.
The figures are explanatory diagrams of the prior art, and FIGS. 12 (a) to 12 (c) are signal arrangement diagrams for explaining the state of encoding / decoding in the prior art. 1,25 low-pass filter, 2 analog / digital converter, 3, 6, 10, 10a buffer, 4 adder / subtractor, 5
…… Maximum value limiting circuit, 7,7a …… Scale value setting section, 8…
... adder, 9 ... register, 11,11 ₀ , 11 ₁ , 11 _-1 quasi-instantaneous compression unit, 12,12 ₀ , 12 ₁ , 12 _-1 ... optimization processing unit, 13 ...
Multiplexers, 14, 14 ₀ , 14 ₁ , 14 _-1 , 22... Quasi-instantaneous decompressor, 15, 15 ₀ , 15 ₁ , 15 _-1 , 15 a, 23... Integrator, 21. … Digital / analog converters 31, 32,
34 ... selector, 33 ... comparator.

Claims

(57) [Claims]

1. A method of forming a difference value between an adjacent sample of PCM-encoded audio data and an immediately preceding sample, dividing the difference value into a predetermined number of blocks in a time series, and calculating a difference between each block. Identifies the scale data representing the most significant digit corresponding to the maximum value of the absolute value of the value, and forms a predetermined number of bits of data including the most significant digit into code data, and compresses the audio data in the block unit. In the encoding method, a first temporary difference value obtained by adding 1 to the least significant bit of the difference value and a second temporary difference value obtained by subtracting 1 from the least significant bit of the difference value are calculated. First temporary code data based on
Calculating second temporary code data based on the temporary difference value and third temporary code data based on the second temporary difference value, a first temporary decoded value based on the first temporary code data, Calculating a second temporary decoded value based on the second temporary code data and a third temporary decoded value based on the third temporary code data; and calculating the first temporary decoded value and the second temporary decoded value. A decoded value and a temporary decoded value that takes a value closest to the value of the original signal in the sample among the third temporary decoded values are determined, and code data corresponding to the determined temporary decoded value is determined in the sample. A speech encoding method which is selected as code data and uses a decoded value of a last sample of a subsequent block as a reference when calculating a difference value of a first sample of a subsequent block.

2. The speech encoding method according to claim 1, wherein a maximum value of an absolute value of said difference value is limited.

3. A PCM-encoded audio data, wherein a difference value between adjacent samples is formed between adjacent samples, and the difference value is divided into a predetermined number of blocks in a time series. Identifies the scale data representing the most significant digit corresponding to the maximum value of the absolute value of the value, and forms a predetermined number of bits of data including the most significant digit into code data, and compresses the audio data in the block unit. In the encoding method, a plurality of code data sequences are formed by shaping the code data based on the most significant digit and a plurality of scale data respectively corresponding to the digits adjacent to the most significant digit, and For the code data sequence, a first temporary difference value obtained by adding 1 to the least significant bit of the difference value, and a least significant bit of the difference value Is calculated, a first temporary code data based on the difference value, a second temporary code data based on the first temporary difference value, and a second temporary difference value are calculated. , And calculates a first temporary decoded value based on the first temporary code data, a second temporary decoded value based on the second temporary code data, and a third temporary decoded value based on the second temporary code data. Calculating a third temporary decoded value based on the temporary code data, and among the first temporary decoded value, the second temporary decoded value, and the third temporary decoded value, the value of the original signal in the sample; Is determined, and the code data corresponding to the determined temporary decode value is selected as code data in the sample, and the plurality of code data sequences are corrected. Corrected code data The code data sequence in which the error in the block is the smallest among the columns is selected as the code data sequence of the block, and when calculating the difference value of the first sample of the subsequent block, the last sample of the code data sequence selected in the block is used. A speech encoding method characterized by using a decoded value of a reference as a reference.

4. A speech encoding method according to claim 3, wherein said intra-block error is an error power in each block of the code data sequence.

5. A speech coding method according to claim 3, wherein said error in a block is a sum of absolute values of errors in each sample in a block of each code data sequence. .

6. The speech encoding method according to claim 3, wherein the maximum value of the absolute value of the difference value is limited.