JPS63236415A

JPS63236415A - Voice encoding system

Info

Publication number: JPS63236415A
Application number: JP7085587A
Authority: JP
Inventors: Hiroki Uchiyama; 博喜内山
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1987-03-25
Filing date: 1987-03-25
Publication date: 1988-10-03

Abstract

PURPOSE:To realize the high quality of voice with a low bit rate, by changing the arrangement of a code bit at every block by comparing a scale value with a threshold value, and changing bit arrangement at every scale value. CONSTITUTION:A voice waveform SS is quantized at an A/D converter 2, and is stored in a buffer 3, and a difference is taken through a maximum value limitation circuit 5, and data is sent to a scale value setting part 7. The scale value setting part 7 finds the scale value, and a bit arrangement control part compares it with the threshold value, and a quasi-moment compression part 11 takes out a prescribed number of bits of the data from the buffer 6, and rounds off low-order bits. An optimization processing part 12 performs correction so as to set an error with an original signal out of transmission bits at the minimum level. After every completion of a processing for one block, expansion and decoding are performed, and the last value is returned to a register 9, and the difference of the forefront of the next block is calculated. Scale data and code data are inputted to a multiplexer 13 at every block, then, transmission data is formed.

Description

【発明の詳細な説明】五亙立互本発明は、音声符号化方式に係り、例えば、ボイスメー
ル、ボイスメモ、各種装置のガイダンス応答装置などに
応用可能なものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a voice encoding system, and is applicable to, for example, voice mail, voice memo, and guidance response devices for various devices.

笠米五亙例えば、高速デジタル回線を用いて音声信号を伝送した
り、音声応答装置のために音声信号を蓄積および合成す
るなど音声信号をデジタル処理するとき、この音声信号
をなんらかの方法でデジタル信号に変換する必要がある
。For example, when digitally processing an audio signal, such as transmitting an audio signal using a high-speed digital line or storing and synthesizing audio signals for a voice response device, this audio signal is converted into a digital signal by some method. need to be converted to .

基本的には音声信号は周波数帯域が０．３〜３．４ＫＨ
ｚのアナログ信号であり、これをデジタル信号に変換す
るには、例えばサンプリング周波数８ＫＨｚで分解能が
８ビツトのアナログ／デジタル変換器で変換すればよい
（ＰＣＭ（Ｐｕｌｓｅ　Ｃｏｄａ　Ｍｏｄｕｌａｔｉｏ
ｎ）符号化）。Basically, audio signals have a frequency band of 0.3 to 3.4KH.
This is an analog signal of
n) encoding).

そして、このデジタル信号を元の音声信号に戻すには、
サンプリング周波数８にＨｚで分解能が８ビツトのデジ
タル／アナログ変換器でアナログ信号に変換し、さらに
ローパスフィルタを通して波形整形してやればよい。こ
のとき、アナログ／デジタル変換器およびデジタル／ア
ナログ変換器の分解能（すなわちＰＣＭ符号のビット［
）が大きいほど再生した音声の品質が高い。Then, to convert this digital signal back to the original audio signal,
The signal may be converted into an analog signal using a digital/analog converter with a sampling frequency of 8 Hz and a resolution of 8 bits, and then passed through a low-pass filter to shape the waveform. At this time, the resolution of the analog/digital converter and digital/analog converter (i.e., the bit [
) is higher, the quality of the reproduced audio is higher.

ところで、このようなＰＣＭ符号化された音声信号は１
秒あたりのビット速度（データ速度：以下ビットレート
という）が６４Ｋｂｐｓとなり、このように高いビット
レートの音声信号を伝送するには非常に高速な伝送路を
必要とし、また、かかる音声信号を蓄積するためには厖
大な記憶容量のメモリを必要とする。そこで、従来から
、音声信号のビットレートを低減するための種々の提案
がなされている。By the way, such a PCM encoded audio signal has 1
The bit rate per second (data rate: hereinafter referred to as bit rate) is 64 Kbps, and transmitting audio signals with such a high bit rate requires a very high-speed transmission path, and it is necessary to store such audio signals. This requires a huge amount of memory. Therefore, various proposals have been made to reduce the bit rate of audio signals.

その１つに、時系列的に隣接するＰＣＭ符号の差分を形
成する差分ＰＣＭ符号化方式がある。この差分ＰＣＭ符
号化方式は、音声波形の相関性に基づく冗長性を利用し
たものであり、隣接したサンプル間の限られた範囲に含
まれることから、１サンプルあたりのビット数を低減す
ることができる。この差分ＰＣＭ符号化方式をさらに進
めた適応差分ＰＧＭ符号化方式の１つである、ＣＣＩＴ
Ｙ（国際電信電話諮問委員会）勧告による適応差分ＰＣ
Ｍ方式では、３２Ｋｂｐｓのビットレートを実現してい
る。One of them is a differential PCM encoding method that forms a difference between chronologically adjacent PCM codes. This differential PCM encoding method utilizes redundancy based on the correlation of audio waveforms, and since it is included in a limited range between adjacent samples, it is possible to reduce the number of bits per sample. can. CCIT is one of the adaptive differential PGM encoding methods that further advances this differential PCM encoding method.
Adaptive differential PC based on Y (International Telegraph and Telephone Advisory Committee) recommendations
The M method achieves a bit rate of 32 Kbps.

この他には、音声信号の非安在性と線形予測可能性を利
用したＡＰＣ−ＡＢ（Ａｄａｐｔｉｖｅ　Ｐｒｅｄｉｃ
ｔｉｏｎ　Ｃｏ−ｄｉｎｇ　ｗｉｔｈ　Ａｄａｐｔｉｖ
ｅ　Ｂｉｔ　Ａ１１ｏｃａｔｉｏｎ）方式、または、音
声分析合成手法によるＬＳＰ（Ｌｉｎｅ　Ｓｐｅｃｔｒ
ｕｍＰａｉｒ）方式などがある。In addition to this, APC-AB (Adaptive Predic
tion Co-ding with Adaptiv
e Bit A11 location) method, or LSP (Line Spectrum) method using speech analysis and synthesis method.
umPair) method, etc.

しかしながら、このような適応差分ＰＣＭ方式、ＡＰＣ
−ＡＢ方式およびＬＳＰ方式は、符号化および復号化の
処理が非常に複雑であり、それらを実現するための装置
は非常に高価なものとなるという不都合がある。However, such adaptive differential PCM method, APC
-The AB method and the LSP method have disadvantages in that the encoding and decoding processes are very complicated, and the equipment for implementing them is very expensive.

一方、放送衛星用の高品位なＰＣＭ音声伝送方式の１つ
に準瞬時圧伸方式がある。この準瞬時圧伸方式は、ＰＣ
Ｍ符号化された音声データを時系列に所定数ごとのブロ
ックに分割し、おのおののブロツクにおける信号絶対値
の最大値に対応した最上位桁をあらわすスケールデータ
を識別し、その最上位桁を含む所定ビット数のデータを
符号データに整形するものであり、比較的符号化処理が
簡単で、かつ、容易に１サンプルのビット数を低減する
ことが可能である。しかしながら、このような準瞬時圧
伸方式は、効率的には充分なものではない。On the other hand, one of the high-quality PCM audio transmission systems for broadcasting satellites is the quasi-instantaneous companding system. This quasi-instantaneous companding method is
Divide M-encoded audio data into a predetermined number of blocks in time series, identify scale data representing the most significant digit corresponding to the maximum value of the signal absolute value in each block, and include the most significant digit. This formats data of a predetermined number of bits into code data, and the encoding process is relatively simple, and the number of bits of one sample can be easily reduced. However, such quasi-instantaneous companding method is not efficient enough.

そこで、この準瞬時圧伸方式の効率を改善する手法とし
て「差分ＰＣＭ方式と準瞬時圧伸との結合」が考えられ
るが、一般に単に準瞬時圧伸を差分ＰＣＭ方式に適用し
ただけでは、圧縮時の欠落ビットが伝送誤差を生じ、受
信側の積分器で誤差が累積して受信不能となる。Therefore, a method to improve the efficiency of this quasi-instantaneous companding method is to ``combine the differential PCM method and quasi-instantaneous companding,'' but in general, simply applying quasi-instantaneous companding to the differential PCM method does not allow compression. Missing bits in time cause transmission errors, and the errors accumulate in the integrator on the receiving side, making reception impossible.

次にこの点について説明する。Next, this point will be explained.

いま準瞬時圧伸を差分ＰＣＭ方式に適用した符号化方式
によって、第４図（ａ）に示したような音声信号を符号
化することを考える。まず、差分ＰＣＭ符号化のために
この音声信号を例えば８ＫＨｚのサンプリング周波数で
サンプリングしてサンプル間の差分値を形成する。ここ
では、隣接するサンプル間の差分値を符号付の８ビツト
のデータすなわち２の補数表現の８ビツトデータであら
れす。そして、準瞬時圧伸のための条件は、８サンプル
で１ブロツクを構成し、伝送データは１サンプルあたり
３ビツトとする。また、スケールデータは３ビツトであ
る。Let us now consider encoding an audio signal as shown in FIG. 4(a) using an encoding method in which quasi-instantaneous companding is applied to the differential PCM method. First, for differential PCM encoding, this audio signal is sampled at a sampling frequency of, for example, 8 KHz to form a difference value between samples. Here, the difference value between adjacent samples is expressed as signed 8-bit data, that is, 8-bit data in two's complement representation. The conditions for quasi-instantaneous companding are that one block consists of eight samples, and the transmission data is three bits per sample. Also, the scale data is 3 bits.

ここで、８つのサンプル＃１〜ｘ８における差分値が第
５図＜ａ＞のように得られたものとするにのブロック内
において差分値の絶対値が最大になるものは、サンプル
＃１であり、したがって、このときのスケール位置ＰＯ
８は、このサンプル＃ｌのビットパタンの最上位桁であ
るビット４となり、スケール位＠ｐｏｓの値は（１００
）、になる。Here, assuming that the difference values for the eight samples #1 to x8 are obtained as shown in Figure 5 <a>, the one with the maximum absolute value of the difference value within the block is sample #1. Yes, therefore, the scale position PO at this time
8 becomes bit 4, which is the most significant digit of the bit pattern of this sample #l, and the value of scale position @pos is (100
),become.

これにより、各サンプルの伝送ビット（伝送データ；す
なわち符号データ）は、このスケール位置ＰｏＳよりも
１つ上位桁で、符号値をあらわすビット５（サインビッ
ト）から３ビツトのデータ、すなわち、ビット５，４．
３のデータとなる。As a result, the transmission bits (transmission data; that is, code data) of each sample are one digit higher than this scale position PoS, and are 3 bits of data from bit 5 (sign bit) representing the code value, that is, bit 5. ,4.
The data will be 3.

したがって、このブロックでは、最初にスケール位置Ｐ
Ｏ８を、それに続いてサンプル＃１〜＃８の伝送ビット
を順次連続して配置して構成した伝送データ（符号デー
タ）は、同図（ｂ）に示したようなものとなる。Therefore, in this block, first scale position P
Transmission data (code data) configured by consecutively arranging O8 and the transmission bits of samples #1 to #8 in sequence is as shown in FIG. 3(b).

このような符号データを復号するとき、まず、１ブロツ
ク分の符号データを３ビツトずつに分解し。When decoding such code data, first, one block of code data is decomposed into three bits each.

その最初の３ビツトの値からスケール位置ＰＯ８を識別
する。そして、後続する３ビツトの符号データを８ビツ
トのデータに伸張するときには、スケール位置ＰＯ８よ
りも１つ上位桁に符号データのＭＳＢ　（最上位ビット
）を一致させ、そのＭＳＢよりも上位桁の各桁には符号
ビットの値をセットし、そのＬＳＢ　（最下位ビット）
よりも下位桁の各桁にはＯをセットする。The scale position PO8 is identified from the value of the first 3 bits. When expanding the following 3-bit code data to 8-bit data, the MSB (most significant bit) of the code data is matched to the digit one higher than the scale position PO8, and each digit higher than the MSB is expanded. Set the value of the sign bit in the digit, and its LSB (least significant bit)
Set O in each lower digit than .

これにより、同図（ｃ）に示したデータが、復号後のデ
ータとして得られる。このデータと、符号化前のデータ
とを比較すると、復号データでは伝送ビットよりも下位
桁の情報（直流成分）が欠落していることがわかる（第
４図（ｂ）参照）。すなわち、情報に欠落ビットを生じ
ている。As a result, the data shown in FIG. 3(c) is obtained as decoded data. Comparing this data with the data before encoding, it is found that the decoded data lacks information (DC component) of lower digits than the transmission bits (see FIG. 4(b)). In other words, there are missing bits in the information.

このような復号データに基づいて音声信号を再生すると
、第４図（Ｃ）に一点鎖線で示したように、欠落ビット
分の誤差が蓄積して負の直流シフトを生じ、同図に破線
で示した元の波形よりも右下がりの波形となり、その結
果、適正に情報を伝送することができない。When an audio signal is reproduced based on such decoded data, errors for missing bits accumulate and cause a negative DC shift, as shown by the dashed line in Fig. 4(C), The waveform becomes more downward sloping than the original waveform shown, and as a result, information cannot be transmitted properly.

このような問題を解決する１つの方法として、「欠落ビ
ットのアキュムレーションによる差分圧伸ＰＣＭ（ＤＣ
−ＰＣＭ）Ｊ　（高欄ほか、電子通信学会論文誌’８４
／１０　Ｖｏｌ、Ｊ６７−Ｂ　Ｎｏ、１０）が提案され
ている。One way to solve this problem is to use differential companding PCM (DC
-PCM) J (Takaran et al., Journal of the Institute of Electronics and Communication Engineers '84
/10 Vol. J67-B No. 10) has been proposed.

しかしながら、この方法は１５ビット程度の差分データ
を８ビット程度に圧縮する場合に有効であり、８ビツト
の差分データを３ビット程度に圧縮するような低ビツト
レートの符号化方式には適用できない。However, this method is effective when compressing differential data of about 15 bits to about 8 bits, and cannot be applied to a low bit rate encoding method that compresses differential data of 8 bits to about 3 bits.

すなわち、このような低ビツトレートの場合にはブロッ
ク間において音声波形の振幅が大幅に変化したときなど
ブロック間でスケール位置が大幅に変動することがあり
、そのためにアキュムレーションされている誤差信号の
方が伝送すべき有効なデータよりも大きな値となること
がある。かかる場合には、伝送すべきデータが誤差信号
に埋もれてしまい、適正なデータ伝送を実現できない。In other words, in the case of such a low bit rate, the scale position may vary significantly between blocks, such as when the amplitude of the audio waveform changes significantly between blocks, so the accumulated error signal is The value may be larger than the valid data to be transmitted. In such a case, the data to be transmitted will be buried in the error signal, making it impossible to achieve proper data transmission.

本出願人は、上述した従来技術の不都合を解決するため
、先に、低ビツトレートで、簡単な処理により、高品質
な音声を再現できる音声符号化方式を提供した。In order to solve the above-mentioned disadvantages of the prior art, the applicant of the present invention first provided an audio encoding system that can reproduce high-quality audio at a low bit rate and through simple processing.

第６図は、本出願人が先に提案した音声符号化装置の一
例を説明するための図で、この例は、差分ＰＣＭ符号化
方式に準瞬時圧伸を適用したものであり、準瞬時圧伸を
施したさいにもとまるブロック化した圧縮差分データを
順次復号し、原信号と比較することで圧縮ビット数内で
誤差の少ない差分データとなるようにサンプル点毎に補
正している。またこの例では、８ＫＨｚのサンプリング
周波数で音声信号をサンプリングしてサンプル間の差分
値を２の補数表現の８ビツトデータであられし、８サン
プルで準瞬時圧伸のための１ブロツクを構成し、伝送デ
ータは１サンプルあたり３ビツト、および、スケールデ
ータは３ビツトとする。FIG. 6 is a diagram for explaining an example of a speech encoding device previously proposed by the applicant. This example applies quasi-instantaneous companding to the differential PCM encoding method, and The compressed difference data that remains in blocks during companding is sequentially decoded and compared with the original signal to correct each sample point so that the difference data has fewer errors within the number of compressed bits. In this example, the audio signal is sampled at a sampling frequency of 8 KHz, and the difference value between the samples is expressed as 8-bit data in two's complement representation, and 8 samples constitute one block for quasi-instantaneous companding. The transmission data is 3 bits per sample, and the scale data is 3 bits.

同図において、入力音声信号ＳＳは、ローパスフィルタ
１によって帯域制限されたのちにアナログ／デジタル変
換器２に加えられて８ビツトのデジタル信号ＤＳに変換
される。このアナログ／デジタル変換器２は、サンプリ
ング周波数が８Ｋ）ｌｚで直線量子化するものである。In the figure, an input audio signal SS is band-limited by a low-pass filter 1 and then applied to an analog/digital converter 2 where it is converted into an 8-bit digital signal DS. This analog/digital converter 2 performs linear quantization at a sampling frequency of 8K)lz.

デジタル信号Ｏ８は、１ブロツクをなす８サンプル分の
記憶容量をもつバッファ３に蓄積され、このバッファ３
に蓄積されたデジタル信号ＤＳｄは、差分データを形成
するための加減算器４のプラス入力端に加えられている
。The digital signal O8 is accumulated in a buffer 3 having a storage capacity of 8 samples forming one block.
The digital signal DSd stored in is applied to the plus input terminal of the adder/subtractor 4 for forming differential data.

この加減算器４から出力される９ビツトの差分データＤ
Ｄｓは、最大値制限回路５によって８ビツトの差分デー
タＤＤに変換される。このようにして差分データの最大
値を制限したのは、次のような理由による。The 9-bit difference data D output from this adder/subtractor 4
Ds is converted into 8-bit differential data DD by the maximum value limiting circuit 5. The reason why the maximum value of differential data is limited in this way is as follows.

すなわち、差分データＤＤｓをそのまま準瞬時圧縮した
場合、突発的に大きな差分データを含むブロックにおい
てはそれに対応とた大きなスケール値が設定され、その
ために他の小さな差分データに対する準瞬時圧縮後のデ
ータの追従性が悪化する。その結果、復元した音声信号
が、聴覚上、ギクシャクした感じを与える。そこで、こ
のように差分データの最大値をある程度に制限すること
により、このような聴覚上の問題を解決することができ
る。In other words, when the differential data DDs is directly compressed quasi-instantaneously, a correspondingly large scale value is set for a block that suddenly contains large differential data, and therefore the data after quasi-instantaneous compression for other small differential data is Followability deteriorates. As a result, the restored audio signal gives an aurally jerky feel. Therefore, by limiting the maximum value of the difference data to a certain extent in this way, such auditory problems can be solved.

この差分データＤＤは、８サンプル分の記憶容量をもつ
バッファ６、準瞬時圧縮のためのスケール値を設定する
ためのスケール値設定部７および加算器８の一入力端に
加えられている。This difference data DD is applied to one input terminal of a buffer 6 having a storage capacity of 8 samples, a scale value setting section 7 for setting a scale value for quasi-instantaneous compression, and an adder 8.

この加算器８の出力はレジスタ９に加えられ、このレジ
スタ９の出力は加減算器４のマイナス入力端、加算器８
の他入力端、および、８サンプル分の記憶容量をもつバ
ッファ１０に加えられている。The output of this adder 8 is added to a register 9, and the output of this register 9 is applied to the minus input terminal of the adder/subtractor 4,
It is added to the other input terminal and to a buffer 10 having a storage capacity of 8 samples.

このようにして、最大値制限回路５によって８ビツトに
制限された差分データＤＤの積算値が加算器８によって
形成され、このデータが差分データＤＤｓを形成するた
めの直前のサンプルのデータとして用いられている。In this way, an integrated value of the difference data DD limited to 8 bits by the maximum value limiting circuit 5 is formed by the adder 8, and this data is used as data of the immediately preceding sample for forming the difference data DDs. ing.

バッファ６の記憶データは、サンプル毎に準瞬時圧縮符
号化する準瞬時圧縮部１１に加えられている。The data stored in the buffer 6 is applied to a quasi-instantaneous compression unit 11 which performs quasi-instantaneous compression encoding for each sample.

スケール値設定部７は、最大値制限回路５から出力され
る差分データＤＤの連続する８サンプルのうち、最も絶
対値の大きいものを識別し、そのビットパタンの最上位
桁を判別してそのビット位置を３ビツトのスケールデー
タＤＫで出力する。このスケールデータＤＫは、準瞬時
圧縮部１１．準瞬時圧縮部１１から出力される圧縮差分
データＤＣを最適なデータに変換するための最適化処理
部１２．１ブロツク分のデータを所定の信号形式に整形
するためのマルチプレクサ１３の一入力端、および、最
適化された圧縮差分データを伸張するための準瞬時伸張
部１４に加えられている。The scale value setting unit 7 identifies the one with the largest absolute value among the eight consecutive samples of the difference data DD output from the maximum value limiting circuit 5, determines the most significant digit of the bit pattern, and sets that bit. The position is output as 3-bit scale data DK. This scale data DK is stored in the quasi-instantaneous compression section 11. An optimization processing section 12 for converting compressed difference data DC output from the quasi-instantaneous compression section 11 into optimal data; one input terminal of a multiplexer 13 for shaping one block's worth of data into a predetermined signal format; It is also added to a quasi-instantaneous decompression unit 14 for decompressing the optimized compressed differential data.

またバッファ１０の記憶データは、レジスタ９から出力
される差分データＤＤを積算して形成された復号データ
であり、符号化の対象である元の音声信号（以下原信号
という）として最適化処理部１２に加えられている。Furthermore, the data stored in the buffer 10 is decoded data formed by integrating the difference data DD output from the register 9, and is treated as the original audio signal (hereinafter referred to as the original signal) to be encoded by the optimization processing unit. It has been added to 12.

準瞬時圧縮部１１は、バッファ６から加えられる８ビッ
トの差分データＤＤについて、おのおののサンプル毎に
、スケール値設定部７から加えられるスケールデータＤ
Ｋがあられすスケール位置よりも１ビツト上位桁をＭＳ
Ｂとする３ビツトのデータを抜きだしてそれを圧縮差分
データＤＣとして最適化処理部１２に出力する。The quasi-instantaneous compression unit 11 converts the 8-bit difference data DD added from the buffer 6 into scale data D added from the scale value setting unit 7 for each sample.
MS the 1 bit higher digit than the scale position where K is
The 3-bit data designated as B is extracted and outputted to the optimization processing section 12 as compressed difference data DC.

最適化処理部１２は、準瞬時圧縮部１１から加えられる
１ブロック分の圧縮差分データＤＣを順次復号し、バッ
ファ１０から加えられる原信号と比較することにより、
圧縮ビット数内で誤差の少ない差分データとなるように
サンプル点毎に圧縮差分データＤＣを補正し、これを最
適化差分データＤＣｏとしてマルチプレクサ１３の他入
力端および準瞬時伸張部１４に加えている・マルチプレクサ１３は４第７図に示したように、スケー
ル値設定部７から出力されたスケールデータＤＫを先頭
に配置し、それに続いて各サンプルの最適化差分データ
ＤＣｏを順次配置して構成した信号を、１ブロック分の
符号化データＤＬとして形成して次段装置（例えばデー
タ伝送装置、あるいは、データ記憶装置等）に出力する
。The optimization processing unit 12 sequentially decodes one block of compressed difference data DC added from the quasi-instantaneous compression unit 11 and compares it with the original signal added from the buffer 10.
The compressed difference data DC is corrected for each sample point so that it becomes difference data with few errors within the number of compressed bits, and this is added to the other input end of the multiplexer 13 and the quasi-instantaneous decompression unit 14 as optimized difference data DCo. - As shown in FIG. 7, the multiplexer 13 is constructed by arranging the scale data DK output from the scale value setting section 7 at the beginning, followed by sequentially arranging the optimized difference data DCo of each sample. The signal is formed as one block of encoded data DL and output to the next stage device (for example, a data transmission device or a data storage device, etc.).

準瞬時伸張部１４は、スケール値設定部７から加えられ
るスケールデータＤＫが示している桁よりも１つ上位桁
にＭＳＢが一致するように、最適化処理部１２から出力
される３ビツトの最適化差分データＤＣ。The quasi-instantaneous decompression unit 14 uses the optimal 3-bit data output from the optimization processing unit 12 so that the MSB matches the digit one higher than the digit indicated by the scale data DK added from the scale value setting unit 7. difference data DC.

を配置するとともに、それよりも上位桁には最適化差分
データＤＣｏの符号データの値を配置し、下位桁には０
を配置して８ビツトの復号データＤＥを形成するもので
あり、その復号データＤＥは積分部１５に加えられる。At the same time, the code data value of the optimized difference data DCo is placed in the higher digits, and 0 is placed in the lower digits.
is arranged to form 8-bit decoded data DE, and the decoded data DE is added to the integrating section 15.

積分部１５は、準瞬時伸張部１４から出力される復号デ
ータＤＥを積算し、符号化データＤＬを実際に復号化処
理して復元したときの復元データＳＤを形成し、この復
元データＳＤをレジスタ９に出力している。レジスタ９
では、この復元データＳＤを、１つのブロックの処理を
終了して次のブロックの処理を開始する直前で取り込む
。The integrating section 15 integrates the decoded data DE output from the quasi-instantaneous decompression section 14, forms restored data SD when the encoded data DL is actually decoded and restored, and stores this restored data SD in a register. It is output to 9. register 9
Now, this restored data SD is fetched just before finishing the processing of one block and starting the processing of the next block.

これにより、準瞬時圧伸に特有な欠落ビットによる誤差
の累積を、次のブロックの最初のサンプルデータを形成
するときに解消することができ、その結果、より正確な
符号化データＤＬを形成することができる。As a result, the accumulation of errors due to missing bits, which is characteristic of quasi-instantaneous companding, can be eliminated when forming the first sample data of the next block, and as a result, more accurate encoded data DL can be formed. be able to.

このようにして、本実施例では、最適化処理部１２によ
って元の音声信号により追従するように圧縮差分データ
ＤＣを補正するとともに、積分部１５で形成した復元デ
ータＳＤによってブロック内の累積誤差を次のブロック
に反映させているので、準瞬時圧伸による低ビツトレー
トの音声符号化処理をより精度よく実現することができ
る。In this way, in this embodiment, the optimization processing unit 12 corrects the compressed difference data DC so that it follows the original audio signal, and the restoration data SD formed by the integration unit 15 corrects the cumulative error within the block. Since it is reflected in the next block, low bit rate audio encoding processing using quasi-instantaneous companding can be realized with higher accuracy.

次に、最適化処理部１２が実行する圧縮差分データの最
適化処理について説明する。Next, the optimization process of compressed difference data executed by the optimization processing unit 12 will be explained.

いま、第８図（ａ）に示したような音声波形を符号化す
ることを考える。サンプル＃０を基準としてサンプル＃
１．＃２．＃３の差分をそれぞれ８ビツトで形成したと
き、この場合の差分の絶対値はサンプル＃１が最大にな
る。したがって、このときの３ビツトの圧縮差分データ
はこのサンプル＃１を４基準に形成され、スケール値は
そのビットパタンの最上位桁の桁位置となる。Now, let us consider encoding a speech waveform as shown in FIG. 8(a). Sample # with reference to sample #0
1. #2. When the difference in #3 is formed using 8 bits each, the absolute value of the difference in this case is the largest in sample #1. Therefore, the 3-bit compressed difference data at this time is formed based on sample #1 as the 4th standard, and the scale value is the most significant digit position of the bit pattern.

さて、圧縮差分データを３ビツトであられした場合、表
現できるデータはスケール値よりも１ビツト下位桁の量
子化幅のものであり、したがって、おのおののサンプル
＄１．＃２．＃３の値は、この量子化幅で表現可能なデ
ータに置換される。例えば、サンプル＃１の圧縮差分デ
ータはこの表現できるデータのうち実際の値Ｐ１１より
も下の値Ｐ１２（＝（０１０）、　：ただしこの場合Ｌ
ＳＢはスケール位置よりも１ビツト下位桁、以下同じ）
になる。Now, when the compressed difference data is made up of 3 bits, the data that can be expressed has a quantization width of 1 bit lower than the scale value, so each sample has a quantization width of $1. #2. The value #3 is replaced with data that can be expressed with this quantization width. For example, the compressed difference data of sample #1 is a value P12 (= (010), which is lower than the actual value P11 among the data that can be expressed. However, in this case, L
SB is 1 bit lower than the scale position, the same applies hereafter)
become.

ところで、この量子化幅で表現できるデータのうち、Ｐ
Ｉ３よりも１つ大きな値Ｐ１３（＝（０１１）ｚ）に対
応したデータの方がよりサンプル＃１の実際の値ｐＨに
近い。したがって、この値Ｐ１３をサンプル＃１の圧縮
差分データとすれば、復号化したときの音声信号（復号
値）の誤差を小さくすることができる。By the way, among the data that can be expressed with this quantization width, P
The data corresponding to the value P13 (=(011)z), which is one value larger than I3, is closer to the actual pH value of sample #1. Therefore, if this value P13 is used as the compressed difference data of sample #1, the error in the audio signal (decoded value) when decoded can be reduced.

すなわち、このときの復号値の誤差は、最大でもこの圧
縮差分データの量子化幅の１７２に抑えることができる
。That is, the error in the decoded value at this time can be suppressed to 172, which is the quantization width of the compressed difference data, at most.

同様に、サンプル＃２．＃３について考えると、その復
号値が符号化前の信号の値（サンプル＃２では値Ｐ２１
．サンプル＃３では値Ｐ３１）に最も近くなる圧縮差分
データを選択すればよい。Similarly, sample #2. Considering #3, its decoded value is the value of the signal before encoding (value P21 in sample #2).
．． For sample #3, the compressed difference data closest to the value P31) may be selected.

すなわち、この場合、サンプル＃２については、値Ｐ２
１よりも小さい値Ｐ２２に基づいた復号値に較べて値Ｐ
２１よりも大きい値Ｐ２３に基づいた復号値の方がより
値Ｐ２１に近いので、サンプル＃ｌの復号値である値Ｐ
１３と値Ｐ２３との差分（＝（ｏｔｏ）ｉ）を圧縮差分
データに設定する。またサンプル＃３については、値Ｐ
３１が圧縮差分データであられし得る復号値に一致する
ので、サンプル＃２の復号値である値Ｐ２３と値Ｐ３１
との差分（＝（０１０）、）を圧縮差分データに設定す
る。That is, in this case, for sample #2, the value P2
The value P compared to the decoded value based on the value P22 smaller than 1
Since the decoded value based on the value P23 larger than 21 is closer to the value P21, the value P which is the decoded value of sample #l
13 and the value P23 (=(oto)i) is set as compressed difference data. Also, for sample #3, the value P
31 matches the possible decoded value of the compressed difference data, so the value P23 and the value P31, which are the decoded values of sample #2,
Set the difference (=(010),) with the compressed difference data.

このようにして、元の音声信号に対する追従性が向上し
た圧縮差分データを形成することができる。そのための
処理である最適化差分ビットルーチンの一例を第９図（
ａ）、（’ｂ）に示す。In this way, compressed difference data with improved followability to the original audio signal can be created. An example of the optimization difference bit routine that is the process for this purpose is shown in Figure 9 (
Shown in a) and ('b).

まず、準瞬時圧縮部１１から圧縮差分データ（ＤＣ）を
入力しく処理１０１）、その圧縮差分データｄの値が、
圧縮ビット数（この場合は３ビツト）であられすことの
できる正の最大値ＭＡＸ（＝（０１１）りより大きいか
、あるいは、負の最大値ＭＩＮ（＝（１００）２）より
小さいがを判別しく判断１０２，１０３）、判断１０２
の結果がＹＥＳになるときには圧縮差分データｄに値Ｍ
ＡＸを代入しく処理１０４）、判断１０３の結果がＹＥ
Ｓになるときに圧縮差分データｄに値ＭＩＮを代入する
（処理１０５）。First, compressed difference data (DC) is input from the quasi-instantaneous compression unit 11 (step 101), and the value of the compressed difference data d is
Determine whether the number of compression bits (in this case 3 bits) is greater than the maximum positive value MAX (= (011)) or smaller than the maximum negative value MIN (= (100) 2). Judgment 102, 103), Judgment 102
When the result is YES, the value M is added to the compressed difference data d.
Process 104) to substitute AX, and the result of judgment 103 is YE.
When the value becomes S, the value MIN is assigned to the compressed difference data d (processing 105).

次に、圧縮差分データｄよりもＬＳＢ（＝（００１）２
）だけ小さな値ｄｍと圧縮差分データｄよりもＬＳＢだ
け大きな値ｄｐを形成しく処理１０６，１０７）、値ｄ
ａが値ＭＩＮよりも小さくなった場合には値ｄ＋ｕに値
ＷＩＮを代入しく判断１０８．処理１０９）、値ｃｔｐ
が値ＭＡＸよりも大きくなった場合には値ｄｐに値ＭＡ
Ｘを代入する（判断１１０゜処理１１１）。Next, from the compressed difference data d, LSB(=(001)2
Processing 106, 107) to form a value dm that is smaller than the compressed difference data d and a value dp that is larger by the LSB than the compressed difference data d.
If a becomes smaller than the value MIN, determine whether to substitute the value WIN to the value d+u 108. Process 109), value ctp
becomes larger than the value MAX, the value dp is set to the value MAX.
Substitute X (judgment 110° process 111).

このようにして値ｄｐ、ｄａ＋を形成すると、スケール
データＤＫに基づいて値ｄ、ｄｐ、ｄａを準瞬時伸張し
たときの８ビツトの値ｄｄ、ｄｄｐ、ｄｄｍを算出する
（処理１１２）。そして、この値ｄｄ、ｄｄｐ、ｄｄｍ
をそれぞれ１サンプル前のデータの復号値ｄａｏを加算
し、おのおのの値ｄ、ｄｐ、ｄｍに対応したローカル復
号値ｄａ、ｄａｐ。After forming the values dp and da+ in this way, 8-bit values dd, ddp, and ddm are calculated when the values d, dp, and da are expanded quasi-instantaneously based on the scale data DK (processing 112). And these values dd, ddp, ddm
and the decoded value dao of the data one sample before, respectively, to obtain local decoded values da and dap corresponding to the respective values d, dp, and dm.

ｄａｍを形成する（処理１１３）。なお処理１１２およ
び１１３でローカル復号処理を実現しており、そのため
に、直前のサンプルの復号値ｄａｏを記憶しておく。A dam is formed (process 113). Note that local decoding processing is realized in processes 112 and 113, and for this purpose, the decoded value dao of the immediately preceding sample is stored.

次に、当該サンプルに対応した原信号の値ｄａｉをバッ
ファ１０から読み込み、この原信号の値ｄａｉと、おの
おののローカル復号値ｄａ、ｄａｐ、ｄａｍとの差の絶
対値Ｄａ、Ｄｐ、Ｄ１１を算出しく処理１１４）、原信
号の値ｄａｉがローカル復号値ｄａよりも大きいかどう
かを調べる（判断１１５）。Next, the value dai of the original signal corresponding to the sample is read from the buffer 10, and the absolute values Da, Dp, and D11 of the differences between the value dai of the original signal and each local decoded value da, dap, and dam are calculated. Then, it is checked whether the value dai of the original signal is larger than the local decoded value da (decision 115).

この判断１１５の結果がＹＥＳになるときには、絶対値
Ｄａが絶対値ｏｐよりも大きいかどうかを調べ（判断１
１６）、この判断１１６の結果がＹＥＳになるときには
、値ｄに値ｄＰを代入しく処理１１７）処理１０６へと
戻り、判断１１６の結果がＮＯになるときには、値ｄを
最適化差分データＤＣｏとして出力する（処理１１８）
。When the result of this judgment 115 is YES, it is checked whether the absolute value Da is larger than the absolute value op (judgment 1
16) When the result of this judgment 116 is YES, the value dP is substituted for the value d. 117) The process returns to process 106, and when the result of judgment 116 is NO, the value d is assigned as the optimized difference data DCo. Output (process 118)
.

また判断１１５の結果がＮｏになるときには、絶ｌ値Ｄ
ａが絶対値Ｄｐよりも大きいかどうかを調べ（判断１１
９）、この判断１１９の結果がＹＥＳになるときには、
値ｄに値ｄｌｌを代入しく処理１２０）処理１０６へと
戻り。Moreover, when the result of judgment 115 is No, the absolute value D
Check whether a is larger than the absolute value Dp (Judgment 11
9), when the result of this judgment 119 is YES,
Assign the value dll to the value d (process 120) and return to process 106.

判断１１９の結判がＮＯになるときには処理１１８を実
行する。When the result of judgment 119 is NO, processing 118 is executed.

すなわち、第８図（ｂ）の左側に示したように。That is, as shown on the left side of FIG. 8(b).

原信号の値ｄａｉが復号値ｄａよりも復号値ｄａｐに近
いときには、圧縮差分データｄをＬＳＢだけ大きい値ｄ
ｐに更新して、その値を最適化差分データＤＣｏとして
出力する。また、その反対に原信号の値ｄａｉが復号値
ｄａよりも復号値ｄａｍに近いときには、圧縮差分デー
タｄ　３　ＬＳＢだけ小さい値ｄｍに更新して、その値
を最適化差分データＤＣｏとして出力する。なお、１回
の処理で最適な値が得られなかった場合には、この処理
を繰り返し実行する。When the value dai of the original signal is closer to the decoded value dap than the decoded value da, the compressed difference data d is set to a value d larger by the LSB.
p and outputs the value as optimized difference data DCo. Conversely, when the value dai of the original signal is closer to the decoded value dam than the decoded value da, the compressed difference data d 3 LSB is updated to a smaller value dm, and the value is output as the optimized difference data DCo. Note that if the optimum value cannot be obtained in one process, this process is repeatedly executed.

このようにして、準瞬時圧伸で得られた圧縮差分データ
に圧縮ビットのＬＳＢを加算、減算する操作をサンプル
点毎に繰り返し行ない、復号値と原信号との誤差を少な
くするように差分データを補正している。In this way, the operation of adding and subtracting the LSB of the compressed bits to the compressed difference data obtained by quasi-instantaneous companding is repeated for each sample point, and the difference data is processed to reduce the error between the decoded value and the original signal. is being corrected.

なお、バッファ１０に原信号として記憶するデータとし
ては、最大値制限回路５に入力される前のバッファ３の
出力データを用いることもできる。Note that the output data of the buffer 3 before being input to the maximum value limiting circuit 5 can also be used as the data stored in the buffer 10 as the original signal.

したがって、例えば第１０図（ａ）に示したような音声
信号（第４図（ａ）と同じ）をこのような最適化差分ビ
ットルーチンを用いた準瞬時圧縮によって符号化するこ
とを考える６まず、この音声信号の８つのサンプル＃１〜＃８におけ
る差分値が第１１図（ａ）のように得られ、したがって
、このときのスケール位置ＰＯ８は、サンプル１１のビ
ットパタンの最上位桁であるビット４となり、スケール
位置ＰＯ８の値は（１００）、になる、これにより、準
瞬時圧縮部１１によって形成される圧縮差分データＤＣ
は、おのおののサンプル＃１〜＃８について、それぞれ
第１１図（ｂ）に示したものとなる。Therefore, let us consider, for example, encoding the audio signal shown in Fig. 10(a) (same as Fig. 4(a)) by quasi-instantaneous compression using such an optimized differential bit routine6. , the difference values of the eight samples #1 to #8 of this audio signal are obtained as shown in FIG. 11(a), and therefore, the scale position PO8 at this time is the most significant digit of the bit pattern of sample 11. Bit 4 becomes bit 4, and the value of scale position PO8 becomes (100). As a result, the compressed difference data DC formed by the quasi-instantaneous compression section 11
are shown in FIG. 11(b) for each of samples #1 to #8.

そして、この圧縮差分データＤＣは、おのおののサンプ
ル１１〜ＩＪ８について、それぞれ上述の最適化差分ビ
ットルーチンにより第１１図（ｃ）に示したような最適
化差分データＤＣｏにそれぞれ補正される。This compressed difference data DC is corrected for each of the samples 11 to IJ8 into optimized difference data DCo as shown in FIG. 11(c) by the above-mentioned optimization difference bit routine.

その結果、後述する音声復号化装置では、この最適化差
分データＤＣｏのおのおののサンプル１／Ｌ−１８の値
が第１１図（ｄ）に示したように８ビツトの差分データ
に伸張され、この差分データに基づいて、音声信号が第
１０図（ｅ）に実線で示したように復号される。ここで
、上述した圧縮差分データＤＣをそのまま符号化データ
に用いた場合の再生された音声信号を第１０図（ｃ）に
一点鎖線で示す。As a result, in the audio decoding device described later, the value of each sample 1/L-18 of this optimized difference data DCo is expanded into 8-bit difference data as shown in FIG. 11(d). Based on the difference data, the audio signal is decoded as shown by the solid line in FIG. 10(e). Here, a reproduced audio signal when the above-mentioned compressed difference data DC is used as encoded data as it is is shown by a dashed line in FIG. 10(c).

これらの比較かられかるように、最適化差分データＤＣ
ｏに基づいて再生された音声信号は、第１０図（ｃ）に
破線で示した符号化前の音声信号にまとわりつくような
ものとなり、符号化前の音声信号に対する追従性が大き
く向上している。As can be seen from these comparisons, the optimized difference data DC
The audio signal reproduced based on the method o clings to the unencoded audio signal shown by the broken line in FIG. 10(c), and the followability to the unencoded audio signal is greatly improved. .

第１２図は、音声復号化装置の一例を示しているが、こ
の音声復号化装置は、上述した音声符号化装置によって
符号化された符号化データＤＬを復号して音声信号を出
力するものである。FIG. 12 shows an example of an audio decoding device, which decodes the encoded data DL encoded by the aforementioned audio encoding device and outputs an audio signal. be.

同図において、例えばデータ受信装置あるいはデータ記
憶装置等の前段装置（図示路）から出力された符号化デ
ータＯＬは、デマルチプレクサ２１に加えられ、おのお
ののブロック毎に、先頭の３ビツトがスケール値ＳＣと
して識別されて準瞬時伸張部２２のスケール値入力端に
加えられ、それ以外のコードデータ（圧縮差分データ）
は、準瞬時伸張部２２のコードデータ入力端に加えられ
る。In the figure, encoded data OL output from a preceding stage device (shown in the figure) such as a data receiving device or a data storage device is applied to a demultiplexer 21, and the first three bits of each block are converted to a scale value. The other code data (compressed difference data) is identified as SC and added to the scale value input terminal of the quasi-instantaneous decompression unit 22.
is applied to the code data input terminal of the quasi-instantaneous decompression section 22.

準瞬時伸張部２２は、加えられるコードデータを３ビツ
トずつに区切るとともに、８ビツトデータにおいて入力
したスケールデータｓｃに対応したビット位置にその３
ビツトのデータを配置し、そのコードデータよりも上位
桁には符号ビットの内容を、下位桁にはＯを配置して８
ビツトデータ（第１１図（ｄ）参照）に伸張し、この８
ビツトデータを積分部２３に出力する。The quasi-instantaneous decompression unit 22 divides the added code data into 3 bits each, and adds the 3 bits to the bit position corresponding to the input scale data sc in the 8-bit data.
Arrange the bit data, place the contents of the sign bit in the upper digits of the code data, and place O in the lower digits to make 8.
Decompress it to bit data (see Figure 11(d)), and
The bit data is output to the integrating section 23.

積分部２３は、順次入力される８ビツトデータを積算し
て音声信号の各サンプルにおける信号値を形成し、これ
をデジタル／アナログ変換器２４に出力する。The integrating section 23 integrates the sequentially input 8-bit data to form a signal value for each sample of the audio signal, and outputs this to the digital/analog converter 24.

デジタル／アナログ変換器２４は、受入した信号値を８
ＫＨｚの変換周波数で対応するアナログ信号（レベル信
号）に変換し、これをローパスフィルタ２５に出力する
。このアナログ信号は、ローパスフィルタ２５によって
波形整形されたのち、再生音声信号として次段装置（例
えば音声出方装置等）に出力される。The digital/analog converter 24 converts the received signal value into 8
It is converted into a corresponding analog signal (level signal) at a conversion frequency of KHz, and outputted to the low-pass filter 25. This analog signal is waveform-shaped by a low-pass filter 25 and then output as a reproduced audio signal to a subsequent device (for example, an audio output device).

このようにして符号化データを復号するための音声復号
化装置の構成は、非常に簡単なものとなり、したがって
、例えば、汎用の８ビツトマイクロプロセツサを用いて
この音声復号化装置を実現することもでき、コストを極
く小さく抑えることができる。The configuration of the audio decoding device for decoding encoded data in this way is extremely simple, and therefore, it is possible to realize this audio decoding device using, for example, a general-purpose 8-bit microprocessor. It is also possible to keep costs to a minimum.

ところで、準瞬時圧伸においては、１つのブロック内に
おいて信号値の絶対値が最大になるビットパタンに基づ
いてスケール位置を設定しているので、例えば、そのブ
ロックにおいて突発的に大きな値をとるサンプルがあっ
た場合、その大きな値に対応したスケール値が設定され
るため、他の小さい値に対する符号化データの追従性が
悪くなり、聴覚上好ましくないという不都合を生じるこ
とがある。By the way, in quasi-instantaneous companding, the scale position is set based on the bit pattern that maximizes the absolute value of the signal value within one block, so for example, if a sample suddenly takes a large value in that block, If there is a scale value corresponding to that large value, the scale value corresponding to that large value is set, so that the followability of the encoded data with respect to other small values becomes poor, which may cause an inconvenience that is not pleasing to the auditory sense.

次に、このような不都合を解消できるようにした例につ
いて説明する。Next, an example will be described in which such inconvenience can be resolved.

さて、上述した不都合を解消する−には、そのブロック
において最も誤差の少ない符号化データを形成すればよ
く、そのためには適切なスケール位置（値）を設定すれ
ばよい。Now, in order to eliminate the above-mentioned inconvenience, it is sufficient to form encoded data with the least error in that block, and for this purpose, it is sufficient to set an appropriate scale position (value).

そこで、最初に設定したスケール位置に基づいて準瞬時
圧縮したときと、そのスケール位置よりも１桁上位桁を
スケール位置に設定した場合にそのスケール位置に基づ
いて準瞬時圧縮したときと、その最初のスケール位置よ
りも１桁下位桁をスケール位置に設定した場合にそのス
ケール位置に基づいて準瞬時圧縮したときとで、おのお
ののサンプルにおいて復号値と符号化前の音声信号の値
との誤差を算出してそれらに基づいてそれぞれのスケー
ル位置での評価値を形成し、この評価値に基づき、それ
らのスケール位置のもののうち最も良好なものを選択す
るようにしている。なお、この評価値としては、おのお
ののサンプルにおける元の音声信号と復号値との差の絶
対値の総和や、その差を自乗した値の総和または平方根
（誤差パワー）を用いることができる。Therefore, when performing quasi-instantaneous compression based on the initially set scale position, when performing quasi-instantaneous compression based on the scale position when one digit higher than that scale position is set as the scale position, and when performing quasi-instantaneous compression based on the scale position, When the scale position is set to one digit lower than the scale position of , the error between the decoded value and the value of the audio signal before encoding is calculated for each sample when quasi-instantaneous compression is performed based on the scale position. Based on these calculations, an evaluation value at each scale position is formed, and based on this evaluation value, the best one is selected from among those at each scale position. Note that as this evaluation value, the sum of the absolute values of the differences between the original audio signal and the decoded value in each sample, the sum of the squares of the differences, or the square root (error power) can be used.

すなわち１例えば第１３図（Ｃ）に示したような音声信
号があったとき、この音声信号の各サンプル＃１〜＃８
のうち差分値が最も大きいサンプル＃１に基づいてスケ
ール値が決定され、その状態が同図であるとする。In other words, 1. For example, when there is an audio signal as shown in FIG. 13(C), each sample #1 to #8 of this audio signal
Assume that the scale value is determined based on sample #1 having the largest difference value, and the state is shown in the figure.

これに対し、スケール値を１つ小さくした場合にはサン
プル値の量子化幅が１段階小さくなるので同図（ｂ）の
状態になり、スケール値を１つ大きくした場合にはサン
プル値の量子化幅が１段階大きくなるので同図（ｃ）の
状態となる。On the other hand, when the scale value is decreased by one, the quantization width of the sample value is decreased by one step, resulting in the state shown in (b) of the same figure, and when the scale value is increased by one, the quantization width of the sample value is decreased by one step. The width increases by one step, resulting in the state shown in FIG. 3(c).

また、おのおのの状態で、上述した最適化差分ビットル
ーチンによる最適化処理を実行したとき、各サンプルに
おける符号ビットは、次の表のようになる。Furthermore, when the optimization processing using the above-mentioned optimization difference bit routine is executed in each state, the sign bit in each sample becomes as shown in the following table.

盈ここでは、第１３図（ａ）の場合のスケール値を、ＳＣ
−□はＳＣ０よりも１だけ小さい同図（ｂ）の場合のス
ケール値を、ＳＣ□はＳＣ０よりも１だけ大きい同図（
ｃ）の場合のスケール値をそれぞれ示している。Here, the scale value in the case of Fig. 13(a) is expressed as SC
-□ is the scale value in the case of (b) in the same figure, which is smaller by 1 than SC0, and SC□ is the scale value in the case of (b) in the same figure, which is larger by 1 than SC0.
The scale values for case c) are shown respectively.

このようにスケール値を変更すると、当然のことながら
元の音声信号と復号値との差の総和あるいは誤差パワー
は、そのブロックにおける音声信号の変化に対応してい
ずれが１つのスケール値のものが最小となり、その最小
となるものが当該ブロックにおいて音声信号に対する追
従性が最良である。When the scale value is changed in this way, the sum of the differences between the original audio signal and the decoded value, or the error power, naturally changes depending on the change in the audio signal in that block. The minimum value has the best followability for the audio signal in the block.

例えば、実験によれば統計的にみて、スケール値ＳＣｏ
のものの誤差パワーが最も小さくなるのはブロック総数
のうち６０％程度であり、スケール値５Ｃ−１のものの
誤差パワーが最も小さくなるのはブロック総数のうち３
０％程度であり、スケール値ＳＣ工のものの誤差パワー
が最も小さくなるのはブロック総数のうち１０％程度で
ある。For example, according to experiments, statistically speaking, the scale value SCo
The error power of the scale value 5C-1 is the lowest in about 60% of the total number of blocks, and the lowest error power of the scale value 5C-1 is 3 out of the total blocks.
It is about 0%, and the error power of the scale value SC is the smallest at about 10% of the total number of blocks.

このようにして、ブロック毎に使用するスケール値を選
択することで、再生した音声信号の音質（聴覚上の）が
向上する。By selecting the scale value to be used for each block in this manner, the sound quality (audible) of the reproduced audio signal is improved.

第１４図は、上述のごとき点を考慮した音声符号化装置
の一例を示す図で、同図において第１１図と同一部分に
は同一符号を付し、相当する部分にはそれと同一符号に
サフィクスを付加してその説明を省略する。またこの装
置は、準瞬時圧縮の１ブロック単位に作動する。FIG. 14 is a diagram showing an example of a speech encoding device that takes into account the above points. In the figure, the same parts as in FIG. will be added and its explanation will be omitted. This device also operates on a block-by-block basis with quasi-instantaneous compression.

同図において、スケール値設定部７ａは、最大値制限回
路５から出力される差分データＤＤの連続する８サンプ
ルからなるブロックでスケール値を判別するとともに、
そのスケール値よりも１つ大きな値および１つ小さな値
を形成し、それぞれスケールデータＤＫ、　、ＤＫｌ、
ＤＫ−１として出力する。これらのスケールデータＤＫ
、、ＯＫ、、０Ｋ−１は、上述したスケール値ＳＣ０，
ＳＣ□、５Ｃ−１にそれぞれ相当する。In the figure, the scale value setting unit 7a determines the scale value in a block consisting of eight consecutive samples of the difference data DD output from the maximum value limiting circuit 5, and
Forming values one larger and one smaller than the scale value, scale data DK, , DKl, respectively.
Output as DK-1. These scale data DK
,,OK,,0K-1 are the scale values SC0,
They correspond to SC□ and 5C-1, respectively.

スケールデータＤＫｌｌは、準瞬時圧縮部１１゜、最適
化処理部１２Ｉ、、準瞬時伸張部１４゜およびセレクタ
３１の一入力端に加えられ、スケールデータＤＫ工は、
準瞬時圧縮部１１い最適化処理部１２い準瞬時伸張部１
４１およびセレクタ３１の低入力端に加えられ。The scale data DKll is applied to one input terminal of the quasi-instantaneous compression unit 11°, the optimization processing unit 12I, the quasi-instantaneous decompression unit 14°, and the selector 31, and the scale data DKll is
Quasi-instantaneous compression section 11 Optimization processing section 12 Quasi-instantaneous decompression section 1
41 and the low input terminal of selector 31.

スケールデータＤＫ−１は、準瞬時圧縮部１１−１、最
適化処理部１２−１、準瞬時伸張部１４−１およびセレ
クタ３１のさらに他の入力端に加えられている。The scale data DK-1 is applied to the quasi-instantaneous compression section 11-1, the optimization processing section 12-1, the quasi-instantaneous expansion section 14-1, and other input terminals of the selector 31.

準瞬時圧縮部１１゜はスケールデータＤＫ、に基づきバ
ッファ６の出力データから圧縮差分データＤＣ，を形成
し、最適化処理部１２゜はこの圧縮差分データＤＣ，に
上述した最適化差分ビットルーチンを適用して最適化差
分データＤＣｏ、を形成し、この最適化差分データＤＣ
ｏ０は準瞬時伸張部１４゜およびセレクタ３２の一入力
端に加えられる。準瞬時伸張部１４゜は入力される最適
化差分データＤＣｏ、をスケールデータＤＫ、を基準と
して８ビツトデータのデータＤＥ０に伸張し、このデー
タＤＥ、は積分部１５゜によって積算されて復号され、
その復号値ＳＤ、は比較部３３の一入力端およびセレク
タ３４の一入力端に加えられている。The quasi-instantaneous compression unit 11° forms compressed difference data DC from the output data of the buffer 6 based on the scale data DK, and the optimization processing unit 12 applies the above-described optimized difference bit routine to this compressed difference data DC. to form optimized difference data DCo, and this optimized difference data DC
o0 is applied to the quasi-instantaneous expansion section 14° and one input terminal of the selector 32. The quasi-instantaneous decompression unit 14° decompresses the input optimization difference data DCo into 8-bit data DE0 using the scale data DK as a reference, and this data DE is integrated and decoded by the integration unit 15°.
The decoded value SD is applied to one input terminal of the comparator 33 and one input terminal of the selector 34.

また、準瞬時圧縮部１１１はスケールデータＤＫ工に基
づきバッファ６の出力データから圧縮差分データＤＣ□
を形成し、最適化処理部１２□はこの圧縮差分データＤ
Ｃ工に上述した最適化差分ビットルーチンを適用して最
適化差分データＤＣｏ工を形成し、この最適化差分デー
タＤＣｏ、は準瞬時伸張部１４１およびセレクタ３２の
一入力端に加えられる。準瞬時伸張部１４□は入力され
る最適化差分データＤＣｏ工をスケールデータＤＫ１を
基準として８ビツトデータのデータＤＥ１に伸張し、こ
のデータＤＥ１は積分部１５１によって積算されて復号
され、その復号値ＳＤ１は比較部３３の一入力端および
セレクタ３４の一入力端に加えられている。In addition, the quasi-instantaneous compression unit 111 converts the compressed difference data DC□ from the output data of the buffer 6 based on the scale data DK.
The optimization processing unit 12□ forms this compressed difference data D
The above-described optimized difference bit routine is applied to the C process to form optimized difference data DCo, and this optimized difference data DCo is applied to one input terminal of the quasi-instantaneous decompression unit 141 and the selector 32. The quasi-instantaneous expansion unit 14□ expands the input optimization difference data DCo into 8-bit data DE1 using the scale data DK1 as a reference, and this data DE1 is integrated and decoded by the integration unit 151, and its decoded value is SD1 is applied to one input terminal of the comparator 33 and one input terminal of the selector 34.

同様に、準瞬時圧縮部１１−１はスケールデータＤＫ−
０に基づきバッファ６の出力データから圧縮差分データ
ＤＣ−□を形成し、最適化処理部１２−０はこの圧縮差
分データＤＣ−１に上述した最適化差分ビットルーチン
を適用して最適化差分データＤＣｏ−ｔを形成し、この
最適化差分データＤＣｏ−０は準瞬時伸張部１４−１お
よびセレクタ３２の一入力端に加えられる。Similarly, the quasi-instantaneous compression unit 11-1 performs scale data DK-
0 from the output data of the buffer 6, and the optimization processing unit 12-0 applies the above-mentioned optimized difference bit routine to this compressed difference data DC-1 to create the optimized difference data. This optimized difference data DCo-0 is applied to the quasi-instantaneous decompression unit 14-1 and one input terminal of the selector 32.

準瞬時伸張部１４−１は入力される最適化差分データＤ
Ｃｏ−，をスケールデータＤＫ−２を基準として８ビツ
トデータのデータＤＥ−，に伸張し、このデータＤＥ−
１は積分部１５−８によって積算されて復号され、その
復号値ＳＤ−，は比較部３３の一入力端およびセレクタ
３４の一入力端に加えられている。The quasi-instantaneous decompression unit 14-1 receives the input optimization difference data D.
Co-, is expanded to 8-bit data DE-, using scale data DK-2 as a reference, and this data DE-
1 is integrated and decoded by the integrating section 15-8, and the decoded value SD-, is applied to one input terminal of the comparing section 33 and one input terminal of the selector 34.

また、最大値制限回路５から出力される差分データＤＤ
は、バッファ１０ａによって１ブロツク分が蓄積され、
このバッファ１０ａの出力は積分部１５ａによって順次
積算されて圧縮されていない音声信号（すなわち原信号
）が形成され、それに対応したデータＳＤａが最適化処
理部１２゜、１２．．１２−１および比較部３３に加え
られている。Further, the difference data DD output from the maximum value limiting circuit 5
is accumulated by one block by the buffer 10a,
The output of this buffer 10a is sequentially integrated by an integrating section 15a to form an uncompressed audio signal (that is, an original signal), and the corresponding data SDa is sent to the optimization processing sections 12°, 12. ．． 12-1 and the comparison section 33.

このように、比較部３３には、原信号に対応したデータ
ＳＤａ、スケール値ＳＣ，（スケールデータＤＫ、　）
に対応した最適化差分データＤＣｏ。を復号したときの
復号値ＳＤ、、スケール値ＳＣよ（スケールデータＤＫ
、　）に対応した最適化差分データＤＣｏ１を復号した
ときの復号値ＳＤ１、および、スケール値５Ｃ−８（ス
ケールデータＤＫ−□）に対応した最適化差分データＤ
Ｃｏ−□を復号したときの復号値ＳＤ−□が、サンプル
毎にそれぞれ加えられる。In this way, the comparator 33 has data SDa corresponding to the original signal, scale value SC, (scale data DK, )
Optimized difference data DCo corresponding to. Decoded value SD when decoding , scale value SC (scale data DK
, ), and the decoded value SD1 when the optimized differential data DCo1 corresponding to
The decoded value SD-□ obtained when Co-□ is decoded is added to each sample.

比較部３３は、データＳＤａと復号値ＳＤ、　、ＳＤ、
　、５Ｄ−１に基づき、それらの復号値ＳＤ、　、ＳＤ
、　、ＳＤ−、におけるデータＳＤａからの誤差をそれ
ぞれのサンプル毎に形成し、１ブロツクにおける誤差パ
ワーをおのおのの復号値ＳＤ、　、５０１，５Ｄ−１に
ついて算出し、それらの中で最小値をとるものを判別す
る。The comparison unit 33 compares the data SDa and the decoded values SD, ,SD,
, 5D-1, their decoded values SD, ,SD
, , SD- from the data SDa for each sample, calculate the error power in one block for each decoded value SD, , 501, 5D-1, and take the minimum value among them. Discern things.

そして、復号値Ｓ００についての誤差パワーが最小にな
るときには、セレクタ３１によってスケールデータＤＫ
、を選択してこれをマルチプレクサ１３に出力するとと
もにセレクタ３２によって最適化差分データＤＣｏ、を
選択してこれをマルチプレクサ１３に出力する。またセ
レクタ３４によって符号値ＳＤ、を選択し、これをレジ
スタ９に取り込ませるデータとする。Then, when the error power for the decoded value S00 becomes the minimum, the selector 31 selects the scale data DK.
, and outputs it to the multiplexer 13, and the selector 32 selects the optimized difference data DCo, and outputs it to the multiplexer 13. Further, the code value SD is selected by the selector 34 and is used as data to be taken into the register 9.

また、復号値ＳＤ１についての誤差パワーが最小になる
ときは、セレクタ３１によってスケールデータＤＫ工を
選択してこれをマルチプレクサ１３に出力するとともに
セレクタ３２によって最適化差分データＤＣｏ１を選択
してこれをマルチプレクサ１３に出力する。またセレク
タ３４によって符号値Ｓ０１を選択し、これをレジスタ
９に取り込ませるデータとする。Further, when the error power for the decoded value SD1 is the minimum, the selector 31 selects the scale data DK and outputs it to the multiplexer 13, and the selector 32 selects the optimized difference data DCo1 and outputs it to the multiplexer. Output to 13. Further, the code value S01 is selected by the selector 34, and is used as data to be taken into the register 9.

同様に、復号値５Ｄ−１についての誤差パワーが最小に
なるときは、セレクタ３１によってスケールデータＤに
−１を選択してこれをマルチプレクサ１３に出力すると
ともにセレクタ３２によって最適化差分デ−タＤＣｏ　
−１を選択してこれをマルチプレクサ１３に出力する。Similarly, when the error power for the decoded value 5D-1 is the minimum, the selector 31 selects -1 for the scale data D and outputs it to the multiplexer 13, and the selector 32 selects the optimized difference data DCo.
-1 is selected and output to the multiplexer 13.

またセレクタ３４によって符号値５Ｄ−４を選択し、こ
れをレジスタ９に取り込ませるデータとする。Further, the code value 5D-4 is selected by the selector 34 and is used as data to be taken into the register 9.

したがって、マルチプレクサ１３からは、そのブロック
において最も誤差パワーが小さくなる符号化データＯＬ
が出力される。Therefore, from the multiplexer 13, encoded data OL with the smallest error power in that block is output.
is output.

比較部３３が実行する比較ルーチンの一例を第１４図に
示す。まず、データＳＤａおよび復号値ＳＤａ、ＳＤ１
゜５Ｄ−２をサンプル毎に入力するとともに（処理２０
１）、それらの入力したデータに基づき、次式を演算し
て、復号値ＳＤ０．ＳＤ□、５Ｄ−１におのおの対応し
た誤差パワＰＭＳｏ、ＰＭＳ０．ＰＭＳ−１をそれぞれ
算出する（処理２０２）。FIG. 14 shows an example of a comparison routine executed by the comparison section 33. First, data SDa and decoded values SDa, SD1
゜5D-2 is input for each sample (processing 20
1) Based on the input data, calculate the following equation to obtain the decoded value SD0. Error power PMSo, PMS0. corresponding to SD□, 5D-1, respectively. Each PMS-1 is calculated (process 202).

ここで、ｋ＝ｏ、１．−１、ｊはブロック内におけるサ
ンプル番号、５Ｄａｊはブロック内のおのおののサンプ
ルにおけるデータ、　５Ｄｋｊはブロック内のおのおの
のサンプルにおける符号値ＳＤｋをあらわす。Here, k=o, 1. -1 and j represent sample numbers within the block, 5Daj represents data at each sample within the block, and 5Dkj represents the code value SDk at each sample within the block.

そして、いずれの誤差パワーＲＭＳ、　、ＲＭＳ、　、
ＲＭＳ−１が最も小さいかを識別しく判断２０３，２０
４，２０５）、最小の誤差パワーＲＭＳｏ、ＲＭＳ、　
、ＲＭＳ−１に対応したスケールデータＤＫ、　、ＤＫ
、　、ＤＫ−、と最適化差分データＤＣｏ、　。And which error power RMS, , RMS, ,
Distinguishing whether RMS-1 is the smallest 203, 20
4,205), minimum error power RMSo, RMS,
, Scale data DK corresponding to RMS-1, ,DK
, ,DK-, and the optimized difference data DCo, .

ＤＣｏｌ、ＤＣｏ−０をそれぞれ選択する（処理２０６
，２０７，２０８）。Select DCol and DCo-0 (process 206
, 207, 208).

なお、この比較ルーチンにおいて、おのおのの復号値Ｓ
Ｄ、　、ＳＤｌ、５Ｄ−１とデータＳＤａとの誤差に基
づいた評価値としては、上述した誤差パワー以外に、例
えば、おのおののサンプルにおけるデータＳＤａと復号
値ＳＤ０．ＳＤｉ、ＳＤ−，との差の総和等を用いるこ
とができる。In addition, in this comparison routine, each decoded value S
In addition to the above-mentioned error power, the evaluation value based on the error between D, , SDl, 5D-1 and the data SDa may be, for example, the data SDa and the decoded value SD0 . The sum of the differences between SDi, SD-, and the like can be used.

而して、上記音声圧縮符号化方式は、差分ＰＣＭ音声符
号化に準瞬時圧伸を適用し、その伝送ビットを補正して
自然な音声信号の圧縮を行うものであるが、この方式は
、伝送ビットレートの割付けが常に一定であり、より圧
縮効率を上げるためには、伝送ビットを一律に下げるか
サンプリング周波数を小さくするしかなく、このままで
圧縮率を上げることには無理があった。The above-mentioned audio compression encoding method applies quasi-instantaneous companding to differential PCM audio encoding and corrects the transmitted bits to compress a natural audio signal. The allocation of transmission bit rates is always constant, and the only way to increase compression efficiency is to uniformly lower the transmission bits or lower the sampling frequency, and it is impossible to increase the compression rate as is.

■−−蓋本発明は、上述のごとき実情に鑑みてなされたもので、
特に、低ビツトレートかつ簡単な処理により自然な音声
を符号化・復号化する音声符号化方式を提供することを
目的としてなされたものである。■--Lid This invention was made in view of the above-mentioned circumstances.
In particular, the purpose of this invention is to provide an audio encoding system that encodes and decodes natural audio using low bit rate and simple processing.

榎−−コ又本発明は、上記目的を達成するために、入力された音声
信号を所定サンプル数ごとのブロックに分割し、そのブ
ロック単位ごとにサンプル点での値を特徴量に変換し、
その最大値みつけ、その最上術をスケールデータとして
識別し、その桁を基準として所定のビット数のデータを
符号データに整形して、伝送を行う音声符号化方式にお
いて、スケールデータを基準に各ブロックごとの伝送ビ
ットのビット数の割当てを変えること、或いは、ＰＣＭ
符号化された音声データのうち隣接するサンプル間で直
前サンプルとの差分値を形成してその差分値を時系列に
所定数ごとのブロックに分割し、おのおののブロックに
おける差分値の絶対値の最大値に対応した最上位桁をあ
らわすスケールデータを識別し、上記最上位桁およびこ
の最上位桁にそれぞれに隣接する桁に対応した複数のス
ケールデータにそれぞれ基づいてスケールデータの示す
桁も含む所定のビット数を符号データに整形して複数の
符号データ系列を形成するとともに、おのおのの符号デ
ータ系列を構成する個別の符号データに対応する復号値
と当該符号データに対応した上記音声データとの誤差が
最も小さくなるように当該符号データを補正して上記複
数の符号データ系列をそれぞれ補正し、それらの補正さ
れた符号データ系列のうちブロック内における誤差パワ
ーが最も小さくなる符号データ系列を当該ブロックの符
号データ系列として選択し、後続するブロックの最初の
サンプルの差分値を算出するときには当該ブロックで選
択した符号データ系列の最後のサンプルの復号値を基準
とする音声符号化方式において、スケール値を閾値と比
較することで、ブロックごとの符号ビットの割当てを変
更しまたはスケール値ごとにブロックごとの符号データ
のビット割当てを変更することを特徴としたものである
６以下、本発明の実施例に基いて説明する。In order to achieve the above object, the present invention divides an input audio signal into blocks each having a predetermined number of samples, converts the values at the sample points into feature quantities for each block, and
In the audio encoding method, the maximum value is found, the highest value is identified as scale data, data of a predetermined number of bits is formatted into code data based on that digit, and each block is transmitted based on the scale data. Changing the bit allocation of transmission bits per PCM
A difference value between adjacent samples of the encoded audio data and the previous sample is formed, and the difference value is divided into a predetermined number of blocks in time series, and the maximum absolute value of the difference value in each block is calculated. The scale data representing the most significant digit corresponding to the value is identified, and the predetermined scale data including the digit indicated by the scale data is The number of bits is formatted into code data to form a plurality of code data sequences, and the error between the decoded value corresponding to the individual code data constituting each code data sequence and the audio data corresponding to the code data is Each of the plurality of code data sequences is corrected by correcting the code data so as to have the smallest error power, and among the corrected code data sequences, the code data sequence in which the error power within the block is the smallest is used as the code of the block. In an audio encoding method that uses the decoded value of the last sample of the code data series selected in the block as a reference when calculating the difference value of the first sample of the subsequent block, the scale value is used as the threshold. Based on the embodiments of the present invention, the method is characterized in that the code bit allocation for each block is changed by comparison, or the bit allocation for code data for each block is changed for each scale value. explain.

第１図は、本発明の一実施例を説明するためのブロック
線図であるが、本実施例は、第６図に示した従来技術を
改良してスケール値設定部７により設定されたスケール
値をもとに圧縮ビットの割当てを制御するビット割当て
制御部１６を付加したものであるので、図中、第６図に
示した回路と同様の作用をする部分には、第６図の場合
と同一の参照番号を付してその詳細な説明は省略する。FIG. 1 is a block diagram for explaining one embodiment of the present invention. This embodiment improves the conventional technique shown in FIG. Since the bit allocation control section 16 that controls the allocation of compression bits based on the value is added, the parts in the figure that have the same effect as the circuit shown in FIG. The same reference numerals will be used to omit the detailed explanation.

而して、通常の準瞬時圧伸では、スケールデータと数サ
ンプルの符号データが１つのブロックとして伝送される
が、その場合、符号データのビット数はあらかじめ設定
されており、全てのブロックで同一である。そこで、本
発明ではスケールデータの値によって符号データのビッ
ト数を変化させることにより、圧縮効率をさらに向上さ
せるようにしている。In normal quasi-instantaneous companding, scale data and several samples of encoded data are transmitted as one block, but in that case, the number of bits of encoded data is set in advance and is the same for all blocks. It is. Therefore, in the present invention, compression efficiency is further improved by changing the number of bits of code data depending on the value of scale data.

第１図に基いて説明すると、音声波形ＳＳはＬＰ−Ｆ　
１でＡ／Ｄ変換器のサンプリング周波数の１／２以下に
帯域圧縮されてからＡ／Ｄ変換器２で量子化される。こ
のＡ／Ｄ変換されたデータＤＳをバッファ３に蓄える。To explain based on FIG. 1, the audio waveform SS is LP-F
1, the band is compressed to 1/2 or less of the sampling frequency of the A/D converter, and then quantized by the A/D converter 2. This A/D converted data DS is stored in a buffer 3.

バッファ３から１ブロツク分のデータを最大値制限回路
５を通して差分をとり、スケール値設定部７ヘデータを
送る。スケール値設定部７では、各差分データの絶対値
をとり、その最大値から有効値から有効最大桁すなわち
スケール値を求める。そのスケール値をもとにビット割
当て制御部１６では、所定の閾値（ＴＳ）との比較が行
なわれ、スケール値が閾値より大きい場合はｍビット、
小さい場合はｎビットとするように準瞬時圧縮部１１に
制御信号を出す。One block of data is passed from the buffer 3 to the maximum value limiting circuit 5, a difference is taken, and the data is sent to the scale value setting section 7. The scale value setting section 7 takes the absolute value of each difference data, and calculates the maximum effective digit, that is, the scale value, from the maximum value. Based on the scale value, the bit allocation control unit 16 compares it with a predetermined threshold (TS), and if the scale value is larger than the threshold, m bits,
If it is smaller, a control signal is sent to the quasi-instantaneous compression unit 11 to set it to n bits.

準瞬時圧縮部１１ではバッファ６からのデータを上位か
ら閾値により決められた所定ビット数だけとり、下位ビ
ットを切り捨てる。最適化処理部では、伝送ビット（ｍ
ビット又はｎビット）内で最も原信号との誤差が小さく
なるように補正を行う。The quasi-instantaneous compression unit 11 extracts a predetermined number of bits from the upper part of the data from the buffer 6, which is determined by a threshold value, and discards the lower bits. In the optimization processing section, transmission bits (m
Correction is performed so that the error from the original signal is minimized within the bits or n bits).

−ブロック処理が終了したらそのブロックを準瞬時伸長
部１４のロジックにより伸長し積分部１５によってロー
カル復号することで、最後の値をレジスタ９に返し、次
のブロックの先頭の差分を計算する。１ブロツクごとに
スケールデータと符号データはマルチプレクサに入力さ
れて伝送データが形成される。- When block processing is completed, the block is expanded by the logic of the quasi-instantaneous expansion unit 14 and locally decoded by the integration unit 15, so that the last value is returned to the register 9 and the difference at the beginning of the next block is calculated. Scale data and code data are input to a multiplexer for each block to form transmission data.

以下、具体的に、１２ビツトのデータを４ビツトに圧縮
する場合を考え、８サンプルで、１ブロツクを形成し、
スケール値が閾値より大きい場合には４ビツト、小さい
場合には３ビツト割当を行うことにする。Below, we will specifically consider the case of compressing 12-bit data to 4-bit data, and form one block with 8 samples.
If the scale value is larger than the threshold value, 4 bits will be allocated, and if it is smaller, 3 bits will be allocated.

この場合、スケールデータとしては４ビツト必要である
からスケールデータが閾値より大きい場合には４＋４Ｘ
８＝３６ビツト、小さい場合には４＋３Ｘ８＝２８ビツ
ト、１ブロツク当たり伝送することになる。従って、６
（＝（３６−２８））ビット／ブロック節約できる。In this case, 4 bits are required as scale data, so if the scale data is larger than the threshold, 4+4X
8 = 36 bits, or in the case of a small number, 4 + 3 x 8 = 28 bits, which will be transmitted per block. Therefore, 6
(=(36-28)) bits/block can be saved.

この場合、スケール値を閾値と比較してビット割当てを
変更したが、各スケールごとに所定のビット割当てを行
ってもよい。In this case, the bit allocation is changed by comparing the scale value with the threshold value, but a predetermined bit allocation may be performed for each scale.

また、以上には、ブロック内の符号データとして、差分
データを使用したが、この化ＰＣＭ符号データをそのま
ま用いてもよい。Moreover, although differential data is used as code data within a block in the above, this encoded PCM code data may be used as is.

第２図は、第１図に示した符号化データＤＬを復号化す
る復号化方式の一例を説明するための図で、図中、４１
はディマルチプレクサ、４２はビット制御ロジック、４
３は伸長部、４４はレジスタ、４５はデジタル・アナロ
グ（Ｄ／Ａ）変換器、４６はローパスフィルター（Ｌ　
Ｐ　Ｆ）で、伝送信号ＤＬはディマルチプレクサ４１に
入力され、先頭のスケールデータがまず取り出される。FIG. 2 is a diagram for explaining an example of a decoding method for decoding the encoded data DL shown in FIG.
is a demultiplexer, 42 is a bit control logic, 4
3 is an expansion unit, 44 is a register, 45 is a digital-to-analog (D/A) converter, and 46 is a low-pass filter (L
At PF), the transmission signal DL is input to the demultiplexer 41, and the leading scale data is first taken out.

スケールデータはビット制御ロジック４２へ送られ、後
続する符号データのビット数が判断され、そのビット数
がディマルチプレクサ４１に送られスケールデータに後
続した符号データが切り出される。切り出された符号は
、スケール値をもとに伸長され順次前のデータとの加算
が行なわれて復号が行なわれる。The scale data is sent to the bit control logic 42, the number of bits of the code data that follows is determined, and the number of bits is sent to the demultiplexer 41 to cut out the code data that follows the scale data. The extracted code is expanded based on the scale value and sequentially added to previous data to perform decoding.

第３図は、本発明の他の実施例を説明するためのブロッ
ク線図であるが、この実施例は、第１４図に示した従来
技術を改良したものであるので。FIG. 3 is a block diagram for explaining another embodiment of the present invention, and this embodiment is an improvement on the prior art shown in FIG. 14.

図中、第１４図の場合と同様の作用をする部分には第１
４図の場合と同一の参照番号を付してその詳細な説明は
省略する。而して、この実施例は、図示のように、ビッ
ト割当て制御ロジック部３５を設け、スケール値設定部
７ａで求まる基準となるスケール値をもち、この値をビ
ット割当て制御ロジック部３５で判断することにより符
号ビットの割当てを決めるものである。この場合、ビッ
ト割当て制御部３５ではスケールデータの値をもとに閾
値を用いることで、または、各スケール値ごとに割当て
ビットを決定している。In the figure, the parts that have the same effect as in Figure 14 have the first
The same reference numerals as in the case of FIG. 4 are given, and detailed explanation thereof will be omitted. As shown in the figure, this embodiment is provided with a bit allocation control logic section 35, which has a reference scale value determined by the scale value setting section 7a, and this value is determined by the bit allocation control logic section 35. This determines the assignment of code bits. In this case, the bit allocation control unit 35 determines the allocated bits by using a threshold value based on the value of the scale data or for each scale value.

第３図に基いて説明すると、音声波形ＳＳはＬＰＦＩで
Ａ／Ｄ変換器２のサンプリング周波数の１／２以下に帯
域圧縮されてからＡ／Ｄ変換器で量子化される。このＡ
／Ｄ変換されたデータＤＳをバッファ３に蓄え、バッフ
ァ３から１ブロック分のデータを最大値制限回路５を通
して差分をとり、スケール値設定部７ａへデータを送る
。スケール値設定部７ａでは、各差分データの絶対値を
とり、その最大値から有効最大桁すなわちスケール値を
求める。そのスケール値をもとにビット割当て制御部３
５では、所定の閾値（ＴＳ）との比較が行なわれ、スケ
ール値が所定の閾値より大きい場合はｍビット、小さい
場合はｎビット割当てをするように準瞬時圧縮部に制御
信号を出す、また、スケール値設定部７ａで決定された
スケール値と。Explaining based on FIG. 3, the audio waveform SS is band-compressed to 1/2 or less of the sampling frequency of the A/D converter 2 by LPFI, and then quantized by the A/D converter. This A
The /D converted data DS is stored in the buffer 3, one block of data from the buffer 3 is passed through the maximum value limiting circuit 5, a difference is taken, and the data is sent to the scale value setting section 7a. The scale value setting section 7a takes the absolute value of each difference data and calculates the effective maximum digit, that is, the scale value from the maximum value. Based on the scale value, the bit allocation control unit 3
In step 5, a comparison is made with a predetermined threshold (TS), and if the scale value is larger than the predetermined threshold, a control signal is issued to the quasi-instantaneous compression unit to allocate m bits, and if it is smaller, to allocate n bits; , and the scale value determined by the scale value setting section 7a.

そのスケール値を−１した値と、スケール値を＋１した
値の３つのスケール値を設定し、各々のスケール値を用
いて、先に決定された割当てビット数に各符号データを
整形する。Three scale values, ie, a value obtained by subtracting the scale value by 1 and a value obtained by adding 1 to the scale value, are set, and each code data is shaped into the previously determined number of allocated bits using each scale value.

最適化処理部では、割当てビット数内で最も原信号との
誤差が小さくなるように符号データの補正を行う、３つ
のスケール値に対応した符号データを各々積分器部で復
号し、比較部で原信号との誤差が最も小さいスケール値
とその符号データを選択する。そのスケール値と符号デ
ータをマルチプレクサ１３に入力し、一定の新式に従っ
て伝送する。この場合、スケール値による各ブロック内
サンプル点でのビット割当ては、閾値を用いずに、スケ
ール値ごとに所定のビット数を設定してもよい。In the optimization processing section, the code data is corrected so that the error from the original signal is minimized within the allocated number of bits.The code data corresponding to the three scale values is decoded by the integrator section, and the code data is decoded by the comparison section. Select the scale value and its code data with the smallest error from the original signal. The scale value and code data are input to multiplexer 13 and transmitted according to a certain new formula. In this case, bit allocation at sample points within each block based on the scale value may be performed by setting a predetermined number of bits for each scale value without using a threshold.

象−ｍ＝１以上の説明から明らかなように、本発明によると低ビツ
トレートで高品質な音声符号化方式を実現することがで
きる。As is clear from the above description, according to the present invention, it is possible to realize a high quality audio encoding system at a low bit rate.

[Brief explanation of drawings]

第１図は１本発明による符号化処理の一実施例を説明す
るためのブロック線図、第２図は、復号化処理の一例を
説明するためのブロック線図、第３図は、本発明による
符号化処理の他の実施例を説明するためのブロック線図
、第４図は、従来技術の説明図、第５図（ａ）〜（ｃ）
は、従来技術の符号化復号化の状況を説明するための信
号配置図、第６図乃至第１５図は、本発明の先行技術を
説明するための図で、第６図は音声符号化装置を示した
ブロック線図、第７図は符号化データの信号形式を例示
した信号配置図、第８図（ａ）は最適化処理を説明する
ための波形図、同図（ｂ）は最適化差分ビットルーチン
の作用を説明するための信号配置図、第９図（ａ）、（
ｂ）は最適化差分ビットルーチンの一例を示したフロー
チャート、第１０図は最適化処理の効果の説明図、第１
１図（ａ）〜（ｄ）は最適化の様子を示した信号配置図
、第１２図は音声復号化装置の一例を示したブロック図
、第１３図（ａ）〜（ｃ）は他の音声符号化装置の例の
原理を説明するための図、第１４図は他の実施例にかか
る音声符号化装置を示したブロック図、第１５図は比較
ルーチンの一例を示したフローチャートである。１．２５・・・ローパスフィルタ、２・・・アナログ／
デジタル変換器、　　３．６．１０，１０１１，１０．
．１０−１．ＩＱａＪＯｂ。１０ｃ・・・バッファ、４・・・加減算器、５・・・最
大値制限回路、７．７ａ・・・スケール値設定部、８・
・・加算器。９・・・レジスタ、　１１，１１．．１１１，１１−□
・・・準瞬時圧縮部。１２．１２゜、１２□、１２−１・・・最適化処理部、
１３・・・マルチプレクサ、　１４，１４゜、　１４１
．１４−、　、２２・・・準瞬時伸張部。１５、１５゜、　１５１．１５−□、１５ａ、２３・・
・積分部、１６・・・ビット割当て制御部、３１，３２
，３４・・・セレクタ。３３・・・比較部、３５・・・ビット割当て制御ロジッ
ク。特許出願人　　　株式会社　リコー第　５　　図第　７７ＬＸｓｌｏ　　　　　＄＋　　　　　＄２　　　　　ミ（ｂ
）第９図第　９　図（ｂ）第１０図第　１１　図第１２図第１３図（ａ）（ｂ）第１３図（Ｃ）第１５図手続苔口正書（方式）１．事件の表示昭和６２年　特許願　第７０８５５号２、発明の名称音声符号化方式３、補正をする者事件との関係　　特許出願人オオタ　り　ナカマゴメ住所　　東京都大田区中馬込１丁目３番６号氏名（名称
）　　（６７４）株式会社リコー代表者　　浜　　１）
　　広４、代理人住　所　　　　〒２３１　横浜市中区不老町１−２−７
シヤトレーイン横浜８０７号FIG. 1 is a block diagram for explaining an example of encoding processing according to the present invention, FIG. 2 is a block diagram for explaining an example of decoding processing, and FIG. 3 is a block diagram for explaining an example of encoding processing according to the present invention. FIG. 4 is a block diagram for explaining another embodiment of the encoding process according to the present invention, and FIG. 5 is an explanatory diagram of the prior art, and FIGS.
is a signal arrangement diagram for explaining the encoding/decoding situation of the prior art, FIGS. 6 to 15 are diagrams for explaining the prior art of the present invention, and FIG. 6 is a diagram of a speech encoding device. Figure 7 is a signal arrangement diagram illustrating the signal format of encoded data, Figure 8 (a) is a waveform diagram to explain the optimization process, and Figure 8 (b) is the optimization process. Signal arrangement diagrams for explaining the action of the differential bit routine, Fig. 9(a), (
b) is a flowchart showing an example of the optimization difference bit routine; FIG. 10 is an explanatory diagram of the effect of the optimization process;
Figures 1 (a) to (d) are signal allocation diagrams showing the state of optimization, Figure 12 is a block diagram showing an example of an audio decoding device, and Figures 13 (a) to (c) are FIG. 14 is a block diagram showing a speech coding device according to another embodiment, and FIG. 15 is a flowchart showing an example of a comparison routine. 1.25...Low pass filter, 2...Analog/
Digital converter, 3.6.10, 1011, 10.
．． 10-1. IQaJOb. 10c... Buffer, 4... Adder/subtractor, 5... Maximum value limiting circuit, 7.7a... Scale value setting section, 8...
...Adder. 9... register, 11,11. ．． 111,11-□
...Quasi-instantaneous compression section. 12.12°, 12□, 12-1... optimization processing section,
13...Multiplexer, 14, 14°, 141
．． 14-, , 22... Quasi-instantaneous extension part. 15, 15°, 151.15-□, 15a, 23...
- Integration section, 16...Bit allocation control section, 31, 32
, 34...Selector. 33... Comparison unit, 35... Bit allocation control logic. Patent applicant Ricoh Co., Ltd. Figure 5 Figure 77 LX slo $+ $2 Mi(b
) Figure 9 Figure 9 (b) Figure 10 Figure 11 Figure 12 Figure 13 (a) (b) Figure 13 (C) Figure 15 Procedure Kokeguchi Orthography (Method) 1. Indication of the case 1985 Patent Application No. 70855 2, Name of the invention Audio encoding method 3, Person making the amendment Relationship to the case Patent applicant Ota Ri Nakamagome Address 1-3-6 Nakamagome, Ota-ku, Tokyo Name (Name) (674) Ricoh Co., Ltd. Representative Hama 1)
Hiro 4, Agent address: 1-2-7 Furocho, Naka-ku, Yokohama 231
Seat Train Yokohama 807

Claims

[Claims]

(1) Divide the input audio signal into blocks of a predetermined number of samples, convert the values at the sample points for each block into feature quantities, find the maximum value, and use the top digit as scale data. In the audio encoding method that identifies the digit, formats the data with a predetermined number of bits into coded data based on the digit, and transmits it, the assignment of the number of transmission bits for each block is changed based on the scale data. Characteristic voice encoding method.

(2) When allocating transmission bits for each block based on scale data, a predetermined threshold TS is set, and a large bit allocation is applied to blocks with scale data larger than this threshold TS, and a large bit allocation is applied to blocks with scale data larger than this threshold TS. 2. The audio encoding method according to claim 1, wherein a small bit is allocated to each block.

(3) When allocating transmission bits to each block based on scale data, the bit allocation to each block is performed after setting the bit allocation corresponding to each value of the scale value in advance. Claim No. (
The audio encoding method described in section 1).

(4) Using the difference value of the PCM encoded audio data as the feature quantity for each block, the error between the data obtained by decoding the encoded data shaped from the scale data and the original audio is reduced. According to claim (1), when the first value of the following block is calculated, the decoded value of the last sample of the block is used as a reference. Audio encoding method.

(5) The audio encoding method according to claim (2), characterized in that PCM encoded audio data is used as the feature amount.

(6) Form a difference value between adjacent samples of the PCM encoded audio data and the previous sample, divide the difference value into a predetermined number of blocks in time series, and calculate the difference value in each block. Identify scale data representing the most significant digit corresponding to the maximum absolute value of A predetermined number of bits, including digits, are formatted into code data to form a plurality of code data sequences, and the decoded values corresponding to individual code data constituting each code data sequence and the above-mentioned audio corresponding to the code data are generated. Correcting each of the plurality of code data series by correcting the code data so that the error with the data is minimized,
Among these corrected code data sequences, the code data sequence with the smallest error power within the block is selected as the code data sequence of the block, and when calculating the difference value of the first sample of the subsequent block, the code data sequence with the smallest error power within the block is selected. In a speech encoding method that uses the decoded value of the last sample of the selected coded data sequence as a reference, by comparing the scale value with a threshold value, the allocation of code bits for each block can be changed or the allocation of code bits for each block can be changed based on the scale value. A speech encoding method characterized by changing bit allocation of encoded data.