JPH01280800A

JPH01280800A - Coding system

Info

Publication number: JPH01280800A
Application number: JP63110860A
Authority: JP
Inventors: Hiroshi Shimura; 浩志村
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1988-05-06
Filing date: 1988-05-06
Publication date: 1989-11-10

Abstract

PURPOSE:To perform encoding which is efficient in sound tone quality and compression by coding the maximum bit value by a completely reproducible coding method and vector-quantizing a difference value when coding a prescribed value of difference in sound signals divided into blocks and the maximum bit digit of the absolute value of the difference in each block. CONSTITUTION:Difference values of sound signals are taken at a difference calculating section 1 which inputs digital sound signals and output of the calculating section 1 is stored in a difference block buffer 2 by one block quantity. Simultaneously, the difference from the calculating section 1 is inputted to a scale value detecting section 3 where the maximum value of the absolute value of the difference and the maximum bit position is outputted as a scale value. The scale value is inputted to a vector-quantizing (VO) code book 4 and the output of the code book 4 is selected by using the scale value. Then a code book A is selected when the scale value is smaller than a prescribed value THA and another code book B is selected when the scale value is larger than the prescribed value THA but smaller than another prescribed value THB. When the scale value is larger than the prescribed value THB but smaller than a prescribed value THC, a code book C is selected.

Description

【発明の詳細な説明】技術分野本発明は、符号化方式、より詳細には、音声応答、音声
ダイアリング等のデジタル音声処理における適応ＶＱ符
号化方式に関する。TECHNICAL FIELD The present invention relates to a coding system, and more particularly to an adaptive VQ coding system in digital voice processing such as voice response, voice dialing, etc.

従来技術現在、ボイスメールなどの音声蓄積や音声信号の伝送の
ために比較的低ビットレートで高品質な音声を堤供する
音声信号符号化（圧縮）方式が各種研究されている。例
えば、電話品質音声（０，３−３，４ＫＨｚ）を３２ｋ
ｂｐｓに符号化するＣＣＩＴＴ勧告のＡ、　Ｄ　Ｐ　Ｃ
Ｍ方式、予測符号化を行うＡ　Ｐ　Ｃ−Ａ　Ｂ　（Ａｄ
ａｐｔｉｖｅ　Ｐｒｅｄｉｃｔｉｏｎ　Ｃｏｄｉｎｇｗ
ｊ、ｔｈ　Ａｄａｐｔｉｖｅ　Ｂｉｔ　Ａ１１ｏｃａｔ
ｉｏｎ）方式、マルチパルス方式、更に音声分析合成手
法によるＬＳＰ（Ｌｊ、ｎｅ　Ｓｐｅｃｔｒｕｍ　Ｐａ
１ｒ）方式なとである。しかし、これらの方式は音声品
質としては良好であるが、符号化、復号化の処理が複雑
でありかなりのハード量を有する。BACKGROUND OF THE INVENTION Currently, various audio signal encoding (compression) methods are being researched that provide high quality audio at a relatively low bit rate for audio storage and audio signal transmission such as voice mail. For example, telephone quality audio (0,3-3,4KHz) is 32k
CCITT Recommendation A, D P C encoded in bps
M method, A P C-A B (Ad
Aptive Prediction Codingw
j,th Adaptive Bit A11ocat
ion) method, multipulse method, and LSP (Lj, ne Spectrum Pa
1r) method. However, although these methods have good audio quality, the encoding and decoding processes are complex and require a considerable amount of hardware.

一方、放送衛星用の高品位なＰＣＭ音声伝送方式の一つ
に準瞬時圧伸方式がある。この方式は符号化処理が簡単
であるものの圧縮は十分に効率的なものではない。この
ため準瞬時圧伸の圧縮率を改善する一つの手法として「
差分ＰＣＭ方式と準瞬時圧伸の結合」が考えられる。一
般に、単に準瞬時圧伸を差分ＰＣＭ方式に適用しただけ
では、圧縮時の欠落ビットが伝送誤差を生じ、受信側の
積分器で誤差が累積して受信不能となる。そこで、高欄
、国中、その他はこの問題を「欠落ビットのアキュムレ
ーション」　（以下、ＤＣ−ＰＣＭ方式と呼ぶ）という
方法で解決しく欠落ピッｌ−のアキュムレーションによ
る差分圧伸ＰＣＭ　（ＤＣ−ＰＣＭ’）、信学論、　Ｖ
ｏｌ、、Ｊ６７−ＢＮαｌＯ）　、量子化ノイズを大き
く低減させる事ができたと報告している。On the other hand, one of the high-quality PCM audio transmission systems for broadcasting satellites is the quasi-instantaneous companding system. Although the encoding process is simple in this method, the compression is not sufficiently efficient. Therefore, one method to improve the compression ratio of quasi-instantaneous companding is
A combination of differential PCM method and quasi-instantaneous companding may be considered. Generally, if quasi-instantaneous companding is simply applied to the differential PCM method, missing bits during compression will cause transmission errors, and the errors will accumulate in the integrator on the receiving side, making reception impossible. Therefore, Takaran, Kuninaka, and others proposed to solve this problem with a method called "missing bit accumulation" (hereinafter referred to as DC-PCM method). , Theory of Faith, V
ol, J67-BNαlO) reported that it was possible to significantly reduce quantization noise.

本出願人は、前記高欄らの方式とは別に差分ＰＣＭ方式
と準瞬時圧伸を組み合わせる方式として１、低ビットレ
ー１へ（１サンプル当たり２ヒツト〜４ピツ）〜に圧縮
）で２、簡単な処理（ハート量小）により３、高品質な音声を再現できる２種類の音声　骨化方式の検討を行った。Apart from the method of Takanara et al., the present applicant proposed a method that combines the differential PCM method and quasi-instantaneous companding. We investigated two types of audio ossification methods that can reproduce high-quality audio through processing (small heart volume).

方式の基本的な考え方は準瞬時圧伸の際に求まる圧縮差
分データをその量子化ピッ１−内で補正するというもの
であり、方式Ｉは差分データに準瞬時圧伸を施した際求
まるブロック化した圧縮差分データを順次復号し、原信
号と比較することで圧縮ビット数内で誤差の少ない差分
データとなるようにサンプル点毎に補正する方式である
。また、方式■は方式Ｉにおいてスケール値を変化させ
量子化ステップ輻を変えることで、原信号との誤差パワ
ーが最小の圧縮差分データを選択する方式である。ここ
で、これら２種類の音声信号符号化方式の原理、シュミ
レーションの結果について説明すると、ます、準瞬時圧伸の概略、差分ＰＣＭ方式に準瞬時圧伸
を適用した時の問題点し３ついて説明すると、十２ヒッ
ｌ−の音声データを３ビツトに圧縮する場合の準瞬時圧
伸は、ＰＣＭテーデー例えば１プロツタ８サンプルごと
に分割し、このブロックの中から最大値を見つけ出しく
正でも負でも良い）、この最大値の示すビットパターン
の有効」二値桁から下位へ３ビツトだけ伝送する（最上
位ビットは符号を示している）。８個のサンプルから構
成される１フロツク内の伝送ビットの位置は同一であり
、この位置をスケール値として伝送する。The basic idea of the method is to correct the compressed difference data obtained during quasi-instantaneous companding within its quantization pitch. This is a method in which the compressed difference data is sequentially decoded and compared with the original signal to correct each sample point so that the difference data has fewer errors within the number of compressed bits. Furthermore, method (2) is a method in which the compressed difference data with the minimum error power from the original signal is selected by changing the scale value and changing the quantization step radius in method I. Here, we will explain the principles and simulation results of these two types of audio signal encoding methods. First, we will explain the outline of quasi-instantaneous companding and the problems when applying quasi-instantaneous companding to the differential PCM method. Then, quasi-instantaneous companding when compressing 12 hits of audio data to 3 bits involves dividing the PCM data into, for example, 8 samples per plotter, and finding the maximum value from this block, whether positive or negative. 3 bits are transmitted from the binary digit to the lower order (the most significant bit indicates the sign). The positions of the transmission bits within one flock consisting of eight samples are the same, and this position is transmitted as a scale value.

復号時には、切り捨てビットの位置にＯが詰められ、伝
送ヒントより上位のビットには符号ビン１〜か詰められ
てピッＩ−の伸長が行われる。この準瞬時圧伸を差分子
３　ＣＭに適用したときの様子を第３図に示す。At the time of decoding, the position of the truncated bit is padded with an O, and the bits higher than the transmission hint are padded with code bins 1 to 1, and a pip I- is expanded. FIG. 3 shows the situation when this quasi-instantaneous companding is applied to the difference molecule 3 CM.

第３図（ａ）に示した音声信号（実線）に対し、隣り合
うサンプル間での差分値を求め（第３図（ｂ’）の細線
（＃１〜＃８））、これに準瞬時圧伸を適用すると第３
図（ｂ）の太線になる。ここでは、８サンプルで１ブロ
ツクを構成する差分データ　（１２ピッ１〜信号）に準
瞬時圧伸を施し３ピントに圧縮している。このとき各サ
ンプル値は、下位ピッ１−成分が切り捨てられ■〜■と
なる。For the audio signal (solid line) shown in Figure 3(a), find the difference value between adjacent samples (thin lines (#1 to #8) in Figure 3(b')), and add it to the quasi-instantaneous When applying companding, the third
This becomes the thick line in figure (b). Here, the differential data (12 bits 1 to signal), which constitutes one block of 8 samples, is subjected to quasi-instantaneous companding and compressed to 3 points. At this time, each sample value becomes ① to ① by truncating the lower P1-component.

一方、受信側で第３図（ｂ）に示した圧縮差分データ（
■〜■）を用いて復号を行うと音声信号は第３図（ｃ）
の−点鎖線となり、復号信号には負のＤＣシフトが生じ
信号は正確に再現できない。On the other hand, on the receiving side, the compressed difference data (
When the audio signal is decoded using
A negative DC shift occurs in the decoded signal, and the signal cannot be accurately reproduced.

このため負のＤＣシフトを防ぐ一つの方法として欠落ビ
ットのアキュムレーションという方式（ＤＣ−ＰＣＭ方
式）が既に提案されている。これは準瞬時圧伸により生
じる欠落ビット成分を伝送ビットにアキュムレーション
という形式で補給することによって受信側での負のＤＣ
シフトを抑える方式である。For this reason, a method of accumulating missing bits (DC-PCM method) has already been proposed as one method for preventing negative DC shifts. This is achieved by replenishing the transmission bits with the missing bit components caused by quasi-instantaneous companding in the form of accumulation.
This method suppresses shifts.

ところでＤＣ−ＰＣＭ方式は、アキュｌル−ジョンによ
り欠落ビットを補っているものの復号波形が原波形の下
側で常に追従し、ブロック内の誤差は最大で欠落ピッ１
−の最大値となる。By the way, in the DC-PCM method, although missing bits are compensated for by acculsion, the decoded waveform always follows the lower side of the original waveform, and the error within a block is at most 1 missing bit.
− is the maximum value.

本出願人は差分ＰＣＭ方式に準瞬時圧伸を適用するとい
うことを基本とし、波形的には復号波形が原波形のまわ
りしこまとわりつくようしこ考慮して、誤差が最大でも
伝送ビットの量子化幅の１／２以下となる低ビツトレー
トかつ簡単な処理（ハード最小）により比較的高品質な
音声を提供する音声信号符号化方式を提案した。ここで
はその方式を「最適ピッ１−による差分圧伸ＰＣＭ方式
」と呼び以下その原理について説明する。The applicant basically applies quasi-instantaneous companding to the differential PCM method, and takes into account that the decoded waveform wraps around the original waveform, and even if the error is maximum, the transmission bit quantum We have proposed an audio signal encoding method that provides relatively high quality audio with a low bit rate of less than 1/2 of the encoding width and simple processing (minimum hardware). Here, this method will be referred to as the "differential companding PCM method using optimal pi1-" and its principle will be explained below.

準瞬時圧伸には次の様な特徴がある。Quasi-instantaneous companding has the following characteristics.

（１）ブロック内伝送ビットの量子化ステップはスケー
ル値により決定される。(1) The quantization step of intra-block transmission bits is determined by the scale value.

（２）各サンプル点での伝送ビットの値は小さい方の量
子化値をとる。(2) The value of the transmission bit at each sample point takes the smaller quantized value.

そこで以上の（１）、（２）を考慮し次のような補正法
を提案した。Therefore, considering (1) and (2) above, we proposed the following correction method.

第４図に方式Ｉの原理を示す。サンプル点＃Ｏを基準と
してサンプル点＃１．＃２．＃３の差分をそれぞれ１３
ビツトで形成し準瞬時圧伸により３ビツトに圧縮する場
合を考える。この場合の差分値の絶対値はサンプル点＃
］が最大になり、３ビツトの圧縮差分データはこのサン
プル＃１を基準に形成される。スケール値は＃］−のピ
ッ１へパターンの最」二値桁の桁位置となる。各々のサ
ンプル＃１，３２．＃３の値は、この量子化幅で表現可
能なデータに置換される。例えば、サンプル＃１−の圧
縮差分データは実際の値ｐＨよりも下の値Ｐ　１２（＝
（０１０））となる。ところで、この量子化幅で表現で
きるデータのうち、ＰＩ３よりも一つ大きな値Ｐ　１３
（＝（０１，ｌ））に対応したデータの方がよりサンプ
ル＃１の実際の値ｐＨに近い。そこで、このＰＩ３をサ
ンプル＃１の圧縮差分データとすれば復号化したときの
音声信号の誤差を小さくすることが出来る。この時の復
号値の誤差は、最大でもこの圧縮差分データの量子化幅
の１／２に抑えることが出来る。FIG. 4 shows the principle of method I. Sample point #1 with sample point #O as a reference. #2. 13 differences for #3 each
Consider the case where the data is formed from bits and compressed to 3 bits by quasi-instantaneous companding. The absolute value of the difference value in this case is sample point #
] becomes the maximum, and 3-bit compressed difference data is formed based on sample #1. The scale value is the digit position of the most binary digit of the pattern to the pin 1 of #]-. Each sample #1, 32. The value #3 is replaced with data that can be expressed with this quantization width. For example, the compressed difference data of sample #1- is a value P12 (=
(010)). By the way, among the data that can be expressed with this quantization width, a value P13 that is one value larger than PI3
The data corresponding to (=(01, l)) is closer to the actual pH value of sample #1. Therefore, if this PI3 is used as the compressed difference data of sample #1, the error in the audio signal when decoding can be reduced. The error in the decoded value at this time can be suppressed to at most 1/2 of the quantization width of this compressed difference data.

同様にサンプル＃２．＃３については、その復号値か符
号化前の信号の値（サンプル＃２ではＰ２１、サンプル
＃３ではＰ３１）にもっとも近くなる圧縮差分データを
選択すればよい。この場合、サンプル＃２については、
Ｐ２１よりも小さい値Ｐ２２に基づいた復号値に比へて
Ｐ２１よりも大きいＩ−’　２３に基づいた復号値の方
がよりＰ２１に近いので、サンプル＃１の復号値である
ＰＩ３とＰ２３との差分（＝（］、］、Ｏ））を圧縮差
分データに設定する。このサンプル＃３についてもＰ２
３とＰ３１の差分（＝（００１−））を圧縮差分データ
に設定する。このようにして元の音声信号に対する追従
性が向−１−シた圧縮差分データを形成することができ
る。そのための操作としては、復号値と真値との誤差が
小さくなるように圧縮差分データにそのＬ　Ｓ　Ｂを加
減算する操作を繰り返し施す。Similarly, sample #2. For #3, compressed difference data that is closest to its decoded value or the value of the signal before encoding (P21 for sample #2, P31 for sample #3) may be selected. In this case, for sample #2,
The decoded value based on I-'23, which is larger than P21, is closer to P21 than the decoded value based on P22, which is smaller than P21, so the difference between PI3, which is the decoded value of sample #1, and P23 is Set the difference (=(], ], O)) to the compressed difference data. P2 for this sample #3 as well.
The difference between 3 and P31 (=(001-)) is set as compressed difference data. In this way, it is possible to form compressed difference data that has better followability with respect to the original audio signal. As an operation for this purpose, an operation of adding and subtracting the LSB to the compressed difference data is repeatedly performed so that the error between the decoded value and the true value becomes small.

ところで差分データの変化が大きくサンプル点が上述の
３ビン１〜では表現できない時にはその３ピツ１〜で表
現できる最大値（０１１）を送って代用する。同様にマ
イナス側でこのようなことが生じた時には負の最大値（
１００）を送って代用するものとする。また、ブロック
間における欠落ヒソｌ−の発生に対しては、前ブロック
の最後のサンプル点での再生値を用いて、次のブロック
の先頭のサンプル点での差分を計算することで対処する
。By the way, when the change in the difference data is large and the sample point cannot be expressed by the above-mentioned 3 bins 1~, the maximum value (011) that can be expressed by the 3 bins 1~ is sent as a substitute. Similarly, when this happens on the negative side, the maximum negative value (
100) to be substituted. Furthermore, the occurrence of a missing histo l- between blocks is dealt with by calculating the difference at the first sample point of the next block using the reproduced value at the last sample point of the previous block.

第５図は、方式Ｉの構成を示す図で、図中、１１はＬ　
Ｐ　Ｆ、１２はＡ／Ｄ、１３は最大値制限部、１４は準
瞬時圧伸部、」５は最適化処理部、］−６はスケール値
設定部、１７はレジスタ、］８は積分部、１９は準瞬時
伸長部、２０はマルチプレクサで、まず、入力信号は８
　Ｋ　Ｈｚでサンプリングされて」−２ビツト原データ
となり１サンプル前のデータとの差分値が計算される。FIG. 5 is a diagram showing the configuration of method I, in which 11 is L
PF, 12 is A/D, 13 is maximum value limiting section, 14 is quasi-instantaneous companding section, 5 is optimization processing section, ]-6 is scale value setting section, 17 is register, ]8 is integrating section , 19 is a quasi-instantaneous expansion unit, 20 is a multiplexer, and first, the input signal is 8
The data is sampled at KHz and becomes 2-bit original data, and the difference value from the data one sample before is calculated.

差分値は例えば８サンプルで１ブロツクを構成し準瞬時
圧伸が行われる。これによって得られた圧縮差分データ
に方式Ｉを適応しサンプル点毎に補正を行う。For example, 8 samples constitute one block of difference values, and quasi-instantaneous companding is performed. Method I is applied to the compressed difference data obtained thereby, and correction is performed for each sample point.

第５図の最適化差分ビットルーチンがこの処理を行って
いる。伝送時にはマルチプレクサによって１ブロツクの
始めにスケール値が付加され、順次補正された圧縮差分
データが伝送される。復号時＝８− にはこのスケール値をもとに圧縮された差分データの伸
長と復号か行われる。The optimized difference bit routine of FIG. 5 performs this processing. During transmission, a scale value is added to the beginning of one block by a multiplexer, and the corrected compressed difference data is transmitted sequentially. At the time of decoding=8-, the compressed differential data is expanded and decoded based on this scale value.

方式Ｉで示した方式を基に各ブロック内での復号波形の
誤差パワーが最少となるようにスケール値と差分データ
を選択する方式を提案した。この方式■は、１つのブロ
ックに対し通常の準瞬時圧伸により求まるスケール値の
他に２つのスケール値を設定し、それぞれのスケール値
に対して方式Ｉを適用することでまず各サンプル点での
誤差が最小となる差分データを求め、さらにこれらの差
分データ群の中からブロック内での原音声信号との誤差
パワーが最小となる差分データを選択する方式である。Based on the method shown in Method I, we proposed a method in which scale values and differential data are selected so that the error power of the decoded waveform within each block is minimized. This method (■) first sets two scale values for one block in addition to the scale value determined by normal quasi-instantaneous companding, and then applies method I to each scale value. In this method, the difference data with the minimum error in the block is determined, and the difference data with the minimum error power with respect to the original audio signal within the block is selected from among these difference data groups.

ここで３つのスケール値とは、通常準瞬時圧伸により求
まるスケール値とこれを基準にしてスケール値−１、ス
ケール値＋１したものである。スケール値を変化させた
時の復号波形の追従の様子を第６図に示す。Here, the three scale values are the scale value normally found by quasi-instantaneous companding, and the scale value -1 and scale value +1 based on this scale value. FIG. 6 shows how the decoded waveform follows when the scale value is changed.

第７図は、方式Ｈの構成を示す図で、図中、２１はＬＰ
Ｆ、２２はＡ／Ｄ、２３は最大値制限部、２４はレジス
タ、２５はスケール設定部、２６は方式■適応ロジック
、２７は方式■適応ロジック、２８は方式Ｉ適応ロジッ
ク、２９は比較ロジック、３０はセレクタ、３１は積分
部、３２はマルチプレクサで第５図に示した方式Ｉとの
違いは、３つのスケール値を設定している点と３つの補
正された差分データの中から最良のものを選択する比較
ロジックが付加されている点である。FIG. 7 is a diagram showing the configuration of method H, in which 21 is the LP
F, 22 is A/D, 23 is maximum value limiter, 24 is register, 25 is scale setting unit, 26 is method ■ adaptive logic, 27 is method ■ adaptive logic, 28 is method I adaptive logic, 29 is comparison logic , 30 is a selector, 31 is an integrator, and 32 is a multiplexer.The difference from method I shown in FIG. The difference is that comparison logic for selecting items is added.

この方式■は、方式Ｉに比べ処理は複雑となるが、復号
信号の誤差パワーは小さくなり音声品質は良くなるもの
と考えられる。This method (2) requires more complicated processing than method I, but it is thought that the error power of the decoded signal will be smaller and the voice quality will be better.

なお比較ロジックで計算される誤差パワー（ＲＭＳｋ）
は、ブロック内の原信号と差分データを復号して得られ
る値との差の二乗和の平方根（これをＲＭＳｋとする）
で定義している。すなわち、」、ブロック内の各サンプ
ル点での原信号をｄ　ａＪ。Note that the error power (RMSk) calculated by the comparison logic
is the square root of the sum of squares of the difference between the original signal in the block and the value obtained by decoding the difference data (this is referred to as RMSk)
It is defined in That is, ``d aJ the original signal at each sample point in the block.

各スケール値に対する圧縮差分データより復号される値
をｄａｓＸＪとするとき、で計算する。When the value decoded from the compressed difference data for each scale value is dasXJ, it is calculated as follows.

ここでｋは３つのスケール値に対する添え字、ｊはサン
プル点、ｎは２ブロツク内のサンプル数を示している。Here, k is a subscript for the three scale values, j is a sample point, and n is the number of samples within two blocks.

上記最適差分ピッ１〜による差分圧縮ＰＣＭ方式は、デ
ジタル音声をブロック化し、スケール値と差分を符号化
するものであるが、上記技術によると圧縮率は２ビツト
／サンプル以下にはなりえず、またその時の音質劣化は
著しい。The differential compression PCM method using the above-mentioned optimal differential PCM blocks the digital audio and encodes the scale value and the difference, but according to the above technique, the compression rate cannot be lower than 2 bits/sample. Also, the sound quality deteriorates significantly at that time.

１−一昨本発明は、上述のごとき実情に鑑みてなされたもので、
特に、ディジタル音声信号をブロック化し、差分値とブ
ロック内の最大差分値（絶対値）の桁を符号化する符号
化方式において、音質劣化をあるレベルｔこおさえて、
低ビットレー１−（１ビツト／サンプル以下）を実現可
能とし、もって、良好な音質を保ちながら圧縮率を向上
することを目的としてなされたものである。1-The present invention was made in view of the above-mentioned circumstances,
In particular, in an encoding method that blocks a digital audio signal and encodes the difference value and the digit of the maximum difference value (absolute value) within the block, the deterioration of sound quality is suppressed to a certain level t.
This was done with the aim of making it possible to achieve a low bit rate of 1- (1 bit/sample or less), thereby improving the compression ratio while maintaining good sound quality.

構　　　成本発明は、」１記目的を達成するために、ディジタル音
声信号をブロック化し、差分の所定ビット数及びそのブ
ロック内の差分絶対値の最大ビット桁（スケール値）を
符号化する方式において、スケール値は、完全復元可能
な符号化方法で符号化し、差分はベクトル量子化（ＶＱ
）することを特徴としたものである。以下、本発明の実
施例に基いて説明する。[Structure] In order to achieve the object described in item 1, the present invention provides a method for dividing a digital audio signal into blocks and encoding a predetermined number of bits of the difference and the maximum bit digit (scale value) of the absolute value of the difference within the block. The scale value is encoded using a fully recoverable encoding method, and the difference is encoded using vector quantization (VQ
). Hereinafter, the present invention will be explained based on examples.

第１図は、本発明の一実施例を説明するためのブロック
図で、図中、１は差分計算部で、ディジタル音声信号の
差分を計算する。２は差分ブロックバッファで、差分計
算部１の出力を１ブロック分格納するバッファでスケー
ル値検出部３で検出されたビット桁より所定ビット数出
力する。３はスケール値検出部で、差分計算部１の出力
の絶対値の最大値をブロック内から検出し、その最大桁
のビット位置を出力する。４はＶＱのコードブックで、
スケール値検出部３の出力によりコードブックを選択す
る。５は類似度最大検出部でスケール値検出部３で検出
されたビット桁より下位の所定ビット数を差分ブロック
バッファ２から入力し、コードブック４の各コードとの
距離を計算し、最小距離を与えるコートのインデックス
を出力する。FIG. 1 is a block diagram for explaining one embodiment of the present invention. In the figure, 1 is a difference calculating section which calculates the difference between digital audio signals. Reference numeral 2 denotes a difference block buffer, which stores the output of the difference calculation section 1 for one block, and outputs a predetermined number of bits from the bit digit detected by the scale value detection section 3. 3 is a scale value detection unit that detects the maximum absolute value of the output of the difference calculation unit 1 from within the block, and outputs the bit position of the maximum digit. 4 is the VQ codebook,
A codebook is selected based on the output of the scale value detection section 3. 5 is a maximum similarity detection unit which inputs a predetermined number of bits lower than the bit digit detected by the scale value detection unit 3 from the difference block buffer 2, calculates the distance to each code in the codebook 4, and calculates the minimum distance. Prints the index of the given coat.

６は出力選択部で、スケール値検出部３の出力が所定の
値より大きい場合は、該スケール値検出部３と差分ブロ
ックバッファ２の出力を符号とし、それ以外はスケール
値検出部３と類似度最大検出部５の出力を符号とする。Reference numeral 6 denotes an output selection section, and when the output of the scale value detection section 3 is larger than a predetermined value, the output of the scale value detection section 3 and the difference block buffer 2 is used as a code; otherwise, it is similar to the scale value detection section 3. The output of the degree maximum detection unit 5 is assumed to be a code.

第１図において、今、ディジタル音声信号が入力され、
差分計算部において、差分がとられる。In FIG. 1, a digital audio signal is now input,
A difference calculation section calculates the difference.

その差分は、ブロック毎に差分ブロックバッファに格納
されると同時にスケール値検出部に入力され、絶対値の
最大値を検出し、その最大のビット位置をスケール値と
して出力する。そのスケール値により、差分ブロックバ
ッファは格納された差分値からスケール値のビット位置
より所定ビット数を切り出した差分値（下位がまるまっ
ている）を出力する。またスケール値は、使用するコー
ドブックを選択する。第１図では、３つのコードブック
があるが、スケール値が所定値ＴＨΔより小さい場合は
コードブックＡ、スケール値がＴＨＡ以上で所定値ＴＨ
８より小さい場合はコードブックＢ、スケール値がＴＨ
Ｂ以」−で所定値ＴＨｃより小さい場合はコードブック
Ｃを選択する。類似度最大検出部は、差分ブロックバッ
ファの出力と、コードフックの各コートとの距離計算を
行い、最小距離を与えるコートを類似度最大とし、その
インデックスを出力する。The difference is stored in the difference block buffer for each block and simultaneously input to the scale value detection section, the maximum absolute value is detected, and the maximum bit position is output as the scale value. Depending on the scale value, the difference block buffer outputs a difference value (the lower part is rounded) obtained by cutting out a predetermined number of bits from the bit position of the scale value from the stored difference value. Also, the scale value selects the codebook to be used. In FIG. 1, there are three codebooks. Codebook A is used when the scale value is smaller than a predetermined value THΔ, and codebook A is used when the scale value is greater than or equal to THA.
If smaller than 8, codebook B, scale value TH
If the value is smaller than the predetermined value THc, codebook C is selected. The maximum similarity detection unit calculates the distance between the output of the difference block buffer and each coat of the code hook, sets the coat that provides the minimum distance as the maximum similarity, and outputs its index.

出力選択部は、スケール値がＴＨｃ以」二の場合、差分
ブロックバッファの出力とスケール値を出力し、それ以
外は、類似度最大検出部のインデックスとスケール値を
出力する。The output selection section outputs the output of the difference block buffer and the scale value when the scale value is greater than or equal to THc, and otherwise outputs the index and scale value of the maximum similarity detection section.

第２図Ａ、Ｂは、本発明の動作原理を具体的に説明する
だめの図で、同図は、ブロック数が４サンプル、差分値
切出しビット数を４ビツト（符号含む）とする。今、入
力列が５．６０，７８゜７０とすると差分値が＋５５．
＋１８．−８と計算される。その絶対値の最大ビット位
置５がスケール値である。このスケール値により図のよ
うなコードブックが選択されたとすると、差分出力バッ
ファの出力との距離計算が、類似度最大検出部により行
なわれる。すなわち、類似度最大検出部５において、インデックスＯのコードとの距離＝＋畳汀１費２−０）２＋（−１−〇）′：＝３．７４
インデックス１のコードとの距離＝ｉ”４ｒσｙ＋（−１−Ｃ−３））”＝２．８３イン
デツクス２のコートとの距離＝ｆ■「：Ｃ房（））”＋（２−０）２＋（−１−３）
２＝５．３９が計算され、距離か一番小さいインデック
ス１のコートか類似度が最大とみなされ、インデックス
コを出力する。次いで出力選択部６でスケール値が出力
選択の閾値より小さいのでインデックスとスケール値を
出力する。FIGS. 2A and 2B are diagrams for specifically explaining the operating principle of the present invention. In the figures, the number of blocks is 4 samples, and the number of difference value extraction bits is 4 bits (including the sign). Now, if the input string is 5.60, 78°70, the difference value is +55.
+18. -8 is calculated. The maximum bit position 5 of the absolute value is the scale value. If a codebook as shown in the figure is selected based on this scale value, the maximum similarity detection unit calculates the distance to the output of the difference output buffer. That is, in the maximum similarity detection unit 5, the distance from the code of index O = + Tatami 1 cost 2 - 0) 2 + (-1 - 〇)': = 3.74
Distance to the code of index 1 = i"4rσy+(-1-C-3))" = 2.83 Distance to the coat of index 2 = f■ ": C tuft ()" + (2-0) 2+ (-1-3)
2=5.39 is calculated, the distance or the code with the smallest index 1 is considered to have the maximum similarity, and the index code is output. Next, since the scale value is smaller than the output selection threshold, the output selection unit 6 outputs the index and the scale value.

勃−−−一−」以」二の説明から明らかなように、本発明によると、圧
縮率が向上させることができ、特にスケール値が小さい
時、小さいコートブックを使い、スケール値が大きい時
に大きいコートブックを使うことにより、音質と圧縮の
両面で効率的である。As is clear from the explanation in Section 2, according to the present invention, the compression ratio can be improved, especially when the scale value is small, using a small coat book, and when the scale value is large. Using a large coatbook is efficient in terms of both sound quality and compression.

また変化の激しい時には、ＶＱを使わないので、音質劣
化が少ない。更に、小さいコードブックは、大きいコー
ドブックのサブセットなのでメモリ面では大きいコード
ブックのみでよい等の利点がある。Also, since VQ is not used when there are rapid changes, there is little deterioration in sound quality. Furthermore, since the small codebook is a subset of the large codebook, it has the advantage of requiring only a large codebook in terms of memory.

[Brief explanation of the drawing]

第１図は、本発明の一実施例を説明するためのブロック
図、第２図は、本発明の動作原理を具体的に説明するた
めの図、第３図乃至第７図は、従来技術を説明するため
の図である。１・・差分計算部、２−・・差分ブロックバッファ、３
・スケール値検出部、４・・・ＶＱのコー１へブック、
５・類似度最大検出部、６　出力選択部。１６一第　６　図（ａ）第６図（Ｃ）FIG. 1 is a block diagram for explaining one embodiment of the present invention, FIG. 2 is a diagram for concretely explaining the operating principle of the present invention, and FIGS. 3 to 7 are diagrams showing the prior art. FIG. 1...Difference calculation unit, 2-...Difference block buffer, 3
・Scale value detection unit, 4...Book to VQ code 1,
5. Maximum similarity detection section, 6. Output selection section. 16-Figure 6 (a) Figure 6 (C)

Claims

[Claims]

1. In a method in which a digital audio signal is divided into blocks and a predetermined number of bits of the difference and the maximum bit digit (scale value) of the absolute value of the difference within the block are encoded, the scale value is encoded using a completely recoverable encoding method. An encoding method characterized by vector quantization (VQ) for the difference.