JPH0335300A

JPH0335300A - Voice coding and decoding transmission system

Info

Publication number: JPH0335300A
Application number: JP1170374A
Authority: JP
Inventors: Hidehira Iseda; 衡平伊勢田; Kazumi Sato; 一美佐藤; Hideaki Kurihara; 秀明栗原; Fumio Amano; 文雄天野; Shigeyuki Umigami; 重之海上
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1989-06-30
Filing date: 1989-06-30
Publication date: 1991-02-15

Abstract

PURPOSE:To allot an optimum information quantity to the frame under processing by respectively synthesizing and forming the local decoding signals of respective sub-frames and one frame from the coding devices of the different numbers of quantization levels extracted for every frame and transmitting the coding devices corresponding to the allotted patterns of the best characteristics. CONSTITUTION:The local decoding signals are formed from the coding devices 211 to 2nm of the different numbers of the quantization level extracted for every frame and the local decoding signals of the respective sub-frames and one frame are respectively synthesized and formed in accordance with the allotted patterns in such a manner that a specified transfer bit rate is attained. The coding devices corresponding to the allotted patterns of the best characteristics are multiplexed 24 and are transmitted. The optimum information quantity is allotted to the frame under processing by determining the allotted patterns of the information quantity in the frame from the currently processed frame in such a manner.

Description

【発明の詳細な説明】（目次）１既要産業上の利用分野従来の技術　　・・・第１４図発明が解決しようとする課題課題を解決するための手段作用第１の実施例　・・第２の実施例　・・第３の実施例　・・第４の実施例　・・発明の効果・第４．５．６図・第７図・第８．９．１０図・第１２．１３図〔概要　〕音声信号を情報圧縮し高能率に情報を伝送する方式で、
特に時間領域或いは周波数領域でビット割当パターンを
変化させる音声符号化伝送方式に関し、全ての割当パターンについて処理することなく、且つフ
レーム内の情報量割当パターンを処理されているフレー
ムから求めることにより、処理されているフレームに対
して、最適な情報量の割当ができる音声信号の高能率符
号化伝送装置を提供することを目的とし、各フレーム毎に抽出した量子化レベル数の異なる符号語
から局部復号信号を生成し、伝送ビットレートが一定に
なるような割当パターンに従って前記フレーム内の各サ
ブフレームの局部復号信号を合成し、１フレームの局部
復号信号を生成して、特性の最も良い割当パターンに対
応する符号語を多重化して送信するよう構成するもので
ある。[Detailed Description of the Invention] (Table of Contents) 1. Existing Industrial Fields of Application Prior Art...Fig. 14 Problems to be Solved by the Invention Means and Action for Solving the Problems First Embodiment... 2. Example 3. Example 4. Effect of the invention, Figure 4.5.6, Figure 7, Figure 8.9.10, Figure 12.13. Overview] A method that compresses information in audio signals and transmits information with high efficiency.
In particular, regarding audio coding transmission systems that change bit allocation patterns in the time domain or frequency domain, processing can be performed by determining the information allocation pattern within a frame from the frame being processed without processing all allocation patterns. The purpose is to provide a high-efficiency coding and transmission device for audio signals that can allocate an optimal amount of information to each frame. generate a signal, combine local decoded signals of each subframe in the frame according to an allocation pattern that makes the transmission bit rate constant, generate a local decoded signal of one frame, and select an allocation pattern with the best characteristics. The configuration is such that corresponding code words are multiplexed and transmitted.

〔産業上の利用分野　〕本発明は、音声信号を情報圧縮し高能率で情報を伝送す
る方式、特に時間領域或いは周波数領域で割当パターン
を変化させる音声符号・復号化伝送方式に関する。[Industrial Field of Application] The present invention relates to a method for compressing information in an audio signal and transmitting the information with high efficiency, and particularly to an audio encoding/decoding transmission method that changes an allocation pattern in the time domain or frequency domain.

入力音声信号の情報量圧縮を行う音声符号化伝送方式は
、適応予測と適応量子化を使用したＡＤＰＣＭと、入力
音声信号の時間的変化に応じて時間領域で適応的にビッ
ト数を割り当てる時間領域適応ビット割当を組み合わせ
情報量の圧縮を行う方式、また入力音声信号を複数の帯
域に分割し、各帯域に割当るビット数の割当パターンを
入力音声信号の時間的変化に応じてフレーム毎に適応的
にビット数を割り当てる周波数領域適応ビット割当てに
より情報圧縮を行う方式等がある。Audio coding and transmission methods that compress the information content of input audio signals are ADPCM that uses adaptive prediction and adaptive quantization, and time domain that adaptively allocates the number of bits in the time domain according to temporal changes in the input audio signal. A method that compresses the amount of information by combining adaptive bit allocation, and also divides the input audio signal into multiple bands and adapts the allocation pattern of the number of bits allocated to each band for each frame according to temporal changes in the input audio signal. There are methods for compressing information using frequency domain adaptive bit allocation, which allocates the number of bits automatically.

近年のディジタル回線の普及に伴い、その回線の有効利
用を図るため、また音声蓄積、応答システム等において
、音声情報の蓄積を行う場合には、蓄積用メモリの容量
削減が重要であるため、音声信号の情報量を圧縮し更に
高能率に符号化する方式が要求されている。With the spread of digital lines in recent years, it is important to reduce the capacity of storage memory in order to make effective use of the lines, and when storing voice information in voice storage and response systems. There is a need for a method that compresses the amount of information in a signal and encodes it with higher efficiency.

[Conventional technology]

従来のミ適応予測と適応量子化を使用したＡＤＰＣＭと
、入力信号の時間的変化に応して時間領域で適応的に情
報（ビット）を割り当てる時間領域適応ビット割当を組
み合わせ、情報量の圧縮を行う音声信号の高能率符号化
伝送方式の一例が、文献ｒ多１）原産“’１６ｋｂ／ｓ
適応ピッ［・割当予測符号化アルゴリズムの検討゛、昭
和５９年度電子通信学会通信部門全国大会、５２−４ｐ
ｐ、２−２８３−２−２８４ｊに開示されている。（第
１５図（ａ））この方法では、入力音声信号をフレーム
に分割し、更に該フレームをサブフレームに分割し、フ
レーム内の情報量が一定になるようにビ・ント割当パタ
ーンに応してサブフレーム毎の割当ビット数を変化させ
るものであり、その際のビット割当パターンを選択する
ための情報は、適応予測と適応量子化を使用しているた
め、一つ前のフレームの残差信号電力を使用するもので
あり、次にどのようなパターンの残差信号が発生するか
を予測し、その予測にもとづいて割り当てパターンを決
定するものである。It compresses the amount of information by combining conventional ADPCM using adaptive prediction and adaptive quantization with time-domain adaptive bit allocation, which adaptively allocates information (bits) in the time domain according to temporal changes in the input signal. An example of a high-efficiency coding transmission method for audio signals is described in the literature 1) "'16kb/s
Adaptive Pi [・Study of Assignment Predictive Coding Algorithm], 1985 National Conference of the Telecommunications Division of the Institute of Electronics and Communication Engineers, p. 52-4
p, 2-283-2-284j. (Figure 15(a)) In this method, the input audio signal is divided into frames, and the frames are further divided into subframes, and the input audio signal is divided into subframes according to the bit allocation pattern so that the amount of information in the frame is constant. The number of allocated bits for each subframe is changed by using adaptive prediction and adaptive quantization, so the information used to select the bit allocation pattern is based on the residual of the previous frame. It uses signal power, predicts what pattern of residual signal will be generated next, and determines the allocation pattern based on the prediction.

また、入力信号を複数の周波数帯域に分割し、各帯域へ
の割り当てビット数の割当パターンを、音声信号の時間
的変化に応してフレーム毎に適応的に変化させる符号化
伝送方式の一例が、文献ｒ林　伸二他　　バックワード
形高品質適応予測適応ビット割当符号化方弐″”　電子
通信学会論文誌８６／１０　　Ｖｏｌ、Ｊ６９−Ａ、Ｎ
ｏ、１０ｐｐ、１２３４−１．２４２Ｊに開示されてい
る。Another example is a coding transmission method that divides the input signal into multiple frequency bands and adaptively changes the allocation pattern of the number of bits allocated to each band for each frame in response to temporal changes in the audio signal. , Reference Shinji Hayashi et al. Backward type high quality adaptive prediction adaptive bit allocation coding method 2'' Journal of the Institute of Electronics and Communication Engineers 86/10 Vol. J69-A, N
o, 10pp, 1234-1.242J.

（第１５図（ｂ））この方法でも、割当パターンは前記
の時間領域での割当パターンの決定と同様に、一つ前の
フレームの残差信号電力を使用するものである。(FIG. 15(b)) In this method as well, the allocation pattern uses the residual signal power of the previous frame, similar to the determination of the allocation pattern in the time domain.

[Problem to be solved by the invention]

従って、従来の時間領域及び周波数帯域での割当パター
ンの決定方法の技術では、現在処理されているフレーム
の一つ前のフレームの残差信号から割当パターンを決定
しているため、現フレームの残差信号と次のフレームの
残差信号との性質が大きく異なっていると、前フレーム
から予測した割当パターンでは、現在のフレームのピン
トの割当には最適にならず、符号語を再生する場合に特
性の劣化を引き起こすことになる。Therefore, in the conventional time domain and frequency band allocation pattern determination methods, the allocation pattern is determined from the residual signal of the frame immediately before the currently processed frame. If the characteristics of the difference signal and the residual signal of the next frame are significantly different, the allocation pattern predicted from the previous frame will not be optimal for the focus allocation of the current frame, and will not be optimal when reproducing the codeword. This will cause deterioration of characteristics.

そこで、考えられる全ての割当パターンについて符号化
することも考えられる。この場合全てのパターンの中か
ら特性の最もよいものを選択することによって、最適な
割当パターンを選択できることになる。しかし、全ての
パターンについて符号化を行うことは装置構成の拡大、
処理量の大幅な増加等の大きな問題を生じ、全体の効率
を考慮した場合に有効とは言えない。Therefore, it may be possible to encode all possible allocation patterns. In this case, the optimal allocation pattern can be selected by selecting the one with the best characteristics from among all the patterns. However, encoding all patterns requires expansion of the device configuration.
This causes major problems such as a significant increase in the amount of processing, and cannot be said to be effective when considering overall efficiency.

本発明は、全ての割当パターンについて処理することな
く、且つフレーム内の情報量割当パターンを現在処理さ
れているフレームから求めることにより、処理されてい
るフレームに対して、最適な情報量の割当ができる音声
信号の高能率符号化伝送装置を提供することを目的とす
る。The present invention makes it possible to allocate the optimal amount of information to the frame being processed without processing all the allocation patterns and by finding the information amount allocation pattern within the frame from the frame currently being processed. The purpose of the present invention is to provide a highly efficient encoding and transmitting device for audio signals.

[Means to solve the problem]

第１図に時間領域での割当パターンを使用する音声符号
化伝送方式の原理図を示す。図中１１１はＭピッｔ−ｉ
子化レベル数の適応量子化器、■１３〜１１６はＮ−Ｍ
ビット逆量子化レベル数の適応逆量子化器、１１７はＮ
ビットレベル数の逆量子化結果に基づき動作する適応予
測器、１１２は選択・多重化部、１１８は局部復号信号
の品質評価部である。FIG. 1 shows a principle diagram of a speech coding transmission method using an allocation pattern in the time domain. 111 in the figure is M pit t-i
Adaptive quantizer for the number of childization levels, ■13 to 116 are N-M
Adaptive inverse quantizer for the number of bit inverse quantization levels, 117 is N
An adaptive predictor operates based on the result of dequantization of the number of bit levels, 112 is a selection/multiplexing unit, and 118 is a locally decoded signal quality evaluation unit.

また、第２図は周波数領域での割当パターンを使用する
音声符号化伝送方式の原理図を示すものである。図中１
は帯域分割フィルタ、２１１〜２ｎｍは量子化レベル数
の異なる符号器、２３は局部復号信号の品質評価部、２
４は選択・多重化部である。Further, FIG. 2 shows a principle diagram of a speech coding transmission system using an allocation pattern in the frequency domain. 1 in the diagram
2 is a band division filter, 211 to 2 nm is an encoder with a different number of quantization levels, 23 is a locally decoded signal quality evaluation unit, 2
4 is a selection/multiplexing section.

本発明は前記目的を達成するため、第１の手段として時
間領域での割当パターンを使用するものとして、Ｍビッ
トａ子化レベル数の適応量子化器１１１とＮ　（＜Ｍ）
ビット逆量子化レベル数の適応逆量子化器１１３とＮビ
ットレベル数の逆量子化結果に基づき動作する適応予測
器１１７からなるエンベデッド符号化器と、前記適応量
子化器１１１からの符号語を復号するＮ＋ｌ−Ｍヒツト
逆量子化しヘル数の適応逆量子化器１１４〜１１６とを
設け、前記適応逆量子化器の出力と適応予測器との出力
によりそれぞれの局部復号信号を抽出し、前記局部復号
信号をそれぞれサブフレームに分割し、伝送ビットレー
トに適応した割当パターンに従ってサブフレームごとの
局部復号信号を合成し、それぞれの合成局部復号信号と
入力音声信号とを比較しその品質を評価し、特性の最も
良い割当パターンを選択し、該選択に従って適応量子化
器からの符号語をサブフレーム毎に下位ピントを削除し
、多重化して伝送するよう構成するものである。In order to achieve the above object, the present invention uses an allocation pattern in the time domain as a first means.
An embedded encoder includes an adaptive dequantizer 113 for the number of bit dequantization levels and an adaptive predictor 117 that operates based on the dequantization result for the number of N bit levels, and a code word from the adaptive quantizer 111. Adaptive inverse quantizers 114 to 116 are provided for inversely quantizing N+l-M hits to be decoded and Hell's number, and respective locally decoded signals are extracted by the outputs of the adaptive inverse quantizers and the outputs of the adaptive predictor; Each locally decoded signal is divided into subframes, local decoded signals for each subframe are synthesized according to an allocation pattern adapted to the transmission bit rate, and the quality is evaluated by comparing each synthesized locally decoded signal with the input audio signal. , the allocation pattern with the best characteristics is selected, and the codewords from the adaptive quantizer are deleted for each subframe in accordance with the selection, and the codewords are multiplexed and transmitted.

更に、周波数領域での割当パターンを使用するものとし
て、前記各帯域毎に、前記割当パターンに存在するビッ
ト数のそれぞれ量子化レベル数の異なる複数の符号化器
２１１〜２ｎｍを設け、前記符号化器からそれぞれ局部
復号信号を抽出し、前記割当パターンの全ての組み合わ
せについての１フレーム分の局部復号信号を生成し、そ
れぞれの局部復号信号と入力音声信号とを比較しその品
質を評価し、特性の最も良い組み合わせを選択し、該選
択された割当パターンに従って該当する符号語を多重化
して伝送するよう構成するものである。Furthermore, as a device that uses an allocation pattern in the frequency domain, a plurality of encoders 211 to 2 nm are provided for each band, each having a different number of quantization levels for the number of bits existing in the allocation pattern, and extracts local decoded signals from each of the devices, generates one frame worth of local decoded signals for all combinations of the allocation patterns, compares each local decoded signal with the input audio signal to evaluate its quality, and determines the characteristics. According to the selected allocation pattern, the corresponding code words are multiplexed and transmitted.

[Effect]

本発明の前記第１の手段は時間領域における割当パター
ンの決定方式である。入力音声信号はフレーム毎に処理
される。１フレーム分の入力音声信号はＭｂｉｔの適応
量子化器１１１で量子化され、選択多重化部１１２に入
力するとともに、各Ｎ、Ｎ＋１．　　・・・・Ｍ−１，
Ｎビットの逆量子化器において必要な上位ビットをそれ
ぞれの適応逆量子化のビット数で逆量子化する。Ｍｂｉ
Ｌの適応量子化器１１１は、Ｎビットの適応逆量子化器
１１３及びＮビットの逆量子化結果に基づき動作する適
応予測器１１７によりエンベツド符号化器を構成するも
のである。エンベツド符号化器とはＳビットの量子化器
及びＴ　（＜３）ビットの逆量子化器及び適応予測器か
ら構成された符号化器であり、信号を伝送する際に符号
語の下位のＳ−Ｔビットに他の情報を載せることを可能
とした符号化方式テアリ、文献ｒＣＣＩＴＴ、Ｇ７２２
Ｊにも開示されている。該エンベツド符号化に（↑う量
子化特性例を第３図に示す。該符号化方式では例えば０
０００，０００１の４ビツトを上位３ピント０００で表
現したり、ｏｏｏｏ、ｏｏｏｔ。The first means of the present invention is a method for determining an allocation pattern in the time domain. The input audio signal is processed frame by frame. The input audio signal for one frame is quantized by an Mbit adaptive quantizer 111, inputted to a selective multiplexer 112, and input to each N, N+1 . ...M-1,
In an N-bit dequantizer, necessary upper bits are dequantized by the number of bits of each adaptive dequantization. Mbi
The L adaptive quantizer 111 constitutes an embedded encoder by an N-bit adaptive inverse quantizer 113 and an adaptive predictor 117 that operates based on the N-bit inverse quantization result. An embedded encoder is an encoder composed of an S-bit quantizer, a T (<3)-bit inverse quantizer, and an adaptive predictor. - Encoding method Tearly that makes it possible to carry other information on the T bit, document rCCITT, G722
It is also disclosed in J. Figure 3 shows an example of the quantization characteristics of the embedded encoding (↑).
The 4 bits of 000,0001 are expressed as the top 3 pinpoints of 000, oooo, ooot.

ｏｏｉｏ、ｏｏｔｉの４ビツトを上位２ビツト００で表
現するものである。The four bits of ooio and ooti are expressed by the upper two bits of 00.

また前記各適応逆量子化器の出力はＮビットの逆量子化
結果に基づき動作する適応予測器１１７の出力と加えら
れそれぞれの局部複合信号が生成され局部復号信号評価
部ｌ１８に入力される。同時に対応するｌフレーム分の
入力信号も入力される。Further, the output of each of the adaptive inverse quantizers is added to the output of the adaptive predictor 117 which operates based on the N-bit inverse quantization results to generate respective local composite signals, which are input to the local decoded signal evaluation unit l18. At the same time, input signals for corresponding l frames are also input.

局部復号信号評価部１１７では、それぞれの１フレーム
分の局部復号信号をサブフレームに分割し、複数の情報
量割当パターンに応じて、サブフレーム毎に１サンプル
当たりの情報量をＭ−Ｎｂｉｔの範囲で変化させた局部
復号信号を組み合わせて１フレーム分の局部復号信号を
合成する。この時、情報量割当パターンは、１フレーム
の情報量の総和が１フレームに許容された伝送ビット数
を越えないようなパターンとする。The local decoded signal evaluation unit 117 divides each one-frame locally decoded signal into subframes, and determines the amount of information per sample for each subframe in the range of M-N bits according to a plurality of information amount allocation patterns. The locally decoded signals changed in are combined to synthesize one frame's worth of locally decoded signals. At this time, the information amount allocation pattern is such that the total amount of information for one frame does not exceed the number of transmission bits allowed for one frame.

複数の情報量割当パターンに応して合成されたｌフレー
ム分の局部復号信号の中で、入力信号と比較して、最も
特性の良いパターンを選択し、その結果を選択多重化部
１１２に出力する。選択多重化部１１２は、符号語が階
層構造であるため、情報量割当パターンに応じて、サブ
フレーム毎にＭｂｔｔの符号語の下位ビットを削除して
パターンに刻応する符号語を生成し、１フレーム分の符
号語を多重化する。更に、局部複合信号評価部１１８で
選択された情報量割当パターンを補助情報として多重化
して伝送することになる。Among the locally decoded signals for l frames synthesized according to a plurality of information allocation patterns, the pattern with the best characteristics is selected by comparing it with the input signal, and the result is output to the selective multiplexing section 112. do. Since the codeword has a hierarchical structure, the selective multiplexing unit 112 deletes the lower bits of the Mbtt codeword for each subframe according to the information allocation pattern, and generates a codeword that corresponds to the pattern. Code words for one frame are multiplexed. Further, the information amount allocation pattern selected by the local composite signal evaluation section 118 is multiplexed and transmitted as auxiliary information.

第２の手段は周波数領域における割当パターンの決定方
式である。入力音声信号はフレーム毎に処理される。１
フレームの入力音声信号は帯域分割フィルターでｎ個の
帯域に分割される。帯域分割された信号はそれぞれ独立
なため、各４１Ｐ域での符号化による品質の劣化は独立
となり、そのため、符号化による劣化を各帯域毎に独立
に評価しても良い。各帯域の信号は、情報割当パターン
に作在する各帯域毎に割当てられる符号化ビットレート
に対応した、複数の符号化ビットレートの異なる符号化
器で並列に符号化される。複数の符号化ビットレートの
異なる符号化器はそれぞれ符号語と局部復号信号を出力
する。２３の評価部では、各帯域毎に複数の符号化ビッ
トレートの異なる符号化器の局部復号信号を評価し、そ
の評価結果を情報割当パターンを参照しながら組み合わ
せて、全ての帯域に渡って総合的に評価し、最も特性の
良い情報割当パターンを選択し、選択・多重部２４に出
力する。選択・多重部２４は、各帯域毎に情報割当パタ
ーンに対応した符号語を選択し、多重化し、更に、情報
割当パターンを多重化して伝送路に送出する。情報割当
パターンは、１フレームの総情報量が１フレームに割当
られる情報量以下になるようなパターンからなる。復号
側では、■フレーム毎に情報割当パターンに応じた復号
化器を各帯域毎に選択し、復号化処理を行った後、帯域
合成し再生信号を出力する。The second means is a method for determining an allocation pattern in the frequency domain. The input audio signal is processed frame by frame. 1
A frame input audio signal is divided into n bands by a band division filter. Since the band-divided signals are independent, the quality deterioration due to encoding in each 41P band is independent, and therefore the deterioration due to encoding may be evaluated independently for each band. The signals of each band are encoded in parallel by a plurality of encoders with different encoding bit rates corresponding to the encoding bit rates assigned to each band existing in the information allocation pattern. A plurality of encoders with different encoding bit rates each output a code word and a locally decoded signal. The evaluation unit No. 23 evaluates local decoded signals of multiple encoders with different encoding bit rates for each band, combines the evaluation results while referring to the information allocation pattern, and performs a comprehensive evaluation across all bands. The information allocation pattern with the best characteristics is selected and output to the selection/multiplexing section 24. The selection/multiplexer 24 selects and multiplexes code words corresponding to the information allocation pattern for each band, and further multiplexes the information allocation pattern and sends the multiplexed code word to the transmission path. The information allocation pattern is a pattern in which the total amount of information for one frame is less than or equal to the amount of information allocated to one frame. On the decoding side, (1) a decoder is selected for each band according to the information allocation pattern for each frame, and after performing decoding processing, band synthesis is performed and a reproduced signal is output.

本発明では、帯域毎の評価結果を情報割当パターンを参
照しながら組み合わせて、全ての帯域に渡って総合的に
評価するため、複数の符号化パターンが、ある帯域に同
一の情報量を割当た場合には、その帯域に於いて、その
情１ＩＪｆｉに対応した符号化処理は、−回となるため
、処理量の増大が防げ、又、−情報割当を行うフレーム
から割当パターンを算出するため、最適な割当が行える
。In the present invention, evaluation results for each band are combined with reference to information allocation patterns to comprehensively evaluate all bands, so multiple encoding patterns allocate the same amount of information to a certain band. In this case, in that band, the encoding process corresponding to the information 1IJfi will be performed - times, thereby preventing an increase in the amount of processing, and - Since the allocation pattern is calculated from the frame to which information is allocated, Optimal allocation can be made.

〔Example 〕

（第１の実施例）本発明における時間領域の割当パターン決定方式の実施
例１を第４図に示す。(First Embodiment) FIG. 4 shows a first embodiment of the time domain allocation pattern determination method according to the present invention.

本実施例における伝送路の伝送ビットレートは４ビツト
／サンプルとする。またサブフレーム数は２とする。こ
の場合−船釣に考えられる割当パターンは前半、後半へ
の割当ビット数が（６゜２）（５，３）（４，４）（３
，５）（２，６）の５パターンとなる。従って適応量子
化器４０１は６ビツト／サンプルとする。また適応逆量
子化器４０７〜４１１は２〜６ビツト／サンプルとする
。適応予測器４０２、適応量子化器４０１、逆量子化器
４０７〜４１１の適応則は、前記文献ｒｃｃ　ＩＴＴ、
Ｇ７２２Ｊ　で規定されテイルものを使用し、６ｂｉｔ
の下位４ｂｉｔを削除した、２ｂｉｔの符号語の逆量子
化結果を使用するものであり、適応予測器も２ビツト／
サンプルの結果に基づき動作する。また、２〜５ビツト
／サンプルの各逆量子化器４０７〜４１０はその前段に
対応する下位ビットを削除するための下位ビット削除部
４０３〜４０６を有する。The transmission bit rate of the transmission path in this embodiment is 4 bits/sample. Further, the number of subframes is assumed to be two. In this case, the possible allocation pattern for boat fishing is that the number of bits allocated to the first half and the second half is (6°2) (5, 3) (4, 4) (3
, 5) (2, 6). Therefore, the adaptive quantizer 401 has 6 bits/sample. Further, the adaptive inverse quantizers 407 to 411 are set to 2 to 6 bits/sample. The adaptive rules of the adaptive predictor 402, adaptive quantizer 401, and inverse quantizers 407 to 411 are described in the above-mentioned document RCC ITT,
Use the tail specified by G722J, 6bit
This method uses the dequantization result of a 2-bit code word with the lower 4 bits deleted, and the adaptive predictor also uses a 2-bit/2-bit code word.
Acts on sample results. Further, each of the 2 to 5 bit/sample inverse quantizers 407 to 410 has lower bit deletion units 403 to 406 for deleting lower bits corresponding to the previous stage thereof.

ｌフレームの長さは、音声信号の非定常性と、補助情報
量を考慮して設定する。The length of the l frame is set in consideration of the non-stationarity of the audio signal and the amount of auxiliary information.

局部復号信号評価部では、本方式によるビット割当パタ
ーンに応じて、サブフレーム毎に局部復号信号を切り出
し、組み合わせることで、１フレームの局部復号信号が
合成される。この各パターンに対応した局部復号信号と
入力信号との間の信号対雑音比（Ｓ／Ｎ）を計算し、Ｓ
／Ｎ最大となるパターンを選択する。In the local decoded signal evaluation section, local decoded signals are cut out for each subframe according to the bit allocation pattern according to the present method, and the locally decoded signals are combined, thereby synthesizing one frame of locally decoded signals. The signal-to-noise ratio (S/N) between the locally decoded signal and the input signal corresponding to each pattern is calculated, and the S
/N Select the maximum pattern.

選択・多重化部において前記結果に基づき、サブフレー
ム毎に下位ビットを削除、或いはそのままとし、所望の
符号語を得る。この符号語を１フレーム分多重化し、更
に情報量割当パターンを示す信号も多重化して伝送路に
送出する。Based on the result, the selection/multiplexing section deletes or leaves the lower bits unchanged for each subframe to obtain a desired code word. This code word is multiplexed for one frame, and a signal indicating the information amount allocation pattern is also multiplexed and sent to the transmission path.

以下、制御動作を実施例のフローチャトを示す第５図で
、より詳細に説明する。Hereinafter, the control operation will be explained in more detail with reference to FIG. 5, which shows a flowchart of the embodiment.

ステップｌ）入力音声信号の１フレームが入力する。（
Ｓｌ）ステップ２）前回入力の音声信号による適応予測器４０
２の出力である予測信号で誤差信号を抽出する。（３２
）ステップ３）前記誤差信号を６ビツトの適応量子化器４
０１で１サンプルを６ビツトで量子化する。（Ｓ３）ステップ４〕前記量子化された６ビツト／サンプルの信
号から各下位ビット削除部４０３で下位の１〜４ビツト
を削除し、対応する適応量子化器４０７〜４１１におい
て逆量子化し、２ビツト適応予測器４０２で得た予測信
号との和からそれぞれの局部復号化信号ａ　−ｅを抽出
され品質評価部４１２に入力する。（Ｓ４）ステップ５
）適応予測器４０２．適応量子化器４０１、逆量子化器
４０７〜４１１を２ビット／サンプルの適応逆量子化器
４０２による局部復号信号でパラメータを更新する。Step l) One frame of the input audio signal is input. (
Sl) Step 2) Adaptive predictor 40 based on the previous input audio signal
The error signal is extracted using the predicted signal that is the output of step 2. (32
) Step 3) Convert the error signal to a 6-bit adaptive quantizer 4.
01 quantizes one sample with 6 bits. (S3) Step 4] Each lower bit deletion unit 403 deletes the lower 1 to 4 bits from the quantized 6-bit/sample signal, and the corresponding adaptive quantizers 407 to 411 dequantize the signal. Each local decoded signal a to e is extracted from the sum with the prediction signal obtained by the bit adaptive predictor 402 and input to the quality evaluation section 412 . (S4) Step 5
) adaptive predictor 402. The parameters of the adaptive quantizer 401 and the inverse quantizers 407 to 411 are updated using the locally decoded signal from the 2-bit/sample adaptive inverse quantizer 402.

ステップ６）１フレーム分のサンプル敷金ての処理が終
了するまで上記動作が続けられる。（Ｓ６）ステップ７）前記各局部復号化信号ａ　”−ｅをそれぞ
れサブフレームに分割する。本実施例では２分割する。Step 6) The above operation is continued until processing of the sample deposit for one frame is completed. (S6) Step 7) Each of the locally decoded signals a''-e is divided into subframes. In this embodiment, it is divided into two.

（Ｓ７）ステップ８）サブフレーム毎の局部復号信号から情報量
割当パターンに対応するｌフレーム分の局部復号信号を
合成する。つまり信号ａの前半のサブフレームと信号ｅ
の後半のサブフレームとを合成するとビット数がサブフ
レーム毎に（２，６）となり、逆に合成すると（６，２
）が得られ、この場合のｌフレームの平均ビット数〜／
サンプル数は４ビツトの信号が得られる。(S7) Step 8) Locally decoded signals for l frames corresponding to the information amount allocation pattern are synthesized from the locally decoded signals for each subframe. In other words, the first half subframe of signal a and signal e
When combining with the second half of the subframe, the number of bits becomes (2, 6) for each subframe, and conversely, when combining
) is obtained, and the average number of bits of l frame in this case ~/
As for the number of samples, a 4-bit signal can be obtained.

同様にして信号す、ｄからビット数がサブフレーム毎に
（３，５）、（５，３）が、信号Ｃより（４，４）を得
る。（Ｓ８）ステップ９）この５パターンの局部復号信号は全て平均
ビット／サンプル数は４ビツトである。Similarly, the bit numbers (3, 5) and (5, 3) are obtained for each subframe from the signals S and d, and (4, 4) from the signal C. (S8) Step 9) The average bit/sample number of all of these five patterns of locally decoded signals is 4 bits.

こうして得られた５つの信号をそれぞれ入力音声信号ｆ
と比較し、Ｓ／Ｎをそれぞれ算出し、一番特性のよいパ
ターンを選択し、該パターンを示す信号ｇを下位ビット
削除・多重化部４１３に出力する。（Ｓ９）ステップ１０）下位ビット削除・多重化部４１３で入力
するｌサンプル毎の符号語を、前記信号ｇに従ってサブ
フサーム毎に下位の数ビットを削除する。例えば選択さ
れたパターンが（６゜２）であると前半のサブフレーム
はそのまま、後半のサブフレームは下位４ビツトを削除
するものである。（ＳＩＯ）ステップ１１）前記動作により下位ビットを制御された
符号語を多重化する。（Ｓｌｌ）ステップ１２）前記多
重化された符号語と品質評価部４１２の出力信号ｇとを
多重化し送信するものでる。（Ｓ１２）以上が符号側の制御動作である。The five signals obtained in this way are respectively input audio signals f
The S/N ratio is calculated for each pattern, the pattern with the best characteristics is selected, and a signal g indicating the selected pattern is output to the lower bit deletion/multiplexing section 413. (S9) Step 10) The lower bit deletion/multiplexing unit 413 deletes the lower several bits of the input code word for each l sample for each subftherm according to the signal g. For example, if the selected pattern is (6°2), the first half of the subframe is left as is, and the lower four bits of the second half of the subframe are deleted. (SIO) Step 11) Multiplex the codewords whose lower bits have been controlled by the above operation. (Sll) Step 12) The multiplexed code word and the output signal g of the quality evaluation unit 412 are multiplexed and transmitted. (S12) The above is the control operation on the code side.

次に前記符号側に対応した復号例の実施例を第６図に示
す。復号側では符号側に対応して２〜６ビツト逆量子化
レベル数の逆量子化器が並列に設けられている。送信さ
れてくる多重信号は分離部６０１において符号側で選択
された割当パターンを示す信号と各符号語とに分離され
る。該割当パターンを示す信号は切替え部６０２に入力
し、サブフサーム毎にどの逆量子化器に入力させるかを
制御するものであり、サブフレーム毎に切替え符号語を
任意の逆量子化器に入力させるものである。Next, FIG. 6 shows an embodiment of a decoding example corresponding to the code side. On the decoding side, inverse quantizers of 2 to 6 bit inverse quantization levels are provided in parallel corresponding to the code side. The transmitted multiplexed signal is separated by a separating section 601 into a signal indicating the allocation pattern selected on the code side and each code word. A signal indicating the allocation pattern is input to the switching unit 602 to control which inverse quantizer is input for each subframe, and a switching code word is input to any inverse quantizer for each subframe. It is something.

こうして得られた信号は予測器６０８の予測信号と合成
され再生音声信号として出力される。The signal thus obtained is combined with the predicted signal of the predictor 608 and output as a reproduced audio signal.

（第２の実施例）本発明のようにエンベデッド符号化を用いて各割当パタ
ーンでの局部復号信号を作威し、品質評価して行うビッ
ト割当は周波数領域及び時間領域の両方での適応ビット
割当を行う符号化にも適応可能であり、その実施例２を
第７図に示す。この場合のサブフレームは、各周波数帯
域毎に分割し、更に時間領域で分割されるものであり、
情報量割当パターンは、時間領域と周波数領域の２次元
のパターンとなる。例えば、高域側サブフレーム前半、
−後半、低域側サブフレーム前半、後半のようになり、
全てのサブフレームの情報量の総和が１フレームの情報
量となる。ここでの実施例では１フレームが４分割され
るものである。以下に本実施例で通常考えられる割当パ
ターンを示す。(Second Embodiment) As in the present invention, embedded coding is used to create locally decoded signals in each allocation pattern, and bit allocation is performed by quality evaluation using adaptive bits in both the frequency domain and time domain. It is also applicable to coding that performs allocation, and a second embodiment thereof is shown in FIG. In this case, the subframe is divided into each frequency band and further divided into the time domain.
The information amount allocation pattern is a two-dimensional pattern in the time domain and frequency domain. For example, the first half of the high-frequency subframe,
−The second half, the first half and the second half of the low-frequency side subframe,
The sum of the information amounts of all subframes is the information amount of one frame. In this embodiment, one frame is divided into four. The allocation patterns that are normally considered in this embodiment are shown below.

時間　　時間　　時間　　時間　　時間低域（５４”　
５４）　ゝ４５”　４５ノ　（４４）但し本実施例では
高域より低域にビット数を多く割り当てるパターンを採
用している。time time time time time time low range (54”
54) ゝ45'' 45ノ (44) However, in this embodiment, a pattern is adopted in which a larger number of bits is allocated to the low range than the high range.

入力音声信号は帯域分割フィルタ７０１を介し高域と低
域とに分割し、それぞれ対応の時間領域の低域の符号化
部７０２は２〜６ビント逆量子化レベル数の量子化器を
有し、高域の符号化部７０３の内部は示していないが、
低域の符号化部７０２と略同様の構成であり、異なるの
は２〜４レベル数の逆量子化レベル数の逆量子化器を有
するところである。ここでの量子化及び逆量子化は前記
実施例で述べたものと同様である。品質評価部７０４で
は、各逆量子化器からの局部復号信号を時間領域でのサ
ブフレーム毎に上記の割当パターンに従って各帯域毎の
局部復号信号を合成する。更に帯域合成し、１フレーム
分の４分割信号から合成した局部復号信号を生成し、入
力音声信号と比較し、Ｓ／Ｎを算出し特性の最もよいパ
ターンを選択し、該選択信号により前記実施例同様下位
ビット削除・多重化部で処理されるものである。An input audio signal is divided into a high band and a low band through a band division filter 701, and each corresponding time domain low band encoding unit 702 has a quantizer with a number of 2 to 6 bit dequantization levels. , although the inside of the high-frequency encoding section 703 is not shown,
It has substantially the same configuration as the low-frequency encoding section 702, except that it has dequantizers with dequantization levels of 2 to 4 levels. The quantization and inverse quantization here are the same as those described in the previous embodiment. The quality evaluation section 704 combines the locally decoded signals from each inverse quantizer for each subframe in the time domain into a locally decoded signal for each band according to the above allocation pattern. Furthermore, band synthesis is performed to generate a locally decoded signal synthesized from the four-divided signals of one frame, which is compared with the input audio signal, calculates the S/N, selects the pattern with the best characteristics, and uses the selected signal to perform the above-mentioned implementation. As in the example, this is processed by the lower bit deletion/multiplexing unit.

復号化部では実施例１同様に同時に送信されてくる割当
パターンを示す信号に応じて、逆量子化し帯域修正して
再生信号を得るものである。As in the first embodiment, the decoding section dequantizes and corrects the band according to the signals indicating the allocation pattern that are simultaneously transmitted to obtain a reproduced signal.

このように時間領域と周波数領域で割当を行うと、割当
パターンの数を増すことが可能となり、より細かな情報
量割当が可能となる。When allocation is performed in the time domain and frequency domain in this way, it becomes possible to increase the number of allocation patterns, and more detailed information amount allocation becomes possible.

また、従来の単純な符号化に比べより特性の優れた符号
化を、更に従来のような全ての割当パターンについて符
号化をするのではなく、必要な逆量子化器を数個用いる
ことで簡単に行なえる優れた方式である。In addition, it is possible to easily perform encoding with better characteristics than conventional simple encoding, and by using several necessary inverse quantizers instead of encoding all allocation patterns as in the conventional method. This is an excellent method that can be used in many situations.

（第３の実施例）次に周波数領域での適応ビット割当を行う方式の実施例
３を第８図に示す。８０１は第１の帯域分割フィルタ、
８０２，８０３は第１の帯域分割フィルタの出力を更に
分割する第２．第３の帯域分割フィルタである。本実施
例３では帯域を４分割するものである。その場合考えら
れるピント割当パターンの一例を第９図に示す。本実施
例では該パターンに従って制御されるものである。ここ
での伝送路の伝送ビットレートは３ビット／サンプル程
度である。(Third Embodiment) Next, a third embodiment of a system for adaptive bit allocation in the frequency domain is shown in FIG. 801 is a first band division filter;
802 and 803 are second band division filters that further divide the output of the first band division filter. This is a third band division filter. In the third embodiment, the band is divided into four. FIG. 9 shows an example of a focus assignment pattern that can be considered in that case. In this embodiment, control is performed according to the pattern. The transmission bit rate of the transmission path here is approximately 3 bits/sample.

帯域分割フィルタ８０１〜８０３には、ＱＭＦ（Ｑｕａ
ｄｒａｔｕｒｅ　ｍ１ｒｒｏｒ　ｆｉｌｔｅｒ）を使用
する。符号器にはＡＤＰＣＭを使用し、割当パターンは
音声信号の性質を考慮し、周波数の低い帯域に多めの情
報量を割当てるパターンとする。（前記第９図）　第９
図から各帯域毎に割当パターンに存在する量子化ビット
数を持つＡＤＰＣＭ符号器を並列に並べる。従って各帯
域のＡＤＰＣＭの量子化ビット数は、低い帯域から、６
．５．４ビット符号器８０４〜８０６．５＋　　４＋　
　３ビット符号器８０７〜８０９．３．２ビット符号器
８１０，８１１．３，２ビット符号器８１２，８１３と
なる。The band division filters 801 to 803 include QMF (Qua
drature m1rror filter). ADPCM is used for the encoder, and the allocation pattern is a pattern that takes into account the characteristics of the audio signal and allocates a large amount of information to a low frequency band. (Figure 9 above) 9th
As shown in the figure, ADPCM encoders having the number of quantization bits existing in the allocation pattern for each band are arranged in parallel. Therefore, the number of quantization bits of ADPCM for each band is 6
．． 5.4 bit encoder 804~806.5+ 4+
These are 3-bit encoders 807 to 809.3, 2-bit encoders 810, 811.3, and 2-bit encoders 812, 813.

以下本実施例３のフローチャートを示す第１０図に従っ
て本実施例を詳細に説明する。The present embodiment will be described in detail below with reference to FIG. 10 showing a flowchart of the third embodiment.

ステップ１）１フレームの音声信号が入力する。Step 1) One frame of audio signal is input.

（Ｔ　１　）ステップ２）帯域分割フィルタ８０１〜８０３を介し４
帯域に分割する。（Ｔ２）ステップ３）全てのＡＤＰＣＭ符号化器８０４〜８１３
でそれぞれ−フレーム分の符号化処理を行う。　（Ｔ３
）ステップ４）各符号化器８０４〜８１３は内部に復号化
機能を有するものであり、該復号化機能を介して、１フ
レーム分の局部復号信号を品質評価部８１４にそれぞれ
出力する。（Ｔ４）ステップ５）各帯域毎に、各ＡＤＰ
ＣＭの局部復号信号と対応する各帯域の入力音声信号を
比較し、それぞれの誤差信号を抽出し、その２乗和（ｌ
フレーム分）を算出する。（Ｔ５）ステップ６）前記各
ＡＤＰＣＭ対応の２乗和に聴感上の重み付けを行う。（
Ｔ６）ステップ７）前記第９図の割当パターンに従って各局部
復号信号を合成した場合の、各帯域毎の前記Ｔ６で得た
２乗和に聴感上の重み付けしたものの総和を各割当パタ
ーン毎に算出する。(T 1 ) Step 2) 4 through band division filters 801 to 803
Divide into bands. (T2) Step 3) All ADPCM encoders 804 to 813
In each case, encoding processing for -frames is performed. (T3
) Step 4) Each of the encoders 804 to 813 has an internal decoding function, and each outputs one frame's worth of locally decoded signals to the quality evaluation section 814 via the decoding function. (T4) Step 5) For each band, each ADP
The locally decoded CM signal and the corresponding input audio signal of each band are compared, each error signal is extracted, and the sum of squares (l
frame). (T5) Step 6) Perceptually weighting is performed on the sum of squares corresponding to each ADPCM. (
T6) Step 7) When each locally decoded signal is synthesized according to the allocation pattern shown in FIG. 9, calculate the sum of the sum of squares obtained in T6 above for each band with auditory weighting for each allocation pattern. do.

（Ｔ７）ステップ８）Ｔ７で求めた総和が最も小さいものを特性
の最も良い割当パターンとして選択し、該割当パターン
を示す信号を選択・多重化部８１５へ出力する。（Ｔ８
）ステップ９）選択・多重化部８１５へ入力する各ＡＤＰ
ＣＭからの符号語のうち、前記品質評価部８１４からの
信号に従って各帯域のＡ　Ｄ　Ｐ　ＣＭからの符号語を
選択し、前記該割当パターンを示す信号とともに多重化
する。（Ｔ９）ステップ１０）伝送路にｌフレーム分の
符号語データを出力する。（ＴＩＯ）ステップ１１）各帯域毎に選択されたＡＤＴ’ＣＭの内
部パラメータを各帯域内の他のＡＤＰＣＭに複写する。(T7) Step 8) The one with the smallest sum determined in T7 is selected as the allocation pattern with the best characteristics, and a signal indicating the allocation pattern is output to the selection/multiplexing section 815. (T8
) Step 9) Each ADP input to the selection/multiplexing section 815
Among the codewords from the CMs, the codewords from the ADP CM of each band are selected according to the signal from the quality evaluation section 814 and multiplexed with the signal indicating the allocation pattern. (T9) Step 10) Output code word data for l frames to the transmission path. (TIO) Step 11) Copy the internal parameters of the ADT'CM selected for each band to other ADPCMs within each band.

これはフレーム毎に各帯域に割り当てられる情報量が異
なるため、復号器との同期を取るため、各帯域毎にフレ
ームの先頭で、一つ前で選択された量子化ビット数の符
号器の内部パラメーターを該当する帯域の並列に存在す
るＡＤＰＣＭに複写するものである。（Ｔｌｌ）以上が符号側の制御である。This is because the amount of information allocated to each band differs for each frame, so in order to synchronize with the decoder, at the beginning of the frame for each band, the coder internally uses the previously selected quantization bit number. The parameters are copied to the ADPCMs existing in parallel in the corresponding band. (Tll) The above is the control on the code side.

次に前記符号側に対応した復号側の構成を第１１図に示
す。１１０１は分離部、１１０２〜１１０５は各帯域毎
の切替え部、１１０６〜１１１５はそれぞれ逆量子化レ
ベル数の異なるＡＤ　Ｐ　ＣＭ復号器、１１１６は帯域
合成フィルタである。Next, FIG. 11 shows the configuration of the decoding side corresponding to the code side. 1101 is a separation unit, 1102 to 1105 are switching units for each band, 1106 to 1115 are ADP CM decoders each having a different number of dequantization levels, and 1116 is a band synthesis filter.

分離部１１０１では、符号側で選択された割当パターン
を示す信号と各符号語が分離される。各符号語はそれぞ
れ割当ての帯域用の切替え部１１０２〜１１０５に入力
する。各切替え部で分離部１１０１で分離された符号側
で選択された割当パターンを示す信号に従って入力する
符号語を帯域毎に対応したＡＤＰＣＭ復号器に切り替え
て入力する。その出力は帯域毎に帯域合成フィルタ１１
１６に入力し合成され再生音声信号として出力される。Separation section 1101 separates the signal indicating the allocation pattern selected on the code side and each code word. Each code word is input to switching units 1102 to 1105 for assigned bands. In each switching unit, the input code word is switched and input to the ADPCM decoder corresponding to each band according to the signal indicating the allocation pattern selected on the code side separated by the separation unit 1101. The output is filtered by a band synthesis filter 11 for each band.
16, which is synthesized and output as a reproduced audio signal.

従来のように、全ての割当パターンに付いて符号化・復
号化を行い最適なものを選択する場合と本実施例の場合
では、割当パターンは、８パターンあるため固定割当の
場合に比べてＡＤＰＣＭの処理は８倍になるが、本発明
によれば、３倍になるものが２帯域と２倍になるものが
２帯域あるため、平均２，５倍の処理となる。しかし、
全てのパターンについて符号化する方法に比べれば極め
て簡単な処理で品質の良い信号の伝送が可１１ヒとなる
ものである。In the conventional case where all allocation patterns are encoded and decoded and the optimal one is selected, in the case of this embodiment, there are 8 allocation patterns, so the ADPCM However, according to the present invention, since there are two bands in which the data is tripled and two bands in which the data is doubled, the processing is 2.5 times as much on average. but,
Compared to a method in which all patterns are encoded, it is possible to transmit high-quality signals with extremely simple processing.

（第４の実施例）本発明では第１の実施例でのエンヘデント符号器及びい
くつかの逆量子化器を用いた構成の符号化器を各帯域毎
にビット数対応で設けることで多数のＡＤＰＣＭを使用
することなく、より簡単な処理で実現できる。その実施
例４を第１２図に示す。図中第８図と同一部材には同一
符号を付す。(Fourth Embodiment) In the present invention, a large number of This can be achieved with simpler processing without using ADPCM. Example 4 is shown in FIG. In the figure, the same members as in FIG. 8 are given the same reference numerals.

またＡＤＰＣＭ４〜６ビノト符号化器８０４〜８０６は
エンベデッド符号化器を有するＡＤＰＣＭ符号器９０が
、同様に８０７〜８０９は９１が、８１０．８１１は９
２が、８１２，８１３は９３がその機能を果たすもの、
である。エンベデッド符号化器を有するＡＤＰＣＭ符号
化！Ｓ９０の内部構成を第ｔ３図に示す。他のエンベデ
ッド符号化器を有するＡＤＰＣＭ符号化器９１〜９３は
量子化。Further, the ADPCM 4 to 6 binot encoders 804 to 806 have an embedded encoder, the ADPCM encoder 90 has an embedded encoder, 807 to 809 have 91, and 810.811 has 9.
2, 812, 813 is 93 that performs the function,
It is. ADPCM encoding with embedded encoder! The internal configuration of S90 is shown in FIG. t3. ADPCM encoders 91 to 93 having other embedded encoders perform quantization.

逆量子化のビット数等が異なるだけで同様の構成である
。第１３図は第４図の実施例１と略同様の構成となって
いる。異なるのは各下位ビット削除部の出力が適応逆量
子化部と選択・多重化部へ出力されることである。該構
成により実施例３同様に各帯域における割当パターンに
従ったビット数での局部復号信号が得られる。品質評価
部での処理は実施例３と全く同様のものである。The configurations are the same except for the number of bits for inverse quantization. FIG. 13 has substantially the same configuration as the first embodiment shown in FIG. The difference is that the output of each lower bit deletion section is output to an adaptive inverse quantization section and a selection/multiplexing section. With this configuration, similarly to the third embodiment, a locally decoded signal with the number of bits according to the allocation pattern in each band can be obtained. The processing in the quality evaluation section is exactly the same as in the third embodiment.

エンベデッドＡＤＰＣＭを使用すると、ＡＤＰＣＭにお
ける適応予測、適応量子化の適応更新期を共通にするこ
とが可能となるため、ＡＤＰＣＭを並列に複数個動作さ
せた場合でも、複数の逆量子化の処理が増加するだけと
なる。この場合、フレーム毎に各帯域に割当られる情報
量が異なっていても、適応予測、適応量子化の処理は共
通なため、フレームの先頭で内部パラメーターを複写す
る必要はなく、実施例３より更に処理が簡単となるもの
である。Using embedded ADPCM makes it possible to share the same adaptive update period for adaptive prediction and adaptive quantization in ADPCM, so even when multiple ADPCMs are operated in parallel, the processing of multiple dequantizations increases. Just do it. In this case, even if the amount of information allocated to each band differs from frame to frame, the processes of adaptive prediction and adaptive quantization are common, so there is no need to copy the internal parameters at the beginning of the frame, and this is further improved than in the third embodiment. This makes processing easier.

以上時間領域５周波数領域及びその両方の実施例を説明
してきたが、各品質評価部の評価の仕方としては色々考
えられる。例えばケプヌトラム距離、誤差信号の絶対値
のピーク値、誤差信号の絶対値の総和、誤差信号の二乗
和、これらの組み合わせ等が考えられるものであり本発
明は実施例に限られるものではない。Although embodiments in the time domain, frequency domain, and both have been described above, various methods of evaluation by each quality evaluation section can be considered. For example, the cepnutrum distance, the peak value of the absolute value of the error signal, the sum of the absolute values of the error signal, the sum of squares of the error signal, a combination thereof, etc. can be considered, and the present invention is not limited to the embodiments.

（発明の効果　〕本発明によれば、適応予測と適応量子化を使用したＡＤ
ＰＣＭに、特性の良い時間領域適応ビット割当処理を組
み合わせることが可能となり、音声信号の時間的変動を
効率良く利用した、高能率音声符号化伝送装置が実現で
きる。また、音声信号を複数の帯域に分割し、フレーム
毎に音声信号の性質の時間的変動に応して各帯域毎に割
り当てる情報量を変化させる音声符号化方式に於いて、
少ない処理量の増大で最適なものを選択できるため、音
声信号の周波数領域における非定常性を効率良く利用す
ることが可能となり、効率の良い音声信号の高能率符号
化伝送装置が構成できる。(Effects of the Invention) According to the present invention, AD using adaptive prediction and adaptive quantization
It becomes possible to combine PCM with time-domain adaptive bit allocation processing with good characteristics, and a highly efficient speech coding and transmission device that efficiently utilizes temporal fluctuations in speech signals can be realized. In addition, in an audio encoding method that divides an audio signal into multiple bands and changes the amount of information allocated to each band according to temporal fluctuations in the characteristics of the audio signal for each frame,
Since the optimum one can be selected with a small increase in the amount of processing, it becomes possible to efficiently utilize the non-stationarity of the audio signal in the frequency domain, and a highly efficient encoding and transmission device for the audio signal can be configured.

従って、本発明によって、より高品質に、全てのパター
ンについて処理するのに比べ極めて簡単な、手数の少な
い処理での音声符号・復号化伝送方式を提供することが
できる。Therefore, according to the present invention, it is possible to provide a speech encoding/decoding transmission system that has higher quality and requires much simpler processing than processing all patterns.

[Brief explanation of drawings]

第１図は時間領域における割当パターンを使用した音声
符号化伝送方式の原理図、第２図は周波数領域における割当パターンを使用した音
声符号化伝送方式の原理図、第３図はエンベデッド符号化に伴う量子化特性例、第４図は時間領域における割当パターンを使用した音声
符号化伝送方式の一実施例、第５図は第４図の音声符号化伝送方式のフローチャート
示す図、第６図は第４図の符号化に対応する復号化を行う音声符
号化方式の一実施例、第７図は周波数領域及び時間領域での割当パターンを使
用する音声符号化伝送方式の一実施例、第８図は周波数
領域における割当パターンを使用した音声符号化伝送方
式の一実施例、第９図は第８図の音声符号化伝送方式で
使用する割当パターンを示す図、第１０図は第８図の音声符号化伝送方式のフローチャー
ト示す図、第１１図は第８図の符号化に対応する復号化を行う音声
符号化方式の一実施例、第１２図は第８図の音声符号化伝送方式に符号化器とし
て、エンベデッド符号器を用いた場合の一実施例、第１３図は第１２図中のエンベデッド符号化を使用した
ＡＤＰＣＭ符号化器符号化器内部構図、第１４図は従来
の構成を示す図である。図中１１１１２．２４１１３〜１１６１７１１８．２３適応量子化器選択・多重化部適応逆量子化器適応予測器品質評価部 ■ ・帯域分割フィルタ１〜２ｎｍ・・符号器本発明における需透口熟詔笥号語（量子化コード）２ｂｉｔ　　　ＢｂＩｔ　　　４ｂ量子化コード第図第４藺にｂげるフローチ、−ト第図割当パタンを刀スす団Figure 1 is a diagram of the principle of a speech coding transmission method using an allocation pattern in the time domain, Figure 2 is a diagram of the principle of a speech coding transmission method using an allocation pattern in the frequency domain, and Figure 3 is a diagram of the principle of an audio coding transmission method using an allocation pattern in the frequency domain. 4 shows an example of the audio coding transmission method using an allocation pattern in the time domain, FIG. 5 shows a flowchart of the audio coding transmission method shown in FIG. 4, and FIG. FIG. 4 is an example of an audio encoding method that performs decoding corresponding to the encoding; FIG. 7 is an example of an audio encoding transmission method that uses allocation patterns in the frequency domain and time domain; The figure shows an example of an audio coding transmission method using an allocation pattern in the frequency domain, FIG. 9 shows an allocation pattern used in the audio coding transmission method of FIG. 8, and FIG. Figure 11 is an example of a voice encoding method that performs decoding corresponding to the encoding in Figure 8. Figure 12 is an example of a voice encoding transmission method that performs decoding corresponding to the encoding in Figure 8. An example in which an embedded encoder is used as an encoder, FIG. 13 shows the internal configuration of the ADPCM encoder using embedded encoding in FIG. 12, and FIG. 14 shows the conventional configuration. FIG. In the figure 11 112.24 113 to 116 17 118.23 Adaptive quantizer selection/multiplexing unit Adaptive inverse quantizer Adaptive predictor Quality evaluation unit Quantization Code Word (Quantization Code) 2bit BbIt 4b Quantization Code Diagram 4th Flowchart, -G Diagram Assignment Pattern.

Claims

[Claims]

(1) A method that compresses the amount of information of an input audio signal, in which one frame is divided into several subframes, and each subframe is given a quantum amount per sample to achieve a predetermined transmission bit rate. In an audio coding transmission method that allocates and encodes the number of quantization bits, a locally decoded signal is generated from codewords with different numbers of quantization levels extracted for each frame, and the above-mentioned decoded signal is generated according to an allocation pattern that achieves a predetermined transmission bit rate. Speech encoding characterized in that locally decoded signals of each subframe within a frame are combined to generate a locally decoded signal of one frame, and code words corresponding to an allocation pattern with the best characteristics are multiplexed and transmitted. method.

(2) A method for compressing the amount of information of an input audio signal, especially ADPCM (AdpCM) that uses adaptive prediction and adaptive quantization.
veDifferentialPulseCodeMo
In an audio encoding method that combines time-domain adaptive bit allocation that adaptively allocates bits in the time domain according to temporal changes in the input audio signal, adaptive quantization with M-bit quantization levels is used. Converter (111) and N
Adaptive inverse quantizer (<M) number of bit inverse quantization levels (1
13) and an adaptive predictor (117) that operates based on the result of inverse quantization of the number of N bit levels;
Adaptive inverse quantizer with +1 to M bit inverse quantization levels (1
14 to 116) are provided, each locally decoded signal is extracted by the output of the adaptive inverse quantizer and the output of the adaptive predictor, each of the locally decoded signals is divided into subframes, and the locally decoded signal is adapted to the transmission bit rate. synthesize local decoded signals for each subframe according to the selected allocation pattern, compare each local decoded signal with the input audio signal to evaluate its quality, select the allocation pattern with the best characteristics, and perform adaptive quantization according to the selection. A voice coding transmission method that deletes the lower bits of the code word from the device for each subftherm, multiplexes it, and transmits it.

(3) The audio coding transmission system according to claim (2), wherein when multiplexing code words, a signal indicating a selected allocation pattern is also multiplexed and transmitted.

(4) A method that compresses the amount of information of an input audio signal, in particular ADPCM encoding using adaptive prediction and adaptive quantization, and adaptively allocating the number of bits in the time domain according to temporal changes in the input audio signal. In an audio decoding method that uses inverse quantization to decode a signal encoded by audio encoding that combines time-domain adaptive bit allocation, codewords are generated according to a signal indicating an allocation pattern transmitted from the encoder side. A switching unit (602) is provided to switch the input inverse quantizer for each subframe, and each subframe is input to the inverse quantizer with the number of inverse quantization levels corresponding to the number of levels at the time of quantization and reproduced. An audio decoding method characterized by obtaining audio signals.

(5) A voice encoding/decoding transmission device comprising an encoding device based on the voice encoding transmission method according to claim (3) and a decoding device based on the voice decoding method according to claim (4).

(6) A method for compressing the amount of information of an input audio signal, in particular, dividing the input audio signal into multiple frequency bands for each frame,
In an adaptive bit allocation coding method that changes the allocation pattern of the number of bits to be allocated according to temporal changes in each band, the number of bits existing in the allocation pattern has a different number of quantization levels for each band. Multiple encoders (2
11 to 2 nm), extracts each local decoded signal from the encoder, generates one frame worth of local decoded signals for all combinations of the allocation patterns, and combines each local decoded signal with the input audio signal. 1. A speech coding and transmission system characterized in that the quality of the signals is compared, the quality thereof is evaluated, the best combination of characteristics is selected, and the corresponding codewords are multiplexed and transmitted according to the selected allocation pattern.

(7) The audio coding transmission system according to claim (6), wherein when multiplexing code words, a signal indicating a selected allocation pattern is also multiplexed and transmitted.

(8) A method for compressing the amount of information of an input audio signal, in particular, dividing the input audio signal into multiple frequency bands for each frame,
In an audio decoding method that decodes signals that are encoded and transmitted using an adaptive bit allocation encoding method that changes the allocation pattern of the number of bits allocated according to temporal changes in each band. A code word and a signal indicating a bit allocation pattern are separated, and decoded by a dequantizer having a corresponding number of dequantization levels for each frequency domain of the code word according to the signal indicating the bit allocation pattern, and decoding is performed for each frequency domain. An audio decoding method characterized in that a reproduced audio signal is obtained by synthesizing encoded signals.

(9) A voice encoding/decoding transmission device comprising an encoding device based on the voice encoding transmission method according to claim (7) and a decoding device based on the voice decoding method according to claim (8).

(10) Multiple encoders corresponding to frequency bands are based on an adaptive quantizer with M-bit quantization levels, an inverse quantizer with N-bit inverse quantization levels, and an inverse quantization result with N-bit levels. Claim (7), characterized in that the embedded encoder is composed of an adaptive predictor that operates according to Audio coding transmission method.

(11) A method that compresses the amount of information in an input audio signal. It divides the input audio signal into multiple frequency bands, changes the allocation pattern of the number of bits allocated according to the temporal changes in each band, and also uses adaptive prediction. In an audio coding transmission system that combines ADPCM encoding using adaptive quantization and time domain adaptive bit allocation that adaptively allocates the number of bits in the time domain according to temporal changes in the input audio signal, each frequency For each band, an adaptive quantizer with M bit quantization levels, an adaptive dequantizer with N (<M) bit dequantization levels, and an adaptive predictor that operates based on the dequantization results with N bit levels. and an adaptive inverse quantizer with a number of N+1 to M bit inverse quantization levels for decoding the code word from the adaptive quantizer, the output of the adaptive inverse quantizer and the adaptive prediction are provided. extracts each locally decoded signal from the output from the transmitter, divides each locally decoded signal into subframes, synthesizes the locally decoded signal for each band according to an allocation pattern adapted to the transmission bit rate, and further divides the locally decoded signal into subframes according to the allocation pattern adapted to the transmission bit rate. synthesize local decoded signals for one frame according to A voice coding transmission method that deletes the lower bits of the code word for each subftherm, multiplexes it, and transmits it.