JPH0754439B2

JPH0754439B2 - Method of synchronizing variable length frame type code decoding device

Info

Publication number: JPH0754439B2
Application number: JP61122690A
Authority: JP
Inventors: 哲田口
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1986-05-27
Filing date: 1986-05-27
Publication date: 1995-06-07
Anticipated expiration: 2010-06-07
Also published as: JPS62278600A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は可変長フレーム型符号復号化装置の同期方法に
関し、特に同期処理における通話遅延の大幅な改善を図
った可変長フレーム型符号復号化装置の同期方法に関す
る。The present invention relates to a synchronization method for a variable-length frame type code decoding device, and in particular to a variable-length frame type code decoding for greatly improving the call delay in the synchronization processing. The present invention relates to a device synchronization method.

[Conventional technology]

入力音声を基本分析フレーム（以下、単に分析フレーム
と言う）で分析し、音声パラメータとしてのスペクトル
包絡パラメータと音源情報とを抽出、これを符号化して
伝送路を介して復号化側に送出し原入力音声の合成を行
なう符号復号化装置はよく知られている。The input voice is analyzed by a basic analysis frame (hereinafter, simply referred to as an analysis frame), a spectrum envelope parameter as a voice parameter and sound source information are extracted, coded and sent to a decoding side via a transmission line. A coding / decoding device that synthesizes input speech is well known.

この場合、入力音声スペクトルの巨視的特徴を示すスペ
クトル包絡パラメータはLPC（Linear Prediction Codin
g,線形予測分析）係数もしくはその誘導形式のものが利
用され、これにはαパラメータ,Kパラメータ、あるいは
LSP（Line Spectrum Pairs,線スペクトル対）等の諸係
数が利用される。また、入力音声スペクトルの微視的特
徴を示す音源情報としては、ピッチ周波数に対応するパ
ルス周波数と白色雑音とを有声／無音情報に対応してモ
デル化して符号化，復号化するために必要なピッチ周波
数，有声／無音，音源振幅等が利用され、このほか音源
の波形情報を利用するものとして、マルチパルス，残査
波形等が符号，復号化される場合もある。In this case, the spectral envelope parameter indicating the macroscopic features of the input speech spectrum is LPC (Linear Prediction Codin
g, linear predictive analysis) coefficient or its derivation form is used, which is α parameter, K parameter, or
Various coefficients such as LSP (Line Spectrum Pairs) are used. The sound source information indicating the microscopic characteristics of the input speech spectrum is necessary for modeling the pulse frequency corresponding to the pitch frequency and the white noise corresponding to the voiced / unvoiced information and encoding / decoding. Pitch frequency, voiced / unvoiced sound, sound source amplitude, etc. are used. In addition, multi-pulses, residual waveforms, etc. may be encoded / decoded by using the sound source waveform information.

可変長フレーム型符号復号化装置は、上述した符号復号
化装置において、符号化側から復号化側に送出すべき音
声パラメータのうちスペクトル包絡パラメータを分析フ
レーム単位で忠実に伝送する代りに複数の分析フレーム
を代表する代表分析フレームを選定したうえ、この代表
分析フレームのスペクトル包絡パラメータとこの代表分
析フレームと同一のスペクトル包絡パラメータをもつも
のとして指定される分析フレーム数いわゆるリピートビ
ットとを伝送することによって伝送ビットレートの大幅
な低減を実現するものである。The variable-length frame type coding / decoding apparatus, in the above-described coding / decoding apparatus, performs a plurality of analysis instead of faithfully transmitting the spectrum envelope parameter among the speech parameters to be sent from the coding side to the decoding side in the analysis frame unit. By selecting a representative analysis frame that represents a frame and transmitting the spectrum envelope parameter of this representative analysis frame and the number of analysis frames designated as having the same spectrum envelope parameter as this representative analysis frame, so-called repeat bits. It realizes a significant reduction of the transmission bit rate.

このような可変長フレーム型符号復号化装置における可
変長フレームの構成形式にも種種の手法があるが基本的
には次のような内容が利用されている。There are various kinds of methods for constructing the variable length frame in such a variable length frame type coding / decoding device, but basically the following contents are used.

すなわち、あらかじめ設定する個数Ｍ個の分析フレーム
よりなる区分ごとにこの区分スペクトル包絡を最適近似
する最適近似関係を設定する。この区分的最適近似に利
用する関数としては、たとえば矩形関数等があり、その
設定は、分析フレームごとのスペクトル包絡パラメータ
による空間ベクトル相互間の距離の総和が最小となるよ
うな近似矩形包絡を区分ごとに見出す形式で行なわれ
る。こうして得られる近似矩形を構成する複数ｉ個の分
析フレームをもってその区分を代表する代表フレームと
する。上述したｉ個は、区分ごとにそれぞれのスペクト
ル包絡に対応して任意に設定され、また、このような最
適近似自体は、分析次数を含み分析フレームごとのスペ
クトル包絡パラメータによる空間ベクトル相互間の総当
り的ベクトル距離の総和を評価尺度とし、これを最小と
する分析フレームの組合せをDP（Dynamic Programing,
動的計画法）によって容易に求められることもよく知ら
れている。That is, an optimum approximation relationship that optimally approximates this segment spectrum envelope is set for each segment composed of a preset number M of analysis frames. As a function used for this piecewise optimal approximation, there is, for example, a rectangular function, and its setting is to divide an approximate rectangular envelope such that the total sum of distances between spatial vectors by spectral envelope parameters for each analysis frame is minimized. It is carried out in the form of finding each. A plurality of i analysis frames forming the approximate rectangle obtained in this manner are used as a representative frame representing the section. The above-mentioned i pieces are arbitrarily set corresponding to the respective spectrum envelopes for each section, and such an optimal approximation itself includes the analysis order and the total between spatial vectors by the spectrum envelope parameter for each analysis frame. The sum of the hit vector distances is used as the evaluation scale, and the combination of analysis frames that minimizes this is DP (Dynamic Programming,
It is also well known that it can be easily obtained by dynamic programming.

さて、こうして求められた可変長フレーム形式によるス
ペクトル包絡パラメータは、音源情報とともに所定の形
式で符号化され復号化側に伝送されるが、この場合、こ
れらデータ送受信におけるフレーム同期が符号化側と復
号化側との間で必要となる。Now, the spectrum envelope parameter in the variable length frame format thus obtained is encoded in a predetermined format together with the excitation information and transmitted to the decoding side. In this case, the frame synchronization in the data transmission / reception is decoded with the encoding side. It is necessary between the liquorization side and

この同期形式としては、独立同期もしくは強制同期のい
ずれかが利用されることが多いが、いずれの場合にして
も可変長フレームを構成する１区分が同期処理における
基本フレームとなって扱われている。As the synchronization format, either independent synchronization or forced synchronization is often used, but in any case, one segment forming a variable length frame is treated as a basic frame in the synchronization processing. .

すなわち、独立同期において、復号化側が符号側とのク
ロック周波数差によるスリップを除去すべく同期制御を
実施するには１区画単位のデータを蓄積しつつ、同期ず
れの場合はこの１区分単位のデータ削除によるフレーム
更新を介して同期回復を行ない、また強制同期にあって
も符号化側から復号化側に提供されるクロックの発生，
分配，伝送等にもとづく障害影響の除去を図るために１
区分単位のデータを蓄積しつつ、これも１区分シフトに
よるフレーム同期を前提として行なっているのが現状で
ある。That is, in independent synchronization, the decoding side accumulates data in units of one partition in order to perform synchronization control in order to remove slips due to the clock frequency difference from the code side. Synchronous recovery is performed through frame update by deletion, and the clock provided from the encoding side to the decoding side is generated even in forced synchronization.
In order to eliminate the effects of failures due to distribution, transmission, etc. 1
At present, the data is stored in units of divisions, and this is also premised on frame synchronization by one division shift.

[Problems to be solved by the invention]

上述した従来のこの種の可変長フレーム型符号復号化装
置の同期方法は、独立同期，強制同期を問わず１区分単
位のデータを蓄積しつつ１区分を同期フレーム単位とし
てフレーム同期を図るという方法で行なわれるため、分
析フレーム単位でフレーム同期を行なう通常の符号復号
化装置に比し著しく通話遅延が避けられないという問題
がある。この通話遅延は、可変長フレームの１区分が数
100mSEC程度になることも珍しくないことを考慮すれば
自明のことである。The above-described conventional synchronization method of this type of variable-length frame type code decoding apparatus is a method of accumulating data in units of one segment regardless of independent synchronization or forced synchronization and achieving frame synchronization in units of one synchronization frame. Therefore, there is a problem that a call delay cannot be avoided remarkably as compared with an ordinary coding / decoding device that performs frame synchronization in units of analysis frames. As for this call delay, one division of variable length frame
This is obvious considering that it is not unusual for the size to reach 100 mSEC.

本発明の目的は上述した欠点を除去し、可変長フレーム
型符号復号化装置の同期方法において、１区分よりも十
分短く、たとえば分析フレームもしくはこれに対応する
タイムスロット分のサンプル数程度のデータの蓄積のみ
でフレーム同期を図るという手段を備えることにより、
同期処理にもとづく通話遅延を著しく改善した可変長フ
レーム型符号復号化装置の同期方法を提供することにあ
る。An object of the present invention is to eliminate the above-mentioned drawbacks, and in a synchronization method of a variable length frame type coding / decoding device, sufficiently shorter than one segment, for example, an analysis frame or data corresponding to the number of samples of time slots corresponding thereto is By providing a means to achieve frame synchronization only by storing,
It is an object of the present invention to provide a synchronization method for a variable length frame type code decoding device, which significantly improves the call delay based on the synchronization processing.

[Means for solving problems]

本発明の方法は、独立同期もしくは強制同期を符号化側
との同期に利用する可変長フレーム型符号復号化装置の
同期方法において、可変長フレーム区分ごとに代表スペ
クトル包絡パラメータによって指定される有声区間もし
くは前記可変長フレーム区分ごとの無音区間あるいは前
記有声区間および無音区間の時間長を符号化側と復号化
側の同期ずれの程度に対応して適応制御しつつ符号化側
と復号化側との同期保持を行なわしめる同期保持手段を
備えて構成される。The method of the present invention is a method of synchronizing a variable length frame type coding / decoding device that utilizes independent synchronization or forced synchronization for synchronization with a coding side, and a voiced section designated by a representative spectrum envelope parameter for each variable length frame section. Alternatively, the silent side of each variable length frame section or the time length of the voiced section and the silent section is adaptively controlled according to the degree of synchronization deviation between the encoding side and the decoding side, while the encoding side and the decoding side are being controlled. It is configured to include a synchronization holding means for holding the synchronization.

〔Example〕

次に図面を参照して本発明を詳細に説明する。 The present invention will now be described in detail with reference to the drawings.

第１図は本発明の一実施例を示すブロック図であり、符
号化側１と復号化側２から成る。FIG. 1 is a block diagram showing an embodiment of the present invention, which comprises an encoding side 1 and a decoding side 2.

符号化側１はさらに、A/Dコンバータ11,窓処理器12,LSP
分析器13,音源情報分析器14,可変長フレーム選択器15,
クロック発生器16およびマルチプレクサ17等を備えて構
成される。The encoding side 1 further includes an A / D converter 11, a window processor 12, an LSP
Analyzer 13, sound source information analyzer 14, variable length frame selector 15,
The clock generator 16 and the multiplexer 17 are provided.

また、復号化側２は、デマルチプレクサ21,フレーム同
期器22,固定長フレーム化器23,メモリ24,適応制御器25,
修正メッセージ発生器26,クロック発生回路27および音
声合成器28等を備えて構成される。Further, the decoding side 2 includes a demultiplexer 21, a frame synchronizer 22, a fixed length framing device 23, a memory 24, an adaptive controller 25,
It comprises a correction message generator 26, a clock generation circuit 27, a voice synthesizer 28 and the like.

第１図に示す実施例は同期方式として独立同期を利用す
る場合を例とし、かつスペクトル包絡パラメータとして
はLSP係数を利用しているが、これは強制同期方式と
し、かつ他のLPC係数を利用するものとしても差支えな
く、基本的にはほぼ同様に実施できる。In the embodiment shown in FIG. 1, the case where independent synchronization is used as the synchronization method is used as an example, and the LSP coefficient is used as the spectrum envelope parameter, but this is the forced synchronization method and other LPC coefficients are used. It does not matter if it is done, and basically it can be carried out in substantially the same manner.

さて、入力音声はA/Dコンバータ11に供給され、高域遮
断周波数3.4KHzのLPF（Low Pass Filter）を介して不要
な高域周波数を除去したのち8KHzのサンプリング周波数
で標本化され12ビットの量子化ステップでディジタル化
される。Now, the input voice is supplied to the A / D converter 11, and after removing unnecessary high frequency through LPF (Low Pass Filter) of high frequency cutoff frequency 3.4KHz, it is sampled at 8KHz sampling frequency It is digitized in the quantization step.

窓処理器12は、A/Dコンバータ11から量子化音声信号を
受けると、あらかじめ設定する１ブロック分、たとえば
30mSEC（240サンプル）ずつを一時的にメモリに格納
し、これにあらかじめ設定した窓関数のハミング関数を
乗算して切出す窓処理を10mSECの分析フレーム周期ごと
に行なう。従って基本的な分析フレームのサイズ、即ち
分析フレーム長は10mSECとなる。Upon receiving the quantized audio signal from the A / D converter 11, the window processor 12 receives one quantized audio signal for a preset one block, for example,
Each 30mSEC (240 samples) is temporarily stored in the memory, and the window processing that multiplies this by a Hamming function of the preset window function and cuts out is performed every 10mSEC analysis frame period. Therefore, the basic analysis frame size, that is, the analysis frame length is 10 mSEC.

窓処理器12の出力はLSP分析器13と音源情報分析器14に
供給される。The output of the window processor 12 is supplied to the LSP analyzer 13 and the sound source information analyzer 14.

LSP分析器13は、入力する分析フレームごとのデータを
公知の手法によってLPC分析して所定のＳ次のαパラメ
ータを抽出し、さらにこれを利用してＳ次元の空間ベク
トルとしてのLSP係数を誘導する。本実施例ではＳとし
て偶数を選んでいる。LSP係数を次の（１）式ので示
す。The LSP analyzer 13 performs LPC analysis on the input data for each analysis frame by a known method to extract a predetermined S-order α parameter, and further uses this to derive an LSP coefficient as an S-dimensional space vector. To do. In this embodiment, S is an even number. The LSP coefficient is shown by the following equation (1).

＝（Ｐ₁,P₂，……Ｐ_s） ……（１）（１）式に示すＰ₁〜Ｐ_sは音道の開閉状態にもとづいて
得られる共振周波数の組の形式で表現される周波数領域
パラメータである。= (P ₁ , P ₂ , ... P _s ) (1) P _{1 to} P _s shown in the equation (1) are expressed in the form of a set of resonance frequencies obtained based on the open / closed state of the sound path. It is a frequency domain parameter.

LSP分析器13で得られる分析フレームごとのLSP係数は可
変長フレーム選択器15に供給される。The LSP coefficient for each analysis frame obtained by the LSP analyzer 13 is supplied to the variable length frame selector 15.

可変長フレーム選択器15は、分析フレーム単位で入力す
るスペクトル包絡パラメータの系列に対し次のようにし
て区分的最適関数近似にもとづく可変長フレーム選択を
実施する。The variable-length frame selector 15 performs variable-length frame selection based on piecewise optimal function approximation for the sequence of spectral envelope parameters input in analysis frame units as follows.

すなわち、本実施例では相続く20個の分析フレーム区
間、すなわち200mSECを１区分とし、この区分ごとに区
分的最適関数近似を矩形近似によって行なっている。こ
の場合、近似関数として矩形関数の代りに折線近似関数
等他の近似関数を利用しても勿論差支えない。That is, in the present embodiment, 20 consecutive analysis frame sections, that is, 200 mSEC are set as one section, and the piecewise optimum function approximation is performed by the rectangle approximation for each section. In this case, it is of course possible to use another approximation function such as a polygonal line approximation function instead of the rectangular function as the approximation function.

さて、区分ごとに含まれる20個の分析フレームのうちｉ
個が代表分析フレームとして選択されたとする。ここで
ｉは１から20のうち任意に選定される数である。この代
表分析フレームに属するLSP係数ベクトルを利用してこ
の区分中の他の分析フレームに属するLSP係数ベクトル
をも近似的に代表せしめるのであるが、この場合矩形関
数近似を行ない、この近似誤差による歪が最小となるよ
うにｉを選定するのである。この場合の歪は基本的には
前述した係数のベクトル距離を尺度として得られ、分析
フレームごとの係数間のベクトル距離の総和が最小であ
るものが歪最小という観点にもとづいてDP手法を利用し
て容易に求めることができる。Now, of the 20 analysis frames included in each category, i
It is assumed that the individual is selected as the representative analysis frame. Here, i is a number arbitrarily selected from 1 to 20. The LSP coefficient vector belonging to this representative analysis frame is also used to approximately represent LSP coefficient vectors belonging to other analysis frames in this section.In this case, a rectangular function approximation is performed and distortion due to this approximation error is performed. I is selected so that is minimized. The distortion in this case is basically obtained by using the vector distance of the above-mentioned coefficient as a scale, and the DP method is used based on the viewpoint that the sum of the vector distances between the coefficients of each analysis frame is the minimum. Can be easily requested.

こうして、区分ごとに次次にｉ個の代表分析フレームが
選定され、かつこれらｉ個の代表分析フレームが代表す
べき分析フレームもそれぞれ決定され、可変長フレーム
構成のLSP係数データとしてマルチプレクサ17に供給さ
れる。In this way, the next i representative analysis frames are selected for each section, and the analysis frames to be represented by these i representative analysis frames are also determined and supplied to the multiplexer 17 as LSP coefficient data having a variable length frame structure. To be done.

音源情報分析器14は、窓処理器12から受ける分析フレー
ムごとの量子化音声信号を利用し、公知の分析手法によ
ってピッチ周期，有声／無音の別および音源振幅を得て
これら音源情報データを分析フレームごとにマルチプレ
クサ17に供給する。The sound source information analyzer 14 analyzes the sound source information data by using a quantized speech signal for each analysis frame received from the window processor 12 to obtain a pitch period, voiced / unvoiced sound and sound source amplitude by a known analysis method. It is supplied to the multiplexer 17 for each frame.

クロック発生器16は、A/Dコンバータ11からマルチプレ
クサ17までのディジタル処理に必要なクロックｔ₁を提
供する。The clock generator 16 provides the clock t ₁ required for digital processing from the A / D converter 11 to the multiplexer 17.

マルチプレクサ17は、こうして入力したLSP係数データ
と音源情報データとを伝送に適する形に合成符号化した
多重化信号として伝送路100を介して復号化側２に送出
し、従ってクロックｔ₁に関する情報もこの多重化信号
に含まれて復号化側２に提供されることとなる。The multiplexer 17 sends the LSP coefficient data and the sound source information data input in this way to the decoding side 2 via the transmission path 100 as a multiplexed signal which is synthetically coded in a form suitable for transmission, and therefore also the information on the clock t _1. It is included in this multiplexed signal and provided to the decoding side 2.

復号化側２では、符号化側１のマルチプレクサ17から提
供された多重化信号をデマルチプレクサ21で受け、この
時系列データのスペクトルにもとづきクロックｔ₁を抽
出、このクロックｔ₁を利用しつつ多重化信号の分離を
行なったのち復号化して符号化側１のマルチプレクサ17
の入力状態に復元し、区分単位で提供されるLSP周波数
データと分析フレーム単位で提供される音源情報データ
はフレーム同期器22に供給する。また、デマルチプレク
サ21で抽出したクロックｔ₁はフレーム同期器22,固定長
フレーム化器23,メモリ24,修正メッセージ発生器26等に
それぞれ供給される。On the decoding side 2, the demultiplexer 21 receives the multiplexed signal provided from the multiplexer 17 on the encoding side 1, extracts the clock t ₁ based on the spectrum of this time series data, and multiplexes while using this clock t _1. After the encoded signal is separated, it is decoded and then the multiplexer 17 on the encoding side 1
The LSP frequency data provided in the unit of division and the sound source information data provided in the unit of analysis frame are supplied to the frame synchronizer 22. The clock t ₁ extracted by the demultiplexer 21 is supplied to the frame synchronizer 22, the fixed length framing device 23, the memory 24, the correction message generator 26, etc., respectively.

フレーム同期器22は、こうして提供されるクロックｔ₁
に関する情報を利用してLSP係数データを送出した区分
単位のフレームと、音源データを送出した分析フレーム
のフレーム同期を行ないつつこれらデータを受信しこれ
を固定長フレーム化器23に供給する。The frame synchronizer 22 provides the clock t ₁ thus provided.
The information is received and supplied to the fixed-length framing device 23 while performing frame synchronization between the frame of the division unit in which the LSP coefficient data is transmitted and the analysis frame in which the sound source data is transmitted by using the information on the data.

フレーム同期器23はまたフレーム同期を行ないつつ受信
するLSP係数データからはリピートビット、また音源デ
ータからは電力ビットに関するデータを抽出し、これら
を適応制御器25に供給する。上述したリピートビット
は、区分ごとに選択されたｉ行の代表分析フレームのそ
れぞれが代表する分析フレームの個数ならびに区分内順
番に関するデータである。The frame synchronizer 23 also extracts data relating to repeat bits from the LSP coefficient data received while performing frame synchronization, and power bits from the sound source data, and supplies these to the adaptive controller 25. The above-mentioned repeat bits are data relating to the number of analysis frames represented by each of the representative analysis frames in the i-th row selected for each section and the order within the section.

固定長フレーム化器23は、デマルチプレクサ21から提供
されるクロックｔ₁の駆動のもとに代表分析フレームと
リピートビット等を利用し、代表分析フレーム間の補間
を行なってLSP係数の固定フレーム別指定状態を復元す
る。すなわちLSP分析器13の出力状態に復元する。The fixed length framing device 23 uses the representative analysis frame and the repeat bit under the drive of the clock t ₁ provided from the demultiplexer 21, performs interpolation between the representative analysis frames, and fixes the LSP coefficient for each fixed frame. Restore the specified state. That is, the output state of the LSP analyzer 13 is restored.

こうして、固定長フレーム化されたLSP係数は別途分析
フレーム単位でフレーム同期器22から提供された音源情
報とともに音声パラメータとしてメモリ24に分析フレー
ム単位で格納される。In this way, the fixed length framed LSP coefficient is separately stored in the analysis frame unit in the memory 24 as a voice parameter together with the sound source information provided from the frame synchronizer 22 in the analysis frame unit.

メモリ24に格納される場合の書込みクロックはデマルチ
プレクサ21から供給され太線で示すクロックｔ₁が利用
され、これを読出して音声合成器28に供給するクロック
は復号化側２のクロック発生器27から供給される点線で
示すクロックｔ₂が利用される。これら２つのクロック
は互いに独立同期の形式で符号化側と復号化側との同期
保持に利用される。このような独立同期は、符号化側に
おけるクロックの安定性の確保や柔軟性のあるシステム
構成、効率的な故障対応といった点では優れているもの
の両クロック間の時間ずれが原因となる通話遅延が避け
られない。このことを詳述すると次のとおりである。The write clock when stored in the memory 24 is supplied from the demultiplexer 21 and the clock t ₁ indicated by the bold line is used. The clock read from this clock and supplied to the voice synthesizer 28 is supplied from the clock generator 27 on the decoding side 2. The supplied clock t ₂ shown by the dotted line is used. These two clocks are used for maintaining synchronization between the encoding side and the decoding side in the form of independent synchronization with each other. Such independent synchronization is excellent in terms of ensuring clock stability on the encoding side, a flexible system configuration, and efficient failure handling, but there is a call delay caused by a time lag between the two clocks. Inevitable. This will be described in detail below.

いま、かりに復号化側２のクロックｔ₂が符号化側のク
ロックｔ₁よりも速い方法にずれているとすると、メモ
リ24に対する書込みよりは読出し速度が速く、最終的に
は読出すべきフレームデータが無くなってしまうことに
なる。また、クロックｔ₂がクロックｔ₁よりも遅い方向
にずれている場合には読出しの終了しないうちに次のフ
レームデータの書込み命令が出されるという状態が発生
することとなる。このような場合、このずれの状態を監
視しつつ、これがある許容値を超えるこきは次の分析フ
レームの書込み、読出しに飛越しシフトし符号，復号化
側間のフレーム同期を確保している。Now, assuming that the clock t ₂ on the decoding side 2 is deviated to a method faster than the clock t ₁ on the encoding side, the reading speed is faster than the writing to the memory 24 and the frame data to be finally read out. Will be lost. Further, when the clock t ₂ is shifted in the direction slower than the clock t _1, a state occurs in which a write command for the next frame data is issued before the reading is completed. In such a case, while monitoring the state of this deviation, when the deviation exceeds a certain allowable value, it is interleaved and shifted for writing and reading of the next analysis frame to ensure frame synchronization between the coding and decoding sides.

しかしながら、可変長フレーム型の符号復号装置では、
このようにして調整されるフレーム同期が１区分を基本
フレーム単位として処理しており、これによる通話遅延
は通常の符号復号化装置に比して著しく増大したものと
なる。However, in the variable length frame type code decoding device,
The frame synchronization adjusted in this way processes one section in units of basic frames, and the call delay due to this is significantly increased as compared with a normal code decoding apparatus.

本実施例は上述したフレーム同期修正を１区分よりも十
分に短い分析フレーム単位で実施して通話遅延の大幅な
改善を図っている。In the present embodiment, the above-mentioned frame synchronization correction is carried out in units of analysis frames which are sufficiently shorter than one segment, and the call delay is greatly improved.

固定長フレーム化器23は適応制御器25から調節フレーム
に関するデータを受ける都度、メモリ24に出力すべき分
析フレームから当該分析フレームを削除する。この調節
フレームは、区分中にあって再生音質に強い影響をもつ
子音もしくは子音と母音の共存部は除き、その時間長が
本質的に不定であり従って再生音質に対する影響が十分
に少ない有声音定常部ならびに無音部分もしくはこの無
音部分と実効的に等価な音声中のポーズ部分を対象とし
て指定され、かつそのタイミングは修正メッセージ発生
器26から修正メッセージを受ける都度行なわれるものと
している。Each time the fixed-length framing device 23 receives data regarding an adjustment frame from the adaptive controller 25, the fixed-length framing device 23 deletes the analysis frame from the analysis frames to be output to the memory 24. This adjustment frame has an essentially indefinite time length except for the consonants or the coexistence part of consonants and vowels, which has a strong influence on the reproduced sound quality during division, and therefore has a sufficiently small influence on the reproduced sound quality. It is assumed that the part and the silent part or the pause part in the voice which is effectively equivalent to the silent part are designated, and the timing thereof is set every time the modification message is received from the modification message generator 26.

ところで、可変長フレーム化されたスペクトル包絡パラ
メータにおいて、代表分析フレームによって指定される
分析フレーム群の時間長が長く、従ってフレーム同期保
持のために削除される１分析フレームの影響が相対的に
少なくて済む区間のひとつが有声音定常部であり、もう
ひとつがポーズを含む無音部である。これら有声音定常
部と無音部とを音声電力の観点で評価すると、前者は明
らかに電力大であり後者は電力小である。どの分析フレ
ームを削除の対象とするかは、フレーム同期器22から提
供される電力ビットを利用し、リピートビットによって
指定される分析フレームから次のようにして選定する。By the way, in the spectrum envelope parameter of variable length frame, the time length of the analysis frame group designated by the representative analysis frame is long, and therefore the influence of one analysis frame deleted for maintaining frame synchronization is relatively small. One of the completed sections is the voiced sound stationary section, and the other is the silent section including pauses. When the voiced sound stationary part and the silent part are evaluated from the viewpoint of voice power, the former is obviously high in power and the latter is low in power. Which analysis frame is to be deleted is selected from the analysis frames designated by the repeat bits by using the power bit provided from the frame synchronizer 22 as follows.

修正メッセージ発生器26は常時クロックｔ₁とクロック
ｔ₁を受けつつ、クロック相互間の時間ずれ±Δｔを監
視する。この時間ずれ±Δｔがある許容値を超えたとき
修正メッセージ発生器26は適応制御器25に対し区分中か
ら削除すべき分析フレームを指定する調節フレームデー
タを出力せしめる修正メッセージを出力する。The correction message generator 26 constantly receives the clocks t ₁ and t ₁ , and monitors the time lag ± Δt between the clocks. When this time difference ± Δt exceeds a certain allowable value, the correction message generator 26 outputs a correction message for causing the adaptive controller 25 to output the adjustment frame data designating the analysis frame to be deleted from the section.

適応制御器25は、修正メッセージを受けると電力ビット
とリピートビットにもとづき時間的に最近傍の削除対象
分析フレームを決定、この情報を調節フレームデータと
して出力し、分析フレーム単位での同期保持を行なう。Upon receiving the modification message, the adaptive controller 25 determines the temporally nearest deletion target analysis frame based on the power bit and the repeat bit, outputs this information as adjustment frame data, and holds synchronization in analysis frame units. .

音声合成器28はクロックｔ₂を受けつつ、メモリ23から
分析フレーム単位でスペクトル包絡パラメータと音源デ
ータを供給される。The voice synthesizer 28 is supplied with the spectrum envelope parameter and the sound source data from the memory 23 in units of analysis frames while receiving the clock t ₂ .

音声合成器28は、LSP合成フィルタのフィルタ係数とし
てLSP係数を利用し、また音源データから再生した音源
信号でこのフィルタを駆動してディジタル音声信号を発
生、さらにこれをD/Aコンバータにかけてアナログ音声
信号としたうえLPFで3.4KHz以上の不要な高域成分を除
去し出力音声を得る。The voice synthesizer 28 uses the LSP coefficient as a filter coefficient of the LSP synthesis filter, drives the filter with a sound source signal reproduced from the sound source data to generate a digital voice signal, and further applies this to a D / A converter to generate an analog voice signal. In addition to the signal, the LPF removes unnecessary high frequency components above 3.4 KHz to obtain the output voice.

こうして、区分単位による可変長フレーム符号，復号化
を実施しつつ、しかもフレーム同期保持は分析フレーム
単位で処理することができる。In this way, variable length frame encoding and decoding can be performed in units of divisions, and frame synchronization retention can be processed in units of analysis frames.

なお、本実施例ではスペクトル包絡パラメータのうち代
表分析フレームによって指定される区間の時間長と無音
区間の時間長とを併用して適応制御するものとしている
が、これをそれぞれ単独に適応制御するものとしても一
向に差支えない。In this embodiment, the time length of the section designated by the representative analysis frame among the spectrum envelope parameters and the time length of the silent section are used for adaptive control, but each of them is adaptively controlled independently. However, it does not matter.

また、本実施では１区分よりも十分短い区間として分析
フレームを同期における基本フレームとして利用してい
るが、これはほぼ同様な時間長のサンプル数を対象とし
ても容易に実施しうることは明らかである。Further, in the present embodiment, the analysis frame is used as a basic frame in synchronization as a section that is sufficiently shorter than one section, but it is clear that this can be easily performed even when the number of samples with almost the same time length is used. is there.

〔The invention's effect〕

以上説明した如く本発明によれば、独立同期もしくは強
制同期を符号化側と復号化側との同期に利用する可変長
フレーム型符号化装置の同期方法において、スペクトル
包絡パラメータの代表時間長もしくは無音区間長、ある
いはこれら２つの時間長を符号化側と復号化側との同期
ずれの程度に対応して適応制御するという手都を備える
ことにより、同期ずれ修正に伴なう通話遅延を著しく低
減せしめる可変長フレーム型符号復号化装置の同期方法
が実現できるという効果がある。As described above, according to the present invention, in the synchronization method of the variable-length frame type encoder that uses independent synchronization or forced synchronization for synchronization between the encoding side and the decoding side, the representative time length of the spectrum envelope parameter or silence By providing the means to adaptively control the section length or these two time lengths according to the degree of synchronization deviation between the encoding side and the decoding side, the call delay associated with the synchronization deviation correction is significantly reduced. There is an effect that it is possible to realize a synchronization method for a variable length frame type code decoding device.

[Brief description of drawings]

第１図は本発明の一実施例を示すブロック図である。１……符号化側、２……復号化側、11……A/Dコンバー
タ、12……窓処理器、13……LSP分析器、14……音源情
報分析器、15……可変長フレーム選択器、16……クロッ
ク発生器、17……マルチプレクサ、21……デマルチプレ
クサ、22……フレーム同期器、23……固定長フレーム化
器、24……メモリ、25……適応制御器、26……修正メッ
セージ発生器、26,27……クロック発生器、28……音声
合成器、100……伝送路。FIG. 1 is a block diagram showing an embodiment of the present invention. 1 ... Encoding side, 2 ... Decoding side, 11 ... A / D converter, 12 ... Window processor, 13 ... LSP analyzer, 14 ... Sound source information analyzer, 15 ... Variable length frame Selector, 16 ... Clock generator, 17 ... Multiplexer, 21 ... Demultiplexer, 22 ... Frame synchronizer, 23 ... Fixed length framing device, 24 ... Memory, 25 ... Adaptive controller, 26 ...... Modified message generator, 26,27 …… Clock generator, 28 …… Speech synthesizer, 100 …… Transmission line.

Claims

[Claims]

1. A synchronization method of a variable-length frame type code decoding apparatus which utilizes independent synchronization or forced synchronization for synchronization between an encoding side and a decoding side, and is designated by a representative spectrum envelope parameter for each variable-length frame section. If the decoding side clock precedes the encoding side clock, the voice analysis section or the silence section for each variable length frame section, or the time length of the voiced section and the silence section is extended by the basic analysis frame period to obtain the encoding side clock. Is preceded by the decoding side clock, it is provided with a synchronization holding means for performing synchronization control between the encoding side and the decoding side while adaptively controlling by shortening the basic analysis frame and correcting the synchronization shift. A method for synchronizing a variable length frame type code decoding device, comprising: