JP3475772B2

JP3475772B2 - Audio encoding device and audio decoding device

Info

Publication number: JP3475772B2
Application number: JP06541898A
Authority: JP
Inventors: 久矢島
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1998-03-16
Filing date: 1998-03-16
Publication date: 2003-12-08
Anticipated expiration: 2018-03-16
Also published as: JPH11259099A

Abstract

PROBLEM TO BE SOLVED: To make it possible to transmit non-speech signal of high quality without increasing a transmission speed at the time of transmitting a non-speech signal by changing over a part of processing function blocks based on the judgment result of a speech/non-speech discriminator. SOLUTION: A speech/non-speech signal discriminator 102 permanently monitors whether an input signal is a speech signal or a non-speech signal. Based on the judgment result, an operation mode of an encoder 101 is decided by change-over switches 103, 104. An encoding processing function block 105 is optimized so as to be able to compress and encode subject speech signal with a high degree of efficiency. On the other hand, an encoding processing function block 106 is optimized so as to be able to compress and encode non- speech signal, for example, a DTMF signal, with little distortion. On the receiver side, based on the judgment result of the speech/non-speech signal discriminator 102, separated by a multi-separating part 202, decoding processing function blocks 205, 206 are selected by change-over switches 203, 204 respectively. These blocks 205, 206 correspond to the blocks 105, 106.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、音声のディジタル
有線通信および無線通信において用いられる音声符号化
・復号装置に関し、特にDTMF(Dual Tone Multi-Frequen
cy)信号などの、音声周波数帯域を用いた非音声信号を
伝送する事の出来る音声符号化・復号装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice encoding / decoding device used in voice digital wire communication and wireless communication, and more particularly to a DTMF (Dual Tone Multi-Frequen).
The present invention relates to a voice encoding / decoding device capable of transmitting a non-voice signal using a voice frequency band such as a cy signal.

【０００２】[0002]

【従来の技術】企業内通信においては、通信コストの低
減が最も重要な課題である。通信トラヒックの大部分を
占める音声信号の高能率伝送を実現するため、近年、8k
bit/sCS-ACELP(Conjugate-Structure Algebraic-Code-E
xcited Linear Prediction: 共役構造代数的符号励振
線形予測)音声符号化方式(ITU-T勧告G.729準拠)に代表
されるような、音声符号化・復号方式に基づく高能率音
声符号化装置を適用する事例が増えつつある。2. Description of the Related Art Reducing communication costs is the most important issue in corporate communication. In order to realize high-efficiency transmission of voice signals, which occupy most of communication traffic, in recent years 8k
bit / sCS-ACELP (Conjugate-Structure Algebraic-Code-E
xcited Linear Prediction: Conjugate structure algebraic code excitation linear prediction) Applying a high-efficiency speech coder based on speech coding / decoding, as typified by speech coding (conforming to ITU-T Recommendation G.729). The number of cases to do is increasing.

【０００３】伝送速度が8kbit/sクラスの音声符号化ア
ルゴリズムにおいては、少ない情報量で高品質な音声を
得るため入力信号を音声信号に特化した構成となってい
る。この事を上記8kbit/s CS-ACELP方式を例にとって説
明する。図２０に符号器のブロック図を、図２１に復号
器のブロック図を示す。本符号化方式は、人間の発声機
構をモデル化した符号化アルゴリズムとなっており、即
ち、人間の声道情報をモデル化した合成フィルタ412(音
声のスペクトル包絡に対応する線形フィルタ)を構成
し、人間の声帯音源に相当する符号帳に蓄えられた時系
列の信号(励振信号再生部411の出力)で駆動する事によ
って音声を再生するCELP方式に基づいている。なお、詳
細なアルゴリズムの説明は、ITU-T Recommendation G.7
29, ■Coding of Speech at 8kbit/s using Conjugate-
Structure Algebraic-Code-ExcitedLinear Prediction
(CS-ACLEP)■を参照されたい。In a voice coding algorithm having a transmission rate of 8 kbit / s class, an input signal is specialized for a voice signal in order to obtain a high quality voice with a small amount of information. This will be described by taking the above 8 kbit / s CS-ACELP method as an example. FIG. 20 shows a block diagram of the encoder, and FIG. 21 shows a block diagram of the decoder. This coding method is a coding algorithm that models a human vocalization mechanism, that is, a synthesis filter 412 (a linear filter corresponding to a spectral envelope of speech) that models human vocal tract information is configured. It is based on the CELP system in which a voice is reproduced by driving with a time-series signal (output of the excitation signal reproducing unit 411) stored in a codebook corresponding to a human vocal cord sound source. For a detailed explanation of the algorithm, see ITU-T Recommendation G.7.
29, ■ Coding of Speech at 8kbit / s using Conjugate-
Structure Algebraic-Code-Excited Linear Prediction
(CS-ACLEP) Please refer to ■.

【０００４】符号化アルゴリズムが音声に特化された構
造になると、高能率音声符号化装置を用いた伝送路にお
いて、音声周波数帯域を用いた音声信号以外の信号（例
えば、デュアルトーンで構成されたプッシュボタンに用
いられるDTMF信号、No.5シグナリング、モデム信号な
ど）の伝送特性は、伝送効率が高能率になればなるほど
低下する傾向がある。When the coding algorithm has a structure specialized for voice, a signal other than a voice signal using a voice frequency band (for example, a dual tone configuration) is used in a transmission line using a high-efficiency voice encoding device. The transmission characteristics of DTMF signals used for push buttons, No.5 signaling, modem signals, etc.) tend to deteriorate as the transmission efficiency becomes higher.

【０００５】この事を示す一例として、LSP量子化部の
詳細について、図２２、図２３を用いて説明する。図２
２は、図２０に示した符号器内のLSP量子化部309の詳細
構成であり、図２３は、図２１に示した復号器内のLSP
逆量子化部407の詳細構成である。As an example showing this, details of the LSP quantizer will be described with reference to FIGS. 22 and 23. Figure 2
2 is a detailed configuration of the LSP quantizer 309 in the encoder shown in FIG. 20, and FIG. 23 is an LSP in the decoder shown in FIG.
3 is a detailed configuration of the inverse quantization unit 407.

【０００６】CS-ACELP音声符号化方式においては、LSP
係数の量子化に３つの処理手順を踏む事で実現してい
る。即ち、LSP量子化部309は、以下に示す３つの処理機
能ブロックを有している。 (1)フレーム間で予測可能な成分を差し引いて効率的に
量子化するための移動平均(MA)予測成分計算部308 (2)ターゲットとなるLSPを、音声により学習された符号
帳を用いて大雑把に量子化を行う第１段LSP量子化符号
帳335 (3)第１段で大雑把に量子化されたターゲットLSPに対し
て、乱数系列を用いた符号帳で微調整を行う第２段LSP
量子化符号帳336In the CS-ACELP speech coding system, the LSP
It is realized by performing three processing steps for coefficient quantization. That is, the LSP quantizer 309 has the following three processing function blocks. (1) Moving average (MA) predictive component calculation unit 308 for efficient quantization by subtracting predictable components between frames (2) Target LSP using a codebook learned by speech First-stage LSP for roughly quantizing Quantization codebook 335 (3) Second-stage LSP for performing fine adjustment with a codebook using a random number sequence on the target LSP roughly quantized at the first stage
Quantization codebook 336

【０００７】(1)の移動平均予測を用いる事により周波
数特性の急激な変化の少ない、即ちフレーム間で相関性
の強い信号を効率的に量子化する事が出来る。また(2)
の学習符号帳と、(3)の乱数符号帳との併用により、数
学的な厳密性には欠けるものの、スペクトル包絡の概形
を効率よく表現する事が出来る。また、(3)の乱数符号
帳の存在により、スペクトル包絡の微妙な変化にも柔軟
に追随する事が出来る。以上の観点から、LSP量子化部3
09は音声のスペクトル包絡情報の特徴を効率よく符号化
するのに良く適した方式であるといえる。By using the moving average prediction of (1), it is possible to efficiently quantize a signal having a small change in frequency characteristics, that is, a signal having a strong correlation between frames. Also (2)
By using the learning codebook of (1) and the random codebook of (3) together, although the mathematical rigor is lacking, the outline of the spectrum envelope can be efficiently expressed. In addition, the existence of the random number codebook in (3) makes it possible to flexibly follow subtle changes in the spectrum envelope. From the above viewpoint, the LSP quantizer 3
It can be said that 09 is a method that is well suited for efficiently encoding the features of speech spectral envelope information.

【０００８】一方、非音声信号、特にDTMF信号の符号化
においては、以下のような性質を考慮する必要がある。・スペクトル包絡は音声信号と明らかに異なっている。・信号継続時間と、ポーズ時間との間でスペクトル特性
に急激な変化がある。利得も急激に変化する。ただし、
信号継続時間内に限定すれば、スペクトル特性、利得と
もに変化量が極めて小さい。・LSPの量子化歪がそのままDTMF信号の周波数歪に反映
されるため、LSP量子化歪は出来るだけ小さくする必要
がある。・DTMF信号が継続する区間においては、周波数特性は極
めて安定している。以上の観点から、上記LSP量子化部309は、DTMF信号のス
ペクトル包絡情報を符号化するのに効果的な方法である
とは言えない。On the other hand, in encoding a non-voice signal, especially a DTMF signal, it is necessary to consider the following properties. The spectral envelope is clearly different from the voice signal. There is a sharp change in the spectral characteristics between the signal duration and the pause time. The gain also changes rapidly. However,
If limited to within the signal duration, the amount of change in both spectral characteristics and gain is extremely small. -The LSP quantization distortion is directly reflected in the DTMF signal frequency distortion, so it is necessary to minimize the LSP quantization distortion. -The frequency characteristics are extremely stable in the section where the DTMF signal continues. From the above viewpoints, the LSP quantization section 309 cannot be said to be an effective method for encoding the spectrum envelope information of the DTMF signal.

【０００９】以上の例で示したように、DTMF信号のよう
な非音声信号はいくつかの観点で音声信号とは異なる性
質を有しているため、非音声信号の符号化に当たって、
特に伝送速度が低く符号化のための冗長性が少ないとい
う条件の下では、音声信号と同じ手法を用いるのは適当
とは言えない。As shown in the above example, since a non-voice signal such as a DTMF signal has a property different from that of the voice signal in some respects, when encoding the non-voice signal,
Especially under the condition that the transmission rate is low and the redundancy for encoding is small, it is not appropriate to use the same method as that for the voice signal.

【００１０】ところで、企業内通信においては、電話通
信における呼接続などのために、シグナリング伝送のた
めの信号線を別途設ける事をせず、DTMF信号等を用いて
インチャネルでシグナリング伝送を行う事が多い。この
場合、割当てられた伝送路が上記の高能率音声符号化を
用いた伝送路であれば、DTMF信号の伝送特性は悪化する
ため、呼接続が正常に出来なくなるケースが高い頻度で
発生するといった弊害がある。By the way, in intra-company communication, signaling lines are transmitted in-channel using DTMF signals, etc. without separately providing a signal line for signaling transmission for call connection in telephone communication. There are many. In this case, if the assigned transmission path is a transmission path using the high-efficiency voice coding described above, the transmission characteristics of the DTMF signal deteriorate, so that a case where call connection cannot be normally made frequently occurs. There is an evil.

【００１１】このような問題を解決する第１の手段とし
て、例えば、特開平9-81199に示されるような図２４の
装置構成がとられる事がある。この構成においては送信
側に、音声信号と、DTMF信号のような非音声信号とを識
別する手段と、DTMF信号をあらかじめ符号化したパター
ンを記憶しているメモリを、送信側と受信側とで有して
おり、本識別手段においてDTMF信号の入力を識別する
と、DTMFの番号に対応する符号化パターンを保持するメ
モリのインデックスを受信側に送信し、受信側ではその
インデックスを識別して、その番号に対応するDTMF信号
を生成するものである。As a first means for solving such a problem, for example, the device configuration shown in FIG. 24 as shown in JP-A-9-81199 may be adopted. In this configuration, the transmitting side has a means for distinguishing a voice signal from a non-voice signal such as a DTMF signal, and a memory storing a pattern in which the DTMF signal is encoded in advance. When the input of the DTMF signal is identified by this identifying means, the index of the memory holding the coding pattern corresponding to the DTMF number is transmitted to the receiving side, and the receiving side identifies the index, The DTMF signal corresponding to the number is generated.

【００１２】また、このような問題を解決する第２の手
段として、例えば、特開平9-326772に示されるような図
２５の装置構成がとられる事がある。この構成において
は、入力音声の符号化に際し、フィルタ係数を得て音声
符号化量を算出する聴覚重み付けフィルタと、聴感上の
品質を高めるための聴覚重み付けフィルタ適応部で得ら
れる反射係数を設定値と比較する反射係数判定手段と、
この反射係数が規定の範囲外であれば聴覚重み付けフィ
ルタを使用し、規定の範囲内であれば聴覚重み付けフィ
ルタを使用しない使用切換手段を備えた。Further, as a second means for solving such a problem, for example, there is a case where the device configuration of FIG. 25 as shown in Japanese Patent Laid-Open No. 9-326772 is adopted. In this configuration, when the input speech is encoded, the auditory weighting filter that obtains the filter coefficient to calculate the speech encoding amount and the reflection coefficient obtained by the auditory weighting filter adaptation unit for enhancing the perceptual quality are set values. Reflection coefficient determination means for comparing with
If the reflection coefficient is out of the specified range, the auditory weighting filter is used, and if the reflection coefficient is in the specified range, the use switching means not using the auditory weighting filter is provided.

【００１３】また更に、入力信号のレベルの変化を監視
する信号レベル監視手段を付加し、使用切換手段は反射
係数と入力信号のレベルの変化の組み合わせで聴覚重み
付けフィルタの使用切換を行うようにした。また、復号
装置側も対応する使用切換手段を備えた。Further, a signal level monitoring means for monitoring the change in the level of the input signal is added, and the use switching means switches the use of the auditory weighting filter by the combination of the reflection coefficient and the change in the level of the input signal. . Further, the decryption device side also has a corresponding use switching means.

【００１４】[0014]

【発明が解決しようとする課題】ところで、音声伝送時
において、音声／DTMF識別器が誤識別を起こす事があ
る。このとき、従来の第１の装置構成では、受信側にお
いて、音声伝送の途中に完全なデュアルトーンが生成さ
れ、即ち、会話の途中で突然DTMFの「ピー」音が挟まる
ような状況を呈する事となり、音声品質へのダメージが
大きくなる事が予想される。また、DTMFと識別されたと
きの復号器の入力が不定となるため、通常の音声伝送に
復帰後、復号器の内部状態が不連続となり、異音が発生
するといった問題もある。By the way, the voice / DTMF discriminator may make an erroneous discrimination during voice transmission. At this time, in the conventional first device configuration, the receiving side may generate a complete dual tone during the voice transmission, that is, the DTMF "beep" sound may be suddenly caught in the middle of the conversation. Therefore, it is expected that the damage to the voice quality will increase. Moreover, since the input of the decoder becomes indeterminate when it is identified as DTMF, there is a problem that the internal state of the decoder becomes discontinuous and abnormal noise is generated after returning to normal voice transmission.

【００１５】従って、音声品質をある程度のレベルに維
持するためには、音声／DTMF識別器には誤識別率が限り
なく０に近い性能の高いものが要求される。これを、例
えばDSPなどの信号処理プロセッサで実現する場合通常
処理負荷は高くなる。このような識別アルゴリズムに対
応する必要から、実現に当たっては高性能のプロセッサ
を必要とするため実現コストが高くなるなどの問題も発
生する。Therefore, in order to maintain the voice quality at a certain level, the voice / DTMF discriminator is required to have a high performance with an erroneous discrimination rate close to zero. When this is implemented by a signal processor such as a DSP, the normal processing load is high. Since it is necessary to deal with such an identification algorithm, a high-performance processor is required for implementation, which causes a problem that the implementation cost becomes high.

【００１６】また、従来の第２の装置構成においては、
伝送速度が比較的高い(16kbit/s程度)符号化アルゴリズ
ムであれば聴覚重み付けフィルタや、ポストフィルタに
よる周波数歪を改善する事で十分なDTMF伝送特性を得る
事が出来るが、これよりも更に低速度(8kbit/s以下)の
音声符号化アルゴリズムにおいては、冗長度が少なくな
る分音声信号の符号化に特化されているため、上記フィ
ルタ以外の機能ブロックで、上記フィルタによる歪以上
の符号化歪が加わってしまう事が知られている。従っ
て、本装置構成においてDTMF信号の伝送特性を十分に改
善する事が出来ない。Further, in the conventional second device configuration,
A coding algorithm with a relatively high transmission rate (about 16 kbit / s) can obtain sufficient DTMF transmission characteristics by improving the frequency distortion due to the auditory weighting filter and post filter, but it is still lower than this. In the speed (8 kbit / s or less) speech coding algorithm, since it is specialized for the coding of the speech signal due to the reduced redundancy, functional blocks other than the above filters encode more than distortion by the above filters. It is known that distortion is added. Therefore, the DTMF signal transmission characteristics cannot be sufficiently improved in this device configuration.

【００１７】本発明は、このような従来の問題を解決す
るために考えられたものであり、DTMF信号等の非音声信
号の伝送特性の改善を図りつつ、符号化アルゴリズムが
本来持っている音声伝送品質が維持された高能率音声符
号化・復号装置を、より簡便な方法で提供し、DTMF信号
等の非音声信号をインチャネルで伝送可能とする事を目
的とする。The present invention has been conceived in order to solve such a conventional problem, and improves the transmission characteristics of a non-voice signal such as a DTMF signal, and at the same time, the voice originally possessed by the encoding algorithm. It is an object of the present invention to provide a high-efficiency voice encoding / decoding device in which transmission quality is maintained by a simpler method and enable non-voice signals such as DTMF signals to be transmitted in-channel.

【００１８】[0018]

【課題を解決するための手段】本発明に係わる音声符号
化装置は、入力される信号が音声信号か、非音声信号か
を識別し、判定結果を出力する、音声／非音声識別器
と、音声の符号化に適した周波数パラメータに基づき入
力信号を符号化し、音声符号化信号を出力する第１の機
能ブロックと、上記第１の機能ブロックと同じ符号化方
式で、上記周波数パラメータとは異なる非音声の符号化
に適した周波数パラメータに基づき入力信号を逐次符号
化し、非音声符号化信号を出力する第２の機能ブロック
とを有し、入力信号を圧縮符号化する符号器と、上記音
声／非音声識別器の判定結果に応じて、入力信号の上記
符号器への供給を第１の機能ブロック、または第２の機
能ブロックのいずれかを選択する切替手段と、上記符号
器の出力と、上記音声／非音声識別器の判定結果とを多
重化し、伝送路に出力する多重化部とを備え、上記第１
の機能ブロックは、音声の符号化に適したＬＳＰ（Ｌｉ
ｎｅＳｐｅｃｔｒｕｍＰａｉｒ：線スペクトル対）量子
化部、上記第２の機能ブロックは、非音声の符号化に適
したＬＳＰ量子化部であり、上記第２の機能ブロックを
構成するＬＳＰ量子化部は、ＬＳＰの移動平均予測処理
を停止するか、または移動平均予測の効果を軽減する機
能を持つものである。 A speech coding apparatus according to the present invention comprises a speech / non-speech discriminator which discriminates whether an input signal is a speech signal or a non-speech signal and outputs a judgment result. A first functional block that encodes an input signal based on a frequency parameter suitable for audio encoding and outputs an audio encoded signal, and the same encoding method as the first functional block, and the frequency parameter is input signal based on the frequency parameters suitable for the coding of different non-speech sequentially coding <br/> the, and a second functional block for outputting a non-speech coded signal, compresses and encodes the input signal An encoder and a switching means for selecting either a first functional block or a second functional block for supplying an input signal to the encoder according to the determination result of the speech / non-speech discriminator. The output of the above encoder and A determination result of the speech / non-speech discriminator multiplexes, and a multiplexing unit to output to the transmission path, the first
The functional block of LSP (Li
neSpectrumPair: Line spectrum pair) Quantum
The second functional block is suitable for non-speech coding.
LSP quantizer, which has the second functional block
The constituent LSP quantizer is a moving average prediction process of LSP.
To stop or reduce the effect of moving average prediction
It has the ability.

【００１９】また、本発明に係わる音声符号化装置
は、入力される信号が音声信号か、非音声信号かを識別
し、判定結果を出力する、音声／非音声識別器と、音声
の符号化に適した周波数パラメータに基づき入力信号を
符号化し、音声符号化信号を出力する第１の機能ブロッ
クと、上記第１の機能ブロックと同じ符号化方式で、上
記周波数パラメータとは異なる非音声の符号化に適した
周波数パラメータに基づき入力信号を逐次符号化し、非
音声符号化信号を出力する第２の機能ブロックとを有
し、入力信号を圧縮符号化する符号器と、上記音声／非
音声識別器の判定結果に応じて、入力信号の上記符号器
への供給を第１の機能ブロック、または第２の機能ブロ
ックのいずれかを選択する切替手段と、上記符号器の出
力と、上記音声／非音声識別器の判定結果とを多重化
し、伝送路に出力する多重化部とを備え、上記第１の機
能ブロックは、音声の符号化に適したＬＳＰ量子化部、
上記第２の機能ブロックは、非音声の符号化に適したＬ
ＳＰ量子化部であり、上記第１及び第２の機能ブロック
は、量子化誤差計算に用いる重み付け係数が異なり、あ
る特定の周波数領域において、上記第２の機能ブロック
の重み付け係数を上記第１の機能ブロックの重み付け係
数より大きくするものである。 The speech coding apparatus according to the present invention discriminates whether the input signal is a speech signal or a non-speech signal.
And a voice / non-voice discriminator that outputs the determination result
The input signal based on frequency parameters suitable for encoding
A first functional block that encodes and outputs an audio encoded signal.
And the same encoding method as the first functional block above,
Suitable for non-speech coding with different frequency parameters
Sequentially encodes the input signal based on the frequency parameter,
A second functional block for outputting a voice coded signal,
And an encoder that compresses and encodes the input signal, and
The encoder of the input signal according to the determination result of the voice discriminator
Supply to the first functional block or the second functional block
Switch for selecting one of the
Force and the judgment result of the voice / non-voice discriminator are multiplexed.
And a multiplexing unit for outputting to the transmission line,
The functional block is an LSP quantizer suitable for audio coding,
The second functional block is L suitable for non-speech coding.
An SP quantizer, which is the first and second functional blocks described above.
, The weighting factors used for the quantization error calculation are different,
The second functional block in a specific frequency region
Of the weighting coefficient of the first functional block
It should be larger than the number.

【００２０】また、本発明に係わる音声復号装置は、
音声符号化装置で符号化され、伝送された多重化データ
列から、符号化信号と、音声／非音声識別信号とに分離
する多重分離部と、上記符号化信号を元の信号に復号す
る復号器とを備え、上記復号器は、伝送された音声符号
化信号を符号化した符号器に対応する第３の機能ブロッ
クと、同じく伝送された非音声符号信号を符号化した符
号器に対応する第４の機能ブロックとを有し、上記多重
データ分離部で分離された、上記音声／非音声識別器の
判定結果に応じて、上記第３の機能ブロック、または第
４の機能ブロックのいずれかを選択する切替手段を有す
るものである。 The speech decoding apparatus according to the present invention is
Multiplexed data encoded and transmitted by a voice coder
Separation from coded signal into coded signal and voice / non-voice discrimination signal
Demultiplexer and decode the encoded signal into the original signal
And a decoder for transmitting the transmitted voice code.
A third functional block corresponding to the encoder that encodes the encoded signal
And a code that encodes the same non-voice code signal transmitted.
And a fourth functional block corresponding to an encoder,
Of the voice / non-voice discriminator separated by the data separation unit
Depending on the judgment result, the third functional block or the third functional block
There is a switching means for selecting any of the four functional blocks.
It is something.

【００２１】[0021]

【００２２】[0022]

【００２３】[0023]

【００２４】[0024]

【００２５】[0025]

【００２６】[0026]

【００２７】[0027]

【発明の実施の形態】実施の形態１.以下、本発明の実
施の形態１について図面を参照しながら説明する。図１
は本発明の実施の形態１における音声符号化・復号装置
の構成を示すブロック図である。図１の送信側におい
て、101は音声信号をあるアルゴリズムに基づき高能率
に圧縮符号化する符号器、102は符号器への入力信号が
音声信号か、非音声信号（例えば、ＤＴＭＦ信号、No.5
シグナリング等)かを識別し、判定結果を出力する音声
／非音声信号識別器、103、104は切替スイッチ、105、1
06はそれぞれ符号器101の一部機能を実行する符号化処
理機能ブロック、107は識別器の判定結果と、符号器の
出力とを多重化して伝送路に出力する多重化部である。
ここで符号化処理機能ブロック105は、音声信号を対象
に高能率に圧縮符号化できるように最適化がなされてい
る。一方、符号化処理機能ブロック106は非音声信号、
例えばDTMF信号を少ない歪で圧縮符号化できるように最
適化がなされている。BEST MODE FOR CARRYING OUT THE INVENTION Embodiment 1 Hereinafter, Embodiment 1 of the present invention will be described with reference to the drawings. Figure 1
FIG. 3 is a block diagram showing a configuration of a voice encoding / decoding device according to Embodiment 1 of the present invention. In the transmission side of FIG. 1, 101 is a coder for highly efficient compression coding of a voice signal based on an algorithm, and 102 is a voice signal or a non-voice signal (for example, DTMF signal, No. Five
(Signaling etc.) and output the judgment result, a voice / non-voice signal discriminator, 103 and 104 are changeover switches, 105 and 1
Reference numeral 06 denotes an encoding processing functional block that executes a part of the functions of the encoder 101, and reference numeral 107 denotes a multiplexer that multiplexes the determination result of the discriminator and the output of the encoder and outputs the multiplexed result to the transmission path.
Here, the encoding processing function block 105 is optimized so that the audio signal can be compression-encoded with high efficiency. On the other hand, the encoding processing function block 106 is a non-voice signal,
For example, the DTMF signal is optimized so that it can be compression-encoded with little distortion.

【００２８】また、図１の受信側において、201は符号
器101と同一のアルゴリズムに基づき高能率に圧縮され
た音声符号を音声信号に復号する復号器、202は多重化
部107で多重化された音声／非音声信号識別器102の判定
結果と、符号器101の出力とを分離して出力する多重分
離部、203、204は切替スイッチ、205、206はそれぞれ復
号器201の一部機能を実行する復号処理機能ブロックで
ある。ここで、205、206はそれぞれ符号器101の符号化
処理機能ブロック105、106に対応している、または機能
的に全く同一のものである。On the receiving side of FIG. 1, 201 is a decoder for decoding a highly efficient compressed voice code into a voice signal based on the same algorithm as the encoder 101, and 202 is multiplexed by a multiplexer 107. The demultiplexing unit that separates and outputs the determination result of the speech / non-speech signal discriminator 102 and the output of the encoder 101, 203 and 204 are changeover switches, and 205 and 206 are partial functions of the decoder 201, respectively. It is a decryption processing function block to be executed. Here, 205 and 206 respectively correspond to the encoding processing functional blocks 105 and 106 of the encoder 101 or are functionally the same.

【００２９】次に、この音声符号化・復号装置の動作に
ついて説明する。図１の送信側において、音声／非音声
信号識別器102は、入力される信号が音声信号か、非音
声信号であるかを常に監視し、その判定結果に基づいて
符号器101の動作モードを決定する。音声／非音声信号
識別器102で「音声」と判定されたときは、切替スイッ
チ103を端子103A側に、同104を端子104A側にそれぞれ倒
す。その結果、符号器101の内部において、符号化処理
処理機能ブロック105が選択され、音声信号を高能率に
符号化するのに適した動作モード(以下、■音声モード
■と称する)となる。このモードにおいて、符号器101は
音声信号を符号化アルゴリズムに基づいて符号化処理を
実行し、入力音声に対応する符号を出力する。Next, the operation of this speech encoding / decoding device will be described. On the transmission side of FIG. 1, the voice / non-voice signal discriminator 102 constantly monitors whether the input signal is a voice signal or a non-voice signal, and determines the operation mode of the encoder 101 based on the determination result. decide. When the voice / non-voice signal discriminator 102 determines "voice", the changeover switch 103 is turned to the terminal 103A side and the switch 104 is turned to the terminal 104A side. As a result, the encoding processing function block 105 is selected inside the encoder 101, and the operation mode suitable for encoding the audio signal with high efficiency (hereinafter, referred to as (2) audio mode (3)) is set. In this mode, the encoder 101 executes the encoding process on the audio signal based on the encoding algorithm, and outputs the code corresponding to the input audio.

【００３０】また、音声／非音声信号識別器102で「非
音声」と判定されたときは、切替スイッチ103を端子103
B側に、同104を端子104B側にそれぞれ倒す。その結果、
符号器101の内部において、符号化処理機能ブロック106
が選択され、非音声信号、例えばDTMF信号等を少ない歪
で圧縮符号化するのに適した動作モード(以下■非音声
モード■と称する)となる。このモードにおいて、符号
器101は非音声信号、例えばDTMF信号等を符号化アルゴ
リズムに基づいて符号化処理を実行し、入力された非音
声信号に対応する符号を出力する。When the voice / non-voice signal discriminator 102 determines "non-voice", the changeover switch 103 is set to the terminal 103.
Tilt the same 104 to the B side and the terminal 104B side. as a result,
Inside the encoder 101, the encoding processing function block 106
Is selected, and an operation mode (hereinafter referred to as (1) non-voice mode (3)) suitable for compressing and encoding a non-voice signal, such as a DTMF signal, with a small distortion is selected. In this mode, the encoder 101 executes a coding process on a non-voice signal, such as a DTMF signal, based on a coding algorithm, and outputs a code corresponding to the input non-voice signal.

【００３１】音声／非音声信号識別器102の動作につい
て、一例として、識別の対象となる非音声信号にDTMF信
号を用いて説明する。DTMF信号はデュアルトーンで構成
されており、出力される信号の周波数成分は規定により
特定の値に固定されている事から、・FFT等を用いて周波数分析を行う・バンドパスフィルタを用いて特定の周波数成分を濾波
する等の方法を用いて、周波数軸上の特徴量を抽出し、DTMF
信号の持つ特徴量と一致するか否かを判定する事により
識別する事が出来る。The operation of the voice / non-voice signal discriminator 102 will be described by using a DTMF signal as a non-voice signal to be discriminated as an example. Since the DTMF signal is composed of dual tones and the frequency component of the output signal is fixed to a specific value by regulation, ・ FFT etc. is used for frequency analysis ・ Specified using a bandpass filter The feature amount on the frequency axis is extracted using a method such as filtering the frequency component of
It can be identified by determining whether or not it matches the feature quantity of the signal.

【００３２】また、DTMF信号のレベルについても、送出
レベルが規定により特定の範囲に限定されている事、レ
ベルの変動が少ない事などから、比較的レベル変動が大
きくダイナミックレンジの広い音声信号とは明らかに異
なった特徴を示す。従って、入力信号のレベルを監視す
る事により、DTMF信号識別のための補助情報として用い
る事でDTMF信号の検出精度を向上させる事も出来る。音
声／非音声信号識別器102では上記のパラメータを入力
信号を用いて独自に算出し、それらを基に判定を下して
結果を出力する機能を持つ。Regarding the level of the DTMF signal as well, since the transmission level is limited to a specific range by regulation and the level fluctuation is small, an audio signal with a relatively large level fluctuation and a wide dynamic range is used. They show distinctly different characteristics. Therefore, by monitoring the level of the input signal, it is possible to improve the detection accuracy of the DTMF signal by using it as auxiliary information for identifying the DTMF signal. The voice / non-voice signal discriminator 102 has a function of uniquely calculating the above parameters using the input signals, making a decision based on them, and outputting the result.

【００３３】多重化部107では、以上のようにして得ら
れた音声信号、或いは非音声信号が符号化されたもの
(以下、音声／非音声符号と称する)と、音声／非音声信
号識別器102の出力である入力信号の識別結果(音声信号
か、非音声信号か)を多重化して伝送路へ送出する。The multiplexing unit 107 encodes the voice signal or the non-voice signal obtained as described above.
(Hereinafter, referred to as voice / non-voice code) and the discrimination result (voice signal or non-voice signal) of the input signal which is the output of the voice / non-voice signal discriminator 102 are multiplexed and sent to the transmission path.

【００３４】一方、図１の受信側においては、まず伝送
路から受信した信号列から、多重分離部202において音
声／非音声符号と、音声／非音声信号識別器102の判定
結果とに分離する。このように信号列から取り出された
音声／非音声信号識別器102の判定結果が、「音声」で
あれば切替スイッチ203を端子203A側に、同204を端子20
4A側にそれぞれ倒す。その結果、復号器201の内部にお
いて復号処理機能ブロック205が選択され、符号器101の
音声モードに対応した復号器の動作モードとなる。この
モードにおいて、復号器201は復号アルゴリズムに基づ
いて復号処理を実行し音声信号を復号する。このとき、
符号化・復号処理はいずれも音声モードで実行されてい
るので、復号された音声信号は符号化アルゴリズムがも
つ本来の性能に見合った品質となっている。On the other hand, on the receiving side in FIG. 1, first, the signal sequence received from the transmission path is separated in the demultiplexing unit 202 into the voice / non-voice code and the determination result of the voice / non-voice signal discriminator 102. . If the determination result of the voice / non-voice signal discriminator 102 extracted from the signal sequence is “voice”, the changeover switch 203 is set to the terminal 203A side and the switch 204 is set to the terminal 20.
Push down to the 4A side. As a result, the decoding processing function block 205 is selected inside the decoder 201, and the operation mode of the decoder corresponding to the voice mode of the encoder 101 is set. In this mode, the decoder 201 performs a decoding process based on a decoding algorithm and decodes an audio signal. At this time,
Since both the encoding and decoding processes are executed in the voice mode, the quality of the decoded voice signal matches the original performance of the encoding algorithm.

【００３５】また、多重分離部202で信号列から取り出
された音声／非音声信号識別器102の判定結果が、「非
音声」であれば切替スイッチ203を端子203B側に、同204
を端子204B側にそれぞれ倒す。その結果、復号器201の
内部において復号処理機能ブロック206が選択され、符
号器101の非音声モードに対応した復号器の動作モード
となる。このモードにおいて、復号器201は復号アルゴ
リズムに基づいて復号処理を実行し非音声信号を復号す
る。このとき、符号化・復号処理はいずれも非音声モー
ドで実行されているので、復号された非音声信号は音声
モードで実行されるよりも一層歪の少ないものとなって
いる。If the determination result of the voice / non-voice signal discriminator 102 extracted from the signal sequence by the demultiplexing unit 202 is "non-voice", the changeover switch 203 is moved to the terminal 203B side,
To the terminals 204B side. As a result, the decoding processing function block 206 is selected inside the decoder 201, and the operation mode of the decoder corresponding to the non-voice mode of the encoder 101 is set. In this mode, the decoder 201 performs a decoding process based on a decoding algorithm to decode a non-voice signal. At this time, since both the encoding / decoding processing is executed in the non-speech mode, the decoded non-speech signal has less distortion than that executed in the speech mode.

【００３６】以上のように、本実施の形態に依れば、音
声信号伝送時には音声の符号化により適した通常の音声
符号化・復号アルゴリズムを用いた方法で、また、非音
声信号、特にDTMF信号等の伝送時においては、一部の処
理機能ブロックを非音声信号の符号化により適した方法
に切替えて符号化・復号処理を実行するので、非音声信
号伝送時に伝送速度を上げる事無く、高品質の非音声信
号を伝送する事が出来る。As described above, according to the present embodiment, a normal voice encoding / decoding algorithm suitable for voice encoding is used during voice signal transmission, and a non-voice signal, especially DTMF is used. When transmitting signals, etc., some processing function blocks are switched to a method more suitable for encoding non-voice signals and encoding / decoding processing is executed, so without increasing the transmission rate during non-voice signal transmission, High quality non-voice signal can be transmitted.

【００３７】また、本実施の形態においては、符号化・
復号処理の一部に変更を加えるものであり、アルゴリズ
ムの本質に関わるような切替を行うものではないため、
例えば、音声信号入力中に、音声／非音声信号識別器10
2で「非音声」と誤識別した場合でも、多少の劣化はあ
るものの、ある程度の音声伝送品質は維持できるので、
通話中に耳触りとなるような弊害は抑えられるといった
利点もある。Further, in the present embodiment, encoding /
Since it changes a part of the decryption process and does not perform switching related to the essence of the algorithm,
For example, during voice signal input, the voice / non-voice signal discriminator 10
Even if it is erroneously identified as "non-voice" in 2, there is some degradation, but some voice transmission quality can be maintained, so
There is also an advantage that it is possible to suppress the harmful effects that may be felt during a call.

【００３８】実施の形態２.以下に、本発明に係る実施
の形態２について、図２、図３を参照しながら説明す
る。本実施の形態は実施の形態１のモード切替に関し、
符号化処理機能ブロック105、106の一つの動作例につい
て詳細に述べたものである。ここで、説明を判り易くす
るために、符号化アルゴリズムの一例としてCS-ACELP方
式(ITU-T勧告G.729準拠)を用いる事とする。CS-ACELP方
式に基づく符号器の詳細なブロック図は上記に示した図
２０に、復号器の詳細なブロック図は同じく図２１に示
した通りである。なお、図１と同一符号を記した構成要
素は、上記実施の形態１の項で説明したものと同一の機
能を持つ構成要素であるため説明の重複を省く。Second Embodiment Hereinafter, a second embodiment according to the present invention will be described with reference to FIGS. 2 and 3. The present embodiment relates to the mode switching of the first embodiment,
One operation example of the encoding processing function blocks 105 and 106 has been described in detail. Here, in order to make the description easy to understand, the CS-ACELP system (conforming to ITU-T recommendation G.729) is used as an example of the encoding algorithm. The detailed block diagram of the encoder based on the CS-ACELP system is as shown in FIG. 20 described above, and the detailed block diagram of the decoder is also shown in FIG. 21. Note that the constituent elements denoted by the same reference numerals as those in FIG. 1 are constituent elements having the same functions as those described in the section of the above-described first embodiment, and therefore redundant description will be omitted.

【００３９】本実施の形態においては、送信側(符号器
側)に非音声信号、例えばDTMF信号によって学習されたL
SP量子化符号帳2(310A)を別途用意する。即ち、図２に
示すように、音声入力に最適化された符号化処理機能ブ
ロック105には、音声により学習されたLSP量子化符号帳
1(310)が対応し、非音声信号に最適化された符号化処理
機能ブロック106にはLSP量子化符号帳2(310A)が対応す
る。音声／非音声信号識別器102が「音声」と判定した
ときは、切替スイッチ103を端子103Aに、同104を端子10
4Aにそれぞれ倒し、LSP量子化符号帳1(310)に基づくLSP
のベクトル量子化を実行する。一方、音声／非音声信号
識別器102が「非音声」と判定したときは、切替スイッ
チ103を端子103Bに、同104を端子104Bにそれぞれ倒し、
LSP量子化符号帳2(310A)に基づくLSPのベクトル量子化
を実行する。In this embodiment, the L side learned by a non-voice signal, for example, a DTMF signal, on the transmission side (encoder side).
SP Quantization Codebook 2 (310A) is prepared separately. That is, as shown in FIG. 2, the LSP quantized codebook learned by voice is included in the encoding processing function block 105 optimized for voice input.
1 (310) corresponds, and the LSP quantization codebook 2 (310A) corresponds to the encoding processing function block 106 optimized for non-voice signals. When the voice / non-voice signal discriminator 102 determines that it is “voice”, the changeover switch 103 is set to the terminal 103A and the switch 104 is set to the terminal 10A.
LSP based on LSP Quantization Codebook 1 (310)
Perform vector quantization of. On the other hand, when the voice / non-voice signal discriminator 102 determines that it is “non-voice”, the changeover switch 103 is set to the terminal 103B, and the same switch 104 is set to the terminal 104B.
Executes vector quantization of LSP based on LSP quantization codebook 2 (310A).

【００４０】受信側(復号器側)においても同様に、図３
に示すように、LSP量子化符号帳1(406)に加えて、LSP量
子化符号帳2(310A)と全く同一のLSP量子化符号帳2(406
A)を用意する。即ち、音声入力に最適化された復号処理
機能ブロック205には、LSP量子化符号帳1(406)が対応
し、非音声信号に最適化された復号処理機能ブロック20
6にはLSP量子化符号帳2(406A)が、それぞれ対応する。
多重分離部202において、取り出された音声／非音声信
号識別器102の判定結果が、「音声」であれば、切替ス
イッチ203を端子203A側に、同204を端子204A側にそれぞ
れ倒し、LSP量子化符号帳1(406)から、送信されたイン
デックスに対応するLSPベクトルを抽出する。一方、多
重分離部202で取り出された音声／非音声信号識別器102
の判定結果が、「非音声」であれば、切替スイッチ203
を端子203B側に、同204を端子204B側にそれぞれ倒し、L
SP量子化符号帳2(406A)から、送信されたインデックス
に対応するLSPベクトルを抽出する。Similarly, on the receiving side (decoder side), as shown in FIG.
In addition to the LSP quantization codebook 1 (406), the LSP quantization codebook 2 (406A) that is exactly the same as the LSP quantization codebook 2 (310A).
Prepare A). That is, the decoding processing function block 205 optimized for voice input corresponds to the LSP quantization codebook 1 (406), and the decoding processing function block 20 optimized for non-voice signals is used.
LSP quantized codebook 2 (406A) corresponds to 6, respectively.
In the demultiplexing unit 202, if the extracted result of the voice / non-voice signal discriminator 102 is “voice”, the changeover switch 203 is turned to the terminal 203A side, the switch 204 is turned to the terminal 204A side, and the LSP quantum The LSP vector corresponding to the transmitted index is extracted from the coded codebook 1 (406). On the other hand, the voice / non-voice signal discriminator 102 extracted by the demultiplexing unit 202
If the result of the determination is “non-voice”, the changeover switch 203
To the terminal 203B side and the same 204 to the terminal 204B side.
The LSP vector corresponding to the transmitted index is extracted from SP quantization codebook 2 (406A).

【００４１】以上のように、本実施の形態に依れば、音
声信号が入力された場合は、音声によって学習された符
号帳を用いてLSPが量子化され、また、DTMF信号が入力
された場合は、DTMF信号によって学習された符号帳を用
いてLSPが量子化されるようにしているので、音声信号
伝送時には、より音声信号に適した符号化を、DTMF信号
伝送時には、よりDTMF信号に適した符号化を行う事が出
来る。この手段により、DTMF信号のスペクトル包絡情報
を精度良く、かつ効率的に量子化する事が出来るため、
DTMF信号伝送時の量子化歪をより低減する事ができる。As described above, according to the present embodiment, when the voice signal is input, the LSP is quantized using the codebook learned by the voice, and the DTMF signal is input. In this case, the LSP is quantized by using the codebook learned by the DTMF signal. Therefore, when transmitting a voice signal, encoding that is more suitable for the voice signal, and when transmitting the DTMF signal, a more DTMF signal is used. Appropriate encoding can be performed. By this means, the spectral envelope information of the DTMF signal can be quantized accurately and efficiently,
It is possible to further reduce quantization distortion during DTMF signal transmission.

【００４２】実施の形態３.以下に、本発明に係る実施
の形態３について図４〜図８を参照しながら説明する。
本実施の形態は実施の形態１のモード切替に関し、符号
化処理機能ブロック105、106の一つの動作例について詳
細に述べたものである。ここで、説明を判り易くするた
めに、符号化アルゴリズムの一例としてCS-ACELP方式(I
TU-T勧告G.729準拠)を用いる事とする。なお、図１と同
一符号を記した構成要素は、上記実施の形態１の項で説
明したものと同一の機能を持つ構成要素であるため説明
の重複を省く。Third Embodiment Hereinafter, a third embodiment according to the present invention will be described with reference to FIGS.
The present embodiment relates to the mode switching of the first embodiment and describes in detail one operation example of the coding processing functional blocks 105 and 106. Here, in order to make the explanation easy to understand, the CS-ACELP method (I
TU-T Recommendation G.729). Note that the constituent elements denoted by the same reference numerals as those in FIG. 1 are constituent elements having the same functions as those described in the section of the above-described first embodiment, and therefore redundant description will be omitted.

【００４３】本実施の形態は、通常のDTMF信号伝送時に
おいて、スペクトル包絡の微妙な揺らぎが発生しない事
に着目したものである。符号器の動作について図４を用
いて説明する。図４はCS-ACELP方式に基づく本発明の符
号器の処理機能ブロックの一部を示したものである。本
実施の形態においては、量子化方式の異なるLSP量子化
部を２種類持っている。即ち、LSP量子化部1(309)は、
例えば、図２２に示されたような、音声信号の入力に対
して特化された符号帳探索方式を実現する。一方、LSP
量子化部2(309A)は、非音声信号の入力に対して特化さ
れた符号帳探索方式を実現する。即ち、音声信号に最適
化された符号化処理機能ブロック105には、LSP量子化部
1(309)が対応し、非音声信号に最適化された符号化処理
機能ブロック106にはLSP量子化部2(309A)が対応する。
音声／非音声信号識別器102が「音声」と判定したとき
は、切替スイッチ103を端子103Aに、同104を端子104Aに
それぞれ倒し、LSP量子化部1(309)の量子化方式に基づ
くLSP量子化を実行する。一方、音声／非音声信号識別
器102が「非音声」と判定したときは、切替スイッチ103
を端子103Bに、同104を端子104Bにそれぞれ倒し、LSP量
子化部2(309A)の量子化方式に基づくLSP量子化を実行す
る。The present embodiment focuses on the fact that no subtle fluctuations in the spectrum envelope occur during normal DTMF signal transmission. The operation of the encoder will be described with reference to FIG. FIG. 4 shows a part of processing functional blocks of the encoder of the present invention based on the CS-ACELP system. In this embodiment, there are two types of LSP quantizers having different quantizing methods. That is, the LSP quantizer 1 (309)
For example, a codebook search method specialized for voice signal input as shown in FIG. 22 is realized. On the other hand, LSP
The quantizer 2 (309A) realizes a codebook search method specialized for the input of a non-voice signal. That is, the LSP quantizer is included in the encoding processing function block 105 optimized for the audio signal.
1 (309) corresponds, and the LSP quantizer 2 (309A) corresponds to the encoding processing functional block 106 optimized for non-voice signals.
When the voice / non-voice signal discriminator 102 determines that it is “voice”, the changeover switch 103 is set to the terminal 103A and the switch 104 is set to the terminal 104A, respectively, and the LSP based on the quantization method of the LSP quantization unit 1 (309) is used. Performs quantization. On the other hand, when the voice / non-voice signal discriminator 102 determines “non-voice”, the changeover switch 103
To terminal 103B and terminal 104 to terminal 104B, respectively, to execute LSP quantization based on the quantization method of LSP quantization section 2 (309A).

【００４４】受信側においても同様に、図５に示すよう
に、LSP逆量子化部1(407)に加えて、LSP量子化部2(309
A)に対応したLSP逆量子化部2(407A)を用意する。即ち、
音声入力に最適化された復号処理機能ブロック205に
は、LSP逆量子化部1(407)が、非音声信号に最適化され
た復号処理機能ブロック206にはLSP逆量子化部2(407A)
がそれぞれ対応する。多重分離部202において取り出さ
れた音声／非音声信号識別器102の判定結果が「音声」
であれば切替スイッチ203を端子203A側に、同204を端子
204A側にそれぞれ倒しLSP逆量子化部1(407)に基づく方
法でLSPを抽出する。LSP逆量子化部1(407)は、例えば、
図２３に示されたようなLSP量子化部1(309)に対応した
逆量子化を実現する。一方、多重分離部202で取り出さ
れた音声／非音声信号識別器102の判定結果が「非音
声」であれば切替スイッチ203を端子203B側に、同204を
端子204B側にそれぞれ倒し、LSP逆量子化部2(407A)に基
づく方法でLSPを抽出する。Similarly, on the receiving side, as shown in FIG. 5, in addition to the LSP dequantization unit 1 (407), the LSP quantization unit 2 (309
An LSP dequantization unit 2 (407A) corresponding to A) is prepared. That is,
The decoding processing function block 205 optimized for voice input has an LSP dequantization unit 1 (407), and the decoding processing function block 206 optimized for non-voice signals has an LSP dequantization unit 2 (407A).
Correspond to each. The determination result of the voice / non-voice signal discriminator 102 extracted by the demultiplexing unit 202 is “voice”
If so, the changeover switch 203 to the terminal 203A side, the same 204 terminal
The LSP is extracted to the 204A side by a method based on the LSP dequantization unit 1 (407). LSP dequantization unit 1 (407), for example,
Inverse quantization corresponding to the LSP quantization unit 1 (309) as shown in FIG. 23 is realized. On the other hand, if the determination result of the voice / non-voice signal discriminator 102 extracted by the demultiplexing unit 202 is “non-voice”, the changeover switch 203 is turned to the terminal 203B side and the switch 204 is turned to the terminal 204B side, and the LSP reverse The LSP is extracted by the method based on the quantizer 2 (407A).

【００４５】LSP量子化部2(309A)の一例として、例えば
図６に示すような手段を用いる。これはLSP量子化部1(3
09)の、第２段LSP量子化符号帳336からの信号入力を無
効にしたものである。DTMF信号のような非音声信号につ
いては、スペクトル包絡はあらかじめ判っているため、
第１段LSP量子化において、入力が想定される非音声信
号のスペクトル包絡を少ない量子化歪で表現出来るベク
トルを、第１段LSP量子化符号帳335にあらかじめ埋め込
んでおく。これにより、第１段量子化で精度良く量子化
が可能となる。また、DTMF信号入力時には、包絡情報が
微妙に変動する事は通常ありえない。既に、第１段の量
子化である程度の精度は確保されているので第２段の微
調整は不要であり、逆に周波数歪の原因ともなる事が考
えられるため無効にしておく。As an example of the LSP quantizer 2 (309A), for example, the means shown in FIG. 6 is used. This is the LSP quantizer 1 (3
The signal input from the second stage LSP quantization codebook 336 in 09) is invalidated. For non-voice signals such as DTMF signals, the spectral envelope is known in advance,
In the first stage LSP quantization, a vector capable of expressing the spectrum envelope of a non-voice signal expected to be input with a small quantization distortion is embedded in the first stage LSP quantization codebook 335 in advance. As a result, the first-stage quantization can be quantized accurately. In addition, when the DTMF signal is input, the envelope information usually does not change subtly. Since a certain degree of accuracy has already been ensured by the first-stage quantization, the second-stage fine adjustment is unnecessary, and conversely it may cause frequency distortion, so it is disabled.

【００４６】また、送信側において図６に示すような方
式を用いた場合、LSP逆量子化部2(407A)の一例として、
例えば図７に示すような方式を用いる。これもまた、LS
P逆量子化部1(407)の、第２段LSP量子化符号帳336から
の信号入力を無効にしたものである。When the method shown in FIG. 6 is used on the transmission side, as an example of the LSP dequantization unit 2 (407A),
For example, the method shown in FIG. 7 is used. This is also LS
This is a signal in which the signal input from the second stage LSP quantization codebook 336 of the P inverse quantization unit 1 (407) is invalidated.

【００４７】以上のように、本発明の実施の形態３に依
れば、DTMF信号伝送時において、第１のLSP量子化手段
でDTMF信号のスペクトル包絡情報を精度良く量子化し、
また、DTMF信号伝送時にはスペクトル包絡情報の揺らぎ
が少ない事に着目して、第２のLSP量子化手段を省略す
るようにしているので、上記符号化・復号装置における
DTMF伝送歪を低減する事が出来る。As described above, according to the third embodiment of the present invention, at the time of transmitting a DTMF signal, the first LSP quantizing means accurately quantizes the spectrum envelope information of the DTMF signal,
Also, since the second LSP quantizing means is omitted, paying attention to the fact that the fluctuation of the spectrum envelope information is small at the time of transmitting the DTMF signal.
DTMF transmission distortion can be reduced.

【００４８】或いは、図８に示すように、実施の形態２
で説明した、DTMF信号で学習されたLSP量子化符号帳2(3
10A)を別途用意し、音声／非音声信号識別器102の判定
結果に応じて使用する符号帳を切替えるという手段を併
用しても、上記実施の形態と同様の効果を得る事が出来
る。Alternatively, as shown in FIG.
LSP quantized codebook 2 (3
10A) is separately prepared and the means for switching the codebook to be used according to the determination result of the voice / non-voice signal discriminator 102 is also used, it is possible to obtain the same effect as the above-mentioned embodiment.

【００４９】実施の形態４.上記の実施の形態２及び３
では、DTMF信号伝送時において、量子化歪に起因する周
波数歪を低減するように工夫したものであるが、次に、
DTMF信号の立上り、立ち下がり時に起きるスペクトル包
絡情報の急激な変化に追随できるようにするための実施
の形態を示す。図９は、この実施の形態４におけるLSP
量子化部の内部の構成を示すブロック構成図である。な
お、図９において、図４と同一符号を記した構成要素
は、上記実施の形態３の項で説明したものと同一の機能
を持つ構成要素であるため説明の重複を省く。Embodiment 4. Embodiments 2 and 3 described above
Then, when transmitting DTMF signals, it was devised to reduce frequency distortion due to quantization distortion.
An embodiment for enabling to follow abrupt changes in spectrum envelope information occurring at the rise and fall of the DTMF signal will be shown. FIG. 9 shows the LSP according to the fourth embodiment.
FIG. 3 is a block configuration diagram showing an internal configuration of a quantization unit. Note that, in FIG. 9, the constituent elements denoted by the same reference numerals as those in FIG. 4 are constituent elements having the same functions as those described in the section of the above-mentioned third embodiment, and therefore redundant description will be omitted.

【００５０】次に、符号器の動作を説明する。本実施の
形態において、図４に示したLSP量子化に関わるブロッ
ク群の動作は、実施の形態３によるものと全く同一であ
る。実施の形態３と異なるのはLSP量子化部2(309A)の内
部のみである。図９によれば、LSP量子化部2(309A)は移
動平均予測の機能を外した構成となっている。LSPの量
子化には、第１段LSP符号帳のみを用いる。これによ
り、音声／非音声信号識別器102で入力信号がDTMF信号
を識別したら、LSPの移動平均予測処理を停止させるの
と同じ効果が得られる。Next, the operation of the encoder will be described. In this embodiment, the operation of the block group related to LSP quantization shown in FIG. 4 is exactly the same as that according to the third embodiment. The difference from the third embodiment is only the inside of the LSP quantizer 2 (309A). According to FIG. 9, the LSP quantizer 2 (309A) has a configuration without the function of moving average prediction. Only the first stage LSP codebook is used for LSP quantization. As a result, when the voice / non-voice signal discriminator 102 discriminates the DTMF signal from the input signal, the same effect as stopping the LSP moving average prediction process can be obtained.

【００５１】一方、復号器201でも、符号器101との対応
を取るため、図７に示すLSP逆量子化部2(407A)から、移
動平均予測の機能を外した単純な構成とする事で、LSP
の復号を実現する事が出来る。On the other hand, the decoder 201 also has a simple structure in which the moving average prediction function is removed from the LSP dequantization unit 2 (407A) shown in FIG. 7 in order to correspond to the encoder 101. , LSP
Can be decrypted.

【００５２】或いは、図１０に示すように、実施の形態
２で説明したように、第１段LSP量子化符号帳2(335A)を
別途用意する手段を併用する事も出来る。このとき、移
動平均の過去のLSPに対する重み付けを軽くするように
構成された移動平均予測係数符号帳2(337A)をLSP量子化
符号帳2(310A)に盛り込む事で、移動平均予測効果の軽
減を実現する事が出来る。Alternatively, as shown in FIG. 10, as described in the second embodiment, a means for separately preparing the first stage LSP quantization codebook 2 (335A) can be used together. At this time, by incorporating the moving average prediction coefficient codebook 2 (337A) configured to reduce the weighting of the past LSP of the moving average into the LSP quantization codebook 2 (310A), the moving average prediction effect is reduced. Can be realized.

【００５３】この時、復号器201でも、符号器101との対
応を取るため、図１１に示すような構成とする事で、LS
Pの復号を実現する事が出来る。At this time, the decoder 201 also has a configuration as shown in FIG.
Decoding of P can be realized.

【００５４】以上のように、本発明の実施の形態４に依
れば、DTMF信号伝送時に、LSP量子化における移動平均
予測の効果を無くす、または効果を低減するような構成
としたのでDTMF信号の立上り、立ち下がり時において、
復号されるべきDTMF信号波形の特性を改善する事が出来
る。As described above, according to the fourth embodiment of the present invention, the effect of moving average prediction in LSP quantization is eliminated or reduced during DTMF signal transmission. At the rise and fall of
It is possible to improve the characteristics of the DTMF signal waveform to be decoded.

【００５５】実施の形態５．次に、ＤＴＭＦ信号を構成
するデュアルトーンの周波数近傍の情報をより正確に符
号化する事に着目した実施の形態を示す。図１２はこの
実施の形態５におけるLSP量子化部2(309A)の内部の構成
を示すブロック構成図である。なお、図１２において、
図６と同一符号を記した構成要素は、上記実施の形態３
の項で説明したものと同一の機能を持つ構成要素である
ため説明の重複を省く。Embodiment 5. Next, an embodiment will be described which focuses on more accurately encoding information in the vicinity of the frequencies of the dual tones forming the DTMF signal. FIG. 12 is a block configuration diagram showing an internal configuration of the LSP quantization unit 2 (309A) in the fifth embodiment. In addition, in FIG.
The constituent elements denoted by the same reference numerals as those in FIG. 6 are the same as those in the third embodiment.
Since it is a component having the same function as that described in the section 1), duplicate description will be omitted.

【００５６】次に、符号器の動作を説明する。本実施の
形態において、図１２に示したLSP量子化部2(309A)の動
作は、実施の形態３によるものと量子化誤差重み付け係
数計算部2(338A)の内部動作が異なるが、その他の動作
は実施の形態３と同一である。Next, the operation of the encoder will be described. In the present embodiment, the operation of the LSP quantization unit 2 (309A) shown in FIG. 12 is different from that of the third embodiment in the internal operation of the quantization error weighting coefficient calculation unit 2 (338A), but other The operation is the same as in the third embodiment.

【００５７】量子化誤差計算に用いる重み付け係数は、
CS-ACELP方式に依れば、以下の式に示される方法で計算
されている。The weighting coefficient used for the quantization error calculation is
According to the CS-ACELP method, it is calculated by the method shown in the following formula.

【００５８】[0058]

【数１】 [Equation 1]

【００５９】ここで、ωiはi次LSPである。即ち、スペ
クトルのピークがくる周波数域については重み付け係数
を重くし、スペクトルの”谷”になっている周波数域に
ついては重み付け係数が軽くなっている。これは、スペ
クトルのピークを示す周波数域については量子化誤差の
寄与分を重くして、誤差に対する感度を鋭くする効果が
ある。Here, ωi is an i-th order LSP. That is, the weighting coefficient is made heavier in the frequency range in which the peak of the spectrum comes, and the weighting coefficient is made lighter in the frequency range at the "valley" of the spectrum. This has the effect of making the contribution of the quantization error heavy in the frequency range showing the peak of the spectrum and sharpening the sensitivity to the error.

【００６０】ここで、DTMF信号のもつデュアルトーンの
スペクトル情報は、１次から６次付近の低次のLSPに集
中している事が知られている。量子化誤差重み付け係数
計算部2(338A)は、量子化誤差重み付け係数計算部(338)
からこの事を利用して改良が加えられたものである。即
ち、１０次のLSP係数のベクトル量子化において、ベク
トル距離算出時に、１次〜６次付近の低次のLSPの誤差
については重み付けをさらに重くし（即ち、わずかの誤
差にも鋭敏に感応するよう係数感度を高める）、高次の
LSPについては重み付けを軽くする（即ち、ある程度の
誤差は許容するよう係数感度を低く抑える）構成として
ある。これは、例えば、上記式におけるw1〜w6におい
て、１以上の係数を乗じる事によって簡単に実現する事
が出来る。It is known that the dual-tone spectrum information of the DTMF signal is concentrated in the low-order LSP around the 1st to 6th orders. The quantization error weighting coefficient calculation unit 2 (338A) is a quantization error weighting coefficient calculation unit (338)
It has been improved by utilizing this fact. That is, in the vector quantization of the 10th-order LSP coefficient, when calculating the vector distance, the weighting is further weighted for the error of the low-order LSP around the 1st to 6th order (that is, even a slight error is sensitive to the error). To increase the coefficient sensitivity), higher order
The LSP is configured to be lightly weighted (that is, the coefficient sensitivity is kept low to allow some error). This can be easily realized, for example, by multiplying w1 to w6 in the above equation by a coefficient of 1 or more.

【００６１】以上のように、本発明の実施の形態５に依
れば、LSP量子化を、DTMF信号を構成するデュアルトー
ン周波数付近について、より精度を高める事が出来るの
で、LSPの量子化誤差を低減でき、よって、復号されたD
TMF信号の周波数歪を小さくする事ができる。As described above, according to the fifth embodiment of the present invention, since the LSP quantization can be made more accurate in the vicinity of the dual tone frequency forming the DTMF signal, the LSP quantization error can be increased. Can be reduced, and thus the decrypted D
It is possible to reduce the frequency distortion of the TMF signal.

【００６２】実施の形態６.以下に、本発明に係る実施
の形態６について、図１３を参照しながら説明する。本
実施の形態は、実施の形態１の音声／非音声信号識別器
102の出力である信号識別情報の多重化、即ち、多重化
部107及び多重分離部202に関する。Embodiment 6 Hereinafter, Embodiment 6 according to the present invention will be described with reference to FIG. The present embodiment is a voice / non-voice signal discriminator of the first embodiment.
The present invention relates to the multiplexing of signal identification information which is the output of 102, that is, the multiplexer 107 and the demultiplexer 202.

【００６３】次に、動作について説明する。図１３は、
あるパラメータをターゲットとして量子化するための量
子化器の構成の一つの例を示すブロック図である。量子
化値の候補となる信号を量子化符号帳340に予め蓄積し
ておく。量子化の対象となるターゲット信号が入力され
たら量子化符号帳の各候補について、ターゲット信号と
の誤差を最小自乗誤差計算部342で計算する。このよう
に、最小の自乗誤差を与える符号帳の候補を探索し、探
し出された候補に付けられたインデックスを引き出し最
適符号として量子化器から出力する。Next, the operation will be described. Figure 13
It is a block diagram which shows one example of a structure of the quantizer for quantizing a certain parameter as a target. Signals that are candidates for quantized values are stored in the quantized codebook 340 in advance. When the target signal to be quantized is input, the least square error calculator 342 calculates the error from the target signal for each candidate of the quantization codebook. In this way, a codebook candidate that gives the smallest squared error is searched, and the index attached to the found candidate is extracted and output from the quantizer as the optimum code.

【００６４】ここで、本実施の形態においては、最適符
号の探索時にある特定の制約条件を用いる。その一例と
して、音声信号伝送時には、図１３の量子化符号帳にお
いて白く示した部分(インデックス0000〜1010)のみを探
索する。また、DTMF信号伝送時においては、同じく斜線
で示した部分(インデックス1011〜1111)のみを探索す
る。このような制限を設ける事により、受信側の多重分
離部において、当該パラメータの量子化値を調べる事に
よって、音声／非音声信号識別器102の判定結果を知る
事が出来、識別パターンが多重化されたのと同等の効果
を呈する事が出来る。In this embodiment, a specific constraint condition is used when searching for the optimum code. As an example, when transmitting a voice signal, only the white parts (indexes 0000 to 1010) in the quantization codebook of FIG. 13 are searched. Also, during DTMF signal transmission, only the shaded portions (indexes 1011-1111) are searched for. By providing such a limitation, the demultiplexing unit on the receiving side can know the determination result of the voice / non-voice signal discriminator 102 by checking the quantized value of the parameter, and the discrimination pattern can be multiplexed. It can have the same effect as that done.

【００６５】以上のように、本実施の形態６に依れば、
識別のためのビットを設ける必要が無いため、伝送速度
を増やさずに音声／非音声識別器の判定結果を受信側に
送る事が出来る。As described above, according to the sixth embodiment,
Since it is not necessary to provide a bit for identification, the determination result of the voice / non-voice identification device can be sent to the receiving side without increasing the transmission rate.

【００６６】実施の形態７.上記実施の形態６では、音
声信号伝送時とDTMF信号伝送時とで異なる量子化符号を
用いる事により、音声／非音声信号識別器102の判定結
果情報を埋め込む事を可能としたものであるが、次に、
ある量子化テーブルの１つを音声／非音声識別パターン
として用いる場合の実例を、図１４、１５を用いて詳し
く説明する。なお、図１〜図１３と同一符号を記した構
成要素は、上記実施の形態１〜６の項で説明したものと
同一の機能を持つ構成要素であるため説明の重複を省
く。Seventh Embodiment In the sixth embodiment, the determination result information of the voice / non-voice signal discriminator 102 is embedded by using different quantized codes during voice signal transmission and during DTMF signal transmission. Is possible, but next,
An actual example in which one of the quantization tables is used as a voice / non-voice identification pattern will be described in detail with reference to FIGS. The constituent elements denoted by the same reference numerals as those in FIGS. 1 to 13 are constituent elements having the same functions as those described in the above-described first to sixth embodiments, and therefore redundant description will be omitted.

【００６７】本実施の形態においては、例えば、上記実
施の形態３の手段とを併用する事で、より効率の良い音
声／非音声信号伝送を実現する事が出来る。ここで、図
１４は、音声信号伝送時に用いられるLSP量子化部1(30
9)、及びLSP量子化符号帳1(310)の内部構造を示すブロ
ック図、図１５は、同じくDTMF信号伝送時に用いられる
LSP量子化部2(309A)、及びLSP量子化符号帳2(310A)の内
部構造を示すブロック図である。実施の形態６における
量子化符号帳には、本実施の形態において、第２段LSP
量子化符号帳336を対応させる。DTMF信号伝送時に割当
てられている量子化の候補は、第２段LSP量子化符号帳3
36において斜線で示されているただ１つの候補のみであ
る。残りの候補は音声信号伝送時に割当てられ上記候補
はこのモードにおいて禁止符号となる。DTMF信号伝送時
に割当てられている候補は１つしかないので、そのイン
デックスはそのままDTMF伝送モードである事を識別する
パターンとして取り扱う事が出来る。In the present embodiment, more efficient voice / non-voice signal transmission can be realized by using, for example, the means of the third embodiment. Here, FIG. 14 shows the LSP quantizer 1 (30
9) and a block diagram showing the internal structure of LSP quantized codebook 1 (310), and FIG. 15 is also used during DTMF signal transmission.
FIG. 3 is a block diagram showing an internal structure of an LSP quantization unit 2 (309A) and an LSP quantization codebook 2 (310A). In the quantized codebook according to the sixth embodiment, the second-stage LSP in the present embodiment is used.
Corresponds to the quantization codebook 336. The quantization candidates assigned during DTMF signal transmission are the second stage LSP quantization codebook 3
There is only one candidate, which is shaded in 36. The remaining candidates are assigned at the time of voice signal transmission, and the above candidates become the prohibition code in this mode. Since there is only one candidate assigned during DTMF signal transmission, its index can be treated as it is as a pattern for identifying the DTMF transmission mode.

【００６８】ここで、DTMF信号伝送時に割当てられてい
る第２段LSP符号は１つしか割当てられていないが、図
１５に示すように、第１段LSP量子化符号帳2(335A)でDT
MF信号のスペクトル包絡情報を十分に表現出来る構成と
しているので、第２段LSP量子化を実行する必要が無
く、従って第２段LSP量子化の情報を送る必要は元々無
い。よって、特性の劣化等の問題は発生しない。Here, although only one second-stage LSP code is assigned at the time of DTMF signal transmission, as shown in FIG. 15, the first-stage LSP quantization codebook 2 (335A) is used for DT.
Since the spectrum envelope information of the MF signal can be sufficiently expressed, it is not necessary to execute the second stage LSP quantization, and therefore, there is originally no need to send the information of the second stage LSP quantization. Therefore, problems such as deterioration of characteristics do not occur.

【００６９】以上説明した符号器の動作の結果、多重化
部107が構成する符号器から出力されるフレームフォー
マットは図１６の通りとなる。DTMF信号伝送モードにお
いては、第２段LSP量子化で禁止符号としたインデック
スをDTMF信号伝送識別パターンとして送っている事に特
徴がある。受信側の多重分離部202においては、第２段L
SPを常に監視し、この識別パターンと一致したらDTMF信
号伝送モードと解釈して復号器201を制御する。As a result of the operation of the encoder described above, the frame format output from the encoder formed by the multiplexing unit 107 is as shown in FIG. The DTMF signal transmission mode is characterized in that the index used as the inhibition code in the second stage LSP quantization is sent as the DTMF signal transmission identification pattern. In the demultiplexing unit 202 on the receiving side, the second stage L
The SP is constantly monitored, and if it matches this identification pattern, it is interpreted as the DTMF signal transmission mode and the decoder 201 is controlled.

【００７０】以上のように、本実施の形態７に依れば、
音声伝送時に選択できなくなる禁止符号(量子化ステッ
プ)を僅か１つに抑えたため、音声伝送時の品質をほと
んど劣化させずに、音声／非音声信号識別器102の判定
結果を伝送する信号列に埋め込む事が出来るという利点
がある。また、DTMF信号伝送時においては、DTMF信号を
符号化伝送するために特化されたアルゴリズムとなって
おり、識別パターンとなる選択禁止符号を送信するため
の十分なビット数が確保されているためDTMF信号の劣化
も無いという優れた利点がある。As described above, according to the seventh embodiment,
Since the number of prohibited codes (quantization steps) that cannot be selected during voice transmission is suppressed to only one, the signal sequence for transmitting the determination result of the voice / non-voice signal discriminator 102 hardly deteriorates the quality during voice transmission. It has the advantage that it can be embedded. Also, during DTMF signal transmission, it is an algorithm specialized for encoding and transmitting DTMF signals, and a sufficient number of bits for transmitting the selection prohibition code that is the identification pattern is secured. There is an excellent advantage that there is no deterioration of the DTMF signal.

【００７１】実施の形態８以上の実施の形態７では、分割する量子化符号帳にLSP
量子化符号帳を用いた場合について述べたが、次に、図
２０における利得量子化部316及び利得量子化符号帳317
を用いた場合の実例を、図１７及び図１８を用いて詳し
く説明する。Eighth Embodiment In the above seventh embodiment, the LSP is used as the quantization codebook to be divided.
The case of using the quantization codebook has been described. Next, the gain quantization unit 316 and the gain quantization codebook 317 in FIG.
An actual example of using will be described in detail with reference to FIGS. 17 and 18.

【００７２】本実施の形態８は、DTMF信号伝送時におい
て、DTMF信号の立上り、立ち下がり時を除けば利得の変
動が極めて少ない事に着目した手段である。識別情報の
多重化手段として、利得量子化ベクトル符号帳の一つを
禁止し、そのインデックスを識別パターンに用いる点の
他は実施の形態７の場合と同一である。音声信号伝送時
において、サブフレーム(5msec)毎に行っている利得の
量子化を、利得変動の少ないDTMF信号伝送時において
は、フレーム毎(10msec)に間引いて冗長度を減らす代わ
りに識別パターンを埋め込む事を特徴とする。The eighth embodiment is a means focusing on the fact that the fluctuation of the gain is extremely small during the transmission of the DTMF signal except when the DTMF signal rises and falls. The present embodiment is the same as the case of the seventh embodiment except that one of the gain-quantized vector codebooks is prohibited as the identification information multiplexing means and the index is used for the identification pattern. At the time of voice signal transmission, the gain quantization performed every subframe (5 msec) is thinned out every frame (10 msec) to reduce the redundancy when the DTMF signal transmission with small gain fluctuation is performed. Characterized by embedding.

【００７３】利得量子化部316で実行される、利得量子
化のタイムスケジュールを図１８に示す。音声信号伝送
モードにおいては、１フレームに２回（即ちサブフレー
ム毎に）利得の量子化を実行する。第１サブフレームで
は通常の符号帳探索を行う。第２サブフレームの利得符
号帳探索において、実施の形態６で示した量子化方法を
用いる。即ち、音声信号伝送時においては唯一の禁止ベ
クトルを除いて全探索を行い、最適ベクトルのインデッ
クスを利得量子化部316から出力する。FIG. 18 shows a time schedule for gain quantization executed by the gain quantization unit 316. In the voice signal transmission mode, gain quantization is performed twice in one frame (that is, every subframe). A normal codebook search is performed in the first subframe. In the gain codebook search of the second subframe, the quantization method shown in Embodiment 6 is used. That is, when transmitting a voice signal, a full search is performed except for the only forbidden vector, and the index of the optimum vector is output from the gain quantizer 316.

【００７４】DTMF信号伝送時においては、利得の量子化
は１フレームに１回しか実行しない。これは、例えば、
全フレームを対象とした利得の量子化を行う事で、実現
する事が出来る。符号帳探索においては、上記音声信号
伝送モードにおける第１サブフレーム時と同様、全探索
を実行する。そして、１サブフレーム分の利得量子化情
報の代わりに、識別情報として音声入力時に探索禁止と
したベクトルのインデックスを利得量子化部316から出
力する。At the time of DTMF signal transmission, gain quantization is executed only once per frame. This is, for example,
This can be achieved by quantizing the gain for all frames. In the codebook search, the full search is executed as in the first subframe in the voice signal transmission mode. Then, instead of the gain quantization information for one subframe, the gain quantization unit 316 outputs the index of the vector for which the search is prohibited at the time of voice input as identification information.

【００７５】以上説明した符号器の動作の結果、多重化
部107が構成する符号器から出力されるフレームフォー
マットは図１７の通りとなる。DTMF信号伝送モードにお
いては、音声信号伝送モードにおいて第１サブフレーム
利得量子化情報を送っていた部分に、全フレームに対す
る利得量子化符号を挿入してある。また、第２サブフレ
ームにおいて利得符号の量子化において禁止符号とした
インデックスを、DTMF信号伝送識別パターンとして送っ
ている事に特徴がある。受信側の多重分離部202におい
ては、第２フレーム利得符号を監視しこの識別パターン
と一致したら、DTMF信号伝送モードと解釈して復号器20
1を制御する。As a result of the operation of the encoder described above, the frame format output from the encoder formed by the multiplexing unit 107 is as shown in FIG. In the DTMF signal transmission mode, the gain quantization code for all frames is inserted in the portion where the first subframe gain quantization information was sent in the voice signal transmission mode. Further, it is characterized in that the index used as the inhibition code in the quantization of the gain code in the second subframe is sent as the DTMF signal transmission identification pattern. In the demultiplexing unit 202 on the receiving side, the second frame gain code is monitored, and if it matches this identification pattern, it is interpreted as the DTMF signal transmission mode and the decoder 20
Control 1

【００７６】また、復号器201において、音声伝送モー
ドにおいては、送られてきた利得符号を基に逆量子化を
行いサブフレーム毎に利得値を更新して復号処理を行
う。一方、DTMF信号伝送モード時には、１フレームに１
つの利得符号しか送られてこないため利得の更新もフレ
ーム単位となる。In the voice transmission mode, the decoder 201 performs inverse quantization on the basis of the gain code sent thereto, updates the gain value for each subframe, and performs decoding processing. On the other hand, in DTMF signal transmission mode, 1 per frame
Since only one gain code is sent, the gain is updated in frame units.

【００７７】以上のように、本実施の形態８に依れば、
音声伝送時に選択できなくなる禁止符号（量子化ステッ
プ）を僅か１つに抑えたため、音声伝送時の品質をほと
んど劣化させずに、音声／非音声信号識別器102の判定
結果を伝送する信号列に埋め込む事が出来るという利点
がある。また、DTMF信号伝送時においては、DTMF信号を
符号化伝送するために特化されたアルゴリズムとなって
おり、識別パターンとなる選択禁止符号を送信するため
の十分なビット数が確保されているため、DTMF信号の劣
化をなくする事が出来る。As described above, according to the eighth embodiment,
Since the number of prohibited codes (quantization steps) that cannot be selected during voice transmission is suppressed to only one, the signal sequence for transmitting the determination result of the voice / non-voice signal discriminator 102 hardly deteriorates the quality during voice transmission. It has the advantage that it can be embedded. Also, during DTMF signal transmission, it is an algorithm specialized for encoding and transmitting DTMF signals, and a sufficient number of bits for transmitting the selection prohibition code that is the identification pattern is secured. , It is possible to eliminate the deterioration of the DTMF signal.

【００７８】実施の形態９.以下に、本発明に係る実施
の形態９について、図１９を参照しながら説明する。本
実施の形態は、音声／非音声信号識別器102の１つの動
作例について詳細に述べたものである。なお、図１と同
一符号を記した構成要素は、上記実施の形態１の項で説
明したものと同一の機能を持つ構成要素であるため説明
の重複を省く。Ninth Embodiment The ninth embodiment according to the present invention will be described below with reference to FIG. The present embodiment describes in detail one operation example of the voice / non-voice signal discriminator 102. Note that the constituent elements denoted by the same reference numerals as those in FIG. 1 are constituent elements having the same functions as those described in the section of the above-described first embodiment, and therefore redundant description will be omitted.

【００７９】次に、動作について、図１９を用いて説明
する。実施の形態１において、DTMF信号の識別手段を数
例提示したが、大きく分けて(1)周波数成分、(2)利得成
分の２点に特徴があるという事が言える。ここで、例え
ば符号化方式にCS-ACELP方式（図２０,図２１)を用いた
音声符号化・復号装置においては、入力された音声信号
を分析して効率よく符号化するため、線形予測法に基づ
く音声信号の周波数分析を実行する符号化処理機能ブロ
ックであるLP分析部302を有している。ここで計算され
るパラメータ(例えば反射係数等)から、DTMF信号の特徴
を示す情報を引き出す事が出来る。音声／非音声信号識
別器102は、この機能を備えており、音声／非音声信号
識別器102自身が周波数分析を実行する手段を持たな
い。なお、音声／非音声信号識別器102の詳細について
は、特開平9-326772に詳細に述べられているので、ここ
での説明は省く。Next, the operation will be described with reference to FIG. In the first embodiment, several examples of the DTMF signal identification means are presented, but it can be said that they are roughly divided into two characteristics: (1) frequency component and (2) gain component. Here, in a speech encoding / decoding device using, for example, the CS-ACELP scheme (FIGS. 20 and 21) as an encoding scheme, a linear prediction method is used in order to analyze and efficiently encode an input speech signal. An LP analysis unit 302, which is a coding processing functional block that executes frequency analysis of a voice signal based on Information indicating the characteristics of the DTMF signal can be derived from the parameters (eg, reflection coefficient) calculated here. The voice / non-voice signal discriminator 102 has this function, and the voice / non-voice signal discriminator 102 itself has no means for performing frequency analysis. Since the details of the voice / non-voice signal discriminator 102 are described in detail in Japanese Patent Laid-Open No. 9-326772, the description thereof is omitted here.

【００８０】以上のように、本実施の形態９に依れば、
符号化処理で必須のパラメータを流用して、音声／非音
声信号の識別に用いている構成となっているため、信号
識別のために必要なパラメータを簡単に得る事が出来
る。従って、信号識別のための処理負荷が軽くできる。
簡便な処理で実現できるため装置構成が簡単に出来る事
が期待でき、装置実現のためのコストを低減できるとい
った利点がある。As described above, according to the ninth embodiment,
Since the parameters required for the encoding process are diverted to be used for identifying the voice / non-voice signal, the parameters required for the signal identification can be easily obtained. Therefore, the processing load for signal identification can be reduced.
Since it can be realized by simple processing, it can be expected that the device configuration can be simplified, and there is an advantage that the cost for realizing the device can be reduced.

【００８１】[0081]

【発明の効果】以上のように、本発明に依れば、音声信
号伝送時には音声の符号化により適した、通常の音声符
号化・復号アルゴリズムを用いた方法で、また、非音声
信号、特にDTMF信号等の伝送時においては、一部の処理
機能ブロックを、非音声信号の符号化により適した方法
に切替えて、符号化・復号処理を実行するので、非音声
信号伝送時に、伝送速度を上げる事無く、高品質の非音
声信号を伝送する事が出来る。As described above, according to the present invention, when a voice signal is transmitted, a method using a normal voice encoding / decoding algorithm, which is more suitable for voice encoding, is used. When transmitting DTMF signals, etc., some processing function blocks are switched to a method more suitable for encoding non-voice signals, and encoding / decoding processing is executed. High quality non-voice signals can be transmitted without raising.

【００８２】また、本発明においては、符号化・復号処
理の一部に変更を加えるものであり、アルゴリズムの本
質に関わるような切替を行うものではないため、例え
ば、音声信号入力中に、音声／非音声信号識別器102で
「非音声」と誤識別した場合でも、多少の劣化はあるも
のの、ある程度の音声伝送品質は維持できるので、通話
中に耳触りとなるような弊害は抑えられる、といった利
点もある。In addition, in the present invention, a part of the encoding / decoding process is changed, and switching that does not relate to the essence of the algorithm is not performed. / Even if the non-voice signal discriminator 102 erroneously identifies it as "non-voice", although there is some deterioration, it is possible to maintain a certain level of voice transmission quality, so that it is possible to suppress the harmful effect on the ear during a call. There are also advantages.

【００８３】また、簡便な方法で構成された、識別性能
の良くない音声／非音声信号識別器を適用しても、ある
程度の音声品質の維持が可能である事から、装置構成を
簡単に出来、実現のためのコストが低減できるなどの優
れた効果がある。Further, even if a voice / non-voice signal discriminator having a poor discriminating performance constructed by a simple method is applied, the voice quality can be maintained to a certain extent, so that the device constitution can be simplified. There is an excellent effect that the cost for realization can be reduced.

[Brief description of drawings]

【図１】実施の形態１である音声符号化・復号装置の
ブロック構成図である。FIG. 1 is a block configuration diagram of a speech encoding / decoding device according to a first embodiment.

【図２】実施の形態２である音声符号器内のLSP量子化
に関わる部分のブロック構成図である。[Fig. 2] Fig. 2 is a block configuration diagram of a portion related to LSP quantization in the speech encoder according to the second embodiment.

【図３】実施の形態２である音声復号器内のLSP逆量子
化に関わる部分のブロック構成図である。[Fig. 3] Fig. 3 is a block configuration diagram of a portion related to LSP dequantization in the speech decoder according to the second embodiment.

【図４】実施の形態３である音声符号器内のLSP量子
化に関わる部分のブロック構成図である。[Fig. 4] Fig. 4 is a block configuration diagram of a portion related to LSP quantization in the speech coder according to the third embodiment.

【図５】実施の形態３である音声復号器内のLSP逆量
子化に関わる部分のブロック構成図である。[Fig. 5] Fig. 5 is a block configuration diagram of a portion related to LSP dequantization in the speech decoder according to the third embodiment.

【図６】実施の形態３である音声符号器内の第２のLS
P量子化部、及びLSP量子化符号帳の詳細構成図である。FIG. 6 is a second LS in the speech coder according to the third embodiment.
FIG. 3 is a detailed configuration diagram of a P quantization unit and an LSP quantization codebook.

【図７】実施の形態３である音声復号器内の第２のLS
P逆量子化部、及びLSP量子化符号帳の詳細構成図であ
る。FIG. 7 is a second LS in the speech decoder according to the third embodiment.
FIG. 3 is a detailed configuration diagram of a P inverse quantization unit and an LSP quantization codebook.

【図８】実施の形態３である音声符号器内のLSP量子
化に関わる部分のブロック構成図である。[Fig. 8] Fig. 8 is a block configuration diagram of a part related to LSP quantization in the speech encoder according to the third embodiment.

【図９】実施の形態４である音声符号器内の第２のLS
P量子化部、及びLSP量子化符号帳の詳細構成図である。FIG. 9 is a second LS in the speech coder according to the fourth embodiment.
FIG. 3 is a detailed configuration diagram of a P quantization unit and an LSP quantization codebook.

【図１０】実施の形態４である音声符号器内の第２の
LSP量子化部、及びLSP量子化符号帳の詳細構成図であ
る。FIG. 10 is a block diagram showing a second embodiment of the speech coder according to the fourth embodiment.
It is a detailed block diagram of an LSP quantization part and an LSP quantization codebook.

【図１１】実施の形態４である音声復号器内の第２の
LSP逆量子化部、及びLSP量子化符号帳の詳細構成図であ
る。FIG. 11 is a block diagram showing a second embodiment of the speech decoder according to the fourth embodiment.
FIG. 3 is a detailed configuration diagram of an LSP dequantization unit and an LSP quantization codebook.

【図１２】実施の形態５である音声符号器内の第２の
LSP量子化部、及びLSP量子化符号帳の詳細構成図であ
る。FIG. 12 is a block diagram illustrating a second embodiment of the speech coder according to the fifth embodiment.
It is a detailed block diagram of an LSP quantization part and an LSP quantization codebook.

【図１３】実施の形態６である音声符号器内の量子化
符号帳の構成例の一つを示す図である。[Fig. 13] Fig. 13 is a diagram illustrating one example of a configuration of a quantization codebook in the speech coder according to the sixth embodiment.

【図１４】実施の形態７である音声符号器内の第１の
LSP量子化部、及び第１のLSP量子化符号帳の詳細構成図
である。FIG. 14 is a diagram showing a first embodiment of the speech coder according to the seventh embodiment.
It is a detailed block diagram of an LSP quantization part and a 1st LSP quantization codebook.

【図１５】実施の形態７である音声符号器内の第２の
LSP量子化部、及び第２のLSP量子化符号帳の詳細構成図
である。FIG. 15 is a block diagram illustrating a second embodiment of the speech coder according to the seventh embodiment.
It is a detailed block diagram of an LSP quantization part and a 2nd LSP quantization codebook.

【図１６】実施の形態７である音声符号化・復号装置
の信号伝送時のフレームフォーマットを示す図である。FIG. 16 is a diagram showing a frame format at the time of signal transmission of the speech encoding / decoding device according to the seventh embodiment.

【図１７】実施の形態８である音声符号化・復号装置
の信号伝送時のフレームフォーマットを示す図である。FIG. 17 is a diagram showing a frame format at the time of signal transmission of the audio encoding / decoding device according to the eighth embodiment.

【図１８】実施の形態８である音声符号化・復号装置
の利得量子化処理ブロックのタイムスケジュールを記し
た表である。FIG. 18 is a table showing a time schedule of gain quantization processing blocks of the speech encoding / decoding device according to the eighth embodiment.

【図１９】実施の形態９に係る音声符号化・復号装置
のブロック構成図である。FIG. 19 is a block configuration diagram of a speech encoding / decoding device according to a ninth embodiment.

【図２０】CS-ACELP方式に基づく符号器の内部構成を示
すブロック図である。FIG. 20 is a block diagram showing an internal configuration of an encoder based on the CS-ACELP system.

【図２１】CS-ACELP方式に基づく復号器の内部構成を示
すブロック図である。FIG. 21 is a block diagram showing an internal configuration of a decoder based on the CS-ACELP method.

【図２２】CS-ACELP方式における符号器の、LSP量子化
部、及びLSP量子化符号帳の内部詳細構成を示すブロッ
ク図である。[Fig. 22] Fig. 22 is a block diagram illustrating an internal detailed configuration of an LSP quantization unit and an LSP quantization codebook of an encoder in the CS-ACELP system.

【図２３】CS-ACELP方式における復号器の、LSP逆量子
化部、及びLSP量子化符号帳の内部詳細構成を示すブロ
ック図である。[Fig. 23] Fig. 23 is a block diagram illustrating an internal detailed configuration of an LSP dequantization unit and an LSP quantization codebook of a decoder in the CS-ACELP system.

【図２４】従来の音声符号化・復号装置の構成を示すブ
ロック図である。FIG. 24 is a block diagram showing a configuration of a conventional speech encoding / decoding device.

【図２５】従来の音声符号化・復号装置の構成を示す
ブロック図である。FIG. 25 is a block diagram showing a configuration of a conventional speech encoding / decoding device.

[Explanation of symbols]

101:符号器、 102:音声／非音声信号識別器、 10
3、104:切替スイッチ、105、106:符号化処理機能ブロッ
ク、 107:多重化部、 201:復号器、202:多重分離
部、 203、204:切替スイッチ、205、206:復号処理機能
ブロック、 301:符号化前処理部、302:LP分析部、
303:線形予測計算部、 304:聴覚重み付けフィル
タ、305:LP-LSP変換部、 306:開ループピッチ探索
部、307:非量子化LSP-LP変換部、 308:移動平均予測
成分計算部、309、309A:LSP量子化部、 310、310A:L
SP量子化符号帳、311:量子化LSP-LP変換部、 312:LP逆
フィルタ、 313:ピッチ成分計算部、314:励振信号
(周期成分)量子化部(閉ループピッチ探索部)、315、32
2:インパルス応答計算部、 316:利得量子化部、317:
利得量子化符号帳、 318:符号帳探索用ターゲット信
号計算部、319:パルス予備選択部、 320:ピッチプレ
フィルタ、321:励振信号(雑音成分)量子化部(固定符号
帳探索部)、323:利得の移動平均予測部、 324:符号
計算部、325:ピッチポストフィルタ、 326、331、33
2、333:加算器、327:量子化された励振信号計算部、
330:利得乗算器、334:最小自乗誤差計算部、 335、
335A:第１段LSP量子化符号帳、336:第２段LSP量子化符
号帳、 337、337A:移動平均予測係数符号帳、338:量
子化誤差重み付け係数計算部、 340:一般的な量子化
符号帳、341:加算器、 342:最小自乗誤差計算部、
401:ピッチラグ復号部、402:利得量子化符号帳、 40
3:利得逆量子化部、 404:利得の移動平均予測部、405:
励振信号(雑音成分)逆量子化部、 406、406A:LSP量
子化符号帳、407、407A:LSP逆量子化部、 408:ピッ
チポストフィルタ、409:LSP内挿部、 410:LSP-LP変
換部、 411:励振信号計算部、412:合成フィルタ、
413:ポストフィルタ、 414:後処理部。101: encoder, 102: voice / non-voice signal discriminator, 10
3, 104: Changeover switch, 105, 106: Encoding processing functional block, 107: Multiplexing unit, 201: Decoder, 202: Demultiplexing unit, 203, 204: Changeover switch, 205, 206: Decoding processing functional block, 301: precoding unit, 302: LP analysis unit,
303: Linear prediction calculation unit, 304: Auditory weighting filter, 305: LP-LSP conversion unit, 306: Open loop pitch search unit, 307: Non-quantized LSP-LP conversion unit, 308: Moving average prediction component calculation unit, 309 , 309A: LSP quantizer, 310, 310A: L
SP quantized codebook, 311: Quantized LSP-LP converter, 312: LP inverse filter, 313: Pitch component calculator, 314: Excitation signal
(Periodic component) Quantizer (closed-loop pitch searcher), 315, 32
2: Impulse response calculator, 316: Gain quantizer, 317:
Gain quantized codebook, 318: target signal calculation unit for codebook search, 319: pulse preliminary selection unit, 320: pitch prefilter, 321: excitation signal (noise component) quantization unit (fixed codebook search unit), 323 : Gain moving average predictor, 324: Code calculator, 325: Pitch post filter, 326, 331, 33
2, 333: adder, 327: quantized excitation signal calculation unit,
330: Gain multiplier, 334: Least square error calculator, 335,
335A: First stage LSP quantization codebook, 336: Second stage LSP quantization codebook, 337, 337A: Moving average prediction coefficient codebook, 338: Quantization error weighting coefficient calculator, 340: General quantization Codebook, 341: adder, 342: least square error calculator,
401: pitch lag decoding unit, 402: gain quantization codebook, 40
3: Gain dequantization unit, 404: Gain moving average prediction unit, 405:
Excitation signal (noise component) dequantization unit, 406, 406A: LSP quantization codebook, 407, 407A: LSP dequantization unit, 408: Pitch post filter, 409: LSP interpolation unit, 410: LSP-LP conversion Part, 411: excitation signal calculation part, 412: synthesis filter,
413: Post filter, 414: Post-processing unit.

Claims

(57) [Claims]

1. A voice / non-voice discriminator that discriminates whether an input signal is a voice signal or a non-voice signal and outputs a determination result, and an input signal based on a frequency parameter suitable for voice encoding. A first functional block that encodes and outputs a speech-encoded signal, and an input based on a frequency parameter suitable for non-speech encoding that is different from the above-mentioned frequency parameter by the same encoding method as the first functional block. Sequentially encode signal
And a second functional block for outputting a non-voice coded signal, which compresses and encodes the input signal, and the code of the input signal according to the determination result of the voice / non-voice discriminator. Switching means for selecting either the first functional block or the second functional block for supply to the audio encoder, the output of the encoder, and the determination result of the voice / non-voice discriminator, and transmission. And a multiplexing unit for outputting to a channel, the first functional block is an LS suitable for audio coding.
P (LineSpectrumPair): Line spectrum
Pair) quantizer, the second functional block is a non-voice code
Is an LSP quantizer suitable for encoding, and the LSP quantizer that constitutes the second functional block is
Stop the LSP moving average prediction process, or
A speech coding apparatus having a function of reducing the effect of uniform prediction .

2. A voice / non-voice discriminator that discriminates whether an input signal is a voice signal or a non-voice signal and outputs a determination result, and an input signal based on a frequency parameter suitable for voice encoding. A first functional block that encodes and outputs a speech-encoded signal, and an input based on a frequency parameter suitable for non-speech encoding that is different from the above-mentioned frequency parameter by the same encoding method as the first functional block. Sequentially encode signal
And a second functional block for outputting a non-voice coded signal, which compresses and encodes the input signal, and the code of the input signal according to the determination result of the voice / non-voice discriminator. Switching means for selecting either the first functional block or the second functional block for supply to the audio encoder, the output of the encoder, and the determination result of the voice / non-voice discriminator, and transmission. And a multiplexing unit for outputting to a channel, the first functional block is an LS suitable for audio coding.
P quantizer, the second functional block is a non-voice code
Is an LSP quantizer suitable for quantization, and the first and second functional blocks described above are used for quantization error calculation.
The weighting factors used are different, and
In addition, the weighting coefficient of the second functional block is
A speech coding apparatus, characterized in that the weighting coefficient is made larger than the weighting coefficient of the first functional block .

3. A demultiplexer for separating a coded signal and a speech / non-speech discrimination signal from a multiplexed data sequence coded and transmitted by the speech coder according to claim 1 or 2. And a decoder for decoding the coded signal into an original signal, the decoder including a third functional block corresponding to the encoder for coding the transmitted voice coded signal, and the same transmission. A fourth corresponding to an encoder that encodes the encoded non-voice coded signal
And a third functional block separated according to the determination result of the voice / non-speech classifier separated by the multiplex data separation unit.
Alternatively, a voice decoding device having a switching means for selecting one of the fourth functional blocks.