JPH0458300A

JPH0458300A - Vector quantizing method for sound signal

Info

Publication number: JPH0458300A
Application number: JP2170633A
Authority: JP
Inventors: Hiroko Kezuka; 毛塚　博子; Koji Yoshida; 幸司吉田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1990-06-28
Filing date: 1990-06-28
Publication date: 1992-02-25

Abstract

PURPOSE:To reduce the processing amount of retrieval as a whole and to reduce energy consumption by defining a relevant code vector as a representative vector when the distortion of a sound to be calculated from an input vector and a code book is lower than an adaptively set threshold value. CONSTITUTION:A discriminating means 1 discriminates whether an input sound is a sound or silence and whether the sound is a vowel or a consonant. A threshold value setting means 2 adaptively changes the threshold value corresponding to the state of the input sound. In order to select the representative vector from a code book 5, a representative vector calculating means 4 successively retrieves code vectors in the code book 5, stops the retrieval when the distorsion of the code vector and of the input vector is made lower than the threshold value, and selects the code vector at that time as the representative vector. Thus, without degrading sound quality by distortion, the operated variable for selecting the representative vector and the energy consumption can be reduced.

Description

【発明の詳細な説明】産業上の利用分野本発明はＡ／Ｄ変換された音声信号を量子化する音声信
号のベクトル量子化方法に関する。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to an audio signal vector quantization method for quantizing an A/D converted audio signal.

従来の技術従来、この種のベクトル量子化方法は、第５図のフロー
チャートに示す様に、まずステップＳ１において、ベク
トル量子化の対象である入力ベクトルＸ１Ｘ　＝　（Ｘ＋＋Ｘｘ、〜、　　ｘｎ　）がある一定時
間（以下フレームという）毎に入力される。BACKGROUND ART Conventionally, in this type of vector quantization method, as shown in the flowchart of FIG. 5, first in step S1, there is an input vector X1 X = (X++Xx, . It is input at fixed time intervals (hereinafter referred to as frames).

次に、入力ベクトルＸについて、所定のコードベクトル
Ｙｉ　　（ｉ＝１．２．・・・Ｎ）Ｙ’　”　（ｙ　Ｌ
、　ｙ　ｉｘ＋〜１３’ｌイ）との歪みｄｉが、例えば
次式、ｄｉ＝Σ　　Ｉｘｉ＋　　　ｙＬｌ” に基づいて、全てのコードベクトルＹｉについて計算さ
れる（ステップＳ２．Ｓ３．Ｓ４゜Ｓ２）。Next, for the input vector X, a predetermined code vector Yi (i=1.2...N)
, y ix+~13'lI) is calculated for all code vectors Yi, for example, based on the following equation, di=ΣIxi+yLl'' (steps S2.S3.S4°S2).

次に、歪みｄｉが最小になるｉを求め（ステップＳ５）
、その番号ｉを入力ベクトルＸの量子化値とする（ステ
ップＳ６）。Next, find i that minimizes the distortion di (step S5)
, the number i is set as the quantized value of the input vector X (step S6).

発明が解決しようとする課題しかしながら、上記した従来の音声信号のベクトル量子
化方法では、歪みが最小のコードベクトル（以下代表ベ
クトルという）を選択する際に、全てのコードベクトル
の歪みを計算する必要があり、そのため、演算量が多く
、かつ消費電力が多いという問題があった。Problems to be Solved by the Invention However, in the conventional vector quantization method for audio signals described above, when selecting the code vector with the minimum distortion (hereinafter referred to as the representative vector), it is necessary to calculate the distortion of all code vectors. Therefore, there were problems in that the amount of calculation was large and the power consumption was large.

本発明は、この様な従来の問題を解決するものであり、
演算量を削減し、消費電力を節約することのできるベク
トル量子化方法を提供することを目的とするものである
。The present invention solves these conventional problems,
The object of the present invention is to provide a vector quantization method that can reduce the amount of calculations and save power consumption.

課題を解決するための手段本発明は上記目的を達成するため、入力された音声信号
が有音であるか無音であるか、及び有音の場合には母音
であるか子音であるかを判定し、入力された音声信号を
ベクトル変換して入力ベクトルを形成し、上記判定結果
に応じて、コードベクトルと入力ベクトルとから算出さ
れる音声の歪みに関する閾値を設定し、複数のコードベ
クトルから成るコードブックを参照し、参照して得たコ
ードベクトルと入力ベクトルから音声の歪みを算出し、
算出された歪みと上記閾値とを比較し、閾値以上の場合
には次のコードベクトルの参照を行い、上記歪みの算出
と閾値との比較処理を繰り返し、閾値以下の場合には次
のコードベクトルの参照を停止し、当該コードベクトル
を代表ベクトルとして選定し、これによって歪みによる
音質の劣化をおさえて全体の演算量を削減するものであ
る。Means for Solving the Problems In order to achieve the above object, the present invention determines whether an input audio signal is voiced or silent, and if it is voiced, determines whether it is a vowel or a consonant. Then, vector-convert the input audio signal to form an input vector, set a threshold regarding audio distortion calculated from the code vector and the input vector according to the above judgment result, and create a vector consisting of a plurality of code vectors. Refer to the codebook, calculate the audio distortion from the code vector obtained by referring to it, and the input vector,
The calculated distortion is compared with the above threshold value, and if it is above the threshold value, the next code vector is referred to, the above distortion calculation and comparison process with the threshold value is repeated, and if it is below the threshold value, the next code vector is , and selects the code vector as a representative vector, thereby suppressing deterioration of sound quality due to distortion and reducing the overall amount of calculation.

作用本発明は上記のような構成により次のような作用を有す
る。Effects The present invention has the following effects due to the above structure.

すなわち、入力された音声が母音であるか子音であるか
無音であるかが判断され、無音や無声子音であれば歪み
が大きくても劣化は少ないので、閾値を低めに設定し、
母音であれば歪みが大きいと劣化が大きいため、閾値を
高めに設定する。この様に、入力音声の状態によって閾
値を適応的に変化させる。そして、コードブックから代
表ベクトルを選択するのに順次コードベクトルを検索し
ていき、コードブック中のコードベクトルと入力ベクト
ルとから求められる音声の歪みが閾値以下になった時点
で検索をやめ、そのコードベクトルを代表ベクトルとし
て決定する。これによって、歪みによる音質の劣化をお
さえて、コードベクトル検索による演算量及び消費電力
を全体的に削減するものである。In other words, it is determined whether the input voice is a vowel, a consonant, or silent, and if it is silent or a voiceless consonant, there will be little deterioration even if the distortion is large, so the threshold is set to a low value.
For vowels, if the distortion is large, the deterioration is large, so the threshold value is set high. In this way, the threshold value is adaptively changed depending on the state of the input audio. Then, to select a representative vector from the codebook, the codevectors are sequentially searched, and when the speech distortion determined from the codevector in the codebook and the input vector becomes less than a threshold, the search is stopped, and the search is stopped. Determine the code vector as the representative vector. This suppresses deterioration in sound quality due to distortion, and reduces the overall amount of calculation and power consumption due to code vector search.

実施例第１図は本発明のベクトル量子化方法の一実施例を示す
ブロック図である。第１図に示す実施例は、入力した音
声を母音であるか子音であるか、及び有音であるか無音
であるかを判定する有音・無音・母音・子音判定手段ｌ
と、その結果に従って閾値を変化させる閾値設定手段２
と、入力音声をベクトル変換する入力ベクトル算出手段
３と、入力ベクトルと閾値より代表ベクトルを算出する
代表ベクトル算出手段４と、それに用いるコードベクト
ルを格納したコードブック５とから構成されている。Embodiment FIG. 1 is a block diagram showing an embodiment of the vector quantization method of the present invention. The embodiment shown in FIG. 1 is a voiced/silent/vowel/consonant determining means l for determining whether an input voice is a vowel or a consonant, and whether it is voiced or silent.
and threshold setting means 2 for changing the threshold according to the result.
, an input vector calculation means 3 for vector-converting input speech, a representative vector calculation means 4 for calculating a representative vector from the input vector and a threshold value, and a codebook 5 storing code vectors used therein.

上記実施例によれば、有音・無音・母音・子音判定手段
ｌが入力された音声が有音であるか無音であるか、及び
母音であるか子音であるがを判定する。有音・無音・母
音・子音判定手段１が入力された音声を無音や無声子音
と判定した場合、歪みが大きくても音質の劣化は少ない
ので、閾値設定手段３は、閾値を低めに設定する。また
、有音・無音・母音・子音判定手段ｌが母音であると判
定した場合には、歪みが大きいと音質の劣化が太き（な
るため１、閾値設定手段３は、閾値を高めに設定する。According to the above embodiment, the voiced/silent/vowel/consonant determining means 1 determines whether the input voice is voiced or silent, and whether it is a vowel or a consonant. When the voiced/silent/vowel/consonant determining means 1 determines that the input voice is silent or a voiceless consonant, the threshold setting means 3 sets the threshold to a low value, since there is little deterioration in sound quality even if the distortion is large. . In addition, when the voiced/silent/vowel/consonant determining means 1 determines that the sound is a vowel, the greater the distortion, the greater the deterioration of the sound quality (1), the threshold setting means 3 sets the threshold to a higher value. do.

すなわち、入力音声の状態によって、閾値を適応的に変
化させる。そして、コードブック５から代表ベクトルを
選定するために、代表ベクトル算出手段４はコードブッ
ク５において順次コードベクトルを検索し、コードベク
トルと入力ベクトルの歪みが、閾値以下になった時点で
検索をやめ、そのときのコードベクトルを代表ベクトル
として選定する。これによって、歪みによる音質の劣化
を生じさせることなく、代表ベクトル選定のための演算
量と消費電力を削減することができる。That is, the threshold value is adaptively changed depending on the state of the input audio. Then, in order to select a representative vector from the codebook 5, the representative vector calculation means 4 sequentially searches for codevectors in the codebook 5, and stops the search when the distortion between the codevector and the input vector becomes less than a threshold value. , select the code vector at that time as the representative vector. As a result, the amount of calculation and power consumption for selecting a representative vector can be reduced without deteriorating the sound quality due to distortion.

第２図は第１図に示す実施例を具体化する回路例を示す
ブロック図である。第２図において１１は音声信号をデ
ィジタル信号に変換するＡ／Ｄ変換器、１２は中央処理
装置（ＣＰＵ）、ｌ　３はＣＰＵ１２の実行プログラム
や複数のコードベクトルで構成されているコードブック
が格納されたリードオンリメモリ（ＲＯＭ）、１４は入
力ベクトルなどが格納されるランダムアクセスメモリ（
ＲＡＭ）である。FIG. 2 is a block diagram showing an example of a circuit embodying the embodiment shown in FIG. In Fig. 2, 11 is an A/D converter that converts audio signals into digital signals, 12 is a central processing unit (CPU), and 3 stores an execution program for the CPU 12 and a codebook made up of multiple code vectors. 14 is a random access memory (ROM) in which input vectors are stored.
RAM).

また、第３図は第２図に示すＣＰＵ１２の動作を示すフ
ローチャートであり、第４図はＲＯＭ１３とＲＡＭ１４
の記憶内容を示す説明図であり、図示するようにＲＯＭ
１３にはあらかじめ複数個のコードベクトルから構成さ
れているコードブックが格納され、ＲＡＭ１４には入力
音声から算出された入力ベクトルが格納される。3 is a flowchart showing the operation of the CPU 12 shown in FIG. 2, and FIG. 4 is a flowchart showing the operation of the CPU 12 shown in FIG.
is an explanatory diagram showing the memory contents of the ROM.
13 stores in advance a codebook composed of a plurality of code vectors, and RAM 14 stores input vectors calculated from input speech.

次に、第２図に示す回路の動作について、第３図に示す
フローチャート及び第４図に示すＲＯＭ１３とＲＡＭ１
４の記憶内容を示す説明図を用いて説明する。Next, regarding the operation of the circuit shown in FIG. 2, the flowchart shown in FIG. 3 and the ROM 13 and RAM 1 shown in FIG.
This will be explained using an explanatory diagram showing the storage contents of No. 4.

Ａ／Ｄ変換器１１によってディジタル信号に変換された
音声信号は、ＣＰＵ１２において、フレーム毎に次の処
理を施される。The audio signal converted into a digital signal by the A/D converter 11 is subjected to the following processing for each frame in the CPU 12.

まずはじめに、入力されたディジタル信号は、ステップ
Ｓ１２において、そのフレーム内の信号に音声が有るか
無いかが判定され（無音・有音判定）、音声があると判
定された場合、その音声が無声音であるか有声音である
かが判定される（子音母音判定）。First, the input digital signal is judged in step S12 as to whether or not there is sound in the signal within the frame (silence/sound determination). If it is determined that there is sound, the sound is unvoiced. It is determined whether there is a voiced sound or not (consonant/vowel determination).

次に、ステップＳ１３において、ステップＳ１２で得ら
れた判定結果に従って、歪みの許容範囲を設定する閾値
を決定する。ステップＳ１２で得られた結果が、例えば
無音や無声子音であった場合には、代表ベクトルとの歪
みが大きくてもあまり音質の劣化はないため、閾値を低
めに設定する０反対に母音や母音定常部などと判定され
た場合には、歪みが大きいと音質が劣化してしまうため
、閾値を高めに設定し、代表ベクトルとの差が小さくな
るようにする。Next, in step S13, a threshold value for setting an allowable range of distortion is determined according to the determination result obtained in step S12. If the result obtained in step S12 is, for example, silence or a voiceless consonant, the sound quality will not deteriorate much even if the distortion with the representative vector is large. If it is determined that it is a stationary portion, the sound quality will deteriorate if the distortion is large, so the threshold value is set high so that the difference from the representative vector becomes small.

次に、ステップＳ１４において入力されたディジタル信
号をベクトル量子化に適した形に変換し、入力ベクトル
ＸＸ＝　（ｘ＋、ｘ＊＋”・＋ｘ＊　）を算出する。そして、第４図に示す様にＲＯＭ１３に格
納されているコードブックの先読から順に、入力ベクト
ルＸとコードベクトルＹｉＹｔ　＝　（３’　ｌｌ＋　
Ｖ　１１＋　”・＋　３’　ｉｎ）との歪みを計算する
（ステップ５１５）。次に、ステップＳ１６において計
算の結果得られた歪みがステップ３１３において設定さ
れた閾値よりも大きいか小さいかが判定される（ステッ
プ５１６）、設定された閾値以下であれば、コードブッ
クの途中であっても検索を終えて、そのコードベクトル
を代表ベクトルとして決定する。設定された閾値以上で
あれば、次のコードベクトル（ｉ＋１番目のコードベク
トル）の歪みを計算し、ステップＳ１５．Ｓ１６に示す
工程を歪みが閾値以下になるまで繰り返す。コードブッ
クの終りまで、閾値以下のコードブックが見つからなか
った場合は、検索したコードベクトルの中で一番歪みの
小さいものを代表ベクトルとして決定する。Next, in step S14, the input digital signal is converted into a form suitable for vector quantization, and the input vector X Input vector X and code vector YiYt = (3' ll+
V 11+ "·+ 3' in) is calculated (step 515). Next, in step S16, it is determined whether the distortion obtained as a result of the calculation is larger or smaller than the threshold value set in step 313. (step 516), if it is less than the set threshold, the search is finished even if it is in the middle of the codebook, and that code vector is determined as the representative vector.If it is more than the set threshold, the next code is selected. Calculate the distortion of the vector (i+1th code vector), and repeat the steps shown in steps S15 and S16 until the distortion becomes less than the threshold.If no codebook with less than the threshold is found until the end of the codebook, search Among the code vectors obtained, the one with the smallest distortion is determined as the representative vector.

このように、上記実施例によれば、入力信号の性質によ
って閾値を設定し、コードベクトルが閾値以下であれば
コードベクトルの検索を終え、それを代表ベクトルとし
て決定するため、検索のために必要な演算量及び消費電
力を削減することができ、また代表ベクトルとの差によ
る音質の劣化を防止することができる。As described above, according to the above embodiment, a threshold value is set depending on the nature of the input signal, and if the code vector is less than or equal to the threshold value, the code vector search is completed and it is determined as the representative vector. The amount of calculation and power consumption can be reduced, and deterioration of sound quality due to the difference from the representative vector can be prevented.

発明の効果本発明は上記実施例より明らかなように、入力音声の性
質により閾値を適応的に設定し、入力ベクトルとコード
ブックから求められる音声の歪みが適応的に設定した閾
値よりも低い場合は検索をやめ、当該コードベクトルを
代表ベクトルとするようにしたものであり、入力音声の
性質により許容できる歪みの範囲が異なるため、音質の
劣化を少なくして、コードブック検索による処理量を全
体的に少な（し、消費電力を削減できるという効果を有
する。Effects of the Invention As is clear from the above embodiments, the present invention adaptively sets a threshold value depending on the nature of the input speech, and when the distortion of the speech obtained from the input vector and the codebook is lower than the adaptively set threshold value. In this method, the search is stopped and the code vector is used as the representative vector, and since the range of allowable distortion varies depending on the nature of the input voice, this reduces the deterioration of sound quality and reduces the overall amount of processing by codebook search. It has the effect of reducing power consumption.

[Brief explanation of drawings]

第１図は本発明の音声信号のベクトル量子化方法の一実
施例を示すブロック図、第２図は第１図に示す実施例を
ＣＰＵを用いて具体化した回路構成を示すブロック図、
第３図は第２図に示すＣＰＵの処理内容を示すフローチ
ャート、第４図は第２図に示すＲＯＭとＲＡＭの記憶内
容の一例を示す説明図、第５図は従来の音声信号のベク
トル量子化方法を示すフローチャートである。１・・・有音・無音・母音・子音判定手段、２・・・閾
値設定手段、３・・・入力ベクトル算出手段、４・・・
代表ベクトル算出手段、５・・・コードブック、１１・
・・Ａ／Ｄ変換器、１２・・・中央処理装置（ＣＰＵ）
、１３・・・ＲＯＭ、１４・・・ＲＡＭ代理人　弁理士　粟野重孝　ばか１名第１第２図第図第図FIG. 1 is a block diagram showing an embodiment of the vector quantization method for audio signals of the present invention, and FIG. 2 is a block diagram showing a circuit configuration in which the embodiment shown in FIG. 1 is implemented using a CPU.
Fig. 3 is a flowchart showing the processing contents of the CPU shown in Fig. 2, Fig. 4 is an explanatory diagram showing an example of the storage contents of the ROM and RAM shown in Fig. 2, and Fig. 5 is a conventional vector quantum of an audio signal. 2 is a flowchart showing a method of DESCRIPTION OF SYMBOLS 1... Voiced/silent/vowel/consonant determination means, 2... Threshold setting means, 3... Input vector calculation means, 4...
Representative vector calculation means, 5... code book, 11.
...A/D converter, 12...Central processing unit (CPU)
, 13...ROM, 14...RAM Agent Patent Attorney Shigetaka Awano One Idiot 1 Figure 2 Figure Figure

Claims

[Claims]

Determines whether the input audio signal is voiced or silent, and if it is voiced, determines whether it is a vowel or consonant, and converts the input audio signal into a vector to form an input vector. , according to the above judgment result, set a threshold value for speech distortion calculated from the code vector and the input vector, refer to a codebook consisting of multiple code vectors, and calculate the code vector and input vector obtained by reference. Calculate the distortion of the voice from , compare the calculated distortion with the threshold, and if it is greater than the threshold, refer to the next code vector, repeat the process of calculating the distortion and comparing with the threshold,
If the value is below the threshold, the reference to the next code vector is stopped and that code vector is selected as the representative vector, thereby suppressing the deterioration of sound quality due to distortion and reducing the overall amount of calculation. Vector quantization method.