JPH11296196A

JPH11296196A - Sound encoding method and sound encoder

Info

Publication number: JPH11296196A
Application number: JP10100961A
Authority: JP
Inventors: Yasuhisa Shimazaki; 靖久島崎; Yuji Hatano; 雄治波多野; Junichi Nishimoto; 順一西本; Koshi Yamada; 孔司山田; Miki Takeuchi; 幹竹内; Hiroyuki Tanigawa; 博之谷川; Hidetoshi Sekine; 英敏関根
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1998-04-13
Filing date: 1998-04-13
Publication date: 1999-10-29

Abstract

PROBLEM TO BE SOLVED: To reduce power consumption at the time of sound encoding processing by reducing the operation quantity of sound encoding processing, when the sound is encoded by using a code note consisting of the correspondence tables of sound source parameters and codes. SOLUTION: When sound encoding is started, a sound parameter 101d is read out from a code note 101, the sound is reproduced by using a synthesis filter 300, an error from the inputted sound is evaluated by an error evaluation device 301, the evaluated value is compared with a threshold value ε, when the former is smaller than the latter, and a code 101c corresponding to sound source parameters at the time is outputted to the outside. When the evaluated value is larger than the threshold value ε, it is stored in a register 102b, and updated to the minimum successively or all are stored.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ディジタル携帯電
話に用いられる音声符号化方法および音声符号化処理装
置に関し、特に符号化処理速度を向上するとともに、符
号化処理時の消費電力を低減することが可能な音声符号
化方法および音声符号化処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice coding method and a voice coding apparatus used in a digital portable telephone, and more particularly to an improvement in coding speed and a reduction in power consumption during coding processing. The present invention relates to an audio encoding method and an audio encoding processing device capable of performing the following.

【０００２】[0002]

【従来の技術】近年、加入者の伸びが著しいディジタル
自動車電話・携帯電話では、音声の帯域圧縮を行う音声
符号化技術が用いられている。従来より、ディジタル自
動車電話・携帯電話に使用される音声符号化方式として
は、例えば『日本音響学会誌５１巻１０号（１９９
５）、p784〜789』に記載されているような装置が用い
られている。この音声符号化装置では、音声源パラメー
タとそれに対する符号の組が多数格納された符号帳(Ｒ
ＯＭ）を設け、その先頭番地より音声パラメータを順次
読み出し、合成フィルタを通過して音声を再生させる。
その合成された音声と入力音声との誤差を誤差評価装置
で評価することにより、誤差が最小であると判断された
音源パラメータに対応する符号を人部に出力させる方法
である。この方法では、符号帳（ＲＯＭ）から全ての音
源パラメータを読み出した後でなければ、最小の誤差の
ものが判別できない。2. Description of the Related Art In recent years, voice coding techniques for compressing voice bandwidth have been used in digital automobile telephones and mobile telephones in which the number of subscribers has increased remarkably. 2. Description of the Related Art Conventionally, as a speech encoding method used for a digital car phone / cellular phone, for example, “Acoustic Society of Japan, Vol.
5), pp. 784-789 ”. In this speech coding apparatus, a codebook (R) storing a large number of sets of speech source parameters and codes corresponding thereto is stored.
OM), the audio parameters are sequentially read from the start address, and the audio is reproduced through the synthesis filter.
In this method, an error between the synthesized voice and the input voice is evaluated by an error evaluation device, and a code corresponding to the sound source parameter determined to have the minimum error is output to a human unit. In this method, the minimum error cannot be determined unless all the sound source parameters have been read from the codebook (ROM).

【０００３】[0003]

【発明が解決しようとする課題】自動車電話・携帯電話
用音声符号化の各種方式の基礎になるものとして、ＣＥ
ＬＰ（ＣｏｄｅＥｘｃｉｔｅｄＬｉｎｅａｒＰｒ
ｅｄｉｃｔｉｏｎ）方式がある。このＣＥＬＰ方式で
は、符号帳として格納された種々の信号の系列と入力音
声の対応付けという形で音声を符号化する。この対応付
けの処理は、合成による分析手法に従って、符号帳の全
ての信号系列について音声を合成し、聴感上、最も良い
系列を選択することにより行われている。つまり、前述
の音声符号化装置のように、符号帳（ＲＯＭ）から全て
の音源パラメータを読み出す必要があり、従って音質の
良い符号化を行うためには演算量が膨大となり、その結
果、非常に高速な信号処理装置を必要とし、また、処理
に伴う消費電力も大きくなる、という問題があった。そ
こで、本発明の目的は、このような従来の課題を解決
し、音声符号化処理の演算量を低減するとともに、音声
符号化処理時の消費電力を低減することが可能な音声符
号化方法および音声符号化処理装置を提供することにあ
る。SUMMARY OF THE INVENTION As a basis of various systems of voice coding for car phones and mobile phones, CE is used.
LP (Code Excited Linear Pr)
Edition) system. In the CELP method, speech is encoded in the form of associating various signal sequences stored as a codebook with input speech. This association process is performed by synthesizing voices for all signal sequences in the codebook in accordance with an analysis method based on synthesis, and selecting the best audible sequence. That is, as in the above-described speech coding apparatus, it is necessary to read out all the sound source parameters from the codebook (ROM). Therefore, the amount of calculation becomes enormous in order to perform coding with good sound quality. There is a problem that a high-speed signal processing device is required, and power consumption for processing is increased. Therefore, an object of the present invention is to solve such a conventional problem, reduce the amount of calculation of the audio encoding process, and reduce the power consumption during the audio encoding process. An object of the present invention is to provide a speech encoding processing device.

【０００４】[0004]

【課題を解決するための手段】上記目的を達成するた
め、本発明の音声符号化方法では、(1)音声を合成する
ための音源パラメータと、該音源パラメータに対応する
符号の複数組を保持する符号帳を設けて、該符号帳から
読み出した音源パラメータから生成された音声と入力さ
れた音声とを比較し、その誤差を評価することにより、
その評価結果に基づいて出力する符号を選択する音声符
号化方法において、上記符号帳を用いて音声符号化処理
を行う際に、誤差の評価結果と定数とを比較してその大
小関係を判定し、その判定結果により符号を出力する
か、あるいは符号帳を分割して、分割された一部の符号
帳からのみ音源パラメータを読み出すことにより符号を
出力するか、あるいは符号帳から読み出されるアドレス
を別の記憶装置に記憶して、そのアドレスから読み出さ
れた音源パラメータに対応する符号を出力するか、のう
ちの１つを用いることにより、符号帳全体をアクセスす
ることなく、処理の途中でも最適な符号を出力させて音
声符号化処理を完了させることを特徴としている（図
３，図４，図８の各実施例参照）。また、(2)上記誤差
の評価結果と定数との大小関係を判定し、判定結果を符
号帳のエントリ数分だけ保持しておき、複数の判定結果
のうちの最小の値に対応する符号を出力するか、あるい
は判定結果を符号帳のエントリ数以下の予定容量分だけ
保持しておき、予定容量分を超えてからは判定結果が最
小のものに更新していき、最終的に最小を値に対応する
符号を出力することを特徴としている（図３の第１の実
施例参照）。また、本発明の音声符号化処理装置では、
(3)上記符号帳を階層的に備え、上位の階層に属する符
号帳は下位に属する符号帳よりも小さな容量を持ち、上
記音声符号化処理を行う際には上位の階層にある符号帳
を用いることを特徴としている（図４の第２の実施例参
照）。さらに、上記誤差の評価結果を保持する記憶装置
と、該記憶装置に記憶されたデータのうち最大値のもの
と最小値のものを判定する装置と、上位の階層に属する
符号帳と下位の階層に属する符号客のデータのやりとり
を管理する装置を有することを特徴としている（図４の
第２の実施例参照）。また、(4)上位の階層にある符号
帳と下位の階層にある符号帳との間にデータの入れ替え
を行う際には、前回の音声符号化処理結果に基づいて、
上位階層の符号帳の中で最も誤差の多いと判定されたデ
ータを下位階層の符号帳上のものと入れ替えることを特
徴としている（図４の第２の実施例参照）。In order to achieve the above object, in the speech encoding method of the present invention, (1) a sound source parameter for synthesizing speech and a plurality of sets of codes corresponding to the sound source parameter are stored. By providing a codebook to perform, by comparing the voice generated from the sound source parameters read from the codebook and the input voice, by evaluating the error,
In a speech coding method for selecting a code to be output based on the evaluation result, when performing a speech coding process using the codebook, the evaluation result of the error is compared with a constant to determine the magnitude relationship. The code is output according to the determination result, or the codebook is divided, and the code is output by reading out the excitation parameters only from some of the divided codebooks, or the address read from the codebook is separated. Or output the code corresponding to the sound source parameter read out from the address, or use one of them, so that the entire codebook is not accessed and the optimum It is characterized in that the speech encoding process is completed by outputting a suitable code (see each embodiment of FIGS. 3, 4 and 8). Also, (2) determining the magnitude relation between the evaluation result of the error and the constant, holding the determination results for the number of codebook entries, and determining the code corresponding to the minimum value of the plurality of determination results. Output or hold the judgment result for the planned capacity equal to or less than the number of entries in the codebook, and after exceeding the planned capacity, update the judgment result to the smallest one, and finally set the minimum value to (See the first embodiment in FIG. 3). Further, in the speech encoding processing device of the present invention,
(3) The codebook is provided in a hierarchical manner, and the codebooks belonging to the upper layer have a smaller capacity than the codebooks belonging to the lower layer. (See the second embodiment in FIG. 4). Further, a storage device for holding the evaluation result of the error, a device for determining a maximum value and a minimum value of data stored in the storage device, a codebook belonging to an upper layer and a lower layer (See the second embodiment in FIG. 4). Also, (4) when performing the exchange of data between the codebook in the upper layer and the codebook in the lower layer, based on the previous speech encoding processing result,
It is characterized in that data determined to have the largest error in the upper layer codebook is replaced with data in the lower layer codebook (see the second embodiment in FIG. 4).

【０００５】(5)上位階層にある符号帳と下位階層にあ
る符号帳との間でデータの入れ替えを行う際に、これま
での音声符号化処理結果の履歴に基づいて、上位階層の
符号帳の中で最も誤差の多いと判定されたデータを下位
階層の符号帳のものと入れ替えても良い（図４の第２の
実施例参照）。また、(6)音声符号化処理の終了時に
は、上位階層および下位階層の符号帳の内容を他の記憶
装置に退避し、次の音声符号化処理の開始時に上記記憶
装置の内容を上位階層および下位階層の符号帳に読み出
すように構成されていることも特徴としている（図４の
第２の実施例参照）。また、(7)音声符号化処理の終了
時には、上位階層および下位階層の符号帳の内容を他の
記憶装置に退避する際には、この音声符号化処理装置の
各使用者毎に記憶できるように個人符号帳が構成されて
いることも特徴としている（図６の第３の実施例参
照）。(8)上記個人符号帳が不揮発性の半導体記憶装置
で構成されていることを特徴としている（図６の第３実
施例参照）。また、(9)上記誤差の評価結果を保持する
記憶装置と、該記憶装置に記憶されたデータのうち最大
値のものと最小値のものを判定する装置と、符号帳のデ
ータ格納アドレスの一部を保持する記憶装置と、該アド
レス記憶装置に登録されるアドレスを管理する管理装置
を有することも特徴としている（図８の第４の実施例参
照）。さらに、(10)上記符号帳のデータ格納アドレスの
一部を保持する記憶装置には、主に誤差の評価結果の値
が小さい符号に対応するアドレスを保持することも特徴
としている（図８の第４の実施例参照）。(5) When exchanging data between a codebook in an upper layer and a codebook in a lower layer, a codebook of an upper layer is determined based on the history of the speech encoding processing results so far. The data determined to have the largest error among them may be replaced with the data in the lower-order codebook (see the second embodiment in FIG. 4). (6) At the end of the audio encoding process, the contents of the upper-layer and lower-layer codebooks are saved to another storage device, and at the start of the next audio encoding process, the contents of the storage device are saved to the upper-layer and the lower layer. It is also characterized in that it is configured to be read out to a lower layer codebook (see the second embodiment in FIG. 4). (7) When the contents of the codebooks of the upper layer and the lower layer are saved in another storage device at the end of the audio encoding process, the contents can be stored for each user of the audio encoding device. It is also characterized in that a personal codebook is configured (see the third embodiment in FIG. 6). (8) The personal codebook is constituted by a nonvolatile semiconductor memory device (see the third embodiment in FIG. 6). (9) A storage device for holding the error evaluation result, a device for determining the maximum value and the minimum value among the data stored in the storage device, and a code book data storage address. It is also characterized by having a storage device for holding the unit and a management device for managing addresses registered in the address storage device (see the fourth embodiment in FIG. 8). Further, (10) the storage device for storing a part of the data storage address of the codebook is mainly characterized by storing an address corresponding to a code having a small error evaluation result value (see FIG. 8). Refer to the fourth embodiment).

【０００６】[0006]

【発明の実施の形態】以下、本発明の実施例を、図面に
より詳細に説明する。図１は、本発明の基本的な構成を
示す音声符号化処理装置のブロック図である。図１にお
いて、１００は音声信号を符号に変換するための音声符
号化処理装置であって、従来と同じく、符号帳から読み
出した音源パラメータから音声を再生するための合成フ
ィルタと、再生された音声と入力された音声との間の誤
差を評価する誤差評価装置とを具備したものである。ま
た、１０１は、従来と同じか、または本発明により新た
に改良された構成の符号帳であって、音声の基になる音
源パラメータとそれに対応する符号の組を多数組登録し
たものであって、本発明では上位階層と下位階層に分離
したものもある。また、１０２は本発明により新たに設
けられたものであって、誤差判定装置とレジスタ、ある
いは最小誤差判定装置とレジスタと一次／二次符号帳管
理装置、あるいはこれらに加えて個人符号帳、あるいは
アドレス表等を具備したもので、符号帳１０１を制御す
るためのものである。Embodiments of the present invention will be described below in detail with reference to the drawings. FIG. 1 is a block diagram of a speech encoding processing device showing a basic configuration of the present invention. In FIG. 1, reference numeral 100 denotes an audio encoding processing device for converting an audio signal into a code, and a synthesis filter for reproducing audio from a sound source parameter read from a codebook and a reproduced audio And an error evaluation device for evaluating an error between the input and the input voice. Reference numeral 101 denotes a codebook having the same configuration as that of the related art or newly improved according to the present invention, in which a large number of sets of sound source parameters and codes corresponding thereto are registered. According to the present invention, there is a configuration in which an upper hierarchy and a lower hierarchy are separated. Reference numeral 102 denotes a device newly provided according to the present invention, which includes an error determination device and a register, or a minimum error determination device and a register and a primary / secondary codebook management device, or a personal codebook in addition thereto. It has an address table and the like, and is for controlling the codebook 101.

【０００７】図２は、本発明の一実施例を示す音声符号
化方法の動作フローチャートである。音声符号化処理装
置１００は音声符号化の開始が指示されると（ステップ
１０ａ）、音声入力を受け付け（ステップ１１ａ）、同
時に符号帳から符号、音源パラメータ組（以下、エント
リと呼ぶ）を読み出す（ステップ１２ａ）。読み出した
音源パラメータを用いて音声を合成し、入力された音声
との誤差を音声符号化処理装置１００の誤差評価装置で
評価する（ステップ１３ａ）。その結果、得られた値が
ある決められたしきい値よりも大きい場合には（ステッ
プ１４ａ）、符号帳から次のエントリを読み出し（ステ
ップ１２ａ）、その音源パラメータを用いて再度音声を
合成し、入力音声との誤差を評価する（ステップ１３
ａ）。誤差評価の結果がしきい値よりも大きければ、上
記の処理を繰り返し行い、しきい値より小さければ、符
号帳を制御してその時の音源パラメータに対応する符号
を外部に出力する（ステップ１５ａ）。音声符号化終了
が指示されるまで、音声入力からここまでの処理を繰り
返し行う（ステップ１６ａ）。本実施例で従来の方法と
異なる点は、誤差がしきい値以下であるか否かを判定す
る処理であって、しきい値以下になった時点で符号を出
力し、それ以降の符号帳からの読み出しを終了する点で
ある。従来では、符号帳の全ての音源パラメータを読み
出していたのに比べて、本実施例では途中で処理を終了
できるので、高速に符号化を行えるとともに、処理時の
消費電力を低減することができる。FIG. 2 is an operation flowchart of a speech encoding method according to an embodiment of the present invention. When instructed to start speech encoding (step 10a), speech encoding processing apparatus 100 accepts speech input (step 11a), and at the same time, reads a code and a sound source parameter set (hereinafter referred to as an entry) from a codebook (step S11a). Step 12a). A speech is synthesized using the read sound source parameters, and an error from the input speech is evaluated by an error evaluation device of the speech encoding processing device 100 (step 13a). As a result, if the obtained value is larger than a predetermined threshold value (step 14a), the next entry is read from the codebook (step 12a), and speech is synthesized again using the sound source parameters. , Evaluate the error with the input voice (step 13)
a). If the error evaluation result is larger than the threshold value, the above processing is repeated, and if smaller than the threshold value, the codebook is controlled and the code corresponding to the excitation parameter at that time is output to the outside (step 15a). . Until the end of the voice encoding is instructed, the processing from the voice input to this point is repeated (step 16a). The difference between the present embodiment and the conventional method is a process of determining whether or not the error is equal to or less than a threshold value. This is the point at which reading from. Compared to reading all the excitation parameters of the codebook conventionally, the processing can be terminated in the middle in the present embodiment, so that high-speed encoding can be performed and power consumption during processing can be reduced. .

【０００８】図３は、本発明の第１の実施例を示す音声
符号化処理装置の構成図である。図３において、１０１
は符号帳、１０１ｃは符号帳１０１に保持されている符
号、１０１ｄはその符号に対応付けられた音源パラメー
タであり、これら２つが１組となって１つのエントリを
構成している。３００は符号帳１０１から読み出した音
源パラメータに基づき音声を合成する合成フィルタ、３
０１は合成フィルタ３００により合成された音声と、外
部から入力された音声の誤差を評価するための誤差評価
装置である。３０２は符号帳１０１からのエントリ読み
出し装置、３０３は符号帳１０１から外部に出力される
符号１０１ｃの選択装置である。１０２は本発明で新た
よ設けられた符号帳制御装置であって、しきい値εと誤
差評価装置３０１から出力される誤差評価結果との大小
判定を行う誤差判定装置１０２ａ、誤差判定装置１０２
ａの判定結果を保持するレジスタ１０２ｂ、誤差判定装
置１０２ａの出力またはレジスタ１０２ｂの出力のいず
れかを選択して出力する選択装置１０２ｃとを具備して
いる。符号制御装置１０２は、比較が終了する毎に符号
帳１０１からのエントリ読み出し装置３０２を制御する
とともに、比較の結果に基づいて符号帳１０１から外部
に出力される符号１０１ｃの選択装置３０３を制御す
る。ここでレジスタ１０２ｂを符号帳の全ての組が記憶
できる容量にし、誤差差判定装置１０２ａが誤差出力と
しきい値εとを比較して、誤差出力がしきい値εより大
きいものを全てレジスタ１０２ｂに記憶し、全ての誤差
出力がしきい値εより大きい場合には、その中の最小の
ものに対応する符号を出力するように選択装置３０３を
制御する第１の方法と、レジスタ１０２ｂを符号帳の全
ての組を記憶できない小さい容量にして、比較の結果、
誤差がそれまでレジスタ１０２ｂに記憶している値のう
ちで最小のときには、順次、その最小値で更新していく
第２の方法とがある。FIG. 3 is a block diagram of a speech coding processing apparatus according to a first embodiment of the present invention. In FIG.
Is a codebook, 101c is a code held in the codebook 101, 101d is a sound source parameter associated with the code, and these two constitute one set to form one entry. Reference numeral 300 denotes a synthesis filter that synthesizes speech based on the sound source parameters read from the codebook 101, and 3
Reference numeral 01 denotes an error evaluation device for evaluating an error between the voice synthesized by the synthesis filter 300 and a voice input from the outside. 302 is a device for reading entries from the codebook 101, and 303 is a device for selecting the code 101c output from the codebook 101 to the outside. Reference numeral 102 denotes a codebook control device newly provided in the present invention, which is an error determination device 102a for performing a magnitude determination between the threshold value ε and an error evaluation result output from the error evaluation device 301, and an error determination device 102
a register 102b for holding the determination result of a, and a selection device 102c for selecting and outputting either the output of the error determination device 102a or the output of the register 102b. The code control device 102 controls the device 302 for reading entries from the codebook 101 each time the comparison is completed, and controls the device 303 for selecting the code 101c output from the codebook 101 to the outside based on the result of the comparison. . Here, the register 102b is set to have a capacity capable of storing all the sets of the codebook, and the error difference determination device 102a compares the error output with the threshold ε. If all error outputs are greater than the threshold value ε, the first method of controlling the selection device 303 to output the code corresponding to the smallest one of them is stored in the register 102b. Is made small enough to store all the sets of
When the error is the minimum value among the values stored in the register 102b, there is a second method of sequentially updating the error with the minimum value.

【０００９】さらに、図３の動作を詳述する。音声符号
化処理が開始されると、最初に符号帳１０１の先頭エン
トリが読み出される。合成フィルタ３００では、読み出
したエントりの音源パラメータから音声を合成し、誤差
評価装置３０１で入力された音声との誤差評価を行う。
その際に、各音声の平均誤差電力を計算しても良いし、
各音声を周波数変換した後で平均誤差を計算しても良
い。また、平均誤差電力対ピーク電力比を評価の指標に
用いても良い。誤差評価の結果得られた値と、あるしき
い値εとの大小関係により以下の処理が分れる。すなわ
ち、誤差評価値がしきい値εよりも大きい場合には符号
帳の次のエントリを読み出し、音声合成と誤差評価処理
を繰り返し行う。また、誤差評価値がしきい値εよりも
小さい場合には上記処理を打ち切り、その時点での符号
１０１ｃを入力音声を符号化したものとして選択し、外
部に出力する。その際に、符号帳１０１の全体に渡って
処理を行っても誤差評価値がしきい値εよりも小さくな
らなかった場合には、誤差評価値が最もしきい値εに近
いものに対応する符号が入力音声を符号化したものとし
て選択され、外部に出力される。なお、しきい値εは誤
差評価装置３０１における誤差評価の方法に従って用意
すべき値であって、固定された値であっても良いし、外
部から信号として入力される値であってもよい。以上の
ように構成することにより、本発明を用いた音声符号化
処理装置では、平均で考えた場合、符号帳全体をアクセ
スすることなく、符号化処理を行うことが可能となる。
つまり、音声符号化処理時の演算量および消費電力を大
幅に低減することができる。Further, the operation of FIG. 3 will be described in detail. When the audio encoding process is started, first, the first entry of the codebook 101 is read. The synthesis filter 300 synthesizes speech from the read sound source parameters of the entry, and evaluates an error with the speech input by the error evaluation device 301.
At that time, the average error power of each voice may be calculated,
The average error may be calculated after frequency conversion of each voice. Further, the ratio of the average error power to the peak power may be used as an evaluation index. The following processing can be identified based on the magnitude relationship between the value obtained as a result of the error evaluation and a certain threshold value ε. That is, when the error evaluation value is larger than the threshold value ε, the next entry in the codebook is read, and the speech synthesis and the error evaluation process are repeatedly performed. If the error evaluation value is smaller than the threshold value ε, the above processing is terminated, and the code 101c at that time is selected as a code obtained by encoding the input voice, and is output to the outside. At this time, if the error evaluation value does not become smaller than the threshold value ε even if the processing is performed over the entire codebook 101, the error evaluation value corresponds to the one closest to the threshold value ε. The code is selected as a coded version of the input speech and output to the outside. The threshold value ε is a value to be prepared according to the error evaluation method in the error evaluation device 301, and may be a fixed value or a value input as a signal from outside. With the above-described configuration, the speech encoding processing device using the present invention can perform the encoding process without accessing the entire codebook, when considered on average.
That is, it is possible to greatly reduce the amount of calculation and the power consumption during the audio encoding process.

【００１０】図４は、本発明の第２の実施例を示す音声
符号化処理装置の構成図である。図４において、１０１
ａはエントリの一部を保持する一次符号帳、１０１ｂは
一次符号帳１０１ａに保持されているもの以外のエント
リを保持する二次符号帳である。３０２は一次符号帳１
０１ａからのエントリ読み出し装置、３０３は一次符号
帳１０１ａから外部に出力される符号の選択装置であ
る。符号帳制御装置４００は、誤差評価装置３０１から
出力される誤差評価結果を保持するレジスタ４００ａ、
レジスタ４００ａ中の値の中から最小のものを判定する
ための最小誤差判定装置４００ｂ、一次符号帳と二次符
号帳に保持されているエントリを管理し、その入れ替え
を制御する一次／二次符号帳管理装置４００ｃからな
る。符号帳制御装置４００は、最小誤差判定装置４００
ｂが最小値を判定する毎に、符号帳１０１から外部に出
力される符号の選択装置３０３を制御し、また一次／二
次符号帳管理装置４００ｃにより一次符号帳と二次符号
帳のエントリの入れ替えを行うエントリ入れ替え装置４
０１を制御する。３００は一次符号帳１０１ａから読み
出した音源パラメータに基づいて音声を合成する合成フ
ィルタ、３０１は合成フィルタ３００により合成された
音声と、外部から入力された音声の誤差を評価するため
の誤差評価装置である。FIG. 4 is a block diagram of a speech coding processing apparatus according to a second embodiment of the present invention. In FIG.
a is a primary codebook holding a part of entries, and 101b is a secondary codebook holding entries other than those stored in the primary codebook 101a. 302 is a primary codebook 1
A device for reading entries from 01a, 303 is a device for selecting a code output from the primary codebook 101a to the outside. Codebook control device 400 includes a register 400a that holds an error evaluation result output from error evaluation device 301,
A minimum error determination device 400b for determining the smallest one among the values in the register 400a, a primary / secondary code for managing entries held in the primary codebook and the secondary codebook, and controlling the replacement thereof The book management device 400c is provided. The codebook control device 400 includes a minimum error determination device 400
Each time b determines the minimum value, it controls the selecting device 303 of the code output from the codebook 101 to the outside, and the primary / secondary codebook management device 400c controls the entry of the primary codebook and the secondary codebook. Entry swapping device 4 for swapping
01 is controlled. Reference numeral 300 denotes a synthesis filter for synthesizing speech based on the sound source parameters read from the primary codebook 101a. Reference numeral 301 denotes an error evaluation device for evaluating an error between the speech synthesized by the synthesis filter 300 and speech input from the outside. is there.

【００１１】一次符号帳１０１ａは二次符号帳１０１ｂ
よりも小さな容量のメモリで構成されており、また一次
符号帳１０１ａと二次符号帳１０１ｂはエントリ入れ替
え装置４０１を介してエントリの入れ替えを行うことが
できる。合成フィルタ３００には、一次符号帳１０１ａ
の先頭番地から最後尾まで順に読み出された音源パラメ
ータが送出され、そこで合成された音声と入力音声の誤
差評価が誤差評価装置３０１で行われた後、誤差評価結
果が逐次符号帳制御装置４００に送られる。符号帳制御
装置４００では、誤差評価装置３０１からの出力信号を
レジスタ４００ａに保存し、最小誤差判定装置４００ｂ
でその中の最小値に対応するエントリを判定する。最小
誤差判定装置４００ｂから出力される信号は、符号選択
装置３０３に送られて、符号選択装置３０３で対応する
符号が選択され、外部に出力される。本実施例では、一
次符号帳１０１ａと二次符号帳１０１ｂに分割すること
により、一次符号帳１０１ａのみのエントリを読み出す
だけで符号化処理が終了するので、第１の実施例と同じ
ように符号化処理の高速化と、消費電力の低減化の効果
が得られる。[0011] The primary codebook 101a is a secondary codebook 101b.
The primary codebook 101a and the secondary codebook 101b can exchange entries via the entry exchange device 401. The synthesis filter 300 includes the primary codebook 101a
Are transmitted in order from the first address to the last address, and the error evaluation between the synthesized voice and the input voice is performed by the error evaluator 301, and the error evaluation result is sequentially transmitted to the codebook controller 400. Sent to In the codebook control device 400, the output signal from the error evaluation device 301 is stored in the register 400a, and the minimum error determination device 400b
Determines the entry corresponding to the minimum value among them. The signal output from the minimum error determination device 400b is sent to the code selection device 303, where the corresponding code is selected and output to the outside. In the present embodiment, since the encoding process is completed only by reading the entry of the primary codebook 101a by dividing into the primary codebook 101a and the secondary codebook 101b, the encoding is performed in the same manner as in the first embodiment. The effect of speeding up the conversion process and reducing power consumption can be obtained.

【００１２】図５は、図４における符号帳制御装置の動
作と符号帳の入れ替え動作のフローチャートである。音
声符号化処理が開始されると（ステップ１０ｂ）、最初
に一次符号帳１０１ａの先頭アドレスからエントリを読
み出す（ステップ１１ｂ）。次に、そのエントリ中の音
源パラメータからある決められた手順に従って音声を合
成する。合成音声と外部から入力された音声の誤差を判
定し、その評価結果をレジスタ４００ａに入力して保存
する（ステップ１２ｂ）。次に、一次符号帳１０１ａの
次のアドレスからエントリを読み出し、先頭アドレスか
ら読み出した時と同じ動作を行う。一次符号帳１０１ａ
の最後尾に到達するまで、以上の動作を繰り返し行う
（ステップ１３ｂ）。このようにすれば、一次符号帳１
０１ａの最後尾にあるエントリの処理が終了した後、レ
ジスタ４００ａには一次符号帳１０１ａの全ての音源パ
ラメータの評価結果が保存されていることになる。その
一連の評価結果のうち、最小誤差判定装置４００ｂにお
いて最も誤差が小さいと判定されている音源パラメータ
に対応する符号が、入力された音声を符号化したものと
して選択され、外部に出力される（ステップ１４ｂ）。
ここで、最小誤差判定装置４００ｂにおける判定結果
は、一次／二次符号帳管理装置４００ｃにも送出され
る。一次／二次符号帳管理装置４００ｃでは、ある決め
られた手順に従って判定結果を処理し、その結果に基づ
いてエントリ入れ替え装置４０１に対して制御信号を発
行する。エントリ入れ替え装置４０１は、一次／二次符
号帳管理装置４００ｃからの制御信号に従って、一次符
号帳１０１ａの誤差最大のエントリと二次符号帳１０１
ｂ上のものとを入れ替える（ステップ１５ｂ）。なお、
二次符号帳１０１ｂのうち最も誤差の小さい音源パラメ
ータは、予め管理装置４００ｃにおいて判別されている
ものとする。以上で、１回の音声符号化処理が終了し
（ステップ１６ｂ）、次の音声符号化処理に移る。FIG. 5 is a flowchart of the operation of the codebook control device and the codebook exchanging operation in FIG. When the speech encoding process is started (step 10b), first, an entry is read from the head address of the primary codebook 101a (step 11b). Next, speech is synthesized according to a predetermined procedure based on the sound source parameters in the entry. An error between the synthesized voice and the voice input from the outside is determined, and the evaluation result is input to the register 400a and stored (step 12b). Next, an entry is read from the next address of the primary codebook 101a, and the same operation as when reading from the first address is performed. Primary codebook 101a
The above operation is repeated until the end of is reached (step 13b). By doing so, the primary codebook 1
After the processing of the entry at the end of 01a is completed, the register 400a stores the evaluation results of all the excitation parameters of the primary codebook 101a. From the series of evaluation results, the code corresponding to the sound source parameter determined to have the smallest error in the minimum error determination device 400b is selected as a coded version of the input voice, and output to the outside ( Step 14b).
Here, the determination result in the minimum error determination device 400b is also sent to the primary / secondary codebook management device 400c. The primary / secondary codebook management device 400c processes the determination result according to a predetermined procedure, and issues a control signal to the entry replacement device 401 based on the result. In accordance with a control signal from the primary / secondary codebook management device 400c, the entry replacement device 401 changes the entry of the primary codebook 101a with the maximum error and the secondary codebook 101.
Replace with the one on b (step 15b). In addition,
It is assumed that the excitation parameter having the smallest error in the secondary codebook 101b is determined in advance by the management device 400c. Thus, one speech encoding process is completed (step 16b), and the process proceeds to the next speech encoding process.

【００１３】なお、図５において、一次／二次符号帳管
理装置４００ｃにおいて行われる処理手順は以下のよう
にしてもよい。すなわち、最小誤差判定装置４００ｂに
おける判定結果から誤差最大であったことが判明したエ
ントリにフラグを付しておき、何回かの音声符号化処理
の後、そのフラグの数が規定数に達したものを二次符号
帳１０１ｂ上のエントリと入れ替える対象にする。二次
符号帳１０１ｂ上のエントリも同様のフラグを持ってお
り、最もフラグの少ないエントリを入れ替えの対象とす
る。以上のように構成することにより、音声符号化処理
中に使用する符号帳が小容量の一次符号帳１０１ａのみ
に限定されるので、処理の高速化のみならず、低消費電
力化も達成できる。さらに、入力された音声をより少な
い誤差で符号化できるように一次符号帳１０１ａの内容
を動作中に更新し、最適化していくために、小容量の一
次符号帳１０１ａと二次符号帳１０１ｂとを合わせて符
号帳全体を構成しているため、本発明を用いていない従
来の符号帳と符号帳の内容とを完全に一致させることが
でき、本発明を用いることによる不都合は生じない。In FIG. 5, the procedure performed in the primary / secondary codebook management device 400c may be as follows. That is, a flag is added to an entry that has been found to have the largest error from the result of the determination by the minimum error determination device 400b, and after a number of speech encoding processes, the number of flags has reached the specified number. The object is replaced with an entry on the secondary codebook 101b. Entries on the secondary codebook 101b also have similar flags, and the entry with the least number of flags is to be replaced. With the above-described configuration, the codebook used during the speech encoding process is limited to only the small-capacity primary codebook 101a, so that not only high-speed processing but also low power consumption can be achieved. Further, in order to update and optimize the contents of the primary codebook 101a during operation so that the input speech can be encoded with a smaller error, the primary codebook 101a and the secondary codebook 101b have a small capacity. To form the entire codebook, the contents of the conventional codebook not using the present invention and the contents of the codebook can be completely matched, and no inconvenience is caused by using the present invention.

【００１４】図６は、本発明の第３の実施例を示す音声
符号化処理装置の構成図である。本実施例の特徴は、各
使用者毎の符号帳の状態を記憶しておく個人符号帳６０
１を設けた点である。６００は符号帳制御装置であっ
て、図４における符号帳制御装置４００とほぼ同じ機能
を有しており、さらに外部から入力される個人識別信号
に応じて個人符号帳６０１を制御する機能を有する。６
０１は各話者の音声符号化処理の終了時における一次符
号帳１０１ａ、二次符号帳１０１ｂ、符号帳制御装置６
００の状態を保持する個人符号帳である。個人符号帳６
０１は、一次符号帳１０１ａ、二次符号帳１０１ｂ、符
号帳制御装置６００の状態を複数の話者に対応してそれ
ぞれ保持するため、大容量の記憶装置で構成されてい
る。話者が決まった時点で、その話者を識別するための
個人識別信号が符号帳制御装置６００に入力されるの
で、それに基づいて符号帳選択信号が生成され、その信
号が個人符号帳６０１に入力されることにより、その使
用者に対する前回の一次符号帳１０１ａと二次符号帳１
０１ｂの状態に基づいてエントリ入れ替え装置４０１で
一次／二次符号帳間で入れ替えが行われる。これによ
り、その使用者に適合した符号化が行われることにな
る。FIG. 6 is a block diagram of a speech coding processing apparatus according to a third embodiment of the present invention. The feature of this embodiment is that a personal codebook 60 for storing the state of the codebook for each user is provided.
1 is provided. A codebook control device 600 has almost the same function as the codebook control device 400 in FIG. 4 and further has a function of controlling the personal codebook 601 according to a personal identification signal input from the outside. . 6
01 is the primary codebook 101a, the secondary codebook 101b, and the codebook control device 6 at the end of the speech encoding process of each speaker.
This is a personal codebook that holds the state of 00. Personal codebook 6
Reference numeral 01 is a large-capacity storage device for holding the states of the primary codebook 101a, the secondary codebook 101b, and the codebook control device 600 for each of a plurality of speakers. When the speaker is determined, a personal identification signal for identifying the speaker is input to codebook control device 600, and a codebook selection signal is generated based on the signal, and the signal is stored in personal codebook 601. By being input, the previous primary codebook 101a and secondary codebook 1
Based on the state of 01b, the entry exchange device 401 performs exchange between the primary / secondary codebooks. As a result, encoding suitable for the user is performed.

【００１５】図７は、図６における音声符号化処理の動
作フローチャートである。音声符号化処理の開始に先立
ち（ステップ１１ｃ）、話者を特定するための個人識別
信号が符号帳制御装置６００に対して入力される（ステ
ップ１２ｃ）。符号帳制御装置６００では、入力された
個人識別信号から符号帳選択信号を生成し、個人符号帳
６０１の対応する部分を選択する。選択された部分は、
一次符号帳１０１ａ、二次符号帳１０１ｂおよび符号帳
制御装置６００の所定の位置にそれぞれロードされる
（ステップ１３ｃ）。その後は、図５に示す手順に従っ
て音声符号化が行われる（ステップ１４ｃ）。すなわ
ち、符号帳制御装置６００の内部は、図４の符号帳制御
装置４００と同じように、レジスタ、最小誤差判定装
置、一次／二次符号帳管理装置が設置されており、一次
帳号帳１０１ａの先頭アドレスからエントリが読み出さ
れた後、合成音声と外部から入力された音声の誤差を判
定し、その評価結果をレジスタに入力して保存する。一
次符号帳１０１ａの最後尾に到達するまで、以上の動作
を繰り返し行う。レジスタに保存された評価結果のう
ち、最小誤差判定装置において最も誤差が小さいと判定
されている音源パラメータに対応する符号が外部に出力
される。FIG. 7 is a flowchart showing the operation of the speech encoding process in FIG. Prior to the start of the voice encoding process (step 11c), a personal identification signal for specifying a speaker is input to the codebook control device 600 (step 12c). The codebook control device 600 generates a codebook selection signal from the input personal identification signal, and selects a corresponding part of the personal codebook 601. The selected part is
The primary codebook 101a, the secondary codebook 101b, and the codebook control device 600 are loaded at predetermined positions (step 13c). Thereafter, speech coding is performed according to the procedure shown in FIG. 5 (step 14c). That is, like the codebook control device 400 of FIG. 4, a register, a minimum error determination device, and a primary / secondary codebook management device are installed inside the codebook control device 600. After the entry is read out from the head address of the above, the error between the synthesized voice and the voice input from the outside is determined, and the evaluation result is input to the register and stored. The above operation is repeated until the end of the primary codebook 101a is reached. Among the evaluation results stored in the register, a code corresponding to a sound source parameter determined to have the smallest error by the minimum error determination device is output to the outside.

【００１６】この間、一次符号帳１０１ａ、二次符号帳
１０１ｂ、符号帳制御装置６００の内部状態は動作時に
順次変化していく。次に、音声符号化処理の終了が指示
されると（ステップ１５ｃ）、その時点で一次符号帳１
０１ａ、二次符号帳１０１ｂおよび符号帳制御装置６０
０の内部状態が個人符号帳６０１にストアされる（ステ
ップ１６ｃ）。このようにして、前回の音声符号化処理
中に最適化された一次符号帳１０１ａを最初から利用す
ることができるようになる。さらに、個人符号帳６０１
には個人識別信号により区別される複数の状態が保持さ
れているため、本発明の音声符号化処理装置が複数の人
物に利用される場合でも、何等問題は生じない。さら
に、個人符号帳６０１を強誘電体メモリ等の不揮発性記
憶装置で構成すれば、電源が遮断された場合でも個人符
号帳の内容を保持することができ、かつ高速動作が可能
となる。また、誤差判定装置４００ｂは、上位の階層の
符号帳１０１ａと下位の階層の符号帳１０１ｂとの間で
データの入れ替えを行う場合、それまでの音声符号化処
理結果の履歴に基づいて、上位の階層の符号帳１０１ａ
の中で最も誤差が多いと判定されたデータを下位の階層
の符号帳１０１ｂ上のものと入れ替える。また、符号帳
管理装置４００ｃは、音声符号化処理の終了時に、上位
の階層と下位の階層の符号帳１０１ａ，ｂの内容を他の
記憶装置（図示省略）に退避させ、次の音声符号化処理
の開始時に該記憶装置の内容を該上位の階層および下位
の階層の符号帳１０１ａ，ｂに読み出すように制御する
こともできる。During this time, the internal states of the primary codebook 101a, the secondary codebook 101b, and the codebook control device 600 change sequentially during operation. Next, when the end of the voice encoding process is instructed (step 15c), the primary codebook 1 is stored at that time.
01a, secondary codebook 101b, and codebook control device 60
The internal state of 0 is stored in the personal codebook 601 (step 16c). In this way, the primary codebook 101a optimized during the previous speech encoding process can be used from the beginning. Further, the personal codebook 601
Holds a plurality of states distinguished by a personal identification signal, so that there is no problem even when the speech coding apparatus of the present invention is used by a plurality of persons. Furthermore, if the personal codebook 601 is configured by a nonvolatile storage device such as a ferroelectric memory, the contents of the personal codebook can be retained even when the power is turned off, and high-speed operation can be performed. In addition, when the data is exchanged between the upper layer codebook 101a and the lower layer codebook 101b, the error determination device 400b determines the upper layer codebook based on the history of the voice encoding processing results up to that time. Hierarchical codebook 101a
Are replaced with the data on the codebook 101b of the lower layer. Further, at the end of the speech encoding process, the codebook management device 400c saves the contents of the codebooks 101a and 101b of the upper hierarchy and the lower hierarchy to another storage device (not shown), and stores the next speech encoding. At the start of the processing, the contents of the storage device may be controlled to be read out to the codebooks 101a and 101b of the upper layer and the lower layer.

【００１７】図８は、本発明の第４の実施例を示す音声
符号化処理装置の構成図である。本実施例の特徴は、符
号帳制御装置内にアドレス表を保持して、使用する符号
帳１０１上のエントリをアドレス表に登録されているも
のに限定することにより、処理の高速化とともに低消費
電力化を図るものである。符号帳制御装置８００には、
誤差評価装置３０１から出力される誤差評価結果を保持
するレジスタ８００ａ、レジスタ８００ａ中の値の中か
ら最小のものを判定するための最小誤差判定装置８００
ｂ、読み出すエントリの符号帳１０１上のアドレスが保
持されているアドレス表８００ｃ、アドレス表８００ｃ
の内容を管理するアドレス表管理装置８００ｄが備えら
れている。そして、符号帳１０１からのエントリ読み出
し装置３０２を制御して、符号帳１０１から外部に出力
される符号の選択装置３０３を制御する。アドレス表８
００ｃには、符号帳１０１のエントリ格納アドレスの一
部が保持されている。最初に、アドレス表８００ｃから
読み出されたアドレスに対応する符号帳１０１のエント
リが読み出される。次に、そのエントリ中の音源パラメ
ータからある決められた手順に従って音声を合成する。
合成音声と外部から入力された音声の誤差を判定し、そ
の評価結果をレジスタ８００ａに入力して保存する。さ
らに、アドレス表８００ｃから読み出された次のアドレ
スからエントリを読み出し、最初と同じ動作を行う。ア
ドレス表８００ｃの最後尾に到達するまで、以上の動作
を繰り返し行う。FIG. 8 is a block diagram of a speech coding apparatus according to a fourth embodiment of the present invention. The feature of the present embodiment is that the address table is held in the codebook control device, and the entries on the codebook 101 to be used are limited to those registered in the address table. It is intended to use electric power. The codebook control device 800 includes:
A register 800a for holding an error evaluation result output from the error evaluation device 301, and a minimum error determination device 800 for determining a minimum value from the values in the register 800a.
b, an address table 800c holding an address of the entry to be read on the codebook 101, and an address table 800c
Is provided with an address table management device 800d for managing the contents of the address table. Then, it controls an entry reading device 302 from the codebook 101 and controls a selecting device 303 for a code output from the codebook 101 to the outside. Address table 8
00c holds a part of the entry storage address of the codebook 101. First, the entry of the codebook 101 corresponding to the address read from the address table 800c is read. Next, speech is synthesized according to a predetermined procedure based on the sound source parameters in the entry.
An error between the synthesized voice and the voice input from outside is determined, and the evaluation result is input to the register 800a and stored. Further, the entry is read from the next address read from the address table 800c, and the same operation as the first is performed. The above operation is repeated until the end of the address table 800c is reached.

【００１８】このようにすることで、アドレス表８００
ｃの最後尾に登録されているアドレスから読み出したエ
ントリの処理が終了した後、レジスタ８００ａにはアド
レス表８００ｃに登録されている全ての音源パラメータ
の評価結果が保存されていることになる。一連の評価結
果のうち、最小誤差判定装置８００ｂにおいて最も誤差
が小さいと判定されている音源パラメータに対応する符
号が、入力された音声を符号化したものとして選択さ
れ、外部に出力される。なお、最小誤差判定装置８００
ｂにおける判定結果は、アドレス表管理装置８００ｄに
も送られる。アドレス表管理装置８００ｄでは、ある決
められた手順に従って判定結果を処理し、その結果に基
づいてアドレス表８００ｃに登録されているアドレスを
更新する。このように、本実施例では、音声符号化処理
中に使用する符号帳１０１上のエントリが小容量のアド
レス表にアドレスが登録されているものだけに限定され
るので、処理の高速化と低消費電力化が実現できる。さ
らに、入力された音声をより少ない誤差で符号化できる
ようにアドレス表の登録内容を動作中に更新して、最適
化しているため、小容量のアドレス表により音声符号化
を管理することによって音質が劣化することはない。ま
た、アドレス表は符号帳のエントリアドレスの一部を保
持しているだけであるから、従来の符号帳と符号帳の内
容を完全に一致させることができ、本発明を用いても不
都合は生じない。さらに、符号帳には一切手を加えずに
小容量のアドレス表を用いるだけであるため、回路量も
増大することはない。By doing so, the address table 800
After the processing of the entry read from the address registered at the end of c is completed, the register 800a stores the evaluation results of all the sound source parameters registered in the address table 800c. From the series of evaluation results, the code corresponding to the sound source parameter determined to have the smallest error by the minimum error determination device 800b is selected as a coded version of the input speech and output to the outside. Note that the minimum error determination device 800
The determination result in b is also sent to the address table management device 800d. The address table management device 800d processes the determination result according to a predetermined procedure, and updates the address registered in the address table 800c based on the result. As described above, in the present embodiment, the entries in the codebook 101 used during the audio encoding process are limited to only those whose addresses are registered in the small-capacity address table. Power consumption can be realized. Furthermore, since the registered contents of the address table are updated during operation and optimized so that the input voice can be coded with a smaller error, the voice quality is managed by managing the voice coding with a small-capacity address table. Does not deteriorate. Further, since the address table only holds a part of the entry addresses of the codebook, it is possible to completely match the contents of the conventional codebook with the contents of the codebook. Absent. Furthermore, since only a small-capacity address table is used without any modification to the codebook, the amount of circuits does not increase.

【００１９】[0019]

【発明の効果】以上説明したように、本発明によれば、
音声を合成するための音源パラメータとそれに対応する
符号の組を複数保持する符号帳を用いて、音声信号を符
号化する場合に、符号帳全体をアクセスすることなく符
号化処理を終了させるので、音声符号化処理の演算量を
低減させることができ、その結果、符号化処理を高速化
し、かつ符号化処理時の消費電力を低減することが可能
になる。As described above, according to the present invention,
When a speech signal is encoded using a codebook that holds a plurality of pairs of sound source parameters and corresponding codes for synthesizing speech, since the encoding process is terminated without accessing the entire codebook, As a result, it is possible to reduce the calculation amount of the audio encoding process, thereby speeding up the encoding process and reducing the power consumption during the encoding process.

[Brief description of the drawings]

【図１】本発明の音声符号化処理装置の基本的構成図で
ある。FIG. 1 is a basic configuration diagram of a speech encoding processing device according to the present invention.

【図２】本発明の第１の実施例を示す音声符号化方法の
動作フローチャートである。FIG. 2 is an operation flowchart of a speech encoding method according to the first embodiment of the present invention.

【図３】本発明の第１の実施例を示す音声符号化処理装
置の構成図である。FIG. 3 is a configuration diagram of a speech encoding processing device according to a first embodiment of the present invention.

【図４】本発明の第２の実施例を示す音声符号化処理装
置の構成図である。FIG. 4 is a configuration diagram of a speech encoding processing device according to a second embodiment of the present invention.

【図５】図４に示す音声符号化処理装置の動作フローチ
ャートである。5 is an operation flowchart of the speech encoding processing device shown in FIG.

【図６】本発明の第３の実施例を示す音声符号化処理装
置の構成図である。FIG. 6 is a configuration diagram of a speech encoding processing device according to a third embodiment of the present invention.

【図７】図６に示す音声符号化処理装置の動作フローチ
ャートである。FIG. 7 is an operation flowchart of the speech encoding processing device shown in FIG. 6;

【図８】本発明の第４の実施例を示す音声符号化処理装
置の構成図である。FIG. 8 is a configuration diagram of a speech encoding processing device according to a fourth embodiment of the present invention.

[Explanation of symbols]

１００…音声符号化処理装置、１０１…符号帳、１０２
…符号帳制御装置、１０１ａ…一次符号帳、１０１ｂ…
二次符号帳、１０２，４００，６００，８００…符号帳
制御装置、１０２ａ…誤差判定装置、４００ａ，８００ａ
…レジスタ、４００ｂ，８００ｂ…最小誤差判定装置、
４００ｃ…一次／二次符号帳管理装置、８００ｃ…アド
レス表、８００ｄ…アドレス表管理装置、３００…合成
フィルタ、３０１…誤差評価装置、３０２…エントリ読
み出し装置、３０３…符号選択装置、６０１…個人符号
帳。100: voice encoding processing device; 101: codebook, 102
... codebook control device, 101a ... primary codebook, 101b ...
Secondary codebook, 102, 400, 600, 800 ... codebook control device, 102a ... error determination device, 400a, 800a
... Register, 400b, 800b ... Minimum error determination device
400c: primary / secondary codebook management device, 800c: address table, 800d: address table management device, 300: synthesis filter, 301: error evaluation device, 302: entry reading device, 303: code selection device, 601: personal code Book.

フロントページの続き (72)発明者山田孔司東京都小平市上水本町五丁目20番１号株式会社日立製作所半導体事業部内 (72)発明者竹内幹東京都小平市上水本町五丁目20番１号株式会社日立製作所半導体事業部内 (72)発明者谷川博之東京都小平市上水本町五丁目20番１号株式会社日立製作所半導体事業部内 (72)発明者関根英敏東京都小平市上水本町五丁目20番１号株式会社日立製作所半導体事業部内Continuing on the front page (72) Inventor, Koji Yamada 5-2-1, Josuihonmachi, Kodaira-shi, Tokyo Inside Semiconductor Division, Hitachi, Ltd. (72) Miki Takeuchi 5-chome, Josuihoncho, Kodaira-shi, Tokyo No. 1 Hitachi Semiconductor Co., Ltd. Semiconductor Division (72) Inventor Hiroyuki Tanikawa 5--20-1, Kamisumihonmachi, Kodaira-shi, Tokyo Incorporated Hitachi Semiconductor Co., Ltd. (72) Inventor Hidetoshi Sekine Kodaira, Tokyo 5-20-1, Kamizuhoncho Inside Semiconductor Division, Hitachi, Ltd.

Claims

[Claims]

1. A sound source parameter which is sequentially read out from a codebook in which a plurality of pairs of a sound source parameter for synthesizing a voice and a code corresponding to the sound source parameter are registered, and a voice reproduced by synthesizing the sound source parameter and inputted. In a speech coding method for selecting and determining a code by evaluating an error with a speech, when the error evaluation value between the speech read and reproduced from the codebook and the input speech is smaller than a threshold, the code is read out. Or outputs the code corresponding to the sound source parameter, or divides the codebook into two, and sequentially reads out and reproduces the error evaluation value between the input voice and the voice read and reproduced sequentially from only one of the divided codebooks. The code corresponding to the sound source parameter of the smallest one is output, or the sound source parameter read from the codebook is registered in an address table stored separately. By outputting the code corresponding to the sound source parameter of the smallest one of the error evaluation values of the reproduced voice and the input voice by limiting the voice code without accessing the entire codebook, A speech coding method characterized by completing a coding process.

2. A codebook in which a plurality of pairs of a sound source parameter for synthesizing speech and a code corresponding to the sound source parameter are registered, and a sound source parameter read out from the codebook is synthesized to generate a speech. And an error evaluator for evaluating an error between the input speech and the input speech. In a speech encoding processor for selecting and determining a code to be output, a magnitude relationship between an evaluation result value of the error evaluator and a constant is determined. An error determination device; and a code selection device for outputting a code corresponding to a sound source parameter corresponding to an evaluation result value determined to be smaller than a constant in the determination by the error determination device to the outside. Audio coding processor.

3. The speech coding apparatus according to claim 2, wherein, in addition to the error determination apparatus, results determined by the error determination apparatus are held for the number of entries in the codebook, or the code is stored. A storage device that holds a number smaller than the number of entries in the book and sequentially updates them to a minimum value, and outputs a code corresponding to the minimum value among a plurality of determination results stored in the storage device to the outside A speech coding processing device characterized by performing the following.

4. A code book in which a plurality of pairs of sound source parameters for synthesizing speech and codes corresponding to the sound source parameters are registered, and a sound source parameter read out from the code book is synthesized, and a generated speech is generated. And an error evaluator that evaluates an error between the input voice and the input voice. In the voice coding processing device that selects and determines a code to be output, two or more upper layers having a smaller capacity than lower layers A codebook having a hierarchy, a storage device for reading out excitation parameters only from the codebook of the upper hierarchy, and holding an evaluation result of an error between a speech generated by synthesizing the excitation parameters and an input speech; Among the evaluation results stored in the device, the maximum value and the minimum value are determined, the code corresponding to the minimum value is output to the outside, and the code and voice corresponding to the maximum value are output. Pa An error determination device for moving a set of parameters from the upper layer to a codebook of a lower layer; and a codebook management device for managing exchange of data between a codebook belonging to the upper layer and a codebook belonging to a lower layer. And a speech coding processing device comprising:

5. The speech coding apparatus according to claim 4, wherein the error determination apparatus is configured to replace data between an upper-layer codebook and a lower-layer codebook. A code that is determined to have the largest error in the upper-layer codebook based on the history of the voice-encoding processing result of Processing equipment.

6. The speech encoding device according to claim 4, wherein the codebook management device stores the contents of the codebooks of the upper layer and the lower layer in another storage device when the speech encoding process ends. A voice encoding processing device that controls the contents of the storage device to be read into the upper-level and lower-level codebooks at the start of the next audio encoding process.

7. The speech encoding processing device according to claim 4, wherein the speech encoding processing is performed as a storage device that saves the contents of a codebook of a higher hierarchy and a lower hierarchy at the end of the speech encoding processing. An audio encoding processing device, which is a personal codebook in which contents are stored for each user who uses the encoding processing device.

8. The speech encoding processing device according to claim 4, wherein said codebook is constituted by a nonvolatile semiconductor memory device. apparatus.

9. A code book in which a plurality of pairs of sound source parameters for synthesizing speech and codes corresponding to the sound source parameters are registered, and a sound source parameter read out from the code book is synthesized, and a generated speech is generated. And a speech encoding processor for selecting and determining a code to be output by using an error evaluator for evaluating an error between the input speech and a speech. A storage device for holding the evaluation result of the error, and a storage device for storing the error evaluation result. An error determination device that determines the maximum value and the minimum value of the evaluation results obtained, an address table storage device that holds a part of the data storage address of the codebook, and a registration in the address table storage device And a management device for managing addresses to be processed.

10. The speech encoding processing device according to claim 9, wherein said address table storage device mainly holds an address corresponding to a code having a small error evaluation result value. Encoding processing device.