JP3205161B2

JP3205161B2 - Audio coding device

Info

Publication number: JP3205161B2
Application number: JP03340394A
Authority: JP
Inventors: 正米崎
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1994-03-03
Filing date: 1994-03-03
Publication date: 2001-09-04
Anticipated expiration: 2016-09-04
Also published as: JPH07244499A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、高能率音声圧縮が必要
なディジタル電話やディジタル録音器に使用する音声符
号化器に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice coder used for digital telephones and digital recorders which require high efficiency voice compression.

【０００２】[0002]

【従来の技術】近年、データを伝送または蓄積する媒体
が有限であることから、高音声品質かつ高圧縮率を実現
する音声符号化装置が望まれている。特に、入力音声の
性質を適応的に反映することができる適応コードブック
を導入することが高能率化を実現するためには重要であ
る。2. Description of the Related Art In recent years, since a medium for transmitting or storing data is limited, a speech encoding apparatus which realizes a high speech quality and a high compression rate has been desired. In particular, it is important to introduce an adaptive codebook capable of adaptively reflecting the characteristics of the input speech in order to achieve high efficiency.

【０００３】以下、従来の分析合成型音声符号化装置の
一例について説明する。図４は、従来の音声符号化装置
のブロック図を示す。図４において、４１は音声を入力
する音声入力器である。４２は基本周波数抽出器であ
り、入力音声の基本周波数を抽出する。４３は抽出され
た基本周波数に従って高調波周波数の振幅値を算出する
周波数振幅算出器である。[0003] An example of a conventional analysis-synthesis type speech coding apparatus will be described below. FIG. 4 shows a block diagram of a conventional speech coding apparatus. In FIG. 4, reference numeral 41 denotes a voice input device for inputting voice. Reference numeral 42 denotes a fundamental frequency extractor, which extracts a fundamental frequency of the input voice. Reference numeral 43 denotes a frequency amplitude calculator that calculates an amplitude value of a harmonic frequency according to the extracted fundamental frequency.

【０００４】以上のように構成された音声符号化装置に
おいて、音声入力器４１により入力された図５に示す波
形の音声から、基本周波数抽出器４２により、図６に示
すパワー・スパクトルと基本周波数が抽出される。その
後、この基本周波数を基にして周波数振幅算出器４２に
より図７に示す高調波周波数の振幅を求める。[0004] In the speech coding apparatus configured as described above, the fundamental frequency extractor 42 extracts the power spectrum and the fundamental frequency shown in FIG. 6 from the speech having the waveform shown in FIG. Is extracted. Thereafter, the amplitude of the harmonic frequency shown in FIG. 7 is obtained by the frequency amplitude calculator 42 based on the fundamental frequency.

【０００５】図８は合成音声波形を示す。FIG. 8 shows a synthesized speech waveform.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、上記の
従来の分析合成型音声符号化装置では、入力音声を分析
する際に入力音声の位相情報を用いないため、図５に示
す入力音声波形と図８に示す合成音声波形がまったく異
なり、直接比較することができない。そのため、各フレ
ーム毎に分析によって得たパラメータを、それぞれ、あ
らかじめ用意されている固定コードブックによって量子
化しなければならず、量子化効率が悪いという問題を有
していた。However, in the above-mentioned conventional analysis-synthesis type speech coding apparatus, since the phase information of the input speech is not used when analyzing the input speech, the input speech waveform shown in FIG. 8, the synthesized speech waveforms are completely different and cannot be directly compared. Therefore, the parameters obtained by the analysis for each frame must be quantized by a fixed codebook prepared in advance, and the quantization efficiency is low.

【０００７】本発明は上記従来の問題を解決するもの
で、音声の特徴を反映し適応的に変化する適応コードブ
ックを導入することによって効率の良い量子化をする優
れた音声符号化装置を提供することを目的とする。The present invention solves the above-mentioned conventional problem, and provides an excellent speech coding apparatus that performs efficient quantization by introducing an adaptive codebook that adaptively changes while reflecting the features of speech. The purpose is to do.

【０００８】[0008]

【課題を解決するための手段】本発明は上記目的を達成
するために、音声を入力する音声入力器と、入力された
音声の自己相関関数を算出する入力音声自己相関関数算
出器と、入力音声の基本周波数を抽出する基本周波数抽
出器と、前記抽出した基本周波数とパラメータ読み取り
器によって得られたベクトルパラメータとを基に入力音
声を分析し、パラメータの抽出を行う音声分析器と、前
記抽出されたパラメータに従って合成音声を生成する音
声合成器と、前記生成された合成音声の自己相関関数を
求める合成音声自己相関関数算出器と、前記合成音声自
己相関関数をコードベクトルとしてもつ適応コードブッ
クと、自己相関関数を求めた点におけるパラメータの変
化率を抽出するパラメータ算出器と、前記変化率パラメ
ータをコードベクトルとしてもつパラメータコードブッ
クと、前記入力音声の自己相関関数に最も近い関数のコ
ード番号を前記適応コードブックから求める適応コード
ブック探索器と、前記求められたコード番号で示されて
いるベクトルパラメータを前記パラメータコードブック
から読み込むパラメータ読み取り器とを備える構成にし
た。In order to achieve the above object, the present invention provides a voice input device for inputting voice, an input voice autocorrelation function calculator for calculating an autocorrelation function of the input voice, and an input device. A fundamental frequency extractor for extracting a fundamental frequency of speech, a speech analyzer for analyzing an input speech based on the extracted fundamental frequency and a vector parameter obtained by a parameter reader and extracting parameters, and A speech synthesizer that generates a synthesized speech according to the generated parameters, a synthesized speech autocorrelation function calculator that obtains an autocorrelation function of the generated synthesized speech, and an adaptive codebook having the synthesized speech autocorrelation function as a code vector. A parameter calculator for extracting a change rate of a parameter at a point where an autocorrelation function is obtained, and a code vector for calculating the change rate parameter. A parameter codebook having the same as the input code, an adaptive codebook searcher for obtaining a code number of a function closest to the autocorrelation function of the input voice from the adaptive codebook, and a vector parameter indicated by the obtained code number. And a parameter reader for reading from the parameter code book.

【０００９】[0009]

【作用】この構成によって、合成音声の自己相関関数に
よって構成されている適応コードブックを、入力音声の
自己相関関数によって探索し、この探索によって得られ
るコード番号の変化率パラメータを用いて入力音声を分
析するから、直接入力音声と合成音声を比較することが
できる。つまり、入力音声の特徴に追随し変化する適応
コードブックを用いることが可能となり、入力音声の特
徴に従ってパラメータを効率的に量子化することができ
る。With this configuration, an adaptive codebook composed of the autocorrelation function of the synthesized speech is searched by the autocorrelation function of the input speech, and the input speech is converted using the change rate parameter of the code number obtained by the search. Because of the analysis, the input speech can be directly compared with the synthesized speech. That is, it is possible to use an adaptive codebook that changes following the characteristics of the input voice, and the parameters can be efficiently quantized according to the characteristics of the input voice.

【００１０】[0010]

【実施例】以下本発明の一実施例について、図面を参照
しながら説明する。An embodiment of the present invention will be described below with reference to the drawings.

【００１１】図１において、１１は音声を入力する音声
入力器、１２は基本周波数抽出器であり、入力音声の基
本周波数を抽出する。１３は音声分析器であり、抽出し
た基本周波数を基に入力音声を分析する。１４は入力音
声自己相関関数算出器であり、入力された音声の自己相
関関数を算出する。１５は適応コードブック探索器であ
り、上記自己相関関数に近いコードベクトルを後述の適
応コードブック２０から選択する。１６はパラメータ読
み取り器であり、得られたコード番号の変化率パラメー
タをパラメータコードブック１７から読み出す。このパ
ラメータコードブック１７は、音声合成に用いる合成パ
ラメータの変化率をパラメータ算出器１８によって算出
した変化率パラメータによって構成されている。１９は
音声合成器であり、音声分析器１３によって求められた
パラメータに従って合成音声を生成する。適応コードブ
ック２０は、音声合成器１９によって生成された音声の
自己相関関数を合成音声自己相関関数算出器２１で算出
したものを蓄積している。In FIG. 1, reference numeral 11 denotes a voice input unit for inputting voice, and 12 denotes a fundamental frequency extractor, which extracts a fundamental frequency of the input voice. Reference numeral 13 denotes a voice analyzer that analyzes an input voice based on the extracted fundamental frequency. Reference numeral 14 denotes an input speech autocorrelation function calculator, which calculates an autocorrelation function of the input speech. An adaptive codebook search unit 15 selects a code vector close to the autocorrelation function from an adaptive codebook 20 described later. Reference numeral 16 denotes a parameter reader which reads out the obtained code number change rate parameter from the parameter code book 17. This parameter codebook 17 is composed of a change rate parameter calculated by a parameter calculator 18 for a change rate of a synthesis parameter used for speech synthesis. Reference numeral 19 denotes a speech synthesizer, which generates a synthesized speech according to the parameters obtained by the speech analyzer 13. The adaptive codebook 20 stores the autocorrelation function of the speech generated by the speech synthesizer 19 calculated by the synthesized speech autocorrelation function calculator 21.

【００１２】次に、上記のように構成された音声符号化
装置について、図１、図３を用いてその動作を説明す
る。Next, the operation of the speech coding apparatus configured as described above will be described with reference to FIGS.

【００１３】まず、音声入力器１１によって入力された
音声の自己相関関数を入力音声自己相関関数算出器１４
で算出する。この自己相関関数に近いコードベクトルを
適応コードブック探索器１５により、適応コードブック
２０から探索する。この適応コードブック２０には、音
声合成器１９によって生成された図２（ａ）に示す合成
音声の自己相関関数を合成音声自己相関関数算出器２１
で求めた図２（ｂ）に示すものが蓄積されている。First, an autocorrelation function of the speech input by the speech input device 11 is calculated by an input speech autocorrelation function calculator 14.
Is calculated by The adaptive codebook searcher 15 searches the adaptive codebook 20 for a code vector close to the autocorrelation function. The adaptive code book 20 includes an autocorrelation function of the synthesized speech generated by the speech synthesizer 19 shown in FIG.
The one shown in FIG.

【００１４】一方、基本周波数分析器１２により入力さ
れた音声の基本周波数を抽出し、これに基づいて音声分
析器１３で入力音声を分析する。ここで、入力音声を分
析する際、適応コードブック探索器１５で選択されたコ
ード番号の変化率パラメータを、パラメータコードブッ
ク１７からパラメータ読み取り器１６で読み取り、この
パラメータと先に求めた基本周波数によって入力音声を
分析する。On the other hand, the fundamental frequency of the input speech is extracted by the fundamental frequency analyzer 12, and the input speech is analyzed by the speech analyzer 13 based on the extracted fundamental frequency. Here, when analyzing the input speech, the change rate parameter of the code number selected by the adaptive codebook searcher 15 is read from the parameter codebook 17 by the parameter reader 16, and the parameter and the fundamental frequency obtained earlier are used. Analyze the input speech.

【００１５】次に、音声分析器１３の動作について説明
する。パラメータコードブック１７は、パラメータ算出
器１８によって求められる変化率パラメータで構成され
るもので、図３に示すように自己相関関数算出器２１で
算出する自己相関関数を求める一定区間Ｌにおけるパラ
メータの変化率を表している。従って、パラメータ読み
取り器１６によって抽出されたパラメータは、分析フレ
ームでのパラメータの変化率を示している。Next, the operation of the voice analyzer 13 will be described. The parameter code book 17 is composed of the rate-of-change parameters obtained by the parameter calculator 18, and as shown in FIG. 3, the parameter change in a certain section L for obtaining the auto-correlation function calculated by the auto-correlation function calculator 21. Represents the rate. Therefore, the parameters extracted by the parameter reader 16 indicate the rate of change of the parameters in the analysis frame.

【００１６】音声分析器１３では、基本周波数分析器１
２によって得られた基本周波数を中心に、上記パラメー
タの変化率に従って推移させた時の線スペクトルを基に
音声を分析する。In the voice analyzer 13, the fundamental frequency analyzer 1
The voice is analyzed on the basis of the line spectrum when the frequency is changed in accordance with the rate of change of the above-mentioned parameter, centering on the fundamental frequency obtained in Step 2.

【００１７】以上から明らかなように、本実施例による
音声符号化装置においては、適応コードブック手法を導
入し、これにより得られる変化率パラメータを用いて、
音声の性質を反映したコードブックによる効率的な量子
化が可能になり、また、音声を動的に分析することがで
きる点で優れた効果が得られる。As is apparent from the above, in the speech coding apparatus according to the present embodiment, an adaptive codebook method is introduced, and a change rate parameter obtained by the adaptive codebook method is used.
An excellent effect is obtained in that efficient quantization can be performed by a code book reflecting the characteristics of speech, and speech can be dynamically analyzed.

【００１８】以上のように本実施例によれば、自己相関
関数をコードブックとする適応コードブックと、それに
対応するパラメータコードブックを設けることにより音
声の特徴を含んだコードブックによって量子化し、合成
パラメータの変化を考慮した分析をすることが可能とな
るので、効率的かつ動的に分析し符号化することができ
る。As described above, according to the present embodiment, an adaptive codebook having an autocorrelation function as a codebook and a parameter codebook corresponding to the adaptive codebook are quantized and synthesized by a codebook containing speech characteristics. Since the analysis can be performed in consideration of the change in the parameter, the analysis and encoding can be performed efficiently and dynamically.

【００１９】[0019]

【発明の効果】以上のように本発明は、合成音声の自己
相関関数算出器とその自己相関関数をコードベクトルと
する適応コードブック、および、音声合成パラメータを
コードベクトルとするパラメータコードブックを設ける
ことにより、入力音声の特徴に即した動的な音声分析が
可能になり、優れた高能率音声符号化装置を実現でき
る。As described above, the present invention provides an autocorrelation function calculator for a synthesized speech, an adaptive codebook using the autocorrelation function as a code vector, and a parameter codebook using a speech synthesis parameter as a code vector. This enables dynamic speech analysis according to the characteristics of the input speech, and realizes an excellent high-efficiency speech encoding device.

[Brief description of the drawings]

【図１】本発明の実施例における音声符号化装置のブロ
ック図FIG. 1 is a block diagram of a speech encoding apparatus according to an embodiment of the present invention.

【図２】本実施例における合成音声と適応コードブック
の関係を示す説明図FIG. 2 is an explanatory diagram showing a relationship between a synthesized speech and an adaptive codebook in the embodiment.

【図３】本実施例における合成音声とパラメータコード
ブックとの関係を示す説明図FIG. 3 is an explanatory diagram showing a relationship between a synthesized speech and a parameter code book in the embodiment.

【図４】従来の音声符号化装置のブロック図FIG. 4 is a block diagram of a conventional speech encoding device.

【図５】従来における入力音声波形図FIG. 5 is a conventional input voice waveform diagram.

【図６】従来における入力音声のパワー・スペクトルと
基本周波数を示す説明図FIG. 6 is an explanatory diagram showing a power spectrum and a fundamental frequency of an input voice in the related art.

【図７】従来における高調波周波数の振幅値を示す説明
図FIG. 7 is an explanatory diagram showing an amplitude value of a harmonic frequency in the related art.

【図８】従来における合成音声波形図FIG. 8 is a conventional synthetic speech waveform diagram.

[Explanation of symbols]

１１音声入力器１２基本周波数抽出器１３音声分析器１４入力音声自己相関関数算出器１５適応コードブック探索器１６パラメータ読み取り器１７パラメータコードブック１８パラメータ算出器１９音声合成器２０適応コードブック２１合成音声自己相関関数算出器４１音声入力器４２基本周波数抽出器４３周波数振幅算出器 Reference Signs List 11 voice input device 12 fundamental frequency extractor 13 voice analyzer 14 input voice autocorrelation function calculator 15 adaptive codebook searcher 16 parameter reader 17 parameter codebook 18 parameter calculator 19 voice synthesizer 20 adaptive codebook 21 synthesized voice Autocorrelation function calculator 41 voice input device 42 fundamental frequency extractor 43 frequency amplitude calculator

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 - 19/14 H03M 7/30 H04B 14/04 ──────────────────────────────────────────────────続き Continued on the front page (58) Field surveyed (Int. Cl. ⁷ , DB name) G10L 19/00-19/14 H03M 7/30 H04B 14/04

Claims

(57) [Claims]

A voice input device for inputting voice; an input voice autocorrelation function calculator for calculating an autocorrelation function of the input voice; a fundamental frequency extractor for extracting a fundamental frequency of the input voice; A voice analyzer that analyzes the input voice based on the obtained fundamental frequency and the vector parameter obtained by the parameter reader and extracts the parameter;
A speech synthesizer for generating a synthesized speech according to the extracted parameters, a synthesized speech autocorrelation function calculator for obtaining an autocorrelation function of the generated synthesized speech, and an adaptive code having the synthesized speech autocorrelation function as a code vector Book, a parameter calculator for extracting a change rate of a parameter at a point where an autocorrelation function is obtained, a parameter code book having the change rate parameter as a code vector, and a code of a function closest to the autocorrelation function of the input voice. An audio coding apparatus comprising: an adaptive codebook searcher for obtaining a number from the adaptive codebook; and a parameter reader for reading a vector parameter indicated by the obtained code number from the parameter codebook.