JPH08286696A - Speech speed conversion and decoding method - Google Patents

Speech speed conversion and decoding method

Info

Publication number
JPH08286696A
JPH08286696A JP7093744A JP9374495A JPH08286696A JP H08286696 A JPH08286696 A JP H08286696A JP 7093744 A JP7093744 A JP 7093744A JP 9374495 A JP9374495 A JP 9374495A JP H08286696 A JPH08286696 A JP H08286696A
Authority
JP
Japan
Prior art keywords
voice
voice signal
pitch period
speech speed
speed conversion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP7093744A
Other languages
Japanese (ja)
Inventor
Hiroko Yoshida
博子 吉田
Koji Yoshida
幸司 吉田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP7093744A priority Critical patent/JPH08286696A/en
Publication of JPH08286696A publication Critical patent/JPH08286696A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE: To provide a speech speed conversion and decoding method which can convert and reproduce a speech speed by adding only slight quantity of operation. CONSTITUTION: By using a voice decoding method reproducing a voice signal in which a voice signal is coded using a voice coding method including a parameter relating to a pitch period in a coding data and it is recorded in a memory 1 while it is decoded, a voice signal is compressed and expanded by interpolating and thinning waveforms of a decoded voice signal in a time base of the voice signal by a speech speed conversion processing means 5 based on a pitch period of a parameter relating to a pitch period being a coding data, and a speech speed conversion is performed.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、留守番電話機等で用い
られる話速変換、例えば、音声信号を符号化して固体メ
モリに録音し、これを復号する際、音声のピッチ(声の
高さ)を変えずに話速(声の速さ)を変えて出力する所謂
早聞き,遅聞きをするための話速変換復号化方法に関す
る。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to speech rate conversion used in an answering machine or the like, for example, when a voice signal is encoded and recorded in a solid-state memory and is decoded, the pitch of the voice (pitch of voice). The present invention relates to a speech speed conversion / decoding method for performing so-called fast listening and slow listening in which the speech speed (voice speed) is changed without changing the output.

【0002】[0002]

【従来の技術】従来、この種の音声信号符号化,復号化
方式に用いられる音声のピッチを変えずに話速を変えて
音声を再生する復号化方法としては、図2に示されるこ
の従来方法を適用した装置に基づくものが知られてい
る。図中、1は符号化した音声信号を記憶させた固体メ
モリ等のメモリ、2はメモリ1から読み出した符号化音
声信号を復号するための復号化処理手段、3は復号化し
た音声信号を一時的にストアするバッファ、4はバッフ
ァ3から読み出した復号化音声信号よりピッチ周期を算
出するためのピッチ周期算出手段、5はピッチ周期算出
手段4により算出されピッチ周期に基づき、前記復号化
音声信号の話速を変換する話速変換処理手段であり話速
変換された音声信号を出力する。
2. Description of the Related Art Conventionally, as a decoding method for reproducing a voice by changing the speech speed without changing the pitch of the voice used in this type of voice signal encoding / decoding system, the conventional method shown in FIG. Those based on devices to which the method is applied are known. In the figure, 1 is a memory such as a solid-state memory for storing an encoded audio signal, 2 is a decoding processing means for decoding the encoded audio signal read from the memory 1, and 3 is a temporary decoded audio signal. A buffer for storing data 4 in order to calculate a pitch cycle from the decoded speech signal read from the buffer 3, and 5 in accordance with the pitch cycle calculated by the pitch cycle calculating means 4 based on the pitch cycle. It is a voice speed conversion processing means for converting the voice speed of the voice and outputs a voice signal whose voice speed has been converted.

【0003】この方法では、バッファ3から読み出した
復号化音声信号から、自己相関法やケプストラム法等の
推定処理方法により前記ピッチ周期算出手段4により話
速変換に必要なピッチ周期を算出し、この算出したピッ
チ周期を用い、例えばポインタ制御による重複加算法
(PICOLA:Pointer Interval Control OverlapAnd
Add),時間領域調波構造伸縮法(TDHS:Time-Domai
n Harmonic Scaling)等音声信号の時間軸上で音声波形
を挿入したり間引いたりして、話速を変換する方法によ
り話速変換処理手段5によって話速を変換するものであ
る。このようにすれば、復号化音声信号から新たにピッ
チ周期を算出することによって、所望の話速変換を行う
ことができるのである。
In this method, the pitch period required for speech speed conversion is calculated by the pitch period calculating means 4 from the decoded speech signal read from the buffer 3 by an estimation processing method such as an autocorrelation method or a cepstrum method. Overlap addition method by pointer control using the calculated pitch period
(PICOLA: Pointer Interval Control OverlapAnd
Add), time-domain harmonic structure expansion / contraction method (TDHS: Time-Domai)
n Harmonic Scaling) etc., the speech speed is converted by the speech speed conversion processing means 5 by a method of converting the speech speed by inserting or thinning the speech waveform on the time axis of the speech signal. In this way, the desired speech speed conversion can be performed by newly calculating the pitch period from the decoded voice signal.

【0004】[0004]

【発明が解決しようとする課題】しかしながら、この従
来の話速変換方法においては、ピッチ周期を新たに算出
することが大きな問題点であり、これを行うために演算
量や命令ROMが増えてしまい、特に、音声信号の圧縮
率が高くなる場合(話速を速くする場合)などは、通常の
3倍以上の復号処理を行わなければならず、処理が追い
つかなくなるという問題があった。
However, in this conventional speech speed conversion method, a new problem is that the pitch period is newly calculated, and in order to do this, the amount of calculation and the instruction ROM increase. Especially, when the compression rate of the audio signal becomes high (when the speech speed is increased), the decoding process must be performed three times or more as much as usual, which causes a problem that the process cannot catch up.

【0005】本発明は上記従来の問題点を解決するもの
で、僅かな演算量の追加で話速を変換して再生できる話
速変換復号化方法を提供するものである。
The present invention solves the above-mentioned conventional problems, and provides a speech speed conversion / decoding method capable of converting the speech speed for reproduction by adding a small amount of calculation.

【0006】[0006]

【課題を解決するための手段】本発明は上記目的を達成
するために、符号化データにピッチ周期に関連するパラ
メータを含む音声符号化方法を用いて音声信号を符号化
し、メモリ等に記録すると共にそれを復号化して音声信
号を再生する音声復号化方法を用い、復号化した音声信
号を、前記符号化データであるピッチ周期に関連するパ
ラメータが示すピッチ周期に基づき、その音声信号の時
間軸上において波形を挿入,間引きして音声信号を圧
縮,伸長し、話速変換を行うようにしたものである。
In order to achieve the above object, the present invention encodes a voice signal using a voice encoding method in which encoded data includes a parameter related to a pitch period and records it in a memory or the like. And a voice decoding method for decoding it to reproduce a voice signal, and decoding the decoded voice signal based on the pitch period indicated by the parameter related to the pitch period which is the encoded data, and the time axis of the voice signal. In the above, the waveform is inserted and thinned to compress and expand the voice signal to convert the speech speed.

【0007】[0007]

【作用】従来の方法においては、自己相関法や、ケプス
トラム法等により演算したピッチ周期を用いて前記の時
間軸上で音声信号を圧縮,伸長する話速変換のピッチ周
期を制御し、話速変換を行うものであったが、通常、前
記の時間軸上で音声信号を圧縮,伸長する形式の話速変
換は、制御されるピッチ周期が多少違っていても、大き
な品質の劣化を来すことはないので、本発明においては
前記音声信号の符号化,復号化方式で求めた通常20ms等
のフレーム単位のピッチ周期をこれに用いることによ
り、演算量を増やさずに話速変換復号化を行うことがで
きる。
In the conventional method, the pitch period of the speech speed conversion for compressing and expanding the voice signal on the time axis is controlled by using the pitch period calculated by the autocorrelation method, the cepstrum method, etc. Although the conversion is performed, usually, the speech speed conversion of the format in which the audio signal is compressed and expanded on the time axis described above causes a large deterioration in quality even if the pitch period to be controlled is slightly different. Therefore, in the present invention, the speech period conversion / decoding can be performed without increasing the calculation amount by using the pitch period of the frame unit such as 20 ms, which is usually obtained by the encoding / decoding method of the voice signal. It can be carried out.

【0008】[0008]

【実施例】以下、本発明の一実施例について図1を参照
しながら説明する。なお、前記従来のものと同一の部分
については同一の符号を付すものとする。図中、1はピ
ッチ周期やラグ等のピッチ周期に関連するパラメータを
符号化信号に持つ符号化方法で符号化された音声信号を
記憶する固体メモリ等のメモリ、2はメモリ1から読み
出した符号化音声信号を復号するための復号化処理手
段、3は復号化した音声信号と復号化時に用いたピッチ
周期に関連するパラメータを一時的にストアするRAM
等を用いたバッファ、5は、前記ピッチ周期に関連する
パラメータが表すピッチ周期に基づき、前記復号化音声
信号の話速を変換する話速変換処理手段であり話速変換
された音声信号を出力する。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to FIG. It should be noted that the same parts as those of the above-mentioned conventional one are denoted by the same reference numerals. In the figure, 1 is a memory such as a solid-state memory for storing an audio signal encoded by an encoding method having a parameter related to the pitch period such as a pitch period or a lag in the encoded signal, and 2 is a code read from the memory 1. Decoding processing means for decoding the deciphered speech signal, 3 is a RAM for temporarily storing the deciphered speech signal and parameters relating to the pitch period used at the time of decoding.
And the like, the buffer 5 is a voice speed conversion processing means for converting the voice speed of the decoded voice signal based on the pitch period represented by the parameter related to the pitch period, and outputs the voice signal whose voice speed is converted. To do.

【0009】次にその動作について説明する。まず、ピ
ッチ周期を使用するTC−WVQ(Transform Coding Wi
th Weighted Vector Quantization)方式,RPE−LT
P(Regular Pulse Excitation - Long Term Predictio
n)方式,ラグ等のピッチ周期を使用するVSELP(Vec
tor Sum Excited Linear Prediction)方式等、符号化信
号にピッチ周期やラグ等のピッチ周期に関連するパラメ
ータを持つ符号化方法で予め音声信号を符号化して格納
したROMや、一時的に格納したRAM等よりなるメモ
リ1から読み出した符号化音声信号を復号化処理手段2
により復号し、復号された前記音声信号および、その際
に復号化したピッチ周期に関連するパラメータを複数フ
レーム分、一時的にバッファ3に格納しておく。そして
話速変換処理手段5では、バッファ3に格納されている
音声信号と、その音声信号を復号化する際に用いたピッ
チ周期に関連するパラメータのピッチ周期を用いて前記
ポインター制御による重複加算法(PICOLA),時間
領域調波構造伸縮法(TDHS)等、ピッチ波形単位で音
声信号を時間軸上で間引いたり、同じ波形または、類似
波形を挿入したりする方法により、再生する音声の速度
を任意の速度に調整する。
Next, the operation will be described. First, TC-WVQ (Transform Coding Wi
th Weighted Vector Quantization) method, RPE-LT
P (Regular Pulse Excitation-Long Term Predictio
n) method, VSELP (Vec
tor Sum Excited Linear Prediction) method, such as a ROM in which a voice signal is encoded and stored in advance by an encoding method that has a parameter related to the pitch period such as pitch period or lag in the encoded signal, RAM that is temporarily stored, etc. The coded audio signal read from the memory 1 including the decoding processing means 2
The decoded voice signal and the parameters related to the pitch period decoded at that time are temporarily stored in the buffer 3 for a plurality of frames. Then, the speech speed conversion processing means 5 uses the voice signal stored in the buffer 3 and the pitch period of the parameter related to the pitch period used when the voice signal is decoded, by the pointer control overlapping addition method. (PICOLA), time domain harmonic structure expansion / contraction method (TDHS), etc., the speed of the reproduced voice is reduced by decimating the audio signal on the time axis in pitch waveform units or inserting the same or similar waveform. Adjust to any speed.

【0010】このように復号化処理で復号化した音声信
号と、その音声信号を復号化する際に用いたピッチ周期
に関連するパラメータを話速変換処理に用いることによ
り、復号化音声の再生速度を任意の速度に調整すること
ができる。
By using the speech signal thus decoded in the decoding processing and the parameter relating to the pitch period used in decoding the speech signal in the speech speed conversion processing, the reproduction speed of the decoded speech is reproduced. Can be adjusted to any speed.

【0011】[0011]

【発明の効果】本発明は以上の実施例から明らかなよう
に、復号化処理で復号化した音声信号と、その音声信号
を復号化する際に用いたピッチ周期に関連するパラメー
タを話速変換処理に用いることにより、話速変換処理用
のピッチ周期を新たに算出する必要がないため、演算量
とプログラムサイズをあまり増やさないで復号化音声の
再生速度を任意の速度に調整することができる。
As is apparent from the above embodiments, the present invention converts the voice signal decoded by the decoding process and the voice speed conversion parameter related to the pitch period used when decoding the voice signal. Since it is not necessary to newly calculate the pitch period for the voice speed conversion processing by using it for the processing, the reproduction speed of the decoded voice can be adjusted to an arbitrary speed without increasing the calculation amount and the program size. .

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の話速変換復号化方法を実施し得る装置
の一例を示す回路ブロック図である。
FIG. 1 is a circuit block diagram showing an example of an apparatus that can implement a speech speed conversion decoding method of the present invention.

【図2】従来の話速変換復号化方法を実施し得る装置の
一例を示す回路ブロック図である。
FIG. 2 is a circuit block diagram showing an example of an apparatus that can implement a conventional speech speed conversion / decoding method.

【符号の説明】[Explanation of symbols]

1…メモリ、 2…復号化処理手段、 3…バッファ、
4…ピッチ周期算出手段、 5…話速変換処理手段。
1 ... Memory, 2 ... Decoding processing means, 3 ... Buffer,
4 ... Pitch cycle calculation means, 5 ... Speech speed conversion processing means.

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 符号化データにピッチ周期に関連するパ
ラメータを含む音声符号化方法を用いて音声信号を符号
化し、メモリ等に記録すると共にそれを復号化して音声
信号を再生する音声復号化方法を用い、復号化した音声
信号を、前記符号化データであるピッチ周期に関連する
パラメータが示すピッチ周期に基づき、その音声信号の
時間軸上において波形を挿入,間引きして音声信号を圧
縮,伸長し、話速変換を行うことを特徴とする話速変換
復号化方法。
1. A voice decoding method in which a voice signal is encoded by using a voice encoding method in which encoded data includes a parameter related to a pitch period, recorded in a memory or the like, and decoded to reproduce the voice signal. Based on the pitch period indicated by the parameter related to the pitch period which is the encoded data, the decoded voice signal is inserted and thinned out on the time axis of the voice signal to compress and expand the voice signal. Then, the speech speed conversion decoding method is characterized by performing speech speed conversion.
JP7093744A 1995-04-19 1995-04-19 Speech speed conversion and decoding method Pending JPH08286696A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP7093744A JPH08286696A (en) 1995-04-19 1995-04-19 Speech speed conversion and decoding method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP7093744A JPH08286696A (en) 1995-04-19 1995-04-19 Speech speed conversion and decoding method

Publications (1)

Publication Number Publication Date
JPH08286696A true JPH08286696A (en) 1996-11-01

Family

ID=14090938

Family Applications (1)

Application Number Title Priority Date Filing Date
JP7093744A Pending JPH08286696A (en) 1995-04-19 1995-04-19 Speech speed conversion and decoding method

Country Status (1)

Country Link
JP (1) JPH08286696A (en)

Similar Documents

Publication Publication Date Title
JP3946812B2 (en) Audio signal conversion apparatus and audio signal conversion method
JP2001344905A (en) Data reproducing device, its method and recording medium
US6678650B2 (en) Apparatus and method for converting reproducing speed
JPH09330097A (en) Voice reproducing device
JPH08286696A (en) Speech speed conversion and decoding method
JP3303580B2 (en) Audio coding device
JPH05303399A (en) Audio time axis companding device
JPH03233500A (en) Voice synthesis system and device used for same
JP2905215B2 (en) Recording and playback device
JP2860991B2 (en) Audio storage and playback device
JPH0854895A (en) Reproducing device
JP2861005B2 (en) Audio storage and playback device
JP3930596B2 (en) Audio signal encoding method
JP2709198B2 (en) Voice synthesis method
JPH0235320B2 (en)
JPH02135931A (en) Signal processing method
JPH04249300A (en) Method and device for voice encoding and decoding
JP3947191B2 (en) Prediction coefficient generation device and prediction coefficient generation method
JPH11311997A (en) Sound reproducing speed converting device and method therefor
JP2937091B2 (en) Musical tone generating apparatus and musical tone generating method capable of reproducing compressed waveform data in the reverse direction
JP2001265392A (en) Voice coding device and its method
JPH08305393A (en) Reproducing device
JPH10124097A (en) Voice recording and reproducing device
JPH01197793A (en) Speech synthesizer
JP2007101644A (en) Voice reproducing apparatus