JPH03198000A - Voice time-base compressing and encoding device - Google Patents

Voice time-base compressing and encoding device

Info

Publication number
JPH03198000A
JPH03198000A JP63056812A JP5681288A JPH03198000A JP H03198000 A JPH03198000 A JP H03198000A JP 63056812 A JP63056812 A JP 63056812A JP 5681288 A JP5681288 A JP 5681288A JP H03198000 A JPH03198000 A JP H03198000A
Authority
JP
Japan
Prior art keywords
waveform
time axis
compressed
frame
pitch
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP63056812A
Other languages
Japanese (ja)
Other versions
JPH0782354B2 (en
Inventor
Masaya Takahashi
真哉 高橋
Kunio Nakajima
中島 邦男
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Priority to JP63056812A priority Critical patent/JPH0782354B2/en
Publication of JPH03198000A publication Critical patent/JPH03198000A/en
Publication of JPH0782354B2 publication Critical patent/JPH0782354B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Abstract

PURPOSE:To prevent a voice from deteriorating by using a compressed waveform signal as a voice waveform signal with a constant pitch cycle and connecting the remaining voice waveform signal to the compressed waveform signal without compression. CONSTITUTION:An in-frame time-base compressing means 4 sections an input voice waveform 3 into constant-length analytic frames, and pitch cycles in the analytic frames are found by, for example, autocorrelation analysis and utilized to compress voice waveforms in the analytic frames on the time basis, thereby finding a compressed waveform 5. Further, this device is equipped with an in-frame time-base expanding means 15 which expands the compressed waveform 14 into a voice waveform with original analytic frame length by utilizing the pitch cycles 6 after the signal is decoded in analytic frame units. Then a frame which is longer than plural pitches of the input voice waveform is set, a waveform of plural (p) pitches in this frame is compressed into a waveform of less integral (q) pitches, and the remaining voice waveform signal is connected to the compressed waveform with the integral pitch without being compressed. Consequently, even if a transmission error of pitch cycles is generated, its influence stays in the sectio.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 この発明は、一定長の分析フレーム範囲内でピッチ周期
に基く音声波形の時間軸圧縮伸長を行う音声時間軸圧縮
符号化装置に関するものである。
[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to an audio time axis compression/encoding device that performs time axis compression/expansion of an audio waveform based on a pitch period within a fixed length analysis frame range. .

〔従来の技術〕[Conventional technology]

従来のこの種の装置として第4図に示すようなものがあ
った。この因は、I OC’82 (IEEE Int
ernational Communication 
C1onference 、 19B2 )での発表論
文R,V、0oxeta’1.’An Impleme
ntation ofTime Domain Har
monic Scaling with Applic
ation to 5peech Coding”(1
0082,pp、4G、1.1−4 )のFig、lに
示されたものと同様なもので、図において、(1)はマ
イクロホン、(2)はA/D変換器、(3)は入力音声
波形、Q侍は時間軸圧縮手段、(ホ)はピッチ周期抽出
手段、(5)は圧縮波形、(6)はピッチ周期、(7)
は符号化手段、(8)はピッチ符号化手段、(9)はマ
ルチプレクサ、Qlは伝送路、(ロ)はデマルチプレク
サ、(2)は復号化手段、(2)はピッチ復号化手段、
Q4は復号後圧縮波形、(財)は時間軸伸長手段、αり
は伸長波形、aηはD/A変換器、(至)はスピーカで
ある。
A conventional device of this type is shown in FIG. The reason for this is I OC'82 (IEEE Int
national communication
Paper presented at C1onference, 19B2) R, V, 0oxeta'1. 'An Impleme
ntation of Time Domain Har
monic scaling with application
ation to 5peech Coding” (1
0082, pp. 4G, 1.1-4). In the figure, (1) is a microphone, (2) is an A/D converter, and (3) is a Input audio waveform, Q Samurai is time axis compression means, (E) is pitch period extraction means, (5) is compressed waveform, (6) is pitch period, (7)
is an encoding means, (8) is a pitch encoding means, (9) is a multiplexer, Ql is a transmission line, (b) is a demultiplexer, (2) is a decoding means, (2) is a pitch decoding means,
Q4 is a compressed waveform after decoding, (F) is a time axis expansion means, α is an expanded waveform, aη is a D/A converter, and (To) is a speaker.

〔従来装置の動作〕[Operation of conventional device]

次に動作について説明する0先ず第4図の送信部につい
て説明する。マイクロホン(1)から入力された音声は
、A/D変換器(2)で離散データ系列(以後入力せ声
波形(3)と呼ぶ)に変換される。ピッチ周期抽出手段
員は例えば自己相関分析によシ入力音声波形(3)のピ
ッチ周期(6)を求める。時間軸圧縮手段−はピッチ周
期抽出手段翰で抽出したピッチ周期(6)に基き、連続
する2つのピッチ区間毎に入力音声波形(3)を時間軸
上で1/2に圧縮し、圧縮波形(5)を求める。ここで
時間軸圧縮手段α傷で行われる時間軸圧縮の詳細を第5
図に示す0第5図において、(5a)4−i連続する2
ピッチ区間の入力音声波形0 (n)でろ、り(Ob)
は入力音声波形S (n)を時間軸上で1/2圧縮した
圧縮波形5C(n)である。今入力音声波形8 (n)
のピッチ周期をTpとすると、圧縮波形5c(n)は次
の山、(2)式によって求める03c(n)= (1−
Wc(n) )−5(n)+ Wc(n) ・S (n
 + Tp )   tl)wc(n)=(n −1/
2 )/Tp           t2)n=1,2
.*zjT。
Next, the operation will be explained.First, the transmitter shown in FIG. 4 will be explained. Voice input from the microphone (1) is converted into a discrete data series (hereinafter referred to as input voice waveform (3)) by an A/D converter (2). The pitch period extraction means obtains the pitch period (6) of the input speech waveform (3), for example, by autocorrelation analysis. The time axis compression means compresses the input speech waveform (3) to 1/2 on the time axis for every two consecutive pitch sections based on the pitch period (6) extracted by the pitch period extraction means Kan, and generates a compressed waveform. Find (5). Here, the details of the time axis compression performed by the time axis compression means α scratch are explained in the fifth section.
In Figure 5, (5a) 4-i consecutive 2
Input audio waveform of pitch section 0 (n) (Ob)
is a compressed waveform 5C(n) obtained by compressing the input audio waveform S (n) by 1/2 on the time axis. Now input audio waveform 8 (n)
If the pitch period of is Tp, the compressed waveform 5c(n) is the following peak, 03c(n)=(1-
Wc(n) )-5(n)+ Wc(n) ・S(n
+ Tp ) tl) wc(n) = (n −1/
2)/Tp t2)n=1,2
.. *zjT.

ここで(II 、 (2)式のWC(n)は入力音声波
形5(n)にかける窓関数でおり、第5図は入力音声波
形S (n)に窓関数wc(n)をかけ圧縮波形5c(
n)を求める手順を示している。
Here, (II), WC(n) in equation (2) is a window function that is applied to the input audio waveform 5(n), and in Figure 5, the input audio waveform S (n) is multiplied by the window function wc(n) and compressed. Waveform 5c (
The procedure for finding n) is shown.

今、第4図の時間軸圧縮手段a値で上記の方法により求
まった圧縮波形(5)は符号化手段(7)で例えば1サ
ンプル毎にパルス符号化され、またピッチ周期(6)も
ピッチ符号化手段(8)で符号化される。マルチプレク
サ(9)は符号化された圧縮波形清報とピッチ周期清報
を多重化して伝送路(Inに出力する。
Now, the compressed waveform (5) obtained by the above method using the time axis compression means a value shown in FIG. It is encoded by the encoding means (8). The multiplexer (9) multiplexes the encoded compressed waveform signal and the pitch period signal and outputs the multiplexed signal to the transmission line (In).

次に第4図の受信部について説明する。伝送路Qlを伝
搬した符号化された圧縮波形i*=iとピッチ周期清報
は、デマルチプレクサt)1)で分離される。
Next, the receiving section shown in FIG. 4 will be explained. The encoded compressed waveform i*=i and the pitch period information propagated through the transmission path Ql are separated by a demultiplexer t)1).

圧縮波形清報は復号化手段(2)で符号化手段(7)に
対応した復号処りで復号化され復号後圧縮波形α→とな
る。同様にピッチ周期清報もピッチ復号化手段0で復号
化される。時間軸伸長手段121)はピッチ復号化手段
qgで復号化されたピッチ周期((ilに基き、このピ
ッチ周期単位に復号18E縮彼形C−4)を時間軸上で
2倍に伸長し、伸長波形OQを求める。時間軸伸長手段
12]Jで行われる時間軸伸長の計、!tIを第6図に
示す。第6図において、(6a)は連続する3ピッチ区
間(r−1,r、r+1.番目)の復号後圧縮波形合c
(n)であり、(6b)は復号後圧縮波形◇c(n)の
r番目のピッチ区間を2倍に伸長した伸長波形◇。(n
)である。今、復号後圧縮波形介c(n)においてr番
目のピッチ区間の先頭地点をn=1とすると、伸長波形
e 、(n)は次の(3)、(4)式によって求める。
The compressed waveform information is decoded by the decoding means (2) in a decoding process corresponding to the encoding means (7), and becomes a compressed waveform α→ after decoding. Similarly, the pitch period information is also decoded by the pitch decoding means 0. The time axis expansion means 121) expands the pitch period decoded by the pitch decoding means qg ((based on il, decoded 18E contraction C-4 in pitch period units) on the time axis, Determine the expanded waveform OQ. The total time axis expansion performed by the time axis expansion means 12]J, !tI, is shown in FIG. 6. In FIG. r, r+1.th) decoded compressed waveform sum c
(n), and (6b) is the compressed waveform after decoding ◇ An expanded waveform ◇ in which the r-th pitch section of c(n) is expanded by twice. (n
). Now, if the starting point of the r-th pitch section in the decoded compressed waveform c(n) is set to n=1, the expanded waveforms e and (n) are determined by the following equations (3) and (4).

G、(n)= <1− We(n) >檜c(n)+ 
we(n) 4c(n −Tp>   (3)We (
n)=(n −1/ 2 )/2T p       
       (4)n=1,2.  ・・・・2Tp (3) 、 (4)式のwe(n)は復号後王都波形◇
c(n)にかける窓関数であり、第6図は復号後圧縮波
形5c(n)に窓関数W、(n)をかけ、伸長波形S、
←)を求める手順を示している。
G, (n) = <1- We(n) > Cypress c(n)+
we(n) 4c(n −Tp> (3) We (
n)=(n-1/2)/2T p
(4) n=1,2. ...2Tp (3), we(n) in equations (4) is the royal capital waveform after decoding◇
It is a window function that is applied to c(n), and in FIG.
←).

第4図の時間軸伸長手段31)で上記の方法によシ求ま
った伸長波形αQは、D/A変換器0ηでアナログ信号
に変換され、スピーカQ8!から出力される0〔発明が
解決しようとする課題〕 従来の音声時間軸圧縮符号化装置は以上の様に構成され
ておシ、その復号化部の時間軸伸長手段は復号?1kE
E縮波形を伝送路より伝送されたピッチ周期の2倍の長
さに逐次伸長するので、伝送路で発生した伝送誤りによ
って一凌ピツチ周期が誤伝送されると、そのピッチ周期
区間のみに限らずそれ以後時間軸伸長手段で伸長される
伸長波形総てに誤りが波及し、音質に著しい劣化を生ず
るという課題があった。
The expanded waveform αQ obtained by the above method by the time axis expansion means 31) in FIG. [Problem to be Solved by the Invention] The conventional audio time axis compression encoding device is configured as described above, and the time axis expansion means of the decoding section is used for decoding. 1kE
Since the E-condensed waveform is sequentially expanded to twice the length of the pitch period transmitted from the transmission line, if one pitch period is erroneously transmitted due to a transmission error that occurs on the transmission line, it is limited to only that pitch period section. There is a problem in that the error spreads to all expanded waveforms that are subsequently expanded by the time axis expansion means, resulting in significant deterioration in sound quality.

また、符号化、復号化手段で行う音声符号化復号化処理
を、ある一定長の分析フレーム単位で行う場合、この分
析フレーム区間とピッチ周期単位で行う時間軸圧縮伸長
の区間の整合が取れないため、音声符号化復号化処理と
時1′&lJ@圧縮伸長処理を個別の区間で別々に実行
しなければならず処理が複雑で処理遅延を生ずる可能性
があるという課題がおった。
Furthermore, when the audio encoding/decoding process performed by the encoding/decoding means is performed in units of analysis frames of a certain length, the analysis frame interval and the interval of time axis compression/expansion performed in pitch cycle units cannot be matched. Therefore, a problem arises in that the audio encoding/decoding process and the compression/decompression process must be performed separately in separate sections, making the process complicated and potentially causing a processing delay.

〔課題を解決するための手段〕[Means to solve the problem]

この発明に係る音声時間@圧縮符号化装置は、一定長の
分析フレーム毎にその分析フレームの範囲内でピッチ周
期に基く音声波形の時間軸圧縮を行う時間軸圧縮手段と
、この分析フレーム毎に時間軸圧縮された波形を元の分
析フレーム長まで時間軸伸長する時間軸伸長手段を備え
たものである。
The audio time @compression encoding device according to the present invention includes a time axis compression means for compressing the time axis of an audio waveform based on a pitch period within the range of the analysis frame for each analysis frame of a fixed length; The apparatus is equipped with a time axis expansion means for expanding the time axis of the time axis compressed waveform to the original analysis frame length.

〔作用〕[Effect]

この発明に訃ける時間軸圧縮手段は、一定長の分析フレ
ーム毎に、その分析フレーム内で連続して存在する数ピ
ツチ周期分の音声波形について、連接しな2ピッチ周期
区間毎に両ピッチ周期区間の波形の類似性を利用して時
間軸上で圧縮する。
The time axis compression means according to the present invention, for each analysis frame of a certain length, converts the audio waveform of several pitch periods continuously existing within the analysis frame into two pitch periods in every two non-contiguous pitch period sections. Compression is performed on the time axis using the similarity of the waveforms of the sections.

またこの発明における時間軸伸長手段は、前記時間軸圧
縮手段で分析フレーム毎に時間軸圧縮された圧縮音声波
形を、前記ピッチ周側に基き時間軸上で伸長し元の分析
フレーム長の音声波形にする。
Further, the time axis expansion means in the present invention expands the compressed audio waveform, which has been time axis compressed for each analysis frame by the time axis compression means, on the time axis based on the pitch circumference side to form the audio waveform of the original analysis frame length. Make it.

〔実施例〕〔Example〕

第1図はこの発明の一実施例を示す構成図であり、!1
1〜(31、(5)〜04) 、 M〜J119は第4
図における従来装置と同一のものであり、説明を省略す
る0第1図において(4)はフレーム内時間軸圧縮手段
、(ト)はフレーム内時間軸伸長手段である。
FIG. 1 is a block diagram showing an embodiment of the present invention. 1
1-(31, (5)-04), M-J119 is the 4th
In FIG. 1, (4) is an intra-frame time axis compression means, and (g) is an intra-frame time axis expansion means, which is the same as the conventional apparatus shown in the figure, and the explanation thereof will be omitted.

第1図でフレーム内時間軸圧縮手段(4)は入力音声波
形(3)を一定長の分析フレーム毎に区切シその分析フ
レーム内のピッチ周期を例えば自己相関分析によって求
め、このピッチ周期を利用してその分析フレーム内の音
声波形を時間軸上で圧縮し、圧縮波形(5)を求める。
In Fig. 1, the intra-frame time axis compression means (4) divides the input speech waveform (3) into analysis frames of a certain length, determines the pitch period within each analysis frame by, for example, autocorrelation analysis, and utilizes this pitch period. Then, the audio waveform within the analysis frame is compressed on the time axis to obtain a compressed waveform (5).

また、攻めたピッチ周期(6)はピッチ符号化手段(8
)で符号化される。
Furthermore, the aggressive pitch period (6) is determined by the pitch encoding means (8).
) is encoded.

フレーム内時間軸伸長手段(へ)は、分析フレーム毎に
、復号後圧縮波形0.0をピッチ周期(6)を利用して
元の分析フレーム長を持つ音声波形に伸長する。
For each analysis frame, the intra-frame time axis expansion means expands the decoded compressed waveform 0.0 into an audio waveform having the original analysis frame length using the pitch period (6).

第2図、第3図に本実施例におけるフレーム内時間軸E
EM手段(4)とフレーム内時間軸伸長手段(至)が実
行する分析フレーム毎の時間軸圧縮伸長の詳細を示す。
Figures 2 and 3 show the intra-frame time axis E in this embodiment.
The details of the time axis compression/expansion for each analysis frame executed by the EM means (4) and the intra-frame time axis expansion means (to) are shown.

第2図は、入力音声波形のピッチ周期T、が分析フレー
ム長Nに対してN/4(Tp≦N/3であるときの時間
軸圧縮伸長を示している。ここで分析フレーム内での晩
間軸EE縮が行える条件として分析フレーム長Nが富に
最大ピッチ周期1°、(max)の2倍以上であること
が必賛である。
Figure 2 shows time axis compression/expansion when the pitch period T of the input speech waveform is N/4 (Tp≦N/3) with respect to the analysis frame length N. As a condition for performing late-night axis EE reduction, it is essential that the analysis frame length N is at least twice the maximum pitch period of 1°, (max).

フレーム内時間軸圧縮+段(4)は、入力音声波形(2
a)の第1番目の2ピッチ周期区間を前記+11 、 
t2)式によって1ピッチ周期分に時間@圧縮する0分
析フレームの残りの音声波形には2ピッチ周期区間分の
長さがないのでこの区間の時間軸圧縮は行わす、時間軸
圧縮した音声波形にそのままつぎ合わせでFE縮i皮形
(2b)を求める。ここで入力音声波形は(N −T、
)/’Hに圧縮される。
The intra-frame time axis compression + stage (4) uses the input audio waveform (2
The first 2-pitch period section of a) is +11 above,
t2) The remaining audio waveform of the 0 analysis frame whose time is compressed to 1 pitch period by the formula does not have the length of 2 pitch period sections, so time axis compression of this section is performed.Time axis compressed audio waveform The FE shrunken skin shape (2b) is obtained by piecing together the two as they are. Here, the input audio waveform is (N − T,
)/'H.

フレーム内時間軸伸長手段αeは復号後FEII波形(
2C)の第1番目の1ピッチ間期分の音声波形を前記(
31、(4) <によって2ピッチ周期分に伸長し、復
号後IE圧縮形の残りの区間を七のままつぎ合わせて、
元の分析フレーム長Nを持つ伸長波形(2(1)を求め
る。
The intra-frame time axis expansion means αe generates the decoded FEII waveform (
The audio waveform for the first 1 pitch period of 2C) is
31, (4) Expand it to 2 pitch periods by
Find the expanded waveform (2(1)) with the original analysis frame length N.

第3図は、入力音声波形のピッチ周期Tpが分析フレー
ム長Nに対してN15 <Tp <−であるときの番 時間@圧縮伸長を示している。
FIG. 3 shows the cycle time@compression/expansion when the pitch period Tp of the input speech waveform satisfies N15 <Tp <- with respect to the analysis frame length N.

この場合フレーム内時間軸圧縮手段(4)は入力音声波
形(3a)の第1番目と第2番目の2ピッチ周期区間を
それぞれ前記+1.1 、 !2)式によって1ピッチ
周期分に時間軸圧縮する。第2図の場合と同様に分析フ
レームの残りの音声波形については時間軸圧縮を行わず
、時間軸圧縮した音声波形にそのままつぎ合わせて圧縮
波形(3b)を得る。ここで入力音声波形は(N −2
Tp)/N K 圧縮される。
In this case, the intra-frame time axis compression means (4) converts the first and second 2-pitch period sections of the input audio waveform (3a) into the above +1.1, !, respectively. 2) The time axis is compressed to one pitch period using the formula. As in the case of FIG. 2, the remaining audio waveforms of the analysis frame are not subjected to time axis compression, but are simply spliced to the time axis compressed audio waveform to obtain a compressed waveform (3b). Here, the input audio waveform is (N −2
Tp)/N K compressed.

フレーム内時間軸伸長手段叫は、先づ復号後圧縮波形(
3C)の第1番目のピッチ周期区間(図(3c)の■)
を前記(3)、 (4)式によって2倍に伸長する。
The intra-frame time axis expansion means first decodes the compressed waveform (
3C) first pitch period section (■ in Figure (3c))
is expanded twice using equations (3) and (4) above.

次に第2番目のピッチ周期区間(図(3C)の■)を、
存在しない第3番目のピッチ周期区間を第2番目のピッ
チ周期区間で置換した置換波形(3d)を用いて2倍に
伸長する。そして、上記で求まった復号後音声波形(3
C)の第1番目と第2番目のピッチ周期区間の2倍伸長
波形と、復号後音声波形(3C)の残りの区間をつなぎ
合わせ、伸長波形(3e)を求める0 以上第2図。第3図に示した様な手1頃で、一定長の分
析フレーム内でピッチ周期を利用した時間軸FE、縮伸
長を行う。
Next, the second pitch period section (■ in Figure (3C)) is
A replacement waveform (3d) in which the non-existent third pitch period section is replaced with the second pitch period section is used to expand the waveform twice. Then, the decoded audio waveform (3
The doubled expanded waveform of the first and second pitch period sections in C) and the remaining section of the decoded audio waveform (3C) are connected to obtain the expanded waveform (3e). At the beginning, as shown in FIG. 3, time axis FE and compression/expansion are performed using the pitch period within an analysis frame of a constant length.

次に本発明によるフレーム内時間軸圧縮手段は時間軸圧
縮率がピッチ周期によって変化するので、この時間軸圧
縮率の変化について考える0今、分析フレーム長内で存
在する2倍のピッチ周期区間の数をkとし、分析フレー
ム長N、ピッチ周期T、とすると となる。kは分析フレーム内で2ピッチ周期区間単位で
行う時間軸圧縮の回数であり、TpXkが実際に時間軸
圧縮される長さとなる。よって時間軸圧縮率Rは となる。
Next, since the time axis compression rate of the in-frame time axis compression means according to the present invention changes depending on the pitch period, let us consider the change in this time axis compression rate. Let the number be k, the analysis frame length N, and the pitch period T. k is the number of times of time axis compression performed in units of two pitch period sections within the analysis frame, and TpXk is the length of time axis compression actually performed. Therefore, the time axis compression ratio R is as follows.

k=1.2.3のときのピッチ周期T、と時間軸圧縮率
の存在範囲を次に示す0 (7)式より時間軸圧縮4Rの存在範囲は次式で定式化
できる。
The range in which the pitch period T and the time axis compression rate exist when k=1.2.3 is shown below. From equation (7), the range in which the time axis compression 4R exists can be formulated as the following equation.

でありその時の時間軸圧縮率Rは である。従って本発明によるフレーム内時間軸圧縮手段
は、分析フレーム内の入力音声波形を(9)式で示した
時間軸圧縮率の範囲内で時間軸圧縮することができる。
And the time axis compression ratio R at that time is. Therefore, the intra-frame time axis compression means according to the present invention can time axis compress the input audio waveform within the analysis frame within the range of the time axis compression ratio shown by equation (9).

なお、上記実施例では符号化手段(力はパルス符号化を
行うものとしたが、他のいかなる音声符号化法を行うも
のとしても良く、特に一定長の分析フレーム単位に符号
化を行う方式を用いる場合、フレーム内時間軸圧縮手段
で設定する分析フレームとこの符号化方式の分析フレー
ムとを同一にしても良い。このとき、復号化手段α〔で
の復号化法は符号化法に順するものとする0 〔発明の効果〕 以上のようにこの発明によれば、ピッチ周期に基づいた
音声波形の時間軸圧縮伸長を一定長の分析フレーム範囲
内で行えるので、伝送路での伝送誤りである分析フレー
ムのピッチ周期が誤って伝送されてもこの誤りによる伸
長波形への直接の影響をその分析フレーム内におさえる
ことができ、しかも符号化手段が一定長の分析フレーム
毎に音声符号化を行うときは、この分析フレームと前記
時間軸圧縮伸長の分析フレームを同一化し処理を簡単化
できるという効果がある。
In addition, in the above embodiment, the encoding means (power) performs pulse encoding, but any other audio encoding method may be used. When used, the analysis frame set by the intra-frame time axis compression means and the analysis frame of this encoding method may be the same.In this case, the decoding method in the decoding means α follows the encoding method. [Effects of the Invention] As described above, according to the present invention, since the time axis compression/expansion of the audio waveform based on the pitch period can be performed within the analysis frame range of a certain length, transmission errors on the transmission path can be prevented. Even if the pitch period of a certain analysis frame is erroneously transmitted, the direct effect of this error on the expanded waveform can be suppressed within that analysis frame, and moreover, the encoding means performs voice encoding for each analysis frame of a certain length. When this is done, this analysis frame and the analysis frame of the time axis compression/expansion are made the same, thereby simplifying the processing.

【図面の簡単な説明】[Brief explanation of drawings]

第1図はこの発明の音声時間軸圧縮符号化装置の一実施
例を示す全体構成図、第2図、第3図はこの発明の一実
施例での処理を示す波形説明図、第4図は従来の音声時
間軸圧縮符号化装置の一例を示す全体構成図、第5図は
従来の音声時間軸圧縮符号化装置の時間軸圧縮処理を示
す説明図、第6図は従来の音声時間軸圧縮符号化装置の
時間軸伸長処理を示す説明図である0 図において、(1)はマイクロホン、(2)はA/D変
換i、(3)は入力音声波形、(4)はフレーム内時間
軸圧縮手段、(5)は圧縮波形、(6)はピッチ周期、
(7)は符号化手段、(8)はピッチ符号化手段、(9
)はマルチプレクサ、Qlは伝送路、(11)はデマル
チプレクサ、(イ)は復号化手段、(至)はピッチ復号
化手段、α→は復号後圧細波形、(至)はフレーム内時
間軸伸長手段、a・は伸長波形、Q′hはD/A変換器
、(至)はスピーカ、(2a)は入力音声波形、(2b
)は圧縮波形、(20)は復号後圧細波形、(2d)は
伸長波形、(3a)は入力音声波形、(3b)は圧縮波
形、(3C)は復号後圧細波形、(3d)は置換波形、
(3e)は伸長波形、α9は時間軸圧縮手段、(1)は
ピッチ周期抽出手段、@Eは時間軸伸長手段、(5a)
は入力音声波形、(5b)は圧縮波形、(6a)は復号
後圧細波形、(61))は伸長波形である。 なお、各図中の同一符号は同−又は相当部分を示す。
FIG. 1 is an overall configuration diagram showing an embodiment of the audio time axis compression encoding device of the present invention, FIGS. 2 and 3 are waveform explanatory diagrams showing processing in an embodiment of the invention, and FIG. 4 is an overall configuration diagram showing an example of a conventional audio time axis compression encoding device, FIG. 5 is an explanatory diagram showing the time axis compression process of the conventional audio time axis compression encoding device, and FIG. 0 is an explanatory diagram showing the time axis expansion process of the compression encoding device. In the figure, (1) is the microphone, (2) is the A/D conversion i, (3) is the input audio waveform, and (4) is the time within the frame. Axial compression means, (5) compression waveform, (6) pitch period,
(7) is an encoding means, (8) is a pitch encoding means, (9
) is the multiplexer, Ql is the transmission line, (11) is the demultiplexer, (a) is the decoding means, (to) is the pitch decoding means, α→ is the compressed waveform after decoding, (to) is the time axis within the frame Decompression means, a. is the decompressed waveform, Q'h is the D/A converter, (to) is the speaker, (2a) is the input audio waveform, (2b
) is the compressed waveform, (20) is the compressed waveform after decoding, (2d) is the expanded waveform, (3a) is the input audio waveform, (3b) is the compressed waveform, (3C) is the compressed waveform after decoding, (3d) is the replacement waveform,
(3e) is an expanded waveform, α9 is a time axis compression means, (1) is a pitch period extraction means, @E is a time axis expansion means, (5a)
is an input audio waveform, (5b) is a compressed waveform, (6a) is a compressed waveform after decoding, and (61) is an expanded waveform. Note that the same reference numerals in each figure indicate the same or corresponding parts.

Claims (1)

【特許請求の範囲】[Claims] 入力音声波形を時間軸上で圧縮して符号化する送信部と
、この圧縮符号化された音声波形を復号化して時間軸上
で伸長する受信部から成る音声時間軸圧縮伸長装置にお
いて、入力音声波形を、一定長の分析フレーム毎にこの
分析フレームの範囲内でこの入力音声波形のこの分析フ
レームにおけるピッチ周期を利用して時間軸上で圧縮す
るフレーム内時間軸圧縮手段を送信部に備え、この時間
軸圧縮手段で時間軸圧縮された音声波形を、前記分析フ
レーム毎に前記ピッチ周期に基き前記分析フレーム長を
持つ音声波形に伸長するフレーム内時間軸伸長手段を受
信部に備えることを特徴とした音声時間軸圧縮符号化装
置。
In an audio time-base compression/expansion device that consists of a transmitter that compresses and encodes an input audio waveform on the time axis, and a receiver that decodes and expands the compressed-encoded audio waveform on the time axis, the input audio The transmitting unit includes an intra-frame time axis compression means for compressing the waveform on the time axis within the range of the analysis frame for each analysis frame of a fixed length by using the pitch period in this analysis frame of the input audio waveform, The reception section is characterized by comprising an intra-frame time axis expansion means for expanding the audio waveform time-axis compressed by the time axis compression means into an audio waveform having the analysis frame length based on the pitch period for each analysis frame. Audio time axis compression encoding device.
JP63056812A 1988-03-09 1988-03-09 Audio time base compression device, audio time base expansion device and audio time base compression and expansion device Expired - Lifetime JPH0782354B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63056812A JPH0782354B2 (en) 1988-03-09 1988-03-09 Audio time base compression device, audio time base expansion device and audio time base compression and expansion device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63056812A JPH0782354B2 (en) 1988-03-09 1988-03-09 Audio time base compression device, audio time base expansion device and audio time base compression and expansion device

Publications (2)

Publication Number Publication Date
JPH03198000A true JPH03198000A (en) 1991-08-29
JPH0782354B2 JPH0782354B2 (en) 1995-09-06

Family

ID=13037788

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63056812A Expired - Lifetime JPH0782354B2 (en) 1988-03-09 1988-03-09 Audio time base compression device, audio time base expansion device and audio time base compression and expansion device

Country Status (1)

Country Link
JP (1) JPH0782354B2 (en)

Also Published As

Publication number Publication date
JPH0782354B2 (en) 1995-09-06

Similar Documents

Publication Publication Date Title
US5619197A (en) Signal encoding and decoding system allowing adding of signals in a form of frequency sample sequence upon decoding
JP3352406B2 (en) Audio signal encoding and decoding method and apparatus
AU616349B2 (en) Speech coding and decoding apparatus
JPH0955778A (en) Bandwidth widening device for sound signal
JPH0636158B2 (en) Speech analysis and synthesis method and device
JP2003150198A (en) Voice encoding device and voice decoding device
JPS63192100A (en) Multi-pulse encoder
JP2834260B2 (en) Speech spectral envelope parameter encoder
JPH03198000A (en) Voice time-base compressing and encoding device
JPH0236628A (en) Transmission system and transmission/reception system for voice signal
JP2644789B2 (en) Image transmission method
JPH01233835A (en) Voice time base compression coding device
JP2744618B2 (en) Speech encoding transmission device, and speech encoding device and speech decoding device
JPS60107933A (en) Adpcm encoding device
JPH03241399A (en) Voice transmitting/receiving equipment
JPH0526376B2 (en)
JPH0294832A (en) Voice coding and decoding system
JPH0232397A (en) Sound signal processor
KR960003626B1 (en) Decoding method of transformed coded audio signal for people hard of hearing
JPH0736486A (en) Speech encoding device
JPH0546199A (en) Speech encoding device
JPH0730462A (en) Method and device for transmitting sound signal
JPH045700A (en) Voice encoding/decoding device
JPH01293398A (en) Speech encoding system
JPH0832544A (en) Image coding and decoding device

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080906

Year of fee payment: 13

EXPY Cancellation because of completion of term
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20080906

Year of fee payment: 13