JPH03198000A

JPH03198000A - Voice time-base compressing and encoding device

Info

Publication number: JPH03198000A
Application number: JP63056812A
Authority: JP
Inventors: Masaya Takahashi; 真哉高橋; Kunio Nakajima; 中島　邦男
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1988-03-09
Filing date: 1988-03-09
Publication date: 1991-08-29
Anticipated expiration: 2010-09-06
Also published as: JPH0782354B2

Abstract

PURPOSE:To prevent a voice from deteriorating by using a compressed waveform signal as a voice waveform signal with a constant pitch cycle and connecting the remaining voice waveform signal to the compressed waveform signal without compression. CONSTITUTION:An in-frame time-base compressing means 4 sections an input voice waveform 3 into constant-length analytic frames, and pitch cycles in the analytic frames are found by, for example, autocorrelation analysis and utilized to compress voice waveforms in the analytic frames on the time basis, thereby finding a compressed waveform 5. Further, this device is equipped with an in-frame time-base expanding means 15 which expands the compressed waveform 14 into a voice waveform with original analytic frame length by utilizing the pitch cycles 6 after the signal is decoded in analytic frame units. Then a frame which is longer than plural pitches of the input voice waveform is set, a waveform of plural (p) pitches in this frame is compressed into a waveform of less integral (q) pitches, and the remaining voice waveform signal is connected to the compressed waveform with the integral pitch without being compressed. Consequently, even if a transmission error of pitch cycles is generated, its influence stays in the sectio.

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は、一定長の分析フレーム範囲内でピッチ周期
に基く音声波形の時間軸圧縮伸長を行う音声時間軸圧縮
符号化装置に関するものである。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to an audio time axis compression/encoding device that performs time axis compression/expansion of an audio waveform based on a pitch period within a fixed length analysis frame range. .

[Conventional technology]

従来のこの種の装置として第４図に示すようなものがあ
った。この因は、Ｉ　ＯＣ’８２　（ＩＥＥＥ　Ｉｎｔ
ｅｒｎａｔｉｏｎａｌ　Ｃｏｍｍｕｎｉｃａｔｉｏｎ　
Ｃ１ｏｎｆｅｒｅｎｃｅ　、　１９Ｂ２　）での発表論
文Ｒ，Ｖ、０ｏｘｅｔａ’１．’Ａｎ　Ｉｍｐｌｅｍｅ
ｎｔａｔｉｏｎ　ｏｆＴｉｍｅ　Ｄｏｍａｉｎ　Ｈａｒ
ｍｏｎｉｃ　Ｓｃａｌｉｎｇ　ｗｉｔｈ　Ａｐｐｌｉｃ
ａｔｉｏｎ　ｔｏ　５ｐｅｅｃｈ　Ｃｏｄｉｎｇ”（１
００８２，ｐｐ、４Ｇ、１．１−４　）のＦｉｇ、ｌに
示されたものと同様なもので、図において、（１）はマ
イクロホン、（２）はＡ／Ｄ変換器、（３）は入力音声
波形、Ｑ侍は時間軸圧縮手段、（ホ）はピッチ周期抽出
手段、（５）は圧縮波形、（６）はピッチ周期、（７）
は符号化手段、（８）はピッチ符号化手段、（９）はマ
ルチプレクサ、Ｑｌは伝送路、（ロ）はデマルチプレク
サ、（２）は復号化手段、（２）はピッチ復号化手段、
Ｑ４は復号後圧縮波形、（財）は時間軸伸長手段、αり
は伸長波形、ａηはＤ／Ａ変換器、（至）はスピーカで
ある。A conventional device of this type is shown in FIG. The reason for this is I OC'82 (IEEE Int
national communication
Paper presented at C1onference, 19B2) R, V, 0oxeta'1. 'An Impleme
ntation of Time Domain Har
monic scaling with application
ation to 5peech Coding” (1
0082, pp. 4G, 1.1-4). In the figure, (1) is a microphone, (2) is an A/D converter, and (3) is a Input audio waveform, Q Samurai is time axis compression means, (E) is pitch period extraction means, (5) is compressed waveform, (6) is pitch period, (7)
is an encoding means, (8) is a pitch encoding means, (9) is a multiplexer, Ql is a transmission line, (b) is a demultiplexer, (2) is a decoding means, (2) is a pitch decoding means,
Q4 is a compressed waveform after decoding, (F) is a time axis expansion means, α is an expanded waveform, aη is a D/A converter, and (To) is a speaker.

[Operation of conventional device]

次に動作について説明する０先ず第４図の送信部につい
て説明する。マイクロホン（１）から入力された音声は
、Ａ／Ｄ変換器（２）で離散データ系列（以後入力せ声
波形（３）と呼ぶ）に変換される。ピッチ周期抽出手段
員は例えば自己相関分析によシ入力音声波形（３）のピ
ッチ周期（６）を求める。時間軸圧縮手段−はピッチ周
期抽出手段翰で抽出したピッチ周期（６）に基き、連続
する２つのピッチ区間毎に入力音声波形（３）を時間軸
上で１／２に圧縮し、圧縮波形（５）を求める。ここで
時間軸圧縮手段α傷で行われる時間軸圧縮の詳細を第５
図に示す０第５図において、（５ａ）４−ｉ連続する２
ピッチ区間の入力音声波形０　（ｎ）でろ、り（Ｏｂ）
は入力音声波形Ｓ　（ｎ）を時間軸上で１／２圧縮した
圧縮波形５Ｃ（ｎ）である。今入力音声波形８　（ｎ）
のピッチ周期をＴｐとすると、圧縮波形５ｃ（ｎ）は次
の山、（２）式によって求める０３ｃ（ｎ）＝　（１−
Ｗｃ（ｎ）　）−５（ｎ）＋　Ｗｃ（ｎ）　・Ｓ　（ｎ
　＋　Ｔｐ　）　　　ｔｌ）ｗｃ（ｎ）＝（ｎ　−１／
２　）／Ｔｐ　　　　　　　　　　　ｔ２）ｎ＝１，２
．＊ｚｊＴ。Next, the operation will be explained.First, the transmitter shown in FIG. 4 will be explained. Voice input from the microphone (1) is converted into a discrete data series (hereinafter referred to as input voice waveform (3)) by an A/D converter (2). The pitch period extraction means obtains the pitch period (6) of the input speech waveform (3), for example, by autocorrelation analysis. The time axis compression means compresses the input speech waveform (3) to 1/2 on the time axis for every two consecutive pitch sections based on the pitch period (6) extracted by the pitch period extraction means Kan, and generates a compressed waveform. Find (5). Here, the details of the time axis compression performed by the time axis compression means α scratch are explained in the fifth section.
In Figure 5, (5a) 4-i consecutive 2
Input audio waveform of pitch section 0 (n) (Ob)
is a compressed waveform 5C(n) obtained by compressing the input audio waveform S (n) by 1/2 on the time axis. Now input audio waveform 8 (n)
If the pitch period of is Tp, the compressed waveform 5c(n) is the following peak, 03c(n)=(1-
Wc(n) )-5(n)+ Wc(n) ・S(n
+ Tp ) tl) wc(n) = (n −1/
2)/Tp t2)n=1,2
．． *zjT.

ここで（ＩＩ　、　（２）式のＷＣ（ｎ）は入力音声波
形５（ｎ）にかける窓関数でおり、第５図は入力音声波
形Ｓ　（ｎ）に窓関数ｗｃ（ｎ）をかけ圧縮波形５ｃ（
ｎ）を求める手順を示している。Here, (II), WC(n) in equation (2) is a window function that is applied to the input audio waveform 5(n), and in Figure 5, the input audio waveform S (n) is multiplied by the window function wc(n) and compressed. Waveform 5c (
The procedure for finding n) is shown.

今、第４図の時間軸圧縮手段ａ値で上記の方法により求
まった圧縮波形（５）は符号化手段（７）で例えば１サ
ンプル毎にパルス符号化され、またピッチ周期（６）も
ピッチ符号化手段（８）で符号化される。マルチプレク
サ（９）は符号化された圧縮波形清報とピッチ周期清報
を多重化して伝送路（Ｉｎに出力する。Now, the compressed waveform (5) obtained by the above method using the time axis compression means a value shown in FIG. It is encoded by the encoding means (8). The multiplexer (9) multiplexes the encoded compressed waveform signal and the pitch period signal and outputs the multiplexed signal to the transmission line (In).

次に第４図の受信部について説明する。伝送路Ｑｌを伝
搬した符号化された圧縮波形ｉ＊＝ｉとピッチ周期清報
は、デマルチプレクサｔ）１）で分離される。Next, the receiving section shown in FIG. 4 will be explained. The encoded compressed waveform i*=i and the pitch period information propagated through the transmission path Ql are separated by a demultiplexer t)1).

圧縮波形清報は復号化手段（２）で符号化手段（７）に
対応した復号処りで復号化され復号後圧縮波形α→とな
る。同様にピッチ周期清報もピッチ復号化手段０で復号
化される。時間軸伸長手段１２１）はピッチ復号化手段
ｑｇで復号化されたピッチ周期（（ｉｌに基き、このピ
ッチ周期単位に復号１８Ｅ縮彼形Ｃ−４）を時間軸上で
２倍に伸長し、伸長波形ＯＱを求める。時間軸伸長手段
１２］Ｊで行われる時間軸伸長の計、！ｔＩを第６図に
示す。第６図において、（６ａ）は連続する３ピッチ区
間（ｒ−１，ｒ、ｒ＋１．番目）の復号後圧縮波形合ｃ
（ｎ）であり、（６ｂ）は復号後圧縮波形◇ｃ（ｎ）の
ｒ番目のピッチ区間を２倍に伸長した伸長波形◇。（ｎ
）である。今、復号後圧縮波形介ｃ（ｎ）においてｒ番
目のピッチ区間の先頭地点をｎ＝１とすると、伸長波形
ｅ　、（ｎ）は次の（３）、（４）式によって求める。The compressed waveform information is decoded by the decoding means (2) in a decoding process corresponding to the encoding means (7), and becomes a compressed waveform α→ after decoding. Similarly, the pitch period information is also decoded by the pitch decoding means 0. The time axis expansion means 121) expands the pitch period decoded by the pitch decoding means qg ((based on il, decoded 18E contraction C-4 in pitch period units) on the time axis, Determine the expanded waveform OQ. The total time axis expansion performed by the time axis expansion means 12]J, !tI, is shown in FIG. 6. In FIG. r, r+1.th) decoded compressed waveform sum c
(n), and (6b) is the compressed waveform after decoding ◇ An expanded waveform ◇ in which the r-th pitch section of c(n) is expanded by twice. (n
). Now, if the starting point of the r-th pitch section in the decoded compressed waveform c(n) is set to n=1, the expanded waveforms e and (n) are determined by the following equations (3) and (4).

Ｇ、（ｎ）＝　＜１−　Ｗｅ（ｎ）　＞檜ｃ（ｎ）＋　
ｗｅ（ｎ）　４ｃ（ｎ　−Ｔｐ＞　　　（３）Ｗｅ　（
ｎ）＝（ｎ　−１／　２　）／２Ｔ　ｐ　　　　　　　
　　　　　　　（４）ｎ＝１，２．　　・・・・２Ｔｐ（３）　、　（４）式のｗｅ（ｎ）は復号後王都波形◇
ｃ（ｎ）にかける窓関数であり、第６図は復号後圧縮波
形５ｃ（ｎ）に窓関数Ｗ、（ｎ）をかけ、伸長波形Ｓ、
←）を求める手順を示している。G, (n) = <1- We(n) > Cypress c(n)+
we(n) 4c(n −Tp> (3) We (
n)=(n-1/2)/2T p
(4) n=1,2. ...2Tp (3), we(n) in equations (4) is the royal capital waveform after decoding◇
It is a window function that is applied to c(n), and in FIG.
←).

第４図の時間軸伸長手段３１）で上記の方法によシ求ま
った伸長波形αＱは、Ｄ／Ａ変換器０ηでアナログ信号
に変換され、スピーカＱ８！から出力される０〔発明が
解決しようとする課題〕従来の音声時間軸圧縮符号化装置は以上の様に構成され
ておシ、その復号化部の時間軸伸長手段は復号？１ｋＥ
Ｅ縮波形を伝送路より伝送されたピッチ周期の２倍の長
さに逐次伸長するので、伝送路で発生した伝送誤りによ
って一凌ピツチ周期が誤伝送されると、そのピッチ周期
区間のみに限らずそれ以後時間軸伸長手段で伸長される
伸長波形総てに誤りが波及し、音質に著しい劣化を生ず
るという課題があった。The expanded waveform αQ obtained by the above method by the time axis expansion means 31) in FIG. [Problem to be Solved by the Invention] The conventional audio time axis compression encoding device is configured as described above, and the time axis expansion means of the decoding section is used for decoding. 1kE
Since the E-condensed waveform is sequentially expanded to twice the length of the pitch period transmitted from the transmission line, if one pitch period is erroneously transmitted due to a transmission error that occurs on the transmission line, it is limited to only that pitch period section. There is a problem in that the error spreads to all expanded waveforms that are subsequently expanded by the time axis expansion means, resulting in significant deterioration in sound quality.

また、符号化、復号化手段で行う音声符号化復号化処理
を、ある一定長の分析フレーム単位で行う場合、この分
析フレーム区間とピッチ周期単位で行う時間軸圧縮伸長
の区間の整合が取れないため、音声符号化復号化処理と
時１′＆ｌＪ＠圧縮伸長処理を個別の区間で別々に実行
しなければならず処理が複雑で処理遅延を生ずる可能性
があるという課題がおった。Furthermore, when the audio encoding/decoding process performed by the encoding/decoding means is performed in units of analysis frames of a certain length, the analysis frame interval and the interval of time axis compression/expansion performed in pitch cycle units cannot be matched. Therefore, a problem arises in that the audio encoding/decoding process and the compression/decompression process must be performed separately in separate sections, making the process complicated and potentially causing a processing delay.

[Means to solve the problem]

この発明に係る音声時間＠圧縮符号化装置は、一定長の
分析フレーム毎にその分析フレームの範囲内でピッチ周
期に基く音声波形の時間軸圧縮を行う時間軸圧縮手段と
、この分析フレーム毎に時間軸圧縮された波形を元の分
析フレーム長まで時間軸伸長する時間軸伸長手段を備え
たものである。The audio time @compression encoding device according to the present invention includes a time axis compression means for compressing the time axis of an audio waveform based on a pitch period within the range of the analysis frame for each analysis frame of a fixed length; The apparatus is equipped with a time axis expansion means for expanding the time axis of the time axis compressed waveform to the original analysis frame length.

[Effect]

この発明に訃ける時間軸圧縮手段は、一定長の分析フレ
ーム毎に、その分析フレーム内で連続して存在する数ピ
ツチ周期分の音声波形について、連接しな２ピッチ周期
区間毎に両ピッチ周期区間の波形の類似性を利用して時
間軸上で圧縮する。The time axis compression means according to the present invention, for each analysis frame of a certain length, converts the audio waveform of several pitch periods continuously existing within the analysis frame into two pitch periods in every two non-contiguous pitch period sections. Compression is performed on the time axis using the similarity of the waveforms of the sections.

またこの発明における時間軸伸長手段は、前記時間軸圧
縮手段で分析フレーム毎に時間軸圧縮された圧縮音声波
形を、前記ピッチ周側に基き時間軸上で伸長し元の分析
フレーム長の音声波形にする。Further, the time axis expansion means in the present invention expands the compressed audio waveform, which has been time axis compressed for each analysis frame by the time axis compression means, on the time axis based on the pitch circumference side to form the audio waveform of the original analysis frame length. Make it.

〔Example〕

第１図はこの発明の一実施例を示す構成図であり、！１
１〜（３１、（５）〜０４）　、　Ｍ〜Ｊ１１９は第４
図における従来装置と同一のものであり、説明を省略す
る０第１図において（４）はフレーム内時間軸圧縮手段
、（ト）はフレーム内時間軸伸長手段である。FIG. 1 is a block diagram showing an embodiment of the present invention. 1
1-(31, (5)-04), M-J119 is the 4th
In FIG. 1, (4) is an intra-frame time axis compression means, and (g) is an intra-frame time axis expansion means, which is the same as the conventional apparatus shown in the figure, and the explanation thereof will be omitted.

第１図でフレーム内時間軸圧縮手段（４）は入力音声波
形（３）を一定長の分析フレーム毎に区切シその分析フ
レーム内のピッチ周期を例えば自己相関分析によって求
め、このピッチ周期を利用してその分析フレーム内の音
声波形を時間軸上で圧縮し、圧縮波形（５）を求める。In Fig. 1, the intra-frame time axis compression means (4) divides the input speech waveform (3) into analysis frames of a certain length, determines the pitch period within each analysis frame by, for example, autocorrelation analysis, and utilizes this pitch period. Then, the audio waveform within the analysis frame is compressed on the time axis to obtain a compressed waveform (5).

また、攻めたピッチ周期（６）はピッチ符号化手段（８
）で符号化される。Furthermore, the aggressive pitch period (6) is determined by the pitch encoding means (8).
) is encoded.

フレーム内時間軸伸長手段（へ）は、分析フレーム毎に
、復号後圧縮波形０．０をピッチ周期（６）を利用して
元の分析フレーム長を持つ音声波形に伸長する。For each analysis frame, the intra-frame time axis expansion means expands the decoded compressed waveform 0.0 into an audio waveform having the original analysis frame length using the pitch period (6).

第２図、第３図に本実施例におけるフレーム内時間軸Ｅ
ＥＭ手段（４）とフレーム内時間軸伸長手段（至）が実
行する分析フレーム毎の時間軸圧縮伸長の詳細を示す。Figures 2 and 3 show the intra-frame time axis E in this embodiment.
The details of the time axis compression/expansion for each analysis frame executed by the EM means (4) and the intra-frame time axis expansion means (to) are shown.

第２図は、入力音声波形のピッチ周期Ｔ、が分析フレー
ム長Ｎに対してＮ／４（Ｔｐ≦Ｎ／３であるときの時間
軸圧縮伸長を示している。ここで分析フレーム内での晩
間軸ＥＥ縮が行える条件として分析フレーム長Ｎが富に
最大ピッチ周期１°、（ｍａｘ）の２倍以上であること
が必賛である。Figure 2 shows time axis compression/expansion when the pitch period T of the input speech waveform is N/4 (Tp≦N/3) with respect to the analysis frame length N. As a condition for performing late-night axis EE reduction, it is essential that the analysis frame length N is at least twice the maximum pitch period of 1°, (max).

フレーム内時間軸圧縮＋段（４）は、入力音声波形（２
ａ）の第１番目の２ピッチ周期区間を前記＋１１　、　
ｔ２）式によって１ピッチ周期分に時間＠圧縮する０分
析フレームの残りの音声波形には２ピッチ周期区間分の
長さがないのでこの区間の時間軸圧縮は行わす、時間軸
圧縮した音声波形にそのままつぎ合わせでＦＥ縮ｉ皮形
（２ｂ）を求める。ここで入力音声波形は（Ｎ　−Ｔ、
）／’Ｈに圧縮される。The intra-frame time axis compression + stage (4) uses the input audio waveform (2
The first 2-pitch period section of a) is +11 above,
t2) The remaining audio waveform of the 0 analysis frame whose time is compressed to 1 pitch period by the formula does not have the length of 2 pitch period sections, so time axis compression of this section is performed.Time axis compressed audio waveform The FE shrunken skin shape (2b) is obtained by piecing together the two as they are. Here, the input audio waveform is (N − T,
)/'H.

フレーム内時間軸伸長手段αｅは復号後ＦＥＩＩ波形（
２Ｃ）の第１番目の１ピッチ間期分の音声波形を前記（
３１、（４）　＜によって２ピッチ周期分に伸長し、復
号後ＩＥ圧縮形の残りの区間を七のままつぎ合わせて、
元の分析フレーム長Ｎを持つ伸長波形（２（１）を求め
る。The intra-frame time axis expansion means αe generates the decoded FEII waveform (
The audio waveform for the first 1 pitch period of 2C) is
31, (4) Expand it to 2 pitch periods by
Find the expanded waveform (2(1)) with the original analysis frame length N.

第３図は、入力音声波形のピッチ周期Ｔｐが分析フレー
ム長Ｎに対してＮ１５　＜Ｔｐ　＜−であるときの番時間＠圧縮伸長を示している。FIG. 3 shows the cycle time@compression/expansion when the pitch period Tp of the input speech waveform satisfies N15 <Tp <- with respect to the analysis frame length N.

この場合フレーム内時間軸圧縮手段（４）は入力音声波
形（３ａ）の第１番目と第２番目の２ピッチ周期区間を
それぞれ前記＋１．１　、　！２）式によって１ピッチ
周期分に時間軸圧縮する。第２図の場合と同様に分析フ
レームの残りの音声波形については時間軸圧縮を行わず
、時間軸圧縮した音声波形にそのままつぎ合わせて圧縮
波形（３ｂ）を得る。ここで入力音声波形は（Ｎ　−２
Ｔｐ）／Ｎ　Ｋ　圧縮される。In this case, the intra-frame time axis compression means (4) converts the first and second 2-pitch period sections of the input audio waveform (3a) into the above +1.1, !, respectively. 2) The time axis is compressed to one pitch period using the formula. As in the case of FIG. 2, the remaining audio waveforms of the analysis frame are not subjected to time axis compression, but are simply spliced to the time axis compressed audio waveform to obtain a compressed waveform (3b). Here, the input audio waveform is (N −2
Tp)/N K compressed.

フレーム内時間軸伸長手段叫は、先づ復号後圧縮波形（
３Ｃ）の第１番目のピッチ周期区間（図（３ｃ）の■）
を前記（３）、　（４）式によって２倍に伸長する。The intra-frame time axis expansion means first decodes the compressed waveform (
3C) first pitch period section (■ in Figure (3c))
is expanded twice using equations (3) and (4) above.

次に第２番目のピッチ周期区間（図（３Ｃ）の■）を、
存在しない第３番目のピッチ周期区間を第２番目のピッ
チ周期区間で置換した置換波形（３ｄ）を用いて２倍に
伸長する。そして、上記で求まった復号後音声波形（３
Ｃ）の第１番目と第２番目のピッチ周期区間の２倍伸長
波形と、復号後音声波形（３Ｃ）の残りの区間をつなぎ
合わせ、伸長波形（３ｅ）を求める０以上第２図。第３図に示した様な手１頃で、一定長の分
析フレーム内でピッチ周期を利用した時間軸ＦＥ、縮伸
長を行う。Next, the second pitch period section (■ in Figure (3C)) is
A replacement waveform (3d) in which the non-existent third pitch period section is replaced with the second pitch period section is used to expand the waveform twice. Then, the decoded audio waveform (3
The doubled expanded waveform of the first and second pitch period sections in C) and the remaining section of the decoded audio waveform (3C) are connected to obtain the expanded waveform (3e). At the beginning, as shown in FIG. 3, time axis FE and compression/expansion are performed using the pitch period within an analysis frame of a constant length.

次に本発明によるフレーム内時間軸圧縮手段は時間軸圧
縮率がピッチ周期によって変化するので、この時間軸圧
縮率の変化について考える０今、分析フレーム長内で存
在する２倍のピッチ周期区間の数をｋとし、分析フレー
ム長Ｎ、ピッチ周期Ｔ、とするととなる。ｋは分析フレーム内で２ピッチ周期区間単位で
行う時間軸圧縮の回数であり、ＴｐＸｋが実際に時間軸
圧縮される長さとなる。よって時間軸圧縮率Ｒはとなる。Next, since the time axis compression rate of the in-frame time axis compression means according to the present invention changes depending on the pitch period, let us consider the change in this time axis compression rate. Let the number be k, the analysis frame length N, and the pitch period T. k is the number of times of time axis compression performed in units of two pitch period sections within the analysis frame, and TpXk is the length of time axis compression actually performed. Therefore, the time axis compression ratio R is as follows.

ｋ＝１．２．３のときのピッチ周期Ｔ、と時間軸圧縮率
の存在範囲を次に示す０（７）式より時間軸圧縮４Ｒの存在範囲は次式で定式化
できる。The range in which the pitch period T and the time axis compression rate exist when k=1.2.3 is shown below. From equation (7), the range in which the time axis compression 4R exists can be formulated as the following equation.

でありその時の時間軸圧縮率Ｒはである。従って本発明によるフレーム内時間軸圧縮手段
は、分析フレーム内の入力音声波形を（９）式で示した
時間軸圧縮率の範囲内で時間軸圧縮することができる。And the time axis compression ratio R at that time is. Therefore, the intra-frame time axis compression means according to the present invention can time axis compress the input audio waveform within the analysis frame within the range of the time axis compression ratio shown by equation (9).

なお、上記実施例では符号化手段（力はパルス符号化を
行うものとしたが、他のいかなる音声符号化法を行うも
のとしても良く、特に一定長の分析フレーム単位に符号
化を行う方式を用いる場合、フレーム内時間軸圧縮手段
で設定する分析フレームとこの符号化方式の分析フレー
ムとを同一にしても良い。このとき、復号化手段α〔で
の復号化法は符号化法に順するものとする０〔発明の効果〕以上のようにこの発明によれば、ピッチ周期に基づいた
音声波形の時間軸圧縮伸長を一定長の分析フレーム範囲
内で行えるので、伝送路での伝送誤りである分析フレー
ムのピッチ周期が誤って伝送されてもこの誤りによる伸
長波形への直接の影響をその分析フレーム内におさえる
ことができ、しかも符号化手段が一定長の分析フレーム
毎に音声符号化を行うときは、この分析フレームと前記
時間軸圧縮伸長の分析フレームを同一化し処理を簡単化
できるという効果がある。In addition, in the above embodiment, the encoding means (power) performs pulse encoding, but any other audio encoding method may be used. When used, the analysis frame set by the intra-frame time axis compression means and the analysis frame of this encoding method may be the same.In this case, the decoding method in the decoding means α follows the encoding method. [Effects of the Invention] As described above, according to the present invention, since the time axis compression/expansion of the audio waveform based on the pitch period can be performed within the analysis frame range of a certain length, transmission errors on the transmission path can be prevented. Even if the pitch period of a certain analysis frame is erroneously transmitted, the direct effect of this error on the expanded waveform can be suppressed within that analysis frame, and moreover, the encoding means performs voice encoding for each analysis frame of a certain length. When this is done, this analysis frame and the analysis frame of the time axis compression/expansion are made the same, thereby simplifying the processing.

[Brief explanation of drawings]

第１図はこの発明の音声時間軸圧縮符号化装置の一実施
例を示す全体構成図、第２図、第３図はこの発明の一実
施例での処理を示す波形説明図、第４図は従来の音声時
間軸圧縮符号化装置の一例を示す全体構成図、第５図は
従来の音声時間軸圧縮符号化装置の時間軸圧縮処理を示
す説明図、第６図は従来の音声時間軸圧縮符号化装置の
時間軸伸長処理を示す説明図である０図において、（１）はマイクロホン、（２）はＡ／Ｄ変
換ｉ、（３）は入力音声波形、（４）はフレーム内時間
軸圧縮手段、（５）は圧縮波形、（６）はピッチ周期、
（７）は符号化手段、（８）はピッチ符号化手段、（９
）はマルチプレクサ、Ｑｌは伝送路、（１１）はデマル
チプレクサ、（イ）は復号化手段、（至）はピッチ復号
化手段、α→は復号後圧細波形、（至）はフレーム内時
間軸伸長手段、ａ・は伸長波形、Ｑ′ｈはＤ／Ａ変換器
、（至）はスピーカ、（２ａ）は入力音声波形、（２ｂ
）は圧縮波形、（２０）は復号後圧細波形、（２ｄ）は
伸長波形、（３ａ）は入力音声波形、（３ｂ）は圧縮波
形、（３Ｃ）は復号後圧細波形、（３ｄ）は置換波形、
（３ｅ）は伸長波形、α９は時間軸圧縮手段、（１）は
ピッチ周期抽出手段、＠Ｅは時間軸伸長手段、（５ａ）
は入力音声波形、（５ｂ）は圧縮波形、（６ａ）は復号
後圧細波形、（６１））は伸長波形である。なお、各図中の同一符号は同−又は相当部分を示す。FIG. 1 is an overall configuration diagram showing an embodiment of the audio time axis compression encoding device of the present invention, FIGS. 2 and 3 are waveform explanatory diagrams showing processing in an embodiment of the invention, and FIG. 4 is an overall configuration diagram showing an example of a conventional audio time axis compression encoding device, FIG. 5 is an explanatory diagram showing the time axis compression process of the conventional audio time axis compression encoding device, and FIG. 0 is an explanatory diagram showing the time axis expansion process of the compression encoding device. In the figure, (1) is the microphone, (2) is the A/D conversion i, (3) is the input audio waveform, and (4) is the time within the frame. Axial compression means, (5) compression waveform, (6) pitch period,
(7) is an encoding means, (8) is a pitch encoding means, (9
) is the multiplexer, Ql is the transmission line, (11) is the demultiplexer, (a) is the decoding means, (to) is the pitch decoding means, α→ is the compressed waveform after decoding, (to) is the time axis within the frame Decompression means, a. is the decompressed waveform, Q'h is the D/A converter, (to) is the speaker, (2a) is the input audio waveform, (2b
) is the compressed waveform, (20) is the compressed waveform after decoding, (2d) is the expanded waveform, (3a) is the input audio waveform, (3b) is the compressed waveform, (3C) is the compressed waveform after decoding, (3d) is the replacement waveform,
(3e) is an expanded waveform, α9 is a time axis compression means, (1) is a pitch period extraction means, @E is a time axis expansion means, (5a)
is an input audio waveform, (5b) is a compressed waveform, (6a) is a compressed waveform after decoding, and (61) is an expanded waveform. Note that the same reference numerals in each figure indicate the same or corresponding parts.

Claims

[Claims]

In an audio time-base compression/expansion device that consists of a transmitter that compresses and encodes an input audio waveform on the time axis, and a receiver that decodes and expands the compressed-encoded audio waveform on the time axis, the input audio The transmitting unit includes an intra-frame time axis compression means for compressing the waveform on the time axis within the range of the analysis frame for each analysis frame of a fixed length by using the pitch period in this analysis frame of the input audio waveform, The reception section is characterized by comprising an intra-frame time axis expansion means for expanding the audio waveform time-axis compressed by the time axis compression means into an audio waveform having the analysis frame length based on the pitch period for each analysis frame. Audio time axis compression encoding device.