JP5031006B2

JP5031006B2 - Scalable decoding apparatus and scalable decoding method

Info

Publication number: JP5031006B2
Application number: JP2009204962A
Authority: JP
Inventors: 正浩押切
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 2009-09-04
Filing date: 2009-09-04
Publication date: 2012-09-19
Anticipated expiration: 2023-09-30
Also published as: JP2010020333A

Abstract

PROBLEM TO BE SOLVED: To provide a scalable coder and a scalable decoder capable of contracting a circuit scale and also reducing a processing operational quantity of coding. SOLUTION: A frequency area conversion part 103 performs the analysis of frequency of a signal sampled at a sampling rate Fx with an analysis length 2×Na to calculate a first spectrum S1(k)(0≤k<Na). A band extension part 104 extends an effective frequency band of the first spectrum S1(k) to 0≤k<Nb to permit the impartment of new spectrum on and after the frequency k=Na of the first spectrum S1(k). An extension spectrum-imparting part 105 imparts an extension spectrum S1'(k)(Na≤k<Nb) which is inputted to an extended frequency band from an external part. A spectrum information specification part 106 outputs information required for specifying the extension spectrum S1'(k) among spectra imparted from the extension spectrum-imparting part 105 as coding codes. COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、スケーラブル復号化装置及びスケーラブル復号化方法に関する。 The present invention relates to a scalable decoding device and a scalable decoding method .

今日、世の中には、コンパクトディスク用の４４．１ｋＨｚ、ＤＡＴ（Digital Audio Tape）、ディジタルＶＴＲ、もしくは衛星テレビジョン用の３２ｋＨｚもしくは４８ｋＨｚ、またはＤＶＤオーディオ信号用の４８ｋＨｚもしくは９６ｋＨｚというように、多くの異なるサンプリングレートが存在する。従って、再生装置もしくは記録装置のデコーダの内部サンプリングレートが、これから復号化しようとするデータのサンプリングレートと異なる場合、サンプリングレートを変換する必要が生じる。このサンプリングレートの変換を行う従来装置としては、例えば、特許文献１に示すものがある。 Today there are many different things in the world, such as 44.1 kHz for compact discs, 32 kHz or 48 kHz for digital audio tape (DAT), digital VTR, or satellite television, or 48 kHz or 96 kHz for DVD audio signals. There is a sampling rate. Therefore, when the internal sampling rate of the decoder of the playback device or recording device is different from the sampling rate of the data to be decoded, it is necessary to convert the sampling rate. An example of a conventional apparatus that performs this sampling rate conversion is disclosed in Patent Document 1.

また近年、有線系でのＡＤＳＬ（Asymmetric Digital Subscriber Line）や光ファイバの普及、または無線系でのＷ−ＣＤＭＡ（Wideband - Code Division Multiple Access）や無線ＬＡＮの実用化等により、ネットワークにおける伝送路容量が大きく改善され、それに伴い音声通信において信号帯域を広げることによる高臨場感化および高品質化が求められてきている。 Also, in recent years, transmission line capacity in networks has become widespread due to the widespread use of ADSL (Asymmetric Digital Subscriber Line) and optical fiber in wired systems, or practical application of W-CDMA (Wideband-Code Division Multiple Access) and wireless LANs in wireless systems. Accordingly, there has been a demand for higher realism and higher quality by expanding the signal band in voice communication.

現在、狭帯域信号を符号化する代表的な方式に、ＩＴＵ(International Telecommunication Union)で規格化されているＧ．７２６、Ｇ．７２９等がある。また、広帯域信号を符号化する代表的な方法として、ＩＴＵ−Ｔ（International Telecommunication Union Telecommunication Standardization Sector）のＧ７２２、Ｇ７２２．１や、３ＧＰＰ(The 3rd Generation Partnership Project)のＡＭＲ−ＷＢ等がある。 Currently, G. standardized by ITU (International Telecommunication Union) is a typical method for encoding narrowband signals. 726, G.G. 729 etc. Moreover, as a typical method for encoding a wideband signal, there are G722 and G722.1 of ITU-T (International Telecommunication Union Telecommunication Standardization Sector), AMR-WB of 3GPP (The 3rd Generation Partnership Project), and the like.

さらに最近、ＩＰ（Internet Protocol）ネットワーク等の様々なネットワーク環境で使用されることを意図して、音声符号化方式にスケーラブル機能の実現が求められている。スケーラブル機能とは、符号化コードの一部からでも、音声信号を復号できる機能を表す。このスケーラブル機能を有することにより、条件の良い通信路では全ての符号化コードを用いて高品質な音声信号を復号し、条件の悪い通信路では符号化コードの一部のみ伝送することでパケットロスの発生の頻度を抑えることができる。また、多地点間での通信時におけるネットワーク資源の効率化等の効果が得られるようになる。 In recent years, there has been a demand for the implementation of a scalable function in a speech coding system intended to be used in various network environments such as an IP (Internet Protocol) network. The scalable function represents a function capable of decoding an audio signal even from a part of an encoded code. By having this scalable function, packet loss can be achieved by decoding high-quality audio signals using all coded codes in a good channel and transmitting only a part of the coded code in a bad channel. The frequency of occurrence can be suppressed. In addition, it is possible to obtain an effect such as efficiency of network resources at the time of communication between multiple points.

このスケーラブル機能を有する高品質な符号化方式の実現には、様々なサンプリングレートの信号を利用して符号化を行う必要性がある。例えば、サンプリングレートが８ｋＨｚの信号をＩＴＵ−Ｔで規格化されているＧ．７２６、Ｇ．７２９等の方式を用いて符号化を行い、サンプリングレートが１６ｋＨｚの領域でその誤差信号をさらに符号化することで、信号帯域の拡張による品質の改善およびスケーラブル性が実現できる。 In order to realize a high-quality encoding method having a scalable function, it is necessary to perform encoding using signals of various sampling rates. For example, a signal with a sampling rate of 8 kHz is standardized by ITU-T. 726, G.G. By performing encoding using a method such as 729 and further encoding the error signal in a region where the sampling rate is 16 kHz, quality improvement and scalability can be realized by extending the signal band.

図２２は、スケーラブル符号化を行う従来の符号化装置の代表的な構成を示したブロック図である。この例では、レイヤ数Ｎ＝３であり、レイヤｎで取り扱う信号のサンプリングレートをＦＳ(ｎ)と表し、ＦＳ(１)＝１６［ｋＨｚ］、ＦＳ(２)＝２４［ｋＨｚ］、ＦＳ(３)＝３２［ｋＨｚ］であるとする。 FIG. 22 is a block diagram showing a typical configuration of a conventional coding apparatus that performs scalable coding. In this example, the number of layers N = 3, the sampling rate of the signal handled in layer n is represented as FS (n), FS (1) = 16 [kHz], FS (2) = 24 [kHz], FS ( 3) Assume that 32 [kHz].

入力端子１１を介してダウンサンプリング部１２に入力された音響信号（音声信号、オーディオ信号等）は、サンプリング周波数が３２ｋＨｚから１６ｋＨｚへとダウンサンプリングされ、第１レイヤ符号化部１３に与えられる。第１レイヤ符号化部１３は、入力された音響信号と符号化後に生成される復号信号との間の聴感的な歪が最小となるように第１符号化コードを決定する。この第１符号化コードは多重化部２６に送られるとともに第１レイヤ復号化部１４に送られる。第１レイヤ復号化部１４は、第１符号化コードを用いて第１レイヤ復号信号を生成する。アップサンプリング部１５は、第１レイヤ復号信号のサンプリング周波数を１６ｋＨｚから２４ｋＨｚへアップサンプリングし、この信号を減算器１８および加算器２１に与える。 The acoustic signal (sound signal, audio signal, etc.) input to the downsampling unit 12 via the input terminal 11 is downsampled from 32 kHz to 16 kHz and supplied to the first layer encoding unit 13. The first layer encoding unit 13 determines the first encoded code so that audible distortion between the input acoustic signal and the decoded signal generated after encoding is minimized. The first encoded code is sent to the multiplexing unit 26 and also sent to the first layer decoding unit 14. The first layer decoding unit 14 generates a first layer decoded signal using the first encoded code. The upsampling unit 15 upsamples the sampling frequency of the first layer decoded signal from 16 kHz to 24 kHz, and supplies this signal to the subtracter 18 and the adder 21.

また、入力端子１１を介してダウンサンプリング部１６に入力された音響信号は、サンプリング周波数が３２ｋＨｚから２４ｋＨｚへとダウンサンプリングされ、遅延部１７に与えられる。遅延部１７は、ダウンサンプリング後の信号を所定の時間長だけ遅延させる。減算器１８は、遅延部１７の出力信号とアップサンプリング部１５の出力信号との差を求め、第２レイヤ残差信号を生成し、第２レイヤ符号化部１９に与えられる。第２レイヤ符号化部１９は、第２レイヤ残差信号を聴感的に品質改善が成されるように符号化を行い、第２符号化コードを決定し、この第２符号化コードを多重化部２６および第２レイヤ復号化部２０に与える。第２レイヤ復号化部２０は、第２符号化コードを用いて復号処理を行い、第２レイヤ復号残差信号を生成する。加算器２１は、前述の第１レイヤ復号信号と第２レイヤ復号残差信号との和をとり、第２レイヤ復号信号を生成する。アップサンプリング部２２は、第２レイヤ復号信号のサンプリング周波数を２４ｋＨｚから３２ｋＨｚへアップサンプリングし、この信号を減算器２４に与える。 The acoustic signal input to the down-sampling unit 16 via the input terminal 11 is down-sampled from 32 kHz to 24 kHz and supplied to the delay unit 17. The delay unit 17 delays the downsampled signal by a predetermined time length. The subtractor 18 obtains a difference between the output signal of the delay unit 17 and the output signal of the upsampling unit 15, generates a second layer residual signal, and provides the second layer encoding unit 19. The second layer encoding unit 19 encodes the second layer residual signal so that the quality improvement is made audibly, determines a second encoded code, and multiplexes the second encoded code To the unit 26 and the second layer decoding unit 20. Second layer decoding section 20 performs a decoding process using the second encoded code, and generates a second layer decoded residual signal. The adder 21 calculates the sum of the first layer decoded signal and the second layer decoded residual signal, and generates a second layer decoded signal. The upsampling unit 22 upsamples the sampling frequency of the second layer decoded signal from 24 kHz to 32 kHz, and supplies this signal to the subtractor 24.

さらに、入力端子１１を介して遅延部２３に入力された音響信号は、所定の時間長だけ遅延され、減算器２４に与えられる。減算器２４は、遅延部２３の出力信号とアップサンプリング部２２の出力信号との差をとり、第３レイヤ残差信号を生成する。この第３レイヤ残差信号が第３レイヤ符号化部２５に与えられる。第３レイヤ符号化部２５は、第３レイヤ残差信号を聴感的に品質改善が成されるように符号化を行い、第３符号化コードを決定し、多重化部２６にその符号化コードを与える。多重化部２６は、第１レイヤ符号化部１３、第２レイヤ符号化部１９、および第３レイヤ符号化部２５から得られた符号化コードを多重化し、出力端子２７を介し出力する。 Further, the acoustic signal input to the delay unit 23 via the input terminal 11 is delayed by a predetermined time length and is given to the subtractor 24. The subtractor 24 takes the difference between the output signal of the delay unit 23 and the output signal of the upsampling unit 22 and generates a third layer residual signal. This third layer residual signal is provided to the third layer encoding unit 25. The third layer encoding unit 25 encodes the third layer residual signal so that quality improvement is made audibly, determines a third encoded code, and sends the encoded code to the multiplexing unit 26. give. The multiplexing unit 26 multiplexes the encoded codes obtained from the first layer encoding unit 13, the second layer encoding unit 19, and the third layer encoding unit 25 and outputs the multiplexed code via the output terminal 27.

特開２０００−６８９４８号公報JP 2000-68948 A

しかしながら、上記のようにＧ．７２６やＧ．７２９、またはＡＭＲ−ＷＢのような時間領域の符号化方式に基づいてスケーラブル機能を実現する従来の符号化装置においては、種々の信号のサンプリングレートを変換する必要があり（上記の例では、ダウンサンプリング部１２、アップサンプリング部１５、ダウンサンプリング部１６、およびアップサンプリング部２２が必要）、符号化装置の構成が複雑になり、符号化の処理演算量も増大するという問題がある。また、この符号化装置によって符号化された信号を復号する復号化装置の回路構成も複雑になり、復号化の処理演算量が増大する。 However, as described above, G.M. 726 and G.G. In the conventional coding apparatus that realizes the scalable function based on the time domain coding scheme such as 729 or AMR-WB, it is necessary to convert the sampling rate of various signals (in the above example, downsampling is performed). The sampling unit 12, the upsampling unit 15, the downsampling unit 16, and the upsampling unit 22 are required), and the configuration of the encoding device becomes complicated, and there is a problem that the amount of processing computation for encoding increases. In addition, the circuit configuration of the decoding device that decodes the signal encoded by the encoding device becomes complicated, and the amount of processing for decoding increases.

本発明は、かかる点に鑑みてなされたものであり、回路規模を縮小でき、処理演算量も削減できるスケーラブル復号化装置及びスケーラブル復号化方法を提供することを目的とする。 The present invention has been made in view of this point, and an object of the present invention is to provide a scalable decoding device and a scalable decoding method that can reduce the circuit scale and the amount of processing computation.

本発明のスケーラブル復号化装置は、音声信号またはオーディオ信号をスケーラブル符号化装置で符号化して生成された、前記音声信号または前記オーディオ信号の所定の周波数より低い帯域である第１帯域に関する第１符号化情報と、前記オーディオ信号の所定の周波数より高い帯域である第２帯域に関する第２符号化情報と、を含む情報を受信する受信手段と、前記第１符号化情報を復号して音声信号またはオーディオ信号の前記第１帯域に相当する第１サンプリングレートの時間領域信号を生成する第１復号化手段と、前記第２符号化情報を周波数領域で復号して前記第２帯域の復号スペクトルを生成し、前記第２帯域の復号スペクトルを用いて前記第１サンプリングレートよりも大きい所定の第２サンプリングレートをサンプリングレート変換した第３サンプリングレートの復号信号を生成する第２復号化手段と、を具備し、前記第２復号化手段は、前記第１復号化手段で得られる前記第１サンプリングレートの時間領域信号から、周波数領域変換によって前記第１帯域のスペクトルを得る第１変換手段と、前記第１帯域のスペクトルの特定の位置のスペクトルを複製する複製手段と、前記第２符号化情報と前記複製されたスペクトルとを用いて前記第１帯域の復号スペクトルの帯域幅を拡張する前記第２帯域の復号スペクトルを生成し、前記第２帯域の復号スペクトルを前記第１帯域の復号スペクトルに付加して拡張復号スペクトルを生成するスペクトル生成手段と、前記拡張復号スペクトルの最大周波数に隣接し且つ前記拡張復号スペクトルの外部に位置する前記拡張復号スペクトルの第１の高域部にゼロを挿入、または、前記最大周波数に隣接し且つ前記拡張復号スペクトルの内部に位置する前記拡張復号スペクトルの第２の高域部を削除して所定の帯域のスペクトルを得、前記所定の帯域のスペクトルから、時間領域変換によって前記第３サンプリングレートの時間領域信号を前記復号信号として生成する時間領域信号生成手段と、を具備する構成を採る。 The scalable decoding device according to the present invention includes a first code relating to a first band, which is a band lower than a predetermined frequency of the audio signal or the audio signal, generated by encoding the audio signal or the audio signal by the scalable encoding device. Receiving means for receiving information including encoded information and second encoded information relating to a second band which is a band higher than a predetermined frequency of the audio signal, and decoding the first encoded information into an audio signal or First decoding means for generating a time-domain signal having a first sampling rate corresponding to the first band of the audio signal, and generating a decoded spectrum of the second band by decoding the second encoded information in the frequency domain And sampling a predetermined second sampling rate larger than the first sampling rate using the decoded spectrum of the second band. Second decoding means for generating a decoded signal having a third sampling rate subjected to the rate conversion, wherein the second decoding means is a time domain of the first sampling rate obtained by the first decoding means. First transforming means for obtaining the spectrum of the first band from the signal by frequency domain transform, replicating means for replicating the spectrum at a specific position of the spectrum of the first band, and the second encoded information and the replica A decoded spectrum of the second band that extends a bandwidth of the decoded spectrum of the first band using the spectrum obtained by adding the decoded spectrum of the second band to the decoded spectrum of the first band. Spectrum generating means for generating a decoded spectrum; and the extension located adjacent to the maximum frequency of the extended decoded spectrum and located outside the extended decoded spectrum No. inserts zeros into the first high band portion of the spectrum, or the extended decoding second predetermined band by removing the high frequency part of the spectrum which is located inside the adjacent and the expansion decoded spectrum to the maximum frequency And a time domain signal generating means for generating a time domain signal of the third sampling rate as the decoded signal from the spectrum of the predetermined band by time domain conversion.

本発明のスケーラブル復号化方法は、音声信号またはオーディオ信号をスケーラブル符号化装置で符号化して生成された、前記音声信号または前記オーディオ信号の所定の周波数より低い帯域である第１帯域に関する第１符号化情報と、前記オーディオ信号の所定の周波数より高い帯域である第２帯域に関する第２符号化情報と、を含む情報を受信する受信ステップと、前記第１符号化情報を復号して音声信号またはオーディオ信号の前記第１帯域に相当する第１サンプリングレートの時間領域信号を生成する第１復号化ステップと、前記第２符号化情報を周波数領域で復号して前記第２帯域の復号スペクトルを生成し、前記第２帯域の復号スペクトルを用いて前記第１サンプリングレートよりも大きい所定の第２サンプリングレートをサンプリングレート変換した第３サンプリングレートの復号信号を生成する第２復号化ステップと、を具備し、前記第２復号化ステップは、前記第１復号化ステップで得られる前記第１サンプリングレートの時間領域信号から、周波数領域変換によって前記第１帯域のスペクトルを得る第１変換ステップと、前記第１帯域のスペクトルの特定の位置のスペクトルを複製する複製ステップと、前記第２符号化情報と前記複製されたスペクトルとを用いて前記第１帯域の復号スペクトルの帯域幅を拡張する前記第２帯域の復号スペクトルを生成し、前記第２帯域の復号スペクトルを前記第１帯域の復号スペクトルに付加して拡張復号スペクトルを生成するスペクトル生成ステップと、前記拡張復号スペクトルの最大周波数に隣接し且つ前記拡張復号スペクトルの外部に位置する前記拡張復号スペクトルの第１の高域部にゼロを挿入、または、前記最大周波数に隣接し且つ前記拡張復号スペクトルの内部に位置する前記拡張復号スペクトルの第２の高域部を削除して所定の帯域のスペクトルを得、前記所定の帯域のスペクトルから、時間領域変換によって前記第３サンプリングレートの時間領域信号を前記復号信号として生成する時間領域信号生成ステップと、を具備するようにした。 In the scalable decoding method of the present invention, a first code relating to a first band, which is a band lower than a predetermined frequency of the audio signal or the audio signal, generated by encoding the audio signal or the audio signal with a scalable encoding device. Reception step of receiving information including encoding information and second encoded information relating to a second band that is a band higher than a predetermined frequency of the audio signal, and decoding the first encoded information to generate an audio signal or A first decoding step of generating a time-domain signal having a first sampling rate corresponding to the first band of the audio signal; and generating a decoded spectrum of the second band by decoding the second encoded information in the frequency domain A predetermined second sampling rate higher than the first sampling rate is sampled using the decoded spectrum of the second band. A second decoding step for generating a decoded signal having a third sampling rate that has undergone ring rate conversion, wherein the second decoding step is a time domain of the first sampling rate obtained in the first decoding step. A first transforming step of obtaining a spectrum of the first band from a signal by frequency domain transform; a replicating step of replicating a spectrum at a specific position of the spectrum of the first band; and the second encoded information and the duplicated signal. A decoded spectrum of the second band that extends a bandwidth of the decoded spectrum of the first band using the spectrum obtained by adding the decoded spectrum of the second band to the decoded spectrum of the first band. A spectrum generating step for generating a decoded spectrum; and an extended decoding system adjacent to a maximum frequency of the extended decoded spectrum and The inserts zeros into the first high band portion of the expansion decoded spectrum located outside the vector, or the second high-band of the extended decoded spectrum located inside the adjacent and the expansion decoded spectrum to the maximum frequency A time domain signal generating step of obtaining a spectrum of a predetermined band by deleting a portion, and generating a time domain signal of the third sampling rate as the decoded signal from the spectrum of the predetermined band by time domain conversion. I tried to do it.

本発明によれば、回路規模を縮小でき、処理演算量も削減することができる。 According to the present invention, the circuit scale can be reduced and the amount of processing calculations can also be reduced.

実施の形態１に係るスペクトル符号化装置の主要な構成を示すブロック図FIG. 1 is a block diagram showing the main configuration of a spectrum encoding apparatus according to Embodiment 1 (ａ)第１スペクトルを表す図、(ｂ)有効周波数帯域を拡張された後のスペクトルを表す図(a) The figure showing the first spectrum, (b) The figure showing the spectrum after the effective frequency band is expanded. スペクトルの有効周波数帯域を拡張する処理の効果を原理的に説明するための図Diagram for explaining in principle the effect of the process of extending the effective frequency band of the spectrum 実施の形態１に係る無線送信装置の主要な構成を示すブロック図FIG. 2 is a block diagram showing a main configuration of a radio transmission apparatus according to Embodiment 1 実施の形態１に係る符号化装置の内部構成を示すブロック図FIG. 2 is a block diagram showing an internal configuration of the encoding apparatus according to Embodiment 1 実施の形態１に係るスペクトル符号化部の内部構成を示すブロック図FIG. 2 is a block diagram showing an internal configuration of a spectrum encoding unit according to Embodiment 1 実施の形態１に係るスペクトル符号化部のバリエーションを示すブロック図FIG. 9 is a block diagram showing variations of the spectrum encoding unit according to Embodiment 1. 実施の形態１に係る無線受信装置の主要な構成を示すブロック図FIG. 2 is a block diagram showing a main configuration of a radio reception apparatus according to Embodiment 1 実施の形態１に係る復号化装置の内部構成を示すブロック図FIG. 2 is a block diagram showing an internal configuration of a decoding apparatus according to Embodiment 1 実施の形態１に係るスペクトル復号化部の内部構成を示すブロック図FIG. 3 is a block diagram showing an internal configuration of a spectrum decoding unit according to Embodiment 1 実施の形態１に係る帯域拡張部で行われる処理について説明する図The figure explaining the process performed in the band expansion part which concerns on Embodiment 1. FIG. スペクトルが実施の形態１に係る結合部および時間領域変換部における処理を経てどのように復号信号が生成されるかを示した図The figure which showed how a spectrum produced | generated through the process in the coupling | bond part and time domain conversion part which concern on Embodiment 1 (ａ)実施の形態１に係る符号化装置が有線通信システムに適用された場合の送信側の主要な構成を示したブロック図、(ｂ)実施の形態１に係る復号化装置が有線通信システムに適用された場合の受信側の主要な構成を示したブロック図(a) The block diagram which showed the main structures of the transmission side when the encoding apparatus which concerns on Embodiment 1 is applied to a wired communication system, (b) The decoding apparatus which concerns on Embodiment 1 is a wired communication system Block diagram showing the main configuration of the receiving side when applied to 実施の形態２に係る復号化装置の主要な構成を示すブロック図FIG. 9 is a block diagram showing the main configuration of a decoding apparatus according to Embodiment 2. 実施の形態２に係るスペクトル復号化部の内部構成を示すブロック図FIG. 7 is a block diagram showing an internal configuration of a spectrum decoding unit according to Embodiment 2 実施の形態２に係る修正部の処理をより詳細に説明するための図The figure for demonstrating in detail the process of the correction part which concerns on Embodiment 2. FIG. 実施の形態２に係る修正部の処理をより詳細に説明するための図The figure for demonstrating in detail the process of the correction part which concerns on Embodiment 2. FIG. 実施の形態２に係るスペクトル復号化部の動作をさらに説明するための図The figure for further explaining operation | movement of the spectrum decoding part which concerns on Embodiment 2. FIG. 実施の形態２に係るスペクトル復号化部の動作をさらに説明するための図The figure for further explaining operation | movement of the spectrum decoding part which concerns on Embodiment 2. FIG. 実施の形態３に係る通信システムの主要な構成を示す図The figure which shows the main structures of the communication system which concerns on Embodiment 3. 実施の形態４に係る通信システムの主要な構成を示す図The figure which shows the main structures of the communication system which concerns on Embodiment 4. スケーラブル符号化を行う従来の符号化装置の代表的な構成を示したブロック図The block diagram which showed the typical structure of the conventional encoding apparatus which performs scalable encoding

本発明の骨子は、入力信号に対し、時間領域でサンプリング変換（特に、アップサンプリング）を行う代わりに、周波数領域でスペクトルの有効周波数帯域を拡張することにより、時間領域の信号においてアップサンプリングを行った場合と等価な信号を得ることである。 The essence of the present invention is to perform upsampling on a time domain signal by extending the effective frequency band of the spectrum in the frequency domain instead of performing sampling conversion (particularly upsampling) on the input signal. It is to obtain a signal equivalent to the case.

以下、本発明の実施の形態について、添付図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

（実施の形態１）
図１は、本発明の実施の形態１に係るスペクトル符号化装置１００の主要な構成を示すブロック図である。 (Embodiment 1)
FIG. 1 is a block diagram showing the main configuration of spectrum coding apparatus 100 according to Embodiment 1 of the present invention.

本実施の形態に係るスペクトル符号化装置１００は、サンプリングレート変換部１０１、入力端子１０２、スペクトル情報特定部１０６、および出力端子１０７を有する。また、サンプリングレート変換部１０１は、周波数領域変換部１０３、帯域拡張部１０４、および拡張スペクトル付与部１０５を有する。 Spectrum coding apparatus 100 according to the present embodiment has sampling rate conversion section 101, input terminal 102, spectrum information specifying section 106, and output terminal 107. In addition, the sampling rate conversion unit 101 includes a frequency domain conversion unit 103, a band extension unit 104, and an extended spectrum addition unit 105.

スペクトル符号化装置１００には、入力端子１０２を介し、サンプリングレートＦｘでサンプリングされた信号が入力される。 A signal sampled at the sampling rate Fx is input to the spectrum encoding device 100 via the input terminal 102.

周波数領域変換部１０３は、この信号を分析長２・Ｎａで周波数分析することにより時間領域の信号を周波数領域の信号に変換（周波数領域変換）し、第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎａ）を算出する。そして、求められた第１スペクトルＳ１(ｋ)を帯域拡張部１０４に与える。ここで、周波数分析は、修正離散コサイン変換（ＭＤＣＴ）を用いる。ＭＤＣＴは、前後の隣接フレームと分析フレームを半分ずつ重ね合わせて分析を行い、分析フレームの前半部は奇関数、後半部は偶関数となる直交基底を使うことにより、フレーム間の歪がキャンセルされるという特徴がある。なお、周波数分析の方法として、離散フーリエ変換（ＤＦＴ）、離散コサイン変換（ＤＣＴ）等を使用することも可能である。 The frequency domain transform unit 103 performs frequency analysis of this signal with an analysis length of 2 · Na to transform a time domain signal into a frequency domain signal (frequency domain transformation), and the first spectrum S1 (k) (0 ≦ k). <Na) is calculated. Then, the obtained first spectrum S 1 (k) is given to the band extending unit 104. Here, the frequency analysis uses a modified discrete cosine transform (MDCT). In MDCT, analysis is performed by superimposing adjacent frames in front and back and analysis frames, and the distortion between frames is canceled by using an orthogonal basis in which the first half of the analysis frame is an odd function and the second half is an even function. There is a feature that. As a frequency analysis method, discrete Fourier transform (DFT), discrete cosine transform (DCT), or the like can be used.

帯域拡張部１０４は、入力された第１スペクトルＳ１(ｋ)の周波数ｋ＝Ｎａ以降に新たなスペクトルを付与できるように新たな領域（周波数帯域）を確保し、第１スペクトルＳ１(ｋ)の有効周波数帯域を０≦ｋ＜Ｎｂに拡張する。この有効周波数帯域を拡張する処理については後ほど詳述する。 The band extension unit 104 secures a new region (frequency band) so that a new spectrum can be assigned after the frequency k = Na of the input first spectrum S1 (k), and the first spectrum S1 (k) The effective frequency band is extended to 0 ≦ k <Nb. The process of extending the effective frequency band will be described in detail later.

拡張スペクトル付与部１０５は、帯域拡張部１０４にて拡張された周波数帯域に外部から入力される拡張スペクトルＳ１’(ｋ)（Ｎａ≦ｋ＜Ｎｂ）を付与し、スペクトル情報特定部１０６に出力する。 The extended spectrum giving unit 105 gives an extended spectrum S1 ′ (k) (Na ≦ k <Nb) input from the outside to the frequency band extended by the band extending unit 104, and outputs it to the spectrum information specifying unit 106. .

スペクトル情報特定部１０６は、拡張スペクトル付与部１０５から与えられたスペクトルのうち、拡張スペクトルＳ１’(ｋ)を特定するために必要な情報を符号化コードとして出力端子１０７を介し出力する。この符号化コードは、拡張スペクトルＳ１’(ｋ)のサブバンドエネルギーを表す情報や有効周波数帯域を表す情報等である。この詳細についても後述する。 The spectrum information specifying unit 106 outputs information necessary for specifying the extended spectrum S 1 ′ (k) out of the spectrum given from the extended spectrum giving unit 105 as an encoded code via the output terminal 107. This encoded code is information representing the subband energy of the extended spectrum S1 '(k), information representing the effective frequency band, and the like. Details of this will also be described later.

次いで、上記の帯域拡張部１０４が第１スペクトルＳ１(ｋ)の有効周波数帯域を拡張する処理の詳細について、図２を用いて説明する。 Next, details of the process in which the band extending unit 104 extends the effective frequency band of the first spectrum S1 (k) will be described with reference to FIG.

図２(ａ)は、周波数領域変換部１０３より与えられる第１スペクトルＳ１(ｋ)を表しており、図２(ｂ)は、帯域拡張部１０４において有効周波数帯域を拡張された後のスペクトルＳ１(ｋ)を表している。帯域拡張部１０４は、第１スペクトルＳ１(ｋ)の周波数ｋがＮａ≦ｋ＜Ｎｂの範囲で表される周波数帯域に新規のスペクトル情報を格納できる領域を確保する。この新規な領域の大きさはＮｂ−Ｎａで表される。 2A shows the first spectrum S1 (k) given from the frequency domain transform unit 103, and FIG. 2B shows the spectrum S1 after the effective frequency band is extended by the band extension unit 104. (k) is shown. The band extension unit 104 secures an area where new spectrum information can be stored in a frequency band in which the frequency k of the first spectrum S1 (k) is represented by a range of Na ≦ k <Nb. The size of this new region is represented by Nb-Na.

ここでＮｂは、入力端子１０２を介し外部から与えられる信号のサンプリングレートＦｘと周波数領域変換部１０３の分析長２・Ｎａと復号化部（図示せず）にて復号される信号のサンプリングレートＦｙとの関係から決まる。具体的には、次式

により、Ｎｂは設定される。また、Ｎｂが決まっているときに、復号化部で復号される信号のサンプリングレートＦｙは次式

により決定される。例えば、Ｎａ＝１２８、Ｆｘ＝１６ｋＨｚの条件で符号化部を設計し、復号化部でＦｙ＝３２ｋＨｚの復号信号を生成する場合には、Ｎｂ＝１２８・３２／１６＝２５６とする必要がある。よって、この場合には、１２８≦ｋ＜２５６の領域を確保することになる。また、別の例としては、Ｎａ＝１２８、Ｎｂ＝３８４、Ｆｘ＝８ｋＨｚの条件で符号化部を設計した場合には、復号化部で生成される復号信号のサンプリングレートは、Ｆｙ＝８・３８４／１２８＝２４ｋＨｚとなる。 Here, Nb is the sampling rate Fx of the signal given from the outside via the input terminal 102, the analysis length 2 · Na of the frequency domain conversion unit 103, and the sampling rate Fy of the signal decoded by the decoding unit (not shown). Determined from the relationship. Specifically, the following formula

Thus, Nb is set. Also, when Nb is determined, the sampling rate Fy of the signal decoded by the decoding unit is

Determined by. For example, when the encoding unit is designed under the conditions of Na = 128 and Fx = 16 kHz, and the decoding unit generates a decoded signal of Fy = 32 kHz, it is necessary to set Nb = 128 · 32/16 = 256. . Therefore, in this case, an area of 128 ≦ k <256 is secured. As another example, when the encoding unit is designed under the conditions of Na = 128, Nb = 384, and Fx = 8 kHz, the sampling rate of the decoded signal generated by the decoding unit is Fy = 8 · 384/128 = 24 kHz.

図３は、帯域拡張部１０４において行われたスペクトルの有効周波数帯域を拡張する処理の効果を原理的に説明するための図である。図３(ａ)は、サンプリングレートＦｘの信号を分析長２・Ｎａで周波数分析した際に得られるスペクトルＳａ(ｋ)を表している。横軸は周波数、縦軸はスペクトル強度を表す。 FIG. 3 is a diagram for explaining in principle the effect of the process of extending the effective frequency band of the spectrum performed in the band extending unit 104. FIG. 3A shows a spectrum Sa (k) obtained when frequency analysis is performed on the signal of the sampling rate Fx with the analysis length 2 · Na. The horizontal axis represents frequency and the vertical axis represents spectrum intensity.

信号の有効周波数帯域は、ナイキスト定理から０〜Ｆｘ／２となる。このとき、分析長が２・Ｎａであるので、周波数インデックスｋの範囲は０≦ｋ＜Ｎａとなり、スペクトルＳａ(ｋ)の周波数解像度はＦｘ／（２・Ｎａ）となる。他方、同一信号をサンプリングレートＦｙへとアップサンプリングした後に、分析長２・Ｎｂで周波数分析して得られるスペクトルＳｂ(ｋ)を図３(ｂ)に示すと、信号の有効周波数帯域は０〜Ｆｙ／２まで拡張されており、周波数インデックスｋの範囲は０≦ｋ＜Ｎｂとなる。ここで、Ｎｂが（式１）を満足する場合、スペクトルＳｂ(ｋ)の周波数解像度Ｆｙ／（２・Ｎｂ）は、Ｆｘ／（２・Ｎａ)と等しくなる。すなわち、帯域０≦ｋ＜ＮａのスペクトルＳａ(ｋ)とスペクトルＳｂ(ｋ)とは等しくなる。逆の見方をすると、スペクトルＳａ(ｋ)（０≦ｋ＜Ｎａ)の帯域をＮｂまで広げたときのスペクトルＳｂ(ｋ)は、サンプリングＦｘの信号をサンプリングＦｙにアップサンプリングした後に、分析長２・Ｎｂで周波数分析して得られるスペクトルと一致する、ことを意味する。この原理を利用することにより、時間領域においてアップサンプリングすることなく、アップサンプリングされた信号と等価のスペクトルを得ることができる。 The effective frequency band of the signal is 0 to Fx / 2 from the Nyquist theorem. At this time, since the analysis length is 2 · Na, the range of the frequency index k is 0 ≦ k <Na, and the frequency resolution of the spectrum Sa (k) is Fx / (2 · Na). On the other hand, when the spectrum Sb (k) obtained by frequency analysis with the analysis length 2 · Nb after up-sampling the same signal to the sampling rate Fy is shown in FIG. The frequency index k is expanded to Fy / 2, and the range of the frequency index k is 0 ≦ k <Nb. Here, when Nb satisfies (Expression 1), the frequency resolution Fy / (2 · Nb) of the spectrum Sb (k) is equal to Fx / (2 · Na). That is, the spectrum Sa (k) and the spectrum Sb (k) in the band 0 ≦ k <Na are equal. In other words, the spectrum Sb (k) obtained when the band of the spectrum Sa (k) (0 ≦ k <Na) is expanded to Nb is obtained by upsampling the signal of the sampling Fx to the sampling Fy, and then the analysis length 2 It means that it matches the spectrum obtained by frequency analysis with Nb. By utilizing this principle, a spectrum equivalent to the upsampled signal can be obtained without upsampling in the time domain.

このように、サンプリングレート変換部１０１において、入力された時間領域の信号を周波数領域の信号に変換し、得られたスペクトルの有効周波数帯域を拡張することにより、時間領域においてアップサンプリングした信号を周波数変換して求められるスペクトルと等価なスペクトルを得ることができる。 In this way, the sampling rate conversion unit 101 converts the input time domain signal into a frequency domain signal and expands the effective frequency band of the obtained spectrum, thereby frequency-sampling the signal up-sampled in the time domain. A spectrum equivalent to the spectrum obtained by conversion can be obtained.

なお、サンプリングレート変換部１０１から出力される信号は周波数領域の信号であるため、時間領域の信号が必要とされる場合は、時間領域変換部を設けて時間領域への再変換を行えば良い。上記の例では、サンプリングレート変換部１０１はスペクトル符号化装置１００内に設置されているので、時間領域の信号に戻すことなく周波数領域の信号のままスペクトル情報特定部１０６に入力され、符号化コードが生成される。 Since the signal output from the sampling rate conversion unit 101 is a frequency domain signal, if a time domain signal is required, a time domain conversion unit may be provided to perform reconversion to the time domain. . In the above example, since the sampling rate conversion unit 101 is installed in the spectrum encoding device 100, the signal is input to the spectrum information specifying unit 106 as the frequency domain signal without returning to the time domain signal, and the encoded code Is generated.

ここで、拡張スペクトル付与部１０５に入力される拡張スペクトルの選択と、スペクトル情報特定部１０６におけるスペクトル情報の特定の仕方とを調整することにより、スペクトル情報特定部１０６から出力される符号化コードの符号化率は異なってくる。すなわち、サンプリングレート変換部１０１内の一部の処理は符号化にも大きな影響を与えている。これは、スペクトル符号化装置１００が、入力信号のサンプリングレートの変換と符号化とを同時に実現していることを意味している。 Here, by adjusting the selection of the extended spectrum input to the extended spectrum providing unit 105 and the method of specifying the spectrum information in the spectrum information specifying unit 106, the encoded code output from the spectrum information specifying unit 106 is adjusted. The coding rate is different. That is, a part of the processing in the sampling rate conversion unit 101 has a great influence on the encoding. This means that the spectrum encoding apparatus 100 simultaneously realizes conversion and encoding of the sampling rate of the input signal.

また、ここでは説明を簡単にするために、拡張スペクトル付与部１０５において拡張スペクトルが元のスペクトルに付与される場合を例にとって説明したが、スペクトル情報特定部１０６で行われる処理は、拡張スペクトルを特定するために必要な情報を符号化コードとして出力することであるため、付与されるべき拡張スペクトルが特定されていれば充分であるので、必ずしも拡張スペクトルが実際に付与されなければならないわけではない。 In addition, here, in order to simplify the explanation, the case where the extended spectrum is added to the original spectrum in the extended spectrum adding unit 105 has been described as an example. However, the processing performed in the spectrum information specifying unit 106 is performed using the extended spectrum. Since it is to output information necessary for identification as an encoded code, it is sufficient if the extended spectrum to be assigned is specified. Therefore, the extended spectrum does not necessarily have to be actually assigned. .

また、ここではサンプリングレート変換の一例としてアップサンプリングを例にとって説明したが、上記の原理はダウンサンプリングの場合にも適用できる。 In addition, although up sampling has been described as an example of sampling rate conversion here, the above principle can be applied to down sampling.

図４は、本実施の形態に係る符号化装置１２０が無線通信システムの送信側に搭載された場合の無線送信装置１３０の主要な構成を示すブロック図である。 FIG. 4 is a block diagram showing the main configuration of radio transmission apparatus 130 when encoding apparatus 120 according to the present embodiment is installed on the transmission side of the radio communication system.

この無線送信装置１３０は、符号化装置１２０、入力装置１３１、Ａ／Ｄ変換装置１３２、ＲＦ変調装置１３３、およびアンテナ１３４を有する。 The wireless transmission device 130 includes an encoding device 120, an input device 131, an A / D conversion device 132, an RF modulation device 133, and an antenna 134.

入力装置１３１は、人間の耳に聞こえる音波Ｗ１１を電気的信号であるアナログ信号に変換し、Ａ／Ｄ変換装置１３２に出力する。Ａ／Ｄ変換装置１３２は、このアナログ信号をディジタル信号に変換し、符号化装置１２０に出力する（信号Ｓ１）。符号化装置１２０は、入力されたディジタル信号Ｓ１を符号化して符号化信号を生成し、ＲＦ変調装置１３３に出力する（信号Ｓ２）。ＲＦ変調装置１３３は、符号化信号Ｓ２を変調して変調符号化信号を生成し、アンテナ１３４に出力する。アンテナ１３４は、変調符号化信号を電波Ｗ１２として送信する。 The input device 131 converts the sound wave W11 that can be heard by the human ear into an analog signal that is an electrical signal, and outputs the analog signal to the A / D conversion device 132. The A / D converter 132 converts this analog signal into a digital signal and outputs it to the encoder 120 (signal S1). The encoding device 120 encodes the input digital signal S1 to generate an encoded signal, and outputs the encoded signal to the RF modulation device 133 (signal S2). The RF modulation device 133 modulates the encoded signal S2 to generate a modulated encoded signal, and outputs the modulated encoded signal to the antenna 134. The antenna 134 transmits the modulated encoded signal as a radio wave W12.

図５は、上記の符号化装置１２０の内部構成を示すブロック図である。ここでは、階層符号化（スケーラブル符号化）を行う場合を例にとって説明する。 FIG. 5 is a block diagram showing an internal configuration of the encoding apparatus 120 described above. Here, a case where hierarchical coding (scalable coding) is performed will be described as an example.

符号化装置１２０は、入力端子１２１、ダウンサンプリング部１２２、第１レイヤ符号化部１２３、第１レイヤ復号化部１２４、遅延部１２６、スペクトル符号化部１００ａ、多重化部１２７、および出力端子１２８を有する。 The encoding apparatus 120 includes an input terminal 121, a downsampling unit 122, a first layer encoding unit 123, a first layer decoding unit 124, a delay unit 126, a spectrum encoding unit 100a, a multiplexing unit 127, and an output terminal 128. Have

入力端子１２１には、サンプリングレートＦｙの音響信号Ｓ１が入力される。ダウンサンプリング部１２２は、入力端子１２１を介し入力された信号Ｓ１にダウンサンプリングを施してサンプリングレートＦｘの信号を生成し、出力する。第１レイヤ符号化部１２３は、このダウンサンプリング後の信号を符号化し、得られた符号化コードを多重化部（マルチプレクサ）１２７に出力すると共に、第１レイヤ復号化部１２４にも出力する。第１レイヤ復号化部１２４は、この符号化コードを基に第１レイヤの復号信号を生成する。 An acoustic signal S1 having a sampling rate Fy is input to the input terminal 121. The downsampling unit 122 performs downsampling on the signal S1 input via the input terminal 121 to generate and output a signal at the sampling rate Fx. The first layer encoding unit 123 encodes the signal after downsampling, and outputs the obtained encoded code to the multiplexing unit (multiplexer) 127 and also to the first layer decoding unit 124. First layer decoding section 124 generates a first layer decoded signal based on the encoded code.

一方、遅延部１２６は、入力端子１２１を介し入力される信号Ｓ１に対し、所定の長さの遅延を与える。この遅延の大きさは、信号がダウンサンプリング部１２２、第１レイヤ符号化部１２３，および第１レイヤ復号化部１２４を介した際に生じる時間遅れと同値とする。スペクトル符号化部１００ａは、第１レイヤ復号化部１２４から出力されるサンプリングレートＦｘの信号Ｓ３と、遅延部１２６から出力されるサンプリングレートＦｙの信号Ｓ４とを用いてスペクトル符号化を行い、生成した符号化コードＳ５を多重化部１２７に出力する。多重化部１２７は、第１レイヤ符号化部１２３で求められる符号化コードとスペクトル符号化部１００ａで求められる符号化コードＳ５を多重化し、出力コードＳ２として出力端子１２８を介し出力する。この出力コードＳ２は、ＲＦ変調装置１３３に与えられる。 On the other hand, the delay unit 126 gives a delay of a predetermined length to the signal S 1 input via the input terminal 121. The magnitude of this delay is the same as the time delay that occurs when the signal passes through the downsampling unit 122, the first layer encoding unit 123, and the first layer decoding unit 124. The spectrum encoding unit 100a performs spectrum encoding using the signal S3 of the sampling rate Fx output from the first layer decoding unit 124 and the signal S4 of the sampling rate Fy output from the delay unit 126 to generate The encoded code S5 is output to the multiplexing unit 127. The multiplexing unit 127 multiplexes the encoded code obtained by the first layer encoding unit 123 and the encoded code S5 obtained by the spectrum encoding unit 100a, and outputs the result as an output code S2 via the output terminal 128. The output code S2 is given to the RF modulation device 133.

図６は、上記のスペクトル符号化部１００ａの内部構成を示すブロック図である。なお、このスペクトル符号化部１００ａは、図１に示したスペクトル符号化装置１００と同様の基本的構成を有しており、同一の構成要素には同一の符号を付し、その説明を省略する。 FIG. 6 is a block diagram showing an internal configuration of the spectrum encoding unit 100a. The spectrum encoding unit 100a has the same basic configuration as that of the spectrum encoding device 100 shown in FIG. 1, and the same components are denoted by the same reference numerals and the description thereof is omitted. .

スペクトル符号化部１００ａの特徴は、サンプリングレートＦｙの入力信号Ｓ３のスペクトルを利用して、拡張スペクトルＳ１’(ｋ)（Ｎａ≦ｋ＜Ｎｂ）を付与することである。これによれば、拡張スペクトルＳ１’(ｋ)を決定するための目標信号が与えられるため、拡張スペクトルＳ１’(ｋ)の精度が向上し、結果として品質向上につながるという効果が得られる。 A feature of the spectrum encoding unit 100a is that an extended spectrum S1 ′ (k) (Na ≦ k <Nb) is given using the spectrum of the input signal S3 having the sampling rate Fy. According to this, since the target signal for determining the extended spectrum S1 '(k) is given, the accuracy of the extended spectrum S1' (k) is improved, and as a result, the quality can be improved.

周波数領域変換部１１２は、入力端子１１１を介し入力されたサンプリングレートＦｙの信号Ｓ４を分析長２・Ｎｂにて周波数分析し、第２スペクトルＳ２(ｋ)（０≦ｋ＜Ｎｂ）を求める。ここで、サンプリング周波数Ｆｘ、Ｆｙ、および分析長Ｎａ、Ｎｂには（式１）で表される関係が成立しているものとする。 The frequency domain converter 112 performs frequency analysis on the signal S4 of the sampling rate Fy input via the input terminal 111 with an analysis length of 2 · Nb, and obtains a second spectrum S2 (k) (0 ≦ k <Nb). Here, it is assumed that the relationship represented by (Expression 1) is established between the sampling frequencies Fx and Fy and the analysis lengths Na and Nb.

スペクトル情報特定部１０６は、拡張スペクトルＳ１’(ｋ)を表す符号化コードを決定する。ここでは、拡張スペクトルＳ１’(ｋ)を周波数領域変換部１１２にて求められた第２スペクトルＳ２(ｋ)を利用して決定する。スペクトル情報特定部１０６は、拡張スペクトルＳ１’(ｋ)の形状を決定するステップと拡張スペクトルＳ１’(ｋ)のゲインを決定するステップとの２つのステップを経て符号化コードを決定する。 The spectrum information specifying unit 106 determines an encoded code representing the extended spectrum S1 '(k). Here, the extended spectrum S1 ′ (k) is determined using the second spectrum S2 (k) obtained by the frequency domain transform unit 112. The spectrum information specifying unit 106 determines the encoding code through two steps: a step of determining the shape of the extended spectrum S1 '(k) and a step of determining the gain of the extended spectrum S1' (k).

まず、拡張スペクトルＳ１’(ｋ)の形状を決定するステップについて以下説明する。 First, the step of determining the shape of the extended spectrum S1 '(k) will be described below.

このステップでは、第１スペクトルＳ１(ｋ)の帯域０≦ｋ＜Ｎａを利用して、拡張スペクトルＳ１’(ｋ)を決定する。その具体的な方法として、次式

に示すように、周波数軸上である固定値Ｃだけ離れた第１スペクトルＳ１(ｋ)を拡張スペクトルＳ１’(ｋ)にコピーする。ここでＣは、あらかじめ定められた固定値であり、Ｃ≦Ｎａの条件を満たす必要がある。この方法では、拡張スペクトルＳ１’(ｋ)の形状を表すための情報は符号化コードとして出力されない。 In this step, the extended spectrum S1 ′ (k) is determined using the band 0 ≦ k <Na of the first spectrum S1 (k). The specific method is as follows:

As shown, the first spectrum S1 (k) separated by a fixed value C on the frequency axis is copied to the extended spectrum S1 ′ (k). Here, C is a predetermined fixed value and needs to satisfy the condition of C ≦ Na. In this method, information for representing the shape of the extended spectrum S1 ′ (k) is not output as an encoded code.

また別の方法として、上記のように固定値Ｃではなく、ある定められた範囲Ｔ_ＭＩＮ〜Ｔ_ＭＡＸの値をとる変数Ｔを用い、拡張スペクトルＳ１’(ｋ)と第２スペクトルＳ２(ｋ)の形状が最も類似するときの変数Ｔの値Ｔ’を符号化コードの一部として出力しても良い。このとき、拡張スペクトルＳ１’(ｋ)は次式

で表される。 As another method, the extended spectrum S1 ′ (k) and the second spectrum S2 (k) are used instead of the fixed value C as described above, using a variable T that takes a value in a predetermined range T _{MIN to} T _MAX. The value T ′ of the variable T when the shapes are the most similar may be output as part of the encoded code. At this time, the extended spectrum S1 ′ (k) is given by

It is represented by

次に、スペクトル情報特定部１０６にて行われる拡張スペクトルＳ１’(ｋ)のゲインを決定するステップについて以下説明する。 Next, the step of determining the gain of the extended spectrum S1 ′ (k) performed by the spectrum information specifying unit 106 will be described below.

拡張スペクトルＳ１’(ｋ)のゲインは、第２スペクトルＳ２(ｋ)の帯域Ｎａ≦ｋ＜Ｎｂのパワと一致するように決定される。具体的には、次式

に従い、パワの偏差Ｖを算出し、この値を量子化して得られるインデックスを符号化コードとして出力端子１０７を介し出力する。 The gain of the extended spectrum S1 ′ (k) is determined so as to coincide with the power of the band Na ≦ k <Nb of the second spectrum S2 (k). Specifically, the following formula

Accordingly, the power deviation V is calculated, and an index obtained by quantizing this value is output as an encoded code via the output terminal 107.

また、拡張スペクトルＳ１’(ｋ)を複数のサブバンドに分割し、それぞれのサブバンドについて独立に符号化コードを決定する態様でも良い。かかる場合、拡張スペクトルＳ１’(ｋ)の形状を決定するステップにおいては、サブバンド毎に（式４）に表されるＴ’をそれぞれ決定して符号化コードとして出力しても良いし、共通のＴ’を一つだけ決定して符号化コードとして出力しても良い。そして、拡張スペクトルＳ１’(ｋ)のゲインを決定するステップにおいては、サブバンド毎にパワの偏差Ｖ(ｊ)を算出し、この値を量子化して得られるインデックスを符号化コードとして出力端子１０７を介して出力する。サブバンド毎のパワの変動量は、次式

で表される。ここで、ｊはサブバンドの番号を表し、ＢＬ(ｊ)は第ｊサブバンドの最小周波数に相当する周波数インデックス、ＢＨ(ｊ)は第ｊサブバンドの最大周波数に相当する周波数インデックスを表す。このようにサブバンド毎に符号化コードを出力する構成にすることで、スケーラブル機能を実現することができる。 Alternatively, the extended spectrum S1 ′ (k) may be divided into a plurality of subbands, and the encoding code may be determined independently for each subband. In such a case, in the step of determining the shape of the extended spectrum S1 ′ (k), T ′ represented in (Equation 4) may be determined for each subband and output as an encoded code. Only one T ′ may be determined and output as an encoded code. Then, in the step of determining the gain of the extended spectrum S1 ′ (k), a power deviation V (j) is calculated for each subband, and an index obtained by quantizing this value is used as an encoded code in the output terminal 107. Output via. The amount of power fluctuation for each subband is

It is represented by Here, j represents a subband number, BL (j) represents a frequency index corresponding to the minimum frequency of the jth subband, and BH (j) represents a frequency index corresponding to the maximum frequency of the jth subband. Thus, a scalable function is realizable by setting it as the structure which outputs an encoding code for every subband.

なお、図６に示したような、第２スペクトルＳ２(ｋ)を算出する態様とは別に、図７に示すように、サンプリングレートＦｙの信号をＬＰＣ分析する態様（スペクトル符号化部１００ｂ）でも良い。すなわち、サンプリングレートＦｙの信号をＬＰＣ分析してＬＰＣ係数を求め、このＬＰＣ係数を用いて拡張スペクトルＳ１’(ｋ)を決定することもできる。この構成では、ＬＰＣ係数をＤＦＴしてスペクトル情報に変換し、このスペクトルを用いて拡張スペクトルＳ１’(ｋ)を決定することができる。 In addition to the mode for calculating the second spectrum S2 (k) as shown in FIG. 6, as shown in FIG. 7, the mode (spectrum encoding unit 100b) for LPC analysis of the signal at the sampling rate Fy is also possible. good. That is, the LPC coefficient is obtained by LPC analysis of the signal of the sampling rate Fy, and the extended spectrum S1 '(k) can be determined using the LPC coefficient. In this configuration, the LPC coefficient is DFT converted into spectral information, and the extended spectrum S1 '(k) can be determined using this spectrum.

このように、本実施の形態の符号化装置によれば、符号化装置の回路規模を縮小でき、符号化の処理演算量も削減することができる。 Thus, according to the encoding apparatus of the present embodiment, the circuit scale of the encoding apparatus can be reduced, and the processing amount of encoding processing can also be reduced.

また、上記の効果の他に、スケーラブル符号化に本実施の形態の符号化装置を適用した場合には、次のようなさらなる効果が得られる。 In addition to the above effects, when the coding apparatus according to the present embodiment is applied to scalable coding, the following further effects can be obtained.

従来のように時間領域にてサンプリングレート変換を行う場合は、エイリアシングの発生を避けるために入力信号を低域通過フィルタ（以後ＬＰＦと呼ぶ）に通す必要がある。一般に、時間領域でフィルタリング処理を行うと、入力信号に対して出力信号に時間遅れ(遅延)が生じる。ＬＰＦにＦＩＲ型フィルタを適用する場合には、カットオフ特性を急峻にするためにフィルタ次数を大きくする必要があり、演算量の大幅な増加と共にフィルタ次数の半分のサンプル値に相当する時間遅れが生じてしまう。 When sampling rate conversion is performed in the time domain as in the prior art, it is necessary to pass the input signal through a low-pass filter (hereinafter referred to as LPF) in order to avoid aliasing. In general, when filtering processing is performed in the time domain, a time delay (delay) occurs in the output signal with respect to the input signal. When an FIR type filter is applied to an LPF, it is necessary to increase the filter order in order to make the cut-off characteristic steep, and a time delay corresponding to a sample value that is half the filter order increases with a large increase in the amount of calculation. It will occur.

例えば、サンプリング周波数Ｆｓ＝２４ｋＨｚの信号に対して２５６次のフィルタを適用した場合には、サンプリングレート変換だけで５ｍｓ以上の遅延が生じる。こういった遅延の発生は、双方向音声通話へ適用した場合、通話相手の反応が遅くなったように感じてしまい問題である。 For example, when a 256th-order filter is applied to a signal having a sampling frequency Fs = 24 kHz, a delay of 5 ms or more occurs only by sampling rate conversion. The occurrence of such a delay is a problem when it is applied to a two-way voice call and it feels that the reaction of the other party has been delayed.

また、ＬＰＦにＩＩＲ型フィルタを使用した場合には、比較的次数を少なくしてもカットオフ特性を急峻にすることができ、かつＦＩＲ型フィルタほど遅延が大きくなることはない。しかし、ＩＩＲ型フィルタではＦＩＲ型フィルタのように全周波数で生じる遅延量が一定となるフィルタを設計することができない。これは、スケーラブル符号化において、入力信号からサンプリングレート変換後の信号を減算するときに、サンプリングレート変換後の信号の時間遅れに合わせて入力信号に一定の遅延量を与える必要があるが、ＩＩＲ型のＬＰＦを用いた場合には周波数に対する遅延量が一定でないため、その減算処理が的確に行えないという問題が生じる。 Further, when an IIR filter is used for the LPF, the cut-off characteristic can be made steep even if the order is relatively reduced, and the delay is not increased as much as the FIR filter. However, an IIR filter cannot design a filter in which the amount of delay generated at all frequencies is constant, unlike an FIR filter. In scalable coding, when subtracting the signal after sampling rate conversion from the input signal, it is necessary to give a certain delay amount to the input signal in accordance with the time delay of the signal after sampling rate conversion. When the type LPF is used, the amount of delay with respect to the frequency is not constant, so that there is a problem that the subtraction process cannot be performed accurately.

本実施の形態の符号化装置は、スケーラブル符号化において発生するこれらの問題点を解消することができる。 The encoding apparatus according to the present embodiment can solve these problems that occur in scalable encoding.

図８は、無線送信装置１３０から送信された信号を受信する無線受信装置１８０の主要な構成を示すブロック図である。 FIG. 8 is a block diagram showing the main configuration of radio receiving apparatus 180 that receives a signal transmitted from radio transmitting apparatus 130.

この無線受信装置１８０は、アンテナ１８１、ＲＦ復調装置１８２、復号化装置１７０、Ｄ／Ａ変換装置１８３、および出力装置１８４を有している。 The wireless reception device 180 includes an antenna 181, an RF demodulation device 182, a decoding device 170, a D / A conversion device 183, and an output device 184.

アンテナ１８１は、電波Ｗ１２としてのディジタルの符号化音響信号を受けて電気信号のディジタルの受信符号化音響信号を生成してＲＦ復調装置１８２に与える。ＲＦ復調装置１８２は、アンテナ１８１からの受信符号化音響信号を復調して復調符号化音響信号Ｓ１１を生成して復号化装置１７０に与える。 The antenna 181 receives a digital encoded acoustic signal as the radio wave W12, generates a digital received encoded acoustic signal of an electrical signal, and provides the RF demodulator 182 with it. The RF demodulator 182 demodulates the received encoded acoustic signal from the antenna 181 to generate a demodulated encoded acoustic signal S11 and provides it to the decoding device 170.

復号化装置１７０は、ＲＦ復調装置１８２からのディジタルの復調符号化音響信号Ｓ１１を受けて復号化処理を行ってディジタルの復号化音響信号Ｓ１２を生成してＤ／Ａ変換装置１８３に与える。Ｄ／Ａ変換装置１８３は、復号化装置１７０からのディジタルの復号化音響信号Ｓ１２を変換してアナログの復号化音声信号を生成して出力装置１８４に与える。出力装置１８４は、電気的信号であるアナログの復号化音声信号を空気の振動に変換して音波Ｗ１３として人間の耳に聴こえるように出力する。 The decoding device 170 receives the digital demodulated encoded acoustic signal S11 from the RF demodulation device 182 and performs a decoding process to generate a digital decoded acoustic signal S12 and supplies it to the D / A conversion device 183. The D / A conversion device 183 converts the digital decoded acoustic signal S12 from the decoding device 170 to generate an analog decoded speech signal, and provides it to the output device 184. The output device 184 converts an analog decoded audio signal, which is an electrical signal, into air vibration and outputs the sound wave W13 so that it can be heard by the human ear.

図９は、上記の復号化装置１７０の内部構成を示すブロック図である。ここでも、階層符号化された信号を復号する場合を例にとって説明する。 FIG. 9 is a block diagram showing an internal configuration of the decoding apparatus 170 described above. Here, a case where a hierarchically encoded signal is decoded will be described as an example.

この復号化装置１７０は、入力端子１７１、分離部１７２、第１レイヤ復号化部１７３、スペクトル復号化部１５０、および出力端子１７６を有する。 The decoding apparatus 170 includes an input terminal 171, a separation unit 172, a first layer decoding unit 173, a spectrum decoding unit 150, and an output terminal 176.

入力端子１７１には、ＲＦ復調装置１８２から階層符号化されたコードＳ１１が入力される。分離部１７２は、入力端子１７１を介し入力された復調符号化音響信号Ｓ１１を分離し、第１レイヤ復号化部１７３用の符号化コードとスペクトル復号化部１５０用の符号化コードとを生成する。第１レイヤ復号化部１７３は、分離部１７２で得られた符号化コードを用いてサンプリングレートＦｘの復号信号を復号し、この復号信号Ｓ１３をスペクトル復号化部１５０に与える。スペクトル復号化部１５０は、分離部１７２で分離された符号化コードＳ１４と第１レイヤ復号化部１７３で生成されたサンプリングレートＦｘの信号Ｓ１３に対し、後述するスペクトル復号化を行い、サンプリングレートＦｙの復号信号Ｓ１２を生成し、出力端子１７６を介しこれを出力する。 The input terminal 171 receives the code S11 that is hierarchically encoded from the RF demodulator 182. Separating section 172 separates demodulated encoded acoustic signal S11 input via input terminal 171 and generates an encoded code for first layer decoding section 173 and an encoded code for spectrum decoding section 150. . First layer decoding section 173 decodes the decoded signal of sampling rate Fx using the encoded code obtained by separating section 172, and provides this decoded signal S13 to spectrum decoding section 150. The spectrum decoding unit 150 performs spectrum decoding, which will be described later, on the encoded code S14 separated by the separation unit 172 and the signal S13 of the sampling rate Fx generated by the first layer decoding unit 173, and performs the sampling rate Fy. The decoded signal S12 is generated and output through the output terminal 176.

図１０は、上記のスペクトル復号化部１５０の内部構成を示すブロック図である。 FIG. 10 is a block diagram showing an internal configuration of the spectrum decoding unit 150 described above.

このスペクトル復号化部１５０は、入力端子１５２、１５３、周波数領域変換部１５４、帯域拡張部１５５、復号部１５６、結合部１５７、時間領域変換部１５８、および出力端子１５９を有する。 The spectrum decoding unit 150 includes input terminals 152 and 153, a frequency domain conversion unit 154, a band extension unit 155, a decoding unit 156, a combining unit 157, a time domain conversion unit 158, and an output terminal 159.

入力端子１５２には、サンプリングレートＦｘでサンプリングされた信号Ｓ１３が入力される。また、入力端子１５３には、拡張スペクトルＳ１’(ｋ)に関する符号化コードＳ１４が入力される。 A signal S13 sampled at the sampling rate Fx is input to the input terminal 152. In addition, the input terminal 153 receives the encoded code S14 related to the extended spectrum S1 '(k).

周波数領域変換部１５４は、入力端子１５２から入力された時間領域信号Ｓ１３に対し分析長２・Ｎａで周波数分析を行い、第１スペクトルＳ１(ｋ)を算出する。周波数分析法は、修正離散コサイン変換（ＭＤＣＴ)を用いる。ＭＤＣＴは、前後の隣接フレームと分析フレームを半分ずつ重ね合わせて分析を行い、分析フレームの前半部は奇関数、後半部は偶関数となる直交基底を使うことにより、フレーム間の歪がキャンセルされるという特徴がある。このようにして求められた第１スペクトルＳ１(ｋ)は、帯域拡張部１５５に与えられる。なお、周波数分析法としては、離散フーリエ変換（ＤＦＴ）、離散コサイン変換（ＤＣＴ）等を使用することも可能である。 The frequency domain transform unit 154 performs frequency analysis on the time domain signal S13 input from the input terminal 152 with the analysis length 2 · Na, and calculates the first spectrum S1 (k). The frequency analysis method uses a modified discrete cosine transform (MDCT). In MDCT, analysis is performed by superimposing adjacent frames in front and back and analysis frames, and the distortion between frames is canceled by using an orthogonal basis in which the first half of the analysis frame is an odd function and the second half is an even function. There is a feature that. The first spectrum S1 (k) obtained in this way is given to the band extending unit 155. As a frequency analysis method, it is possible to use discrete Fourier transform (DFT), discrete cosine transform (DCT), or the like.

帯域拡張部１５５は、入力された第１スペクトルＳ１(ｋ)の周波数ｋ＝Ｎａ以降に新たにスペクトルを付与できるような領域を確保し、第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜Ｎｂとなるようにする。帯域が拡張された第１スペクトルＳ１(ｋ)は、結合部１５７に出力される。 The band extending unit 155 secures a region where a spectrum can be newly added after the frequency k = Na of the input first spectrum S1 (k), and the band of the first spectrum S1 (k) is 0 ≦ k <. Nb is set. The first spectrum S1 (k) whose band is extended is output to the combining unit 157.

一方、復号部１５６は、入力端子１５３を介し入力された拡張スペクトルＳ１’(ｋ)に関する符号化コードＳ１４を復号して、拡張スペクトルＳ１’(ｋ)を得て、結合部１５７に出力する。 On the other hand, the decoding unit 156 decodes the encoded code S14 related to the extended spectrum S1 ′ (k) input through the input terminal 153, obtains the extended spectrum S1 ′ (k), and outputs the extended spectrum S1 ′ (k) to the combining unit 157.

結合部１５７は、帯域拡張部１５５より与えられた第１スペクトルＳ１(ｋ)と拡張スペクトルＳ１’(ｋ)を結合させる。この結合は、第１スペクトルＳ１(ｋ)の帯域Ｎａ≦ｋ＜Ｎｂに拡張スペクトルＳ１’(ｋ)を挿入することにより実現される。この処理により得られる第１スペクトルＳ１(ｋ)は、時間領域変換部１５８に出力される。 The combining unit 157 combines the first spectrum S1 (k) and the extended spectrum S1 '(k) given from the band extending unit 155. This coupling is realized by inserting the extended spectrum S1 '(k) in the band Na ≦ k <Nb of the first spectrum S1 (k). The first spectrum S1 (k) obtained by this processing is output to the time domain conversion unit 158.

時間領域変換部１５８は、スペクトル符号化部１００ａで施された周波数領域変換の逆変換に相当する時間領域変換処理を施し、適切な窓関数の乗算および重ね合わせ加算を経て、時間領域の信号Ｓ１２を生成する。このようにして生成された時間領域の信号Ｓ１２は、復号信号として出力端子１５９を介して出力される。 The time domain transform unit 158 performs a time domain transform process corresponding to the inverse transform of the frequency domain transform performed by the spectrum encoding unit 100a, undergoes appropriate window function multiplication and superposition addition, and performs time domain signal S12. Is generated. The time-domain signal S12 generated in this way is output as a decoded signal via the output terminal 159.

次いで、帯域拡張部１５５で行われる処理について、図１１を用いて説明する。 Next, processing performed by the bandwidth extension unit 155 will be described with reference to FIG.

図１１(ａ)は、周波数領域変換部１５４より与えられる第１スペクトルＳ１(ｋ)を表す。図１１(ｂ)は、帯域拡張部１５５の処理の結果得られるスペクトルを表し、周波数ｋがＮａ≦ｋ＜Ｎｂの範囲で表される帯域に新規のスペクトル情報を格納できる領域が確保される。この新規領域の大きさはＮｂ−Ｎａで表される。Ｎｂは、入力端子１５２から与えられる信号のサンプリングレートＦｘと、周波数領域変換部１５４の分析長２・Ｎａと、スペクトル復号化部１５０にて復号される信号のサンプリングレートＦｙとの間の関係に依存し、次式

に従い、Ｎｂを設定することができる。また、Ｎｂが決まっているときには、スペクトル復号化部１５０で復号される信号のサンプリングレートＦｙは、次式

により決定される。例えば、入力信号のサンプリングレートがＦｘ＝１６ｋＨｚ、周波数領域変換部１５４の分析長がＮａ＝１２８の条件のときに、スペクトル復号化部１５０でサンプリングレートがＦｙ＝３２ｋＨｚの復号信号を生成する場合には、帯域拡張部１５５でＮｂ＝１２８・３２／１６＝２５６とする必要がある。よって、この場合には、帯域拡張部１５５にて１２８≦ｋ＜２５６の領域を確保することになる。また、別の例として、入力信号のサンプリングレートがＦｘ＝８ｋＨｚ、周波数領域変換部１５４の分析長がＮａ＝１２８、帯域拡張部１５５の拡張量がＮｂ＝３８４のときに、スペクトル復号化部１５０で生成される復号信号のサンプリングレートはＦｙ＝８・３８４／１２８＝２４ｋＨｚとなる。 FIG. 11A shows the first spectrum S1 (k) given from the frequency domain transform unit 154. FIG. FIG. 11B shows a spectrum obtained as a result of the processing of the band extending unit 155, and an area where new spectrum information can be stored in a band where the frequency k is expressed in the range of Na ≦ k <Nb is secured. The size of this new area is represented by Nb-Na. Nb is a relationship between the sampling rate Fx of the signal given from the input terminal 152, the analysis length 2 · Na of the frequency domain transform unit 154, and the sampling rate Fy of the signal decoded by the spectrum decoding unit 150. Depending on

Nb can be set according to When Nb is determined, the sampling rate Fy of the signal decoded by the spectrum decoding unit 150 is given by

Determined by. For example, when the sampling rate of the input signal is Fx = 16 kHz and the analysis length of the frequency domain transform unit 154 is Na = 128, the spectrum decoding unit 150 generates a decoded signal with the sampling rate Fy = 32 kHz. Needs to be Nb = 128 · 32/16 = 256 in the bandwidth extension unit 155. Therefore, in this case, the bandwidth extension unit 155 secures an area of 128 ≦ k <256. As another example, when the sampling rate of the input signal is Fx = 8 kHz, the analysis length of the frequency domain transform unit 154 is Na = 128, and the extension amount of the band extension unit 155 is Nb = 384, the spectrum decoding unit 150 The sampling rate of the decoded signal generated in the above is Fy = 8 · 384/128 = 24 kHz.

図１２は、スペクトルが結合部１５７および時間領域変換部１５８における処理を経てどのように復号信号が生成されるかを示した図である。 FIG. 12 is a diagram showing how a spectrum is subjected to processing in the combining unit 157 and the time domain conversion unit 158 to generate a decoded signal.

結合部１５７は、帯域が拡張された第１スペクトルＳ１(ｋ)のＮａ≦ｋ＜Ｎｂの帯域に拡張スペクトルＳ１’(ｋ)（Ｎａ≦ｋ＜Ｎｂ）を挿入し、これにより得られる結合後の第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ）を時間領域変換部１５８に送る。時間領域変換部１５８は、時間領域の復号信号を生成し、これによりサンプリングレートＦＳ（＝Ｆｘ・Ｎｂ／Ｎａ）の復号信号を得ることができる。 The combining unit 157 inserts the extended spectrum S1 ′ (k) (Na ≦ k <Nb) into the band of Na ≦ k <Nb of the first spectrum S1 (k) whose band is extended, and the combined spectrum obtained thereby The first spectrum S1 (k) (0 ≦ k <Nb) is sent to the time domain transform unit 158. The time domain transform unit 158 generates a decoded signal in the time domain, and thereby can obtain a decoded signal having a sampling rate FS (= Fx · Nb / Na).

このように、本実施の形態の復号化装置によれば、本実施の形態に係る符号化装置によって符号化された信号を復号することができる。 Thus, according to the decoding apparatus of the present embodiment, it is possible to decode the signal encoded by the encoding apparatus according to the present embodiment.

なお、ここでは、本実施の形態に係る符号化装置または復号化装置が無線通信システムに適用される場合を例にとって説明したが、本実施の形態に係る符号化装置または復号化装置は、以下に示すように、有線通信システムにも適用することができる。 Here, the case where the encoding apparatus or decoding apparatus according to the present embodiment is applied to a wireless communication system has been described as an example, but the encoding apparatus or decoding apparatus according to the present embodiment is described below. As shown in FIG. 6, the present invention can also be applied to a wired communication system.

図１３(ａ)は、本実施の形態に係る符号化装置が有線通信システムに適用された場合の送信側の主要な構成を示したブロック図である。なお、図４に示した構成要素と同一のものには同一の符号を付し、その説明を省略する。 FIG. 13A is a block diagram showing a main configuration on the transmission side when the coding apparatus according to the present embodiment is applied to a wired communication system. The same components as those shown in FIG. 4 are denoted by the same reference numerals, and the description thereof is omitted.

有線送信装置１４０は、符号化装置１２０、入力装置１３１、およびＡ／Ｄ変換装置１３２を有し、出力がネットワークＮ１に接続されている。 The wired transmission device 140 includes an encoding device 120, an input device 131, and an A / D conversion device 132, and an output is connected to the network N1.

Ａ／Ｄ変換装置１３２の入力端子は、入力装置１３１の出力端子に接続されている。符号化装置１２０の入力端子は、Ａ／Ｄ変換装置１３２の出力端子に接続されている。符号化装置１２０の出力端子はネットワークＮ１に接続されている。 The input terminal of the A / D conversion device 132 is connected to the output terminal of the input device 131. The input terminal of the encoding device 120 is connected to the output terminal of the A / D conversion device 132. The output terminal of the encoding device 120 is connected to the network N1.

入力装置１３１は、人間の耳に聞こえる音波Ｗ１１を電気的信号であるアナログ信号に変換してＡ／Ｄ変換装置１３２に与える。Ａ／Ｄ変換装置１３２は、アナログ信号をディジタル信号に変換して符号化装置１２０に与える。符号化装置１２０は、入力されてくるディジタル信号を符号化してコードを生成し、ネットワークＮ１に出力する。 The input device 131 converts the sound wave W11 that can be heard by the human ear into an analog signal, which is an electrical signal, and provides the analog signal to the A / D converter 132. The A / D conversion device 132 converts the analog signal into a digital signal and gives it to the encoding device 120. The encoding device 120 generates a code by encoding the input digital signal and outputs the code to the network N1.

図１３(ｂ)は、本実施の形態に係る復号化装置が有線通信システムに適用された場合の受信側の主要な構成を示したブロック図である。なお、図８に示した構成要素と同一のものには同一の符号を付し、その説明を省略する。 FIG. 13B is a block diagram showing a main configuration on the receiving side when the decoding apparatus according to the present embodiment is applied to a wired communication system. In addition, the same code | symbol is attached | subjected to the same thing as the component shown in FIG. 8, and the description is abbreviate | omitted.

有線受信装置１９０は、ネットークＮ１に接続されている受信装置１９１、復号化装置１７０、Ｄ／Ａ変換装置１８３、および出力装置１８４を有している。 The wired receiving device 190 includes a receiving device 191 connected to the network N1, a decoding device 170, a D / A conversion device 183, and an output device 184.

受信装置１９１の入力端子は、ネットワークＮ１に接続されている。復号化装置１７０の入力端子は、受信装置１９１の出力端子に接続されている。Ｄ／Ａ変換装置１８３の入力端子は、復号化装置１７０の出力端子に接続されている。出力装置１８４の入力端子は、Ｄ／Ａ変換装置１８３の出力端子に接続されている。 The input terminal of the receiving device 191 is connected to the network N1. An input terminal of the decryption device 170 is connected to an output terminal of the reception device 191. The input terminal of the D / A conversion device 183 is connected to the output terminal of the decoding device 170. The input terminal of the output device 184 is connected to the output terminal of the D / A converter 183.

受信装置１９１は、ネットワークＮ１からのディジタルの符号化音響信号を受けてディジタルの受信音響信号を生成して復号化装置１７０に与える。復号化装置１７０は、受信装置１９１からの受信音響信号を受けてこの受信音響信号に復号化処理を行ってディジタルの復号化音響信号を生成してＤ／Ａ変換装置１８３に与える。Ｄ／Ａ変換装置１８３は、復号化装置１７０からのディジタルの復号音声信号を変換してアナログの復号音声信号を生成して出力装置１８４に与える。出力装置１８４は、電気的信号であるアナログの復号音響信号を空気の振動に変換して音波Ｗ１３として人間の耳に聴こえるように出力する。 The receiving device 191 receives the digital encoded acoustic signal from the network N1, generates a digital received acoustic signal, and provides it to the decoding device 170. The decoding device 170 receives the received acoustic signal from the receiving device 191, performs a decoding process on the received acoustic signal, generates a digital decoded acoustic signal, and supplies the digital decoded acoustic signal to the D / A conversion device 183. The D / A conversion device 183 converts the digital decoded speech signal from the decoding device 170 to generate an analog decoded speech signal, and provides it to the output device 184. The output device 184 converts an analog decoded acoustic signal, which is an electrical signal, into vibration of the air and outputs the sound wave W13 so that it can be heard by the human ear.

このように、上記の構成によれば、上記の無線送受信装置と同様の作用効果を有する有線送受信装置を提供することができる。 Thus, according to said structure, the wired transmission / reception apparatus which has the same effect as said wireless transmission / reception apparatus can be provided.

（実施の形態２）
図１４は、本発明の実施の形態２に係る復号化装置２７０の主要な構成を示すブロック図である。なお、この復号化装置２７０は、図９に示した復号化装置１７０と同様の基本的構成を有しており、同一の構成要素には同一の符号を付し、その説明を省略する。 (Embodiment 2)
FIG. 14 is a block diagram showing the main configuration of decoding apparatus 270 according to Embodiment 2 of the present invention. Note that this decoding device 270 has the same basic configuration as that of the decoding device 170 shown in FIG. 9, and the same components are denoted by the same reference numerals and description thereof is omitted.

本実施の形態の特徴は、結合後の第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ）の最大周波数インデックスＮｂを所望の値Ｎｃに修正することにより、所望のサンプリングレートにて復号信号を生成することである。 The feature of this embodiment is that the decoded signal is converted at a desired sampling rate by correcting the maximum frequency index Nb of the combined first spectrum S1 (k) (0 ≦ k <Nb) to a desired value Nc. Is to generate.

スペクトル復号化部２５０は、分離部１７２で分離された符号化コードＳ１４、第１レイヤ復号化部１７３で生成されたサンプリングレートＦｘの信号Ｓ１３、および入力端子２７１を介し入力された係数Ｎｃ（信号Ｓ２１）を用いて、スペクトル復号化を行う。そして、得られたサンプリングレートＦｙの復号信号を出力端子１７６を介し出力する。スペクトル復号化部２５０における周波数領域変換の分析長が２・Ｎａであるとき、復号信号のサンプリングレートＦｙはＦｙ＝Ｆｘ・Ｎｃ／Ｎａで表される。 The spectrum decoding unit 250 includes the encoded code S14 separated by the separation unit 172, the signal S13 of the sampling rate Fx generated by the first layer decoding unit 173, and the coefficient Nc (signal received via the input terminal 271) Spectrum decoding is performed using S21). Then, the decoded signal of the obtained sampling rate Fy is output via the output terminal 176. When the analysis length of the frequency domain transform in the spectrum decoding unit 250 is 2 · Na, the sampling rate Fy of the decoded signal is represented by Fy = Fx · Nc / Na.

図１５は、上記のスペクトル復号化部２５０の内部構成を示すブロック図である。 FIG. 15 is a block diagram showing the internal configuration of the spectrum decoding unit 250 described above.

入力端子２７１を介し入力された係数Ｎｃは、修正部２５１および時間領域変換部１５８ａに与えられる。 The coefficient Nc input via the input terminal 271 is given to the correction unit 251 and the time domain conversion unit 158a.

修正部２５１は、結合部１５７より与えられる第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ）の有効な帯域を、入力端子２７１を介し与えられた係数Ｎｃ（信号Ｓ２１）に基づいて０≦ｋ＜Ｎｃに修正する。そして、帯域修正後の第１スペクトルＳ１(ｋ)（0≦ｋ＜Ｎｃ)を時間領域変換部１５８ａに与える。 The correcting unit 251 determines the effective band of the first spectrum S1 (k) (0 ≦ k <Nb) given from the combining unit 157 based on the coefficient Nc (signal S21) given via the input terminal 271. Correct to k <Nc. Then, the first spectrum S1 (k) (0 ≦ k <Nc) after the band correction is given to the time domain conversion unit 158a.

時間領域変換部１５８ａは、入力端子２７１を介し与えられた係数Ｎｃに従い、分析長２・Ｎｃの下で修正部２５１から与えられる第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｃ)に対し変換処理を施し、適切な窓関数の乗算および重ね合わせ加算を行い、時間領域の信号を生成して出力端子１５９を介して出力する。この復号信号のサンプリングレートは、ＦＳ＝Ｆｘ・Ｎｃ／Ｎａとなる。 The time domain conversion unit 158a converts the first spectrum S1 (k) (0 ≦ k <Nc) given from the correction unit 251 under the analysis length 2 · Nc according to the coefficient Nc given via the input terminal 271. Processing is performed, appropriate window function multiplication and overlay addition are performed, and a time domain signal is generated and output through the output terminal 159. The sampling rate of this decoded signal is FS = Fx · Nc / Na.

図１６および図１７は、修正部２５１の処理をより詳細に説明するための図である。 16 and 17 are diagrams for explaining the processing of the correction unit 251 in more detail.

図１６は、Ｎｃ＜Ｎｂの場合における修正部２５１の処理を表している。結合部１５７から与えられる第１スペクトルＳ１(ｋ)（信号Ｓ２１）の帯域は、０≦ｋ＜Ｎｂとなっている。そこで、修正部２５１は、この第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜Ｎｃとなるように、Ｎｃ≦ｋ＜Ｎｂの範囲のスペクトルを削除する。この結果得られる第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｃ)（信号Ｓ２２）を時間領域変換部１５８ａに与え、時間領域の復号信号Ｓ２３が生成される。この復号信号Ｓ２３のサンプリングレートは、ＦＳ＝Ｆｘ・Ｎｃ／Ｎａとなる。 FIG. 16 shows the processing of the correction unit 251 when Nc <Nb. The band of the first spectrum S1 (k) (signal S21) given from the combining unit 157 is 0 ≦ k <Nb. Therefore, the correcting unit 251 deletes the spectrum in the range of Nc ≦ k <Nb so that the band of the first spectrum S1 (k) satisfies 0 ≦ k <Nc. The first spectrum S1 (k) (0 ≦ k <Nc) (signal S22) obtained as a result is given to the time domain converter 158a, and a decoded signal S23 in the time domain is generated. The sampling rate of the decoded signal S23 is FS = Fx · Nc / Na.

図１７は、同様に修正部２５１の処理であるが、Ｎｃ＞Ｎｂの場合の処理を表している。結合部２５１から与えられる第１スペクトルＳ１(ｋ)（信号Ｓ２５）の帯域は、図１６と同様に０≦ｋ＜Ｎｂとなっている。修正部２５１は、この第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜ＮｃとなるようにＮｂ≦ｋ＜Ｎｃの帯域を拡張し、その領域に特定の値（例えば、ゼロ値）を付与する。この結果得られる第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｃ)（信号Ｓ２６）を時間領域変換部１５８ａに与え、時間領域の復号信号Ｓ２７が生成される。この復号信号Ｓ２７のサンプリングレートは、ＦＳ＝Ｆｘ・Ｎｃ／Ｎａとなる。 FIG. 17 similarly shows the processing of the correction unit 251, but shows processing when Nc> Nb. The band of the first spectrum S1 (k) (signal S25) given from the combining unit 251 is 0 ≦ k <Nb as in FIG. The correcting unit 251 expands the band of Nb ≦ k <Nc so that the band of the first spectrum S1 (k) becomes 0 ≦ k <Nc, and gives a specific value (for example, zero value) to the region. To do. The first spectrum S1 (k) (0 ≦ k <Nc) (signal S26) obtained as a result is given to the time domain conversion unit 158a, and a time domain decoded signal S27 is generated. The sampling rate of the decoded signal S27 is FS = Fx · Nc / Na.

図１８および図１９を用いて、スペクトル復号化部２５０の動作をさらに説明する。 The operation of the spectrum decoding unit 250 will be further described with reference to FIGS. 18 and 19.

まず、入力端子１５３を介し入力される符号化コードがフレーム毎に変動していることを想定する。すなわち、結合部１５７から出力される第１スペクトルＳ１(ｋ)の帯域には、図１８に示されるような０≦ｋ＜Ｎａ（帯域Ｒ１）、０≦ｋ＜Ｎｂ１（帯域Ｒ２）、０≦ｋ＜Ｎｂ２（帯域Ｒ３）の３通りの帯域が存在し（ただし、Ｎａ＜Ｎｂ１＜Ｎｂ２）、フレーム毎にこれらの帯域の内の一つが選択されているものとする。 First, it is assumed that the encoded code input via the input terminal 153 varies from frame to frame. That is, the band of the first spectrum S1 (k) output from the combining unit 157 includes 0 ≦ k <Na (band R1), 0 ≦ k <Nb1 (band R2), 0 ≦ as shown in FIG. Assume that there are three bands k <Nb2 (band R3) (where Na <Nb1 <Nb2), and one of these bands is selected for each frame.

図１９(ａ)は、係数ＮｃがＮｂ２に等しい場合のスペクトル復号化部２５０の動作、図１９(ｂ)は、係数ＮｃがＮｂ１に等しい場合のスペクトル復号化部２５０の動作を説明するための図である。 FIG. 19A illustrates the operation of the spectrum decoding unit 250 when the coefficient Nc is equal to Nb2, and FIG. 19B illustrates the operation of the spectrum decoding unit 250 when the coefficient Nc is equal to Nb1. FIG.

これらの図では、第ｉフレームで得られるスペクトルの帯域が、Ｒ１、Ｒ２、Ｒ３のいずれかであることを表している。また、処理１はＮｂ１≦ｋ＜Ｎｂ２の帯域にゼロ値を挿入する処理、処理２はＮａ≦ｋ＜Ｎｂ２の帯域にゼロ値を挿入する処理、処理３はＮｂ１≦ｋ＜Ｎｂ２の帯域を削除する処理、処理４はＮａ≦ｋ＜Ｎｂ１の帯域にゼロ値を挿入する処理を表している。 In these figures, the spectrum band obtained in the i-th frame is any one of R1, R2, and R3. Process 1 is a process of inserting a zero value into a band of Nb1 ≦ k <Nb2, Process 2 is a process of inserting a zero value into a band of Na ≦ k <Nb2, and Process 3 is a process of deleting a band of Nb1 ≦ k <Nb2. Processing 4 and processing 4 represent processing for inserting a zero value in the band of Na ≦ k <Nb1.

まず、図１９(ａ)の場合について説明する。 First, the case of FIG. 19A will be described.

この図において、第０フレーム〜第１フレームおよび第７フレーム〜第８フレームではスペクトルの帯域がＲ３、すなわち第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜Ｎｂ２であるため、修正部２５１は何の処理も施さずに第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ２）を時間領域変換部１５８ａに出力する。 In this figure, since the spectrum band is R3 in the 0th frame to the 1st frame and the 7th frame to the 8th frame, that is, the band of the first spectrum S1 (k) is 0 ≦ k <Nb2, the correction unit 251 The first spectrum S1 (k) (0 ≦ k <Nb2) is output to the time domain conversion unit 158a without performing any processing.

また、第２フレーム〜第４フレームおよび第９フレームではスペクトルの帯域がＲ２、すなわち第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜Ｎｂ１であるため、修正部２５１は第１スペクトルＳ１(ｋ)の帯域をＮｂ２まで拡張し、かつＮｂ１≦ｋ＜Ｎｂ２の帯域にゼロ値を挿入した後に、第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ２）を時間領域変換部１５８ａに出力する。 Further, in the second to fourth frames and the ninth frame, the spectrum band is R2, that is, the band of the first spectrum S1 (k) is 0 ≦ k <Nb1, and therefore the correcting unit 251 has the first spectrum S1 (k ) Is extended to Nb2 and a zero value is inserted into the band of Nb1 ≦ k <Nb2, and then the first spectrum S1 (k) (0 ≦ k <Nb2) is output to the time domain conversion unit 158a.

一方、第５フレーム〜第６フレームではスペクトルの帯域がＲ１、すなわち第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜Ｎａであるため、修正部２５１は第１スペクトルＳ１(ｋ)の帯域をＮｂ２まで拡張し、かつＮａ≦ｋ＜Ｎｂ２の範囲にゼロ値を挿入した後に、第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ２）を時間領域変換部１５８ａに出力する。 On the other hand, in the fifth to sixth frames, the spectrum band is R1, that is, the band of the first spectrum S1 (k) is 0 ≦ k <Na. Therefore, the correction unit 251 sets the band of the first spectrum S1 (k). After extending to Nb2 and inserting a zero value in the range of Na ≦ k <Nb2, the first spectrum S1 (k) (0 ≦ k <Nb2) is output to the time domain conversion unit 158a.

次に、図１９(ｂ)の場合について説明する。 Next, the case of FIG. 19B will be described.

この図において、第２フレーム〜第４フレームおよび第９フレームではスペクトルの帯域がＲ２、すなわち第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜Ｎｂ１であるため、修正部２５１は何の処理も施さずに第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ１）を時間領域変換部１５８ａに出力する。 In this figure, in the second to fourth frames and the ninth frame, since the spectrum band is R2, that is, the band of the first spectrum S1 (k) is 0 ≦ k <Nb1, the correction unit 251 does not perform any processing. Without applying, the first spectrum S1 (k) (0 ≦ k <Nb1) is output to the time domain conversion unit 158a.

また、第０フレーム〜第１フレームおよび第７フレーム〜第８フレームではスペクトルの帯域がＲ３、すなわち第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜Ｎｂ２であるため、修正部２５１はＮｂ１≦ｋ＜Ｎｂ２の帯域を削除した後に、第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ１）を時間領域変換部１５８ａに出力する。 Also, in the 0th frame to the 1st frame and the 7th frame to the 8th frame, the spectrum band is R3, that is, the band of the first spectrum S1 (k) is 0 ≦ k <Nb2, so the correction unit 251 has Nb1 ≦ After deleting the band of k <Nb2, the first spectrum S1 (k) (0 ≦ k <Nb1) is output to the time domain conversion unit 158a.

一方、第５フレーム〜第６フレームではスペクトルの帯域がＲ１、すなわち第１スペクトルＳ１(ｋ)の帯域が０≦ｋ＜Ｎａであるため、修正部２５１は第１スペクトルＳ１(ｋ)の帯域をＮｂ１まで拡張し、かつＮａ≦ｋ＜Ｎｂ１の帯域にゼロ値を挿入した後に、第１スペクトルＳ１(ｋ)（０≦ｋ＜Ｎｂ１）を時間領域変換部１５８ａに出力する。 On the other hand, in the fifth to sixth frames, the spectrum band is R1, that is, the band of the first spectrum S1 (k) is 0 ≦ k <Na. Therefore, the correction unit 251 sets the band of the first spectrum S1 (k). After extending to Nb1 and inserting a zero value in the band of Na ≦ k <Nb1, the first spectrum S1 (k) (0 ≦ k <Nb1) is output to the time domain transform unit 158a.

このように、本実施の形態によれば、受信される第１スペクトルＳ１(ｋ)の有効な周波数帯域が時間的に変動する場合でも、適切な係数Ｎｃを与えることにより、所望のサンプリングレートの復号信号を安定して得ることができる。 As described above, according to the present embodiment, even when the effective frequency band of the received first spectrum S1 (k) fluctuates with time, by giving an appropriate coefficient Nc, a desired sampling rate can be obtained. The decoded signal can be obtained stably.

（実施の形態３）
図２０は、本発明の実施の形態３に係る通信システムの主要な構成を示す図である。 (Embodiment 3)
FIG. 20 is a diagram showing a main configuration of a communication system according to Embodiment 3 of the present invention.

本実施の形態の特徴は、通信ネットワークの状況（通信環境）によって受信側で受信される第１スペクトルＳ１(ｋ)の有効周波数帯域が時間的に変動する場合に対処することである。 The feature of this embodiment is to cope with a case where the effective frequency band of the first spectrum S1 (k) received on the receiving side varies with time depending on the situation (communication environment) of the communication network.

階層符号化部３０１は、サンプリングレートＦｙの入力信号に対し、実施の形態１で示した階層符号化処理を施し、スケーラブルな符号化コードを生成する。ここでは、生成される符号化コードが、帯域０≦ｋ＜Ｎｅに関する情報（Ｒ３１）、帯域Ｎｅ≦ｋ＜Ｎｆに関する情報（Ｒ３２）、および帯域Ｎｆ≦ｋ＜Ｎｇに関する情報（Ｒ３３）により構成されるものとする。階層符号化部３０１は、この符号化コードをネットワーク制御部３０２に与える。 Hierarchical encoding section 301 performs the hierarchical encoding processing shown in Embodiment 1 on the input signal of sampling rate Fy, and generates a scalable encoded code. Here, the generated encoded code is configured by information on band 0 ≦ k <Ne (R31), information on band Ne ≦ k <Nf (R32), and information on band Nf ≦ k <Ng (R33). Shall be. The hierarchical encoding unit 301 gives this encoded code to the network control unit 302.

ネットワーク制御部３０２は、階層符号化部３０１より与えられる符号化コードを階層復号化部３０３に転送する。ここで、ネットワーク制御部３０２は、ネットワークの状況に応じて階層復号化部３０３に転送する符号化コードの一部を廃棄する。そのため、階層復号化部３０３に入力される符号化コードは、廃棄される符号化コードが全くない場合は情報Ｒ３１〜Ｒ３３により構成された符号化コード、情報Ｒ３３の符号化コードが廃棄される場合は情報Ｒ３１およびＲ３２により構成された符号化コード、情報Ｒ３２およびＲ３３の符号化コードが廃棄される場合は情報Ｒ３１により構成された符号化コード、のいずれかとなる。 The network control unit 302 transfers the encoded code given from the hierarchical encoding unit 301 to the hierarchical decoding unit 303. Here, the network control unit 302 discards part of the encoded code to be transferred to the hierarchical decoding unit 303 according to the network status. Therefore, the encoded code input to the hierarchical decoding unit 303 is the case where the encoded code composed of the information R31 to R33 and the encoded code of the information R33 are discarded when no encoded code is discarded. Is either an encoded code configured by information R31 and R32, or an encoded code configured by information R31 when the encoded codes of information R32 and R33 are discarded.

階層復号化部３０３は、与えられた符号化コードに対し、実施の形態１または実施の形態２に示した階層復号化方法を適用して復号信号を生成する。なお、階層復号化部３０３に実施の形態１を適用した場合には、出力される復号信号のサンプリングレートＦｚは、Ｆｙとなる（Ｆｚ＝Ｆｙ・Ｎｇ／Ｎｇのため）。また、階層復号化部３０３に実施の形態２を適用した場合には、所望の係数Ｎｃによって復号信号のサンプリングレートを設定することができ、その復号信号のサンプリングレートＦｚは、Ｆｙ・Ｎｃ／Ｎｇとなる。 Hierarchical decoding section 303 generates a decoded signal by applying the hierarchical decoding method shown in the first or second embodiment to a given encoded code. When Embodiment 1 is applied to hierarchical decoding section 303, sampling rate Fz of the output decoded signal is Fy (because Fz = Fy · Ng / Ng). When Embodiment 2 is applied to hierarchical decoding section 303, the sampling rate of the decoded signal can be set by a desired coefficient Nc, and the sampling rate Fz of the decoded signal is Fy · Nc / Ng. It becomes.

このように、本実施の形態によれば、通信ネットワークの状況によって受信側で受信される第１スペクトルＳ１(ｋ)の有効周波数帯域が時間的に変動する場合でも、受信側は所望のサンプリングレートの復号信号を安定して求めることができる。 As described above, according to the present embodiment, even when the effective frequency band of the first spectrum S1 (k) received on the receiving side varies with time depending on the state of the communication network, the receiving side has a desired sampling rate. Can be obtained stably.

（実施の形態４）
図２１は、本発明の実施の形態４に係る通信システムの主要な構成を示す図である。 (Embodiment 4)
FIG. 21 is a diagram showing a main configuration of a communication system according to Embodiment 4 of the present invention.

本実施の形態の特徴は、１つの階層符号部により生成された１つの符号化コードを、それぞれ復号可能なサンプリングレートの異なる（復号能力の異なる）複数の階層復号化部に対して同時に送信しても、受信側がこれに対応し、それぞれ異なるサンプリングレートの復号信号を得ることである。 A feature of this embodiment is that one encoded code generated by one hierarchical encoder is simultaneously transmitted to a plurality of hierarchical decoders having different decoding rates (different decoding capabilities). However, the receiving side responds to this and obtains decoded signals having different sampling rates.

階層符号化部４０１は、サンプリングレートＦｙの入力信号に対して実施の形態１に示した符号化処理を施し、スケーラブルな符号化コードを生成する。ここでは、生成される符号化コードは、帯域０≦ｋ＜Ｎｈに関する情報（Ｒ４１）、帯域Ｎｈ≦ｋ＜Ｎｉに関する情報（Ｒ４２）、帯域Ｎｉ≦ｋ＜Ｎｊに関する情報（Ｒ４３）により構成されるものとする。階層符号化部４０１は、この符号化コードを、第１階層復号化部４０２−１、第２階層復号化部４０２−２、第３階層復号化部４０２−３にそれぞれ与える。 Hierarchical encoding section 401 performs the encoding process shown in Embodiment 1 on the input signal of sampling rate Fy to generate a scalable encoded code. Here, the generated encoded code is configured by information on band 0 ≦ k <Nh (R41), information on band Nh ≦ k <Ni (R42), and information on band Ni ≦ k <Nj (R43). Shall. Hierarchical encoding section 401 gives this encoded code to first hierarchical decoding section 402-1, second hierarchical decoding section 402-2, and third hierarchical decoding section 402-3, respectively.

第１階層復号化部４０２−１、第２階層復号化部４０２−２、および第３階層復号化部４０２−３は、与えられた符号化コードに対し、実施の形態１または実施の形態２に示した階層復号化法を適用して復号信号を生成する。第１階層復号化部４０２−１は係数Ｎｃ＝Ｎｊとしたときの復号化処理、第２階層復号化部４０２−２は係数Ｎｃ＝Ｎｉとしたときの復号化処理、第３階層復号化部４０２−３は係数Ｎｃ＝Ｎｈとしたときの復号化処理を行う。 First layer decoding section 402-1, second layer decoding section 402-2, and third layer decoding section 402-3 perform the first or second embodiment on the given encoded code. The decoded signal is generated by applying the hierarchical decoding method shown in FIG. First hierarchy decoding section 402-1 is a decoding process when coefficient Nc = Nj, second hierarchy decoding section 402-2 is a decoding process when coefficient Nc = Ni, and third hierarchy decoding section 402-3 performs a decoding process when the coefficient Nc = Nh.

第１階層復号化部４０２−１は、係数Ｎｃ＝Ｎｊとしたときの復号化処理を行い、復号信号を生成する。この復号信号のサンプリングレートＦ１はＦｙとなる(Ｆ１＝Ｆｙ・Ｎｊ／Ｎｊのため)。 First layer decoding section 402-1 performs a decoding process when coefficient Nc = Nj, and generates a decoded signal. The sampling rate F1 of this decoded signal is Fy (because F1 = Fy · Nj / Nj).

第２階層復号化部４０２−２は、係数Ｎｃ＝Ｎｉとしたときの復号化処理を行い、復号信号を生成する。この復号信号のサンプリングレートＦ２はＦｙ・Ｎｉ／Ｎｊとなる。 Second layer decoding section 402-2 performs a decoding process when coefficient Nc = Ni, and generates a decoded signal. The sampling rate F2 of this decoded signal is Fy · Ni / Nj.

第３階層復号化部４０２−３は、係数Ｎｃ＝Ｎｈとしたときの復号化処理を行い、復号信号を生成する。この復号信号のサンプリングレートＦ３はＦｙ・Ｎｈ／Ｎｊとなる。 Third layer decoding section 402-3 performs a decoding process when coefficient Nc = Nh, and generates a decoded signal. The sampling rate F3 of this decoded signal is Fy · Nh / Nj.

このように、本実施の形態によれば、送信側は受信側の復号能力を考慮することなく符号化コードを送信することができるので、通信ネットワークの負荷を抑えることができる。また、これら複数種類のサンプリングレートの復号信号は、簡易な構成かつ少ない演算量で生成することができる。 As described above, according to the present embodiment, the transmitting side can transmit the encoded code without considering the decoding capability of the receiving side, so that the load on the communication network can be suppressed. Also, the decoded signals of these plural types of sampling rates can be generated with a simple configuration and a small amount of calculation.

本発明は、移動体通信システムにおける通信端末装置および基地局装置に適用することも可能であり、これにより上記と同様の作用効果を有する通信端末装置および基地局装置を提供することができる。 The present invention can also be applied to a communication terminal apparatus and a base station apparatus in a mobile communication system, thereby providing a communication terminal apparatus and a base station apparatus having the same effects as described above.

なお、ここでは、本発明をハードウェアで構成する場合を例にとって説明したが、ソフトウェアで実現することも可能である。 Here, although the case where the present invention is configured by hardware has been described as an example, it can also be realized by software.

本発明は、簡易な構成および少ない演算量でスケーラブル符号化を実現する効果を有し、ＩＰネットワーク等の通信システムの用途に適用できる。 The present invention has the effect of realizing scalable coding with a simple configuration and a small amount of computation, and can be applied to the use of a communication system such as an IP network.

１０３、１１２、１５４周波数領域変換部
１０４、１５５帯域拡張部
１０５拡張スペクトル付与部
１０６スペクトル情報特定部
１１３ＬＰＣ分析部
１５６復号部
１５７結合部
１５８時間領域変換部
２５１修正部 103, 112, 154 Frequency domain conversion unit 104, 155 Band extension unit 105 Extended spectrum giving unit 106 Spectrum information specifying unit 113 LPC analysis unit 156 Decoding unit 157 Combining unit 158 Time domain conversion unit 251 Correction unit

Claims

First encoding information relating to a first band, which is a band lower than a predetermined frequency of the audio signal or the audio signal, generated by encoding the audio signal or the audio signal with a scalable encoding device, and a predetermined value of the audio signal Receiving means for receiving information including the second encoded information relating to the second band, which is a band higher than the frequency of
First decoding means for decoding the first encoded information to generate a time-domain signal having a first sampling rate corresponding to the first band of the audio signal or audio signal;
The second encoded information is decoded in a frequency domain to generate a decoded spectrum of the second band, and a predetermined second sampling rate higher than the first sampling rate is sampled using the decoded spectrum of the second band Second decoding means for generating a decoded signal of the third sampling rate subjected to rate conversion,
The second decoding means includes
First transform means for obtaining a spectrum of the first band by frequency domain transform from a time domain signal of the first sampling rate obtained by the first decoding means;
Replicating means for replicating a spectrum at a specific position of the spectrum of the first band;
The second encoded information and the duplicated spectrum are used to generate a decoded spectrum of the second band that expands a bandwidth of the decoded spectrum of the first band, and the decoded spectrum of the second band is Spectrum generating means for generating an extended decoded spectrum by adding to one band of decoded spectrum;
A zero is inserted in the first high band part of the extended decoded spectrum that is adjacent to the maximum frequency of the extended decoded spectrum and outside the extended decoded spectrum, or is adjacent to the maximum frequency and of the extended decoded spectrum . The second high-frequency part of the extended decoded spectrum located inside is deleted to obtain a spectrum of a predetermined band, and the time-domain signal of the third sampling rate is obtained from the spectrum of the predetermined band by time-domain transformation. Time domain signal generating means for generating a decoded signal;
A scalable decoding device comprising:

First encoding information relating to a first band, which is a band lower than a predetermined frequency of the audio signal or the audio signal, generated by encoding the audio signal or the audio signal with a scalable encoding device, and a predetermined value of the audio signal A reception step of receiving information including second encoded information relating to a second band that is a band higher than the frequency of
A first decoding step of decoding the first encoded information to generate a time-domain signal having a first sampling rate corresponding to the first band of the audio signal or the audio signal;
The second encoded information is decoded in a frequency domain to generate a decoded spectrum of the second band, and a predetermined second sampling rate higher than the first sampling rate is sampled using the decoded spectrum of the second band A second decoding step for generating a rate-converted third sampling rate decoded signal,
The second decoding step includes
A first transforming step of obtaining a spectrum of the first band by a frequency domain transform from a time domain signal of the first sampling rate obtained in the first decoding step;
A duplication step of duplicating a spectrum at a specific position of the spectrum of the first band;
The second encoded information and the duplicated spectrum are used to generate a decoded spectrum of the second band that expands a bandwidth of the decoded spectrum of the first band, and the decoded spectrum of the second band is A spectrum generating step of generating an extended decoded spectrum by adding to the decoded spectrum of one band;
A zero is inserted in the first high band part of the extended decoded spectrum that is adjacent to the maximum frequency of the extended decoded spectrum and outside the extended decoded spectrum, or is adjacent to the maximum frequency and of the extended decoded spectrum . The second high-frequency part of the extended decoded spectrum located inside is deleted to obtain a spectrum of a predetermined band, and the time-domain signal of the third sampling rate is obtained from the spectrum of the predetermined band by time-domain transformation. A time domain signal generation step for generating as a decoded signal;
A scalable decoding method comprising: