JP3510493B2

JP3510493B2 - Audio signal encoding / decoding method and recording medium recording the program

Info

Publication number: JP3510493B2
Application number: JP25178398A
Authority: JP
Inventors: 高志三浦; 史彦板垣; 深雪川島
Original assignee: 株式会社ハドソン
Priority date: 1998-08-24
Filing date: 1998-08-24
Publication date: 2004-03-29
Anticipated expiration: 2018-08-24
Also published as: JP2000068856A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は音声信号の符号／復
号方法及びそのプログラムを記録した記録媒体に関し、
更に詳しくは音声（音楽，背景音，コンピュータ合成音
等を含む）信号をコンピュータ処理可能な符号データに
データ圧縮符号化処理してこれを一旦記憶手段に蓄積
し、かつ必要な時に前記記憶手段から読み出した符号デ
ータをデータ伸長復号化処理して音声信号を再生する用
途に適用して好適なるものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio signal encoding / decoding method and a recording medium having a program recorded thereon.
More specifically, a voice (including music, background sound, computer-synthesized sound, etc.) signal is subjected to data compression coding into computer-processable code data, which is temporarily stored in the storage means, and when necessary, from the storage means. It is suitable to be applied to the purpose of reproducing the audio signal by subjecting the read code data to the data decompression / decoding process.

【０００２】[0002]

【従来の技術】従来、音声信号をコンピュータ処理する
場合には１６ビットの音声データを対数変換法（Ａ−ｌ
ａｗ，μ−ｌａｗ等）により８ビットに変換したり、又
はＡＤＰＣＭ（Adaptive Differential PCM)により４ビ
ットに圧縮する等の方法が知られている。近年、野球ゲ
ーム等ではＴＶ放送と同様にゲームの進行に合わせてア
ナウンサーの実況放送が行われるものがあり、この様な
音声の収録・蓄積にはデータ圧縮率の高いＡＤＰＣＭ方
式が採用されている。2. Description of the Related Art Conventionally, when a voice signal is processed by a computer, 16-bit voice data is subjected to a logarithmic conversion method (A-1).
There are known methods such as conversion into 8 bits by aw, μ-law, etc., or compression into 4 bits by ADPCM (Adaptive Differential PCM). In recent years, in some baseball games and the like, live broadcasts of announcers are performed in accordance with the progress of the game as in the case of TV broadcasting, and the ADPCM method with a high data compression rate is adopted for recording and accumulating such voice. .

【０００３】図８は従来のＡＤＰＣＭ方式を説明する図
で、ＡＰＣＭ（Adaptive PCM) とＤＰＣＭ（Differenti
al PCM) の特徴を組み合わせた場合を示している。図８
（Ａ）は符号装置のブロック図で、この様な装置はハー
ドウェア構成又はＤＳＰを使用したソフトウェア構成に
より実現可能である。図において、入力の音声信号ｘ
（ｎ）は加算器１１で予測値ｘ”（ｎ）を差し引かれ、
差信号ｄ（ｎ）となる。この予測値ｘ”（ｎ）は適応予
測器１６により線形予測法を用いて次式により求められ
る。FIG. 8 is a diagram for explaining a conventional ADPCM system, which is APCM (Adaptive PCM) and DPCM (Differentientication).
al PCM) is shown in combination. Figure 8
(A) is a block diagram of a coding device, and such a device can be realized by a hardware configuration or a software configuration using a DSP. In the figure, the input audio signal x
(N) is subtracted from the predicted value x ″ (n) by the adder 11,
The difference signal d (n) is obtained. The predicted value x ″ (n) is calculated by the adaptive predictor 16 using the linear prediction method according to the following equation.

【０００４】[0004]

【数１】ここで、ａ_iは予測係数であり、音声信号ｘ’（ｎ）の
長時間自己相関関数Ｒ_i（ｉ＝０，１，２，…，ｐ）を
用いて予測誤差ｄ（ｎ）の自乗を最小とする様に求めら
れる。この時、加算器１５は量子化差信号ｄ’（ｎ）の
逆量子化値と予測値ｘ”（ｎ）との和により音声信号
ｘ’（ｎ）を再生しており、量子化誤差が累積しない構
成となっている。[Equation 1] Here, a _i is a prediction coefficient, and the long-term autocorrelation function R _i (i = 0, 1, 2, ..., P) of the speech signal x ′ (n) is used to square the prediction error d (n). Is required to be minimized. At this time, the adder 15 is reproducing the audio signal x ′ (n) by the sum of the dequantized value of the quantized difference signal d ′ (n) and the predicted value x ″ (n), and the quantization error is It does not accumulate.

【０００５】更に、差信号ｄ（ｎ）は量子化器１２でス
テップサイズ△（ｎ）により量子化され、量子化差信号
ｄ’（ｎ）となる。このステップサイズ△（ｎ）は、ス
テップサイズ適応制御器１４により１サンプル前のステ
ップサイズ△（ｎ−１）に係数ｐを乗じることで、次式
により求められる。Further, the difference signal d (n) is quantized by the quantizer 12 with a step size Δ (n), and becomes a quantized difference signal d '(n). This step size Δ (n) is obtained by the following equation by multiplying the step size Δ (n−1) one sample before by the coefficient p by the step size adaptive controller 14.

【０００６】[0006]

【数２】ここで、係数ｐは１サンプル前の符号ｃ（ｎ−１）によ
り決定される。[Equation 2] Here, the coefficient p is determined by the code c (n-1) one sample before.

【０００７】更に、量子化差信号ｄ’（ｎ）は符号器１
３で符号化され、符号データｃ（ｎ）となり、用途に応
じて通信相手にリアルタイムで送信され、又は一旦メモ
リに記憶される。Further, the quantized difference signal d '(n) is the encoder 1
3 is coded to be coded data c (n), which is transmitted in real time to a communication partner or temporarily stored in a memory according to the purpose.

【０００８】図８（Ｂ）は復号装置のブロック図で、こ
の様な装置はハードウェア構成又はＤＳＰを使用したソ
フトウェア構成により実現可能である。図において、復
号側では、上記符号側と同様のアルゴリズムに従い、入
力の符号データｃ（ｎ）と自己が生成管理するステップ
サイズ△（ｎ）とから差信号ｄ’（ｎ）を再生し、これ
に予測値ｘ”（ｎ）を加算して最終的に音声再生信号
ｘ’（ｎ）を得る。更に、図示しないが、この音声再生
信号ｘ’（ｎ）は波形再生フィルタで音声信号ｘ（ｎ）
にＤ／Ａ変換されて後、アンプで増幅され、スピーカに
加えられる。従って、高いデータ圧縮率が得られ、多く
のアナウンス情報を効率良くメモリに蓄積し、再生でき
る。FIG. 8B is a block diagram of a decoding device, and such a device can be realized by a hardware structure or a software structure using a DSP. In the figure, the decoding side reproduces the difference signal d ′ (n) from the input coded data c (n) and the step size Δ (n) generated and managed by itself according to the same algorithm as the above-mentioned coding side. Finally, the predicted value x ″ (n) is added to the audio reproduction signal x ′ (n) to obtain the audio reproduction signal x ′ (n). n)
After being D / A converted to, it is amplified by the amplifier and added to the speaker. Therefore, a high data compression rate can be obtained, and a large amount of announcement information can be efficiently stored in the memory and reproduced.

【０００９】[0009]

【発明が解決しようとする課題】しかし、近年この種の
アナウンス情報（一般には歓声等の背景音も含む）は益
々増加する傾向にあり、大容量のＣＤ−ＲＯＭを使用す
る場合は良いが、容量に制限のあるＲＯＭカートリッジ
等を使用する様な場合には符号データの更なる削減が必
要となる。この場合に、もし音声信号のサンプリング周
波数を下げれば、符号データも削減出来るが、音質が犠
牲となってしまう。However, in recent years, this type of announcement information (generally including background sounds such as cheers) tends to increase more and more, which is good when a large capacity CD-ROM is used. If a ROM cartridge or the like having a limited capacity is used, it is necessary to further reduce the code data. In this case, if the sampling frequency of the audio signal is lowered, the code data can be reduced, but the sound quality is sacrificed.

【００１０】従って、本発明の目的は、音質を損なわず
に符号データを更に削減できる音声信号の符号／復号方
法及びそのプログラムを記録した記録媒体を提供するこ
とにある。It is therefore an object of the present invention to provide a voice signal coding / decoding method capable of further reducing code data without degrading sound quality, and a recording medium recording the program thereof.

【００１１】[0011]

【課題を解決するための手段】上記の課題は例えば図１
の方法により解決される。即ち、本発明（１）の音声信
号の符号方法は、音声信号を所定のブロック長で分割
し、各ブロック信号をスペクトル分析してブロック信号
毎の実質的な周波数帯域を求め、その先頭ブロックより
連続して実質同一の周波数帯域と見做せる複数のブロッ
ク信号を順次連結し、得られたグループ信号毎に必要最
低限のサンプリング周波数で音声信号のサンプリングを
行い、得られた全サンプリングデータ系列につきデータ
圧縮符号化処理を行うものである。The above-mentioned problem is solved, for example, by referring to FIG.
It is solved by the method of. That is, according to the audio signal coding method of the present invention (1), the audio signal is divided into predetermined block lengths, each block signal is spectrum-analyzed to obtain a substantial frequency band for each block signal, Consecutively connect multiple block signals that can be regarded as having substantially the same frequency band in succession, sample audio signals at the minimum required sampling frequency for each group signal obtained, and Data compression encoding processing is performed.

【００１２】ところで、音声波形の長時間平均スペクト
ル分布については８００Ｈｚ以下の周波数成分が大部分
（約８割）であり、低域（２００Ｈｚ以下）を除いては
男女の差は少なく、かつ言語（日本語，英語等）による
差も少ないことが知られている。しかし、これを短時間
（数十〜数百ｍｓオーダ）のレンジでみると、音声波形
のスペクトル分布（占有帯域）も様々に変化している。By the way, in the long-term average spectral distribution of the speech waveform, the frequency components below 800 Hz are the majority (about 80%), and there is little difference between men and women except for the low range (below 200 Hz), and the language ( It is known that there is little difference between Japanese and English. However, when this is observed in a short-time range (several tens to several hundreds of ms), the spectrum distribution (occupied band) of the voice waveform changes variously.

【００１３】またゲーム機等では、アナウンサーが常時
喋り続けている訳ではなく、アナウンスと次のアナウン
スとの間に隙間が空く。またアナウンサーと野球解説者
との会話は、通常の会話と同様に、所定の間を以て一定
のリズムで行われる。更には、各アナウンスの間に観客
の歓声や、ブラスバンドの音楽等が挿入される場合も少
なくない。この時、ブラスバンドの音楽を所要の音質で
聴かせるには高い周波数成分までカバーしなくてはなら
ない。この様に、音声（サウンド）波形の短時間スペク
トル分布（所要帯域）は様々に変化しており、従って、
更なる帯域圧縮の余地がある。Further, in a game machine or the like, the announcer does not always speak, and there is a gap between the announcement and the next announcement. Also, the conversation between the announcer and the baseball commentator is performed at a constant rhythm over a predetermined period, as in a normal conversation. Furthermore, the cheers of the audience and brass band music are often inserted between the announcements. At this time, in order to hear the brass band music with a desired sound quality, it is necessary to cover high frequency components. In this way, the short-time spectrum distribution (required band) of the voice (sound) waveform changes variously, and therefore,
There is room for further band compression.

【００１４】そこで、本発明（１）においては、入力の
音声信号を所定のブロック長Ｌで分割し、各ブロック信
号をスペクトル分析してブロック信号毎の実質的な周波
数帯域ｋ1 ，ｋ1 ，ｋ1 ，ｋ1 ，ｋ2 ，ｋ2 ，ｋ2 ，ｋ
1 ，…，等を求める。ここでは、ｋ1 ＜ｋ2 である。更
に、その先頭ブロックより、連続して実質同一の周波数
帯域と見做せる複数のブロック信号を順次連結し、得ら
れたグループ信号｛（ｋ1 ，ｋ1 ，ｋ1 ，ｋ1 のグルー
プ），（ｋ2 ，ｋ2 ，ｋ2 のグループ），…，等｝毎に
必要最低限のサンプリング周波数２ｋ1 ，２ｋ2 ，…，
等で音声信号のサンプリングを行い、得られた全サンプ
リングデータ系列につきＡＤＰＣＭ等によるデータ圧縮
符号化処理を行う。Therefore, in the present invention (1), the input voice signal is divided by a predetermined block length L, the block signals are spectrally analyzed, and the substantial frequency bands k1, k1, k1, k1, k2, k2, k2, k
Find 1,…, etc. Here, k1 <k2. Further, from the first block, a plurality of block signals which can be regarded as consecutively substantially the same frequency band are sequentially concatenated to obtain a group signal {(k1, k1, k1, k1 group), (k2, k2 , K2 groups), ..., Etc.} and the minimum required sampling frequencies 2k1, 2k2 ,.
Etc., the audio signal is sampled, and the data compression encoding processing by ADPCM or the like is performed on the obtained entire sampling data series.

【００１５】従って、グループ信号毎に所要帯域を確保
でき、よって常に高い音質を維持できる。一方、所要帯
域が低い所ではサンプリングデータ数も少ないので、こ
うして、音質を損なわずに符号データを更に削減でき
る。即ち、トータルのデータ圧縮率を更に上げることが
できる。Therefore, a required band can be ensured for each group signal, so that high sound quality can always be maintained. On the other hand, since the number of sampling data is small in a place where the required bandwidth is low, the code data can be further reduced without impairing the sound quality. That is, the total data compression rate can be further increased.

【００１６】好ましくは、本発明（２）においては、上
記本発明（１）において、前記音声信号に代えて、予め
一定のサンプリング周波数Ｆ_S（例えば４０ＫＨｚ）で
サンプリングされた音声データを処理対象となし、かつ
順次連結されたグループデータ毎に必要最低限のサンプ
リング周波数ｆ_s（＜Ｆ_S）で音声データのリサンプリ
ングを行うものである。Preferably, in the present invention (2), in the present invention (1), instead of the audio signal, audio data preliminarily sampled at a constant sampling frequency F _S (eg 40 KHz) is to be processed. None, and re-sampling of audio data is performed at the minimum necessary sampling frequency f _s (<F _S ) for each group data that is sequentially connected.

【００１７】従って、予めディジタル化された音声（サ
ウンド）データに対しても、音質を損なわずに符号デー
タを更に削減できる。Therefore, even for voice (sound) data which has been digitized in advance, the code data can be further reduced without impairing the sound quality.

【００１８】また本発明（３）の音声信号の復号方法
は、上記本発明（１）の方法によりデータ圧縮符号化さ
れた音声符号データであって、グループ信号毎に必要最
低限の周波数でサンプリングされたサンプリング周波数
と、データサイズと、符号データとの各情報を含むも
の、を入力して、該各グループ信号毎の音声符号データ
を対応する各サンプリング周期で順次データ伸長復号化
処理するものである。The audio signal decoding method of the present invention (3) is data compression-encoded by the method of the above invention (1).
Audio coded data that is required for each group signal.
Sampling frequency sampled at a low frequency
And each information of data size and code data
Is input and the audio code data for each group signal is sequentially subjected to data decompression decoding processing at each corresponding sampling cycle.

【００１９】本発明（３）によれば、本発明（１）の方
法によりデータ圧縮符号化された音声符号データであっ
て、グループ信号毎に必要最低限の周波数でサンプリン
グされたサンプリング周波数と、データサイズと、符号
データとの各情報を含むもの、を入力して、該各グルー
プ信号毎の音声符号データを対応する各サンプリング周
期で順次データ伸長復号化処理する構成により、各所要
帯域の音声信号が適正に再生される。また、この様な音
声信号の復号方法は上記本発明（１）又は（２）の符号
方法と組み合わせることで、最適なる音声符号システム
を構成できる。According to the present invention (3), the present invention (1)
It is voice code data that is data compression coded by the
The minimum frequency required for each group signal.
Sampling frequency, data size, and sign
Enter the data and each information,
The audio signal in each required band is properly reproduced by the configuration in which the audio code data for each audio signal is sequentially decompressed and decoded at each corresponding sampling period. In addition, such an audio signal decoding method can be combined with the encoding method of the present invention (1) or (2) to configure an optimal audio encoding system.

【００２０】また本発明（４）のコンピュータ読み取り
可能な記録媒体は、上記本発明（１）〜（３）の何れか
１つに記載の音声信号の符号／復号方法をコンピュータ
に実行させるためのプログラムを記録したものである。A computer-readable recording medium according to the present invention (4) is for causing a computer to execute the method of encoding / decoding an audio signal according to any one of the above-mentioned inventions (1) to (3). It is a record of the program.

【００２１】[0021]

【発明の実施の形態】以下、添付図面に従って本発明に
好適なる実施の形態を詳細に説明する。なお、全図を通
して同一符号は同一又は相当部分を示すものとする。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings. The same reference numerals denote the same or corresponding parts throughout the drawings.

【００２２】図２は実施の形態によるサウンドデータ符
号装置の構成を示す図で、図において、１は本装置の主
制御・処理を行うＣＰＵ，２はＣＰＵ１が実行する例え
ば図３，図４のサウンドデータ符号処理プログラムや必
要なデータを記憶するためのＲＡＭ，ＲＯＭ，ＥＥＰＲ
ＯＭ等から成る主メモリ、３はＣＲＴ又は液晶等から成
る表示部（ＤＳＰ）、４は処理情報を入力するためのキ
ーボード（ＫＢＤ）、５は２次記憶装置としてのハード
ディスク装置（ＨＤＤ）、５ａは処理前のサウンドデー
タを記憶しているサウンドデータファイル、５ｂは処理
後の符号データを記憶する符号データファイルである。FIG. 2 is a diagram showing the configuration of the sound data encoding device according to the embodiment. In FIG. 2, 1 is a CPU that performs main control / processing of the device, and 2 is a CPU that the CPU 1 executes, for example, as shown in FIGS. RAM, ROM, EEPR for storing sound data code processing program and necessary data
A main memory including an OM or the like, a display unit (DSP) including a CRT or a liquid crystal, a keyboard (KBD) for inputting processing information, a hard disk device (HDD) 5a as a secondary storage device, and 5a. Is a sound data file that stores unprocessed sound data, and 5b is a code data file that stores processed code data.

【００２３】なお、図示しないが、予め外部で作成され
たサウンドデータを読み込むための読込装置及び内部で
作成した符号データを外部のＲＯＭカートリッジ４２や
ＣＤ−ＲＯＭ４３に書き込むための書込装置を接続可能
である。また、本明細書を通して音声，音楽，背景音等
の混在したサウンド信号のことを単に音声信号と呼ぶこ
とがある。Although not shown, a reading device for reading sound data created externally in advance and a writing device for writing code data created internally to an external ROM cartridge 42 or CD-ROM 43 can be connected. Is. In addition, a sound signal in which voice, music, background sound, and the like are mixed may be simply referred to as an audio signal throughout this specification.

【００２４】図において、予めマイク７（又はその他の
録音機器，サウンド合成機器等）からのサウンド信号は
所定周波数Ｆ_s（例えば４０ＫＨｚ固定）のクロック信
号ＳＣＫによりサンプリング（Ａ／Ｄ変換）され、サウ
ンドデータファイル５ａに記憶されている。In the figure, a sound signal from the microphone 7 (or other recording device, sound synthesizing device, etc.) is sampled (A / D converted) by a clock signal SCK having a predetermined frequency F _s (for example, fixed at 40 KHz), and sound is generated. It is stored in the data file 5a.

【００２５】符号装置は、このサウンドデータを所定の
ブロック長で分割し、各ブロックデータをスペクトル分
析してブロックデータ毎の実質的な周波数帯域を求め、
その先頭ブロックより連続して実質同一の周波数帯域と
見做せる複数のブロックデータを順次連結し、得られた
１のブロックデータを含む各グループデータ毎に必要最
低限のサンプリング周波数でサウンドデータのリサンプ
リングを行い、得られた全サンプリングデータ系列につ
き例えばＡＤＰＣＭによるデータ圧縮符号化処理を行
い、これを符号データファイル５ｂに格納する。このサ
ウンド符号データは、ゲーム用のＲＯＭカートリッジ４
２又はＣＤ−ＲＯＭ４３に３ＤＣＧアニメーションデー
タと共に書き込まれ、サウンドデータの情報源として使
用される。The coding device divides the sound data into blocks of a predetermined length, and spectrally analyzes each block data to obtain a substantial frequency band for each block data.
It was obtained by sequentially connecting a plurality of block data that can be regarded as the substantially same frequency band continuously from the first block.
The sound data is resampled at the minimum required sampling frequency for each group data including one block data, and the obtained all sampling data series is subjected to data compression encoding processing by, for example, ADPCM, and this is a code data file. Store in 5b. This sound code data is stored in the game ROM cartridge 4
2 or written in the CD-ROM 43 together with 3DCG animation data and used as an information source of sound data.

【００２６】図３，図４は実施の形態によるサウンドデ
ータ符号処理のフローチャート（１），（２）、また図
５，図６は実施の形態によるサウンドデータ符号処理の
イメージ図（１），（２）であり、以下これらの図を参
照して実施の形態によるサウンドデータ符号処理を詳細
に説明する。3 and 4 are flowcharts (1) and (2) of the sound data coding process according to the embodiment, and FIGS. 5 and 6 are image diagrams (1) and (2) of the sound data coding process according to the embodiment. ), And the sound data encoding process according to the embodiment will be described in detail below with reference to these drawings.

【００２７】図３において、ステップＳ１ではスペクト
ルの実質的な存在有／無を決定するための閾値ｔ_hを設
定する。ここで、「実質的な」とは、復号サウンドに所
要の音質を得るのに必要な、と言うことを意味する。ス
テップＳ２ではスペクトル分析を行うサウンドデータの
ブロック長Ｌ（例えばＬ＝１００〜１０００ワードの範
囲で選択）を設定する。ステップＳ３ではあるサウンド
データファイルを開き、サウンドデータのデータ長ｌ_en
を取得する。[0027] In FIG. 3, to set the threshold t _h for determining the substantial presence / absence of the spectrum in step S1. Here, “substantially” means that the decoded sound is necessary to obtain a desired sound quality. In step S2, a block length L of sound data to be subjected to spectrum analysis (for example, L = 100 to 1000 words is selected in the range) is set. In step S3, a sound data file is opened and the data length of the sound data is l _en
To get.

【００２８】図５（Ａ）にサウンドデータファイルの記
憶構造を示す。サウンドデータファイルには、野球ゲー
ム等における各シーンに対応して、アナウンサーの実況
放送の音声がその背景音（歓声，音楽等）と共に所定周
波数Ｆ_s（例えば４０ＫＨｚ固定）で予めサンプリング
された一連のサウンドデータが記録されている。そのデ
ータ長ｌ_enはゲームのシーン（アナウンス内容）毎に異
なり、最後のサウンドデータの後ろにＥＯＦコードが付
加されている。なお、このデータ長ｌ_enの情報は処理の
容易のために予め求められ、サウンドデータファイルの
一部に書き込まれている。FIG. 5A shows the storage structure of the sound data file. The sound data file includes a series of pre-sampled voices of the announcer's live broadcast corresponding to each scene in a baseball game and the like, together with the background sound (cheers, music, etc.) at a predetermined frequency F _s (for example, 40 KHz fixed). Sound data is recorded. The data length l _en differs for each scene (announcement content) of the game, and an EOF code is added after the last sound data. The information of the data length l _en is obtained in advance for easy processing and is written in a part of the sound data file.

【００２９】図３に戻り、ステップＳ４ではサウンドデ
ータをブロック長Ｌで分割すると共に、ｍ＝［ｌen／
Ｌ］＋２となし、かつｍ個の配列Ｄ［ｍ］をデータ
「０」で初期化する。ここで、演算［ｌen／Ｌ］は小数
点以下の切り捨てを意味する。また＋２は端数ブロック
等をカバーするためにｍの値に余裕を持たせたものであ
る。Returning to FIG. 3, in step S4, the sound data is divided by the block length L, and m = [len /
L] +2, and m arrays D [m] are initialized with data “0”. Here, the operation [len / L] means rounding down after the decimal point. Further, +2 is a value in which m has a margin in order to cover a fractional block or the like.

【００３０】図５（Ｂ）にブロック分割の態様を示す。
データ長ｌ_enの一連のサウンドデータをブロック長Ｌで
分割して考え、夫々をブロックデータとする。その際に
は、最後の端数ブロックも１つのブロックとして取り扱
われる。FIG. 5B shows a mode of block division.
A series of sound data having a data length of l _en is divided by the block length L to be considered, and each is used as block data. In that case, the last fractional block is also treated as one block.

【００３１】なお、後述するスペクトル分析では、各ブ
ロックデータをそのまま切り出す（即ち、矩形窓を乗じ
る）と歪んだスペクトルが観測されることになる。そこ
で、実際上は、この影響を低減するためにハミング窓
（Hamming window) 等を乗じて切り出す。またハミング
窓を使用すると両端で波形が減衰するため、処理区間
（各ブロック）を一部重複させながら移動させる方法を
採る。但し、ここでは説明の簡単のため、ブロックデー
タをそのまま切り出す方法で説明を行う。In the spectrum analysis described later, when each block data is cut out as it is (that is, multiplied by a rectangular window), a distorted spectrum is observed. Therefore, in practice, a Hamming window or the like is used to cut out in order to reduce this effect. Moreover, since the waveform is attenuated at both ends when the Hamming window is used, a method of moving the processing sections (each block) while partially overlapping them is adopted. However, for simplification of description, a method of cutting out the block data as it is will be described here.

【００３２】図３に戻り、ステップＳ５ではブロック番
号を計数するためのカウンタｉ＝０に初期化する。ステ
ップＳ６では１ブロック分のサウンドデータを読み込
む。ステップＳ７ではＥＯＦコードか否かを判別する。
ＥＯＦコードでない場合は、ステップＳ８で当該ブロッ
クのサウンドデータが１ブロック長Ｌより短いか否かを
判別する。短い場合はステップＳ９で短い分をデータ
「０」で満たす。また短く無い場合は上記ステップＳ９
の処理をスキップする。ステップＳ１０では１ブロック
分のサウンドデータをスペクトル分析する。Returning to FIG. 3, in step S5, a counter i for counting the block number is initialized to 0. In step S6, one block of sound data is read. In step S7, it is determined whether or not it is an EOF code.
If it is not the EOF code, it is determined in step S8 whether or not the sound data of the block is shorter than one block length L. If it is short, the short portion is filled with data "0" in step S9. If it is not short, the above step S9
Skip the process of. In step S10, one block of sound data is spectrally analyzed.

【００３３】図５（Ｃ）に１ブロックデータのサンプリ
ング値ｓ（０）〜ｓ（Ｌ−１）Ｔを示す。１例のサンプ
リング周期Ｔ＝２５μｓ（Ｆ_s＝４０ＫＨｚ）相当であ
るが、本符号化処理は実時間で発生するサンプリングデ
ータを取り扱う訳ではないので、各サンプリング値の取
得とそのスペクトル分析は高速に得られる。ここで、離
散値から成るサウンド信号ｓ（ｎＴ）は次式により表さ
れる。FIG. 5C shows sampling values s (0) to s (L-1) T of one block data. The sampling cycle T of one example is equivalent to T = 25 μs (F _s = 40 KHz), but since this encoding processing does not handle sampling data generated in real time, acquisition of each sampling value and its spectrum analysis are performed at high speed. can get. Here, the sound signal s (nT) consisting of discrete values is expressed by the following equation.

【００３４】[0034]

【数３】離散信号ｓ（ｎＴ）の離散フーリェ変換Ｆ（ｓ）は次式
により表される。[Equation 3] The discrete Fourier transform F (s) of the discrete signal s (nT) is expressed by the following equation.

【００３５】[0035]

【数４】これを基本周波数ｆ₀＝１／ＬＴをパラメータとして表
すと次式となる。[Equation 4] If this is expressed by using the fundamental frequency f ₀ = 1 / LT as a parameter, the following equation is obtained.

【００３６】[0036]

【数５】更に、このスペクトル分析の演算を行列の形で表すと次
式となる。[Equation 5] Further, the calculation of this spectrum analysis is expressed in the form of a matrix as follows.

【００３７】[0037]

【数６】なお、実際の演算はＦＦＴ（Fast Fourier Transform)
により高速に行う。[Equation 6] The actual calculation is FFT (Fast Fourier Transform)
To do faster.

【００３８】図５（Ｄ）にブロックサウンド信号ｓ（ｎ
Ｔ）のスペクトル値Ｆ（０）〜Ｆ（（Ｌ−１）ｆ₀）を
示す。なお、この図はブロック長Ｌ＝１００とした場合
を示しており、基本周波数ｆ₀＝４００Ｈｚとなってい
る。また、この例では高い周波数から低い周波数に向か
った場合に、Ｆ（ｋｆ₀）のスペクトル値が閾値ｔ_hを
上回っている。FIG. 5D shows the block sound signal s (n
The spectral values F (0) to F ((L-1) f ₀ ) of T) are shown. This figure shows the case where the block length L = 100, and the fundamental frequency is f ₀ = 400 Hz. Further, in the case towards the lower frequencies from a high frequency in this example, the spectral values of F (kf ₀₎ exceeds the threshold value t _h.

【００３９】図３に戻り、ステップＳ１１では高い周波
数から低い周波数に向かい、Ｆ（ｋｆ₀）＞ｔ_hとなる
時のスペクトル位置ｋを検出する。このスペクトル位置
ｋは当該ブロック信号ｓ（ｎＴ）に含まれる実質的な最
高周波数成分の周波数（以下、これを帯域とも呼ぶ）に
対応している。ステップＳ１２では配列Ｄ［ｉ］（最初
はＤ［０］）に上記検出したスペクトル位置ｋを保存す
る。ステップＳ１３ではブロックカウンタｉに＋１し、
上記ステップＳ６の処理に戻る。[0039] Returning to FIG. 3, towards the lower frequency from In step S11 high frequencies, and detect the spectrum position k when the _{_{F (kf 0)> t h}} . The spectral position k corresponds to the frequency of the substantially highest frequency component included in the block signal s (nT) (hereinafter, also referred to as a band). In step S12, the detected spectral position k is stored in the array D [i] (initially D [0]). In step S13, the block counter i is incremented by 1,
The process returns to step S6.

【００４０】ステップＳ６では次の１ブロックデータを
読み込み、上記同様の処理を行う。こうして、やがて、
ステップＳ７の判別でＥＯＦコードが検出されると、全
ブロック分のスペクトル分析が終了したことになる。こ
れにより、処理はステップＳ１４に進み、その時のカウ
ンタｉの内容（スペクトル分析したブロック数）をレジ
スタＩに退避する。そして、図４の処理に進む。In step S6, the next one block data is read and the same processing as above is performed. In this way,
When the EOF code is detected in the determination in step S7, it means that the spectrum analysis for all blocks is completed. As a result, the process proceeds to step S14, and the contents of the counter i (the number of blocks analyzed by spectrum) at that time are saved in the register I. Then, the process proceeds to FIG.

【００４１】図４において、ステップＳ２１ではブロッ
ク番号を計数するためのカウンタｉ＝０に初期化する。
ステップＳ２２では前側のブロックと実質的に同一帯域
と見做せるブロックの数を計数するためのカウンタｊ＝
１に初期化する。ステップＳ２３ではレジスタＫに配列
Ｄ［ｉ］（最初はＤ［０］）の内容（例えばｋ₁）をセ
ットする。ステップＳ２４ではブロックカウンタｉに＋
１して後、レジスタＭに配列Ｄ［ｉ］（最初はＤ
［１］）の内容（例えばｋ₁）をセットする。ステップ
Ｓ２５ではＫ≒Ｍか否かを判別する。In FIG. 4, in step S21, a counter i for counting the block number is initialized to 0.
In step S22, a counter j = for counting the number of blocks that can be regarded as having substantially the same band as the block on the front side.
Initialize to 1. In step S23, the contents (for example, k ₁ ) of the array D [i] (initially D [0]) is set in the register K. In step S24, + is added to the block counter i
After that, an array D [i] (first D
The contents of [1]) (for example, k ₁ ) are set. In step S25, it is determined whether K≈M.

【００４２】なお、このＫ≒Ｍか否かの判別方法には幾
通りか考えられる。１つは厳密にＫ＝Ｍの場合にＫ≒Ｍ
と判定し、また他の１つはＫ−α≦Ｍ≦Ｋ＋α（但し、
αは１，２，…，等の整数）の場合にＫ≒Ｍと判定する
場合である。因みに、前者の場合は厳密に同一を要求す
るために実質同一帯域と見做される連続するブロック数
が少なくなり、また後者の場合はある程度までの相違を
同一と見做すために実質同一帯域と見做される連続する
ブロック数が多くなる。There are several possible methods for determining whether or not K≈M. One is K ≈ M when strictly K = M
And the other one is K−α ≦ M ≦ K + α (however,
α is an integer such as 1, 2, ... By the way, in the former case, the number of consecutive blocks that are regarded as substantially the same band is reduced because they require exactly the same, and in the latter case, the substantially same band is regarded as the difference to some extent is regarded as the same. The number of consecutive blocks that are considered to be large increases.

【００４３】いずれにしても、上記ステップＳ２５の判
別でＫ≒Ｍの場合はステップＳ２６で同一ブロック数カ
ウンタｊに＋１し、ステップＳ２４に戻る。ステップＳ
２４ではカウンタｉに＋１して後、レジスタＭに配列Ｄ
［ｉ］（今度はＤ［２］）の内容（例えばｋ1 ）を読み
出す。こうして、実質的に同一帯域と見做せる複数のブ
ロック列を検出し、やがて、ステップＳ２５の判別でＫ
≒Ｍでない場合は、ステップＳ２７の処理に進む。In any case, if K≈M in the determination in step S25, the same block number counter j is incremented by 1 in step S26, and the process returns to step S24. Step S
At 24, the counter i is incremented by 1, and then the array D is placed in the register M.
The content (eg, k1) of [i] (this time D [2]) is read. In this way, a plurality of block strings that can be regarded as being substantially in the same band are detected, and eventually K is determined in the determination in step S25.
If not ≈M, the process proceeds to step S27.

【００４４】図６（Ａ）に同一帯域と見做されたリサン
プルブロック列（グループデータ）の抽出例を示す。こ
こでは、例えば最初の４ブロック分の帯域は夫々ｋ₁、
次の３ブロック分の帯域は夫々ｋ₂（＞ｋ₁）、次の４
ブロック分の帯域は夫々ｋ₃（＞ｋ₂）と判別されてい
る。FIG. 6A shows an example of extracting a resample block sequence (group data) considered to be in the same band. Here, for example, the bands for the first four blocks are k ₁ and
The bands for the next three blocks are k ₂ (> k ₁ ) and the next 4 bands, respectively.
The band for each block is determined to be k ₃ (> k ₂ ).

【００４５】図４に戻り、ステップＳ２７ではｆ_s＝ｇ
（Ｋ）の関係により、新しいサンプリング周波数ｆ_sを
決定する。ここで、サンプリング周波数ｆ_sとレジスタ
Ｋの内容（最初はｋ₁）とは予め関数（又は不図示のテ
ーブル）ｇにより関係付けられており、例えばｆ_s＝２
ｋ₁ｆ₀の関係にある。ステップＳ２８ではｆ_c＝ｈ
（Ｋ）の関係により、後述するローパスフィルタ処理の
カットオフ周波数ｆ_cを決定する。ここで、カットオフ
周波数ｆ_cとレジスタＫの内容（最初はｋ₁）とは予め
関数（又は不図示のテーブル）ｈにより関係付けられて
おり、例えばｆ_c＝ｋ₁ｆ₀の関係にある。Returning to FIG. 4, in step S27, f _s = g
A new sampling frequency f _s is determined by the relationship of (K). Here, the sampling frequency f _s and the contents of the register K (initially k ₁ ) are related in advance by a function (or a table (not shown)) g, for example, f _s = 2
There is a relationship of k ₁ f ₀ . In step S28, f _c = h
The cutoff frequency f _c of the low-pass filter processing described later is determined based on the relationship (K). Here, the cutoff frequency f _c and the contents of the register K (initially k ₁ ) are related by a function (or a table (not shown)) h in advance, for example, f _c = k ₁ f _0. .

【００４６】ステップＳ２９ではｊ（最初は４）ブロッ
ク分のサウンドデータを読み込む。この時、必要ならｊ
ブロック分のサウンドデータに対応するハミング窓を乗
じる。ステップＳ３０では、ＵとＤを互いに素な整数と
して、ｆ_c≒Ｆ_s・Ｕ／Ｄと近似できる様な最小のＵと
Ｄとを決定する。例えば、今、カットオフ周波数ｆ_c＝
２６．７ＫＨｚであったとすると、Ｕ＝２，Ｄ＝３とす
ることで、ｆ_c≒Ｆ_s・Ｕ／Ｄ＝４０ＫＨｚ×２／３＝
２６．７ＫＨｚを満足する。そして、ステップＳ３１で
は公知のマルチレート・フィルタ処理によりｊブロック
分のサウンドデータ（グループデータ）のリサンプリン
グ処理を行う。In step S29, sound data for j (initially 4) blocks is read. At this time, if necessary, j
Multiply the humming window corresponding to the sound data for blocks. In step S30, U and D are set to be relatively prime integers, and the minimum U and D that can be approximated to f _c ≈F _s · U / D are determined. For example, now the cutoff frequency f _c =
If it is 26.7 KHz, by setting U = 2 and D = 3, f _c ≈F _s · U / D = 40 KHz × 2/3 =
It satisfies 26.7 KHz. Then, in step S31, resampling processing of sound data (group data) for j blocks is performed by a known multi-rate filter processing.

【００４７】図６（Ｂ）〜図６（Ｄ）にマルチレート・
フィルタ処理のイメージを示す。図６（Ｂ）において、
まずアップサンプラ（Ｕ）により、４ブロック分の各サ
ウンドデータｓ（０）〜ｓ（Ｌ−１）Ｔの各中央にデー
タ「０」を挿入する。これにより、信号帯域は２倍に拡
大される。図６（Ｃ）において、次にカットオフ周波数
ｆ_cのローパスフィルタ（ｆ_c）により、上記アップサ
ンプリング後のサウンドデータをフィルタ（帯域制限）
処理する。これにより、元の４ブロック分の各サウンド
データｓ（０）〜ｓ（Ｌ−１）Ｔの間を滑らかに繋ぐよ
うな新たなデータが補間される。図６（Ｄ）において、
次にダウンサンプラ（Ｄ）により、上記フィルタ出力の
サウンドデータをダウンサンプル（この例では２つ置き
に抽出）する。こうして得られたリサンプリング後のサ
ウンドデータは、元の４ブロック長分のサウンド信号を
最初からｆ_s＝２６．７ＫＨｚ（３７．５μｓ間隔）で
サンプリングしたサウンドデータと略等価になってい
る。In FIG. 6 (B) to FIG. 6 (D), the multi-rate
The image of filter processing is shown. In FIG. 6 (B),
First, the upsampler (U) inserts data "0" at the center of each sound data s (0) to s (L-1) T for four blocks. As a result, the signal band is doubled. In FIG. 6 (C), the the next cut-off frequency f _c by the low-pass filter (f _c), the filter sound data after the up-sampling (band-limited)
To process. As a result, new data is smoothly interpolated between the original four blocks of sound data s (0) to s (L-1) T. In FIG. 6 (D),
Next, the downsampler (D) downsamples (every two in this example) the sound data of the filter output. The sound data after resampling obtained in this manner is substantially equivalent to the sound data obtained by sampling the original sound signal of four block lengths from the beginning at f _s = 26.7 KHz (37.5 μs interval).

【００４８】図４に戻り、ステップＳ３２では上記リサ
ンプルされたサウンドデータを例えばＡＤＰＣＭ方式に
よりデータ圧縮符号化処理する。このＡＤＰＣＭによる
符号処理は上記図８（Ａ）で述べたものと同様で良い。
ステップＳ３３ではｊブロック分のデータサイズ（ｊ×
Ｌに相当）の情報と新しいサンプリング周波数ｆs の情
報とを出力する。ステップＳ３４では引き続き上記ＡＤ
ＰＣＭ符号処理した符号データｃ（ｎ）を出力する。こ
れらの情報（ｊ×Ｌに相当），ｆs 及びその符号データ
ｃ（ｎ）は互いに関係を付けられて符号データファイル
５ｂに書き込まれ、後述する復号処理で利用される。ス
テップＳ３５ではｉ＜Ｉ（＝最大ブロック数）か否かを
判別する。ｉ＜Ｉの場合はステップＳ２２に戻り、現時
点のカウンタｉが指すブロックを次のグループデータの
先頭ブロックと考えて上記同様の処理を繰り返す。こう
して、やがてｉ＜Ｉでなくなると、１サウンドデータ分
の符号処理を終了する。Returning to FIG. 4, in step S32, the resampled sound data is subjected to data compression encoding processing by, for example, the ADPCM system. The code processing by the ADPCM may be the same as that described with reference to FIG.
In step S33, the data size of j blocks (j ×
( Corresponding to L) and new sampling frequency fs information is output. In step S34, the above AD is continued.
The code data c (n) subjected to the PCM code processing is output. These pieces of information ( corresponding to j × L), fs, and their coded data c (n) are associated with each other and written in the coded data file 5b, and are used in the decoding process described later. In step S35, it is determined whether i <I (= maximum number of blocks). If i <I, the process returns to step S22, and the block indicated by the current counter i is regarded as the first block of the next group data, and the same process as above is repeated. Thus, when i <I no longer holds, the coding process for one sound data is ended.

【００４９】図７は実施の形態によるゲーム装置の構成
を示す図で、ここには実施の形態によるサウンドデータ
復号処理の構成が具備されている。FIG. 7 is a diagram showing the configuration of the game device according to the embodiment. Here, the configuration of the sound data decoding process according to the embodiment is provided.

【００５０】図において、３０はゲーム装置本体、３１
はゲーム装置の主制御（ゲームプログラム，３ＤＣＧア
ニメーションデータ及びサウンド符号データの読込制
御，ユーザによるゲーム操作データの読込制御等）並び
に３ＤＣＧアニメーションに係るゲーム処理（ゲームキ
ャラクタの移動処理等）を行うＣＰＵ（ゲームプロセッ
サ）、３２はＣＰＵ３１が使用するＲＡＭ，ＲＯＭ，Ｅ
ＥＰＲＯＭ等からなる主メモリ（ＭＭ）、３３は入力の
３ＤＣＧアニメーションデータを２Ｄ画面に透視変換す
るための３Ｄアクセラレータ、３４はＲＯＭカートリッ
ジ４２又はＣＤ−ＲＯＭ４３から逐次読み出したサウン
ド符号データを一時的に保持するバッファ（ＢＵＦ）、
３５は後述する操作パッド４１をゲーム装置に収容する
周辺インタフェース（ＰＩＦ）、３６はＲＯＭカートリ
ッジ４２をゲーム装置に着脱自在に収容可能なＲＯＭカ
ートリッジインタフェース（ＲＯＭ−ＣＩＦ）、３７は
ＣＤ−ＲＯＭ４３をゲーム装置に着脱自在に収容し、か
つ駆動可能なＣＤ−ＲＯＭドライバ、３８は、ゲームプ
ログラムのオンラインダウンロードや他のゲーム装置と
の間で対戦ゲームを行うために、本ゲーム装置を不図示
の公衆網に接続するための通信制御部（ＣＯＭ）、３９
はサウンド符号データをデータ伸長復号処理するＡＤＰ
ＣＭ復号部である。In the figure, 30 is the main body of the game device, and 31 is
Is a CPU that performs main control of the game device (game program, reading control of 3DCG animation data and sound code data, reading control of game operation data by user, etc.) and game processing related to 3DCG animation (moving processing of game character, etc.) ( Game processor), 32 is RAM, ROM, E used by the CPU 31
A main memory (MM) including an EPROM, 33 is a 3D accelerator for perspectively converting input 3DCG animation data into a 2D screen, and 34 is a temporary storage of sound code data sequentially read from the ROM cartridge 42 or the CD-ROM 43. Buffer (BUF),
Reference numeral 35 denotes a peripheral interface (PIF) for accommodating an operation pad 41 described later in the game device, 36 denotes a ROM cartridge interface (ROM-CIF) capable of accommodating a ROM cartridge 42 detachably in the game device, and 37 denotes a CD-ROM 43 for the game. A CD-ROM driver 38 that can be detachably accommodated in the device and can be driven is provided by a public network (not shown) for this game device in order to download a game program online or to play a battle game with another game device. Communication control unit (COM) for connecting to
Is an ADP for decompressing and decoding sound code data
It is a CM decoding unit.

【００５１】更に、４１はユーザがゲーム操作やパラメ
ータ設定等を行うための各種制御キー（スタートキー，
選択キー，十字キー等）や、マウス，ジョイスティック
等の位置入力手段を備える操作パッド、４２はゲームプ
ログラム（処理手順，ゲームパラメータ，３ＤＣＧアニ
メーションデータ，サウンド符号データ等）をマスクＲ
ＯＭ等に記録しているＲＯＭカートリッジ（ＲＯＭ−
Ｃ）、４３は前記と同様のゲームプログラムを記録して
いるＣＤ−ＲＯＭ、４４は公衆網に接続する通信線、４
５はゲームを表示するためのＣＲＴや液晶等から成る表
示部（ＤＳＰ）、そして、４６は音声を含むサウンド信
号を出力するためのスピーカである。Further, reference numeral 41 denotes various control keys (start key,
An operation pad provided with position input means such as a mouse and a joystick, and a mask R for a game program (processing procedure, game parameters, 3DCG animation data, sound code data, etc.).
ROM cartridge (ROM-
C), 43 are CD-ROMs recording the same game programs as above, 44 is a communication line connected to the public network, 4
Reference numeral 5 is a display unit (DSP) made up of a CRT or liquid crystal for displaying a game, and 46 is a speaker for outputting a sound signal including voice.

【００５２】ＡＤＰＣＭ復号部３９において、２５は波
形再生フィルタ（例えばカットオフ周波数２０ＫＨｚ）
等を備えるベースバンド処理部（ＢＢＣ）、２６はサウ
ンドデータのリサンプリング周波数ｆ_sに対応する処理
タイミング信号ＳＣＫを生成するタイミング制御部であ
る。その他の構成は上記図８（Ｂ）の復号装置と同様で
良い。なお、この様なＡＤＰＣＭ復号部３９はハードウ
ェア構成又はＤＳＰを使用したソフトウェア構成により
実現可能である。In the ADPCM decoding unit 39, 25 is a waveform reproduction filter (for example, a cutoff frequency of 20 KHz).
A baseband processing unit (BBC) 26 including the above is a timing control unit that generates a processing timing signal SCK corresponding to the resampling frequency f _s of the sound data. Other configurations may be similar to those of the decoding device shown in FIG. Note that such an ADPCM decoding unit 39 can be realized by a hardware configuration or a software configuration using a DSP.

【００５３】図において、ＣＰＵ３１は、ゲームの進行
に伴い、ＲＯＭカートリッジ４２又はＣＤ−ＲＯＭ４３
から読み出した、対応するシーンの３ＤＣＧアニメーシ
ョンデータを３Ｄアクセラレータ３３に供給すると共
に、該シーンに対応するサウンド符号データｃ（ｎ）を
適宜バッファ３４に転送する。このサウンド符号データ
ｃ（ｎ）は、上記図２のサウンドデータ符号装置により
グループデータ毎にリサンプル処理され、その後にＡＤ
ＰＣＭ方式によりデータ圧縮符号化された一連のサウン
ド符号データである。タイミング制御部２６は、サウン
ド符号データｃ（ｎ）に付加された新しいサンプリング
周波数ｆ_sの情報に従い対応する周波数のタイミング信
号ＳＣＫを生成し、かつ該サウンド符号データｃ（ｎ）
に付加されたデータサイズ（ｊ×Ｌに相当）の情報に従
いバッファ３４から当該データサイズ分のサウンド符号
データｃ（ｎ）を逐次読み出す。In the figure, the CPU 31 has a ROM cartridge 42 or a CD-ROM 43 as the game progresses.
The 3DCG animation data of the corresponding scene read out from the 3D accelerator 33 is supplied to the 3D accelerator 33, and the sound code data c (n) corresponding to the scene is appropriately transferred to the buffer 34. This sound code data c (n) is resampled for each group data by the sound data coding device shown in FIG.
It is a series of sound code data that is data compression coded by the PCM system. The timing control unit 26 generates a timing signal SCK having a corresponding frequency according to the information of the new sampling frequency f _s added to the sound code data c (n), and the sound code data c (n).
According to the information of the data size (corresponding to j × L) added to, the sound code data c (n) of the data size is sequentially read from the buffer 34.

【００５４】こうして、最初のｊ（＝４）ブロック分の
サウンド符号データｃ（ｎ）は周波数２ｋ₁ｆ₀に相当
するタイミング信号ＳＣＫに従ってデータ伸長復号処理
される。このタイミング信号ＳＣＫはベースバンド処理
部（Ｄ／Ａ変換部）２５にも提供され、所要帯域ｋ₁ｆ
₀のサウンド信号が再生される。以下同様にして、次の
ｊ（＝３）ブロック分のサウンド符号データｃ（ｎ）は
周波数２ｋ₂ｆ₀（＞２ｋ₁ｆ₀）のタイミング信号Ｓ
ＣＫに従ってデータ伸長復号処理され、次のｊ（＝４）
ブロック分のサウンド符号データｃ（ｎ）は周波数２ｋ
₃ｆ₀（＞２ｋ₂ｆ₀）のタイミング信号ＳＣＫに従っ
てデータ伸長復号処理される。こうして、常に高い音質
の維持されたサウンド信号がスピーカ４６に加えられ
る。In this way, the sound code data c (n) for the first j (= 4) blocks is subjected to data decompression decoding processing according to the timing signal SCK corresponding to the frequency 2k ₁ f ₀ . The timing signal SCK is also provided to the baseband processing unit (D / A conversion unit) 25, and the required band k ₁ f
A sound signal of ₀ is played. In the same manner, the sound code data c (n) for the next j (= 3) blocks is the timing signal S of the frequency 2k ₂ f ₀ (> 2k ₁ f ₀ ).
Data expansion decoding processing is performed according to CK, and the next j (= 4)
Sound code data c (n) for a block has a frequency of 2k
Data expansion decoding processing is performed according to the timing signal SCK of ₃ f ₀ (> 2k ₂ f ₀ ). In this way, a high-quality sound signal that is always maintained is applied to the speaker 46.

【００５５】なお、上記実施の形態ではＡＤＰＣＭによ
る符号／復号方式への適用例を述べたが、本発明による
適応的サンプリングレート変換方法に係る部分は、他の
様々な音声符号／復号方式（波形符号方式，生成源符号
方式等）と組み合わせることが可能であることは明らか
である。In the above embodiment, an example of application to the coding / decoding system by ADPCM is described, but the part related to the adaptive sampling rate conversion method according to the present invention is various other speech coding / decoding systems (waveforms). It is clear that it can be combined with a coding system, a source coding system, etc.).

【００５６】また、上記実施の形態では予め所定周波数
Ｆ_s（例えば４０ＫＨｚ）でサンプリングされた音声デ
ータを入力として、これを所要帯域が実質的に同一なグ
ループデータ毎に必要最低限のサンプリング周波数ｆ_s
でリサンプリングする場合を述べたが、アナログ音声信
号を直接の入力として、これを所要帯域が実質的に同一
なグループ信号毎に必要最低限のサンプリング周波数ｆ
_sでサンプリングする様に構成しても良いことは明らか
である。Further, in the above embodiment, the voice data sampled at the predetermined frequency F _s (for example, 40 KHz) is input, and this is used as the minimum sampling frequency f for each group data having substantially the same required band. _s
Although the case where the resampling is performed is described above, the analog voice signal is directly input, and this is used as the minimum sampling frequency f for each group signal having substantially the same required band.
_s may be configured so as to sampling by is clear.

【００５７】また、上記実施の形態では実質同一帯域と
見做せる各ブロックデータを繋げてグループデータ化し
たが、このグループデータ化を行わずに、ブロックデー
タ単位で音声データのリサンプル処理及び符号／復号処
理を行うように装置を構成しても良い。Further, in the above-mentioned embodiment, the block data which can be regarded as substantially the same band is connected to form the group data. However, without performing the group data, the re-sampling process and the code of the audio data are performed in the unit of the block data. / The device may be configured to perform the decoding process.

【００５８】また、上記本発明に好適なる実施の形態を
述べたが、本発明思想を逸脱しない範囲内で各部の構
成、制御、処理及びこれらの組合せの様々な変更が行え
ることは言うまでも無い。Although the preferred embodiments of the present invention have been described above, it goes without saying that various changes can be made to the configuration, control, processing, and combinations thereof of each part without departing from the spirit of the present invention. There is no.

【００５９】[0059]

【発明の効果】以上述べた如く本発明によれば、入力の
音声信号をブロック分割すると共に、各ブロック信号を
必要最低限のサンプリング周波数でディジタル処理する
ため、音質を犠牲にせずに、高データ圧縮率の音声符号
／復号処理を実現できる。As described above, according to the present invention, the input voice signal is divided into blocks and each block signal is digitally processed at the minimum necessary sampling frequency. Therefore, high data quality can be achieved without sacrificing sound quality. A compression rate voice encoding / decoding process can be realized.

[Brief description of drawings]

【図１】本発明の原理を説明する図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】実施の形態によるサウンドデータ符号装置の構
成を示す図である。FIG. 2 is a diagram showing a configuration of a sound data encoding device according to an embodiment.

【図３】実施の形態によるサウンドデータ符号処理のフ
ローチャート（１）である。FIG. 3 is a flowchart (1) of sound data coding processing according to the embodiment.

【図４】実施の形態によるサウンドデータ符号処理のフ
ローチャート（２）である。FIG. 4 is a flowchart (2) of sound data coding processing according to the embodiment.

【図５】実施の形態によるサウンドデータ符号処理のイ
メージ図（１）である。FIG. 5 is an image diagram (1) of sound data coding processing according to the embodiment.

【図６】実施の形態によるサウンドデータ符号処理のイ
メージ図（２）である。FIG. 6 is an image diagram (2) of sound data coding processing according to the embodiment.

【図７】実施の形態によるゲーム装置の構成を示す図で
ある。FIG. 7 is a diagram showing a configuration of a game device according to an embodiment.

【図８】従来のＡＤＰＣＭ方式を説明する図である。FIG. 8 is a diagram illustrating a conventional ADPCM system.

[Explanation of symbols]

１ＣＰＵ２主メモリ３表示部（ＤＳＰ）４キーボード（ＫＢＤ）５ハードディスク装置（ＨＤＤ）５ａサウンドデータファイル５ｂ符号データファイル３９ＡＤＰＣＭ復号部４２ＲＯＭカートリッジ（ＲＯＭ−Ｃ）４３ＣＤ−ＲＯＭ 1 CPU 2 main memory 3 Display (DSP) 4 keyboard (KBD) 5 Hard disk drive (HDD) 5a Sound data file 5b code data file 39 ADPCM Decoding Unit 42 ROM cartridge (ROM-C) 43 CD-ROM

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) H03M 3/00 - 11/00 G10L 19/00 ─────────────────────────────────────────────────── ─── Continuation of the front page (58) Fields surveyed (Int.Cl. ⁷ , DB name) H03M 3/00-11/00 G10L 19/00

Claims

(57) [Claims]

1. An audio signal is divided into a predetermined block length,
A group signal obtained by spectrally analyzing each block signal to obtain a substantial frequency band for each block signal, sequentially connecting a plurality of block signals that can be regarded as substantially the same frequency band consecutively from the first block An audio signal coding method characterized in that an audio signal is sampled at a required minimum sampling frequency for each time, and a data compression coding process is performed on the obtained all sampling data series.

2. Resampling of audio data at a minimum required sampling frequency for each group data that is sequentially processed, instead of processing the audio data, which is previously sampled at a constant sampling frequency. The method of encoding an audio signal according to claim 1, wherein

3. Data compression coding according to the method of claim 1.
Audio coded data that is required for each group signal
Sampling frequency sampled at the minimum frequency
It also includes information such as number, data size, and coded data.
Is input and the audio code data for each group signal is sequentially subjected to data decompression decoding processing at each corresponding sampling cycle.

4. A computer-readable recording medium having recorded therein a program for causing a computer to execute the method of encoding / decoding an audio signal according to claim 1. Description: