JP4254960B2

JP4254960B2 - Audio data encoding apparatus and audio data reproducing apparatus

Info

Publication number: JP4254960B2
Application number: JP2004351162A
Authority: JP
Inventors: 弘充松浦
Original assignee: Sony Computer Entertainment Inc
Current assignee: Sony Interactive Entertainment Inc
Priority date: 2004-12-03
Filing date: 2004-12-03
Publication date: 2009-04-15
Anticipated expiration: 2024-12-03
Also published as: JP2006162756A

Description

本発明は、音声データの記録および再生に係り、特に、ループ部分を含む音声データの再生時の処理負荷を軽減する記録および再生技術に関する。 The present invention relates to recording and reproduction of audio data, and more particularly to a recording and reproduction technique for reducing processing load during reproduction of audio data including a loop portion.

パーソナルコンピュータ、ゲーム機、音楽再生装置等のデジタル音声データ再生機能を備えた情報処理装置では、符号化された音声データを再生時にアナログ音声データに変換して音声出力することが行なわれている。 In an information processing apparatus having a digital voice data playback function such as a personal computer, a game machine, a music playback device, etc., encoded voice data is converted into analog voice data during playback and output as voice.

符号化された音声データは、音楽用ＣＤのように非圧縮のものもあるが、多くは、圧縮された状態で読み出され、再生時に伸長され音声出力される。 The encoded audio data may be uncompressed like a music CD, but most of the encoded audio data is read in a compressed state, decompressed during reproduction, and output as audio.

音声圧縮方式としては、種々の方式が提案され実用化されているが、時系列に並んだサンプルデータを複数のブロックに分割し、直交変換、例えば、ＤＣＴ（Discrete Consine Transform）、ＭＤＣＴ（Modified Discrete Consine Transform）等を施して圧縮を行なう方式が広く知られている。このとき、音声波形の連続性を高めるため、隣り合う複数のブロックを組にして、ブロックをオーバーラップさせながら順次圧縮することが行なわれている。このような圧縮方式の代表的なものとして、いわゆるミニディスク（ＭＤ）等で採用されているＡＴＲＡＣ（Adaptive TRans form Acoustic Coding）が挙げられる（特許文献１参照）。 Various methods have been proposed and put to practical use as audio compression methods, but sample data arranged in time series is divided into a plurality of blocks, and orthogonal transformation, for example, DCT (Discrete Consine Transform), MDCT (Modified Discrete) is performed. A method of performing compression by applying (Consine Transform) is widely known. At this time, in order to increase the continuity of the speech waveform, a plurality of adjacent blocks are grouped and sequentially compressed while overlapping the blocks. A typical example of such a compression method is ATRAC (Adaptive TRans form Acoustic Coding) employed in a so-called minidisc (MD) (see Patent Document 1).

図７（ａ）は、ブロック単位で音声データの圧縮を行なう様子を示す図である。本図では、時系列に並んだサンプルデータが所定個数毎に区切られてブロック（オリジナルブロック）「Ａ０」「Ａ」「Ｂ」「Ｃ」「Ｄ」「Ｄ０」…が生成されている。 FIG. 7A is a diagram illustrating a state in which audio data is compressed in units of blocks. In this figure, sample data arranged in time series is divided every predetermined number to generate blocks (original blocks) “A0” “A” “B” “C” “D” “D0”.

そして、連続する２つのブロックを組として、ブロックをオーバーラップさせながら圧縮処理を行ない、エンコードブロックが生成される。ここでは、オリジナルブロック「Ａ０」「Ａ」からエンコードブロック「Ｘ１」が生成され、オリジナルブロック「Ａ」「Ｂ」からエンコードブロック「Ｘ２」が生成されている。 Then, an encoding block is generated by performing compression processing with two consecutive blocks as a set and overlapping the blocks. Here, an encoding block “X1” is generated from the original blocks “A0” and “A”, and an encoding block “X2” is generated from the original blocks “A” and “B”.

音声データの再生時には、エンコードブロック「Ｘ１」「Ｘ２」からデコードブロック「ａ」が生成される。このデコードブロック「ａ」はオリジナルブロックの「Ａ」に対応するため、デコードブロック「ａ」のサンプルをＤ／Ａ変換して音声を再生することができる。以下同様にして、デコードブロック「ｂ」「ｃ」…を得ることができるため、音声が連続的に再生される。
特開平８−２８７６１２号公報 At the time of reproduction of audio data, a decode block “a” is generated from the encode blocks “X1” and “X2”. Since the decoded block “a” corresponds to “A” of the original block, the sample of the decoded block “a” can be D / A converted to reproduce the sound. Similarly, since the decoded blocks “b”, “c”,... Can be obtained, the sound is continuously reproduced.
JP-A-8-287612

ところで、音声データをループ再生させたい場合がある。例えば、図７（ｂ）に示すように、サンプルデータの所定の位置（ＬＳ）から所定の位置（ＬＥ）までの音声を、ある条件が満たされるまで繰り返し再生するような場合である。ここでは、ループのスタート位置であるＬＳがオリジナルブロック「Ａ」に含まれ、ループのエンド位置であるＬＥがオリジナルブロック「Ｄ」に含まれているものとする。 By the way, there is a case where audio data is desired to be reproduced in a loop. For example, as shown in FIG. 7B, the sound from the predetermined position (LS) to the predetermined position (LE) of the sample data is repeatedly reproduced until a certain condition is satisfied. Here, it is assumed that LS that is the start position of the loop is included in the original block “A”, and LE that is the end position of the loop is included in the original block “D”.

この場合、再生時には、図７（ｃ）に示すように、デコードブロック「ｄ」のＬＥに対応する位置まで再生を行なうと、続けて、デコードブロック「ａ」のＬＳに対応する位置から再生を行なうことになる。 In this case, as shown in FIG. 7C, when reproduction is performed up to the position corresponding to LE of the decoded block “d”, reproduction is continued from the position corresponding to LS of the decoded block “a”. Will do.

デコードブロック「ｄ」は、エンコードブロック「Ｘ４」「Ｘ５」とから生成されるが、エンコードブロック「Ｘ４」は、デコードブロック「ｃ」の生成の際にすでに読み込んでいるため、「Ｘ５」を読み込んで処理を行なえば足りる。一方、デコードブロック「ａ」は、エンコードブロック「Ｘ１」「Ｘ２」から生成されるが、ループで戻ってきた際には、両方のエンコードブロックを読み込んで処理を行なわなくてはならない。このため、一時的にデコード処理の負荷が増大する。 The decode block “d” is generated from the encode blocks “X4” and “X5”, but since the encode block “X4” has already been read when the decode block “c” is generated, “X5” is read. It's enough if you do the process. On the other hand, the decode block “a” is generated from the encode blocks “X1” and “X2”, but when returning in a loop, both the encode blocks must be read and processed. For this reason, the load of the decoding process temporarily increases.

さらに、ループの際には、デコードブロック「ｄ」に引き続いて再生されるデコードブロック「ａ」は、デコードブロック「ｄ」の再生が終わるまでに生成されていなければならない。ところが、デコードブロック「ｄ」は、ブロック全体が再生されずに、ＬＥの位置までしか再生されない。このため、より短時間でデコードブロック「ａ」を生成しなければならず、上述の処理負荷が一層大きくなっていた。 Further, in the loop, the decoded block “a” reproduced subsequent to the decoded block “d” must be generated before the reproduction of the decoded block “d” is completed. However, the decoded block “d” is reproduced only up to the position of LE without reproducing the entire block. For this reason, the decode block “a” must be generated in a shorter time, and the processing load described above is further increased.

このループの際の処理負荷の増大は、携帯型ゲーム機、ポータブル音楽再生装置のように携帯性が重視され、処理速度に制約のある音声データ再生装置において特に問題となっている。 The increase in processing load at the time of the loop is a problem particularly in an audio data reproducing apparatus in which portability is important and a processing speed is restricted like a portable game machine and a portable music reproducing apparatus.

本発明は、ループ部分を含む音声データの再生時における処理負荷を軽減させることを目的とする。 An object of the present invention is to reduce the processing load at the time of reproducing audio data including a loop portion.

上記課題を解決するため、本発明の第１の態様によれば、
ループスタート位置とループエンド位置とが指定されたループ部分を含む音声データをエンコードする音声データエンコード装置であって、
音声データをブロックに分割するブロック分割手段と、
連続する複数のブロックを組としてエンコードを行なうエンコード手段とを備え、
前記エンコード手段は、さらに、
ループエンド位置を含むブロックとループスタート位置を含むブロックとを組としてエンコードを行なうことを特徴とする音声データエンコード装置が提供される。 In order to solve the above problems, according to the first aspect of the present invention,
An audio data encoding device that encodes audio data including a loop portion in which a loop start position and a loop end position are specified,
A block dividing means for dividing the audio data into blocks;
Encoding means for encoding a set of a plurality of consecutive blocks as a set,
The encoding means further comprises:
There is provided an audio data encoding apparatus characterized in that encoding is performed by combining a block including a loop end position and a block including a loop start position.

また、上記課題を解決するため、本発明の第２の態様によれば、
ループスタート位置とループエンド位置とが指定されたループ部分を含む音声データを再生する音声データ再生装置であって、
ブロック単位のエンコード音声データを順次読み込み、連続する複数のブロックを組としてデコードして音声データを生成するデコード手段と、
ループの有無を判定する判定手段とを備え、
前記デコード手段は、
生成する音声データにループエンド位置が含まれる場合であって、前記判定手段がループ有と判定した場合には、次に読み込むべきブロックに代え、ループエンド位置を含むブロックとループスタート位置を含むブロックとを組としてエンコードされたブロックを読み込んでデコードを行なうことを特徴とする音声データ再生装置が提供される。 Moreover, in order to solve the said subject, according to the 2nd aspect of this invention,
An audio data reproducing device for reproducing audio data including a loop portion in which a loop start position and a loop end position are designated,
Decoding means for sequentially reading encoded audio data in units of blocks and decoding a plurality of consecutive blocks as a set to generate audio data;
Determination means for determining the presence or absence of a loop,
The decoding means includes
If the audio data to be generated includes a loop end position and the determination means determines that there is a loop, the block including the loop end position and the block including the loop start position are used instead of the block to be read next. An audio data reproducing apparatus is provided that reads and decodes a block encoded as a set.

ここで、前記デコード手段は、
生成する音声データにループエンド位置が含まれる場合であって、前記判定手段がループ有と判定した場合には、さらに、ループスタート位置を含む音声データの次のブロックを生成するために用いるエンコードブロックを読み込んでデコードを行なうことができる。 Here, the decoding means includes
If the audio data to be generated includes a loop end position, and the determination means determines that there is a loop, the encoding block used for generating the next block of the audio data including the loop start position Can be read and decoded.

本発明の実施の形態について図面を参照して説明する。図１は、本発明の実施形態の構成の概要を示す図である。本図に示すように、本実施形態では、音声データエンコード装置１００がエンコードした音声データを、音声データ再生装置２００がデコードして再生する処理を行なう。 Embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a diagram showing an outline of the configuration of an embodiment of the present invention. As shown in the figure, in this embodiment, the audio data reproducing apparatus 200 performs a process of decoding and reproducing the audio data encoded by the audio data encoding apparatus 100.

具体的には、音声データエンコード装置１００は、記録対象の音声データ１１を読み込み、エンコード処理部１１０を用いて圧縮等の処理を施して、エンコードデータ２０に変換する。ここで、音声データ１１は、あらかじめ所定のレートでサンプリングされているものとする。もちろん、アナログ音声データとして音声データエンコード装置１００に入力し、音声データエンコード装置１００においてＡ／Ｄ変換するようにしてもよい。 Specifically, the audio data encoding apparatus 100 reads the audio data 11 to be recorded, performs a process such as compression using the encoding processing unit 110, and converts it into encoded data 20. Here, it is assumed that the audio data 11 is sampled at a predetermined rate in advance. Of course, analog audio data may be input to the audio data encoding device 100 and A / D converted by the audio data encoding device 100.

本実施形態において、音声データ１１は、ループ部分を含んでおり、あらかじめループスタート位置とループエンド位置が指定されている。ループスタート位置とループエンド位置とは、例えば、音声データ１１におけるアドレスあるいはサンプル番号等で示されており、ループ情報１２として音声データ１１とともに読み込まれる。なお、ループ情報１２は、音声データ１１のヘッダ部分等に付加するようにしてもよいし、音声データ１１とは別個に存在するようにしてもよい。 In the present embodiment, the audio data 11 includes a loop portion, and a loop start position and a loop end position are designated in advance. The loop start position and the loop end position are indicated by, for example, an address or a sample number in the audio data 11 and are read together with the audio data 11 as loop information 12. Note that the loop information 12 may be added to the header portion or the like of the audio data 11 or may exist separately from the audio data 11.

ループ部分は、音声データ再生装置２００での音声データ１１の再生時において、あらかじめ定められた条件にしたがってループの有無が判断される。 When the audio data reproduction apparatus 200 reproduces the audio data 11, the loop portion determines whether or not there is a loop according to a predetermined condition.

ループ有無の判断の結果、ループを行なうと判断された場合には、ループ情報１２にしたがってループを行なう。すなわち、ループエンド位置まで再生するとループスタート位置まで戻って再生を続ける。一方、ループを行なわないと判断された場合には、ループ情報１２に関わらず、ループは行なわない。すなわち、ループエンド位置まで再生すると、それ以降の音声データ１１の再生を続ける。 If it is determined as a result of the loop presence / absence that a loop is to be performed, the loop is performed according to the loop information 12. That is, when the playback is performed up to the loop end position, the playback is resumed up to the loop start position. On the other hand, when it is determined that the loop is not performed, the loop is not performed regardless of the loop information 12. That is, when the playback is performed up to the loop end position, the subsequent playback of the audio data 11 is continued.

１つの音声データ１１についてループ部分は複数含まれていてもよい。この場合、それぞれのループ部分についてループスタート位置とループエンド位置とがループ情報として指定される。複数のループ部分は、それぞれ異なる条件でループの有無が判断されるようにすることができる。 A plurality of loop portions may be included for one audio data 11. In this case, a loop start position and a loop end position are designated as loop information for each loop portion. The plurality of loop portions can be determined whether or not there is a loop under different conditions.

音声データ１１のどの範囲についてループ部分とするかは、音声データ１１の内容に基づいて、開発者が任意に定めることができる。 The developer can arbitrarily determine which range of the audio data 11 is the loop portion based on the contents of the audio data 11.

音声データエンコード装置１００のエンコード処理部１１０は、音声データ１１を複数のブロックに分割して、連続する所定数のブロックを一単位として、直交変換（例えば、ＭＤＣＴ）を用いたエンコードを行ない、ブロックをオーバラップさせ、順次エンコードデータを生成する。なお、エンコード処理部１１０の具体的な処理については、後述する。 The encoding processing unit 110 of the audio data encoding apparatus 100 divides the audio data 11 into a plurality of blocks, performs encoding using orthogonal transform (for example, MDCT) with a predetermined number of consecutive blocks as a unit, and blocks Are sequentially encoded to generate encoded data. Note that specific processing of the encoding processing unit 110 will be described later.

エンコードデータ２０は、音声データ１１のループ処理を制御する制御用プログラム３０とともにＣＤ−ＲＯＭ等の記録媒体３００に記録される。記録媒体３００への記録処理は、音声データエンコード装置１００で行なうようにしてもよいし、他の装置で行なうようにしてもよい。 The encoded data 20 is recorded on a recording medium 300 such as a CD-ROM together with a control program 30 that controls the loop processing of the audio data 11. The recording process on the recording medium 300 may be performed by the audio data encoding apparatus 100 or may be performed by another apparatus.

なお、エンコードデータ２０と制御用プログラム３０とは、別々の記録媒体３００に記録するようにしてもよい。また、記録媒体３００は、ＣＤ−ＲＯＭに限られず、ＤＶＤ−ＲＯＭ等の光ディスク、磁気ディスク、半導体記録装置等であってもよい。あるいは、インターネット等の通信回線を介して音声データエンコード装置１００から音声データ再生装置２００にエンコードデータ２０を出力するようにしてもよい。 Note that the encoded data 20 and the control program 30 may be recorded on separate recording media 300. The recording medium 300 is not limited to a CD-ROM, and may be an optical disc such as a DVD-ROM, a magnetic disc, a semiconductor recording device, or the like. Alternatively, the encoded data 20 may be output from the audio data encoding apparatus 100 to the audio data reproducing apparatus 200 via a communication line such as the Internet.

制御用プログラム３０は、音声データ再生装置２００で実行されるプログラムである。音声データ再生装置２００は、制御用プログラム３０に基づいて、例えば、ユーザからの操作に応じて、エンコードデータ２０をデコードして再生される音声データ（以下「音声データ１１ａ」）のループの有無を決定する。具体的な例としては、ユーザからキー入力等の所定の操作を受け付けるまで、音声データ１１ａのループ部分を繰り返し再生し、所定の操作を受け付けると、ループ再生を終了し、ループ部分以降の音声データ１１ａを再生することが挙げられる。 The control program 30 is a program executed by the audio data playback device 200. Based on the control program 30, the audio data reproducing apparatus 200 determines whether or not there is a loop of audio data (hereinafter “audio data 11 a”) that is reproduced by decoding the encoded data 20 in accordance with an operation from the user, for example. decide. As a specific example, the loop portion of the audio data 11a is repeatedly reproduced until a predetermined operation such as key input is received from the user, and when the predetermined operation is received, the loop reproduction is terminated and the audio data after the loop portion is received. 11a is mentioned.

音声データ再生装置２００は、制御用プログラム３０等を実行するプログラム実行部２１０とエンコードデータ２０のデコードを行なうデコード処理部２２０とを備えている。また、音声データ再生装置２００は、デコードされた音声データ１１ａを再生し、音声出力する機能を備えている。これにより、音声データ再生装置２００は、エンコードデータ２０をほぼリアルタイムでデコードし、再生することができる。デコード処理部２２０の具体的な処理については後述する。 The audio data reproduction device 200 includes a program execution unit 210 that executes the control program 30 and the like, and a decode processing unit 220 that decodes the encoded data 20. The audio data reproduction device 200 has a function of reproducing the decoded audio data 11a and outputting the audio. As a result, the audio data reproducing apparatus 200 can decode and reproduce the encoded data 20 almost in real time. Specific processing of the decoding processing unit 220 will be described later.

なお、音声データエンコード装置１００は、音声処理機能を備えたパーソナルコンピュータ、ワークステーション、その他の一般的な情報処理装置を用いて構成することができる。もちろん、音声データ記録に特化した専用のシステムを用いて構成するようにしてもよい。 Note that the audio data encoding apparatus 100 can be configured using a personal computer, a workstation, or other general information processing apparatus having an audio processing function. Of course, a dedicated system specialized for recording audio data may be used.

音声データ再生装置２００も、音声処理機能を備えたパーソナルコンピュータ、ワークステーション、その他の一般的な情報処理装置を用いて構成することができる。もちろん、音声データ再生に特化した専用のシステムを用いて構成するようにしてもよい。 The audio data reproducing apparatus 200 can also be configured using a personal computer, a workstation, or other general information processing apparatus having an audio processing function. Of course, a dedicated system specialized for audio data reproduction may be used.

なお、本発明は、音声データ再生装置２００として、ＰＤＡ（Personal Digital Assistant）、携帯型ゲーム機、ポータブル音楽再生装置のように携帯性が重視され、処理速度に制約のある情報処理装置を用いた場合に特に効果的に適用することができる。 Note that the present invention uses an information processing apparatus that places importance on portability and restricts processing speed, such as a PDA (Personal Digital Assistant), a portable game machine, and a portable music playback apparatus, as the audio data playback apparatus 200. It can be applied particularly effectively in cases.

情報処理装置は、音声データ記録用あるいは音声データ再生用に開発されたアプリケーションプログラムを中央処理装置（ＣＰＵ）が実行することにより、音声データエンコード装置１００あるいは音声データ再生装置２００として機能することができる。これらのアプリケーションプログラムは、ＣＤ−ＲＯＭ等の記録媒体に記録させることで市場に流通させることができる。また、インターネット等のコンピュータネットワークを介して流通させることもできる。 The information processing apparatus can function as the audio data encoding apparatus 100 or the audio data reproducing apparatus 200 when a central processing unit (CPU) executes an application program developed for recording audio data or reproducing audio data. . These application programs can be distributed in the market by being recorded on a recording medium such as a CD-ROM. It can also be distributed via a computer network such as the Internet.

つぎに、音声データエンコード装置１００における音声データのエンコード処理について図２のフロー図を参照して説明する。 Next, audio data encoding processing in the audio data encoding apparatus 100 will be described with reference to the flowchart of FIG.

まず、音声データエンコード装置１００は、音声データ１１とループ情報１２とを読み込む（Ｓ１０１）。ここで、音声データ１１は、図３（ａ）に示すように、サンプルデータが時系列に並んだものであり、ループ情報１２によりループスタート（ＬＳ）位置と、ループエンド（ＬＥ）位置とが指定されている。 First, the audio data encoding apparatus 100 reads the audio data 11 and the loop information 12 (S101). Here, as shown in FIG. 3A, the audio data 11 is sample data arranged in a time series, and the loop start (LS) position and the loop end (LE) position are determined by the loop information 12. It is specified.

つぎに、エンコード処理部１１０は、音声データ１１をブロックに分割する（Ｓ１０２）。ブロック分割は、例えば、音声データ１１の先頭から所定のサンプル毎に区切ることで行なわれる。ここでは、各ブロックのサンプル数は一定とするが、所定の規則にしたがって可変とするようにしてもよい。図３（ｂ）は、音声データ１１をブロック（オリジナルブロック）に分割した様子を示している。すなわち、音声データ１１が、オリジナルブロック「Ａ０」「Ａ」「Ｂ」「Ｃ」「Ｄ」「Ｄ０」…に分割されている。 Next, the encoding processing unit 110 divides the audio data 11 into blocks (S102). The block division is performed, for example, by dividing every predetermined sample from the head of the audio data 11. Here, the number of samples in each block is constant, but may be variable according to a predetermined rule. FIG. 3B shows a state in which the audio data 11 is divided into blocks (original blocks). That is, the audio data 11 is divided into original blocks “A0” “A” “B” “C” “D” “D0”.

一般に、ループ部分とブロック分割位置とは無関係に定められる。ループスタート位置とループエンド位置とは、それらの音声データにおける相対位置と各ブロックに含まれるサンプル数とに応じて、それぞれ、いずれかのオリジナルブロックに含まれることになる。 In general, the loop portion and the block division position are determined independently. The loop start position and the loop end position are included in any one of the original blocks according to the relative position in the audio data and the number of samples included in each block.

ここでは、ループスタート位置がオリジナルブロック「Ａ」に含まれ、ループエンド位置がオリジナルブロック「Ｄ」に含まれているものとする。なお、ループスタート位置およびループエンド位置がそれぞれどのオリジナルブロックに含まれるかは、各ブロックに含まれるサンプル数から容易に求めることができる。 Here, it is assumed that the loop start position is included in the original block “A” and the loop end position is included in the original block “D”. It should be noted that which original block each includes the loop start position and the loop end position can be easily obtained from the number of samples included in each block.

エンコード処理部１１０は連続するオリジナルブロックを単位として音声圧縮（エンコード）を行なう（Ｓ１０３）。ここでは、ループ情報の有無にかかわらず、時系列に並ぶオリジナルブロックそれぞれについてエンコード処理を行なう。 The encoding processing unit 110 performs audio compression (encoding) in units of continuous original blocks (S103). Here, the encoding process is performed for each of the original blocks arranged in time series regardless of the presence or absence of the loop information.

エンコード処理では、連続する２つのオリジナルブロックから１つのエンコードブロックを生成するものとする。ただし、それぞれのオリジナルブロックはオーバーラップされ、前後のエンコードブロック生成に用いられる。 In the encoding process, one encoded block is generated from two consecutive original blocks. However, each original block is overlapped and used to generate preceding and subsequent encoded blocks.

すなわち、図３（ｃ）に示すように、連続するオリジナルブロック「Ａ０」「Ａ」とからエンコードブロック「Ｘ１」を生成し、オリジナルブロック「Ａ」「Ｂ」とからエンコードブロック「Ｘ２」を生成する。以下同様にして、エンコードブロック「Ｘ３」「Ｘ４」「Ｘ５」…を生成する。 That is, as shown in FIG. 3C, the encoding block “X1” is generated from the continuous original blocks “A0” and “A”, and the encoding block “X2” is generated from the original blocks “A” and “B”. To do. Similarly, encode blocks “X3”, “X4”, “X5”,... Are generated.

このような圧縮方式としては、ＡＴＲＡＣ（Adaptive TRans form Acoustic Coding）が代表的であるが、本発明は、これに限られない。また、連続する３以上のオリジナルブロックから１つのエンコードブロックを生成するようにしてもよいし、デコード時に３以上のエンコードブロックから１つのデコードブロックを生成するようにしてもよい。 As such a compression method, ATRAC (Adaptive TRans form Acoustic Coding) is typical, but the present invention is not limited to this. In addition, one encode block may be generated from three or more consecutive original blocks, or one decode block may be generated from three or more encode blocks during decoding.

つぎに、エンコード処理部１１０は、ループ用のエンコードブロックを生成する（Ｓ１０４）。ここで、ループ用のエンコードブロックは、ループエンド位置が含まれるオリジナルブロックとループスタート位置が含まれるオリジナルブロックとから生成するようにする。このとき、ループ時には、ループエンド位置からループスタート位置への順で再生すべきであるから、ループエンド位置が含まれるブロックを前にし、ループスタート位置が含まれるブロックを後ろにしてエンコードを行なう。 Next, the encoding process part 110 produces | generates the encoding block for loops (S104). Here, the encoding block for the loop is generated from the original block including the loop end position and the original block including the loop start position. At this time, at the time of the loop, since playback should be performed in the order from the loop end position to the loop start position, encoding is performed with the block including the loop end position in front and the block including the loop start position in the back.

図４は、ループ用のエンコードブロックを生成する様子を示す図である。本例では、ループエンド位置はオリジナルブロック「Ｄ」に含まれ、ループスタート位置はオリジナルブロック「Ａ」に含まれているため、ループ用エンコードブロック「Ｘ」は、オリジナルブロック「Ｄ」と「Ａ」とから生成される。このため、ループ用ブロックとしてオリジナルブロック「Ｄ」「Ａ」が並んでいる。なお、ループの際にはブロック「Ｄ」のＤ１に対応する部分を再生すると、ブロック「Ａ」のＡ１に対応する部分が再生されることになる。 FIG. 4 is a diagram illustrating a state in which an encoding block for a loop is generated. In this example, since the loop end position is included in the original block “D” and the loop start position is included in the original block “A”, the loop encoding block “X” includes the original blocks “D” and “A”. Is generated. Therefore, the original blocks “D” and “A” are arranged as loop blocks. In the loop, when the portion corresponding to D1 of the block “D” is reproduced, the portion corresponding to A1 of the block “A” is reproduced.

ここで、ループ時にはカットされるＤ１部分とＡ１部分以外のデータについては、もとのオリジナルブロック「Ｄ」および「Ａ」のデータをそのまま用いることができる。このようにすることで、音声波形の連続性を高めることができる。もちろん、無音等のダミーデータを用いるようにしてもよい。 Here, the original blocks “D” and “A” can be used as they are for the data other than the portions D1 and A1 that are cut during the loop. By doing in this way, the continuity of a speech waveform can be improved. Of course, dummy data such as silence may be used.

音声データエンコード装置１００は、時系列に並んだエンコードブロックとループ用のエンコードブロックとをエンコードデータ２０として出力する（Ｓ１０５）。このとき、ループ情報もあわせて出力する。ループ情報は、エンコードデータ２０とは別のファイルとして出力するようにしてもよいし、エンコードデータ２０のヘッダ部分に付加して出力するようにしてもよい。いずれにしても、エンコードデータ２０とループ情報とは関連性を保って扱われる。また、エンコードデータ２０には、必要に応じて、エンコード時のエンコードディレイ（遅延情報）も含めるようにする。 The audio data encoding apparatus 100 outputs the encoded blocks arranged in time series and the encoded block for loop as encoded data 20 (S105). At this time, the loop information is also output. The loop information may be output as a file different from the encoded data 20 or may be output by being added to the header portion of the encoded data 20. In any case, the encoded data 20 and the loop information are handled while maintaining the relevance. Further, the encode data 20 includes an encode delay (delay information) at the time of encoding as necessary.

上述のように、音声データエンコード装置１００が出力したエンコードデータ２０は、制御用プログラム３０とともに記録媒体３００に記録され、市場に流通される。 As described above, the encoded data 20 output from the audio data encoding apparatus 100 is recorded on the recording medium 300 together with the control program 30 and distributed to the market.

つぎに、音声データ再生装置２００における再生処理について図５のフロー図を参照して説明する。 Next, playback processing in the audio data playback device 200 will be described with reference to the flowchart of FIG.

ユーザの操作等に基づいて記録媒体３００に記録された音声データの再生を開始すると（Ｓ２０１）、音声データ再生装置２００は、エンコードデータ２０の先頭のエンコードブロックおよびループ情報を読み込む（Ｓ２０２）。 When reproduction of audio data recorded on the recording medium 300 is started based on a user operation or the like (S201), the audio data reproduction device 200 reads the first encoded block and loop information of the encoded data 20 (S202).

まず、ループを行なわない場合の再生について説明する。ループを行なわない場合とは、再生対象のブロック（デコード後のブロック：デコードブロック）にループエンド位置が含まれていない場合（Ｓ２０３：Ｎ）、あるいは、ループエンド位置が含まれていても（Ｓ２０３：Ｙ）、ループを行なわない状態の場合（Ｓ２０８：Ｎ）である。なお、ループを行なうかどうかは、上述のように、制御用プログラム３０にしたがって、プログラム実行部２１０が判断する。また、デコード後のブロックにループエンド位置が含まれているかどうかは、再生済のブロック数とブロックに含まれるサンプル数とループ情報とから判断することができる。 First, reproduction when no loop is performed will be described. The case where the loop is not performed means that the block to be reproduced (block after decoding: decoded block) does not include the loop end position (S203: N), or even if the loop end position is included (S203). : Y), the case where no loop is performed (S208: N). Whether or not to perform the loop is determined by the program execution unit 210 according to the control program 30 as described above. Whether the decoded block contains a loop end position can be determined from the number of reproduced blocks, the number of samples included in the block, and loop information.

ループを行なわない場合には、従来どおり、つぎのエンコードブロックを読み込み（Ｓ２０５）、連続するエンコードブロックを用いてデコードを行ない、デコードブロックを生成する（Ｓ２０５）。 When the loop is not performed, the next encoding block is read as usual (S205), and the decoding is performed using the continuous encoding block to generate the decoding block (S205).

すなわち、図３（ｄ）に示すように、エンコードブロック「Ｘ１」「Ｘ２」からデコードブロック「ａ」を生成し、エンコードブロック「Ｘ２」「Ｘ３」からデコードブロック「ｂ」を生成していく。なお、デコードブロック「ａ」「ｂ」…は、オリジナルブロック「Ａ」「Ｂ」…に対応する音声データである。 That is, as shown in FIG. 3D, a decode block “a” is generated from the encode blocks “X1” and “X2”, and a decode block “b” is generated from the encode blocks “X2” and “X3”. The decoded blocks “a”, “b”,... Are audio data corresponding to the original blocks “A”, “B”,.

そして、デコードブロックを順次Ｄ／Ａ変換し、音声出力することで音声データが再生される（Ｓ２０６）。以上の処理は、エンコードブロックが終了するまで続けられる（Ｓ２０７、Ｓ２１５）。 Audio data is reproduced by sequentially D / A converting the decoded blocks and outputting the audio (S206). The above processing is continued until the encoding block is completed (S207, S215).

つぎに、ループを行なう場合の再生について説明する。ループを行なう場合とは、再生対象のデコードブロックにループエンド位置が含まれており（Ｓ２０３：Ｙ）、かつ、ループを行なう状態の場合（Ｓ２０８：Ｙ）である。 Next, reproduction when looping will be described. The case where the loop is performed is a case where the loop end position is included in the decoding block to be reproduced (S203: Y) and the state where the loop is performed (S208: Y).

この場合、音声データ再生装置２００は、つぎのエンコードブロックではなく、ループ用のエンコードブロックを読み込む（Ｓ２０９）。そして、読み込み済みのエンコードブロックと新たに読み込んだループ用のエンコードブロックとからデコードブロックを生成する（Ｓ２１１）。 In this case, the audio data reproducing device 200 reads the loop encoding block instead of the next encoding block (S209). Then, a decode block is generated from the read encode block and the newly read loop encode block (S211).

この処理について、さらに、図６を参照して説明する。本例では、再生対象がデコードブロック「ｄ」の場合にループが発生する。デコードブロック「ｃ」の処理の際に、すでにエンコードブロック「Ｘ４」を読み込んでいるため、ループがない場合には、エンコードブロック「Ｘ５」を読み込むことになるが、ループを行なう場合には「Ｘ５」ではなく、ループ用のエンコードブロック「Ｘ」を読み込む。 This process will be further described with reference to FIG. In this example, a loop occurs when the reproduction target is the decode block “d”. Since the encode block “X4” has already been read when the decode block “c” is processed, the encode block “X5” is read when there is no loop, but when the loop is performed, “X5” is read. ”Instead of the loop encoding block“ X ”.

そして、図６（ａ）に示すように、「Ｘ４」と「Ｘ」とからデコードブロック「ｄ」を生成する。デコードブロック「ｄ」のうち、再生すべきはｄ１の部分であるので、音声データ再生装置２００は、ｄ１の部分を再生し、それ以降はカットする（Ｓ２１１）。 Then, as shown in FIG. 6A, a decode block “d” is generated from “X4” and “X”. Of the decoded block “d”, the portion to be reproduced is the portion of d1, so the audio data reproducing device 200 reproduces the portion of d1, and cuts after that (S211).

音声データ再生装置２００は、ループスタート位置を含むデコードブロック「ａ」を得るためにエンコードブロックを読み込む。本来、デコードデータ「ａ」を生成するためにはエンコードブロック「Ｘ１」「Ｘ２」を読み込まなければならないが（図３（ｄ）参照）、ここでは、エンコードブロック「Ｘ」をすでに読み込んでいるため、エンコードブロック「Ｘ２」を読み込んで処理すれば足りる（Ｓ２１２）。 The audio data reproducing device 200 reads the encode block to obtain the decode block “a” including the loop start position. Originally, in order to generate the decoded data “a”, the encoding blocks “X1” and “X2” must be read (see FIG. 3D), but here the encoding block “X” has already been read. It is sufficient to read and process the encoding block “X2” (S212).

そして、図６（ｂ）に示すように、「Ｘ」と「Ｘ２」とからデコードブロック「ａ」を生成する（Ｓ２１３）。デコードブロック「ａ」のうち、再生すべきはａ１の部分であるので、音声データ再生装置２００は、ａ１の部分のみを再生する（Ｓ２１４）。これは、ループ情報を参照することで容易に判断することができる。 Then, as shown in FIG. 6B, a decode block “a” is generated from “X” and “X2” (S213). In the decoded block “a”, the portion to be reproduced is the portion a1, and therefore the audio data reproducing device 200 reproduces only the portion a1 (S214). This can be easily determined by referring to the loop information.

このように、本発明によれば、ループ用のエンコードブロックを用意しているため、ループの際の処理負荷を軽減することができる。 As described above, according to the present invention, since the encoding block for the loop is prepared, the processing load at the time of the loop can be reduced.

図６（ｂ）は、この様子を示している。すなわち、デコードブロック「ｄ」から「ａ」へループで戻る際に、オリジナルブロック「Ｄ」「Ａ」を用いて生成したループ用のエンコードブロック「Ｘ」を用いているため、図７（ｃ）のようにエンコードブロック「Ｘ１」「Ｘ２」の両方を読み込む必要はない。なぜならば、図６（ｂ）において「Ｘ」は、「ｄ」の処理の際にすでに読み込んでいるため、「Ｘ２」のみを読み込んで処理すれば足りるからである。 FIG. 6B shows this state. That is, when the loop is returned from the decoding block “d” to “a”, the encoding block “X” for the loop generated using the original blocks “D” and “A” is used. Thus, it is not necessary to read both the encoding blocks “X1” and “X2”. This is because “X” in FIG. 6B has already been read in the process of “d”, and therefore it is sufficient to read and process only “X2”.

ここで、連続する３つのオリジナルブロックを用いて１ブロックのエンコードデータを生成するエンコード方式を用いた場合について簡単に説明する。図４で示した例のようにオリジナルブロック「Ｄ」から「Ａ」に戻るとすると、「ＣＤＡ」「ＤＡＢ」の組で２つのループ用エンコードブロックを生成するようにする。そして、再生の際には、ループ部分にこれらのエンコードブロックを用いてエンコード処理を行なえばよい。
なお、エンコード時、デコード時に遅延が発生する圧縮方式があるが、このような圧縮方式を用いている場合には、デコード時の出力サンプルが遅延を持って出力される。このときは、カットする位置を遅延サンプル分後ろにずらすことで、所望の再生が可能となる。 Here, a brief description will be given of a case where an encoding method for generating encoded data of one block using three consecutive original blocks is used. When returning from the original block “D” to “A” as in the example shown in FIG. 4, two loop encoding blocks are generated with a set of “CDA” and “DAB”. Then, at the time of reproduction, encoding processing may be performed using these encoding blocks in the loop portion.
There is a compression method in which a delay occurs at the time of encoding and decoding. When such a compression method is used, output samples at the time of decoding are output with a delay. At this time, the desired reproduction can be performed by shifting the cutting position backward by the delayed sample.

本発明の実施形態の構成の概要を示す図である。It is a figure which shows the outline | summary of a structure of embodiment of this invention. 音声データエンコード装置１００におけるエンコード処理について説明するフロー図である。5 is a flowchart for explaining an encoding process in the audio data encoding apparatus 100. FIG. 音声データ１１のブロックについて説明する図である。It is a figure explaining the block of the audio | voice data 11. FIG. ループ用のエンコードブロックを生成する様子を示す図である。It is a figure which shows a mode that the encoding block for loops is produced | generated. 音声データ再生装置２００における再生処理について説明するフロー図である。FIG. 10 is a flowchart for explaining playback processing in the audio data playback device 200. ループの際のブロック単位の処理を説明する図である。It is a figure explaining the process of the block unit in the case of a loop. ブロック単位で音声データを圧縮する方式について説明する図である。It is a figure explaining the system which compresses audio | voice data by a block unit.

Explanation of symbols

１１…音声データ
１２…ループ情報
２０…エンコードデータ
３０…制御用プログラム
１００…音声データエンコード装置
１１０…エンコード処理部
２００…音声データ再生装置
２１０…プログラム実行部
２２０…デコード処理部
３００…記録媒体 DESCRIPTION OF SYMBOLS 11 ... Audio data 12 ... Loop information 20 ... Encoding data 30 ... Control program 100 ... Audio data encoding device 110 ... Encoding processing unit 200 ... Audio data reproduction device 210 ... Program execution unit 220 ... Decoding processing unit 300 ... Recording medium

Claims

An audio data encoding apparatus that encodes audio data including a loop portion in which a loop start position and a loop end position are specified,
A block dividing means for dividing the audio data into blocks;
Encoding means for generating a first encoded block for each set by encoding a plurality of consecutive blocks as a set,
The encoding means further comprises:
An audio data encoding apparatus characterized in that a second encoded block is generated for a set of blocks by encoding a block including a loop end position and a block including a loop start position as a set.

The audio data encoding device according to claim 1,
In the second encoding block, any data other than the data from the beginning of the block including the loop end position to the loop end position and the data from the loop start position to the end of the block including the loop start position is arbitrarily set. An audio data encoding apparatus characterized by allowing data.

An audio data reproducing device for reproducing audio data including a loop portion in which a loop start position and a loop end position are designated,
In the audio data divided into predetermined blocks, each of the first encoded blocks encoded as a set of a plurality of continuous blocks is sequentially read, and the audio is obtained by decoding the plurality of continuous first encode blocks as a set. Decoding means for generating data;
Determination means for determining whether or not a condition for performing loop reproduction is satisfied,
The decoding means includes
When the audio data generated using the next first encoding block includes a loop end position and the determination means determines that the condition for performing loop reproduction is satisfied, In place of the first encoding block, audio data including the loop end position is obtained by reading and decoding the second encoding block encoded as a set of the block including the loop end position and the block including the loop start position. An audio data reproducing apparatus characterized by generating

The audio data reproducing device according to claim 3,
The decoding means includes
After the audio data including the loop end position is generated by decoding the second encoded block, the first encoded block encoded as a set of the block including the loop start position and the next block is read. An audio data reproducing apparatus that generates audio data including a loop start position by performing decoding.

A program for causing an information processing apparatus to function as an audio data encoding apparatus that encodes audio data including a loop portion in which a loop start position and a loop end position are specified,
In the information processing apparatus,
A block division function that divides audio data into blocks, and an encoding function that generates a first encoded block for each set by encoding a plurality of consecutive blocks as a set.
The encoding function further includes:
A program for generating a second encoded block for a set of blocks by encoding a block including a loop end position and a block including a loop start position as a set.

The program according to claim 5,
In the second encoding block, any data other than the data from the beginning of the block including the loop end position to the loop end position and the data from the loop start position to the end of the block including the loop start position is arbitrarily set. A program characterized by allowing data.

A program for causing an information processing apparatus to function as an audio data reproducing apparatus that reproduces audio data including a loop portion in which a loop start position and a loop end position are designated,
In the information processing apparatus,
In the audio data divided into predetermined blocks, each of the first encoded blocks encoded as a set of a plurality of continuous blocks is sequentially read, and the audio is obtained by decoding the plurality of continuous first encode blocks as a set. A decoding function that generates data and a determination function that determines whether the conditions for loop playback are satisfied,
The decoding function is
If the audio data generated using the next first encoding block includes a loop end position and the determination means determines that the condition for performing loop reproduction is satisfied, Instead of the next first encoding block, a second encoding block encoded as a set of a block including a loop end position and a block including a loop start position is read and decoded, so that audio including the loop end position is read. A program characterized by generating data.

The program according to claim 7,
The decoding function is
After the audio data including the loop end position is generated by decoding the second encoded block, the first encoded block encoded as a set of the block including the loop start position and the next block is read. A program that generates audio data including a loop start position by performing decoding.

An information processing apparatus-readable recording medium on which the program according to any one of claims 5 to 8 is recorded.

A recording medium readable by an information processing device,
Information for designating the loop start position and loop end position of the audio data;
For audio data divided into blocks, a first encoded block encoded as a set of a plurality of consecutive blocks;
For the audio data divided into blocks, a second encoded block encoded as a set of a block including the loop end position and a block including the loop start position is recorded,
The information processing apparatus is
A decoding step of sequentially reading the first encoding block from the recording medium and generating audio data by decoding a plurality of consecutive first encoding blocks as a set;
A determination step for determining whether or not a condition for performing loop playback is satisfied,
In the decoding step,
When it is determined that the loop end position is included in the audio data generated using the next first encoding block and the condition for performing the loop reproduction is satisfied in the determination step, the next Instead of the first encoding block, the second encoding block is read from the recording medium and decoded, thereby generating audio data including a loop end position ,
Recording medium characterized Rukoto said determining step is further a program for realizing the recording to the information processing apparatus.

An audio data encoding method in an audio data encoding apparatus for encoding audio data including a loop portion in which a loop start position and a loop end position are specified,
The audio data encoding device is
A block dividing step for dividing the audio data into blocks;
Performing an encoding step of generating a first encoding block for each set by encoding a plurality of consecutive blocks as sets,
In the encoding step,
An audio data encoding method comprising: encoding a block including a loop end position and a block including a loop start position as a set to generate a second encoded block for the set of blocks.

An audio data decoding method in an audio data reproducing apparatus for reproducing audio data including a loop portion in which a loop start position and a loop end position are designated,
The audio data decoding device comprises:
In the audio data divided into predetermined blocks, the first encoded block encoded as a set of a plurality of continuous blocks is sequentially read, and the audio data is decoded by decoding the plurality of continuous first encode blocks as a set. A decoding step to generate;
A determination step for determining whether or not a condition for performing loop playback is satisfied,
In the decoding step,
When it is determined that the loop end position is included in the audio data generated using the next first encoding block and the condition for performing the loop reproduction is satisfied in the determination step, the next Instead of the first encoding block, the second encoding block encoded as a set of the block including the loop end position and the block including the loop start position is read and decoded, so that the audio data including the loop end position is obtained. An audio data decoding method comprising generating the audio data.

An audio data decoding method of claim 1 2,
The audio data decoding device, in the decoding step,
After the audio data including the loop end position is generated by decoding the second encoded block, the first encoded block encoded as a set of the block including the loop start position and the next block is read. An audio data decoding method comprising: generating audio data including a loop start position by performing decoding.