JP3035948B2

JP3035948B2 - Audio data playback method

Info

Publication number: JP3035948B2
Application number: JP2021113A
Authority: JP
Inventors: 健久多良木
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1990-01-31
Filing date: 1990-01-31
Publication date: 2000-04-24
Anticipated expiration: 2015-04-24
Also published as: JPH03226800A

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、準瞬時圧縮によりPCMデータ化された音声
信号を再生する音声データ再生方法に関する。Description: TECHNICAL FIELD The present invention relates to an audio data reproducing method for reproducing an audio signal converted into PCM data by quasi-instantaneous compression.

[Summary of the Invention]

本発明は、準瞬時圧縮によりPCMデータ化された音声
信号を再生する音声データ再生方法において、人間の声
が記録される箇所の会話及びフレーズの切れ目の部分の
音声信号を、強制的に直前の音声データとは無関係なモ
ードのPCMデータとし、この直前の音声データとは無関
係なモードのPCMデータが得られるタイミングから再生
するようにし、常に良好な音声が再生されるようにした
ものである。The present invention relates to an audio data reproducing method for reproducing an audio signal converted into PCM data by quasi-instantaneous compression. The PCM data is in a mode irrelevant to the audio data, and is reproduced from the timing at which PCM data in a mode irrelevant to the immediately preceding audio data is obtained, so that good audio is always reproduced.

[Conventional technology]

従来、音声信号をPCMデータとしてデジタル信号化す
る場合、効率良くデータ化するために、例えば特願昭63
−301544号に記載のように、準瞬時圧縮によりPCMデー
タ化することが提案されている。即ち、例えば第５図に
示す如く、ブロック化されて形成されたPCMデータのデ
ータ圧縮率を、このブロック毎に変化させるもので、圧
縮データの他に付属情報（圧縮に関するパラメータ）を
付加するもので、デコーダ側（再生側）のフィルタはII
R（巡回型）デジタル・フィルタ構成となる。Conventionally, when converting an audio signal into a digital signal as PCM data, in order to efficiently convert the data, for example, Japanese Patent Application No. Sho 63
As described in -301544, conversion to PCM data by quasi-instantaneous compression has been proposed. That is, as shown in FIG. 5, for example, the data compression ratio of the block-formed PCM data is changed for each block, and additional information (compression-related parameters) is added in addition to the compressed data. And the filter on the decoder side (reproduction side) is II
An R (cyclic) digital filter configuration is obtained.

ここで、各ブロックの構成を第６図に示すと、１ブロ
ック分のデータとしては、１バイトのヘッダ情報（パラ
メータ情報）RFと８バイトのサンプル用データD_A0〜D_B3
で構成されている。ヘッダ情報RFは、４ビットのレンジ
情報と、２ビットのフィルタ選択情報と、それぞれ１ビ
ットの２つのフラグ情報、例えばループの開始点を含む
ブロックであることを示す情報（ループ・スタート・フ
ラグLSF）及びループの終端点のブロックを示す情報
（ループ・エンド・フラグLEF）とで構成されている。
ここで、１サンプルの波高値データは、ビット圧縮され
て４ビットで表されており、データD_A0〜D_B3中には16サ
ンプル分の４ビット・データD_A0H〜D_B3Lが含まれてい
る。Here, illustrating the configuration of each block in FIG. 6, 1 as the block of data, 1-byte header information (parameter information) RF and 8 bytes of sample data D _A0 to D _B3
It is composed of The header information RF includes 4-bit range information, 2-bit filter selection information, and two 1-bit flag information, for example, information indicating a block including a loop start point (loop start flag LSF). ) And information (loop end flag LEF) indicating the block at the end point of the loop.
Here, the peak value data of one sample is bit-compressed and represented by 4 bits, and the data D _{A0 to} D _B3 include 4-bit data D _{A0H to} D _{B3L for} 16 samples. .

そして、２ビットのフィルタ選択情報により、デコー
ダ側のフィルタが、０次フィルタと１次フィルタと２次
フィルタとが選択されるようにしてある。即ち、このフ
ィルタ選択情報により、再生側のIIRフィルタの巡回数
が０から２まで選択される。この場合、巡回数０のとき
は、巡回が全く行われず前のブロックのデータと無相関
の所謂ストレートPCMモードである。Then, the filter on the decoder side is selected from the 0th-order filter, the 1st-order filter, and the 2nd-order filter according to the 2-bit filter selection information. That is, the repetition number of the IIR filter on the reproduction side is selected from 0 to 2 based on the filter selection information. In this case, when the number of tours is 0, the so-called straight PCM mode is performed in which no tour is performed and there is no correlation with the data of the previous block.

[Problems to be solved by the invention]

ところで、このように準瞬時圧縮されたデータを最初
から再生するときには、IIRフィルタによる巡回等の処
理で元のビット数のデータに戻され、良好な音声が再生
されるが、ランダム再生等で途中から再生させると、再
生し始めたときのデータに設定された所定の巡回が行わ
れず、不自然な音声が再生されてしまう虞れがあった。
即ち、IIRフィルタにより巡回させるときには、所定の
ブロックのデータを巡回させたものに次のブロックのデ
ータを加算して、元のデータを得るため、前のブロック
のデータがないと元のデータが復元されず、途中から再
生させたときにはこのように元のデータが復元されず、
聴くに耐えない雑音が再生されてしまう。By the way, when the data thus compressed is reproduced from the beginning, the data is restored to the original number of bits by processing such as cycling by an IIR filter, and a good sound is reproduced. When the reproduction is started from the beginning, the predetermined tour set in the data at the time of starting the reproduction is not performed, and there is a possibility that an unnatural sound may be reproduced.
That is, when the data is circulated by the IIR filter, the data of the next block is added to the circulated data of the predetermined block, and the original data is obtained. However, when played back from the middle, the original data is not restored in this way,
Noise that cannot be heard is reproduced.

本発明の目的は、このように準瞬時圧縮されたデータ
を任意の箇所から再生させたときに、良好な音声を再生
させることにある。It is an object of the present invention to reproduce good sound when such quasi-instantaneously compressed data is reproduced from an arbitrary position.

[Means for solving the problem]

本発明の音声データ再生方法は、準瞬時圧縮によりPC
Mデータ化された連続した音声信号を再生する音声デー
タ再生方法において、人間の声が記録される箇所の会話
及びフレーズの切れ目の部分の音声信号を、強制的に直
前の音声データとは無関係なモードのPCMデータとし、
このPCMデータを任意の箇所から再生するとき、直前の
音声データとは無関係なモードのPCMデータが得られる
までミューティングを行い、この直前の音声データとは
無関係なモードのPCMデータが得られるタイミングから
再生するようにしたものである。The audio data reproducing method of the present invention uses a quasi-instantaneous compression
In the audio data reproducing method of reproducing a continuous audio signal converted into M data, the audio signal at the part where the human voice is recorded and the part at the break of the phrase are forcibly changed to have no relation to the immediately preceding audio data. Mode PCM data,
When playing this PCM data from any point, muting is performed until PCM data in a mode unrelated to the previous audio data is obtained, and PCM data in a mode unrelated to the previous audio data is obtained It is made to play from.

また、人間の声以外の音声信号を、人間の声の音声信
号とは別のチャンネルで常に直前の音声データとは無関
係なモードのPCMデータとし、上述した再生を行うとき
に、このPCMデータを準瞬時圧縮によりPCMデータ化され
た人間の声のPCMデータと混合して再生するようにした
ものである。In addition, an audio signal other than a human voice is always set as PCM data in a mode different from the immediately preceding audio data on a channel different from the audio signal of the human voice, and when performing the above-described reproduction, this PCM data is used. It is designed to be mixed with PCM data of human voice converted to PCM data by quasi-instantaneous compression and reproduced.

さらにまた、人間の声が記録されたチャンネルを含む
準瞬時圧縮によりPCMデータ化された複数チャンネルのP
CMデータを同時に任意の箇所から再生するとき、人間の
声が記録されたチャンネルが直前の音声データとは無関
係なモードのPCMデータとなったタイミングから再生す
ると共に、他のチャンネルのPCMデータを、同じタイミ
ングから強制的に直前の音声データとは無関係なモード
と見なして再生するようにしたものである。Furthermore, a plurality of channels of PCM data converted to PCM data by quasi-instantaneous compression including a channel in which a human voice is recorded.
When playing CM data from an arbitrary location at the same time, the channel where the human voice is recorded is played from the timing when the PCM data in the mode irrelevant to the previous audio data is played back, and the PCM data of other channels is At the same timing, the reproduction is forcibly regarded as a mode unrelated to the immediately preceding audio data.

[Action]

このようにしたことで、任意の箇所からランダム再生
を行うときには、直前の音声データとは無関係なモード
のPCMデータとなったタイミングから再生されるので、
直前の音声データと関連のある音声データが途中から再
生されて不自然な音声が再生されることがなく、常に良
好な音声が再生される。By doing this, when random playback is performed from an arbitrary point, playback is performed from the timing at which PCM data in a mode irrelevant to the previous audio data is played,
The sound data related to the immediately preceding sound data is not reproduced from the middle and unnatural sound is not reproduced, and good sound is always reproduced.

〔Example〕

以下、本発明の音声データ再生方法の一実施例を、第
１図〜第４図を参照して説明する。An embodiment of the audio data reproducing method according to the present invention will be described below with reference to FIGS.

先ず第２図は、サンプリング前のアナログ楽音信号波
形の一例を示している。この第２図のアナログ波形から
も明らかなように、一般の楽器音の発音開始直後におい
ては、ピアノの打鍵ノイズや管楽器のブレスノイズ等の
非音程成分が含まれることにより、波形の周期性が不明
瞭な部分であるフォルマント部分FRが生じていることが
多く、その後、楽音の音程（ピッチ、音高）に対応する
基本周期で同じ波形が繰り返し現れるようになる。この
繰り返し波形の所定のｎ周期分（ｎは整数）をルーピン
グ区間LPとし、この区間LPの始点及び終点をそれぞれル
ーピング開始点LP_S及びルービング終端点LP_E（ルーピン
グポイント）と表している。そして上記フォルマント部
分FR及びルーピング区間LPのアナログ波形に対応するデ
ィジタル・データを記憶媒体に記録し、再生時にはフォ
ルマント部分FRの再生に続いてルーピング区間LPを繰り
返し再生することにより、任意の長時間にわたって音楽
を発生させることができる。First, FIG. 2 shows an example of an analog tone signal waveform before sampling. As is clear from the analog waveform of FIG. 2, immediately after the start of the sounding of a general musical instrument sound, the periodicity of the waveform is reduced due to the inclusion of non-pitch components such as a keystroke noise of a piano and a breath noise of a wind instrument. A formant portion FR, which is an unclear portion, often occurs, and thereafter, the same waveform repeatedly appears in a basic cycle corresponding to a musical interval (pitch, pitch). Predetermined n cycles of the repeating waveform (n is an integer) a a looping section LP, represents this section LP start point and looping start point the end point, respectively LP _S and Rubingu end point LP _E (looping points). Then, the digital data corresponding to the analog waveform of the formant part FR and the looping section LP is recorded on a storage medium, and at the time of reproduction, the looping section LP is repeatedly reproduced following the reproduction of the formant part FR, for an arbitrary long time. Can generate music.

ここで、このようなルーピング区間LPを含む楽音デー
タをメモリ等の記憶媒体にストアするに先立ってビット
圧縮符号化することにより、データ量の低減を図るわけ
であるが、本実施例においては、本件出願人が先に特開
昭61−158217号公報、特開昭62−8629号公報あるいは特
開昭62−3516号公報等において提案しているビット圧縮
符号化方式、すなわち波高値データの所定数のサンプル
（ｈサンプル）毎にブロック化しこのブロック単位で最
適のビット圧縮を施すような高能率符号化方式を用いる
ものとし、この高能率ビット圧縮符号化方式について、
第１図を参照しながら概略的に説明する。Here, the music data including such a looping section LP is bit-compressed and encoded prior to being stored in a storage medium such as a memory, so that the data amount is reduced. The bit compression encoding method proposed by the present applicant in Japanese Patent Application Laid-Open Nos. 61-158217, 62-8629 or 62-3516, that is, a method of determining the peak value data. It is assumed that a high-efficiency encoding method is used in which blocks are formed for every number of samples (h samples) and optimal bit compression is performed in units of these blocks.
This will be schematically described with reference to FIG.

この第１図において、高能率ビット圧縮符号化システ
ムは、記録側のエンコーダ（10）と、再生側のデコーダ
（30）とにより構成されており、エンコーダ（10）の入
力端子（11）には、楽音信号の波高値データｘ（ｎ）が
供給されている。In FIG. 1, the high-efficiency bit compression encoding system includes an encoder (10) on the recording side and a decoder (30) on the reproducing side, and an input terminal (11) of the encoder (10) is , The peak value data x (n) of the tone signal.

この入力信号（の波高値データ）ｘ（ｎ）は予測器
（12）及び加算器（13）で構成されたFIR（有限インパ
ルス応答型）ディジタル・フィルタ（14）に供給され、
予測器（12）からの予測信号（の波高値データ）
（ｎ）は加算器（13）に減算信号として送られている。
加算器（13）においては、入力信号ｘ（ｎ）から予測信
号（ｎ）が減算されることによって、予測誤差信号あ
るいは広義の差分出力ｄ（ｎ）が出力される。予測器
（12）は、一般に過去のｐ個の入力ｘ（ｎ−ｐ）,x（ｎ
−ｐ＋１），‥‥,x（ｎ−１）の１次結合により予測値
（ｎ）を算出するものである。なお、FIRフィルタ（1
4）を、以下エンコード・フィルタと称す。This input signal (peak value data) x (n) is supplied to an FIR (finite impulse response) digital filter (14) composed of a predictor (12) and an adder (13).
Predicted signal from the predictor (12) (peak value data)
(N) is sent to the adder (13) as a subtraction signal.
The adder (13) subtracts the prediction signal (n) from the input signal x (n) to output a prediction error signal or a difference output d (n) in a broad sense. The predictor (12) generally has p past inputs x (n-p), x (n
−p + 1), ‥‥, x (n−1) to calculate the predicted value (n). Note that the FIR filter (1
4) is hereinafter referred to as an encoding filter.

このビット圧縮符号化システムにおいては、入力信号
（波高値データ）ｘ（ｎ）を、所定数のサンプル（ｈサ
ンプル）毎にブロック化して、各ブロック毎に最適の特
性のエンコード・フィルタ（14）を選択するようにして
いる。これは互いに異なる特性を有する複数の（例えば
４個の）エンコード・フィルタを予め設けておき、これ
らのフィルタのうち最適の特性の、すなわち最も高い圧
縮率を得ることのできるようなフィルタを選択すること
で実現し得るものである。ただし、一般のディジタル・
フィルタの構成上は、第１図に示す１個のエンコード・
フィルタ（14）の予測器（12）の係数の組を複数組（例
えば４組）係数メモリ等に記憶させておき、これらの係
数の組を時分割的に切り換え選択することで、実質的に
複数のエンコード・フィルタのうちの１つを選択するの
と等価な動作を行わせることが多い。具体例としては、
０次、１次、２種類の２次の計４種類のエンコード・フ
ィルタを用意し、これらの４種類のフィルタのいずれを
選択したかを示す２ビットのフィルタ選択情報（モード
選択情報）を各ブロック毎に伝送するようにすればよ
い。なお、０次のエンコード・フィルタを選択すること
は、入力PCM信号がそのまま出力されるストレートPCMモ
ードを選択することに相当する。In this bit compression encoding system, an input signal (peak value data) x (n) is divided into blocks for a predetermined number of samples (h samples), and an encoding filter (14) having optimum characteristics for each block. To choose. In this method, a plurality of (for example, four) encoding filters having different characteristics are provided in advance, and a filter having an optimum characteristic, that is, a filter capable of obtaining the highest compression ratio is selected from these filters. It can be realized by doing. However, general digital
In terms of the configuration of the filter, one encoding unit shown in FIG.
A plurality of sets (for example, four sets) of coefficients of the predictor (12) of the filter (14) are stored in a coefficient memory or the like, and these sets of coefficients are switched and selected in a time-sharing manner. In many cases, an operation equivalent to selecting one of a plurality of encoding filters is performed. As a specific example,
A total of four types of encoding filters, ie, 0 order, 1 order, and 2 types of secondary, are prepared, and 2-bit filter selection information (mode selection information) indicating which of these 4 types of filters has been selected is provided. What is necessary is just to transmit by block. Selecting a zero-order encoding filter corresponds to selecting a straight PCM mode in which an input PCM signal is output as it is.

次に、予測誤差としての差分出力ｄ（ｎ）は、加算器
（21）を介し、利得Ｇのシフタ（15）と量子化器（16）
とよりなるビット圧縮器に送られ、例えば浮動小数点
（フローティング・ポイント）表示形態における指数部
が利得Ｇに、仮数部が量子化器（16）からの出力にそれ
ぞれ対応するような圧縮処理あるいはレンジング処理が
施される。すなわち、シフタ（16）により例えば16ビッ
トの入力データを利得Ｇに応じたビット数だけシフトす
ることでレンジを切り換え、量子化器（16）によりビッ
ト・シフトされたデータの一定ビット数、例えば４ビッ
トを取り出すような再量子化を行っている。ここで、ノ
イズ・シェイピング回路（ノイズ・シェイパ）（17）
は、量子化器（16）の出力と入力との誤差分いわゆる量
子化誤差を加算器（18）で得て、この量子化誤差を利得
G^-1のシフタ（19）を介し予測器（20）に送って、量子
化誤差の予測信号を加算器（21）に減算信号として帰還
するようないわゆるエラー・フィードバックを行う。こ
のようにして量子化器（16）による再量子化とノイズ・
シェイピング回路（17）によるエラー・フィードバック
とが施され、出力端子（22）より出力（ｎ）が取り出
される。Next, a difference output d (n) as a prediction error is passed through an adder (21) to a gain G shifter (15) and a quantizer (16).
Compression processing or ranging in which, for example, the exponent part in the floating point display form corresponds to the gain G, and the mantissa part corresponds to the output from the quantizer (16). Processing is performed. That is, the range is switched by shifting the input data of, for example, 16 bits by the shifter (16) by the number of bits according to the gain G, and the fixed number of bits of the data bit-shifted by the quantizer (16), for example, 4 Requantization is performed to extract bits. Here, the noise shaping circuit (noise shaper) (17)
Obtains a so-called quantization error by an adder (18) corresponding to an error between an output and an input of the quantizer (16), and obtains the quantization error by a gain.
The signal is sent to the predictor (20) via the G- ¹ shifter (19), and so-called error feedback is performed in which the prediction signal of the quantization error is fed back to the adder (21) as a subtraction signal. In this way, requantization by the quantizer (16) and noise
The error feedback is performed by the shaping circuit (17), and the output (n) is taken out from the output terminal (22).

ところで、加算器（21）からの出力ｄ′（ｎ）は差分
出力ｄ（ｎ）よりノイズ・シェイパ（17）からの量子化
誤差の予測信号（ｎ）を減算したものであり、利得Ｇ
のシフタ（15）からの出力ｄ″（ｎ）は利得Ｇと出力加
算器（21）からの出力ｄ′（ｎ）を乗算したものであ
る。また、量子化器（16）からの出力（ｎ）は、量子
化の過程における量子誤差ｅ（ｎ）とシフタ（15）から
の出力ｄ″（ｎ）を加算したものとなり、ノイズ・シェ
イパ（17）の加算器（18）において量子化誤差ｅ（ｎ）
が取り出される。この量子化誤差ｅ（ｎ）は、利得G^-1
のシフタ（19）を介し、過去のｒ個の入力の１次結合を
とる予測器（20）を介することにより量子化誤差の予測
信号（ｎ）となる。The output d '(n) from the adder (21) is obtained by subtracting the prediction signal (n) of the quantization error from the noise shaper (17) from the difference output d (n).
The output d "(n) from the shifter (15) is obtained by multiplying the gain G by the output d '(n) from the output adder (21). The output from the quantizer (16) ( n) is the sum of the quantum error e (n) in the quantization process and the output d ″ (n) from the shifter (15), and the quantization error in the adder (18) of the noise shaper (17). e (n)
Is taken out. This quantization error e (n) is calculated by the gain G ^-1
And a prediction signal (n) of a quantization error by passing through a predictor (20) that takes a linear combination of the past r inputs through the shifter (19).

音源データは、以上のようなエンコード処理が施さ
れ、量子化器（16）からの出力（ｎ）となって出力端
子（22）を介して取り出される。The sound source data is subjected to the above-described encoding processing, becomes an output (n) from the quantizer (16), and is taken out via the output terminal (22).

次に予測・レンジ適応回路（24）からは、最適フィル
タ選択情報としてのモード選択情報が出力されて、エン
コード・フィルタ（14）の例えば予測器（12）および出
力端子（27）に送られ、また、利得Ｇおよび利得G^-1あ
るいはビット・シフト量を決定するためのレンジ情報が
出力されて、各シフタ（15），（19）および出力端子
（26）に送られている。これらのモード（フィルタ）選
択情報およびレンジ情報は、圧縮に関するパラメータと
も称される。Next, from the prediction / range adaptation circuit (24), mode selection information as optimal filter selection information is output and sent to, for example, a predictor (12) and an output terminal (27) of the encoding filter (14). Further, the gain G and the gain G- ¹ or the range information for determining the bit shift amount are output and sent to the shifters (15) and (19) and the output terminal (26). These mode (filter) selection information and range information are also referred to as parameters relating to compression.

次に、再生側のデコーダ（30）の入力端子（31）に
は、エンコーダ（10）の出力端子（22）からの出力
（ｎ）が伝送され、あるいは記録、再生されることによ
って得られた信号′（ｎ）が供給されている。この入
力信号′（ｎ）は利得G^-1のシフタ（32）を介して加
算器（33）に送られている。加算器（33）からの出力
ｘ′（ｎ）は予測器（34）に送られて予測信号′
（ｎ）となり、この予測信号′（ｎ）は加算器（33）
に送られてシフタ（32）からの出力″（ｎ）と加算さ
れる。この加算出力がデコード出力′（ｎ）として出
力端子（35）より出力される。Next, the output (n) from the output terminal (22) of the encoder (10) is transmitted to the input terminal (31) of the decoder (30) on the reproduction side, or obtained by being recorded and reproduced. Signal '(n) is provided. This input signal '(n) is sent to the adder (33) via the shifter (32) having a gain of G- ¹ . The output x '(n) from the adder (33) is sent to the predictor (34) and
(N), and the predicted signal '(n) is added to the adder (33).
To be added to the output "(n) from the shifter (32). The added output is output from the output terminal (35) as a decoded output '(n).

また、エンコーダ（10）の各出力端子（26）および
（27）より出力され、伝送あるいは記録，再生されたレ
ンジ情報およびモード選択信号は、デコーダ（30）の各
入力端子（36）および（37）にそれぞれ入力されてい
る。そして、入力端子（36）からのレンジ情報はシフタ
（32）に送られて利得G^-1を決定し、入力端子（37）か
らのモード選択情報は予測器（34）に送られて予測特定
を決定する。この予測器（34）の予測特性は、エンコー
ダ（10）の予測器（12）の特性に等しいものが選択され
る。The range information and mode selection signal output from the output terminals (26) and (27) of the encoder (10) and transmitted, recorded, or reproduced are input to the input terminals (36) and (37) of the decoder (30). ). The range information from the input terminal (36) is sent to the shifter (32) to determine the gain G- ^1, and the mode selection information from the input terminal (37) is sent to the predictor (34) to specify the prediction. To determine. The prediction characteristic of the predictor (34) is selected to be equal to the characteristic of the predictor (12) of the encoder (10).

このデコーダ（30）において、シフタ（32）からの出
力″（ｎ）は、入力信号′（ｎ）と利得G^-1を乗算
したものである。また、加算器（33）の出力′（ｎ）
は、シフタ（32）からの出力″（ｎ）と予測信号′
（ｎ）を加算したものである。In the decoder (30), the output "(n) from the shifter (32) is obtained by multiplying the input signal '(n) by the gain G ^-1 . The output' (n) of the adder (33) )
Is the output "(n) from the shifter (32) and the prediction signal '
(N) is added.

このような構成のビット圧縮符号化システムにおい
て、例えば第２図に示した楽器音の記録時には、第２図
のルーピング区間LPのルーピング開始点LP_Sから始まる
ブロック（ルーピング開始ブロック）の最初の所定数の
ワードを、ストレートPCMデータのワードとする。この
所定数としては、エンコード・フィルタ（14）の次数
（予測フィルタの次数）の最大値と同じかあるいはそれ
以上とすればよく、具体的ではエンコード・フィルタ
（14）の最大次数が二次であるから、２個以上のストレ
ートPCMデータのワードＷを配置するようにすればよ
い。このためには、例えばエンコード出力端子（22）の
直前に切換スイッチを設け量子化器（16）からの圧縮デ
ータとストレートPCMデータとを切換選択して出力可能
と成し、ルーピング開始点LP_Sからの所定数ワードの間
は切換スイッチによりストレートPCMデータを選択して
出力するように構成すればよい。なお、レンジ情報はブ
ロックで独立であるから、元のサンプルのワード長の例
えば16ビットのストレートPCMデータをそのまま出力す
る代わりに、同じブロックのレンジ情報に対応するビッ
トだけ圧縮した例えば４ビットのストレートPCMデータ
を出力することもできる。In such a configuration bit compression coding system, for example, at the time of recording of the instrument sound as shown in FIG. 2, the first predetermined block (looping start block) starting from the looping start point LP _S looping section LP of FIG. 2 Let the number words be words of straight PCM data. This predetermined number may be equal to or greater than the maximum value of the order of the encoding filter (14) (order of the prediction filter). More specifically, the maximum order of the encoding filter (14) is quadratic. Therefore, it is sufficient to arrange the words W of two or more straight PCM data. For this purpose, for example, a changeover switch is provided immediately before the encode output terminal (22) so that the compressed data from the quantizer (16) and the straight PCM data can be selected and output, and the looping start point LP _S It is sufficient to select and output straight PCM data by a changeover switch during a predetermined number of words from. Since the range information is independent for each block, instead of outputting the 16-bit straight PCM data having the word length of the original sample as it is, for example, a 4-bit straight data obtained by compressing only the bits corresponding to the range information of the same block is used. It can also output PCM data.

そして本例においては、会話データを音声データとし
て記録（記憶）させる際には、各会話文の先頭部分及び
フレーズの切れ目の部分をストレートPCMデータとして
おく。即ち、例えば「こんにちはいいお天気ですね
さようなら」と発音される会話データの場合、第３図に
示す如く、各文の先頭部分である「こんにちは」の先頭
部ａと、「いいお天気ですね」の先頭部ｂと、「さよう
なら」の先頭部ｃとを、ストレートPCMデータとして記
録し、再生時における第１図に示すデコーダ（30）の端
子（35）に供給されるデコード出力として、予測器（3
4）と加算器（33）による巡回が行われていない音声デ
ータとする。In this example, when the conversation data is recorded (stored) as voice data, the beginning part of each conversation sentence and the break between phrases are set as straight PCM data. In other words, it is, for example, "Hello, good weather
In the case of conversation data to be pronounced goodbye ", as shown in FIG. 3, which is the first part of each statement and the top part a of" Hello ", and the top portion (b) of the" sounds good weather ", of" goodbye " The head c is recorded as straight PCM data, and as a decode output supplied to the terminal (35) of the decoder (30) shown in FIG.
It is assumed that the audio data has not been looped by 4) and the adder (33).

そして、このような会話データを再生させる際に、任
意の箇所から再生を行う所謂ランダム再生を行うときに
は、必ずストレートPCMデータとして記録された箇所か
ら再生させる。即ち、第４図のフローチャートに示す如
く、再生時に音声データをデコードしたときに、このと
きのデコードがランダム再生で任意の箇所からの再生
か、或いは前のデータからの連続した再生であるか否か
を判別し、連続した再生であるときには再生動作を継続
して行われせる。そして、ランダム再生であるときに
は、再生音声データ中のモード選択信号を判別し、スト
レートPCMデータのモード選択信号が検出されるまで再
生音声の出力を禁止させるミューティングを行う。そし
て、ストレートPCMデータのモード選択信号が検出され
ると、ミュートを解除し、再生動作を開始させる。When reproducing such conversation data, when performing so-called random reproduction in which reproduction is performed from an arbitrary position, reproduction is always performed from a position recorded as straight PCM data. That is, as shown in the flowchart of FIG. 4, when audio data is decoded at the time of reproduction, whether the decoding at this time is reproduction from an arbitrary position by random reproduction or continuous reproduction from previous data is determined. It is determined whether the playback operation is continuous and the playback operation is continued. When random reproduction is performed, a mode selection signal in the reproduced audio data is determined, and muting is performed to inhibit output of the reproduced audio until a mode selection signal of straight PCM data is detected. When the mode selection signal of the straight PCM data is detected, the mute is released and the reproducing operation is started.

このようにしてランダム再生を行うようにしたこと
で、ランダム再生が開始される箇所は、直前の音声デー
タとは無関係なモードのPCMデータが得られるタイミン
グであるので、本来巡回されて出力されるPCMデータが
巡回されずに出力されることがなく、再生開始時の出力
音声が乱れたものになる虞れがない。なお、実際の会話
データには、会話文の先頭やフレーズの切れ目が頻繁に
現れるので、ミューティングが行われる時間は僅かであ
る場合が多い。Since random playback is performed in this manner, the point at which random playback is started is the timing at which PCM data in a mode irrelevant to the immediately preceding audio data is obtained, so that the data is cyclically output. The PCM data is not output without being circulated, and there is no possibility that the output sound at the start of the reproduction will be disturbed. In actual conversation data, since the beginning of a conversation sentence or a break in a phrase frequently appears, muting is often performed in a short time.

ここで、このような準瞬時圧縮によりPCMデータ化さ
れた音声信号の実際の使用状態について説明すると、例
えば映画が記録されたビデオディスクに映像信号と共に
記録する音声信号を、上述したように圧縮されたPCMデ
ータとする。この場合、音声信号として複数チャンネル
用意し、登場人物の会話を、複数カ国語でそれぞれ別の
チャンネルのPCMデータとして記録する。また、背景音
をこれとは別のチャンネルでPCMデータとして記録す
る。そして、再生時には用意された複数カ国語の中から
何れかを選択し、この選択したチャンネルの言語のPCM
データと背景音のチャンネルのPCMデータとをデコード
し、両チャンネルの音声信号を混合して再生させる。こ
のようにして圧縮されたPCMデータにより複数チャンネ
ル設定することで、音声データ量を大幅に減らすことが
でき、複数チャンネル分のビデオディスクへの記録が可
能になる。Here, an actual use state of an audio signal converted into PCM data by such quasi-instantaneous compression will be described.For example, an audio signal recorded together with a video signal on a video disc on which a movie is recorded is compressed as described above. PCM data. In this case, a plurality of channels are prepared as audio signals, and conversations of characters are recorded as PCM data of different channels in a plurality of languages. The background sound is recorded as PCM data on another channel. At the time of playback, one of the prepared languages is selected, and the PCM of the language of the selected channel is selected.
The data and PCM data of the background sound channel are decoded, and the audio signals of both channels are mixed and reproduced. By setting a plurality of channels using the PCM data compressed in this way, the amount of audio data can be significantly reduced, and recording on a video disc for a plurality of channels becomes possible.

そして、このようにして記録・再生を行うものに上述
した本例の音声データ再生方法を適用することで、ラン
ダム再生時には必ずストレートPCMモードの箇所から再
生され、再生開始時の会話の再生音声が乱れることがな
い。この場合、背景音のPCMデータを常時ストレートPCM
モードにして記録することで、背景音の再生も良好に行
われる。Then, by applying the above-described audio data reproducing method of the present example to a device that performs recording / reproduction in this manner, at the time of random reproduction, reproduction is always performed from the straight PCM mode, and the reproduction voice of the conversation at the start of reproduction is reproduced. There is no disturbance. In this case, the background sound PCM data is always straight PCM
By recording in the mode, the background sound can be reproduced well.

また、背景音のPCMデータも準瞬時圧縮によりPCMデー
タ化して記録した場合には、ランダム再生時に会話デー
タのチャンネルがストレートPCMモードとなる会話文の
先頭やフレーズの切れ目から再生させ、このときの背景
音のPCMデータが巡回させるモードであっても強制的に
ストレートPCMモードと見なして再生させれば良い。こ
のときには、背景音の最初の部分が多少歪むが、会話文
と異なり内容を聞き取る必要がないので、ほとんど支障
がない。Also, if the PCM data of the background sound is also converted into PCM data by semi-instantaneous compression and recorded, the channel of the conversation data at the time of random playback is played from the beginning of the conversation sentence or the break of the phrase in which the PCM mode is straight PCM mode. Even in the mode in which the PCM data of the background sound circulates, the playback may be forcibly regarded as the straight PCM mode. In this case, the first part of the background sound is slightly distorted, but unlike the conversational sentence, there is no need to listen to the contents, so there is almost no problem.

なお、本発明は上述実施例に限らず、その他種々の構
成が取り得ることは勿論である。Note that the present invention is not limited to the above-described embodiment, but may take various other configurations.

〔The invention's effect〕

本発明によると、人間の声が記録される箇所の会話及
びフレーズの切れ目の部分の音声信号を、直前の音声デ
ータとは無関係な所謂ストレートPCMモードのPCMデータ
とし、任意の箇所からランダム再生を行うときに、この
ストレートPCMモードのPCMデータとなったタイミングか
ら再生させるようにしたので、直前の音声データと関連
のある音声データが途中から再生されて不自然な音声が
再生されることがなく、常に良好な音声が再生される。
この場合、人間の声が記録されたチャンネルを含む複数
チャンネルのPCMデータを同時に任意の箇所からランダ
ム再生するときにも、この人間の声が記録されたチャン
ネルがストレートPCMモードのPCMデータとなったタイミ
ングから再生させるようにしたので、常に良好な音声が
再生される。According to the present invention, the speech signal of the part where the conversation of the human voice is recorded and the break of the phrase is set as PCM data in a so-called straight PCM mode irrelevant to the immediately preceding audio data, and random reproduction is performed from an arbitrary position. When performing, since it was made to play from the timing that became the PCM data of this straight PCM mode, audio data related to the immediately preceding audio data is played from the middle and unnatural audio is not played , Always good sound is played.
In this case, even when randomly reproducing the PCM data of a plurality of channels including the channel where the human voice was recorded from an arbitrary position at the same time, the channel where the human voice was recorded became the PCM data in the straight PCM mode. Since the reproduction is performed from the timing, a good sound is always reproduced.

[Brief description of the drawings]

第１図は本発明の一実施例による音声データ再生方法に
基づいたシステムの構成図、第２図及び第３図は一実施
例のデータ例を示す説明図、第４図は一実施例の説明に
供するフローチャート図、第５図及び第６図は音声デー
タ例を示す説明図である。（30）はデコーダ、（33）は加算器、（34）は予測器で
ある。FIG. 1 is a block diagram of a system based on an audio data reproducing method according to one embodiment of the present invention, FIGS. 2 and 3 are explanatory diagrams showing data examples of one embodiment, and FIG. FIGS. 5 and 6 are flowcharts for explanation, and are explanatory diagrams showing examples of audio data. (30) is a decoder, (33) is an adder, and (34) is a predictor.

フロントページの続き (56)参考文献特開昭62−3516（ＪＰ，Ａ) 特開昭60−113300（ＪＰ，Ａ) 特開昭58−215697（ＪＰ，Ａ) 特開平３−265309（ＪＰ，Ａ) 特許2674161（ＪＰ，Ｂ２) 特公平６−1903（ＪＰ，Ｂ２) 特公平６−101709（ＪＰ，Ｂ２) 特公昭59−12187（ＪＰ，Ｂ２) 特公平６−44200（ＪＰ，Ｂ２) 特公平８−21200（ＪＰ，Ｂ２) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 G10H 7/02 G11B 20/10 301 H03M 1/16 H04B 14/04 ＩＮＳＰＥＣ（ＤＩＡＬＯＧ) ＪＩＣＳＴファイル（ＪＯＩＳ) ＷＰＩ（ＤＩＡＬＯＧ)Continuation of front page (56) References JP-A-62-3516 (JP, A) JP-A-60-113300 (JP, A) JP-A-58-215697 (JP, A) JP-A-3-265309 (JP) , A) Japanese Patent No. 2674161 (JP, B2) Japanese Patent Publication No. 6-1903 (JP, B2) Japanese Patent Publication No. 6-101709 (JP, B2) Japanese Patent Publication No. 59-12187 (JP, B2) Japanese Patent Publication No. 6-44200 (JP, B2) JP, B2) JP 8-21200 (JP, B2) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 19/00 G10H 7/02 G11B 20/10 301 H03M 1/16 H04B 14 / 04 INSPEC (DIALOG) JICST file (JOIS) WPI (DIALOG)

Claims

(57) [Claims]

An audio data reproducing method for reproducing a continuous audio signal converted into PCM data by quasi-instantaneous compression, wherein a speech at a place where a human voice is recorded and an audio signal at a break between phrases are forcibly transmitted. The PCM data in a mode irrelevant to the immediately preceding audio data is used.When this PCM data is reproduced from an arbitrary position, muting is performed until PCM data in a mode irrelevant to the immediately preceding audio data is obtained. An audio data reproduction method that reproduces from a timing at which PCM data in a mode irrelevant to the audio data is obtained.

2. An audio signal other than a human voice is converted into PCM data in a mode different from that of the immediately preceding audio data on a channel different from that of the human voice audio signal. 2. The audio data reproducing method according to claim 1, wherein the audio data is reproduced by being mixed with the digitized human voice PCM data.

3. A multi-channel PCM converted into PCM data by quasi-instantaneous compression including a channel in which a human voice is recorded.
When playing data from an arbitrary location at the same time, the channel on which the human voice was recorded becomes the PCM data in a mode irrelevant to the previous audio data, and the PCM data of other channels are played back at the same time. 2. The audio data reproducing method according to claim 1, wherein the reproduction is forcibly regarded as a mode unrelated to the immediately preceding audio data from the timing.