JPH1011096A

JPH1011096A - Karaoke device

Info

Publication number: JPH1011096A
Application number: JP8178536A
Authority: JP
Inventors: Yuji Senba; 祐二仙場
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 1996-06-19
Filing date: 1996-06-19
Publication date: 1998-01-16
Anticipated expiration: 2016-06-19
Also published as: JP3261982B2

Abstract

PROBLEM TO BE SOLVED: To shorten a transfer time and reduce the storage capacity of music data by decoding voice data on the basis of a voice spectrum pattern read out on the basis of index information, and auxiliary information. SOLUTION: A communication interface 6 reproduces sent music data into the original music data according to its communication system and outputs the data to a hard disk drive 5, and sends a history of KARAOKE performances stored in the hard disk drive 5, etc., to a host computer 90 according to the communication system. A vector quantization data decoding means 71 converts index information included in vector quantization voice waveform data into a spectrum pattern according to a code book 81 in response to an indication for a voice track in the music data, decodes the original digital voice waveform data on the basis of the converted spectrum pattern and auxiliary information, and outputs the data to a mixer circuit 12. The code book 81 converts the index information to the spectrum pattern of the voice waveform data.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、符号化された楽
曲データに基づいて楽音を生成する音源を内蔵したカラ
オケ装置に係り、特に楽曲データに含まれる音声データ
などのデータ符号化に変更を加えたカラオケ装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a karaoke apparatus having a built-in sound source for generating a musical tone based on encoded music data, and particularly to a data encoding method for audio data included in music data. Karaoke apparatus.

【０００２】[0002]

【従来の技術】最も単純なカラオケ装置は、楽曲をアナ
ログ信号として記録したテープを再生することによって
楽曲を再生するものであったが、電子技術の発達に伴っ
て、テープがＣＤ（ＣｏｍｐａｃｔＤｉｓｋ）やＬＤ
（ＬａｓｅｒＤｉｓｋ）に代わり、記録される信号も
アナログ信号からディジタル信号に代わり、それらに記
録されるデータも楽曲データだけでなく映像データや歌
詞データなどの種々の情報が付加されるようになってき
た。そして、最近では、ＣＤやＬＤに代えて、通信回線
（一般の電話回線やＩＳＤＮ回線）を介して楽曲データ
を取り込み、それを音源とシーケンサを用いて演奏する
通信型のカラオケ装置が急速に普及してきた。この通信
型のカラオケ装置には、再生する楽曲データをその都度
通信回線を介して取り込んで再生する非蓄積型のもの
と、取り込んだ楽曲データを内蔵の記憶装置（ハードデ
ィスクなど）に蓄積しておき、必要な時に読み出して再
生する蓄積型とがある。現在では通信コストの点から、
蓄積型のカラオケ装置が主流になっている。このような
通信型のカラオケ装置には、通信時間（通信コスト）や
記憶容量を極力低く抑えるために、１曲当たりの楽曲デ
ータのデータ量を少なくするために最新のデータ圧縮方
法や通信方法が導入されている。例えば、楽曲データと
して、ＣＤやＬＤに記録されていたディジタルデータを
そのまま使用して通信していたのでは、通信時間や通信
コストの点から通信型のカラオケ装置は成り立たない。
そこで、通信型のカラオケ装置は、楽曲データの中の演
奏に関するデータや歌詞に関するデータなどをＭＩＤＩ
（ＭｕｓｉｃａｌＩｎｓｔｒｕｍｅｎｔＤｉｇｉｔａ
ｌＩｎｔｅｒｆａｃｅ）規格に準拠したデータ（以下
「ＭＩＤＩデータ」という）に符号化し、このＭＩＤＩ
データに符号化することのできない音声データなどをＡ
ＤＰＣＭ（ＡｄａｐｔｉｖｅＤｉｆｆｅｒｅｎｔｉａ
ｌＰｕｌｓｅＣｏｄｅＭｏｄｕｌａｔｉｏｎ）デ
ータに符号化することによって、１曲当たりの楽曲デー
タのデータ量を少なくして通信を行っていた。2. Description of the Related Art The simplest karaoke apparatus reproduces music by reproducing a tape in which the music is recorded as an analog signal. However, with the development of electronic technology, the tape has been replaced with a CD (Compact Disk). And LD
Instead of (Laser Disk), the signal to be recorded is also changed from an analog signal to a digital signal, and the data recorded therein is not only music data but also various information such as video data and lyrics data. Was. Recently, a communication type karaoke apparatus which takes in music data via a communication line (general telephone line or ISDN line) instead of a CD or LD and plays it using a sound source and a sequencer has rapidly spread. I've been. This communication type karaoke apparatus has a non-storage type in which music data to be reproduced is fetched and reproduced through a communication line each time, and a karaoke apparatus in which fetched music data is stored in a built-in storage device (such as a hard disk). There is a storage type that reads out and reproduces when necessary. At present, in terms of communication costs,
Storage type karaoke devices have become mainstream. In such a communication type karaoke apparatus, the latest data compression method and communication method are used in order to minimize the communication time (communication cost) and the storage capacity and to reduce the data amount of music data per music. Has been introduced. For example, if communication is performed using digital data recorded on a CD or LD as music data as it is, a communication-type karaoke apparatus cannot be established in terms of communication time and communication cost.
Therefore, the communication type karaoke apparatus transmits MIDI-related data and lyrics-related data in the music data to MIDI.
(Musical Instrument Digita
l Interface) data (hereinafter referred to as “MIDI data”).
A voice data that cannot be encoded into data
DPCM (Adaptive Differentia)
Communication was performed by reducing the data amount of music data per music by encoding the data into l Pulse Code Modulation (lPulse Code Modulation) data.

【０００３】[0003]

【発明が解決しようとする課題】ところが、ＡＤＰＣＭ
データは、圧縮してあるとはいっても、ＭＩＤＩデータ
に比べるとそのデータ量が比較的大きいため、蓄積型の
カラオケ装置の記憶装置内の大半（約３分の２程度）を
占有し、カラオケ装置内に蓄積可能な楽曲データの数を
制限する要因の一つとなっていた。また、これは、楽曲
データを通信する場合の通信時間（通信コスト）の短縮
化を制限する要因でもあった。この発明は上述の点に鑑
みてなされたものであり、通信回線などの信号線を介し
て楽曲データを転送する際に、その転送時間を大幅に短
縮化できると共に楽曲データの蓄積容量を少なくするこ
とのできるカラオケ装置を提供することを目的とする。SUMMARY OF THE INVENTION However, ADPCM
Even though the data is compressed, the data amount is relatively large as compared with the MIDI data. Therefore, the data occupies most (about two-thirds) in the storage device of the storage type karaoke apparatus, and This is one of the factors that limit the number of music data that can be stored in the device. This is also a factor that limits shortening of communication time (communication cost) when communicating music data. The present invention has been made in view of the above points, and when transferring music data via a signal line such as a communication line, the transfer time can be greatly reduced and the storage capacity of the music data can be reduced. It is an object of the present invention to provide a karaoke apparatus that can perform the karaoke.

【０００４】[0004]

【課題を解決するための手段】この発明に係るカラオケ
装置は、転送されてきた楽曲データを複数記憶する記憶
手段と、選曲操作に基づき、該選曲された楽曲データの
伴奏データを読み出して伴奏楽音を発生するとともに、
該伴奏楽音に同期して音声データを読み出し発音するカ
ラオケ装置において、前記音声データをベクトル量子化
波形データであるインデックス情報とスペクトル包絡に
関する補助情報とから構成するとともに、前記インデッ
クス情報と音声スペクトルパターンとの関係を記憶した
テーブル記憶手段と、前記インデックス情報をもとに前
記テーブル記憶手段から音声スペクトルパターンを読み
出す読み出し手段と、前記読み出した音声スペクトルパ
ターンと前記補助情報に基づいて音声データを復号する
復号化手段とを備えたものである。従来のカラオケ装置
は、ＡＤＰＣＭデータ圧縮技術によってデータ圧縮され
た音声波形データ（ＡＤＰＣＭ音声波形データ）に基づ
いてカラオケ演奏処理を行っていた。この発明では、Ａ
ＤＰＣＭデータ圧縮技術よりも高い圧縮率でデータを圧
縮することのできるベクトル量子化技術を採用する。こ
のベクトル量子化技術はＡＤＰＣＭデータ圧縮技術の約
３倍の圧縮率で音声波形データを圧縮することができる
ものであり、音声波形データを、その音声スペクトルパ
ターンを特定するインデックス情報とスペクトル包絡に
関する補助情報とに圧縮する。この発明では、ベクトル
量子化技術によってデータ圧縮された音声波形データ
（ベクトル量子化音声波形データ）が通信回線などの信
号線を介して転送されて来る。そこで、カラオケ装置
は、転送されてきたベクトル量子化音声波形データを元
の音声波形データに復号し、復号された音声波形データ
を用いてカラオケ演奏を行う。これによって、従来のＡ
ＤＰＣＭ音声波形データを転送する場合に比べてその通
信時間を大幅に短縮化することができる。また、カラオ
ケ装置は、受信したベクトル量子化音声波形データをそ
のままハードディスク装置などの記憶媒体に記憶できる
ので、楽曲データの蓄積容量を少なくすることもでき
る。A karaoke apparatus according to the present invention has a storage means for storing a plurality of transferred music data and an accompaniment music tone by reading accompaniment data of the selected music data based on a music selection operation. Together with
In a karaoke apparatus for reading and producing audio data in synchronization with the accompaniment music, the audio data is composed of index information as vector quantized waveform data and auxiliary information relating to a spectrum envelope, and the index information and the audio spectrum pattern are included. Table reading means for storing the relationship of the above, reading means for reading a voice spectrum pattern from the table storing means based on the index information, and decoding for decoding voice data based on the read voice spectrum pattern and the auxiliary information. Means. A conventional karaoke apparatus has performed karaoke performance processing based on audio waveform data (ADPCM audio waveform data) that has been data-compressed by the ADPCM data compression technique. In the present invention, A
A vector quantization technique capable of compressing data at a higher compression rate than the DPCM data compression technique is employed. This vector quantization technique is capable of compressing audio waveform data at approximately three times the compression rate of the ADPCM data compression technique, and converts the audio waveform data into index information for specifying the audio spectrum pattern and auxiliary information relating to the spectrum envelope. Compress with information. According to the present invention, audio waveform data (vector quantized audio waveform data) compressed by the vector quantization technique is transferred via a signal line such as a communication line. Therefore, the karaoke apparatus decodes the transferred vector quantized audio waveform data into the original audio waveform data, and performs a karaoke performance using the decoded audio waveform data. Thereby, the conventional A
The communication time can be greatly reduced as compared with the case where the DPCM audio waveform data is transferred. Further, the karaoke apparatus can store the received vector quantized audio waveform data in a storage medium such as a hard disk apparatus as it is, so that the storage capacity of music data can be reduced.

【０００５】[0005]

【発明の実施の形態】以下、添付図面を参照して、この
発明の実施の形態を詳細に説明する。図１はこの発明に
係るカラオケ装置７０の一実施の形態の全体構成を示す
概略ブロック図である。この実施の形態ではカラオケ装
置７０が通信インターフェイス６及び通信ネットワーク
８０を介してホストコンピュータ９０に接続され、ホス
トコンピュータ９０から配信された楽曲データを受信
し、内蔵ハードディスクに記憶する蓄積型のカラオケ装
置７０について説明する。なお、この実施の形態では、
ホストコンピュータ９０がディジタル音声波形データ１
〜ｎをベクトル量子化技術によって圧縮し、圧縮された
ディジタル音声波形データ（以下、「ベクトル量子化音
声波形データ」とする）をヘッダ部及びＭＩＤＩデータ
部に付加して図２（Ａ）のような楽曲データを構成し、
それを所定の通信方式に従って通信ネットワーク８０を
介してカラオケ装置７０に送信する。カラオケ装置７０
は、受信したベクトル量子化音声波形データをハードデ
ィスク装置５に順次記憶する場合について説明する。な
お、このベクトル量子化技術については、図４にて後述
する。Embodiments of the present invention will be described below in detail with reference to the accompanying drawings. FIG. 1 is a schematic block diagram showing the overall configuration of a karaoke apparatus 70 according to an embodiment of the present invention. In this embodiment, a karaoke apparatus 70 is connected to a host computer 90 via a communication interface 6 and a communication network 80, receives music data distributed from the host computer 90, and stores the music data in a built-in hard disk. Will be described. In this embodiment,
When the host computer 90 receives the digital audio waveform data 1
.. N are compressed by a vector quantization technique, and the compressed digital audio waveform data (hereinafter referred to as “vector-quantized audio waveform data”) is added to a header portion and a MIDI data portion, as shown in FIG. Composed music data,
It is transmitted to the karaoke apparatus 70 via the communication network 80 according to a predetermined communication method. Karaoke device 70
Will be described for the case where the received vector quantized audio waveform data is sequentially stored in the hard disk device 5. This vector quantization technique will be described later with reference to FIG.

【０００６】このカラオケ装置７０は、マイクロプロセ
ッサユニット（ＣＰＵ）１、プログラムメモリ（ＲＯ
Ｍ）２、ワーキングメモリ（ＲＡＭ）３からなるマイク
ロコンピュータシステムの制御の下で各種の処理を実行
するようになっている。ＣＰＵ１は、このカラオケ装置
７０全体の動作を制御する。このＣＰＵ１に対して、デ
ータ及びアドレスバス２１を介してプログラムメモリ
（ＲＯＭ）２、ワーキングメモリ（ＲＡＭ）３、パネル
インターフェイス４、ハードディスク装置（ＨＤＤ）
５、通信インターフェイス６、ベクトル量子化データ復
号化手段７１、コードブック８１、音源回路１０、エフ
ェクト手段１４、映像作成回路１６及び背景映像再生回
路１８が接続されている。なお、これ以外にも、ＭＩＤ
Ｉインターフェイス回路やＬＤ（ＣＤ）チェンジャーか
らなる背景映像再生装置などの付属装置が接続されてい
るが、ここでは説明を省略する。The karaoke apparatus 70 includes a microprocessor unit (CPU) 1 and a program memory (RO).
M) 2 and various processes are executed under the control of a microcomputer system including a working memory (RAM) 3. The CPU 1 controls the operation of the karaoke apparatus 70 as a whole. For this CPU 1, a program memory (ROM) 2, a working memory (RAM) 3, a panel interface 4, a hard disk device (HDD) via a data and address bus 21.
5, communication interface 6, vector quantized data decoding means 71, codebook 81, sound source circuit 10, effect means 14, video creation circuit 16, and background video playback circuit 18 are connected. In addition, MID
Attached devices such as an I interface circuit and a background video reproducing device including an LD (CD) changer are connected, but the description is omitted here.

【０００７】プログラムメモリ２はＣＰＵ１のシステム
関連のプログラム、またハードディスク装置５に記憶さ
れたシステム関連のプログラムをロードするプログラ
ム、各種のパラメータやデータなどを記憶しているもの
であり、リードオンリメモリ（ＲＯＭ）で構成されてい
る。ワーキングメモリ３は、ハードディスク装置５から
ロードされたシステムプログラム、またＣＰＵ１がプロ
グラムを実行する際に発生する各種のデータを一時的に
記憶するものであり、ランダムアクセスメモリ（ＲＡ
Ｍ）の所定のアドレス領域がそれぞれ割り当てられ、レ
ジスタやフラグ等として利用される。パネルインターフ
ェイス４は、カラオケ装置７０のパネル（図示せず）上
に設けられた各種操作子やリモコン装置（図示せず）な
どからの指令信号をＣＰＵ１の処理可能な信号に変換し
てデータ及びアドレスバス２１に出力する。The program memory 2 stores a system-related program of the CPU 1, a program for loading a system-related program stored in the hard disk device 5, various parameters and data, and the like. ROM). The working memory 3 temporarily stores a system program loaded from the hard disk device 5 and various data generated when the CPU 1 executes the program.
M) predetermined address areas are respectively allocated and used as registers and flags. The panel interface 4 converts command signals from various controls and a remote control device (not shown) provided on a panel (not shown) of the karaoke apparatus 70 into signals that can be processed by the CPU 1, and converts them into data and addresses. Output to the bus 21.

【０００８】ハードディスク装置５は、カラオケ装置７
０のシステムプログラムや楽曲データを記憶するもので
あり、例えば数百Ｍから数Ｇの記憶容量のもので構成さ
れる。この実施の形態では、ハードディスク装置５に記
憶される楽曲データの中に含まれる音声データはベクト
ル量子化技術によって圧縮されている。なお、このハー
ドディスク装置５に記憶される楽曲データは通信ネット
ワーク８０を介して取り込まれるだけではなく、図示し
ていないフロッピーディスクドライバやＣＤ−ＲＯＭド
ライバなどから読み込まれて記録されてもよいことは言
うまでもない。通信インターフェイス６は、通信ネット
ワーク８０を介して送信されてきた楽曲データをその通
信方式に従って元の楽曲データに再現し、ハードディス
ク装置５に出力したり、ハードディスク装置５に記憶さ
れたカラオケ演奏の履歴等を通信方式に従ってホストコ
ンピュータ９０へ送信する。ベクトル量子化データ復号
化手段７１は、楽曲データ中の音声トラックの指示に従
いベクトル量子化音声波形データに含まれるインデック
ス情報をコードブック８に基づいてスペクトルパターン
に変換し、変換されたスペクトルパターンと補助情報と
に基づいて元のディジタル音声波形データを復号化し、
ミキサ回路１２へ出力する。コードブック８１は、イン
デックス情報を音声波形データのスペクトルパターンに
変換するための変換テーブルである。コードブック８
は、専用のメモリで構成されていてもよいし、ハードデ
ィスク装置５の適当な場所に記憶されていてもよい。コ
ードブック８のデータは、通信ネットワーク８０を介し
て取り込まれてもよいし、図示していないフロッピーデ
ィスクドライバやＣＤ−ＲＯＭドライバなどから読み込
まれて記録されてもよい。すわなち、この実施の形態に
係るカラオケ装置７０では、ベクトル量子化技術によっ
て圧縮された音声波形データを含む楽曲データを通信回
線８０を介して取り込み、それをハードディスク装置５
に記憶し、曲の演奏にあたっては、その音声データをベ
クトル量子化データ復号化手段７１に出力して、コード
ブック８１に記憶されたスペクトルパターンを参照の
元、音声再生を実現している。[0008] The hard disk device 5 includes a karaoke device 7
It stores zero system programs and music data, and has a storage capacity of several hundred M to several G, for example. In this embodiment, audio data included in the music data stored in the hard disk device 5 is compressed by a vector quantization technique. It is needless to say that the music data stored in the hard disk device 5 is not only taken in through the communication network 80 but may be read and recorded from a floppy disk driver or a CD-ROM driver (not shown). No. The communication interface 6 reproduces the music data transmitted via the communication network 80 into the original music data in accordance with the communication system, and outputs the original music data to the hard disk device 5 or the karaoke performance history stored in the hard disk device 5. To the host computer 90 according to the communication method. The vector quantized data decoding means 71 converts the index information included in the vector quantized audio waveform data into a spectrum pattern based on the codebook 8 according to the instruction of the audio track in the music data, and converts the converted spectrum pattern and the auxiliary Decoding the original digital audio waveform data based on the information and
Output to the mixer circuit 12. The code book 81 is a conversion table for converting the index information into a spectrum pattern of the audio waveform data. Codebook 8
May be configured by a dedicated memory, or may be stored in an appropriate location of the hard disk device 5. The data of the code book 8 may be taken in via the communication network 80, or may be read and recorded from a floppy disk driver or a CD-ROM driver (not shown). That is, in the karaoke apparatus 70 according to the present embodiment, the music data including the audio waveform data compressed by the vector quantization technique is fetched via the communication line 80, and the karaoke apparatus 70 receives the music data.
When performing a music, the audio data is output to the vector quantization data decoding means 71, and audio reproduction is realized based on the spectral pattern stored in the code book 81.

【０００９】音源回路１０は、複数チャンネルで楽音信
号の同時発生が可能であり、データ及びアドレスバス２
１を経由して与えられた楽音トラック上のＭＩＤＩ規格
に準拠したデータを入力し、このデータに基づいた楽音
信号を生成し、それをミキサ回路１２に出力する。音源
回路１０において複数チャンネルで楽音信号を同時に発
音させる構成としては、１つの回路を時分割で使用する
ことによって複数の発音チャンネルを形成するようなも
のや、１つの発音チャンネルが１つの回路で構成される
ような形式のものであってもよい。また、音源回路１０
における楽音信号発生方式はいかなるものを用いてもよ
い。例えば、発生すべき楽音の音高に対応して変化する
アドレスデータに応じて波形メモリに記憶した楽音波形
サンプル値データを順次読み出すメモリ読み出し方式
（波形メモリ方式）、又は上記アドレスデータを位相角
パラメータデータとして所定の周波数変調演算を実行し
て楽音波形サンプル値データを求めるＦＭ方式、あるい
は上記アドレスデータを位相角パラメータデータとして
所定の振幅変調演算を実行して楽音波形サンプル値デー
タを求めるＡＭ方式等の公知の方式を適宜採用してもよ
い。また、これらの方式以外にも、自然楽器の発音原理
を模したアルゴリズムにより楽音波形を合成する物理モ
デル方式、基本波に複数の高調波を加算することで楽音
波形を合成する高調波合成方式、特定のスペクトル分布
を有するフォルマント波形を用いて楽音波形を合成する
フォルマント合成方式、ＶＣＯ、ＶＣＦ及びＶＣＡを用
いたアナログシンセサイザ方式等を採用してもよい。ま
た、専用のハードウェアを用いて音源回路を構成するも
のに限らず、ＤＳＰとマイクロプログラムを用いて音源
回路を構成するようにしてもよいし、ＣＰＵとソフトウ
ェアのプログラムで音源回路を構成するようにしてもよ
い。The tone generator circuit 10 is capable of simultaneously generating a musical tone signal on a plurality of channels.
1 to input data conforming to the MIDI standard on the musical tone track given, generate a musical tone signal based on this data, and output it to the mixer circuit 12. The tone generator circuit 10 can simultaneously generate tone signals on a plurality of channels by using a single circuit in a time-division manner to form a plurality of tone channels, or a single tone channel can be constituted by a single circuit. It may be of the type as described below. The sound source circuit 10
May be used as the tone signal generation system. For example, a memory reading method (waveform memory method) for sequentially reading out tone waveform sample value data stored in a waveform memory in accordance with address data that changes in accordance with a pitch of a musical tone to be generated, or a phase angle parameter An FM method for executing a predetermined frequency modulation operation as data to obtain tone waveform sample value data, or an AM method for executing a predetermined amplitude modulation operation using the address data as phase angle parameter data to obtain tone waveform sample value data, or the like. May be appropriately adopted. In addition to these methods, a physical model method that synthesizes a musical sound waveform by an algorithm that simulates the sounding principle of a natural musical instrument, a harmonic synthesis method that synthesizes a musical sound waveform by adding a plurality of harmonics to a fundamental wave, A formant synthesis method of synthesizing a musical tone waveform using a formant waveform having a specific spectral distribution, an analog synthesizer method using VCO, VCF, and VCA may be employed. In addition, the tone generator circuit is not limited to the one that uses the dedicated hardware, and the tone generator circuit may be configured using a DSP and a microprogram. Alternatively, the tone generator circuit may be configured using a CPU and a software program. It may be.

【００１０】ミキサ回路１２は、音源回路１０からの楽
音信号と、ベクトル量子化データ復号化手段７１からの
音声信号と、マイク１３からの音声信号とをミキシング
し、それをエフェクト手段１４に出力する。エフェクト
手段１４は、ミキサ回路１２からの楽音信号、音声信号
に残響やリバーブなどの効果を付与して音響出力装置１
５に出力する。エフェクト手段１４は、楽曲データの中
の効果制御トラックの制御データに応じて効果の種類や
程度を制御する。音響出力装置１５は、エフェクト手段
１４からの楽音信号及び音声信号をアンプ及びスピーカ
からなるサウンドシステムを介して発音する。[0010] The mixer circuit 12 mixes the tone signal from the tone generator circuit 10, the audio signal from the vector quantized data decoding means 71, and the audio signal from the microphone 13, and outputs it to the effect means 14. . The effect means 14 adds an effect such as reverberation or reverb to the tone signal and the audio signal from the mixer circuit 12 to provide the sound output device 1 with the effect.
5 is output. The effect means 14 controls the type and degree of the effect according to the control data of the effect control track in the music data. The sound output device 15 emits a tone signal and a sound signal from the effect means 14 via a sound system including an amplifier and a speaker.

【００１１】映像作成回路１６は、歌詞トラックに記録
されているＭＩＤＩデータに基づいて作成された文字コ
ードと、その表示箇所に関する文字データと、歌詞の表
示時間に関する表示時間データと、歌詞の表示色を曲進
行に合わせて順次変化させるためのワイプシーケンスデ
ータとに基づいてモニタ画面に表示される歌詞映像を作
成する。背景映像再生回路１８は、演奏される楽曲のジ
ャンルに対応した所定の背景映像をＣＤ−ＲＯＭ１７か
ら選択的に再生し、それを映像ミキサ回路１９に出力す
る。映像ミキサ回路１９は、背景映像再生回路１８から
の背景映像に映像作成回路１６からの歌詞映像をスーパ
ーインポーズで重ね合わせて、映像出力回路２０に出力
する。映像出力回路２０は、映像ミキサ回路１９によっ
てミキシングされた背景映像と歌詞映像の合成画像をモ
ニタ画面上に表示する。The video creation circuit 16 is provided with a character code created based on the MIDI data recorded on the lyrics track, character data relating to the display location, display time data relating to the display time of the lyrics, and the display color of the lyrics. Is generated on the monitor screen on the basis of the wipe sequence data for sequentially changing the song according to the progress of the music. The background video reproduction circuit 18 selectively reproduces a predetermined background video corresponding to the genre of the music to be played from the CD-ROM 17 and outputs it to the video mixer circuit 19. The video mixer circuit 19 superimposes the lyrics video from the video creating circuit 16 on the background video from the background video reproducing circuit 18 in a superimposed manner, and outputs the superimposed lyrics video to the video output circuit 20. The video output circuit 20 displays a composite image of the background video and the lyrics video mixed by the video mixer circuit 19 on a monitor screen.

【００１２】図２は図１のカラオケ装置７０が通信ネッ
トワークを介して受信して記憶する１曲分の楽曲データ
の構成例を示す図である。楽曲データは図２（Ａ）に示
すようにヘッダ部３１、ＭＩＤＩデータ部３２及び音声
データ部３３からなる。ヘッダ部３１は、この楽曲デー
タに関する種々のデータからなり、具体的には、楽曲
名、楽曲の属するジャンル名、発売日、演奏時間などの
データである。これ以外にも、ヘッダ部３１には通信さ
れた日付やアクセスされた日付や回数などの付随的な情
報が記録される場合がある。ＭＩＤＩデータ部３２は、
楽音トラック、歌詞トラック、音声トラック及び効果制
御トラックからなる。楽音トラックには、楽曲に応じた
メロディパート、伴奏パート、リズムパートなどの演奏
データが記録される。演奏データは、ＭＩＤＩ規格に準
拠したデータであり、イベントとイベントとの間の時間
間隔を示すデュレーションタイムデータΔｔと、そのイ
ベントの種類（発音開始命令、発音停止命令など）を示
すステータスデータと、発音開始又は発音停止される音
高を指定するピッチ指定データと、発音時の音量を指定
する音量指定データとからなる。音量指定データはステ
ータスデータが発音開始命令の場合に付与される。歌詞
トラックには、図示していないモニタ画面に表示される
べき歌詞に関するデータがＭＩＤＩのシステム・エクス
クルーシブ・メッセージ形式で記録される。すなわち、
この歌詞トラックに記録されるＭＩＤＩデータは、表示
する歌詞に対応した文字コード及びその表示箇所に関す
る文字データと、歌詞の表示時間に関する表示時間デー
タと、歌詞の表示色を曲進行に合わせて順次変化させる
ためのワイプシーケンスデータとからなる。音声トラッ
クには、音声データ部に記録されている音声波形データ
の発生に関するデータが図２（Ｂ）に示すようなＭＩＤ
Ｉのシステム・エクスクルーシブ・メッセージ形式で記
録される。すなわち、この音声トラックに記録されるＭ
ＩＤＩデータは、音声波形データの発生タイミングに関
するデータと、そのタイミングで発生されるべき音声波
形データを指定するデータと、その音声データの音量、
ピッチを指定するデータとからなる。効果制御トラック
には、エフェクト手段１４の制御に関するＭＩＤＩデー
タが記憶される。歌詞トラック及び効果制御トラックも
図２（Ｂ）のようなＭＩＤＩ規格に準拠したデータとし
て送信され、ハードディスク装置５に記憶される。FIG. 2 is a diagram showing an example of the configuration of one piece of music data received and stored by the karaoke apparatus 70 of FIG. 1 via a communication network. The music data includes a header section 31, a MIDI data section 32, and an audio data section 33 as shown in FIG. The header section 31 is composed of various data related to the music data, and specifically, is data such as a music title, a genre name to which the music belongs, a release date, a performance time, and the like. In addition, the header section 31 may record additional information such as a communication date, a date accessed, and a number of times. The MIDI data section 32
It consists of a music track, a lyrics track, a voice track and an effect control track. In the music track, performance data such as a melody part, an accompaniment part, and a rhythm part according to the music are recorded. The performance data is data conforming to the MIDI standard, and includes duration time data Δt indicating a time interval between events, status data indicating the type of the event (such as a sounding start command and a sounding stop command), It is composed of pitch designation data for designating a pitch at which sound is started or stopped, and volume designation data for designating a sound volume at the time of sound production. The volume designation data is given when the status data is a sound generation start command. In the lyrics track, data relating to lyrics to be displayed on a monitor screen (not shown) is recorded in a MIDI system exclusive message format. That is,
The MIDI data recorded on the lyrics track sequentially changes the character code corresponding to the lyrics to be displayed and the character data relating to the display location, the display time data relating to the display time of the lyrics, and the display color of the lyrics in accordance with the progress of the music. And wipe sequence data. In the audio track, data relating to the generation of audio waveform data recorded in the audio data section is recorded as an MID as shown in FIG.
I is recorded in the system exclusive message format. That is, M recorded on this audio track
The IDI data includes data relating to the generation timing of audio waveform data, data designating audio waveform data to be generated at that timing, the volume of the audio data,
It consists of data specifying the pitch. The effect control track stores MIDI data relating to control of the effect means 14. The lyrics track and the effect control track are also transmitted as data conforming to the MIDI standard as shown in FIG.

【００１３】図２（Ｃ）は、ベクトル量子化技術によっ
て量子化された音声データ部３３のデータ構成の一例を
示す図である。この音声データ部３３のデータ１〜ｎ
は、楽曲に対応して発生されるバックコーラス、お手本
音声およびデュエット音声などに関する音声波形データ
のスペクトル包絡に関する補助情報３７〜３９と、その
音声波形データのスペクトルパターンを特定するインデ
ックス情報３４〜３６とからなる。補助情報とインデッ
クス情報とで１フレームのデータが構成され、フレーム
の先頭にはスタートデータが、フレームの末尾にはエン
ドデータがそれぞれ記録される。図では、３フレーム分
のデータが示されているが、音声データ部３３のデータ
は実際には多数のフレームで構成される。図３は、コー
ドブック８の内容を示す図である。例えば、インデック
ス情報が『１』の場合はスペクトルパターン１が該フレ
ームのスペクトルとして読み出され、『２』の場合はス
ペクトルパターン２が該フレームのスペクトルとして読
み出される。FIG. 2C is a diagram showing an example of the data configuration of the audio data section 33 quantized by the vector quantization technique. Data 1 to n of the audio data section 33
Include auxiliary information 37 to 39 relating to the spectral envelope of audio waveform data relating to the back chorus, sample audio, duet audio, etc., generated according to the music, and index information 34 to 36 for specifying the spectral pattern of the audio waveform data. Consists of One frame of data is composed of the auxiliary information and the index information. Start data is recorded at the beginning of the frame, and end data is recorded at the end of the frame. In the figure, data for three frames is shown, but the data of the audio data unit 33 is actually composed of many frames. FIG. 3 is a diagram showing the contents of the code book 8. For example, when the index information is “1”, the spectrum pattern 1 is read as the spectrum of the frame, and when the index information is “2”, the spectrum pattern 2 is read as the spectrum of the frame.

【００１４】図４は、音声波形データをベクトル量子化
音声波形データに圧縮する動作の概略を説明する図であ
る。図４（Ａ）のような、音声波形データが存在する場
合、その音声波形データの一部を図４（Ｂ）の長方形４
０に示すような領域で切り出す。図４（Ｂ）のように切
り出された波形データは、離散的コサイン変換や離散的
フーリエ変換などの直交変換（ＭＤＣＴ）部４１によっ
て、図４（Ｃ）のような周波数領域の信号すなわちスペ
クトル信号に変換される。また、切り出されたデータ
は、線形予測分析（ＬＰＣ）部４２によって、図４
（Ｄ）のようなスペクトル包絡情報に変換される。量子
化部４３は図４（Ｄ）のようなスペクトル包絡情報と、
その音声パワー情報とを補助情報として量子化する。図
４（Ｃ）の周波数領域の信号（スペクトル信号）は、正
規化部４４によって、図４（Ｅ）のように正規化された
スペクトルパターンに変換される。ここでは、図４
（Ｄ）のスペクトル包絡情報によって図４（Ｃ）の周波
数領域の信号を除算することによって、正規化されたス
ペクトルパターンを得る場合について説明するが、これ
以外の方法で正規化してもよい。正規化されたスペクト
ルパターンは、量子化部４５によって、図３のコードブ
ック８に記憶されたスペクトルパターンのうち最も近い
スペクトルパターンに対応するインデックス情報に量子
化される。このようにして量子化部４３及び量子化部４
５によってそれぞれ量子化された補助情報とインデック
ス情報が図２（Ｃ）のように構成されて、音声データ部
のデータ１〜ｎを表すベクトル量子化音声波形データと
して、楽曲データに付加される。FIG. 4 is a diagram schematically illustrating an operation of compressing the speech waveform data into vector quantized speech waveform data. When audio waveform data exists as shown in FIG. 4A, a part of the audio waveform data is represented by a rectangle 4 in FIG.
Cut out the area as shown in FIG. The waveform data cut out as shown in FIG. 4B is subjected to an orthogonal transform (MDCT) unit 41 such as a discrete cosine transform or a discrete Fourier transform, so as to obtain a signal in the frequency domain as shown in FIG. Is converted to Further, the cut-out data is processed by a linear prediction analysis (LPC) unit 42 in FIG.
It is converted into spectrum envelope information as shown in (D). The quantization unit 43 includes spectrum envelope information as shown in FIG.
The audio power information is quantized as auxiliary information. The signal (spectral signal) in the frequency domain of FIG. 4C is converted by the normalization unit 44 into a normalized spectrum pattern as shown in FIG. Here, FIG.
A case where a normalized spectrum pattern is obtained by dividing the signal in the frequency domain of FIG. 4C by the spectrum envelope information of FIG. 4D will be described. However, normalization may be performed by any other method. The normalized spectral pattern is quantized by the quantization unit 45 into index information corresponding to the closest spectral pattern among the spectral patterns stored in the code book 8 of FIG. Thus, the quantization unit 43 and the quantization unit 4
The auxiliary information and index information respectively quantized by 5 are configured as shown in FIG. 2C, and are added to the music data as vector quantized audio waveform data representing data 1 to n of the audio data portion.

【００１５】カラオケ装置７０は、ベクトル量子化音声
波形データを音声データ部のデータとして含む楽曲デー
タを通信ネットワーク８０及び通信インターフェイス６
を介して受信し、それをハードディスク装置５に記憶す
る。楽曲演奏に当たっては音声トラックの指示に従いベ
クトル量子化データ復号化手段７１で、元のディジタル
音声波形データに復号する。図５は、ベクトル量子化デ
ータ復号化手段７１がベクトル量子化音声波形データに
基づいて元のディジタル音声波形データを復号する際の
動作の概略を説明する図である。なお、図５（Ｂ）は図
４（Ｂ）に、図５（Ｃ）は図４（Ｃ）に、図５（Ｄ）は
図４（Ｄ）に、図５（Ｅ）は図４（Ｅ）に、それぞれ対
応している。ベクトル量子化データ復号化手段７１にお
いて、正規化スペクトル再生部５１はインデックス情報
３４〜３６に基づき図３のコードブック８から図５
（Ｅ）のようなスペクトルパターンを読み出す。スペク
トル包絡再生部５２は、補助情報３７〜３９に基づいて
図５（Ｄ）のようなスペクトル包絡情報を再生する。ス
ペクトル再生部５３は、正規化スペクトル再生部５１か
らのスペクトルパターンに、スペクトル包絡再生部から
のスペクトル包絡情報を乗じることによって、図５
（Ｃ）のようなスペクトル信号を再生する。逆直交変換
部５４は、スペクトル再生部５３からのスペクトル信号
に逆直交変換処理を施すことによって、図５（Ｄ）のよ
うな元のディジタル音声波形データの一部を復号する。
復号されたディジタル音声波形データは、ミキサ回路７
１に出力される。The karaoke apparatus 70 transmits music data including vector quantized audio waveform data as data of an audio data section to the communication network 80 and the communication interface 6.
And stores it in the hard disk drive 5. In performing the music, the vector quantized data decoding means 71 decodes the original digital audio waveform data in accordance with the instruction of the audio track. FIG. 5 is a diagram for explaining an outline of the operation when the vector quantized data decoding means 71 decodes the original digital audio waveform data based on the vector quantized audio waveform data. Note that FIG. 5B is shown in FIG. 4B, FIG. 5C is shown in FIG. 4C, FIG. 5D is shown in FIG. 4D, and FIG. E) respectively. In the vector quantized data decoding means 71, the normalized spectrum reproducing unit 51 converts the codebook 8 of FIG.
A spectrum pattern as shown in (E) is read. The spectrum envelope reproducing unit 52 reproduces the spectrum envelope information as shown in FIG. 5D based on the auxiliary information 37 to 39. The spectrum reproducing unit 53 multiplies the spectrum pattern from the normalized spectrum reproducing unit 51 by the spectrum envelope information from the spectrum envelope reproducing unit to obtain a signal shown in FIG.
A spectrum signal as shown in (C) is reproduced. The inverse orthogonal transform unit 54 decodes a part of the original digital audio waveform data as shown in FIG. 5 (D) by performing an inverse orthogonal transform process on the spectrum signal from the spectrum reproducing unit 53.
The decoded digital audio waveform data is supplied to a mixer circuit 7.
1 is output.

【００１６】なお、上述の実施の形態では、音声データ
をベクトル量子化技術でデータ圧縮する場合ついて説明
したが、これ以外の背景映像などをベクトル量子化技術
で圧縮するようにしてもよい。In the above-described embodiment, the case has been described where the audio data is compressed by the vector quantization technique. However, other background video and the like may be compressed by the vector quantization technique.

【００１７】[0017]

【発明の効果】この発明によれば、カラオケの音声デー
タをベクトル量子化波形データとし、この量子化された
波形データを、端末に具備されたコードブックをもとに
音声合成するようにしたため、従来に比べて、楽曲の通
信時間を短縮し、端末の記憶装置の負担を軽減できるカ
ラオケ装置を提供できるという効果を奏する。According to the present invention, the karaoke voice data is used as vector quantized waveform data, and the quantized waveform data is voice-synthesized based on a codebook provided in the terminal. Compared with the related art, there is an effect that a karaoke apparatus that can shorten the communication time of music and reduce the load on the storage device of the terminal can be provided.

[Brief description of the drawings]

【図１】この発明に係るカラオケ装置の一実施の形態
の全体構成を示す概略ブロック図。FIG. 1 is a schematic block diagram showing an overall configuration of an embodiment of a karaoke apparatus according to the present invention.

【図２】カラオケ装置に配信される楽曲データの構成
例を示す図。FIG. 2 is a diagram showing a configuration example of music data distributed to a karaoke device.

【図３】図１のコードブックの内容の一例を示す図。FIG. 3 is a view showing an example of the contents of the code book of FIG. 1;

【図４】ベクトル量子化技術によって音声波形データ
がインデッスク情報と補助情報に量子化される際の動作
の概略を説明する図。FIG. 4 is a diagram schematically illustrating an operation when audio waveform data is quantized into index information and auxiliary information by a vector quantization technique.

【図５】ベクトル量子化技術によって圧縮されたベク
トル量子化音声波形データに基づいて元のディジタル音
声波形データを復号する際の動作の概略を説明する図。FIG. 5 is a diagram schematically illustrating an operation of decoding original digital audio waveform data based on vector quantized audio waveform data compressed by a vector quantization technique.

[Explanation of symbols]

１…ＣＰＵ、２…ＲＯＭ、３…ＲＡＭ、４…パネルイン
ターフェイス、５…ハードディスク装置、６…通信イン
ターフェイス、７１…ベクトル量子化データ復号化手
段、８１…コードブック、１０…音源回路、１２…ミキ
サ回路、１３…マイク、１４…エフェクト手段、１５…
音響出力手段、１６…映像作成回路、１７…ＣＤ−ＲＯ
Ｍ、１８…背景映像再生回路、１９…映像ミキサ回路、
２０…映像出力手段、２１…データ及びアドレスバスDESCRIPTION OF SYMBOLS 1 ... CPU, 2 ... ROM, 3 ... RAM, 4 ... Panel interface, 5 ... Hard disk drive, 6 ... Communication interface, 71 ... Vector quantization data decoding means, 81 ... Code book, 10 ... Sound source circuit, 12 ... Mixer Circuit, 13 microphone, 14 effect means, 15
Sound output means, 16: video creation circuit, 17: CD-RO
M, 18: background video reproduction circuit, 19: video mixer circuit,
20 video output means, 21 data and address bus

─────────────────────────────────────────────────────
────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成９年６月３０日[Submission date] June 30, 1997

【手続補正１】[Procedure amendment 1]

【補正対象書類名】図面[Document name to be amended] Drawing

【補正対象項目名】図３[Correction target item name] Figure 3

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【図３】 FIG. 3

Claims

[Claims]

1. A storage means for storing a plurality of transferred music data, and an accompaniment data of the selected music data is read out based on a music selection operation to generate an accompaniment music sound, and the music data is synchronized with the accompaniment music sound. In a karaoke apparatus for reading and generating audio data, the audio data is composed of index information as vector quantized waveform data and auxiliary information relating to a spectrum envelope, and a table storage storing a relationship between the index information and an audio spectrum pattern. Means, reading means for reading an audio spectrum pattern from the table storage means based on the index information, and decoding means for decoding audio data based on the read audio spectrum pattern and the auxiliary information. A karaoke apparatus characterized by the above-mentioned.