JP2001312288A

JP2001312288A - Music data processor

Info

Publication number: JP2001312288A
Application number: JP2000129242A
Authority: JP
Inventors: Shinichi Nakaishi; 信一中石; Tatsuya Yamaguchi; 達也山口
Original assignee: Denso Ten Ltd
Current assignee: Denso Ten Ltd
Priority date: 2000-04-28
Filing date: 2000-04-28
Publication date: 2001-11-09

Abstract

PROBLEM TO BE SOLVED: To provide a music data processor which can display the text of a music recorded on a recording medium even when the recording medium having no character information on the text is used and easily makes KARAOKE singing possible with any recording medium. SOLUTION: This processor is equipped with a 1st readout part 2 which constitutes a shock proof means for intermittently reading music data out of a recording medium 100 at a speed faster than an ordinary read speed for reproduction when the music data are read out of the recording medium 100 and reproduced, a 1st DSP 6 including a voice recognizing means which recognizes the voice of vocal information included in the music data read out of the 1st readout part 2 and recognizes character information, and a 1st display 11 and a 1st speaker 13 which output the character information recognized by the voice recognizing means.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は音楽データ処理装置
に関し、より詳細には、音楽データが記録されている記
録媒体から音楽データを読み取って再生する車載用又は
ホーム用（業務用を含む）のカラオケ装置や音響装置と
して利用される音楽データ処理装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a music data processing apparatus, and more particularly, to an in-vehicle or home (including business use) apparatus for reading and reproducing music data from a recording medium on which the music data is recorded. The present invention relates to a music data processing device used as a karaoke device or a sound device.

【０００２】[0002]

【従来の技術】従来の音楽データ処理装置の一つとして
知られているカラオケ装置は、音楽データを記憶させて
おく記録媒体として、コンパクト・ディスク−グラフィ
ックス（ＣＤ−Ｇ）、レーザー・ディスク（登録商標）
（ＬＤ）、デー・ブイ・デー（ＤＶＤ）のような記憶容
量の大きい記録媒体を使用するように構成されている。
そして、記録媒体から音楽データを読み取って再生する
際に、この記録媒体に予め音楽データとして記録されて
いる歌詞に関する文字情報を読み出すことによって、音
楽データの再生とともに歌詞（文字情報）を画面表示す
るようになっている。また、従来の別の音楽データ処理
装置として知られている音響装置は、音楽データを記憶
させておく記録媒体として、コンパクト・ディスク（Ｃ
Ｄ）やミニディスク（ＭＤ）等の記録媒体を使用し、該
記録媒体から音楽データを読み取って再生するように構
成されている。2. Description of the Related Art A karaoke apparatus known as one of conventional music data processing apparatuses includes a compact disk-graphics (CD-G), a laser disk ( Registered trademark)
(LD), and a recording medium with a large storage capacity such as DV (DVD).
When the music data is read from the recording medium and reproduced, the character information relating to the lyrics recorded in advance in the recording medium as the music data is read, whereby the lyrics (character information) are displayed on the screen together with the reproduction of the music data. It has become. Another conventional audio device known as a music data processing device is a compact disk (C) as a recording medium for storing music data.
D) or a recording medium such as a mini-disc (MD) is used, and music data is read from the recording medium and reproduced.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、上記し
たように従来のカラオケ装置では、記録媒体から再生し
ようとしている音楽データに対応する歌詞情報を読み出
して画面表示する構成であるため、予め記録媒体に歌詞
情報（文字情報）を記録させておくことが必要であり、
前記記録媒体に記録させておくべきデータ量が必然的に
多くなってしまう。歌詞情報を予め記録媒体に記録して
おくことは、ＤＶＤのような記憶容量の大きい記録媒体
では然程問題にはならないが、ＤＶＤと比較すると記憶
容量がかなり小さいＣＤやＭＤ等の記録媒体では、記録
媒体に歌詞情報を予め記録させておくことは、記録でき
る曲数を減少させることにつながるため好ましくない。However, as described above, in the conventional karaoke apparatus, the lyrics information corresponding to the music data to be reproduced is read out from the recording medium and is displayed on the screen. It is necessary to record lyrics information (character information),
The amount of data to be recorded on the recording medium inevitably increases. Recording the lyrics information on a recording medium in advance does not pose a significant problem in a recording medium having a large storage capacity such as a DVD, but in a recording medium such as a CD or MD which has a considerably small storage capacity as compared with a DVD. Recording lyrics information in advance on a recording medium is not preferable because it leads to a decrease in the number of recordable songs.

【０００４】従って、従来のカラオケ装置は、音楽デー
タを記録した記録媒体として最も普及しているＣＤや、
急速に普及しつつあるＭＤを使用することはできても、
これらに記録されている曲の歌詞（文字情報）を表示で
きないものとなっている。よって、ボーカル情報を含む
音楽データ（ボーカル入りの曲）に加えて、この曲のカ
ラオケ用の音楽データが記録されているものが多いシン
グル版のＣＤを、カラオケ用の記録媒体として有効に活
用することができないのが現状である。また、ＣＤやＭ
Ｄ等の記録媒体を使用する従来の音響装置には、同様の
理由により、記録媒体から読み取って再生する音楽デー
タの歌詞（文字情報）を画面表示できるものが現在のと
ころ存在していない。[0004] Therefore, the conventional karaoke apparatus is most widely used as a recording medium on which music data is recorded, such as a CD,
Even if you can use MD that is spreading rapidly,
The lyrics (character information) of the music recorded in these cannot be displayed. Therefore, in addition to music data including vocal information (songs with vocals), a single-version CD that often records karaoke music data of this song is effectively used as a karaoke recording medium. It is not possible at present. CD and M
For the same reason, there is currently no conventional audio device that uses a recording medium such as D that can display lyrics (character information) of music data read from the recording medium and reproduced on the screen.

【０００５】本発明は上記課題に鑑みなされたものであ
って、曲の歌詞（文字情報）を記録していないＣＤやＭ
Ｄ等の記録媒体を使用する場合にも、記録媒体に記録さ
れている曲の歌詞を出力させることができ、どのような
記録媒体であっても簡単にカラオケを楽しむことができ
る音楽データ処理装置を提供することを目的としてい
る。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and has been made in consideration of the above-described problems.
A music data processing device that can output lyrics of a song recorded on a recording medium even when a recording medium such as D is used, and can easily enjoy karaoke on any recording medium. It is intended to provide.

【０００６】[0006]

【課題を解決するための手段及びその効果】上記課題を
解決するために、本発明に係る音楽データ処理装置
（１）は、音楽デ−タを記録した記録媒体から前記音楽
データを読み取って再生する音楽データ処理装置におい
て、前記音楽データを再生する際に、再生のための通常
の読み取り速度よりも高速で間欠的に前記記録媒体から
前記音楽データを読み取るショックプルーフ手段と、該
ショックプルーフ手段により読み取られた音楽データに
含まれているボーカル情報を音声認識して文字情報を取
得する音声認識手段と、該音声認識手段により取得され
た前記文字情報を出力する第１の出力手段とを備えてい
ることを特徴としている。In order to solve the above problems, a music data processing apparatus (1) according to the present invention reads and reproduces music data from a recording medium on which music data is recorded. A music proofing means for reading the music data from the recording medium intermittently at a higher speed than a normal reading speed for reproduction when reproducing the music data; A voice recognition unit configured to perform voice recognition of vocal information included in the read music data to obtain character information; and a first output unit configured to output the character information obtained by the voice recognition unit. It is characterized by having.

【０００７】上記した音楽データ処理装置（１）によれ
ば、前記ショックプルーフ手段によって記録媒体から読
み取られた音楽データに含まれているボーカル情報が、
前記音声認識手段により音声認識されて文字情報に変換
され、再生する音楽データの歌詞（文字情報）として前
記第１の出力手段に出力（画面表示及び／又は音声出
力）されるので、前記記録媒体が、音楽データの歌詞
（文字情報）が記録されていない例えばＣＤやＭＤであ
っても、音楽データの再生とともに歌詞を出力させるこ
とができることとなる。よって、記録媒体の種類や記録
内容等に依らず歌詞（文字情報）の出力が可能になるた
め、どのような記録媒体を使用する場合にも、簡単に歌
詞の確認やカラオケを楽しむことができる。According to the above music data processing apparatus (1), the vocal information contained in the music data read from the recording medium by the shock proof means is
The voice is recognized by the voice recognition means, converted into character information, and output (screen display and / or voice output) to the first output means as lyrics (character information) of music data to be reproduced. However, even if the lyrics (character information) of the music data are not recorded, for example, a CD or MD, the lyrics can be output together with the reproduction of the music data. Therefore, it is possible to output lyrics (character information) irrespective of the type of the recording medium, the recorded contents, and the like, so that the user can easily check the lyrics and enjoy karaoke regardless of the recording medium used. .

【０００８】また、上記音楽データ処理装置（１）によ
れば、前記ショックプルーフ手段が、再生する音楽デー
タを、再生のための通常の読み取り速度よりも高速で間
欠的に前記記録媒体から読み取っており、再生する音楽
データの全データ（一曲）を一気に読み取らないため、
この読み取りを短時間で行うことができ、かつ音楽デー
タの再生に歌詞出力のタイミングを容易に合わせること
ができることとなる。よって、音楽データの再生開始時
に音出しが遅れる等の不具合を生じさせることがなく、
また音楽データの再生にタイミングを的確に合わせて歌
詞を出力させることができる音楽データ処理装置を実現
することができる。According to the music data processing device (1), the shock proof means intermittently reads the music data to be reproduced from the recording medium at a speed higher than a normal reading speed for reproduction. Since all data (one song) of the music data to be played is not read at a stretch,
This reading can be performed in a short time, and the timing of outputting the lyrics can be easily adjusted to the reproduction of the music data. Therefore, at the time of starting the reproduction of the music data, there is no problem such as a delay in sound output.
In addition, it is possible to realize a music data processing device capable of outputting lyrics in time with the timing of reproducing music data.

【０００９】また、本発明に係る音楽データ処理装置
（２）は、上記音楽データ処理装置（１）において、ボ
ーカル情報及び伴奏情報を含む第１の音楽データを再生
する場合に、再生のための通常の読み取り速度よりも高
速で間欠的に前記記録媒体から前記第１の音楽データを
読み取る第１の先読み手段を含むものであることを特徴
としている。Further, the music data processing apparatus (2) according to the present invention provides a music data processing apparatus (1) for reproducing first music data including vocal information and accompaniment information in the music data processing apparatus (1). It is characterized by including a first pre-reading means for intermittently reading the first music data from the recording medium at a higher speed than a normal reading speed.

【００１０】上記した音楽データ処理装置（２）によれ
ば、ボーカル情報及び伴奏情報を含む第１の音楽データ
の再生に際し、この第１の音楽データを前記記録媒体か
ら読み取る第１の先読み手段を含んで構成されているた
め、記録媒体が例えばＣＤやＭＤであっても、第１の音
楽データの再生とともに歌詞を出力させることが可能と
なる。また、前記ショックプルーフ手段による音楽デー
タの読み取りに要する時間が極めて短くて済むことにな
る。According to the music data processing device (2), when the first music data including the vocal information and the accompaniment information is reproduced, the first prefetch means for reading the first music data from the recording medium is used. Because of the configuration, even if the recording medium is, for example, a CD or an MD, it is possible to output the lyrics together with the reproduction of the first music data. Further, the time required for reading the music data by the shock proof means is extremely short.

【００１１】また、本発明に係る音楽データ処理装置
（３）は、上記音楽データ処理装置（２）において、前
記ショックプルーフ手段と前記音声認識手段との間に、
前記第１の先読み手段により読み取られた第１の音楽デ
ータに含まれているボーカル情報の周波数帯域の情報の
みを抽出するフィルタ手段が介装されていることを特徴
としている。[0011] The music data processing device (3) according to the present invention, in the music data processing device (2), further comprises:
It is characterized in that a filter means for extracting only information of the frequency band of the vocal information included in the first music data read by the first look-ahead means is provided.

【００１２】上記した音楽データ処理装置（３）によれ
ば、前記フィルタ手段により前記第１の音楽データから
取り出された前記ボーカル情報の周波数帯域の情報が、
前記音声認識手段により音声認識されて文字情報に変換
されるので、前記第１の音楽データに含まれた曲の歌詞
が誤って音声認識されるといった事態の発生を回避する
ことができ、前記第１の音楽データに含まれた曲の正確
な歌詞を高い確率で得ることができる。また、前記フィ
ルタ手段を利用し、あるいは該フィルタ手段で得た前記
ボーカル情報の周波数帯域の情報を前記第１の音楽デー
タから差し引くことによって、音楽データに含まれた曲
のカラオケ用の曲を作成することもできる。[0012] According to the music data processing device (3), the information of the frequency band of the vocal information extracted from the first music data by the filter means is:
Since the voice is recognized by the voice recognition means and converted into character information, it is possible to avoid occurrence of a situation in which the lyrics of the song included in the first music data are erroneously recognized as voice. Accurate lyrics of a song included in one music data can be obtained with high probability. Also, by using the filter means or by subtracting information on the frequency band of the vocal information obtained by the filter means from the first music data, a karaoke song included in the music data is created. You can also.

【００１３】また、本発明に係る音楽データ処理装置
（４）は、上記音楽データ処理装置（１）〜（３）のい
ずれかにおいて、前記記録媒体が音楽データとして、ボ
ーカル情報及び伴奏情報を含む第１の音楽データと、該
第１の音楽データに関する伴奏情報のみを含む第２の音
楽データ（カラオケ用の曲）とを記録したものである場
合において、前記第１の音楽データ又は前記第２の音楽
データを再生する際に、前記ショックプルーフ手段が前
記第２の音楽データを再生のための通常の読み取り速度
よりも高速で間欠的に前記記録媒体より読み取る第２の
先読み手段を含んで構成され、前記ショックプルーフ手
段と前記音声認識手段との間に、前記第１の先読み手段
により読み取られた第１の音楽データと、前記第２の先
読み手段により読み取られた第２の音楽データとの差を
求めて前記第１の音楽データに含まれているボーカル情
報のみを抽出するボーカル情報抽出手段が介装されてい
ることを特徴としている。In the music data processing device (4) according to the present invention, in any one of the music data processing devices (1) to (3), the recording medium includes vocal information and accompaniment information as music data. In a case where the first music data and the second music data (song for karaoke) including only the accompaniment information related to the first music data are recorded, the first music data or the second music data is recorded. And a second read-ahead means for intermittently reading the second music data from the recording medium at a higher speed than a normal reading speed for reproduction when reproducing the music data. The first music data read by the first look-ahead means and the second music data read by the second look-ahead means are provided between the shock proof means and the voice recognition means. Vocal information extracting means for obtaining a difference between the second musical data to extract only the vocal information contained in the first musical data is characterized in that it is interposed which was.

【００１４】上記した音楽データ処理装置（４）によれ
ば、前記ボーカル情報抽出手段により、前記第１の先読
み手段により読み取られた第１の音楽データと前記第２
の先読み手段により読み取られた第２の音楽データとの
差から抽出された前記ボーカル情報が、前記音声認識手
段により音声認識されて文字情報に変換されるので、再
生する音楽データに含まれた曲の歌詞が誤って音声認識
されるといった事態の発生を確実に回避することがで
き、正確な歌詞を取得する確率をさらに高めることがで
きる。According to the music data processing device (4), the first music data read by the first look-ahead means and the second music data are read by the vocal information extracting means.
The vocal information extracted from the difference from the second music data read by the look-ahead means is recognized by the voice recognition means and converted into character information. It is possible to reliably avoid the occurrence of a situation in which the lyrics of the lyrics are incorrectly recognized by speech, and it is possible to further increase the probability of obtaining accurate lyrics.

【００１５】また、本発明に係る音楽データ処理装置
（５）は、上記音楽データ処理装置（１）〜（４）のい
ずれかにおいて、前記記録媒体が前記第１の音楽データ
と、該第１の音楽データに含まれた曲の文字情報とを記
録したものである場合において、前記第１の音楽データ
の再生に際し、該第１の音楽データに対応する曲の文字
情報を前記記録媒体から読み取る第１の読み取り手段
と、前記記録媒体の種類又は該記録媒体における記録内
容に応じて、再生する音楽データに含まれた曲の文字情
報を取得するための手段を選択する選択手段とを備え、
前記第１の出力手段が、前記第１の読み取り手段が読み
取った文字情報を出力する第１の出力部を含むものであ
ることを特徴としている。Further, in the music data processing device (5) according to the present invention, in any one of the music data processing devices (1) to (4), the recording medium may include the first music data and the first music data. When the first music data is reproduced, the character information of the music corresponding to the first music data is read from the recording medium. A first reading unit, and a selecting unit that selects a unit for acquiring character information of a song included in music data to be reproduced according to a type of the recording medium or a content recorded on the recording medium,
The first output unit includes a first output unit that outputs the character information read by the first reading unit.

【００１６】上記した音楽データ処理装置（５）によれ
ば、前記選択手段によって、記録媒体の種類又は該記録
媒体における記録内容に応じ、前記音楽データに含まれ
た曲の歌詞（文字情報）を取得するための最適な手段
を、例えば前記音声認識手段のみ、前記フィルタ手段と
前記音声認識手段との組み合わせ、前記ボーカル情報抽
出手段と前記音声認識手段との組み合わせ、前記第１の
読み取り手段の中から自動的に選択することができる。
このため、記録媒体の種類や記録内容にかかわらず、音
楽データに含まれた曲の正確な歌詞が出力される確率が
高い方法で歌詞（文字情報）を得ることができる。According to the music data processing device (5), the selecting means converts the lyrics (character information) of the music included in the music data according to the type of the recording medium or the content recorded on the recording medium. The optimal means for obtaining, for example, only the voice recognition means, the combination of the filter means and the voice recognition means, the combination of the vocal information extraction means and the voice recognition means, the first reading means Can be selected automatically.
Therefore, regardless of the type of the recording medium or the recorded content, lyrics (character information) can be obtained by a method that has a high probability of outputting accurate lyrics of the music included in the music data.

【００１７】また、本発明に係る音楽データ処理装置
（６）は、上記音楽データ処理装置（１）〜（５）のい
ずれかにおいて、前記音声認識手段により認識された文
字情報を記憶する記憶手段と、該記憶手段に記憶されて
いる文字情報を読み取る第２の読み取り手段と、該第２
の読み取り手段により読み取られた文字情報を、ユーザ
の指示に従い修正する文字情報修正手段と、該文字情報
修正手段により修正された文字情報を前記記憶手段に記
憶させる記憶制御手段とを備え、前記第１の出力手段
が、前記第２の読み取り手段により前記記憶手段から読
み取られた文字情報を出力する第２の出力部を含むもの
であることを特徴としている。Further, the music data processing device (6) according to the present invention is characterized in that in any one of the music data processing devices (1) to (5), the storage means for storing the character information recognized by the voice recognition means. Second reading means for reading character information stored in the storage means;
A character information correction unit that corrects the character information read by the reading unit according to a user's instruction; and a storage control unit that stores the character information corrected by the character information correction unit in the storage unit. The first output unit includes a second output unit that outputs the character information read from the storage unit by the second reading unit.

【００１８】上記した音楽データ処理装置（６）によれ
ば、前記文字情報修正手段、前記記憶手段及び前記記憶
制御手段により、前記音声認識手段により認識された文
字情報をユーザが修正して記憶保存させることができる
ので、たとえ音楽データを初めて再生したときに前記音
声認識手段が誤った歌詞（文字情報）を認識しても、ユ
ーザが正しい歌詞に修正することができる。そして、前
記第２の読み取り手段及び前記第２の出力部によって、
その修正された文字情報に基づいた正確な歌詞を、短時
間で出力させることができる。また、ユーザが文字情報
を自由に替えられることにより、ユーザ自身が作成した
歌詞による替え歌を楽しむこともできる。According to the music data processing device (6), the character information recognized by the voice recognizing means is corrected and stored by the user by the character information correcting means, the storage means and the storage control means. Therefore, even if the voice recognition unit recognizes an incorrect lyrics (character information) when the music data is reproduced for the first time, the user can correct the lyrics to correct lyrics. And, by the second reading means and the second output unit,
Accurate lyrics based on the corrected character information can be output in a short time. In addition, since the user can freely change the character information, the user can enjoy a replacement song based on lyrics created by the user.

【００１９】また、本発明に係る音楽データ処理装置
（７）は、上記音楽データ処理装置（６）において、音
楽データを再生する際に、前記記憶手段に記憶されてい
る文字情報を利用するか否かのユーザによる選択を可能
にする文字情報選択設定手段を備え、前記記憶手段に記
憶されている文字情報を利用する選択が前記文字情報選
択設定手段を介してユーザによりなされた場合に、前記
第２の読み取り手段が前記記憶手段から文字情報を読み
取るものであることを特徴としている。In the music data processing device (7) according to the present invention, the music data processing device (6) uses the character information stored in the storage means when reproducing the music data. Character information selection setting means for allowing the user to select whether or not, when the selection using the character information stored in the storage means is made by the user via the character information selection setting means, The second reading means reads character information from the storage means.

【００２０】上記した音楽データ処理装置（７）によれ
ば、前記文字情報選択設定手段によって、前記記憶手段
に文字情報が記憶されている音楽データを再生する際に
は、前記記憶手段に記憶されている文字情報に基づく歌
詞を出力させるか否かをユーザが自由に選択することが
できる。よって、常にユーザが出力させたい歌詞を出力
できてユーザを満足させる音楽データ処理装置を実現す
ることができる。According to the music data processing device (7), when the music data whose character information is stored in the storage means is reproduced by the character information selection and setting means, the music data is stored in the storage means. The user can freely select whether or not to output the lyrics based on the character information. Therefore, it is possible to realize a music data processing device that can always output the lyrics that the user wants to output and that satisfies the user.

【００２１】また、本発明に係る音楽データ処理装置
（８）は、上記音楽データ処理装置（１）〜（７）のい
ずれかにおいて、前記第１の出力手段が、前記文字情報
を画面表示する画面表示手段を含むものであることを特
徴としている。Further, in the music data processing device (8) according to the present invention, in any one of the music data processing devices (1) to (7), the first output means displays the character information on a screen. It is characterized by including screen display means.

【００２２】上記した音楽データ処理装置（８）によれ
ば、前記画面表示手段によって、再生する音楽データに
含まれた曲の歌詞を画面表示させることができるので、
どのような記録媒体を使用する場合にも、簡単に歌詞を
画面で確認してカラオケを楽しむことができる。According to the music data processing apparatus (8), the lyrics of the song included in the music data to be reproduced can be displayed on the screen by the screen display means.
Regardless of the type of recording medium used, the user can easily check the lyrics on the screen and enjoy karaoke.

【００２３】また、本発明に係る音楽データ処理装置
（９）は、上記音楽データ処理装置（１）〜（８）のい
ずれかにおいて、前記音声認識手段により認識された文
字情報に基づいて、前記音楽データに含まれた曲の歌詞
を音声合成する第１の音声合成部を含む音声合成手段を
備え、前記第１の出力手段が、前記第１の音声合成部に
より音声合成された歌詞の音声合成情報を音声出力する
第１の音声出力手段を含むものであることを特徴として
いる。Further, the music data processing device (9) according to the present invention, based on any of the above-mentioned music data processing devices (1) to (8), based on the character information recognized by the voice recognition means. A voice synthesizer including a first voice synthesizer for voice-synthesizing the lyrics of the song included in the music data, wherein the first output means outputs the voice of the lyrics synthesized by the first voice synthesizer; It is characterized in that it includes first audio output means for outputting synthesized information as audio.

【００２４】上記した音楽データ処理装置（９）によれ
ば、前記第１の音声合成部及び前記第１の音声出力手段
によって、前記音声認識手段により認識された文字情報
に基づく歌詞を音声出力させることができるので、たと
え歌詞が表示された画面をユーザが視認できない状況に
あっても、ユーザが歌詞の確認を音声で行うことができ
る。このため、ユーザの状況に関係なくカラオケ等を楽
むことができる音楽データ処理装置を提供することがで
きる。According to the music data processing device (9), the first speech synthesizer and the first speech output means cause the lyrics based on the character information recognized by the speech recognition means to be outputted as speech. Therefore, even if the user cannot visually recognize the screen on which the lyrics are displayed, the user can check the lyrics by voice. Therefore, it is possible to provide a music data processing device capable of enjoying karaoke and the like regardless of the situation of the user.

【００２５】また、本発明に係る音楽データ処理装置
（１０）は、上記音楽データ処理装置（９）において、
前記記録媒体が音楽データとして、ボーカル情報及び伴
奏情報を含む第１の音楽データと、該第１の音楽データ
に関する曲の文字情報とを記録したものである場合にお
いて、前記第１の音楽データの再生に際し、該第１の音
楽データに対応する歌詞の文字情報を前記記録媒体から
読み取る第１の読み取り手段を備え、前記音声合成手段
が、前記第１の読み取り手段により読み取られた文字情
報に基づいて、再生する音楽データに関する曲の歌詞を
音声合成する第２の音声合成部を含み、前記第１の出力
手段が、前記第２の音声合成部により音声合成された歌
詞の音声合成情報を音声出力する第２の音声出力手段を
含むものであることを特徴としている。The music data processing device (10) according to the present invention is the music data processing device (9)
In the case where the recording medium records, as music data, first music data including vocal information and accompaniment information, and character information of a song related to the first music data, A first reading unit that reads character information of lyrics corresponding to the first music data from the recording medium during reproduction, wherein the voice synthesizing unit is configured to perform processing based on the character information read by the first reading unit; A second speech synthesizer for speech-synthesizing the lyrics of a song relating to the music data to be reproduced, wherein the first output means outputs speech synthesis information of the lyrics speech-synthesized by the second speech synthesizer. It is characterized by including second audio output means for outputting.

【００２６】上記した音楽データ処理装置（１０）によ
れば、予め記録媒体に歌詞（文字情報）が記録されてい
る曲の前記第１の音楽データを再生する際にも、前記第
１の読み取り手段、前記第２の音声合成部及び第２の音
声出力手段によって、この第１の音楽データに関する曲
の歌詞を音声出力させることができる。従って、記録媒
体の種類や記録媒体の記録内容にかかわらず、再生する
音楽データに関する曲の歌詞を音声出力させることが可
能になる。According to the music data processing apparatus (10), the first reading is performed even when reproducing the first music data of a song in which lyrics (character information) are previously recorded on a recording medium. Means, the second speech synthesizer and the second speech output means can output the lyrics of the song relating to the first music data as speech. Therefore, irrespective of the type of the recording medium and the recorded content of the recording medium, it is possible to output the lyrics of the music related to the music data to be reproduced.

【００２７】また、本発明に係る音楽データ処理装置
（１１）は、上記音楽データ処理装置（９）又は（１
０）において、前記音声認識手段により認識された文字
情報を記憶する記憶手段と、該記憶手段に記憶されてい
る文字情報を読み取る第２の読み取り手段と、該第２の
読み取り手段により読み取られた文字情報を、ユーザの
指示に従い修正する文字情報修正手段と、該文字情報修
正手段により修正された文字情報を前記記憶手段に記憶
させる記憶制御手段とを備え、前記音声合成手段が、前
記第２の読み取り手段により読み取られた文字情報に基
づいて、再生する音楽データに含まれた曲の歌詞を音声
合成する第３の音声合成部を含むものであり、前記第１
の出力手段が、前記第３の音声合成部により音声合成さ
れた歌詞の音声合成情報を音声出力する第３の音声出力
手段を含むものであることを特徴としている。Further, the music data processing device (11) according to the present invention comprises the music data processing device (9) or (1).
In 0), storage means for storing the character information recognized by the voice recognition means, second reading means for reading the character information stored in the storage means, and the character information read by the second reading means. Character information correction means for correcting character information in accordance with a user's instruction; and storage control means for storing the character information corrected by the character information correction means in the storage means. And a third voice synthesizer for voice-synthesizing the lyrics of the song included in the music data to be reproduced, based on the character information read by the reading means.
Is characterized by including third voice output means for voice-outputting voice synthesis information of lyrics synthesized by the third voice synthesizer.

【００２８】上記した音楽データ処理装置（１１）によ
れば、前記文字情報修正手段、前記記憶手段及び前記記
憶制御手段によって、ユーザが前記音声認識手段により
認識された文字情報を修正して前記記憶手段に記憶させ
ておくことができるため、たとえ音楽データの最初の再
生時に前記音声認識手段で誤った歌詞（文字情報）が認
識されても、ユーザが正しい歌詞に修正することができ
る。また、前記第２の読み取り手段、前記第３の音声合
成部及び前記第３の音声出力手段によって、前記記憶手
段に修正された文字情報が記憶されている曲に関する音
楽データを再び再生する際に、その修正された文字情報
に基づいて正確な歌詞を極短時間で音声出力させること
ができる。また、ユーザが文字情報を自由に替えられる
ことにより、ユーザ自身が作成した歌詞も音声出力で
き、娯楽性の高い音楽データ処理装置を提供することが
できる。According to the music data processing apparatus (11), the character information correcting means, the storage means and the storage control means allow the user to correct the character information recognized by the voice recognition means and store the corrected character information. Since the data can be stored in the means, even if the erroneous lyrics (character information) are recognized by the voice recognition means at the first reproduction of the music data, the user can correct the lyrics to correct ones. Further, when the second reading unit, the third speech synthesizing unit and the third speech output unit reproduce music data related to a song in which the character information corrected in the storage unit is stored, Based on the corrected character information, accurate lyrics can be output in a very short time. In addition, since the user can freely change the character information, the lyrics created by the user can be output as voice, and a music data processing device with high entertainment can be provided.

【００２９】[0029]

【発明の実施の形態】以下、本発明に係る音楽データ処
理装置の実施の形態を図面に基づいて説明する。図１は
実施の形態（１）に係る音楽データ処理装置の概略構成
を示すブロック図である。実施の形態（１）に係る音楽
データ処理装置１は、第１の読み取り部２、再生機構部
３、処理回路部４、第１のメモリ５、第１のＤＳＰ６、
第１の操作部７、第１のＣＰＵ８、第２のメモリ９、表
示ドライバ１０、第１のディスプレイ１１、Ｄ／Ａコン
バータ１２及び第１のスピーカ１３を含んで構成されて
いる。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of a music data processing device according to the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a schematic configuration of a music data processing device according to Embodiment (1). The music data processing device 1 according to the embodiment (1) includes a first reading unit 2, a reproducing mechanism unit 3, a processing circuit unit 4, a first memory 5, a first DSP 6,
It comprises a first operation unit 7, a first CPU 8, a second memory 9, a display driver 10, a first display 11, a D / A converter 12, and a first speaker 13.

【００３０】第１の読み取り部２は、例えば、音楽デー
タを記録したＣＤやＭＤ等の記録媒体１００より、音楽
データを読み取る光ピックアップ（図示略）と、光ピッ
クアップを記録媒体１００の半径方向に移動させる移動
モータ（図示略）とを含んで構成されている。また第１
の読み取り部２は、第１のＣＰＵ８からの制御信号を受
けて動作する再生機構部３の制御に基づき、再生のため
の通常の読み取り速度よりも高速で間欠的に記録媒体１
００から音楽データを読み取り（以下、このような読み
取りを先読みと記す）、先読みしたこの音楽デ−タを一
旦メモリに記憶し、その後再生出力するショックプルー
フ手段の構成要素ともなっている。The first reading unit 2 includes, for example, an optical pickup (not shown) for reading music data from a recording medium 100 such as a CD or MD on which music data is recorded, and an optical pickup in a radial direction of the recording medium 100. And a moving motor (not shown) for moving. Also the first
The reading unit 2 intermittently operates at a higher speed than the normal reading speed for reproduction based on the control of the reproduction mechanism unit 3 which operates in response to a control signal from the first CPU 8.
The music data is read out from 00 (hereinafter, such reading is referred to as pre-reading), and the pre-read music data is temporarily stored in a memory, and thereafter, it is also a component of a shock proof means for reproducing and outputting.

【００３１】このショックプルーフ手段は、第１の先読
み手段を含んで構成されている。この第１の先読み手段
は、ボーカル入りの曲（伴奏を含む）の音楽データ、す
なわちボーカル情報及び伴奏情報を含む第１の音楽デー
タを再生する場合に、この第１の音楽データを先読みす
るようになっている。The shock proof means includes first look-ahead means. The first prefetch means prefetches the first music data when reproducing music data of a vocal-containing song (including accompaniment), that is, first music data including vocal information and accompaniment information. It has become.

【００３２】再生機構部３は、記録媒体１００を所定の
線速度で回転させるスピンドルサーボモータ（図示略）
や第１の読み取り部２の移動モータを駆動制御する等の
動作を行うように構成されている。また処理回路部４
は、第１の読み取り部２が読み取った音楽データから各
種のエラー信号（フォーカスエラー信号やトラッキング
エラー信号等）を作成する処理等を行うようになってい
る。また第１のメモリ５は、前記第１の先読み手段によ
って先読みされ、処理回路部４を介して送られてくる前
記第１の音楽データを記憶する例えばＲＡＭ等で構成さ
れている。The reproducing mechanism 3 is a spindle servomotor (not shown) for rotating the recording medium 100 at a predetermined linear velocity.
And an operation such as driving control of a moving motor of the first reading unit 2. Processing circuit unit 4
Performs processing for creating various error signals (such as a focus error signal and a tracking error signal) from the music data read by the first reading unit 2. The first memory 5 is configured by, for example, a RAM or the like that stores the first music data that is pre-read by the first pre-reading means and transmitted through the processing circuit unit 4.

【００３３】本実施の形態（１）に係る音楽データ処理
装置１において、第１のメモリ５は、図２の模式的説明
図に示すごとく、前記第１の先読み手段によって先読み
された、数秒〜数十秒間分の第１の音楽データ１０１を
記憶するショックプルーフメモリ領域５ａを含むもので
あり、前記ショックプルーフ手段は、該ショックプルー
フメモリ領域５ａにおけるメモリ容量の上限値まで第１
の音楽データ１０１を蓄積させると、データフル信号を
第１のＣＰＵ８に向けて出力するようになっている。ま
た、ショックプルーフメモリ領域５ａに記憶させた第１
の音楽データ１０１を第１のＤＳＰ６に送出し、ショッ
クプルーフメモリ領域５ａにおける蓄積量が所定の値に
まで減少すると、データエンプティ信号を第１のＣＰＵ
８に向けて出力し、第１のＣＰＵ８からの指示信号を受
けて先読みを再開させて第１の読み取り部２から取り込
んだ第１の音楽データ１０１を再びショックプルーフメ
モリ領域５ａに記憶させるようになっている。In the music data processing apparatus 1 according to the embodiment (1), as shown in the schematic explanatory view of FIG. 2, the first memory 5 stores several seconds to several seconds pre-read by the first pre-reading means. A shock-proof memory area for storing the first music data for several tens of seconds; and wherein the shock-proof means is configured to store the first music data in the first memory data up to the upper limit of the memory capacity in the shock-proof memory area.
When the music data 101 is stored, a data full signal is output to the first CPU 8. The first stored in the shock proof memory area 5a
Is transmitted to the first DSP 6, and when the storage amount in the shock-proof memory area 5a decreases to a predetermined value, the data empty signal is sent to the first CPU 6.
8 and restarts prefetching in response to an instruction signal from the first CPU 8 so that the first music data 101 fetched from the first reading unit 2 is stored again in the shockproof memory area 5a. Has become.

【００３４】第１のＤＳＰ６は、第１のメモリ５より送
られてくる前記第１の音楽データから、該第１の音楽デ
ータに含まれている歌詞（文字情報）を獲得するといっ
た処理を含むデジタル信号処理を行うものである。ここ
では、例えば図３に示す概略構成ブロック図に示すよう
に、音声認識手段１４と第１の音声合成手段１５とディ
レイ手段１６とを含んで構成され、第１のメモリ５より
出力された前記第１の音楽データが、音声認識手段１４
とディレイ手段１６とにそれぞれ入力されるようになっ
ている。The first DSP 6 includes a process of acquiring lyrics (character information) included in the first music data from the first music data sent from the first memory 5. It performs digital signal processing. Here, for example, as shown in a schematic configuration block diagram shown in FIG. 3, the configuration includes a voice recognition unit 14, a first voice synthesis unit 15, and a delay unit 16, and is output from the first memory 5. The first music data is stored in the voice recognition unit 14.
And the delay means 16 respectively.

【００３５】音声認識手段１４は、第１のメモリ５のシ
ョックプルーフメモリ領域５ａに記憶された第１の音楽
データ１０１に含まれているボーカル情報を音声認識し
て文字情報を取得するものである。そして、第１のＣＰ
Ｕ８からの指示信号に従い、取得した文字情報を第２の
メモリ９を介して表示ドライバ１０に、又は第１の音声
合成手段１５に、又は表示ドライバ１０及び第１の音声
合成手段１５の両方に出力するようになっている。The voice recognition means 14 obtains character information by voice recognition of the vocal information contained in the first music data 101 stored in the shockproof memory area 5a of the first memory 5. . And the first CP
In accordance with the instruction signal from U8, the obtained character information is sent to the display driver 10 via the second memory 9, to the first speech synthesizer 15, or to both the display driver 10 and the first speech synthesizer 15. Output.

【００３６】第１の音声合成手段１５は、図４の模式的
説明図に示すように、音声認識手段１４から送られてく
る文字情報に基づき、第１の音楽データに含まれる歌詞
情報を音声合成して歌詞の音声合成情報を出力するよう
になっている。本実施の形態（１）では第１の音声合成
手段１５は、第１の音楽データに含まれる曲のフレーズ
の再生直前に歌詞が第１のスピーカ１３から合成音で音
声出力される（読み上げられる）ように、音声合成した
歌詞の音声合成情報をディレイ手段１６の出力側に出力
するようになっている。The first voice synthesizing means 15 converts the lyric information included in the first music data into voice based on the character information sent from the voice recognizing means 14, as shown in the schematic explanatory view of FIG. It synthesizes and outputs speech synthesis information of the lyrics. In the present embodiment (1), the first speech synthesizer 15 outputs the lyrics as a synthesized sound from the first speaker 13 immediately before the reproduction of the phrase of the song included in the first music data (speech is performed). As described above, the voice synthesis information of the lyrics synthesized by voice is output to the output side of the delay means 16.

【００３７】一方、ディレイ手段１６は、第１のメモリ
５から出力された前記第１の音楽データを、第１のＣＰ
Ｕ８からの指示信号に従って所定の時間だけ遅延させて
Ｄ／Ａコンバータ１２に出力するものであり、バッファ
・メモリにより構成されている。このため、ディレイ手
段１６から出力された前記第１の音楽データは、音声認
識手段１４において取得された文字情報の第１のディス
プレイ１１からの出力と同期を図って第１のスピーカ１
３から再生されることとなる。On the other hand, the delay means 16 converts the first music data output from the first memory 5 into a first CP.
The signal is output to the D / A converter 12 after being delayed by a predetermined time according to the instruction signal from U8, and is constituted by a buffer memory. Therefore, the first music data output from the delay unit 16 is synchronized with the output of the character information obtained by the voice recognition unit 14 from the first display 11 to the first speaker 1.
3 will be reproduced.

【００３８】第１の操作部７は、ユーザが音楽データ処
理装置１への操作信号を入力するためのものであり、第
１のＣＰＵ８に接続され、例えばスイッチ、キー、ボタ
ンあるいはタッチパネル等の手動入力手段やマイク等の
音声入力手段を含んで構成されている。手動入力手段と
しては、例えば記録媒体１００に記録されている音楽デ
ータを再生するように指示するための通常のスイッチ
（以下、再生用スイッチと記す）の他に、音楽データの
歌詞情報を画面表示するように指示するためのスイッチ
（以下、歌詞画面表示スイッチと記す）、歌詞情報を通
常の再生出力の少し前に読み上げるように指示するため
のスイッチ（カラオケ先生モードスイッチと記す）等を
装備している。またこれらのスイッチ操作を、前記音声
入力手段への音声入力によっても行えるように構成され
ている。The first operation section 7 is for the user to input an operation signal to the music data processing apparatus 1 and is connected to the first CPU 8 and is operated by a manual operation such as a switch, a key, a button, or a touch panel. It is configured to include voice input means such as input means and a microphone. As the manual input means, for example, in addition to a normal switch for instructing reproduction of music data recorded on the recording medium 100 (hereinafter referred to as a reproduction switch), lyrics information of the music data is displayed on a screen. (Hereinafter referred to as a lyrics screen display switch), and a switch (referred to as a karaoke teacher mode switch) for instructing to read out the lyric information shortly before normal reproduction output. ing. Further, these switches can be operated by voice input to the voice input means.

【００３９】第１のＣＰＵ８は、再生機構部３、第１の
メモリ５、第１のＤＳＰ６、第２のメモリ９に接続さ
れ、これら各部を制御するものであり、本実施の形態
（１）に係る音楽データ処理装置１では、処理回路部４
から送られてきたエラー信号等や、第１のメモリ５から
のデータフル信号、データエンプティ信号、第１の操作
部７から入力された操作信号等に基づいて制御動作を行
うようになっている。また第２のメモリ９は、例えばＲ
ＡＭ等で構成され、音声認識手段１４によって取得され
た文字情報を記憶するように構成されている。The first CPU 8 is connected to the reproducing mechanism 3, the first memory 5, the first DSP 6, and the second memory 9, and controls these units. The first embodiment (1) In the music data processing device 1 according to
The control operation is performed based on an error signal transmitted from the first memory 5, a data full signal from the first memory 5, a data empty signal, an operation signal input from the first operation unit 7, and the like. . The second memory 9 stores, for example, R
It is composed of an AM or the like, and is configured to store the character information obtained by the voice recognition means 14.

【００４０】表示ドライバ１０は、第１のＤＳＰ６から
第２のメモリ９を介して送られてくる文字情報を第１の
ディスプレイ１１に画面表示させるための画像信号を生
成する処理等を行うものとなっている。また第１のディ
スプレイ１１は、音声認識手段１４から表示ドライバ１
０を介して送られてきた文字情報の画像信号を、例えば
図２に示すように画面表示するものとなっている。The display driver 10 performs a process of generating an image signal for displaying the character information sent from the first DSP 6 via the second memory 9 on the first display 11 and the like. Has become. Further, the first display 11 is provided with a display driver 1 from the voice recognition unit 14.
The image signal of the character information sent via the "0" is displayed on the screen as shown in FIG. 2, for example.

【００４１】Ｄ／Ａコンバータ１２は、第１のＤＳＰ６
におけるディレイ手段１６から出力された前記第１の音
楽データや第１の音声合成手段１５から出力された歌詞
の音声合成情報をＤ／Ａ変換して第１のスピーカ１３に
出力するものとなっている。第１のスピーカ１３は、図
２に示すように、Ｄ／Ａコンバータ１２から出力された
前記第１の音楽データや音声合成情報を再生するように
なっている。この第１のスピーカ１３及び第１のディス
プレイ１１により、音声認識手段１４によって取得され
た文字情報を音声出力及び／又は画面表示する本発明の
第１の出力手段が構成されている。The D / A converter 12 is connected to the first DSP 6
The D / A conversion of the first music data output from the delay means 16 and the voice synthesis information of the lyrics output from the first voice synthesis means 15 are performed, and the resultant data is output to the first speaker 13. I have. As shown in FIG. 2, the first speaker 13 reproduces the first music data and voice synthesis information output from the D / A converter 12. The first speaker 13 and the first display 11 constitute a first output unit of the present invention that outputs the character information obtained by the voice recognition unit 14 and / or displays the information on a screen.

【００４２】次に、上記のごとく構成された音楽データ
処理装置１において、前記第１の音楽データとしてのボ
ーカル入りの曲を再生する際に第１のＣＰＵ８が行う動
作を、図５に示すフローチャートを用いて説明する。Next, the operation performed by the first CPU 8 when the music data processing device 1 configured as described above reproduces a vocal-added song as the first music data will be described with reference to a flowchart shown in FIG. This will be described with reference to FIG.

【００４３】電源が投入され、ある選択された曲のＣＤ
再生用スイッチがオンされると、ステップＳ１におい
て、まず歌詞画面表示スイッチがオンされているか否か
の判断を行う。ステップＳ１において、歌詞画面表示ス
イッチがオンされていると判断すると、続いてステップ
Ｓ２に進み、カラオケ先生モードスイッチがオンされて
いるか否かを判断する。The power is turned on, and a CD of a selected song is turned on.
When the reproduction switch is turned on, it is first determined in step S1 whether or not the lyrics screen display switch is turned on. If it is determined in step S1 that the lyrics screen display switch is on, the process proceeds to step S2 to determine whether the karaoke teacher mode switch is on.

【００４４】ステップＳ２において、カラオケ先生モー
ドスイッチがオンされていると判断すると、ステップＳ
３に進んでショックプルーフ手段を構成する前記第１の
先読み手段に、記録媒体１００に記録されている第１の
音楽データとしてのボーカル入りの曲を、再生時におけ
る通常の読み取り速度の２倍速以上の速度で先読みさせ
るように再生機構部３に指示を与える。先読みされたデ
ータは処理回路部４において所定の処理が施される。If it is determined in step S2 that the karaoke teacher mode switch has been turned on, the process proceeds to step S2.
Then, the program proceeds to step 3 and the first look-ahead means which constitutes the shock proof means stores the vocal-containing music as the first music data recorded on the recording medium 100 at twice or more the normal reading speed at the time of reproduction. The reproduction mechanism 3 is instructed to read ahead at the speed of. The pre-read data is subjected to predetermined processing in the processing circuit unit 4.

【００４５】次にステップＳ４において、処理回路部４
において処理された先読みデータを、ショックプルーフ
メモリ領域５ａに記憶させる。この際、ショックプルー
フメモリ領域５ａにおけるメモリ容量の上限値まで先読
みデータが蓄積されて、第１のメモリ５からのデータフ
ル信号を取り込むと、ショックプルーフメモリ領域５ａ
に蓄積された先読みデータを第１のＤＳＰ６へ送出す
る。Next, in step S4, the processing circuit unit 4
Is stored in the shock proof memory area 5a. At this time, the pre-read data is accumulated up to the upper limit of the memory capacity in the shock proof memory area 5a, and when the data full signal from the first memory 5 is fetched, the shock proof memory area 5a
Is sent to the first DSP 6.

【００４６】また図示を省略しているが、第１のＣＰＵ
８は、ショックプルーフメモリ領域５ａに蓄積された先
読みデータが、第１のＤＳＰ６への送出によって所定の
値にまで減少するとステップＳ３に戻り、前回先読みし
た部分の続きの部分のデ−タを第１の先読み手段に先読
みさせるための制御を行うよう再生機構部３に指示を与
える。Although not shown, the first CPU
8 returns to step S3 when the pre-read data stored in the shock proof memory area 5a is reduced to a predetermined value by sending the data to the first DSP 6, and the data of the continuation part of the previously pre-read part is read. An instruction is given to the reproduction mechanism unit 3 so as to perform control for causing the first prefetching unit to prefetch.

【００４７】次いでステップＳ５に進み、第１のＤＳＰ
６に取り込まれた第１の音楽データに含まれているボー
カル情報を音声認識して文字情報に変換するように音声
認識手段１４に指示を与える。その後、ステップＳ６に
進んで文字情報を音声認識手段１４から第１の音声合成
手段１５へ送出させて、第１の音声合成手段１５におい
て前記文字情報を基にして歌詞を音声合成させる。また
ステップＳ７において、文字情報を音声認識手段１４か
ら第２のメモリ９へも送出させて、第２のメモリ９に文
字情報を記憶させる。Then, the process proceeds to a step S5, wherein the first DSP
The voice recognition unit 14 is instructed to perform voice recognition on the vocal information included in the first music data captured by the voice recognition unit 6 and convert the vocal information into character information. Thereafter, the process proceeds to step S6, in which the character information is sent from the voice recognition means 14 to the first voice synthesis means 15, and the first voice synthesis means 15 voice-synthesizes the lyrics based on the character information. In step S7, the character information is also sent from the voice recognition means 14 to the second memory 9, and the character information is stored in the second memory 9.

【００４８】次に、ステップＳ８に進んで、第２のメモ
リ９に記憶させた文字情報を表示ドライバ１０を介して
第１のディスプレイ１１に出力させて画面表示させると
ともに、ディレイ手段１６に送られた第１の音楽データ
及び第１の音声合成手段１５により音声合成された歌詞
の音声情報を、Ｄ／Ａコンバータ１２を介して第１のス
ピーカ１３に出力させる。Next, proceeding to step S8, the character information stored in the second memory 9 is output to the first display 11 via the display driver 10 to be displayed on the screen, and sent to the delay means 16. The first music data and the voice information of the lyrics synthesized by the first voice synthesis means 15 are output to the first speaker 13 via the D / A converter 12.

【００４９】この際、第１の音楽データが通常の再生処
理時における速度で第１のスピーカ８から再生されるよ
うにディレイ手段１６を制御し、また、第１の音声合成
手段１５から出力される歌詞の音声情報が、第１の音楽
データに含まれた曲のフレーズの直前に第１のスピーカ
１３から読み上げられるように、第１の音声合成手段１
５からの前記音声情報の出力を制御し、また文字情報の
第１のディスプレイ１１での画面表示を第１のスピーカ
１３からの前記音声情報の出力と同期させる。これらの
制御によって、第１の音楽データが通常の速度で再生さ
れるとともに、フレーズに合わせて歌詞が画面表示さ
れ、さらに１フレーズ分の第１の音楽データの再生直前
に歌詞が合成音で読み上げられる。At this time, the delay means 16 is controlled so that the first music data is reproduced from the first speaker 8 at the speed at the time of the normal reproduction processing. The first voice synthesizing means 1 reads out the voice information of the lyrics from the first speaker 13 immediately before the phrase of the song included in the first music data.
5 controls the output of the voice information, and synchronizes the screen display of the character information on the first display 11 with the output of the voice information from the first speaker 13. With these controls, the first music data is reproduced at a normal speed, the lyrics are displayed on the screen in accordance with the phrase, and the lyrics are read out as a synthesized sound immediately before the reproduction of the first music data for one phrase. Can be

【００５０】一方、ステップＳ１において、歌詞画面表
示スイッチがオンされていないと判断すると、ステップ
Ｓ９に進んで、カラオケ先生モードスイッチがオンされ
ているか否かを判断する。ステップＳ９において、カラ
オケ先生モードスイッチがオンされていると判断する
と、上記したステップＳ３〜ステップＳ６と同じ動作を
行う（ステップＳ１０）。On the other hand, if it is determined in step S1 that the lyrics screen display switch has not been turned on, the flow advances to step S9 to determine whether or not the karaoke teacher mode switch has been turned on. If it is determined in step S9 that the karaoke teacher mode switch has been turned on, the same operation as in steps S3 to S6 described above is performed (step S10).

【００５１】その後、ステップＳ１１に進み、ディレイ
手段１６に送られた第１の音楽データ及び第１の音声合
成手段１５において音声合成された歌詞情報を、Ｄ／Ａ
コンバータ１２を介して第１のスピーカ１３に出力させ
る。この際、通常の再生処理時における速度で第１の音
楽データが第１のスピーカ１３から再生されるようにデ
ィレイ手段１６を制御するとともに、第１の音声合成手
段１５において音声合成された歌詞情報が、第１の音楽
データに含まれた曲のフレーズの直前に第１のスピーカ
１３から読み上げられるように、第１の音声合成手段１
５からの前記音声情報の出力を制御する。これらの制御
によって、第１の音楽データが通常の速度で再生される
とともに、１フレーズ分の第１の音楽データの再生直前
に歌詞が合成音で読み上げられる。Thereafter, the process proceeds to step S11, in which the first music data sent to the delay means 16 and the lyric information speech-synthesized by the first speech synthesis means 15 are converted into D / A data.
The signal is output to the first speaker 13 via the converter 12. At this time, the delay means 16 is controlled so that the first music data is reproduced from the first speaker 13 at the speed at the time of the normal reproduction processing, and the lyric information synthesized by the first speech synthesis means 15 is reproduced. Is read out from the first speaker 13 immediately before the phrase of the song included in the first music data.
5 controls the output of the audio information. With these controls, the first music data is reproduced at a normal speed, and the lyrics are read out as a synthesized sound just before the reproduction of the first music data for one phrase.

【００５２】また、ステップＳ９において、カラオケ先
生モードスイッチがオンされていないと判断すると、ス
テップＳ１２に進み、記録媒体１００から第１の音楽デ
ータを再生するための通常の制御を行う。If it is determined in step S9 that the karaoke teacher mode switch has not been turned on, the flow advances to step S12 to perform normal control for reproducing the first music data from the recording medium 100.

【００５３】また、歌詞画面表示スイッチがオンされて
いると判断したものの、続くステップＳ２において、カ
ラオケ先生モードスイッチがオフされていると判断する
と、上記したステップＳ３〜ステップＳ５と同じ動作を
行う（ステップＳ１３）。その後、ステップＳ１４に進
んでステップＳ７と同じように、文字情報を第２のメモ
リ９に記憶させるための制御を行う。If it is determined that the lyrics screen display switch is turned on, but it is determined in step S2 that the karaoke teacher mode switch is off, the same operations as those in steps S3 to S5 are performed (step S2). Step S13). Thereafter, the process proceeds to step S14, and control for storing character information in the second memory 9 is performed as in step S7.

【００５４】次いで、ステップＳ１５において、第２の
メモリ９に記憶させた文字情報を表示ドライバ１０を介
して第１のディスプレイ１１に出力させて画面表示させ
るとともに、ディレイ手段１６に送られた第１の音楽デ
ータを、Ｄ／Ａコンバータ１２を介して第１のスピーカ
１３から音声出力させる。その際も、通常の再生処理時
における速度で第１の音楽データが第１のスピーカ１３
から再生されるようにディレイ手段１６を制御するとと
もに、文字情報の第１のディスプレイ１１での画面表示
を第１のスピーカ１３からの前記音声情報の出力と同期
させる。これらの制御によって、第１の音楽データが通
常の速度で再生されるとともに、フレーズに合わせて歌
詞が画面表示される。Next, at step S15, the character information stored in the second memory 9 is output to the first display 11 via the display driver 10 to be displayed on the screen, and the first character sent to the delay means 16 is displayed. From the first speaker 13 via the D / A converter 12. Also in this case, the first music data is transmitted to the first speaker 13 at the speed at the time of the normal reproduction processing.
In addition to controlling the delay means 16 so that the character information is reproduced, the screen display of the character information on the first display 11 is synchronized with the output of the audio information from the first speaker 13. With these controls, the first music data is reproduced at a normal speed, and the lyrics are displayed on the screen in accordance with the phrase.

【００５５】なお、ユーザによりＣＤ再生用スイッチが
オンされたときに上記のごとく動作する第１のＣＰＵ８
は、ＣＤ再生用スイッチがオフとされているものの、歌
詞画面表示スイッチ及びカラオケ先生モードスイッチの
うちのいずれか一方、あるいは両方がオンとされた場合
に、例えば以下に述べるような動作を行うものとなって
いる。The first CPU 8 which operates as described above when the CD playback switch is turned on by the user.
Indicates that when one or both of the lyrics screen display switch and the karaoke teacher mode switch are turned on while the CD playback switch is turned off, for example, the following operation is performed. It has become.

【００５６】まず、ユーザが歌詞画面表示スイッチ及び
カラオケ先生モードスイッチのうちのいずれか一方、あ
るいは両方をオンさせると、第１のＣＰＵ８は、第２の
メモリ９から、該第２のメモリ９に文字情報が記憶され
た曲の題名のリストデータを表示ドライバ１０を介して
第１のディスプレイ１１に出力させ、画面表示させる。
ユーザによって、リスト表示された曲の題名の中から出
力させたい曲の題名が選択されると、先にユーザがスイ
ッチをオンにすることにより入力された指示信号に基づ
いて、選択された曲の文字情報を第１のディスプレイ１
１又は第１のスピーカ１３、又は第１のディスプレイ１
１及び第１のスピーカ１３に出力させて歌詞を表示ある
いは音声出力させる。First, when the user turns on one or both of the lyric screen display switch and the karaoke teacher mode switch, the first CPU 8 transfers the data from the second memory 9 to the second memory 9. The list data of the titles of the songs in which the character information is stored is output to the first display 11 via the display driver 10 and displayed on the screen.
When the user selects the title of the song to be output from the titles of the songs displayed in the list, the user turns on the switch first and the selected song title is output based on the input instruction signal. Character information on the first display 1
1 or 1st speaker 13 or 1st display 1
The first and first speakers 13 output the lyrics to display or audio output.

【００５７】上記した実施の形態（１）に係る音楽デー
タ処理装置１によれば、記録媒体１００から読み取った
第１の音楽データに含まれているボーカル情報を、音声
認識して文字情報に変換し、歌詞情報として第１のディ
スプレイ１１に画面表示させたり、第１のスピーカ１３
から音声出力させることができる。よって、記録媒体１
００が第１の音楽データとしてボーカル入りの曲のみを
記録しており、歌詞情報が文字情報として記録されてい
ないＣＤやＭＤであっても、第１の音楽データを再生し
つつ、歌詞の確認やカラオケを容易に行うことができる
ため、娯楽性の高い音楽データ処理装置とすることがで
きる。According to the music data processing apparatus 1 according to the embodiment (1), the vocal information included in the first music data read from the recording medium 100 is converted into character information by voice recognition. Then, it is displayed on the screen of the first display 11 as lyrics information, or the first speaker 13
Can output audio. Therefore, the recording medium 1
00 records only songs with vocals as the first music data, and confirms the lyrics while reproducing the first music data even on a CD or MD in which the lyrics information is not recorded as character information. Since karaoke and karaoke can be easily performed, a music data processing device having high entertainment properties can be provided.

【００５８】また、歌詞情報の表示に際しては、ショッ
クプルーフ手段が、再生のための通常の読み取り速度よ
りも高速で間欠的に記録媒体１００から第１の音楽デー
タを読み取って第１のメモリ５に蓄積する、いわゆるシ
ョックプルーフ機能により第１の音楽データの読み取り
を行っており、第１の音楽データの全データを一気に読
み取らない。これにより、第１の音楽データの読み取り
を短時間で行え、しかも第１の音楽データの再生のタイ
ミングに歌詞情報の出力のタイミングを合わせ易いとい
う点で非常に優れている。したがって、音楽データ処理
装置１は、再生開始時に音出しが遅れる等の不具合が発
生せず、また第１の音楽データの再生にタイミングを的
確に合わせて歌詞情報を出力させることができるものと
なる。When displaying the lyrics information, the shock proof means intermittently reads the first music data from the recording medium 100 at a speed higher than the normal reading speed for reproduction and stores it in the first memory 5. The first music data is read by the so-called shock proof function that is stored, and the entire data of the first music data is not read at once. This is very advantageous in that the first music data can be read in a short time, and the output timing of the lyrics information can be easily adjusted to the reproduction timing of the first music data. Therefore, the music data processing apparatus 1 does not cause a problem such as delay in sound output at the start of reproduction, and can output the lyric information in time with the reproduction of the first music data. .

【００５９】また、音楽データ処理装置１では、ショッ
クプルーフ手段により、ボーカル情報及び伴奏情報を含
む第１の音楽データが読み取られ、記録媒体１００がボ
ーカル入りの曲のみを記録したＣＤやＭＤであっても、
第１の音楽データの再生に際し、歌詞情報を出力させる
ことができる。また、歌詞情報の出力に際しては、第１
の音楽データの読み取りに要する時間が極めて短くて済
むとともに、記録媒体１００から音楽データを読み取る
動作が複雑にならないといった利点もある。In the music data processing apparatus 1, the first music data including the vocal information and the accompaniment information is read by the shock proof means, and the recording medium 100 is a CD or MD in which only the vocal-containing music is recorded. Even
When reproducing the first music data, it is possible to output lyrics information. When outputting the lyrics information, the first
The time required for reading the music data is extremely short, and the operation of reading the music data from the recording medium 100 is not complicated.

【００６０】さらに、ショックプルーフ機能は、振動に
より音飛びが発生し易い車載用の音響装置等に採用され
ているものであるため、この音楽データ処理装置１を車
載用の音響装置等に適用した場合には、既存の音響装置
の構成要素を利用して音楽データ処理装置１を容易に構
成することができ、追加部品に要するコストを低く抑え
ることができる。しかも、音楽データ処理装置１では、
音声認識された文字情報に基づく歌詞情報を音声出力さ
せることが可能であるため、歌詞情報が画面表示された
第１のディスプレイ１１をユーザがたとえ視認できな
い、例えば車を運転している状況にあっても、ユーザに
歌詞情報を音声出力により伝えることができる。したが
って、音楽データ処理装置１は、車内でカラオケを楽し
むことができる車載用の装置としても非常に有効なもの
となる。Further, since the shock proof function is employed in an in-vehicle audio device or the like in which sound skipping is likely to occur due to vibration, the music data processing device 1 is applied to an in-vehicle audio device or the like. In this case, the music data processing device 1 can be easily configured by using the components of the existing audio device, and the cost required for additional components can be reduced. Moreover, in the music data processing device 1,
Since it is possible to output lyric information based on the character information recognized by voice, the first display 11 on which the lyric information is displayed on the screen cannot be visually recognized by the user, for example, when driving a car. However, lyrics information can be transmitted to the user by voice output. Therefore, the music data processing device 1 is also very effective as an in-vehicle device for enjoying karaoke in a car.

【００６１】また、第１の操作部７に設けられたスイッ
チ等によりユーザが、歌詞を画面表示させるか音声出力
させるか、又は画面表示と音声出力の両方で出力させる
かを選択できるため、ユーザの好みに合った歌詞情報の
出力を行うことができる。Further, the user can select whether to display the lyrics on the screen, to output the voice, or to output both the screen and the voice by using a switch provided on the first operation unit 7. Can output lyrics information that suits the user's preference.

【００６２】また、実施の形態（１）に係る音楽データ
処理装置１では、音声認識手段１４により認識された文
字情報を記憶する第２のメモリ９が装備されていること
により、記録媒体１００に記録された音楽データを再生
するとき以外、例えば音楽データの再生を終えた後に
も、第２のメモリ９に記憶されている歌詞情報を第１の
ディスプレイ１１や第１のスピーカ１３に出力させるこ
とができる。従って、音楽データの再生時に見落とし
た、あるいは聞き逃した歌詞情報を容易に確認すること
ができる。In the music data processing apparatus 1 according to the embodiment (1), since the second memory 9 for storing the character information recognized by the voice recognition means 14 is provided, To output the lyrics information stored in the second memory 9 to the first display 11 or the first speaker 13 even when the recorded music data is not reproduced, for example, even after the reproduction of the music data is completed. Can be. Therefore, it is possible to easily confirm the lyric information that was overlooked or missed during the reproduction of the music data.

【００６３】なお、実施の形態（１）に係る音楽データ
処理装置１では、第２のメモリ９が装備された例を説明
したが、本発明はこの例に限定されるものではない。例
えば別の実施の形態に係る音楽データ処理装置では、第
２のメモリ９が装備されていないものとし、音声認識手
段１４により認識された文字情報が直接、表示ドライバ
１０へ出力されるように構成することも可能である。こ
の場合には、第２のメモリ９が削減される分、音楽デー
タ処理装置の構成を簡略化することができる利点があ
る。In the music data processing apparatus 1 according to the embodiment (1), the example in which the second memory 9 is provided has been described, but the present invention is not limited to this example. For example, in a music data processing apparatus according to another embodiment, it is assumed that the second memory 9 is not provided, and the character information recognized by the voice recognition means 14 is directly output to the display driver 10. It is also possible. In this case, there is an advantage that the configuration of the music data processing device can be simplified by the reduction of the second memory 9.

【００６４】また、実施の形態（１）に係る音楽データ
処理装置１では、音声認識手段１４、第１の音声合成手
段１５、ディレイ手段１６を含む第１のＤＳＰ６が装備
された例を説明したが、別の実施の形態に係る音楽デー
タ処理装置では、第１の音声合成手段１５を含まない
（音声認識手段１４及びディレイ手段１６だけを含む）
第１のＤＳＰを、実施の形態（１）における第１のＤＳ
Ｐ６に替えて装備することも可能である。この場合に
も、第１の音楽データに含まれるボーカル入りの曲の歌
詞情報を画面表示できるので、記録媒体がボーカル入り
の曲のみを記録しており、歌詞情報が文字情報として記
録されていないＣＤやＭＤであっても、ボーカル入りの
曲を再生しつつ、歌詞情報の確認を容易に行えるといっ
た効果を得ることができる。Further, in the music data processing apparatus 1 according to the embodiment (1), an example has been described in which the first DSP 6 including the voice recognition means 14, the first voice synthesis means 15, and the delay means 16 is provided. However, the music data processing device according to another embodiment does not include the first voice synthesis unit 15 (only includes the voice recognition unit 14 and the delay unit 16).
The first DSP is the first DS in the first embodiment.
It is also possible to equip it instead of P6. Also in this case, the lyric information of the vocal song included in the first music data can be displayed on the screen, so that the recording medium records only the vocal song, and the lyric information is not recorded as character information. Even if it is a CD or MD, it is possible to obtain an effect that the lyrics information can be easily confirmed while reproducing the vocal-added music.

【００６５】次に、本発明の実施の形態（２）に係る音
楽データ処理装置を説明する。実施の形態（２）に係る
音楽データ処理装置は、実施の形態（１）に係る音楽デ
ータ処理装置１とは第１のＤＳＰ、第１のＣＰＵ及び第
１の操作部の構成が相違しているが、これら第１のＤＳ
Ｐ、第１のＣＰＵ及び第１の操作部以外の構成はほぼ同
じとなっている。そのため、ここでは図１に示したブロ
ック図と、実施の形態（２）に係る音楽データ処理装置
の第１のＤＳＰの概略構成を示す図６とを用いて実施の
形態（２）に係る音楽データ処理装置の説明を行い、図
１において第１のＤＳＰ、第１のＣＰＵ、第１の操作部
及び音楽データ処理装置にのみ異なる符号を付しておく
こととする。Next, a music data processing device according to the embodiment (2) of the present invention will be described. The music data processing device according to the embodiment (2) is different from the music data processing device 1 according to the embodiment (1) in the configuration of the first DSP, the first CPU, and the first operation unit. But these first DS
Configurations other than P, the first CPU, and the first operation unit are substantially the same. Therefore, here, the music according to the embodiment (2) will be described using the block diagram shown in FIG. 1 and FIG. 6 showing the schematic configuration of the first DSP of the music data processing device according to the embodiment (2). The data processing device will be described, and in FIG. 1, only the first DSP, the first CPU, the first operation unit, and the music data processing device will be denoted by different reference numerals.

【００６６】図６において、実施の形態（２）に係る音
楽データ処理装置２０の第１のＤＳＰ２１は、実施の形
態（１）における第１のＤＳＰ６を構成する音声認識手
段１４と第１の音声合成手段１５とディレイ手段１６と
に加えて、バンドパスフィルタ２２、第３のメモリ２
３、カラオケ曲作成手段１７及びディレイ手段１８が装
備されており、第１のメモリ５から出力された第１の音
楽データが、バンドパスフィルタ２２とディレイ手段１
６とカラオケ曲作成手段１７とにそれぞれ入力され、前
記第１の音楽データから音声認識手段１４により文字情
報を獲得する等のデジタル信号処理を行うようになって
いる。In FIG. 6, the first DSP 21 of the music data processing apparatus 20 according to the embodiment (2) includes a voice recognition unit 14 and a first voice which constitute the first DSP 6 in the embodiment (1). In addition to the synthesizing means 15 and the delay means 16, a band-pass filter 22, a third memory 2
3, a karaoke song creating means 17 and a delay means 18 are provided, and the first music data output from the first memory 5 is supplied to the bandpass filter 22 and the delay means 1
6 and the karaoke song creating means 17, and performs digital signal processing such as obtaining character information from the first music data by the voice recognition means 14.

【００６７】バンドパスフィルタ２２は、第１の読み取
り部２（図１参照）から読み取られた前記第１の音楽デ
ータ中から、該第１の音楽データに含まれているボーカ
ル情報の周波数帯域の信号のみを通過させて取り出すフ
ィルタ処理を行うものである。人の声は、おおよそ９０
Ｈｚ〜１０ｋＨｚの周波数帯域に分布する。このためバ
ンドパスフィルタ２２は、図７の説明図において、入力
された第１の音楽データとしてのボーカル入りの曲
（ａ）から、人の声が主に含まれている周波数帯域の情
報、例えば３００Ｈｚ〜３ｋＨｚの周波数帯域の情報を
取り出すことができるように構成されており（ｂ）、こ
のことによって（ｃ）において、前記第１の音楽データ
中に含まれているボーカル情報のみでほぼ構成された情
報を得ることが可能になっている。The band-pass filter 22 converts the frequency band of the vocal information included in the first music data from the first music data read from the first reading unit 2 (see FIG. 1). This is to perform a filtering process that allows only the signal to pass therethrough. Human voice is about 90
It is distributed in a frequency band of 10 Hz to 10 kHz. For this reason, in the explanatory diagram of FIG. 7, the band-pass filter 22 converts information on the frequency band mainly including human voices from the input vocal-containing music (a) as the first music data, for example, It is configured to be able to extract information in a frequency band of 300 Hz to 3 kHz (b), whereby (c) substantially comprises only vocal information included in the first music data. Information can be obtained.

【００６８】第３のメモリ２３は、バンドパスフィルタ
２２が取り出した情報（ボーカル情報）を記憶し、音声
認識手段１４及びカラオケ曲作成手段１７に出力するも
のとなっている。従って、音声認識手段１４は、第３の
メモリ２３に記憶された情報（ボーカル情報）を、音声
認識して文字情報に変換するようになっている。The third memory 23 stores information (vocal information) extracted by the band-pass filter 22 and outputs the information to the voice recognition means 14 and the karaoke music creating means 17. Therefore, the voice recognition means 14 converts the information (vocal information) stored in the third memory 23 into character information by voice recognition.

【００６９】カラオケ曲作成手段１７は、バンドパスフ
ィルタ２２により第１の音楽データから取り出され、第
３のメモリ２３に記憶された情報を用いて、カラオケ用
の音楽データ（第２の音楽データ）を作成するものであ
る。すなわち、バンドパスフィルタ２２により処理され
る前の第１の音楽データから、第３のメモリ２３に記憶
された、ほぼボーカル情報のみからなる情報を差し引く
ことによって第１の音楽データ中に含まれる伴奏情報の
みで構成されたカラオケ用の音楽データを作成するよう
になっている。The karaoke music creating means 17 uses the information extracted from the first music data by the bandpass filter 22 and stored in the third memory 23 to use the karaoke music data (second music data). Is to create. In other words, the accompaniment included in the first music data is subtracted from the first music data before being processed by the band-pass filter 22, by subtracting information substantially consisting of only vocal information stored in the third memory 23. It creates music data for karaoke composed only of information.

【００７０】音楽データ処理装置２０における第１のＣ
ＰＵ２４（図１参照）は、再生機構部３、第１のメモリ
５、第２のメモリ９及び処理回路部４に接続され、これ
ら各部を、実施の形態（１）に係る第１のＣＰＵ８と同
様に制御するものである。また、前記第１の音楽データ
中に含まれる曲の歌詞の文字情報や、音声情報、カラオ
ケ用の音楽データを作成するように第１のＤＳＰ２１の
制御を行うものとなっている。The first C in the music data processing device 20
The PU 24 (see FIG. 1) is connected to the reproduction mechanism 3, the first memory 5, the second memory 9, and the processing circuit 4, and these units are connected to the first CPU 8 according to the embodiment (1) and The same control is performed. Also, the first DSP 21 is controlled so as to create character information of the lyrics of the music included in the first music data, voice information, and music data for karaoke.

【００７１】また、第１の操作部２５（図１参照）は、
実施の形態（１）で述べた各スイッチの他に、手動入力
手段として、例えばボーカル入りの曲からカラオケ用の
音楽データを作成して再生するように、ユーザが第１の
ＣＰＵ２４に指示するためのカラオケスイッチ（図示せ
ず）を備えたものとなっている。The first operation unit 25 (see FIG. 1)
In addition to the switches described in the embodiment (1), as a manual input means, for example, a user instructs the first CPU 24 to create and reproduce music data for karaoke from a song with vocals. Karaoke switch (not shown).

【００７２】図８は、上記のごとく構成された音楽デー
タ処理装置２０において、第１の音楽データとしてのボ
ーカル入りの曲を再生する際の第１のＣＰＵ２４が行う
動作の一部を示したフローチャートであり、ここでは上
記実施の形態（１）に係る音楽データ処理装置１におけ
る第１のＣＰＵ８が行う動作と相違する部分のみを示し
ている。図８において第１のＣＰＵ２４は、図５に示し
たフローチャートのステップＳ４とステップＳ５との間
に、ステップＳ２１、ステップＳ２２の動作を行うもの
となっている。FIG. 8 is a flowchart showing a part of the operation performed by the first CPU 24 when the music data processing apparatus 20 configured as described above reproduces a vocal music as the first music data. Here, only a portion different from the operation performed by the first CPU 8 in the music data processing device 1 according to the above-described embodiment (1) is shown. In FIG. 8, the first CPU 24 performs the operations of steps S21 and S22 between steps S4 and S5 in the flowchart shown in FIG.

【００７３】すなわち、図５のステップＳ４においてシ
ョックプルーフメモリ領域５ａに記憶させた第１の音楽
データを、図８のステップＳ２１において、第１のＤＳ
Ｐ２１に取り込ませてバンドパスフィルタ２２を通過さ
せてフィルタ処理を行わせる。続いて、フィルタ処理さ
れた情報を第３のメモリ２３へ出力させて、この第３の
メモリ２３にフィルタ処理後のほぼボーカル情報のみで
構成された情報を記憶させる（ステップＳ２２）。その
後、図５に示したステップＳ５に進む。That is, the first music data stored in the shockproof memory area 5a in step S4 of FIG. 5 is stored in the first DS in step S21 of FIG.
It is taken into P21 and passed through the band-pass filter 22 to perform filter processing. Subsequently, the filtered information is output to the third memory 23, and the third memory 23 stores the information composed substantially of only the vocal information after the filtering (step S22). Thereafter, the process proceeds to step S5 shown in FIG.

【００７４】なお、図８には示していないが、第１のＣ
ＰＵ２４は、上記した第１の音楽データとしてのボーカ
ル入りの曲を再生する際の制御動作において、前記カラ
オケスイッチがオンされているか否かの判断も行う。該
カラオケスイッチがユ−ザによりオンされていないと判
断すると、前述の図５に示したステップＳ８、ステップ
Ｓ１１、ステップＳ１５における場合と同様に、第１の
読み取り部２が記録媒体１００から読み取ったボーカル
情報および伴奏情報を含む第１の音楽データを再生させ
る制御を行う。Although not shown in FIG. 8, the first C
The PU 24 also determines whether or not the karaoke switch is turned on in the control operation for playing back the vocal music as the first music data. If it is determined that the karaoke switch has not been turned on by the user, the first reading unit 2 reads from the recording medium 100 in the same manner as in steps S8, S11, and S15 shown in FIG. The control for reproducing the first music data including the vocal information and the accompaniment information is performed.

【００７５】一方、第１のＣＰＵ２４は、前記カラオケ
スイッチがユ−ザによりオンされていると判断すると、
第３のメモリ２３に記憶させたボーカル情報を用いてカ
ラオケ用の音楽データを作成するようにカラオケ曲作成
手段１７に指示を与える。そして、図５に示したステッ
プＳ８、ステップＳ１１、ステップＳ１５に対応するス
テップでは、ボーカル情報および伴奏情報を含む第１の
音楽データに替えて、カラオケ曲作成手段１７が作成し
たボーカル情報抜きの伴奏情報のみからなるカラオケ用
の音楽データを再生させる制御を行うことになる。On the other hand, when the first CPU 24 determines that the karaoke switch is turned on by the user,
An instruction is given to the karaoke song creating means 17 to create karaoke music data using the vocal information stored in the third memory 23. Then, in the steps corresponding to steps S8, S11, and S15 shown in FIG. 5, the first music data including the vocal information and the accompaniment information is replaced with the accompaniment without vocal information created by the karaoke song creating means 17. Control for reproducing music data for karaoke consisting of only information is performed.

【００７６】以上説明したように、実施の形態（２）に
係る音楽データ処理装置２０によれば、第１のＤＳＰ２
１のバンドパスフィルタ２２によって、第１の読み取り
部２で読み取られた第１の音楽データから、該第１の音
楽データに含まれたボーカル情報のみでほぼ構成された
情報を取り出すことができ、このボーカル情報から音声
認識手段１４によって文字情報を作成することができ
る。このため、第１の音楽データとしてのボーカル入り
の曲の歌詞が誤って音声認識されるといった事態の発生
確率を低減することができ、正確に認識された歌詞情報
を第１のディスプレイ１１に表示したり、第１のスピー
カ１３から合成音により出力することができる。As described above, according to the music data processing device 20 of the embodiment (2), the first DSP 2
The first bandpass filter 22 can extract, from the first music data read by the first reading unit 2, information substantially composed only of the vocal information included in the first music data, From the vocal information, character information can be created by the voice recognition means 14. For this reason, it is possible to reduce the probability of occurrence of a situation in which the lyrics of a song containing vocals as the first music data are erroneously recognized as speech, and the correctly recognized lyrics information is displayed on the first display 11. Or a synthesized sound is output from the first speaker 13.

【００７７】また、音楽データ処理装置２０では、第３
のメモリ２３及びカラオケ曲作成手段１７が装備されて
いることにより、バンドパスフィルタ２２により得られ
た情報を利用し、第１の音楽データに含まれる情報を元
にしてカラオケ用の音楽デ−タを作成することもでき
る。よって、シングル版のＣＤのようにカラオケ曲が記
録されていないことが多いＣＤ等の記録媒体１００から
第１の音楽データとしてのボーカル入りの曲を再生する
場合にも、簡単にカラオケ用の音楽デ−タを作成するこ
とが可能になるので、記録媒体１００を選ばなくてもカ
ラオケを楽しむことができる音楽データ処理装置２０を
提供することができる。In the music data processing device 20, the third
Is provided with the memory 23 and the karaoke music creating means 17 so that karaoke music data can be obtained based on the information contained in the first music data by using the information obtained by the bandpass filter 22. Can also be created. Therefore, even when a song with vocals as the first music data is reproduced from a recording medium 100 such as a CD in which karaoke songs are often not recorded, such as a single version CD, music for karaoke can be easily performed. Since data can be created, it is possible to provide the music data processing apparatus 20 that allows users to enjoy karaoke without selecting the recording medium 100.

【００７８】なお、上記した実施の形態（２）に係る音
楽データ処理装置２０では、バンドパスフィルタ２２を
通過した情報を利用し、カラオケ曲を作成するカラオケ
曲作成手段１７を装備した例を説明したが、本発明はこ
の例に限定されるものではない。別の実施の形態に係る
音楽データ処理装置では、実施の形態（２）におけるバ
ンドパスフィルタ２２とは逆の動作、つまり第１の音楽
データに含まれているボーカル情報の周波数帯域の情報
のみを前記第１の音楽データから除去する動作を行うバ
ンドストップフィルタを、カラオケ曲作成手段として装
備し、この手段によって第１の音楽データに含まれてい
るボーカル情報を取り除くことによりカラオケ用の音楽
データを作成するように構成することも可能である。The music data processing device 20 according to the above-described embodiment (2) is provided with a karaoke song creating means 17 for creating a karaoke song using information passed through the band-pass filter 22. However, the present invention is not limited to this example. In the music data processing device according to another embodiment, the operation is the reverse of that of the bandpass filter 22 in the embodiment (2), that is, only the information of the frequency band of the vocal information included in the first music data is transmitted. A band stop filter for performing an operation of removing from the first music data is provided as a karaoke song creating means, and the vocal information included in the first music data is removed by this means to thereby convert the music data for karaoke. It is also possible to configure to create.

【００７９】次に、本発明の実施の形態（３）に係る音
楽データ処理装置を説明する。図９は実施の形態（３）
に係る音楽データ処理装置の概略構成を示すブロック図
である。図９に示した実施の形態（３）に係る音楽デー
タ処理装置３０の場合、実施の形態（１）に係る音楽デ
ータ処理装置１とは、実施の形態（１）における第１の
読み取り部２、第１のＤＳＰ６、第１のＣＰＵ８のそれ
ぞれに替えて、第２の読み取り部３１、第２のＤＳＰ３
２、第２のＣＰＵ３３が装備され、第４のメモリ３４が
追加装備されている点において相違している。Next, a music data processing device according to the embodiment (3) of the present invention will be described. FIG. 9 shows an embodiment (3).
1 is a block diagram showing a schematic configuration of a music data processing device according to the first embodiment. In the case of the music data processing device 30 according to the embodiment (3) illustrated in FIG. 9, the music data processing device 1 according to the embodiment (1) is different from the music data processing device 1 according to the embodiment (1) in the first reading unit 2. , The first reading unit 31 and the second DSP 3 instead of the first DSP 6 and the first CPU 8, respectively.
2. The difference is that a second CPU 33 is provided and a fourth memory 34 is additionally provided.

【００８０】第２の読み取り部３１は、第２の先読み手
段を含むショックプルーフ手段の構成要素ともなってい
る。この第２の先読み手段は、記録媒体１００が、第１
の音楽データとしてのボーカル入りの曲と、そのカラオ
ケ用の音楽データ、つまり前記第１の音楽データの伴奏
情報のみとを含む第２の音楽データとを記録した、例え
ばシングル版ＣＤのようなものの場合において、これら
第１の音楽データ又は第２の音楽データを再生する際
に、第１の音楽データと第２の音楽データとをそれぞ
れ、再生のための通常の読み取り速度よりも高速で間欠
的に記録媒体１００から読み取るものとなっている。The second reading section 31 is also a component of the shock proof means including the second pre-read means. This second pre-reading means is that the recording medium 100
For example, a single-version CD, in which a vocal-containing song as music data and karaoke music data, that is, second music data including only accompaniment information of the first music data, are recorded. In reproducing the first music data or the second music data, the first music data and the second music data are intermittently read at a higher speed than a normal reading speed for reproduction. Is read from the recording medium 100.

【００８１】第４のメモリ３４は、前記第２の先読み手
段によって先読みされ、処理回路部４を介して送られて
くる前記第２の音楽データを記憶する例えばＲＡＭ等で
構成されており、第１のメモリ５と同様のメモリ容量を
有するショックプルーフメモリ領域を含むものとなって
いる。The fourth memory 34 is constituted by, for example, a RAM for storing the second music data prefetched by the second prefetch means and sent through the processing circuit section 4. 1 includes a shock-proof memory area having a memory capacity similar to that of the first memory 5.

【００８２】第２のＤＳＰ３２は、第１のメモリ５より
送られてくる第１の音楽データと、第４のメモリ３４よ
り送られてくる第２の音楽データとから、前記第１の音
楽データに含まれている歌詞情報（文字情報）を獲得す
る等の処理を含むデジタル信号処理を行うものである。
ここでは、例えば図１０の概略構成ブロック図におい
て、ボーカル情報抽出手段３５、第３のメモリ２３、音
声認識手段１４、第１の音声合成手段１５及びディレイ
手段１６を含んで構成され、第１のメモリ５より送られ
てくる第１の音楽データが、ボーカル情報抽出手段３５
とディレイ手段１６とにそれぞれ入力されるとともに、
第２のメモリ３４より送られてくる第２の音楽データが
ボーカル情報抽出手段３５に入力されるように構成され
ている。The second DSP 32 converts the first music data sent from the first memory 5 and the second music data sent from the fourth memory 34 into the first music data. Performs digital signal processing including processing such as acquiring lyrics information (character information) included in the information.
Here, for example, in the schematic block diagram of FIG. 10, the vocal information extracting unit 35, the third memory 23, the voice recognizing unit 14, the first voice synthesizing unit 15, and the delay unit 16 are included. The first music data sent from the memory 5 is used as vocal information extracting means 35.
And delay means 16 respectively.
The second music data sent from the second memory 34 is configured to be input to the vocal information extracting means 35.

【００８３】ボーカル情報抽出手段３５は、図１１の説
明図において、第２の読み取り部３１より読み取られた
第１の音楽データとしてのボーカル入りの曲（ａ）と、
前記第２の先読み手段により読み取られたカラオケ用の
第２の音楽データ（ｂ）との差を求めて前記第１の音楽
データに含まれているボーカル情報（ｃ）のみを抽出す
る例えば比較器で構成されている。The vocal information extracting means 35 includes, in the explanatory diagram of FIG. 11, a vocal-containing song (a) as the first music data read by the second reading section 31,
A comparator for extracting only vocal information (c) included in the first music data by obtaining a difference from the second music data (b) for karaoke read by the second prefetching means, for example, It is composed of

【００８４】第２のＤＳＰ３２における第３のメモリ２
３は、実施の形態（２）における第３のメモリ２３（図
６）と同様に、入力された情報を記憶し、音声認識手段
１４へ出力するように構成されている。すなわち、ここ
では第３のメモリ２３は、ボーカル情報抽出手段３５が
抽出したボーカル情報を記憶した後、音声認識手段１４
へ出力するものとなっている。また、第２のＤＳＰ３２
における音声認識手段１４、第１の音声合成手段１５及
びディレイ手段１６は、実施の形態（１）における音声
認識手段１４、第１の音声合成手段１５及びディレイ手
段１６と同様に構成されている。Third memory 2 in second DSP 32
3 is configured to store the input information and output it to the voice recognition unit 14, similarly to the third memory 23 (FIG. 6) in the embodiment (2). That is, here, the third memory 23 stores the vocal information extracted by the vocal information
Output to Also, the second DSP 32
The voice recognition means 14, the first voice synthesis means 15 and the delay means 16 in the first embodiment are configured in the same manner as the voice recognition means 14, the first voice synthesis means 15 and the delay means 16 in the embodiment (1).

【００８５】第２のＣＰＵ３３は、再生機構部３、第１
のメモリ５、第２のメモリ９の各部に接続され、これら
各部を実施の形態（１）に係る音楽データ処理装置１の
第１のＣＰＵ８と同様に制御するとともに、第４のメモ
リ３４を、第１のメモリ５と同様に制御するものとなっ
ている。また、前記第１の音楽データ中に含まれる曲の
歌詞の文字情報や、音声情報を作成するように第２のＤ
ＳＰ３２の制御を行うものとなっている。The second CPU 33 includes the reproducing mechanism 3, the first
Of the music data processing apparatus 1 according to the embodiment (1), and the fourth memory 34 is connected to the respective sections of the memory 5 and the second memory 9. The control is performed similarly to the first memory 5. Also, the second D data is generated so as to create character information and voice information of the lyrics of the song included in the first music data.
The control of the SP 32 is performed.

【００８６】図１２は、上記のごとく構成された音楽デ
ータ処理装置３０において、ボーカル入りの曲及びカラ
オケ曲が記録されている記録媒体１００から、第１の音
楽データとしてのボーカル入りの曲を再生する際に第２
のＣＰＵ３３が行う動作の一部を示したフローチャート
であり、ここでは上記実施の形態（１）に係る音楽デー
タ処理装置１における第１のＣＰＵ８が行う動作と相違
する部分のみを示している。図１２において第２のＣＰ
Ｕ３３は、図５に示したフローチャートのステップＳ
３、ステップＳ４に替えて、ステップＳ３１〜ステップ
Ｓ３６の動作を行うものとなっている。FIG. 12 shows that the music data processing apparatus 30 having the above-described configuration reproduces a vocal song as first music data from a recording medium 100 on which a vocal song and a karaoke song are recorded. When the second
5 is a flowchart showing a part of the operation performed by the CPU 33 of the music data processing apparatus 1 according to the embodiment (1). In FIG. 12, the second CP
U33 is Step S of the flowchart shown in FIG.
3. In place of step S4, operations of steps S31 to S36 are performed.

【００８７】すなわち、第２のＣＰＵ３３は、図５のス
テップＳ２において、歌詞画面表示スイッチがオンさ
れ、かつカラオケ先生モードスイッチがオンされている
と判断すると、図１２のステップＳ３１に進んで、記録
媒体１００に記録されている第１の音楽データとしての
ボーカル入りの曲を、再生時における通常の読み取り速
度の３倍以上で先読みさせる制御を行うよう再生機構部
３に指示を与える。この指示を受けて、再生機構部３で
は第２の読み取り部３１を制御して前記第１の音楽デー
タの先読みを行わせ、先読みされた前記第１の音楽デー
タを処理回路部４から第１のメモリ５に送出させる。That is, if the second CPU 33 determines in step S2 in FIG. 5 that the lyrics screen display switch is on and the karaoke teacher mode switch is on, the process proceeds to step S31 in FIG. An instruction is given to the reproduction mechanism unit 3 to perform control to pre-read a vocal-added song as the first music data recorded on the medium 100 at three times or more the normal reading speed at the time of reproduction. In response to this instruction, the reproduction mechanism unit 3 controls the second reading unit 31 to perform pre-reading of the first music data, and transfers the pre-read first music data from the processing circuit unit 4 to the first music data. To the memory 5.

【００８８】次に、ステップＳ３２において、処理回路
部４から送られてきた第１の音楽データの先読みされた
データを、第１のメモリ５のショックプルーフメモリ領
域５ａに記憶させる。そしてショックプルーフメモリ領
域５ａにおけるメモリ容量の上限値まで第１の音楽デー
タが蓄積されると、ステップＳ３３に進む。Next, in step S 32, the pre-read data of the first music data sent from the processing circuit section 4 is stored in the shock-proof memory area 5 a of the first memory 5. When the first music data is stored up to the upper limit of the memory capacity in the shock proof memory area 5a, the process proceeds to step S33.

【００８９】次にステップＳ３３では、記録媒体１００
に記録されている第２の音楽データとしてのカラオケ曲
を、再生時における通常の読み取り速度の３倍以上で先
読みさせる制御を行うよう再生機構部３に指示を与え
る。この指示を受けて、再生機構部３では第２の読み取
り部３１を制御して前記第２の音楽データの先読みを行
わせ、先読みされた前記第２の音楽データを処理回路部
４から第４のメモリ３４に送出させる。Next, in step S33, the recording medium 100
Of the karaoke tune as the second music data recorded in the reproduction mechanism unit 3 is controlled so that the karaoke tune is pre-read at three times or more the normal reading speed at the time of reproduction. In response to this instruction, the reproduction mechanism unit 3 controls the second reading unit 31 to perform pre-reading of the second music data, and transfers the pre-read second music data from the processing circuit unit 4 to the fourth reading unit. To the memory 34.

【００９０】次に、ステップＳ３４において、処理回路
部４から送られてきた第２の音楽データの先読みされた
データを、第４のメモリ３４のショックプルーフメモリ
領域に記憶させる。そしてショックプルーフメモリ領域
におけるメモリ容量の上限値まで第２の音楽データが蓄
積されると、ステップＳ３５に進む。Next, in step S34, the pre-read data of the second music data sent from the processing circuit section 4 is stored in the shock-proof memory area of the fourth memory. When the second music data is accumulated up to the upper limit of the memory capacity in the shock proof memory area, the process proceeds to step S35.

【００９１】そしてステップＳ３５において、第１のメ
モリ５のショックプルーフメモリ領域５ａに蓄積された
第１の音楽データと、第４のメモリ３４のショックプル
ーフ領域に蓄積された第２の音楽データとの差を求め、
第１の音楽データに含まれているボーカル情報のみを抽
出する制御を第２のＤＳＰ３２のボーカル情報抽出手段
３５において行わせる。In step S35, the first music data stored in the shock-proof memory area 5a of the first memory 5 and the second music data stored in the shock-proof area of the fourth memory 34 are compared. Find the difference,
The control for extracting only the vocal information included in the first music data is performed by the vocal information extracting means 35 of the second DSP 32.

【００９２】また図示を省略しているが、第２のＣＰＵ
３３は、第１のメモリ５に蓄積された第１の音楽デー
タ、第４のメモリ３４に蓄積された第２の音楽データ
が、第２のＤＳＰ３２への出力によって所定の値にまで
減少すると、ステップＳ３１に戻り、前回先読みした第
１の音楽データの続きの部分を先読みさせるための制御
を行うよう再生機構部３に指示を与える。Although not shown, the second CPU
33 indicates that when the first music data stored in the first memory 5 and the second music data stored in the fourth memory 34 are reduced to predetermined values by output to the second DSP 32, Returning to step S31, the reproduction control unit 3 is instructed to perform control for prefetching the continuation of the previously read first music data.

【００９３】ステップＳ３５の後、ステップＳ３６に進
み、第２のＤＳＰ３２のボーカル情報抽出手段３５によ
り抽出されたボーカル情報を第３のメモリ２３に記憶さ
せる。そして、図５のステップＳ５に進んで、ボーカル
情報を音声認識して文字情報に変換するように音声認識
手段１４に指示を与える。After step S35, the process proceeds to step S36, in which the vocal information extracted by the vocal information extracting means 35 of the second DSP 32 is stored in the third memory 23. Then, the process proceeds to step S5 in FIG. 5 to instruct the voice recognition unit 14 to perform voice recognition on the vocal information and convert it into character information.

【００９４】なお、上記したように第２のＣＰＵ３３
は、図５に示したフローチャートのステップＳ３、ステ
ップＳ４に替えて、ステップＳ３１〜ステップＳ３６の
動作を行うものであることから、図５に示したフローチ
ャートのステップＳ１０、ステップＳ１３のそれぞれに
おいてステップＳ３、ステップＳ４と同じ動作を行う際
にも、これらステップＳ３、ステップＳ４に替えてステ
ップＳ３１〜ステップＳ３６の動作がなされることにな
る。Note that, as described above, the second CPU 33
Performs the operations of steps S31 to S36 in place of steps S3 and S4 of the flowchart shown in FIG. 5, and therefore, in each of steps S10 and S13 of the flowchart shown in FIG. Also, when performing the same operation as step S4, the operation of steps S31 to S36 is performed instead of step S3 and step S4.

【００９５】以上説明したように、実施の形態（３）に
係る音楽データ処理装置３０によれば、第２のＤＳＰ３
２におけるボーカル情報抽出手段３５により、第１の音
楽データとしてのボーカル入り曲と第２の音楽データと
してのカラオケ曲との差からボーカル情報が抽出され、
この抽出されたボーカル情報が音声認識されることによ
り、前記第１の音楽データに含まれるボーカル入りの曲
の歌詞の文字情報が得られるので、歌詞情報が誤って音
声認識されるといった事態の発生確率を確実に低減する
ことができる。よって、第１の音楽データを再生しつつ
正確な歌詞情報をより一層高い確率で第１のディスプレ
イ１１に画面表示したり、第１のスピーカ１３から音声
出力させることができる。As described above, according to the music data processing device 30 of the embodiment (3), the second DSP 3
2, the vocal information extraction means 35 extracts the vocal information from the difference between the vocal tune as the first music data and the karaoke tune as the second music data,
Since the extracted vocal information is subjected to voice recognition, character information of the lyrics of the vocal-containing song included in the first music data is obtained. The probability can be reliably reduced. Therefore, accurate lyrics information can be displayed on the first display 11 with a higher probability while the first music data is being reproduced, and audio can be output from the first speaker 13.

【００９６】なお、実施の形態（３）に係る音楽データ
処理装置３０では、第１のメモリ５が第２のＤＳＰ３２
のディレイ手段１６に接続されている場合を例に挙げて
説明したが、本発明はこの例に限定されるものではな
い。例えば、別の実施の形態に係る音楽データ処理装置
では、第２のＤＳＰ３２のディレイ手段１６には、第１
のメモリ５と第４のメモリ３４とがそれぞれ切り換え手
段を介して接続され、該切り換え手段による切り換えに
よって第１のメモリ５からの第１の音楽データが、又は
第４のメモリ３４からの第２の音楽データがディレイ手
段１６に入力されるように構成されていてもよい。In the music data processing device 30 according to the embodiment (3), the first memory 5 is stored in the second DSP 32
The description has been given by taking an example in which the delay means 16 is connected to the delay means 16, but the present invention is not limited to this example. For example, in the music data processing apparatus according to another embodiment, the delay unit 16 of the second DSP 32
And the fourth memory 34 are respectively connected via switching means, and the first music data from the first memory 5 or the second music data from the fourth memory 34 is May be configured to be input to the delay means 16.

【００９７】このような別の実施の形態に係る音楽デー
タ処理装置では、前記第１の音楽データを再生しつつ正
確な歌詞情報を出力させることができるばかりでなく、
第２の音楽データを再生しつつ、つまりカラオケ曲を演
奏させつつ正確な歌詞情報を出力させることも可能なも
のとなる。In the music data processing apparatus according to such another embodiment, not only can accurate lyrics information be output while reproducing the first music data,
It is also possible to output accurate lyrics information while reproducing the second music data, that is, playing a karaoke song.

【００９８】次に、本発明の実施の形態（４）に係る音
楽データ処理装置を説明する。図１３は実施の形態
（４）に係る音楽データ処理装置の概略構成を示すブロ
ック図である。図１３において、実施の形態（４）に係
る音楽データ処理装置４０が、実施の形態（３）に係る
音楽データ処理装置３０と相違するところは、実施の形
態（３）における第２の読み取り部３１、第２のＤＳＰ
３２、第２のＣＰＵ３３、第１の操作部７、第１のディ
スプレイ１１、第１のスピーカのそれぞれに替えて、第
３の読み取り部４１、第３のＤＳＰ４２、第３のＣＰＵ
４３、第２の操作部４６、第２のディスプレイ４７、第
２のスピーカ４８が装備され、第５のメモリ４４、切り
換え手段４５が追加装備されている点にある。Next, a music data processing device according to the embodiment (4) of the present invention will be described. FIG. 13 is a block diagram showing a schematic configuration of a music data processing device according to Embodiment (4). In FIG. 13, the music data processing device 40 according to the embodiment (4) differs from the music data processing device 30 according to the embodiment (3) only in that the second reading unit according to the embodiment (3) is different. 31, the second DSP
32, the second CPU 33, the first operation unit 7, the first display 11, and the first speaker, instead of the third reading unit 41, the third DSP 42, and the third CPU.
43, a second operation unit 46, a second display 47, and a second speaker 48, and a fifth memory 44 and a switching unit 45 are additionally provided.

【００９９】第３の読み取り部４１は、ショックプルー
フ手段の構成要素となっており、また第１の音楽データ
としてのボーカル入りの曲と、この第１の音楽データに
含まれる歌詞の文字情報とを記録した、例えばＤＶＤの
ような記録媒体１００から、第１の音楽データや該第１
の音楽データに含まれる文字情報を読み取る、本発明に
おける第１の読み取り手段の構成要素ともなっている。The third reading section 41 is a constituent element of the shock proof means, and includes a vocal-containing song as first music data, character information of lyrics included in the first music data, and the like. From the recording medium 100, such as a DVD, on which the first music data or the first music data is recorded.
It also serves as a component of the first reading means of the present invention for reading character information included in the music data.

【０１００】第３のＤＳＰ４２は、実施の形態（２）に
おける第１のＤＳＰ２１と、実施の形態（３）における
第２のＤＳＰ３２とが組み合わされて構成されたものと
なっている。すなわち、第３のＤＳＰ４２は、図１４の
概略構成ブロック図において、バンドパスフィルタ２２
と、ボーカル情報抽出手段３５と、第３のメモリ２３
と、音声認識手段１４と、第２の音声合成手段４９と、
ディレイ手段１６とを含んで構成されている。そして、
バンドパスフィルタ２２及び音声認識手段１４を用いた
実施の形態（２）に係る場合と同様の音楽データ処理手
段（以下、実施の形態（２）の処理手段と記す）と、ボ
ーカル情報抽出手段３５及び音声認識手段１４を用いた
実施の形態（３）に係る場合と同様の音楽データ処理手
段（以下、実施の形態（３）の処理手段と記す）とのい
ずれかにより、音楽データに含まれる曲の歌詞の文字情
報を獲得できるように構成されている。The third DSP 42 is configured by combining the first DSP 21 in the embodiment (2) and the second DSP 32 in the embodiment (3). That is, the third DSP 42 is different from the bandpass filter 22 in the schematic block diagram of FIG.
Vocal information extracting means 35 and the third memory 23
Voice recognition means 14, second voice synthesis means 49,
The delay means 16 is included. And
Music data processing means similar to that of the embodiment (2) using the band pass filter 22 and the voice recognition means 14 (hereinafter referred to as the processing means of the embodiment (2)), and the vocal information extracting means 35 And music data processing means similar to that of the embodiment (3) using the voice recognition means 14 (hereinafter referred to as the processing means of the embodiment (3)). It is configured so that the character information of the lyrics of the song can be obtained.

【０１０１】また第２の音声合成手段４９は、音声認識
手段１４から送られてくる文字情報に基づき、第１の音
楽データに含まれる曲の歌詞情報を音声合成して歌詞情
報の音声情報化を図るものである。本実施の形態（４）
においても、第２の音声合成手段４９が、第３のＣＰＵ
４３の指示にしたがって、再生する音楽データに含まれ
る曲のフレーズの演奏直前に第２のスピーカ４８から合
成音で音声情報化された歌詞情報が音声出力される（読
み上げられる）ように、音声合成した歌詞情報をディレ
イ手段１６の出力側に出力するようになっている。The second speech synthesizer 49 synthesizes the lyric information of the tune included in the first music data based on the character information sent from the speech recognizer 14, and converts the lyric information into speech information. It is intended. Embodiment (4)
, The second voice synthesizing means 49 is provided with the third CPU
In accordance with the instruction at 43, voice synthesis is performed so that the lyric information converted into voice information by synthesized voice is output from the second speaker 48 immediately before the performance of the phrase of the music included in the music data to be reproduced (read out). The lyrics information is output to the output side of the delay means 16.

【０１０２】切り換え手段４５は、第１のメモリ５及び
第４のメモリ３４と、第３のＤＳＰ４２との間に介装さ
れた、例えば切り換えスイッチで構成されている。そし
て、第３のＣＰＵ４３の指示に基づき、第３のＤＳＰ４
２において音楽データを処理する手段として、バンドパ
スフィルタ手段２２及び音声認識手段１４を用いた処理
手段と、ボーカル情報抽出手段３５及び音声認識手段１
４を用いた処理手段とのどちらかに切り換えられるよう
になっている。The switching means 45 comprises, for example, a changeover switch interposed between the first and fourth memories 5 and 34 and the third DSP 42. Then, based on an instruction from the third CPU 43, the third DSP 4
2, processing means using band-pass filter means 22 and voice recognition means 14, vocal information extracting means 35 and voice recognition means 1
4 can be switched to one of the processing means.

【０１０３】第３のＣＰＵ４３は、記録媒体１００の種
類又は該記録媒体１００における記録内容に応じて、再
生する音楽データに含まれる曲の歌詞の文字情報を得る
ための手段を自動的に選択する選択手段４３ａを含んで
構成されている。例えば図１４に示すごとく、この選択
手段４３ａは、記録媒体１００の種類に応じて、前記歌
詞の文字情報を得るための手段を選択する種別対応選択
手段４３ａ₁と、記録媒体１００に記録された曲の全て
について記録内容を比較することにより、前記歌詞の文
字情報を得るための手段を選択する比較判断手段４３ａ
₂とを含むものとなっている。The third CPU 43 automatically selects a means for obtaining the character information of the lyrics of the music included in the music data to be reproduced, according to the type of the recording medium 100 or the contents recorded on the recording medium 100. It is configured to include the selection means 43a. For example as shown in FIG. 14, the selection unit 43a, depending on the type of the recording medium 100, the type-corresponding selecting means 43a ₁ for selecting the means for obtaining the character information of the lyrics, recorded on the recording medium 100 A comparison determining means 43a for selecting a means for obtaining the character information of the lyrics by comparing the recorded contents of all the songs.
₂ and so on.

【０１０４】なお、上述したように、本実施の形態
（４）に係る音楽データ処理装置４０は、再生する音楽
データに含まれる曲の歌詞の文字情報を得るための手段
として、予め歌詞の文字情報が記録された記録媒体１０
０から文字情報を直接読み取る第１の読み取り手段と、
上記した実施の形態（２）に係る処理手段と同様の処理
手段と、上記した実施の形態（３）に係る処理手段と同
様の処理手段とを装備している。また、記録媒体１００
の一つであるＤＶＤには、予め歌詞の文字情報が通常記
録されており、他方、アルバム版のＣＤやＭＤには、全
曲がボーカル入りの曲で記録されているものが多い。ま
たシングル版のＣＤの多くには、ボーカル入りの曲とカ
ラオケ用の曲とが記録されている。As described above, the music data processing device 40 according to the present embodiment (4) uses the lyrics characters in advance as means for obtaining the lyrics character information of the songs included in the music data to be reproduced. Recording medium 10 on which information is recorded
First reading means for directly reading character information from 0;
A processing unit similar to the processing unit according to the above-described embodiment (2) and a processing unit similar to the processing unit according to the above-described embodiment (3) are provided. Also, the recording medium 100
The DVD, which is one of the above, usually stores character information of lyrics in advance, while the album version of CDs and MDs often has all songs recorded with vocals. Many of the single version CDs include vocal songs and karaoke songs.

【０１０５】選択手段４３ａを構成する種別対応選択手
段４３ａ₁は、例えば音楽データ処理装置４０で使用さ
れる可能性のある記録媒体１００としてＤＶＤ、ＣＤ
（アルバム版、シングル版）、ＭＤの３種類が設定され
ている場合、再生処理する記録媒体１００がＤＶＤ、Ｃ
Ｄ、ＭＤのいずれであるかにより、音楽データに含まれ
る曲の歌詞の文字情報を得るための手段として第１の読
み取り手段と、上記した実施の形態（２）に係る処理手
段と同様の処理手段と、上記した実施の形態（３）に係
る処理手段と同様の処理手段との少なくとも１つを選択
するものとなっている。The type correspondence selecting means 43a ₁ constituting the selecting means 43a is, for example, a DVD or CD as a recording medium 100 which may be used in the music data processing apparatus 40.
(Album version, single version) and MD, if the recording medium 100 to be played back is DVD, C
The first reading means as means for obtaining the character information of the lyrics of the music included in the music data, and the same processing as the processing means according to the above-described embodiment (2), depending on whether the processing is D or MD. Means and at least one of the processing means similar to the processing means according to the embodiment (3) described above is selected.

【０１０６】また、選択手段４３ａを構成する比較判断
手段４３ａ₂は、記録媒体１００がＣＤであることによ
り、種別対応選択手段４３ａ₁が、上記した実施の形態
（２）に係る処理手段と同様の処理手段及び上記した実
施の形態（３）に係る処理手段と同様の処理手段の両方
を選択した場合、再生する第１の音楽データに対応する
カラオケ用の曲の第２の音楽データが、記録媒体１００
に含まれているか否かの判断を、再生する第１の音楽デ
ータと、記録媒体１００に記録されている全ての音楽デ
ータとを、曲の頭から数秒間分だけ比較することにより
行うものとなっている。そして、比較結果に基づき、上
記した実施の形態（２）に係る処理手段と同様の処理手
段あるいは上記した実施の形態（３）に係る処理手段と
同様の処理手段のいずれかを選択し、選択した処理手段
側に音楽データが入力されるように切り換え手段４５を
制御するようになっている。[0106] The comparison determination unit 43a ₂ that constitutes the selecting means 43a, by the recording medium 100 is CD, type-corresponding selection means 43a ₁ is, similarly to the processing means according to the above-mentioned embodiment (2) When both the processing means of the first embodiment and the processing means similar to the processing means according to the embodiment (3) are selected, the second music data of the karaoke tune corresponding to the first music data to be reproduced is Recording medium 100
Is determined by comparing the first music data to be reproduced with all the music data recorded on the recording medium 100 for a few seconds from the beginning of the music. Has become. Then, based on the comparison result, one of the processing means similar to the processing means according to the above-described embodiment (2) or the processing means similar to the processing means according to the above-described embodiment (3) is selected and selected. The switching means 45 is controlled so that the music data is input to the processing means.

【０１０７】よって、選択手段４３ａは、例えば図１６
の説明図に示すように、記録媒体１００がアルバム版の
ＣＤあるいはＭＤである場合に実施の形態（２）に係る
処理手段と同様の処理手段を選択し、記録媒体１００が
ＤＶＤの場合に第１の読み取り手段を選択し、記録媒体
１００がシングル版のＣＤである場合に実施の形態
（３）に係る処理手段と同様の処理手段を選択するよう
に構成されたものとなっている。Therefore, the selecting means 43a is provided, for example, in FIG.
When the recording medium 100 is an album version CD or MD, a processing unit similar to the processing unit according to the embodiment (2) is selected, and when the recording medium 100 is a DVD, 1 is selected, and when the recording medium 100 is a single-version CD, the same processing means as the processing means according to the embodiment (3) is selected.

【０１０８】また第３のＣＰＵ４３は、通常の再生処理
のための制御を行うとともに、選択手段４３ａの選択に
基づき、第１の読み取り手段又は上記した実施の形態
（２）に係る処理手段と同様の処理手段又は上記した実
施の形態（３）に係る処理手段と同様の処理手段によっ
て歌詞の文字情報が得られるように、再生機構部３、第
１のメモリ５、第２のメモリ９、第４のメモリ３４、第
３のＤＳＰ４２、第５のメモリ４４等を制御するものと
なっている。さらに、選択手段４３ａの比較判断手段４
３ａ₂の比較判断処理に用いる情報を入手すべく再生機
構部３等を制御するように構成されている。Further, the third CPU 43 performs control for normal reproduction processing and, based on the selection of the selection means 43a, the same as the first reading means or the processing means according to the above-described embodiment (2). The reproducing mechanism 3, the first memory 5, the second memory 9, and the second memory 9 are provided so that the character information of the lyrics can be obtained by the processing means of the first embodiment or the processing means similar to the processing means according to the third embodiment. The fourth memory 34, the third DSP 42, the fifth memory 44, and the like are controlled. Further, the comparing and judging means 4 of the selecting means 43a
In order to obtain information used for comparison judgment processing 3a ₂ is configured to control the reproducing mechanism 3 or the like.

【０１０９】第５のメモリ４４は、第３のＣＰＵ４３に
おける選択手段４３ａの比較判断手段４３ａ₂が行う比
較判断処理に用いる情報を記憶するものである。例え
ば、記録媒体１００がＣＤである場合に、ＣＤに記録さ
れている全ての曲について、第３の読み取り部４１が曲
の頭から数秒間の音楽データをスキャンすることによっ
て得られた情報を記憶するようになっている。[0109] Memory 44 of the fifth is for storing the information used in the comparison judgment processing comparative determination unit 43a ₂ of the selecting means 43a in the third CPU43 performs. For example, when the recording medium 100 is a CD, the third reading unit 41 stores information obtained by scanning music data for a few seconds from the beginning of a song for all songs recorded on the CD. It is supposed to.

【０１１０】第２の操作部４６は、ユーザが音楽データ
処理装置４０への操作信号を入力するためのものであ
り、第３のＣＰＵ４３に接続され、例えばスイッチ、キ
ー、ボタンあるいはタッチパネル等の手動入力手段やマ
イク等の音声入力手段を含んで構成されている。手動入
力手段としては、例えば記録媒体１００に収録されてい
る音楽データを再生するように指示するための通常のス
イッチ（以下、再生用スイッチと記す）の他に、音楽デ
ータの歌詞情報を画面表示するように指示するためのス
イッチ（以下、歌詞画面表示スイッチと記す）、歌詞情
報を通常の再生出力の少し前に読み上げるように指示す
るためのスイッチ（カラオケ先生モードスイッチと記
す）、ボーカル入りの曲からカラオケ用の音楽データを
作成して再生するように、ユーザが第３のＣＰＵ４３に
指示するためのカラオケスイッチ（図示せず）を備えた
ものとなっている。またこれらのスイッチ操作を、前記
音声入力手段への音声入力によっても行えるように構成
されている。The second operation section 46 is for the user to input an operation signal to the music data processing apparatus 40, and is connected to the third CPU 43 and is operated by a manual operation such as a switch, a key, a button, or a touch panel. It is configured to include voice input means such as input means and a microphone. As the manual input means, for example, in addition to a normal switch (hereinafter referred to as a reproduction switch) for instructing reproduction of music data recorded on the recording medium 100, lyrics information of the music data is displayed on a screen. Switch (hereinafter referred to as a lyrics screen display switch), a switch (referred to as a karaoke teacher mode switch) for instructing to read out the lyric information shortly before normal reproduction output, and A karaoke switch (not shown) is provided for the user to instruct the third CPU 43 to create and reproduce music data for karaoke from a song. Further, these switches can be operated by voice input to the voice input means.

【０１１１】第２のディスプレイ４７は、音声認識手段
１４により得られ、表示ドライバ１０を介して送られて
きた文字情報の画像信号を画面表示するようになってい
る。第２のスピーカ４８は、第３の読み取り部４１にお
いて読み取られ、音声認識手段１４により認識された文
字情報を基に、第２の音声合成手段４９によって音声合
成された歌詞情報を音声出力し、また、通常の再生処理
における音声を出力するようになっている。The second display 47 displays the image signal of the character information obtained by the voice recognition means 14 and transmitted through the display driver 10 on the screen. The second speaker 48 outputs lyric information which is read by the third reading unit 41 and voice-synthesized by the second voice synthesis means 49 based on the character information recognized by the voice recognition means 14, In addition, audio in normal reproduction processing is output.

【０１１２】次に、上記のごとく構成された音楽データ
処理装置４０において、記録媒体１００からの音楽デー
タを再生しつつ該音楽データに含まれる曲の歌詞情報を
出力する際の第３のＣＰＵ４３が行う動作を、図１７に
示すフローチャートを用いて説明する。Next, in the music data processing device 40 configured as described above, the third CPU 43 for outputting the lyrics information of the music included in the music data while reproducing the music data from the recording medium 100 is used. The operation to be performed will be described with reference to the flowchart shown in FIG.

【０１１３】ユーザによりある曲の再生用スイッチがオ
ンされ、さらに前記歌詞画面表示スイッチあるいは前記
カラオケ先生モードスイッチの少なくとも一方がオンさ
れると、ステップＳ４１において、音楽データを再生す
る記録媒体１００を調査する。次いでこの調査結果に基
づいて、記録媒体１００の種類を、ＤＶＤ、ＣＤ、ＭＤ
の中から選択手段４３ａが判断する（ステップＳ４
２）。When the user turns on a switch for reproducing a song and turns on at least one of the lyrics screen display switch and the karaoke teacher mode switch, the recording medium 100 for reproducing music data is checked in step S41. I do. Next, based on the result of this survey, the type of the recording medium 100 is changed to DVD, CD, MD.
Is selected by the selecting means 43a (step S4).
2).

【０１１４】ステップＳ４２において記録媒体１００が
ＤＶＤであると判断すると、次いでステップＳ４３にお
いて、再生する音楽データに対応する文字情報をＤＶＤ
ディスクから読み取るように前記第１の読み取り手段に
指示を与える。そして、ステップＳ４４に示すように、
第１の読み取り手段により読み取られた文字情報を基に
歌詞情報を出力させる。If it is determined in step S42 that the recording medium 100 is a DVD, then in step S43, character information corresponding to the music data to be reproduced is stored in the DVD.
An instruction is given to the first reading means to read from the disk. Then, as shown in step S44,
The lyrics information is output based on the character information read by the first reading means.

【０１１５】その際には、ユーザが歌詞画面表示スイッ
チ、カラオケ先生モードスイッチを操作することによっ
て入力した指示信号に従い、前記読み取られた文字情報
に基づく歌詞を第２のディスプレイ４７に画面表示さ
せ、又は第３のＤＳＰ４２における第２の音声合成手段
４９に、前記読み取られた文字情報から歌詞を音声合成
させて第２のスピーカ４８から音声出力させ、又は歌詞
を第２のディスプレイ４７に画面表示させるとともに第
２のスピーカ４８から音声出力させる制御を行う。At this time, the lyrics based on the read character information are displayed on the second display 47 on the screen according to the instruction signal input by the user operating the lyrics screen display switch and the karaoke teacher mode switch. Alternatively, the second voice synthesizer 49 in the third DSP 42 synthesizes the lyrics from the read character information by voice and outputs the voice from the second speaker 48, or causes the second display 47 to display the lyrics on the screen. At the same time, control for outputting sound from the second speaker 48 is performed.

【０１１６】なお、第３のＣＰＵ４３は、ステップＳ４
４において歌詞情報を出力させる際には、音楽データも
再生させる。またそのときには、音楽データの再生によ
り出力される曲のフレーズに合わせて歌詞が画面表示さ
れ、また１フレーズ分の音楽データの再生直前に歌詞が
合成音で読み上げられるように制御を行う。Note that the third CPU 43 determines in step S4
When the lyrics information is output in step 4, the music data is also reproduced. At that time, control is performed so that the lyrics are displayed on the screen in accordance with the phrase of the music output by the reproduction of the music data, and the lyrics are read out by a synthesized sound immediately before the reproduction of the music data for one phrase.

【０１１７】一方、ステップＳ４２において、選択手段
４３ａが、記録媒体１００がＣＤであると判断すると、
ステップＳ４５に進み、ＣＤに記録されている全ての曲
の音楽データを、音楽データの頭から数秒間、第３の読
み取り部４１にスキャンさせるための指示を再生機構部
３に与える。次いで、スキャンによって得た情報を第５
のメモリ４４に記憶させる（ステップＳ４６）。その
後、ステップＳ４７において、第５のメモリ４４に記憶
されたスキャン情報を基に、ＣＤ内に同じ曲に関してボ
ーカル入りのものとカラオケ用のものが記録されている
か否かを判断する。On the other hand, in step S42, when the selecting means 43a determines that the recording medium 100 is a CD,
Proceeding to step S45, the reproduction mechanism unit 3 is instructed to cause the third reading unit 41 to scan the music data of all the songs recorded on the CD from the beginning of the music data for several seconds. Then, the information obtained by the scan is
(Step S46). Thereafter, in step S47, based on the scan information stored in the fifth memory 44, it is determined whether or not the same tune with vocals and the one for karaoke are recorded in the CD.

【０１１８】ステップＳ４７において、ＣＤ内に同じ曲
に関してボーカル入りのものとカラオケ用のものとが記
録されていると判断すると、再生する音楽データに含ま
れる曲の歌詞の文字情報を得るための手段として上記し
た実施の形態（３）に係る処理手段と同様の処理手段を
選択する。そして、ステップＳ４８に示すように、実施
の形態（３）における第２のＣＰＵ３３と同様の制御動
作（図１２参照）を行って、上記した実施の形態（３）
に係る処理手段と同様の処理手段に、再生する第１の音
楽データに含まれる曲の歌詞の文字情報を獲得させ、獲
得された文字情報から歌詞情報を出力させる。このと
き、第１のメモリ５から第１の音楽データが、また第４
のメモリ３５から第２の音楽データがそれぞれ、上記し
た実施の形態（３）に係る処理手段と同様の処理手段に
入力されるように切り換え手段４５にスイッチ切り換え
の指示を与える。If it is determined in step S47 that the same tune with vocals and that of karaoke are recorded on the CD, means for obtaining the character information of the lyrics of the tune included in the music data to be reproduced is determined. And a processing unit similar to the processing unit according to the above-described embodiment (3) is selected. Then, as shown in step S48, a control operation similar to that of the second CPU 33 in the embodiment (3) (see FIG. 12) is performed, and the above-described embodiment (3) is performed.
Of the song included in the first music data to be reproduced, and outputs the lyrics information from the acquired character information. At this time, the first music data is stored in the first memory 5 and
Is given to the switching means 45 so that the second music data is input from the memory 35 to the processing means similar to the processing means according to the embodiment (3).

【０１１９】上記ステップＳ４８においても、ユーザが
歌詞画面表示スイッチ、カラオケ先生モードスイッチを
操作することによって入力した指示信号に従い、前記獲
得された文字情報に基づく歌詞を第２のディスプレイ４
７や第２のスピーカ４８に出力させる制御を行う。Also in step S48, the lyrics based on the obtained character information are displayed on the second display 4 in accordance with the instruction signal input by the user operating the lyrics screen display switch and the karaoke teacher mode switch.
7 and the second speaker 48.

【０１２０】また、ステップＳ４７において、ＣＤ内に
同じ曲に関してボーカル入りのものとカラオケ用のもの
が記録されていないと判断すると、再生する音楽データ
に含まれる曲の歌詞の文字情報を得るための手段として
上記した実施の形態（２）に係る処理手段と同様の処理
手段を選択する。そして、ステップＳ４９に示すよう
に、上記した実施の形態（２）における第１のＣＰＵ２
４と同様の制御動作（図５参照）を行って、上記した実
施の形態（２）に係る処理手段と同様の処理手段に、再
生する第１の音楽データに含まれる曲の歌詞の文字情報
を獲得させ、獲得された文字情報から歌詞を出力させ
る。このとき、第１のメモリ５から第１の音楽データが
上記した実施の形態（２）に係る処理手段と同様の処理
手段に入力されるように切り換え手段４５にスイッチ切
り換えの指示を与える。If it is determined in step S47 that the same tune with vocals and that for karaoke are not recorded in the CD, the character information of the lyrics of the tune included in the music data to be reproduced is obtained. As the means, a processing means similar to the processing means according to the above-described embodiment (2) is selected. Then, as shown in step S49, the first CPU 2 in the above-described embodiment (2)
4 (see FIG. 5), and the same processing means as the above-described embodiment (2) is applied to the processing means similar to the processing means according to the above-mentioned embodiment (2), so that the character information of the lyrics of the music included in the first music data to be reproduced And output the lyrics from the obtained character information. At this time, a switch switching instruction is given to the switching unit 45 so that the first music data is input from the first memory 5 to the same processing unit as the processing unit according to the above-described embodiment (2).

【０１２１】上記ステップＳ４９においても、ユーザが
歌詞画面表示スイッチ、カラオケ先生モードスイッチを
操作することによって入力した指示信号に従い、前記獲
得された文字情報に基づく歌詞を第２のディスプレイ４
７や第２のスピーカ４８に出力させる制御を行う。Also in step S49, the lyrics based on the obtained character information are displayed on the second display 4 in accordance with the instruction signal input by the user operating the lyrics screen display switch and the karaoke teacher mode switch.
7 and the second speaker 48.

【０１２２】また、ステップＳ４２において、選択手段
４３ａが、記録媒体１００がＭＤであると判断した場合
にも、ステップＳ４９に進み、上記した実施の形態
（２）における第１のＣＰＵ２４と同様の制御動作（図
５参照）を行って、上記した実施の形態（２）に係る処
理手段と同様の処理手段に、再生する第１の音楽データ
に含まれる曲の歌詞の文字情報を獲得させ、獲得された
文字情報から歌詞を出力させることになる。Also, in step S42, when the selecting means 43a determines that the recording medium 100 is an MD, the process proceeds to step S49, and the same control as the first CPU 24 in the above-described embodiment (2) is performed. By performing the operation (see FIG. 5), the processing means similar to the processing means according to the above-described embodiment (2) acquires the character information of the lyrics of the song included in the first music data to be reproduced, and acquires the character information. The lyrics are output from the input character information.

【０１２３】以上説明したように、実施の形態（４）に
係る音楽データ処理装置４０によれば、選択手段４３ａ
によって、記録媒体１００の種類又は該記録媒体１００
における記録内容に応じ、再生する音楽データに含まれ
る曲の歌詞の文字情報を得るために最適な処理が自動的
に行われる。したがって、記録媒体の種類１００や記録
内容にかかわらず、再生する音楽データに含まれる曲の
正確な歌詞情報を高い確率で出力させることができる。As described above, according to the music data processing device 40 of the embodiment (4), the selecting means 43a
Depending on the type of the recording medium 100 or the recording medium 100
In accordance with the recorded contents in, the optimal processing for automatically obtaining the character information of the lyrics of the music included in the music data to be reproduced is automatically performed. Therefore, regardless of the type 100 of the recording medium and the recorded content, accurate lyrics information of the music included in the music data to be reproduced can be output with a high probability.

【０１２４】また、音楽データ処理装置４０では、記録
されている曲の歌詞の文字情報が予め記録されているＤ
ＶＤ等の記録媒体１００の音楽データを再生する際に
も、前記第１の読み取り手段、第２の音声合成手段４９
及び第２のスピーカ４８によって、この音楽データに含
まれる曲の歌詞情報を音声出力させることができる。し
たがって、実施の形態（４）に係る音楽データ処理装置
４０によれば、記録媒体１００の種類や記録媒体１００
の記録内容にかかわらず、再生する音楽データに含まれ
る曲の歌詞情報を音声出力させることができる音楽デー
タ処理装置を提供することができる。In the music data processing apparatus 40, the character information of the lyrics of the recorded music is recorded in advance.
When reproducing the music data of the recording medium 100 such as a VD, the first reading unit and the second voice synthesizing unit 49 are also used.
And the second speaker 48 can output the lyric information of the music included in the music data by voice. Therefore, according to the music data processing device 40 according to the embodiment (4), the type of the recording medium 100 and the recording medium 100
Irrespective of the recorded contents of the music data, it is possible to provide a music data processing device capable of outputting the lyric information of the music included in the music data to be reproduced.

【０１２５】なお、実施の形態（４）に係る音楽データ
処理装置４０では、記録媒体１００の種類又は該記録媒
体１００における記録内容に応じて、再生する音楽デー
タに含まれる曲の歌詞の文字情報を得るための手段とし
て、前記第１の読み取り手段と、バンドパスフィルタ手
段２２及び音声認識手段１４を用いた上記した実施の形
態（２）に係る処理手段と同様の処理手段と、ボーカル
情報抽出手段３５及び音声認識手段１４を用いた上記し
た実施の形態（３）に係る処理手段と同様の処理手段と
を装備している場合を例に挙げて説明したが、本発明は
この例に限定されるものではない。In the music data processing device 40 according to the embodiment (4), the character information of the lyrics of the music included in the music data to be reproduced is determined according to the type of the recording medium 100 or the content recorded on the recording medium 100. As the means for obtaining the vocal information, the first reading means, the processing means similar to the processing means according to the above-described embodiment (2) using the band-pass filter means 22 and the voice recognition means 14, and vocal information extraction Although the case where the processing means similar to the processing means according to the above-described embodiment (3) using the means 35 and the voice recognition means 14 is provided has been described as an example, the present invention is limited to this example. It is not something to be done.

【０１２６】例えば、別の実施の形態に係る音楽データ
処理装置では、前記文字情報を得る手段として、前記第
１の読み取り手段に加えて、音声認識手段のみを用いた
上記した実施の形態（１）に係る処理手段と同様の処理
手段と、上記した実施の形態（２）に係る処理手段と同
様の処理手段と、上記した実施の形態（３）に係る処理
手段と同様の処理手段とのうちの一つ、又は上記した実
施の形態（４）に係る処理手段と同様の処理手段との組
み合わせのような二つ以上を装備したものとすることが
可能である。For example, in the music data processing apparatus according to another embodiment, as the means for obtaining the character information, only the voice recognition means is used in addition to the first reading means (1). ), A processing unit similar to the processing unit according to the above-described embodiment (2), and a processing unit similar to the processing unit according to the above-described embodiment (3). It is possible to provide two or more of them, one of which is a combination of the processing means according to the above-described embodiment (4) and the same processing means.

【０１２７】次に、本発明の実施の形態（５）に係る音
楽データ処理装置を説明する。図１８は実施の形態
（５）に係る音楽データ処理装置の概略構成を示すブロ
ック図である。この実施の形態（５）に係る音楽データ
処理装置５０が、上記した実施の形態（１）に係る音楽
データ処理装置１と相違するところは、第４のＣＰＵ５
１、第４のＤＳＰ５２、第３のディスプレイ５３、第３
のスピーカ５４及び第３の操作部５５の構成にある。Next, a music data processing device according to the embodiment (5) of the present invention will be described. FIG. 18 is a block diagram showing a schematic configuration of a music data processing device according to Embodiment (5). The difference between the music data processing device 50 according to this embodiment (5) and the music data processing device 1 according to the above-described embodiment (1) is that the fourth CPU 5
1, fourth DSP 52, third display 53, third
Of the speaker 54 and the third operation unit 55.

【０１２８】すなわち、音楽データ処理装置５０の第４
のＣＰＵ５１は、上記した実施の形態（１）における第
１のＣＰＵ８とほぼ同様の制御手段に加えて、文字情報
修正手段と、第１の記憶制御手段と、第２の記憶制御手
段と、情報読み取り手段と、文字情報選択設定手段とを
含んで構成されている。前記文字情報修正手段は、図１
９のブロック図にも示した第４のＤＳＰ５２の音声認識
手段１４により得られた文字情報を、ユーザの指示に従
い修正（あるいは変更）するものである。ここでは、文
字情報修正手段は後述するごとく、音声認識手段１４か
ら第２のメモリ９に出力されて記憶された文字情報につ
いて、ユーザの指示に従い修正を行うものとなってい
る。That is, the fourth of the music data processing device 50
CPU 51 includes character information correction means, first storage control means, second storage control means, and information control means in addition to control means substantially similar to first CPU 8 in the above-described embodiment (1). It is configured to include a reading unit and a character information selection setting unit. FIG.
The character information obtained by the voice recognition means 14 of the fourth DSP 52 also shown in the block diagram of FIG. 9 is modified (or changed) in accordance with a user's instruction. Here, as will be described later, the character information correcting means corrects the character information output from the voice recognition means 14 to the second memory 9 and stored in accordance with a user's instruction.

【０１２９】また、前記第１の記憶制御手段は、音声認
識手段１４により得られた歌詞の文字情報を記憶させて
おく旨の指示がユーザからなされた場合に、その得られ
た歌詞の文字情報を、曲名とともに第２のメモリ９に記
憶させる制御を行うように構成されている。また前記第
２の記憶制御手段は、図２０の説明図に示すように、前
記文字情報修正手段により修正された文字情報を、該文
字情報に対応する曲名とともに第２のメモリ９に記憶保
存させる制御を行うものである。このことから、第２の
メモリ９は、音声認識手段１４により得られた文字情報
を記憶するものであるとともに、前記文字情報修正手段
により修正された文字情報をも記憶する記憶手段を兼ね
たものとなっている。When the user gives an instruction to store the character information of the lyrics obtained by the voice recognizing means 14, the first storage control means operates the character information of the obtained lyrics. Is stored in the second memory 9 together with the song title. Further, as shown in the explanatory diagram of FIG. 20, the second storage control means stores the character information corrected by the character information correcting means in the second memory 9 together with the song title corresponding to the character information. The control is performed. For this reason, the second memory 9 stores the character information obtained by the voice recognition unit 14 and also serves as a storage unit that also stores the character information corrected by the character information correction unit. It has become.

【０１３０】前記情報読み取り手段は、図２０に示すご
とく、第２のメモリ９に記憶されている文字情報を第２
のメモリ９から読み取る本発明における第２の読み取り
手段を構成している。本実施の形態（５）では、後述す
るごとく、ユーザの指示に従い文字情報選択設定手段に
より第２のメモリ９に記憶されている文字情報を利用す
る選択設定がなされた場合に、第２のメモリ９から文字
情報を読み取るようになっている。The information reading means reads the character information stored in the second memory 9 as shown in FIG.
Of the present invention that reads from the memory 9 of the present invention. In the present embodiment (5), as will be described later, when the selection setting using the character information stored in the second memory 9 is made by the character information selection setting means in accordance with the user's instruction, the second memory 9 to read character information.

【０１３１】前記文字情報選択設定手段は、第２のメモ
リ９に保存されている文字情報を利用するか否かのユー
ザによる選択を可能にするものである。この文字情報選
択設定手段によって、第２のメモリ９に保存されている
文字情報に基づく歌詞情報を、ユーザが利用できるよう
になっている。The character information selection setting means enables the user to select whether or not to use the character information stored in the second memory 9. With this character information selection setting means, the user can use the lyrics information based on the character information stored in the second memory 9.

【０１３２】本実施の形態（５）における第４のＤＳＰ
５２は、上記した実施の形態（１）における第１のＤＳ
Ｐ６とは、この第１のＤＳＰ６における第１の音声合成
手段１５に替えて、図１９に示すごとく第３の音声合成
手段５６が装備されている点で相違している。第３の音
声合成手段５６は、音声認識手段１４から送られてくる
文字情報に基づき、第１の音楽データに含まれる曲の歌
詞情報を音声合成して歌詞の音声情報化を図る第１の音
声合成部を含むものであるとともに、前記情報読み取り
手段により第２のメモリ９から読み取られた文字情報に
基づいて、再生する音楽データに含まれる曲の歌詞情報
を音声合成して音声情報化を図る第３の音声合成部を含
むものとなっている。Fourth DSP in Embodiment (5)
52 is the first DS in the above embodiment (1).
The difference from P6 is that a third speech synthesis means 56 is provided as shown in FIG. 19 instead of the first speech synthesis means 15 in the first DSP 6. The third voice synthesizing unit 56 synthesizes the lyrics information of the tune included in the first music data based on the character information sent from the voice recognizing unit 14 to convert the lyrics into voice information. A voice synthesizing unit for synthesizing the lyric information of the music included in the music data to be reproduced based on the character information read from the second memory 9 by the information reading means, and converting the lyric information into voice information. 3 is included.

【０１３３】また本実施の形態（５）においても第３の
音声合成手段５６は、第４のＣＰＵ５１の指示にしたが
って、再生する音楽データに含まれる曲のフレーズの再
生直前に、歌詞情報が第３のスピーカ５４から合成音で
音声出力される（読み上げられる）ように、音声合成し
た歌詞の音声情報をディレイ手段１６の出力側に出力す
るようになっている。Also in the present embodiment (5), the third voice synthesizing means 56 outputs the lyric information immediately before the phrase of the music included in the music data to be reproduced in accordance with the instruction of the fourth CPU 51. The voice information of the lyrics synthesized by voice is output to the output side of the delay means 16 so that the voice is output (read out) as a synthesized voice from the third speaker 54.

【０１３４】第３のディスプレイ５３及び第３のスピー
カ５４は、音声認識手段１４において得られた文字情報
を出力する第１の出力部と、図２０に示すごとく、前記
情報読み取り手段により第２のメモリ９から読み取られ
た文字情報を出力する第３の出力部とを兼ねたものであ
る。このうち第３のディスプレイ５３は、表示ドライバ
１０を介して送られてきた文字情報の画像信号を画面表
示するようになっている。The third display 53 and the third speaker 54 are provided with a first output section for outputting the character information obtained by the voice recognition means 14, and a second output section as shown in FIG. It also serves as a third output unit that outputs the character information read from the memory 9. The third display 53 displays an image signal of the character information transmitted via the display driver 10 on a screen.

【０１３５】また第３のスピーカ５３は、音声認識手段
１４において認識された文字情報を基に、第３の音声合
成手段５６によって音声合成された歌詞の音声情報を音
声出力するようになっており、第３の音声合成手段５６
の第１の音声合成部によって音声合成された歌詞の音声
情報を再生する第１の音声出力手段と、第３の音声合成
部によって音声合成された歌詞の音声情報を再生する第
３の音声出力手段とを兼ねたものとなっている。The third speaker 53 outputs the voice information of the lyrics synthesized by the third voice synthesis means 56 based on the character information recognized by the voice recognition means 14. , The third speech synthesizer 56
A first voice output means for reproducing voice information of lyrics synthesized by the first voice synthesis unit, and a third voice output for reproducing voice information of lyrics synthesized by the third voice synthesis unit It also serves as a means.

【０１３６】第３の操作部５５は、例えばスイッチ、ボ
タン又はキー、タッチパネル等のユーザが手動入力する
ための手段や、マイク等のユーザが音声入力するための
手段で構成されたものである。本実施の形態（５）にお
いては、手動入力するための手段として、例えば実施の
形態（１）における第１の操作部７に設けられた各スイ
ッチと、実施の形態（２）における第１の操作部２５に
設けられたスイッチとを装備したものとなっている。ま
たこれらのスイッチにより第４のＣＰＵ５１に与える指
示を、音声入力手段からの音声入力によっても行えるよ
うに構成されている。また、第２のメモリ９に記憶され
ている文字情報の修正、つまり歌詞情報の修正（変更）
をユーザが要求するための歌詞修正モードスイッチや、
第２のメモリ９に記憶されている文字情報を利用する又
は利用しないを、ユーザが選択して指示するためのメモ
リ情報利用スイッチ等も装備している。The third operation section 55 is constituted by means for manual input by the user such as a switch, button or key, touch panel, etc., and means for voice input by the user such as a microphone. In the present embodiment (5), as means for manual input, for example, each switch provided on the first operation unit 7 in the embodiment (1) and the first switch in the embodiment (2) are used. A switch provided on the operation unit 25 is provided. In addition, an instruction to be given to the fourth CPU 51 by these switches can be performed by voice input from a voice input unit. Further, the character information stored in the second memory 9 is corrected, that is, the lyrics information is corrected (changed).
Lyrics correction mode switch for the user to request,
A memory information use switch or the like is provided for the user to select and indicate whether to use or not use the character information stored in the second memory 9.

【０１３７】次に、上記のように構成された音楽データ
処理装置５０において、第２のメモリ９に記憶された文
字情報を修正する際の第４のＣＰＵ５１が行う動作を、
図２１に示すフローチャートを用いて説明する。Next, in the music data processing device 50 configured as described above, the operation performed by the fourth CPU 51 when correcting the character information stored in the second memory 9 will be described.
This will be described with reference to the flowchart shown in FIG.

【０１３８】ユーザにより歌詞修正モードスイッチがオ
ンされると、ステップＳ５１において、第２のメモリ９
に記憶されている歌詞の文字情報を読み取って、第３の
ディスプレイ５３に表示させる。この際、第３のディス
プレイ５３においては、例えば図２２に示すように第２
のメモリ９に文字情報が記憶されている曲名と、この曲
に対応する文字情報に基づく歌詞とのリストが画面表示
される。When the lyrics correction mode switch is turned on by the user, in step S51, the second memory 9 is turned on.
Is read and displayed on the third display 53. At this time, on the third display 53, for example, as shown in FIG.
A list of song names whose character information is stored in the memory 9 and lyrics based on the character information corresponding to the song are displayed on the screen.

【０１３９】続いてステップＳ５２において、例えば
「修正（変更）したい曲を選択して下さい。」といった
ようなユーザに対するメッセージを、第３のディスプレ
イ５３又は第３のスピーカ５４から出力させるための制
御を行う。次に、上記メッセージの出力後、ユーザが第
３の操作部５５を操作することによって修正希望の歌詞
を指定する指示信号を入力したことを判断すると、入力
された指示信号に従い、図２２に示すように第３のディ
スプレイ５３に出力されているカーソル１０２をユーザ
が指定した歌詞まで移動させる。Subsequently, in step S52, control for outputting a message to the user such as "Please select a song to be modified (changed)" from the third display 53 or the third speaker 54 is performed. Do. Next, after the message is output, when it is determined that the user operates the third operation unit 55 to input an instruction signal for designating the lyrics desired to be corrected, the instruction signal shown in FIG. As described above, the cursor 102 output to the third display 53 is moved to the lyrics designated by the user.

【０１４０】続いてステップＳ５３において、ユーザが
第３の操作部５５を操作することによって修正歌詞情報
を入力すると、カーソル１０２位置の歌詞を入力された
歌詞に修正（変更）する制御を行う。なお、図２２で
は、出力された歌詞情報の１文字分がカーソル１０２で
指定されて修正される例が表示されているが、例えば図
２３（ａ）に示すように、１度に２文字以上が下線やカ
ーソル等で指定され、その指定された部分が図２３
（ｂ）に示すように１度に修正されるように構成されて
いてもよい。Subsequently, in step S53, when the user operates the third operation unit 55 to input the corrected lyrics information, control is performed to correct (change) the lyrics at the position of the cursor 102 to the input lyrics. Note that FIG. 22 shows an example in which one character of the output lyrics information is designated by the cursor 102 and corrected, but, for example, as shown in FIG. Is designated by an underline or a cursor, etc., and the designated portion is shown in FIG.
It may be configured to be corrected at once as shown in (b).

【０１４１】次いで、ユーザが第３の操作部５５を操作
することによって、歌詞の修正を終了する指示信号を入
力したか否かを判断し（ステップＳ５４）、該指示信号
が入力されていないと判断するとステップＳ５３に戻っ
て、ユーザから入力された指示信号に基づく歌詞の修正
制御を続ける。またステップＳ５４において、修正を終
了する指示信号が入力されたと判断すると、ステップＳ
５５に進み、修正する前の文字情報に対応する曲の歌詞
として修正後の文字情報を第２のメモリ９に記憶させ
る。また、その際には、ユーザに対する修正完了メッセ
ージを、第３のディスプレイ５３又は第３のスピーカ５
４に出力させる制御を行う。Next, it is determined whether or not the user operates the third operation unit 55 to input an instruction signal to end the lyrics correction (step S54). When it is determined, the process returns to step S53, and the control of correcting the lyrics based on the instruction signal input by the user is continued. If it is determined in step S54 that an instruction signal to end the correction has been input, the process proceeds to step S54.
Proceeding to 55, the corrected character information is stored in the second memory 9 as the lyrics of the song corresponding to the character information before the correction. In this case, a correction completion message to the user is displayed on the third display 53 or the third speaker 5.
4 is controlled.

【０１４２】以上説明した制御動作によって、たとえ音
楽データの最初の再生時に音声認識手段１４で誤った歌
詞の文字情報が得られても、その文字情報をユーザが正
しい歌詞に修正することができる。また、ユーザが遊び
で、歌詞を自由に変更して替え歌を作成することができ
ることになる。According to the control operation described above, even if the character information of the erroneous lyrics is obtained by the voice recognition means 14 at the first reproduction of the music data, the user can correct the character information to correct lyrics. In addition, the user can freely change the lyrics and create a replacement song in play.

【０１４３】次に、実施の形態（５）に係る音楽データ
処理装置５０において、第１の音楽データとしてのボー
カル入りの曲を再生する際に第４のＣＰＵ５１が行う動
作を、図２４に示すフローチャートを用いて説明する。Next, in the music data processing device 50 according to the embodiment (5), FIG. 24 shows the operation performed by the fourth CPU 51 when reproducing a vocal music as the first music data. This will be described with reference to a flowchart.

【０１４４】ユーザにより再生用スイッチがオンされる
と、ステップＳ６１において、第２のメモリ９に記憶さ
れている歌詞の文字情報を利用するメモリ情報利用スイ
ッチが、ユーザによってオンされたか否かを判断する。
ステップＳ６１においてメモリ情報利用スイッチがオン
されていないと判断すると、図５に示したフローチャー
トのステップＳ１に進み、図５に示したフローチャート
にしたがった制御動作を行う。When the reproduction switch is turned on by the user, it is determined in step S61 whether or not the memory information use switch that uses the character information of the lyrics stored in the second memory 9 has been turned on by the user. I do.
If it is determined in step S61 that the memory information use switch has not been turned on, the process proceeds to step S1 of the flowchart shown in FIG. 5, and performs a control operation according to the flowchart shown in FIG.

【０１４５】ただし、本実施の形態（５）においては、
前記第１の記憶制御手段を備えていることにより、図５
に示したフローチャートにおけるステップＳ６とステッ
プＳ７との間、ステップＳ１３とステップＳ１４との間
にそれぞれ、図２５に示したステップＳ７１の判断動
作、すなわち音声認識手段１４により認識された文字情
報を記憶させておく旨の指示がユ−ザによりなされたか
否かの判断動作を行う。そして、ステップＳ７１におい
て、文字情報を記憶させておく旨の指示がなされている
と判断すると、ステップＳ７（ステップＳ１３からステ
ップＳ７１に進んだ場合にはステップＳ１４）に進んで
文字情報を第２のメモリ９に記憶させ、文字情報を記憶
させておく旨の指示がなされていないと判断すると、ス
テップＳ７（ステップＳ１４）の動作を行わずにステッ
プＳ８（ステップＳ１５）に進む。However, in the present embodiment (5),
By providing the first storage control means,
25, between the steps S6 and S7 and between the steps S13 and S14 in the flowchart shown in FIG. 25, the character information recognized by the voice recognition means 14 is stored. An operation is performed to determine whether or not an instruction to keep the data has been given by the user. If it is determined in step S71 that an instruction to store character information has been given, the process proceeds to step S7 (or step S14 if the process proceeds from step S13 to step S71), and the character information is stored in the second If it is determined that the instruction to store the character information in the memory 9 has not been issued, the process proceeds to step S8 (step S15) without performing the operation of step S7 (step S14).

【０１４６】図２４のステップＳ６１において、メモリ
情報利用スイッチがオンされていると判断すると、次い
で、ステップＳ６２に進んで、第２のメモリ９に記憶さ
れている歌詞の文字情報を読み出して、第３のディスプ
レイ５３に表示させる。この際、第３のディスプレイ５
３においては、例えば図２６に示すように第２のメモリ
９に文字情報が記憶されている曲名と、この曲に対応す
る文字情報に基づく歌詞の内容が画面表示される。次に
ステップＳ６３に進んで、第３のディスプレイ５３に表
示させた歌詞の中から１つの歌詞をユーザに選択させる
べく、例えば「利用する歌詞情報を選択して下さい」と
いったようなユーザに対するメッセージを、第３のディ
スプレイ５３又は第３のスピーカ５４に出力させるため
の制御を行う。If it is determined in step S61 of FIG. 24 that the memory information use switch is turned on, then the flow advances to step S62 to read out the character information of the lyrics stored in the second memory 9, and 3 is displayed on the display 53. At this time, the third display 5
In 3, the song name whose character information is stored in the second memory 9 and the contents of the lyrics based on the character information corresponding to this song are displayed on the screen as shown in FIG. 26, for example. Next, proceeding to step S63, a message to the user such as "Please select the lyrics information to be used" is displayed in order to allow the user to select one of the lyrics displayed on the third display 53. , A control for outputting to the third display 53 or the third speaker 54 is performed.

【０１４７】次に、上記メッセージの出力後、ユーザが
第３の操作部５５を操作することによって歌詞情報を指
定する指示信号が入力されると、第４のＣＰＵ５１は、
入力された指示信号に従い、例えば図２６に示すよう
に、第３のディスプレイ５３に出力されている１つの曲
名及びその歌詞情報を覆うカーソル１０３をユーザが指
定した歌詞情報まで移動させる。次いで図２４のステッ
プＳ６４に示すように、例えば「指定した歌詞情報でＯ
Ｋ？」といったようなユーザに対するメッセージを、第
３のディスプレイ５３又は第３のスピーカ５４に出力さ
せるための制御を行う。Next, after the message is output, when the user operates the third operation unit 55 to input an instruction signal for designating the lyrics information, the fourth CPU 51
In accordance with the input instruction signal, for example, as shown in FIG. 26, the cursor 103 covering one song title and its lyrics information output on the third display 53 is moved to the lyrics information designated by the user. Next, as shown in step S64 of FIG. 24, for example, “O
K? ”Is output to the third display 53 or the third speaker 54.

【０１４８】そして、ステップＳ６５において、指定し
た歌詞でＯＫである旨の信号がユーザから入力されてい
るか否かの判断を行い、指定した歌詞でＯＫである旨の
信号が入力されていると判断すると、ステップＳ６６に
示すように、指定した歌詞に対応する曲の音楽データを
再生させつつ、ユーザが指定した歌詞を第２のディスプ
レイ４７や第２のスピーカ４８に出力させる制御を行
う。なお、その際は、音楽データの再生により演奏され
る曲のフレーズに合わせて歌詞が画面表示され、また１
フレーズ分の音楽データの再生直前に歌詞が合成音で読
み上げられるように制御を行う。またステップＳ６５に
おいて、指定した歌詞でＯＫである旨の信号が入力され
なかったと判断すると、ステップＳ６１に戻って、ステ
ップＳ６１の判断を再び行う。In step S65, it is determined whether or not a signal indicating that the specified lyrics are OK is input from the user, and it is determined that the signal indicating that the specified lyrics is OK is input. Then, as shown in step S66, control is performed to output the lyrics designated by the user to the second display 47 and the second speaker 48 while reproducing the music data of the song corresponding to the designated lyrics. At that time, the lyrics are displayed on the screen in accordance with the phrase of the music played by the reproduction of the music data.
Control is performed so that the lyrics are read out as a synthesized sound immediately before the reproduction of the music data for the phrase. If it is determined in step S65 that the signal indicating that the designated lyrics are OK is not input, the process returns to step S61, and the determination in step S61 is performed again.

【０１４９】以上説明したように、実施の形態（５）に
係る音楽データ処理装置５０によれば、第４のＣＰＵ５
１が文字情報修正手段を装備しているので、たとえ音楽
データの最初の再生時に音声認識手段１４で誤った歌詞
の文字情報が得られても、その文字情報をユーザが正し
い歌詞に修正することができる。また前記第１の記憶制
御手段により、音声認識手段１４において認識された文
字情報を第２のメモリ９に記憶させておくことができる
とともに、前記第２の記憶制御手段を備えていることに
より、前記文字情報修正手段によって修正された文字情
報も第２のメモリ９に記憶させておくことができる。As described above, according to the music data processing device 50 of the embodiment (5), the fourth CPU 5
1 is equipped with character information correcting means, even if the character information of the wrong lyrics is obtained by the voice recognition means 14 at the first reproduction of the music data, the user can correct the character information to correct lyrics. Can be. In addition, the first storage control means can store the character information recognized by the voice recognition means 14 in the second memory 9, and by including the second storage control means, The character information corrected by the character information correcting means can also be stored in the second memory 9.

【０１５０】また、前記第１の記憶制御手段により、ユ
ーザが希望する文字情報のみを第２のメモリ９に記憶さ
せるとができるため、不要な文字情報が第２のメモリ９
に記憶されて文字情報を記憶させたいときにメモリ容量
が不足しまっているというような事態の発生を回避する
ことができる。よって、第２のメモリ９を有効に活用す
ることができる。Further, since only the character information desired by the user can be stored in the second memory 9 by the first storage control means, unnecessary character information is stored in the second memory 9.
In such a case, it is possible to avoid occurrence of a situation where the memory capacity is insufficient when character information is to be stored and stored. Therefore, the second memory 9 can be effectively used.

【０１５１】さらに、第４のＣＰＵ５１が前記情報読み
取り手段を備えており、また第３のディスプレイ５３及
び第３のスピーカ５４が装備されていることにより、第
２のメモリ９に文字情報が記憶されている音楽データを
再び再生させる際には、第２のメモリ９から文字情報を
読み出すことによって歌詞情報を出力させることもでき
る。しかも、第３の音声合成手段５６の前記第３の音声
合成部により、第２のメモリ９から読み出された文字情
報を音声合成して歌詞の音声情報化を図ることができ、
この音声情報を第３のスピーカ５４から音声出力させる
ことができる。従って、第２のメモリ９に文字情報が記
憶されている音楽データを再び再生させる際において、
歌詞情報が出力されるまでの時間を短縮することができ
るとともに、正確な歌詞情報を出力させることができ
る。また、ユーザが文字情報を自由に替えられることに
より、ユーザ自身が作成した歌詞も替え歌として音声出
力できる。Further, since the fourth CPU 51 has the information reading means and is equipped with the third display 53 and the third speaker 54, character information is stored in the second memory 9. When reproducing the music data, the lyrics information can be output by reading the character information from the second memory 9. In addition, the third speech synthesis unit of the third speech synthesis means 56 can synthesize speech information of the character information read from the second memory 9 to convert the lyrics into speech information.
This audio information can be output as audio from the third speaker 54. Therefore, when music data whose character information is stored in the second memory 9 is reproduced again,
The time until the lyrics information is output can be shortened, and accurate lyrics information can be output. Further, since the user can freely change the character information, the lyrics created by the user himself can be output as a substitute song.

【０１５２】また、前記文字情報選択設定手段によっ
て、第２のメモリ９に文字情報が記憶されている音楽デ
ータを再び再生させる際には、第２のメモリ９に記憶さ
れている文字情報に基づく歌詞情報を出力させるか否か
をユーザが自由に選択することができる。よって、常に
ユーザが出力させたい歌詞情報を出力させることがで
き、ユーザを満足させる娯楽性の高い音楽データ処理装
置５０を実現することができる。When the music data whose character information is stored in the second memory 9 is reproduced again by the character information selection setting means, the character data is set based on the character information stored in the second memory 9. The user can freely select whether or not to output lyrics information. Therefore, it is possible to always output the lyric information that the user wants to output, and it is possible to realize the music data processing device 50 having a high recreational quality that satisfies the user.

【０１５３】なお、実施の形態（５）に係る音楽データ
処理装置５０では、上記した実施の形態（１）に係る音
楽データ処理装置１の第１のＣＰＵ７とほぼ同様の制御
手段に加えて、前記文字情報修正手段と、前記第１の記
憶制御手段と、前記第２の記憶制御手段と、前記情報読
み取り手段と、前記文字情報選択設定手段とを含む第４
のＣＰＵ５１が装備された例を説明したが、本発明はこ
の例に限定されるものではない。In the music data processing device 50 according to the embodiment (5), in addition to the same control means as the first CPU 7 of the music data processing device 1 according to the embodiment (1), A fourth memory including the character information correcting unit, the first storage control unit, the second storage control unit, the information reading unit, and the character information selection setting unit;
Although the example in which the CPU 51 is provided has been described, the present invention is not limited to this example.

【０１５４】例えば別の実施の形態に係る音楽データ処
理装置では、上記した実施の形態（２）に係る音楽デー
タ処理装置２０における第１のＣＰＵ２４とほぼ同様の
制御手段に加えて、前記文字情報修正手段と、前記第１
の記憶制御手段と、前記第２の記憶制御手段と、前記情
報読み取り手段と、前記文字情報選択設定手段とを含む
ＣＰＵが装備された構成であってもよい。For example, in the music data processing apparatus according to another embodiment, in addition to the same control means as the first CPU 24 in the music data processing apparatus 20 according to the above-described embodiment (2), the character information Correcting means;
And a CPU including the storage control means, the second storage control means, the information reading means, and the character information selection and setting means.

【０１５５】さらに別の実施の形態に係る音楽データ処
理装置では、上記した実施の形態（３）に係る音楽デー
タ処理装置３０における第２のＣＰＵ３３とほぼ同様の
制御手段に加えて、前記文字情報修正手段と、前記第１
の記憶制御手段と、前記第２の記憶制御手段と、前記情
報読み取り手段と、前記文字情報選択設定手段とを含む
ＣＰＵが装備された構成とすることも可能である。In the music data processing apparatus according to still another embodiment, in addition to the same control means as the second CPU 33 in the music data processing apparatus 30 according to the embodiment (3), the character information Correcting means;
And a CPU including the second storage control unit, the information reading unit, and the character information selection setting unit.

【０１５６】これら別の実施の形態に係る音楽データ処
理装置においても、ユーザにより再生用スイッチがオン
されると、まず前記ＣＰＵは、図２４に示したステップ
Ｓ６１の判断、すなわちメモリに記憶されている歌詞の
文字情報を利用するメモリ情報利用スイッチが、ユーザ
によってオンされているか否かを判断する。ステップＳ
６１においてメモリ情報利用スイッチがオンされている
と判断すると、図２４に示したステップＳ６２以降の動
作を行う。また同図のステップＳ６１において、メモリ
情報利用スイッチがユーザによってオンされていないと
判断すると、上記した実施の形態（２）において説明し
た第１の音楽データ再生時の動作（図５及び図８参
照）、又は上記した実施の形態（３）において説明した
音楽データ再生時の動作（図５及び図１２参照）を行
う。In the music data processing apparatus according to these other embodiments, when the reproduction switch is turned on by the user, the CPU first determines in step S61 shown in FIG. It is determined whether or not the memory information use switch that uses the character information of the lyrics that is present is turned on by the user. Step S
If it is determined in step 61 that the memory information use switch is turned on, the operation after step S62 shown in FIG. 24 is performed. If it is determined in step S61 of the figure that the memory information use switch has not been turned on by the user, the operation at the time of reproducing the first music data described in the above embodiment (2) (see FIGS. 5 and 8). ) Or the operation at the time of reproducing the music data described in the above embodiment (3) (see FIGS. 5 and 12).

【０１５７】また、上記した実施の形態（２）において
説明した第１の音楽データ再生時の動作、上記した実施
の形態（３）において説明した音楽データ再生時の動作
のいずれにおいても、上記した実施の形態（５）の場合
と同様、図５に示したステップＳ６とステップＳ７との
間（図５に示したステップＳ１３とステップＳ１４との
間）に図２５に示したフローチャートにおけるステップ
Ｓ７１で示した判断動作を加えると、ユーザが希望する
文字情報のみをメモリに記憶させることができ、メモリ
を有効活用することができる。[0157] Also, in both the operation at the time of reproducing the first music data described in the above-described embodiment (2) and the operation at the time of reproducing the music data described in the above-described embodiment (3). As in the case of the embodiment (5), between step S6 and step S7 shown in FIG. 5 (between step S13 and step S14 shown in FIG. 5) in step S71 in the flowchart shown in FIG. By adding the indicated determination operation, only the character information desired by the user can be stored in the memory, and the memory can be used effectively.

【０１５８】また、さらに別の実施の形態に係る音楽デ
ータ処理装置では、上記した実施の形態（４）に係る音
楽データ処理装置４０の第３のＣＰＵ４３とほぼ同様の
制御手段に加えて、前記文字情報修正手段と、前記第１
の記憶制御手段と、前記第２の記憶制御手段と、前記情
報読み取り手段と、前記文字情報選択設定手段とを含む
ＣＰＵが装備された構成であってもよい。Further, in the music data processing device according to still another embodiment, in addition to the same control means as the third CPU 43 of the music data processing device 40 according to the above-described embodiment (4), Character information correcting means;
And a CPU including the storage control means, the second storage control means, the information reading means, and the character information selection and setting means.

【０１５９】この別の実施の形態に係る音楽データ処理
装置では、ユーザにより再生用スイッチがオンされ、さ
らに歌詞画面表示スイッチあるいはカラオケ先生モード
スイッチの少なくとも一方がオンされると、まず前記Ｃ
ＰＵは、上記した実施の形態（４）における第３のＣＰ
Ｕ４３が行う図１７に示したステップＳ４１の動作、す
なわち音楽データを再生する記録媒体を調査する動作を
行う。その後、図２４に示したステップＳ６１の判断、
すなわちメモリに記憶されている歌詞の文字情報を利用
するメモリ情報利用スイッチが、ユーザによってオンさ
れているか否かの判断を行う。そして、ステップＳ６１
においてメモリ情報利用スイッチがオンされていると判
断すると、図２４に示したステップＳ６２以降の動作を
行う。また同図のステップＳ６１において、メモリ情報
利用スイッチがユーザによってオンされていないと判断
すると、図１７に示したステップＳ４２以降の動作を行
う。In the music data processing device according to this another embodiment, when the user turns on the reproduction switch and further turns on at least one of the lyrics screen display switch and the karaoke teacher mode switch, the C
PU is the third CP in the above-described embodiment (4).
The operation of step S41 shown in FIG. 17 performed by U43, that is, the operation of investigating a recording medium for reproducing music data is performed. Thereafter, the determination in step S61 shown in FIG.
That is, it is determined whether or not the memory information use switch that uses the character information of the lyrics stored in the memory is turned on by the user. Then, step S61
When it is determined that the memory information use switch is turned on, the operation after step S62 shown in FIG. 24 is performed. If it is determined in step S61 of the figure that the memory information use switch has not been turned on by the user, the operation from step S42 shown in FIG. 17 is performed.

【０１６０】上記した３つの別の実施の形態に係る音楽
データ処理装置のいずれにおいても、記録媒体の種類等
に関係なく、文字情報が記憶されている音楽データを再
び再生させる際において、歌詞情報が出力されるまでの
時間を短縮することができるとともに、正確な歌詞情報
を出力させることができる。従って、より簡単にカラオ
ケを楽しむことができ、しかも容易に替え歌を楽しめる
非常に娯楽性の高い音楽データ処理装置を提供すること
ができる。In any of the music data processing apparatuses according to the above three different embodiments, irrespective of the type of the recording medium, etc., when reproducing the music data in which the character information is stored again, Can be shortened, and accurate lyrics information can be output. Therefore, it is possible to provide a very entertaining music data processing apparatus that allows the user to enjoy karaoke more easily and easily enjoy the replacement song.

[Brief description of the drawings]

【図１】本発明の実施の形態（１）に係る音楽データ処
理装置の概略構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of a schematic configuration of a music data processing device according to Embodiment (1) of the present invention.

【図２】実施の形態（１）に係る音楽データ処理装置に
おいて、ショックプルーフ手段により先読みされた音楽
データの処理を説明するための図である。FIG. 2 is a diagram for explaining processing of music data pre-read by shock proof means in the music data processing device according to the embodiment (1).

【図３】実施の形態（１）に係る音楽データ処理装置の
第１のＤＳＰの概略構成を示すブロック図である。FIG. 3 is a block diagram showing a schematic configuration of a first DSP of the music data processing device according to the embodiment (1).

【図４】実施の形態（１）に係る音楽データ処理装置に
おける音声認識手段で得られた文字情報の音声合成処理
を説明するための図である。FIG. 4 is a diagram for explaining a speech synthesis process of character information obtained by speech recognition means in the music data processing device according to the embodiment (1).

【図５】実施の形態（１）に係る音楽データ処理装置に
おいて、第１の音楽データとしてのボーカル入りの曲を
再生する際の第１のＣＰＵの行う動作を示すフローチャ
ートである。FIG. 5 is a flowchart showing an operation performed by a first CPU when playing back a song with vocals as first music data in the music data processing device according to the embodiment (1).

【図６】本発明の実施の形態（２）に係る音楽データ処
理装置における第１のＤＳＰの概略構成を示すブロック
図である。FIG. 6 is a block diagram showing a schematic configuration of a first DSP in a music data processing device according to Embodiment (2) of the present invention.

【図７】実施の形態（２）に係る音楽データ処理装置に
おけるバンドパスフィルタによるフィルタ処理を説明す
るための図である。FIG. 7 is a diagram for describing filter processing by a band-pass filter in the music data processing device according to the embodiment (2).

【図８】実施の形態（２）に係る音楽データ処理装置に
おいて、第１の音楽データとしてのボーカル入りの曲を
再生する際の第１のＣＰＵの行う動作の一部を示したフ
ローチャートである。FIG. 8 is a flowchart showing a part of an operation performed by a first CPU when reproducing a tune with vocals as first music data in the music data processing device according to the embodiment (2). .

【図９】本発明の実施の形態（３）に係る音楽データ処
理装置の概略構成を示すブロック図である。FIG. 9 is a block diagram showing a schematic configuration of a music data processing device according to Embodiment (3) of the present invention.

【図１０】実施の形態（３）に係る音楽データ処理装置
における第２のＤＳＰの概略構成を示すブロック図であ
る。FIG. 10 is a block diagram showing a schematic configuration of a second DSP in the music data processing device according to the embodiment (3).

【図１１】実施の形態（３）に係る音楽データ処理装置
におけるボーカル情報抽出手段によるボーカル情報抽出
処理を説明するための図である。FIG. 11 is a diagram for explaining vocal information extraction processing by vocal information extraction means in the music data processing device according to the embodiment (3).

【図１２】実施の形態（３）に係る音楽データ処理装置
において、ボーカル入りの曲及びカラオケ用の曲が収録
されている記録媒体から第１の音楽データとしてのボー
カル入りの曲を再生する際の第２のＣＰＵの行う動作の
一部を示したフローチャートである。FIG. 12 is a diagram illustrating a case where the music data processing device according to the embodiment (3) reproduces a vocal song as first music data from a recording medium on which a vocal song and a karaoke song are recorded. 9 is a flowchart showing a part of the operation performed by the second CPU.

【図１３】本発明の実施の形態（４）に係る音楽データ
処理装置の概略構成を示すブロック図である。FIG. 13 is a block diagram showing a schematic configuration of a music data processing device according to Embodiment (4) of the present invention.

【図１４】実施の形態（４）に係る音楽データ処理装置
における第３のＤＳＰ周辺の概略構成を示すブロック図
である。FIG. 14 is a block diagram showing a schematic configuration around a third DSP in the music data processing device according to the embodiment (4).

【図１５】実施の形態（４）に係る音楽データ処理装置
における第１の読み取り手段で得られた文字情報の音声
合成処理を説明するための図である。FIG. 15 is a diagram for describing a speech synthesis process of character information obtained by the first reading unit in the music data processing device according to the embodiment (4).

【図１６】実施の形態（４）に係る音楽データ処理装置
における第３のＣＰＵを構成する選択手段による記録媒
体の選択処理例を説明するための図である。FIG. 16 is a diagram for describing an example of a recording medium selection process by a selection unit forming a third CPU in the music data processing device according to the embodiment (4).

【図１７】実施の形態（４）に係る音楽データ処理装置
において、再生する音楽データに含まれた曲の歌詞を表
示する際の第３のＣＰＵの行う動作を示すフローチャー
トである。FIG. 17 is a flowchart showing an operation performed by a third CPU when displaying lyrics of a song included in music data to be reproduced in the music data processing device according to the embodiment (4).

【図１８】本発明の実施の形態（５）に係る音楽データ
処理装置の概略構成を示すブロック図である。FIG. 18 is a block diagram showing a schematic configuration of a music data processing device according to Embodiment (5) of the present invention.

【図１９】実施の形態（５）に係る音楽データ処理装置
における第４のＤＳＰの概略構成を示すブロック図であ
る。FIG. 19 is a block diagram showing a schematic configuration of a fourth DSP in the music data processing device according to Embodiment (5).

【図２０】実施の形態（５）に係る音楽データ処理装置
における第２の記憶制御手段の動作と情報読み取り手段
の動作を説明するための図である。FIG. 20 is a diagram for explaining the operation of the second storage control means and the operation of the information reading means in the music data processing device according to the embodiment (5).

【図２１】実施の形態（５）に係る音楽データ処理装置
における第２のメモリに記憶された文字情報を修正する
際の第４のＣＰＵの動作を示すフローチャートである。FIG. 21 is a flowchart showing an operation of a fourth CPU when correcting the character information stored in the second memory in the music data processing device according to the embodiment (5).

【図２２】実施の形態（５）に係る音楽データ処理装置
において、歌詞修正モード時に第２のメモリから読み取
られた文字情報の画面表示の一例を示す図である。FIG. 22 is a diagram showing an example of a screen display of character information read from the second memory in the lyrics correction mode in the music data processing device according to the embodiment (5).

【図２３】実施の形態（５）に係る音楽データ処理装置
において、歌詞修正モード時における歌詞修正の一例を
示す図であり、（ａ）は修正前、（ｂ）は修正後であ
る。FIG. 23 is a diagram showing an example of lyrics correction in the lyrics correction mode in the music data processing device according to the embodiment (5), where (a) is before correction and (b) is after correction.

【図２４】実施の形態（５）に係る音楽データ処理装置
において、第１の音楽データの再生に際し、第２のメモ
リに記憶されている文字情報を用いる場合の第４のＣＰ
Ｕの行う動作を示すフローチャートである。FIG. 24 is a diagram illustrating a fourth CP in the case where the music data processing apparatus according to the embodiment (5) uses the character information stored in the second memory when reproducing the first music data.
6 is a flowchart illustrating an operation performed by U.

【図２５】実施の形態（５）に係る音楽データ処理装置
において、音楽データの再生に際し、第２のメモリに記
憶されている文字情報を用いない場合の第４のＣＰＵの
行う動作の一部を示すフローチャートである。FIG. 25 is a diagram illustrating a part of an operation performed by a fourth CPU in a case where character data stored in a second memory is not used in reproducing music data in the music data processing device according to embodiment (5). It is a flowchart which shows.

【図２６】実施の形態（５）に係る音楽データ処理装置
において、音楽データの再生時に第２のメモリから読み
取られた文字情報の画面表示の一例を示す図である。FIG. 26 is a diagram showing an example of a screen display of character information read from a second memory when playing back music data in the music data processing device according to the embodiment (5).

[Explanation of symbols]

１、２０、３０、４０、５０音楽データ処理装置２第１の読み取り部９第２のメモリ１１第１のディスプレイ１３第１のスピ
ーカ１４音声認識手段１５第１の音声
合成手段２２バンドパスフィルタ３１第２の読み
取り部３５ボーカル情報抽出手段４１第３の読み
取り部４３第３のＣＰＵ４３ａ選択手段４７第２のディスプレイ４８第２のスピ
ーカ４９第２の音声合成手段５１第４のＣＰ
Ｕ５３第３のディスプレイ５４第３のスピ
ーカ５６第３の音声合成手段1, 20, 30, 40, 50 Music data processing device 2 First reading unit 9 Second memory 11 First display 13 First speaker 14 Speech recognition unit 15 First speech synthesis unit 22 Bandpass filter 31 Second reading unit 35 Vocal information extraction unit 41 Third reading unit 43 Third CPU 43a Selection unit 47 Second display 48 Second speaker 49 Second voice synthesis unit 51 Fourth CP
U 53 Third display 54 Third speaker 56 Third voice synthesis means

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１１Ｂ 20/10 ３２１Ｇ１０Ｌ 3/00 Ｅ５Ｄ３７８ 27/34 ５５１Ｇ９Ａ００１Ｆターム(参考） 5D015 AA01 KK03 LL05 5D044 AB05 BC02 CC04 FG24 FG30 5D045 AA20 AB30 5D077 AA26 BA04 BA08 BB16 HA07 HC17 HC18 5D108 BA04 BA16 BA32 BA35 BA39 BB03 BC02 BC12 BD02 BD12 BD14 BE03 5D378 KK44 MM24 MM34 MM37 MM47 MM49 MM52 MM59 MM64 MM65 MM66 MM73 MM92 MM95 MM97 TT08 TT24 WW16 9A001 HH17 KK43 KK45 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G11B 20/10 321 G10L 3/00 E 5D378 27/34 551G 9A001 F term (Reference) 5D015 AA01 KK03 LL05 5D044 AB05 BC02 CC04 FG24 FG30 5D045 AA20 AB30 5D077 AA26 BA04 BA08 BB16 HA07 HC17 HC18 5D108 BA04 BA16 BA32. KK45

Claims

[Claims]

1. A music data processing device for reading and reproducing music data from a recording medium on which music data is recorded, wherein the music data is reproduced at a speed higher than a normal reading speed for reproduction. Shock proof means for intermittently reading the music data from the recording medium; voice recognition means for voice recognition of vocal information contained in the music data read by the shock proof means to obtain character information; A music data processing apparatus comprising: a first output unit that outputs the character information obtained by the voice recognition unit.

2. When reproducing first music data including vocal information and accompaniment information, the first music data is intermittently read from the recording medium at a speed higher than a normal reading speed for reproduction. 2. The music data processing device according to claim 1, further comprising a first look-ahead means.

3. Extracting only information in a frequency band of vocal information included in the first music data read by the first look-ahead means, between the shock proof means and the voice recognition means. 3. The music data processing device according to claim 2, wherein a filter means is interposed.

4. The recording medium in which, as music data, first music data including vocal information and accompaniment information, and second music data including only accompaniment information related to the first music data are recorded. In some cases, when reproducing the first music data or the second music data, the shock proof means intermittently operates at a speed higher than a normal reading speed for reproducing the second music data. A second pre-reading means for reading from the recording medium; a first music data read by the first pre-reading means between the shock proof means and the voice recognition means; Vocal information extraction means for obtaining a difference from the second music data read by the prefetch means and extracting only the vocal information contained in the first music data Claim, characterized in that it is interposed 1
4. The music data processing device according to any one of Items 3 to 3.

5. The method according to claim 5, wherein the recording medium records the first music data and character information of lyrics of a song included in the first music data. At the time of reproduction, first reading means for reading character information of a song corresponding to the first music data from the recording medium; and Selecting means for selecting means for acquiring character information of the included music, wherein the first output means outputs a first output unit which outputs the character information read by the first reading means. The music data processing device according to any one of claims 1 to 4, wherein the music data processing device includes:

6. A storage means for storing character information recognized by the voice recognition means, a second reading means for reading the character information stored in the storage means, and a character string read by the second reading means. Character information
A character information correcting unit that corrects the character information in accordance with a user's instruction; and a storage control unit that stores the character information corrected by the character information correcting unit in the storage unit, wherein the first output unit performs the second reading. Means for outputting character information read from said storage means by means
4. An output unit comprising:
6. The music data processing device according to any one of items 5.

7. A character information selection setting means for enabling a user to select whether or not to use character information stored in said storage means when playing back music data, said character information selection setting means being stored in said storage means. The second reading means reads the character information from the storage means when the user uses the selected character information through the character information selection setting means. Item 7. The music data processing device according to Item 6.

8. The music data processing apparatus according to claim 1, wherein said first output means includes a screen display means for displaying said character information on a screen.

9. A voice synthesizing unit including a first voice synthesizing unit for voice-synthesizing lyrics of a song included in the music data to be reproduced based on the character information recognized by the voice recognition unit, 9. The apparatus according to claim 1, wherein the first output means includes first speech output means for outputting speech synthesis information of lyrics synthesized by the first speech synthesis section. A music data processing device according to the item.

10. When the recording medium records, as music data, first music data including vocal information and accompaniment information, and character information of lyrics of a song related to the first music data. When reproducing the first music data, the first music data is provided with first reading means for reading character information of lyrics corresponding to the first music data from the recording medium. A second voice synthesizer for voice-synthesizing the lyrics of the song related to the music data to be reproduced based on the read character information, wherein the first output means is voice-synthesized by the second voice synthesizer; 10. The music data processing apparatus according to claim 9, further comprising a second voice output unit that outputs voice synthesis information of the lyrics.

11. A storage means for storing character information recognized by the voice recognition means, a second reading means for reading the character information stored in the storage means, and a character read by the second reading means. Character information
A character information correcting unit that corrects the character information in accordance with a user's instruction; and a storage control unit that stores the character information corrected by the character information correcting unit in the storage unit. A third voice synthesizer for voice-synthesizing the lyrics of the song included in the music data to be reproduced based on the read character information; and wherein the first output means includes the third voice synthesizer. 11. The music data processing device according to claim 9, further comprising a third voice output unit that voice-outputs voice synthesis information of lyrics synthesized by the voice unit.