JP2008122888A

JP2008122888A - Karaoke machine

Info

Publication number: JP2008122888A
Application number: JP2006336357A
Authority: JP
Inventors: Naotsuyo Nishikawa; 直剛西川; Osamu Soga; 修曽我; Takamasa Kazafuji; 孝誠風藤; Yoshiaki Kawagishi; 義明川岸
Original assignee: KOTO KK
Current assignee: KOTO KK
Priority date: 2006-11-14
Filing date: 2006-11-14
Publication date: 2008-05-29

Abstract

<P>PROBLEM TO BE SOLVED: To provide a karaoke machine capable of changing parameters for generating an image object according to various parameters and displaying the image object generated as the result of changing parameters. <P>SOLUTION: According to one or more of "average level", " peak level", "frequency component of the largest energy" and "energy of a specified frequency component", which are obtained by analyzing "classification of a musical instrument", "tone height", "tone intensity", " position of sound image" and "data obtained by analog-to-digital conversion of user's voice input from a microphone" included in "karaoke accompaniment music score data stored in a memory", one or more parameters of "classification", "level", "color", "shape", "coordinates" of the image object are dynamically changed. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

この発明はカラオケ装置に関し、特にカラオケ伴奏ならびにユーザが歌った音に応じてテレビ画面の映像を変化させるカラオケ装置に関する。 The present invention relates to a karaoke apparatus, and more particularly, to a karaoke apparatus that changes an image on a television screen according to a karaoke accompaniment and a sound sung by a user.

この種の従来技術が特許文献１に開示されている。This type of prior art is disclosed in Patent Document 1.

特許文献１の従来技術は、歌唱者をカメラで撮像し、その映像をマイクの音声出力信号に応じて変化させるものである。
特開平６−２５０６８３［Ｇ１０Ｋ１５／０４］ The prior art of Patent Document 1 captures a singer with a camera and changes the video in accordance with a sound output signal of a microphone.
JP-A-6-250683 [G10K 15/04]

上記の従来技術は、人の顔を撮影した映像を変化させるものであったので、画像表現に限界があった。Since the above-described conventional technique is to change an image obtained by photographing a human face, there is a limit to image expression.

それゆえに、この発明の主たる目的は、カラオケ伴奏楽譜データ、ユーザの音声、ユーザの楽器演奏に含まれる様々なパラメータに応じて、画像オブジェクトを生成するためのパラメータを変化させ、その結果生成された画像オブジェクトをカラオケの背景画像として表示可能なカラオケ装置を提供することである。また、拡張性のある、画像表現豊かなカラオケ装置を提供することである。Therefore, the main object of the present invention is to change the parameters for generating the image object according to various parameters included in the karaoke accompaniment score data, the user's voice, and the user's musical instrument performance, and the result is generated as a result. A karaoke apparatus capable of displaying an image object as a background image of karaoke is provided. Another object of the present invention is to provide an expandable karaoke apparatus rich in image expression.

請求項１の発明は、家庭用テレビ装置などに映像信号ならびに音声信号を出力可能なカラオケ装置であって、カラオケ装置に内蔵、または着脱可能なマイクロフォン、カラオケ装置に内蔵、または着脱可能なメモリ、および二次元および／または三次元で表現される画像オブジェクトを生成し、１または複数の画像オブジェクトから構成される画像をカラオケの背景画像として生成し、かつカラオケの歌詞を示す歌詞画像と合成し、映像信号として出力する画像処理手段を含み、画像処理手段は、「メモリに格納されたカラオケ伴奏楽譜データ」に含まれる「楽器の種類」、「音の高さ（ピッチ）」、「音の持続時間」、「発音の強さ（ベロシティ）」、「音像の位置（パンポット）」、ならびに「マイクロフォンから入力されたユーザの音声をアナログ−デジタル変換して得られたデータ」を解析して得られる「平均レベル」、「ピークレベル」、「最もエネルギーの大きな周波数成分」、「特定の周波数成分のエネルギー」のうちの１または複数に応じて、画像オブジェクトの「種類」、「大きさ」、「色」、「形状」、「座標」の１または複数のパラメータを動的に変化させることを特徴とするカラオケ装置である。The invention of claim 1 is a karaoke apparatus capable of outputting a video signal and an audio signal to a home television apparatus or the like, a microphone built in or detachable in the karaoke apparatus, a memory built in or detachable in the karaoke apparatus, And an image object expressed in two dimensions and / or three dimensions, an image composed of one or more image objects is generated as a background image of karaoke, and synthesized with a lyrics image indicating the lyrics of karaoke, Image processing means for outputting as a video signal, the image processing means including “musical instrument type”, “sound pitch (pitch)”, “sound duration” included in “karaoke accompaniment score data stored in memory” "Time", "Strength of pronunciation (velocity)", "Position of sound image (panpot)", and "User's sound input from microphone" 1 of “average level”, “peak level”, “frequency component with the largest energy”, or “energy of a specific frequency component” obtained by analyzing “data obtained by analog-to-digital conversion” The karaoke apparatus is characterized in that one or more parameters of “type”, “size”, “color”, “shape”, and “coordinate” of an image object are dynamically changed according to a plurality.

請求項１の発明によれば、カラオケ伴奏楽譜データならびにユーザの音声に同期した映像がカラオケの背景画像として生成されるので、ユーザは、カラオケを楽しむ際に音楽と映像の一体感を感じることができ、これによりカラオケの新たな楽しみを提供できる。また、カラオケの曲ごとに背景画像を用意する必要が無く、カラオケのコンテンツ制作のための労力を少なくすることができる。According to the first aspect of the present invention, since the karaoke accompaniment score data and the video synchronized with the user's voice are generated as the background image of karaoke, the user may feel a sense of unity between music and video when enjoying karaoke. This can provide a new karaoke experience. Also, it is not necessary to prepare a background image for each karaoke song, and the labor for producing karaoke content can be reduced.

請求項２の発明は、カラオケ装置が、さらに電子楽器を接続可能な接続ポートを備え、
「接続ポートからＭＩＤＩまたはＭＩＤＩに類似の形式で入力される楽器演奏データ」に含まれる「楽器の種類」、「音の高さ（ピッチ）」、「音の持続時間」、「発音の強さ（ベロシティ）」、「音像の位置（パンポット）」のうちの１または複数のパラメータに応じて、画像オブジェクトの「種類」、「大きさ」、「色」、「形状」、「座標」の１または複数のパラメータを動的に変化させることを特徴とするカラオケ装置である。In the invention of claim 2, the karaoke apparatus further includes a connection port to which an electronic musical instrument can be connected,
"Musical instrument performance data input from the connection port in a format similar to MIDI or MIDI""Musical instrument type", "Pitch", "Sound duration", "Sounding intensity" (Velocity), “Position of sound image (Panpot)”, and “Type”, “Size”, “Color”, “Shape”, “Coordinate” of the image object according to one or more parameters A karaoke apparatus characterized by dynamically changing one or more parameters.

請求項２の発明によれば、さらにユーザの楽器演奏に同期した映像がカラオケの背景画像として生成されるので、ユーザは、カラオケを楽しむ際に楽器演奏と映像の一体感を感じることができ、これによりカラオケの新たな楽しみを提供できる。According to the invention of claim 2, since the video synchronized with the user's musical instrument performance is generated as the background image of the karaoke, the user can feel a sense of unity between the musical instrument performance and the video when enjoying the karaoke, This can provide a new enjoyment of karaoke.

この発明によれば、カラオケの新たな楽しみを提供することが可能になる。According to the present invention, it is possible to provide new enjoyment of karaoke.

この発明の上述の目的，その他の目的，特徴および利点は、図面を参照して行う以下の実施例の詳細な説明から一層明らかとなろう。The above object, other objects, features and advantages of the present invention will become more apparent from the following detailed description of embodiments with reference to the drawings.

図１を参照して、この発明の実施例であるカラオケ装置１のカラオケ装置構成図を説明する。
図１は、カラオケ装置１ならびにカラオケ装置１に接続される機器を含むシステムの構成図です。カラオケ装置１は、家庭用テレビ３に接続されます。家庭用テレビ３の画面に歌詞字幕画像ならびに背景画像が表示され、テレビのスピーカーからカラオケの伴奏音楽ならびにマイクロフォン１１から入力されたユーザの音声が再生されます。また、カラオケ装置１は、マイクロフォン１１、キースイッチ１３、メモリカートリッジコネクタ１２、拡張ポート１７等を備えており、ハウジングの形状は、例えばマイクロフォン型をしている。
以下に各部の機能について記します。With reference to FIG. 1, the karaoke apparatus block diagram of the karaoke apparatus 1 which is an Example of this invention is demonstrated.
FIG. 1 is a configuration diagram of a system including a karaoke device 1 and devices connected to the karaoke device 1. The karaoke device 1 is connected to a home TV 3. The lyrics subtitle image and the background image are displayed on the screen of the home TV 3, and the accompaniment music of karaoke and the user's voice input from the microphone 11 are played from the TV speaker. The karaoke apparatus 1 includes a microphone 11, a key switch 13, a memory cartridge connector 12, an expansion port 17, and the like, and the shape of the housing is, for example, a microphone type.
The function of each part is described below.

マイクロフォン１１
ユーザの声をアナログ音声信号に変換します。Microphone 11
The user's voice is converted into an analog voice signal.

キースイッチ１３
カラオケ装置１を操作するための複数のキーから構成されます。本実施例では、「Ｓｌｏｗキー」、「Ｆａｓｔキー」、「ｂ（フラット）キー」、「＃（シャープ）キー」、「＋（プラス）キー」、「−（マイナス）キー」、「キャンセルキー」、「決定キー」の８つのキーより構成されます。以下に各キーの機能を示します。
・Ｓｌｏｗキー：再生中の曲のテンポを遅くします。１回押すごとにテンポが一段階遅くなります。但し、既に最も遅いテンポで曲が再生されている場合には、このキーを押しても何も起こりません。
・Ｆａｓｔキー：再生中の曲のテンポを速くします。１回押すごとにテンポが一段階速くなります。但し、既に最も早いテンポで曲が再生されている場合には、このキーを押しても何も起こりません。
・♭キー：再生中の曲のピッチ（音の高さ）を低くします。１回押すごとにピッチが一段階低くなります。但し、既に最も低いピッチで曲が再生されている場合には、このキーを押しても何も起こりません。
・＃キー：再生中の曲のピッチを高くします。１回押すごとにピッチが一段階高くなります。但し、既に最も高いピッチで曲が再生されている場合には、このキーを押しても何も起こりません。
・＋キー：マイクロフォン１１から入力されたユーザの音声のボリューム（音量）を大きくします。１回押すごとにボリュームが一段階大きくなります。但し、既に最も大きいボリュームで音声が再生されている場合には、このキーを押しても何も起こりません。
・−キー：マイクロフォン１１から入力されたユーザの音声のボリューム（音量）を小さくします。１回押すごとにボリュームが一段階大きくなります。但し、既に最も小さいボリュームで音声が再生されている場合には、このキーを押しても何も起こりません。
・キャンセルキー：メニュー画面、曲選択画面においては、選択中の項目の解除を行ったり、一段階上位のメニューに移動したりするために用いられます。曲の再生中にこのキーが押されると、曲の再生が中断され、曲選択画面に戻ります。
・決定キー：メニュー画面、曲選択画面において、選択中の項目の決定を行うために用いられます。
「♭（フラット）キー」、「＃（シャープ）キー」、「＋（プラス）キー」、「−（マイナス）キー」の４つのキーは、それぞれ左、右、上、下を示す矢印のように十字に配置され、メニュー画面、曲選択画面において、選択中の項目を上下左右に移動させる目的にも用いられます。Key switch 13
It consists of a number of keys for operating the karaoke machine 1. In this embodiment, “Slow key”, “Fast key”, “b (flat) key”, “# (sharp) key”, “+ (plus) key”, “− (minus) key”, “cancel key” ”And“ Determine key ”. The function of each key is shown below.
• Slow key: Decreases the tempo of the song being played. Each time you press the key, the tempo is lowered by one step. However, if the song is already playing at the slowest tempo, nothing happens when you press this key.
・ Fast key: Increases the tempo of the song being played. Each press will increase the tempo by one step. However, if the song is already playing at the fastest tempo, nothing happens when you press this key.
• ♭ key: Decreases the pitch (pitch) of the song being played. Each time you press it, the pitch is lowered by one step. However, if the song is already playing at the lowest pitch, nothing will happen even if you press this key.
• # key: Increases the pitch of the song being played. Each press increases the pitch by one step. However, if the song is already playing at the highest pitch, nothing will happen even if you press this key.
• + key: Increases the volume of the user's voice input from the microphone 11. Each press increases the volume by one level. However, if the sound is already being played at the highest volume, nothing will happen even if this key is pressed.
-Key: Decreases the volume of the user's voice input from the microphone 11. Each press increases the volume by one level. However, if the sound is already being played at the smallest volume, nothing will happen even if this key is pressed.
・ Cancel key: Used to cancel the currently selected item or move to the menu one level higher on the menu screen or song selection screen. If this key is pressed during song playback, song playback is interrupted and the song selection screen reappears.
・ Determination key: Used to determine the currently selected item on the menu screen and song selection screen.
The four keys “♭ (flat) key”, “# (sharp) key”, “+ (plus) key”, and “-(minus) key” are arrows indicating left, right, up, and down, respectively. It is also used for the purpose of moving the selected item up, down, left and right on the menu screen and song selection screen.

メモリカートリッジ２
デコード前動画データ、デコード前音声データ、楽譜データ、画像オブジェクトデータ、ＣＰＵ１０２が実行するプログラムコード、ＤＳＰ１０３が実行するプログラムコード等をＮＡＮＤ型フラッシュメモリまたはマスクＲＯＭなどに格納しています。Memory cartridge 2
Video data before decoding, audio data before decoding, musical score data, image object data, program code executed by CPU 102, program code executed by DSP 103, etc. are stored in NAND flash memory or mask ROM.

メモリカートリッジコネクタ１２
メモリカートリッジ２を接続するためのコネクタです。Memory cartridge connector 12
A connector for connecting the memory cartridge 2.

拡張ポート１７
電子楽器等４を接続するためのコネクタです。ＵＡＲＴをベースにしています。Expansion port 17
A connector for connecting electronic musical instruments. Based on UART.

外部マイクロフォン５
複数のユーザがカラオケを楽しむ場合、装置本体のマイクロフォン１１から音声を入力するユーザとは別のユーザの声をアナログ音声信号に変換し、出力します。External microphone 5
When multiple users enjoy karaoke, the voice of a user other than the user who inputs voice from the microphone 11 of the device is converted into an analog voice signal and output.

外部マイクロフォン入力端子１８
外部マイクロフォン５からアナログ音声信号を入力するための端子です。External microphone input terminal 18
A terminal for inputting analog audio signals from the external microphone 5.

映像出力端子１４
カラオケ装置１が生成したコンポジットビデオ信号を家庭用テレビ３に出力するための端子です。Video output terminal 14
This terminal is used to output the composite video signal generated by the karaoke machine 1 to the home TV 3.

音声出力端子１５，１６
カラオケ装置１が生成したアナログオーディオ信号を家庭用テレビ３に出力するための端子です。左チャンネルの信号を出力するための音声出力端子（左）１５と、右チャンネルの信号を出力するための音声出力端子（右）１６から構成されます。Audio output terminals 15, 16
This terminal is used to output the analog audio signal generated by the karaoke machine 1 to the home TV 3. It consists of an audio output terminal (left) 15 for outputting the left channel signal and an audio output terminal (right) 16 for outputting the right channel signal.

次に、図２を参照して、本発明のカラオケ装置１の映像出力端子１４から出力される画像信号によって、テレビ画面に描写される画面イメージの説明をする。Next, with reference to FIG. 2, the screen image drawn on the television screen by the image signal output from the video output terminal 14 of the karaoke apparatus 1 of the present invention will be described.

図２の（Ａ）
カラオケの背景画像として、動画を表示する場合の画面イメージです。動画データはＭＰＥＧ−４やＨ．２６４（ＭＰＥＧ−４／ＡＶＣ）等の形式で圧縮され、メモリカートリッジ２に格納されています。デコード（解凍）された動画データは、後述するディスプレイコントローラ１０９が扱うビデオスクリーンに表示されます。カラオケの字幕画像は、後述するグラフィックスプロセッサ１０７がフレームバッファに描画し、ディスプレイコントローラ１０９がフレームバッファに格納された画像をビデオスクリーンにオーバーレイして表示します。(A) of FIG.
This is a screen image for displaying a movie as a background image for karaoke. The moving image data is MPEG-4 or H.264. It is compressed in a format such as H.264 (MPEG-4 / AVC) and stored in the memory cartridge 2. The decoded (decompressed) video data is displayed on the video screen handled by the display controller 109 described later. The subtitle image of karaoke is drawn in the frame buffer by the graphics processor 107 described later, and the display controller 109 displays the image stored in the frame buffer as an overlay on the video screen.

図２の（Ｂ）
カラオケの背景画像として、伴奏ならびにユーザの声に応じてリアルタイムに変化する３ＤＣＧ（三次元コンピュータグラフィックス）で描画された画像オブジェクトを表示する場合の画面イメージです。この例では、伴奏に合わせて「人」を表す画像オブジェクトが踊り、マイクロフォン１１から入力されたユーザの声に合わせて画像オブジェクトが歌っているかのごとく口を動かします。画像オブジェクトのモデルデータやアニメーションデータはメモリカートリッジ２に格納されています。画像オブジェクトはグラフィックスプロセッサ１０７によって字幕画像とともにフレームバッファに描画されます。ディスプレイコントローラ１０９はフレームバッファに格納された画像を表示します。FIG. 2 (B)
This is a screen image for displaying an image object drawn in 3DCG (3D computer graphics) that changes in real time according to the accompaniment and the user's voice as the background image of karaoke. In this example, an image object representing “people” dances along with the accompaniment, and the mouth moves as if the image object is singing along with the voice of the user input from the microphone 11. Image object model data and animation data are stored in the memory cartridge 2. The image object is drawn in the frame buffer along with the subtitle image by the graphics processor 107. The display controller 109 displays the image stored in the frame buffer.

次に図３を参照して、本発明のカラオケ装置１の電気的な構成を示すブロック図を説明する。（図１で説明済みの部分については一部説明を省略します。）Next, with reference to FIG. 3, the block diagram which shows the electrical structure of the karaoke apparatus 1 of this invention is demonstrated. (Some of the parts already explained in Fig. 1 will be omitted.)

入力段アンプ２１
マイクロフォン１１から入力されたアナログ音声信号の振幅を増幅してＡＤＣ付きマイコン２２に出力します。Input stage amplifier 21
The amplitude of the analog audio signal input from the microphone 11 is amplified and output to the microcomputer 22 with ADC.

ＡＤＣ付きマイコン２２
２つのアナログ入力端子ならびにＡＤＣ（アナログ−デジタル変換器）を備えるマイコン（マイクロコントローラ）です。入力段アンプ２１ならびに外部マイクロフォン入力端子１８から入力されたアナログ音声信号をデジタル音声データに変換し、マルチメディアプロセッサＳｏＣに出力します。マルチメディアプロセッサＳｏＣ１００の第１のＵＡＲＴポート（ＵＡＲＴ＃０）に接続されます。Microcomputer 22 with ADC
A microcomputer (microcontroller) with two analog input terminals and ADC (analog-digital converter). The analog audio signal input from the input stage amplifier 21 and the external microphone input terminal 18 is converted into digital audio data and output to the multimedia processor SoC. It is connected to the first UART port (UART # 0) of the multimedia processor SoC100.

キースイッチ１３
マルチメディアプロセッサＳｏＣの第１のＧＰＩＯ（ＧｅｎｅｒａｌＰｕｒｐｏｓｅＩ／Ｏ）セット（ＧＰＩＯ＃０１１６）に接続され、各キーが押下されているか否かのステータスがマルチメディアプロセッサＳｏＣ１００に入力されます。Key switch 13
It is connected to the first GPIO (General Purpose I / O) set (GPIO # 0 116) of the multimedia processor SoC, and the status of whether each key is pressed is input to the multimedia processor SoC100.

拡張コネクタ
マルチメディアプロセッサＳｏＣ１００の第２のＵＡＲＴポート（ＵＡＲＴ＃１）に接続されます。電子楽器から入力された演奏情報をシリアル通信でマルチメディアプロセッサＳｏＣ１００に伝達します。この拡張コネクタの仕様用途は、メモリカートリッジ２に格納されたソフトウェアに応じて様々に変化します。電子楽器以外にも、カラオケ装置１本体の検査器などを接続することも可能です。Expansion connector Connected to the second UART port (UART # 1) of multimedia processor SoC100. Performance information input from an electronic musical instrument is transmitted to the multimedia processor SoC100 via serial communication. The specifications of this expansion connector vary depending on the software stored in the memory cartridge 2. In addition to electronic musical instruments, it is also possible to connect a tester of the karaoke device 1 itself.

電源回路２９
乾電池２８より＋３．３Ｖ、＋１．２Ｖ等の複数の直流電源電圧を生成し、装置内の各部に供給します。Power circuit 29
Multiple DC power supply voltages such as + 3.3V and + 1.2V are generated from dry cell 28 and supplied to each part in the equipment.

マルチメディアプロセッサＳｏＣ（ＳｙｓｔｅｍｏｎＣｈｉｐ）１００
カラオケ装置１の電源が投入される、あるいはカラオケ装置１のシステムリセットが発生すると、先ずブートＲＯＭ２３に格納されているプログラムコードを実行し、メモリカートリッジ２からプログラムコードならびにデータをメインメモリ２４にロードし、ロードされたプログラムコードの実行を開始します。プログラムコードの実行に従い、ＡＤＣ付きマイコン２２から入力されたデジタル音声データ、キースイッチ１３のステータス、電子楽器等４からの入力データに応じて、家庭用テレビ３に出力する画像、音声を生成します。生成された画像はコンポジットビデオ信号に変換され、映像出力端子１４を通じて出力されます。音声はＩ２Ｓフォーマットのデジタル音声信号としてオーディオＤＡＣ２６に出力されます。Multimedia processor SoC (System on Chip) 100
When the power of the karaoke device 1 is turned on or a system reset of the karaoke device 1 occurs, the program code stored in the boot ROM 23 is first executed, and the program code and data are loaded from the memory cartridge 2 into the main memory 24. , Start execution of the loaded program code. According to the execution of the program code, it generates images and sounds to be output to the home TV 3 according to the digital sound data input from the microcomputer 22 with ADC, the status of the key switch 13, and the input data from the electronic musical instrument 4 . The generated image is converted into a composite video signal and output through the video output terminal 14. The audio is output to the audio DAC 26 as a digital audio signal in I2S format.

ブートＲＯＭ２３
カラオケ装置１の電源が投入される、あるいはカラオケ装置１のシステムリセットが発生した際に、マルチメディアプロセッサＳｏＣ１００内のＣＰＵ１０２が実行するプログラムコードを格納しています。また、カラオケ装置１起動時に表示されるタイトル画面の画像データ、タイトル画面表示時に再生される音楽データも格納しています。Boot ROM 23
Stores the program code executed by the CPU 102 in the multimedia processor SoC 100 when the power of the karaoke device 1 is turned on or when the system reset of the karaoke device 1 occurs. It also stores the title screen image data that is displayed when the karaoke machine 1 is started and the music data that is played back when the title screen is displayed.

メインメモリ２４
マルチメディアプロセッサＳｏＣ１００の主記憶装置です。８Ｍバイト〜３２Ｍバイト程度のＳＤＲＡＭで構成されます。メモリカートリッジ２からロードされたプログラムコードならびにデータを格納する領域、生成された画像を格納するフレームバッファとしての領域、３ＤＣＧにマッピングされるテクスチャデータの格納領域などに分かれています。Main memory 24
This is the main memory of the multimedia processor SoC100. It consists of 8Mbyte to 32Mbyte SDRAM. It is divided into an area for storing program code and data loaded from the memory cartridge 2, an area as a frame buffer for storing generated images, and an area for storing texture data mapped to 3DCG.

オーディオＤＡＣ２６
マルチメディアプロセッサＳｏＣから出力されるデジタル音声信号をアナログ音声信号に変換し、出力段アンプ２７に出力します。Audio DAC26
The digital audio signal output from the multimedia processor SoC is converted into an analog audio signal and output to the output stage amplifier 27.

出力段アンプ２７
オーディオＤＡＣ２６から入力されたアナログ音声信号の振幅を増幅し、音声出力端子１５，１６を通じて家庭用テレビ３に出力します。Output stage amplifier 27
The amplitude of the analog audio signal input from the audio DAC 26 is amplified and output to the home TV 3 through the audio output terminals 15 and 16.

メモリカートリッジコネクタ１２
第２のＧＰＩＯセット（ＧＰＩＯ＃１）に接続されます。ＣＰＵ１０２はＩ／Ｏアクセスによりメモリカートリッジ２にアクセスします。Memory cartridge connector 12
Connected to the second GPIO set (GPIO # 1). The CPU 102 accesses the memory cartridge 2 by I / O access.

次に、図４を参照して、マルチメディアプロセッサＳｏＣ１００の内部構成を示すブロック図を説明する。Next, a block diagram showing the internal configuration of the multimedia processor SoC 100 will be described with reference to FIG.

ＣＣＣＰ１０１（ＣｏｎｓｕｍｅｒＣｕｓｔｏｍＳｏｌｕｔｉｏｎｓＰｌａｔｆｏｒｍ）
内部の二つのバスシステムＡＨＢ１０４ならびにＡＰＢ１２０間のバスブリッジ、２セットのＵＡＲＴ、ＧＰＩＯ＃１、外部メモリとのインターフェース（ＳＤＲＡＭコントローラ含む）を含みます。CCCP101 (Consumer Custom Solutions Platform)
It includes a bus bridge between two internal bus systems AHB104 and APB120, two sets of UART, GPIO # 1, and an external memory interface (including SDRAM controller).

ＡＨＢ１０４
主にメモリアクセス等に用いられる高速な内部バスです。ＣＰＵ１０２、ＤＳＰ１０３、ＤＭＡＣ１０６、グラフィックスプロセッサ１０７、ディスプレイコントローラ１０９、サウンドプロセッサ１１０がバスマスタとして接続されています。ＣＣＣＰ１０１を通じて、メモリバス２５やＡＰＢ１２０にアクセスを行うことも可能です。AHB104
A high-speed internal bus mainly used for memory access. The CPU 102, DSP 103, DMAC 106, graphics processor 107, display controller 109, and sound processor 110 are connected as a bus master. It is also possible to access the memory bus 25 and APB120 through CCCP101.

ＡＰＢ１２０
主に制御やＩ／Ｏアクセスに用いられる低速な内部バスです。ＤＳＰ１０３、ＤＭＡＣ１０６、グラフィックスプロセッサ１０７、ディスプレイコントローラ１０９、サウンドプロセッサ１１０、ＣＡＵ１１１、パワーマネジメントユニット１１２、タイマ１１３、メールボックス１１４、ＧＰＩＯ＃０がバススレーブとして接続されています。APB120
A low-speed internal bus mainly used for control and I / O access. The DSP 103, DMAC 106, graphics processor 107, display controller 109, sound processor 110, CAU 111, power management unit 112, timer 113, mailbox 114, GPIO # 0 are connected as bus slaves.

ＣＰＵ１０２
３２ビットＲＩＳＣ型ＣＰＵです。ブートＲＯＭ２３ならびにメインメモリ２４に格納されているプログラムコードを実行し、マルチメディアプロセッサＳｏＣ１００の全体制御等を司ります。CPU102
32-bit RISC type CPU. The program code stored in the boot ROM 23 and the main memory 24 is executed, and the overall control of the multimedia processor SoC 100 is performed.

ＤＳＰ１０３（ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ）
演算を高速に処理するためのプロセッサです。ＤＳＰローカルメモリ１０５に格納されたプログラムコードを実行し、ＤＳＰローカルメモリ１０５を作業メモリ領域として用います。ＣＰＵ１０２との通信は、ＡＰＢ１２０に接続されたメールボックス１１４を通じて行います。ＤＭＡＣ１０６を用い、メインメモリ２４とＤＳＰローカルメモリ１０５の間でＤＭＡ転送を行うことが可能です。DSP103 (Digital Signal Processor)
A processor for high-speed processing. The program code stored in the DSP local memory 105 is executed, and the DSP local memory 105 is used as a working memory area. Communication with the CPU 102 is performed through a mailbox 114 connected to the APB 120. The DMAC 106 can be used to perform DMA transfer between the main memory 24 and the DSP local memory 105.

ＤＳＰローカルメモリ１０５
ＤＳＰ１０３によって実行されるプログラムコードを格納します。また、ＤＳＰ１０３の作業メモリ領域として用いられます。ＤＳＰローカルメモリ１０５と後述するＧＰＵローカルメモリ１０８は物理的には同じメモリブロックであり、この二つのメモリがメモリブロックの領域を分割して使用します。DSP local memory 105
Stores program code executed by DSP103. It is also used as a working memory area for the DSP 103. The DSP local memory 105 and the GPU local memory 108, which will be described later, are physically the same memory block, and these two memories divide and use the memory block area.

ＤＭＡＣ１０６（ＤｉｒｅｃｔＭｅｍｏｒｙＡｃｃｅｓｓＣｏｎｔｒｏｌｌｅｒ）メインメモリ２４とＤＳＰメモリとの間、メインメモリ２４とＧＰＵローカルメモリ１０８との間、メインメモリ２４のある領域と別の領域との間でのＤＭＡ転送を行います。DMAC 106 (Direct Memory Access Controller) performs DMA transfer between the main memory 24 and the DSP memory, between the main memory 24 and the GPU local memory 108, and between one area of the main memory 24 and another area.

グラフィックスプロセッサ１０７
３ＤＣＧ生成のための幾何演算（行列演算、アフィン変換、投影変換等）、３ＤＣＧを構成するポリゴンのセットアップ、ポリゴンならびに２Ｄ画像オブジェクトのフレームバッファへの描画を司ります。Graphics processor 107
Geometric operations for 3DCG generation (matrix operations, affine transformations, projection transformations, etc.) It is responsible for setting up polygons that make up 3DCG and drawing polygons and 2D image objects to the frame buffer.

ＧＰＵローカルメモリ１０８
グラフィックスプロセッサ１０７の扱うローカルメモリです。３ＤＣＧ描画の際のＺバッファの格納等の用途に用いられます。GPU local memory 108
This is the local memory handled by the graphics processor 107. Used for Z buffer storage when 3DCG drawing.

ディスプレイコントローラ１０９
ビデオスクリーン、フレームバッファ、４面の２Ｄキャラクタスクリーンを合成して表示画面イメージを生成します。ビデオスクリーンならびにフレームバッファの画像イメージは、メインメモリ２４に格納されます。２Ｄキャラクタスクリーンは、縦８ピクセル×横８ピクセルから成るキャラクタを格子状に配置したキャラクタの２次元配列から成る画像です。各キャラクタの画像イメージならびにキャラクタの配置情報は、メインメモリ２４に格納されます。生成された表示画面イメージは、ビデオコンポジット信号波形を表すデジタルデータストリームに変換され、ビデオＤＡＣ１１５に出力されます。Display controller 109
The display screen image is generated by combining the video screen, frame buffer, and 4D 2D character screen. The video screen and frame buffer images are stored in the main memory 24. The 2D character screen is an image consisting of a two-dimensional array of characters arranged in a grid of 8 pixels by 8 pixels. The image of each character and the character placement information are stored in the main memory 24. The generated display screen image is converted to a digital data stream representing the video composite signal waveform and output to the video DAC 115.

サウンドプロセッサ１１０
メインメモリ２４にＰＣＭデータかつ／またはＡＤＰＣＭデータとして格納されている楽器音のピッチ変換を行い、最大６４音の楽器音の合成を行いカラオケの伴奏音を再生します。また、ＣＰＵ１０２またはＤＳＰ１０３によってデコードされメインメモリ２４に格納された圧縮ＰＣＭデータを再生します。再生された音声はＩ２Ｓフォーマットのデジタル音声信号として、オーディオＤＡＣ２６に出力されます。Sound processor 110
Performs pitch conversion of instrument sounds stored as PCM data and / or ADPCM data in the main memory 24, synthesizes up to 64 instrument sounds, and plays accompaniment sounds of karaoke. The compressed PCM data decoded by the CPU 102 or the DSP 103 and stored in the main memory 24 is reproduced. The reproduced audio is output to the audio DAC 26 as a digital audio signal in I2S format.

ＣＡＵ１１１（ＣｏｎｖｅｒｓｉｏｎＡｃｃｅｌｅｒａｔｉｏｎＵｎｉｔ）
浮動小数点数と整数または固定小数点数との間の変換を高速に実行します。３ＤＣＧで描画される画像オブジェクトの座標等のパラメータ算出に用いられます。CAU111 (Conversion Acceleration Unit)
Performs fast conversion between floating-point numbers and integer or fixed-point numbers. It is used to calculate parameters such as the coordinates of image objects drawn with 3DCG.

パワーマネジメントユニット１１２
ＣＰＵ１０２ならびにＤＳＰ１０３の動作周波数を段階的に制御し、また、使用していない機能ブロックへのクロック信号の供給を停止することにより、マルチメディアプロセッサＳｏＣの電力消費を低減します。Power management unit 112
Reduces the power consumption of the multimedia processor SoC by controlling the operating frequency of the CPU 102 and DSP 103 in stages, and stopping the supply of clock signals to unused functional blocks.

タイマ１１３
供給されるクロック信号に基づき、内蔵するカウンタの値を更新します。カウント値が所定の値に一致した時、ＣＰＵ１０２かつ／またはＤＳＰ１０３に対する割り込み要求信号をアサートすることも可能です。Timer 113
The built-in counter value is updated based on the supplied clock signal. When the count value matches the predetermined value, it is possible to assert an interrupt request signal for the CPU 102 and / or the DSP 103.

メールボックス１１４
ＣＰＵ１０２とＤＳＰ１０３との間のメッセージ通信に用いられます。Mailbox 114
It is used for message communication between CPU102 and DSP103.

ＧＰＩＯ＃１
汎用目的で用いられるデジタルＩ／Ｏポートです。本願カラオケ装置においては、キースイッチ１３に接続され、各キーのステータスが入力されます。GPIO # 1
Digital I / O port used for general purposes. In this application karaoke device, it is connected to the key switch 13 and the status of each key is input.

メモリバス２５
非同期インターフェースのＲＯＭ、非同期インターフェースのＳＲＡＭ、ＮＯＲ型フラッシュメモリ、ＳＤＲＡＭが接続可能です。本願カラオケ装置においては、ブートＲＯＭ２３および８Ｍバイト〜３２ＭバイトのＳＤＲＡＭが接続されます。Memory bus 25
Asynchronous interface ROM, asynchronous interface SRAM, NOR flash memory, SDRAM can be connected. In this application karaoke device, boot ROM23 and SDRAM of 8MB to 32MB are connected.

ビデオＤＡＣ１１５
ディスプレイコントローラ１０９から入力されるジタルデータストリームをアナログ信号に変換し、コンポジットビデオ信号として出力します。Video DAC115
The digital data stream input from the display controller 109 is converted to an analog signal and output as a composite video signal.

次に、図５、図６および図７を参照して、ＣＰＵ１０２およびＤＳＰ１０３の動作フローチャートを用いて、デコードされた動画データをカラオケの背景画像として表示する場合について説明する。Ｓ１は、ステップＳ１の略である。Ｓ２〜も同様である。また、そのステップがＣＰＵ１０２の動作を表している場合は、（ＣＰＵ１０２）と記載し、ＤＳＰ１０３の動作を示している場合は、（ＤＳＰ１０３）と記載することとする。Next, with reference to FIGS. 5, 6, and 7, the case where the decoded moving image data is displayed as a karaoke background image using the operation flowcharts of the CPU 102 and the DSP 103 will be described. S1 is an abbreviation for step S1. The same applies to S2 to S2. Further, when the step represents the operation of the CPU 102, it is described as (CPU 102), and when the step represents the operation of the DSP 103, it is described as (DSP 103).

Ｓ１（ＣＰＵ１０２）図５を参照。
システム起動後、ＣＰＵ１０２はブートＲＯＭ２３の所定のアドレスに格納されているプログラムコードの実行を開始します。実行開始後、各機能ブロックのハードウェア設定の初期化、ＤＳＰ１０３ローカルメモリとして割り当てるメモリ空間の設定、メインメモリ２４におけるＶＲＡＭ領域（フレームバッファ、テクスチャデータ等を格納）の設定等を行います。S1 (CPU 102) See FIG.
After starting the system, the CPU 102 starts executing the program code stored at a predetermined address in the boot ROM 23. After the start of execution, initialization of the hardware settings of each functional block, setting of the memory space allocated as the DSP 103 local memory, setting of the VRAM area (frame buffer, texture data, etc.) in the main memory 24 are performed.

Ｓ３（ＣＰＵ１０２）
カラオケ装置としてのタイトル画面の表示を行います。表示のためのプログラムコードならびに画像データは、ブートＲＯＭ２３に予め格納されています。S3 (CPU 102)
Displays the title screen as a karaoke device. The program code and image data for display are stored in the boot ROM 23 in advance.

Ｓ５（ＣＰＵ１０２）
メモリカートリッジ２からＣＰＵ１０２が実行するプログラムコード、ならびにＤＳＰ１０３が実行するプログラムコードをメインメモリ２４にロードします。S5 (CPU 102)
The program code executed by the CPU 102 and the program code executed by the DSP 103 are loaded into the main memory 24 from the memory cartridge 2.

Ｓ７（ＣＰＵ１０２）
プログラムコードのメインメモリ２４へのロード完了を待機します。S7 (CPU 102)
Wait for program code to be loaded into main memory 24.

Ｓ９（ＣＰＵ１０２）
ロード完了後、ＣＰＵ１０２がロードされたプログラムコードの実行を開始します。S9 (CPU 102)
After loading is complete, CPU102 starts executing the loaded program code.

Ｓ１１（ＣＰＵ１０２）
ＤＭＡにより、ＤＳＰ１０３が実行するプログラムコードをＤＳＰ１０３ローカルメモリに転送します。S11 (CPU 102)
The program code executed by the DSP 103 is transferred to the DSP 103 local memory by DMA.

Ｓ１３（ＣＰＵ１０２）
ＤＭＡ転送の完了を待機します。S13 (CPU 102)
Wait for DMA transfer to complete.

Ｓ１５（ＣＰＵ１０２）
転送完了後、ＤＳＰ１０３に対しＤＳＰローカルメモリ１０５に転送されたプログラムコードの実行開始を指示します。S15 (CPU 102)
After the transfer is complete, the DSP 103 is instructed to start executing the program code transferred to the DSP local memory 105.

Ｓ１７（ＤＳＰ１０３）
ＤＳＰ１０３は、プログラムコードの実行開始後、直ちにＤＳＰ１０３の実行環境の初期化処理を行います。S17 (DSP103)
The DSP 103 initializes the execution environment of the DSP 103 immediately after the execution of the program code is started.

Ｓ２１（ＣＰＵ１０２）図６を参照。
ＣＰＵ１０２は、所定量のデコード前動画データ、所定量のデコード前音声データをメインメモリ２４にロードする必要があるかどうかを確認します。最初にこのステップが実行される際には、メインメモリ２４には両者とも全くロードされていないので、必ず両者ともにロードの必要有り、と判断されます。ＣＰＵ１０２がロードする必要ありと判断した場合は、Ｓ２３へ進み、そうでない場合は、Ｓ２５へ進む。S21 (CPU 102) See FIG.
The CPU 102 checks whether or not a predetermined amount of pre-decoding moving image data and a predetermined amount of pre-decoding audio data need to be loaded into the main memory 24. When this step is executed for the first time, since both are not loaded into the main memory 24, it is determined that both need to be loaded. If the CPU 102 determines that it is necessary to load, the process proceeds to S23, and if not, the process proceeds to S25.

Ｓ２３（ＣＰＵ１０２）
所定量のデコード前動画データ、所定量のデコード前音声データのいずれか、または両方がロードの必要有りと判断されると、それらのデータのメモリカートリッジ２からメインメモリ２４へのロードを行います。S23 (CPU102)
If it is determined that either a predetermined amount of pre-decoding video data, a predetermined amount of pre-decoding audio data, or both need to be loaded, the data is loaded from the memory cartridge 2 to the main memory 24.

Ｓ２５（ＣＰＵ１０２）
現在のプレゼンテーションタイムの確認を行います。プレゼンテーションタイムは再生開始時に０にクリアされ、ビデオ信号の垂直同期、またはタイマ１１３によるカウント値に基づいて更新されます。S25 (CPU 102)
Check the current presentation time. The presentation time is cleared to 0 at the start of playback, and is updated based on the vertical synchronization of the video signal or the count value by the timer 113.

Ｓ２７（ＣＰＵ１０２）
メインメモリ２４にロードされた音声データのデコード処理はＣＰＵ１０２によって行われます。メインメモリ２４にロードされた音声データは、ＡＡＣ、ＭＰ３等の形式で圧縮されています。ＣＰＵ１０２は、これを解凍し、リニアＰＣＭのオーディオデータストリームを生成します。S27 (CPU 102)
The CPU 102 decodes the audio data loaded in the main memory 24. The audio data loaded into the main memory 24 is compressed in a format such as AAC or MP3. The CPU 102 decompresses it and generates a linear PCM audio data stream.

Ｓ２９（ＣＰＵ１０２）
デコードされた音声データ（リニアＰＣＭ形式）を再生するよう、サウンドプロセッサ１１０に指示します。S29 (CPU102)
Instructs the sound processor 110 to play back the decoded audio data (linear PCM format).

Ｓ３１（ＣＰＵ１０２）
次に表示すべき画像フレームデータが、ＤＳＰ１０３によって既にメインメモリ２４に格納されているか否かを確認します。ＣＰＵ１０２がメインメモリ２４に格納されていると判断した場合は、Ｓ３３へ進み、そうでない場合は、Ｓ４１へ進む。S31 (CPU 102)
Next, it is confirmed whether the image frame data to be displayed is already stored in the main memory 24 by the DSP 103. If the CPU 102 determines that it is stored in the main memory 24, the process proceeds to S33, and if not, the process proceeds to S41.

Ｓ３３（ＣＰＵ１０２）
現在のプレゼンテーションタイムが、次のビデオフレームで新たな画像フレームデータを表示すべきタイミングに到達しているか否かを確認します。ＣＰＵ１０２が該タイミングであると判断した場合は、Ｓ３５へ進み、そうでない場合は、Ｓ４１へ進む。S33 (CPU102)
Check whether the current presentation time has reached the timing for displaying new image frame data in the next video frame. If the CPU 102 determines that it is the timing, the process proceeds to S35, and if not, the process proceeds to S41.

Ｓ３５（ＣＰＵ１０２）
動画の表示品質を向上させるため、画像データにディザリング処理を施す場合があります。ディスプレイコントローラ１０９が扱う色データ形式は最大で１ピクセル当り１６ビットであるため、デコードされた動画データ（通常１ピクセル当り２４ビット）をそのまま１６ビットに精度を落とす処理を行うと、色の境界縞（マッハバンド）が目立ってしまいます。デコードされた動画データの各ピクセルの色データにランダムノイズを加えた後に精度を落とす処理を行うと、人間の目にはマッハバンドが目立たなくなります。この処理はディザリングと呼ばれます。ディザリング処理を行う際に、色空間の変換を行う必要がある場合があります。Ｈ．２６４ならびにＭＰＥＧ−４といった動画コーデックでは、デコード後の動画データの色空間の形式は、色差信号ＹＣｂＣｒとなっています。ディザリングの処理は、通常ＲＧＢ色空間で行われるので、ＹＣｂＣＲからＲＧＢへの色空間変換処理がディザリング処理の前に必要となります。本実施例においては、色空間変換処理はＣＰＵ１０２によって行われます。次に表示すべき画像フレームデータとして、ＤＳＰ１０３によってメインメモリ２４に格納されているデータの各ピクセルの色空間を、ＹＣｂＣｒからＲＧＢへと変換します。S35 (CPU 102)
In order to improve the display quality of the video, the image data may be dithered. The color data format handled by the display controller 109 is a maximum of 16 bits per pixel. Therefore, if the decoded moving image data (usually 24 bits per pixel) is reduced to 16 bits as it is, the color boundary stripes (Mach band) will stand out. If random noise is added to the color data of each pixel in the decoded video data and then the accuracy is reduced, the Mach band will not be noticeable to the human eye. This process is called dithering. When performing dithering processing, it may be necessary to convert the color space. H. For video codecs such as H.264 and MPEG-4, the color space format of the decoded video data is the color difference signal YCbCr. Since dithering is usually performed in the RGB color space, color space conversion processing from YCbCR to RGB is required before dithering. In this embodiment, the color space conversion process is performed by the CPU 102. As the image frame data to be displayed next, the color space of each pixel of the data stored in the main memory 24 by the DSP 103 is converted from YCbCr to RGB.

Ｓ３７（ＣＰＵ１０２）
色空間変換後の画像フレームデータに対し、ディザリング処理を行います。画像フレームデータの各ピクセルの表示ＸＹ座標に基づき、ノイズデータをデータテーブルより抽出し、各ピクセルの色データに加算します。S37 (CPU 102)
Dithering the image frame data after color space conversion. Based on the display XY coordinates of each pixel in the image frame data, noise data is extracted from the data table and added to the color data of each pixel.

Ｓ３９（ＣＰＵ１０２）
ディザリング処理が完了した画像フレームデータを、次のビデオフレームでビデオスクリーンに表示するように設定を行います。S39 (CPU 102)
Set the image frame data that has been dithered to be displayed on the video screen in the next video frame.

Ｓ４１（ＣＰＵ１０２）
字幕画像のディスプレイリストの生成を行います。字幕画像は、フレームバッファに描画され、ビデオ信号出力時にはディスプレイコントローラ１０９によってビデオスクリーンにオーバーレイされて表示されます。本実施例では、字幕画像の各文字データは、スプライト（矩形状の二次元グラフィックスオブジェクト）としてフレームバッファに描画されます。ＣＰＵ１０２は、各スプライトの描画座標、色深度、カラールックアップテーブル指定等のパラメータを含むディスプレイリストを作成します。S41 (CPU 102)
Generate a display list of subtitle images. The subtitle image is drawn in the frame buffer and displayed on the video screen by the display controller 109 when the video signal is output. In this embodiment, each character data of the subtitle image is drawn in the frame buffer as a sprite (rectangular two-dimensional graphics object). The CPU 102 creates a display list including parameters such as drawing coordinates, color depth, and color lookup table specification for each sprite.

Ｓ４３（ＣＰＵ１０２）
Ｓ４１で作成されたディスプレイリストに基づいて字幕画像をフレームバッファに描画するよう、グラフィックスプロセッサ１０７に描画指示を与えます。メインメモリ２４のＶＲＡＭ領域には、二セットのフレームバッファが備えられ、描画用と表示用とが随時切り替えられます。グラフィックスプロセッサ１０７は、描画用に設定されたフレームバッファに対する描画を行います。S43 (CPU 102)
A rendering instruction is given to the graphics processor 107 so that the subtitle image is rendered in the frame buffer based on the display list created in S41. The VRAM area of the main memory 24 is equipped with two sets of frame buffers, which can be switched between drawing and display at any time. The graphics processor 107 performs drawing on the frame buffer set for drawing.

Ｓ４５（ＣＰＵ１０２）
コンポジットビデオ信号における垂直ブランキング期間の開始を待機します。S45 (CPU 102)
Wait for the start of the vertical blanking period in the composite video signal.

Ｓ４７（ＣＰＵ１０２）
フレームバッファかつ／またはビデオスクリーンの更新が完了していれば、ディスプレイコントローラ１０９が表示を行うために読み出すフレームバッファかつ／またはビデオスクリーン用データの切り替えを行います。S47 (CPU 102)
If the update of the frame buffer and / or video screen has been completed, the display controller 109 switches the frame buffer and / or video screen data to be read for display.

Ｓ４８（ＣＰＵ１０２）
コンポジットビデオ信号の垂直ブランキング期間の開始、またはタイマ１１３のカウント値に基づいてプレゼンテーションタイムの更新を行います。S48 (CPU 102)
The presentation time is updated based on the start of the vertical blanking period of the composite video signal or the count value of the timer 113.

次にＤＳＰ１０３の処理を説明する。
Ｓ４９（ＤＳＰ１０３）
ＤＳＰ１０３は、所定量のデコード前動画データが、メインメモリ２４に既にロードされているか否かを確認します。そして、ロードされるまで待機します。Next, the processing of the DSP 103 will be described.
S49 (DSP103)
The DSP 103 checks whether or not a predetermined amount of pre-decode video data has already been loaded into the main memory 24. Then wait for it to load.

Ｓ５１（ＤＳＰ１０３）
ＤＭＡを用い、ロード済みのデコード前動画データをメインメモリ２４からＤＳＰローカルメモリ１０５に転送します。S51 (DSP103)
Transfers pre-decoded video data from the main memory 24 to the DSP local memory 105 using DMA.

Ｓ５３（ＤＳＰ１０３）
動画のデコード処理を行います。本実施例では、ＤＳＰ１０３はＨ．２６４デコーダソフトウェアまたはＭＰＥＧ−４デコーダソフトウェアを実行します。S53 (DSP103)
Decodes the video. In this embodiment, the DSP 103 is H.264. Runs H.264 decoder software or MPEG-4 decoder software.

Ｓ５５（ＤＳＰ１０３）
ＤＭＡを用い、次に表示すべき画像フレームデータをＤＳＰローカルメモリ１０５からメインメモリ２４に転送します。S55 (DSP103)
The image frame data to be displayed next is transferred from the DSP local memory 105 to the main memory 24 using DMA.

Ｓ５７（ＤＳＰ１０３）
メールボックス１１４を用い、次に表示すべき画像フレームデータがメインメモリ２４に格納済みであることをＣＰＵ１０２に通知します。S57 (DSP103)
The mail box 114 is used to notify the CPU 102 that the image frame data to be displayed next is already stored in the main memory 24.

Ｓ５９（ＤＳＰ１０３）
次のデコード処理の単位に必要なデコード前動画データが、ＤＳＰローカルメモリ１０５中に存在するか否かを確認します。既に存在していれば、次の画像フレームデータのデコード処理を継続できます。（Ｓ４９へ戻る）そうでない場合は、Ｓ５３に戻る。S59 (DSP103)
Check whether the pre-decode video data required for the next decoding unit exists in the DSP local memory 105. If it already exists, you can continue decoding the next image frame data. (Return to S49) Otherwise, return to S53.

図７を参照して、カラオケの伴奏ならびにマイクロフォン１１から入力されたユーザの音声に応じて変化するＣＧキャラクタをカラオケの背景画像として表示する場合における、ＣＰＵ１０２ならびにＤＳＰ１０３で行われる処理の内容を示すフローチャートを説明する。（シンクロＣＧ）初期化部分の処理（Ｓ１〜Ｓ１７）は図５の動画再生のフローチャートと同じであるため、省略しています。但し、ＤＳＰ１０３で実行されるプログラムコードの内容については、動画再生の場合とは異なります。Referring to FIG. 7, a flowchart showing the contents of processing performed by CPU 102 and DSP 103 when a CG character that changes in accordance with the accompaniment of karaoke and the user's voice input from microphone 11 is displayed as a karaoke background image. Will be explained. (Synchronized CG) The initialization part processing (S1 to S17) is the same as the moving image playback flowchart of FIG. However, the content of the program code executed by the DSP 103 is different from the case of moving image playback.

Ｓ６１（ＣＰＵ１０２）
マイクから入力された音声データをＡＤＣ付きマイコン２２から受領し、メインメモリ２４に格納します。外部マイクロフォン５も用いて二人分のユーザの声が入力される場合には、二つの音声データストリームがインターリーブされてＡＤＣ付きマイコン２２から入力されます。S61 (CPU 102)
Audio data input from the microphone is received from the microcomputer 22 with ADC and stored in the main memory 24. When two users' voices are input using the external microphone 5, the two audio data streams are interleaved and input from the microcomputer 22 with ADC.

Ｓ６３（ＣＰＵ１０２）
所定量の音声データの受領が完了を待機します。ＤＳＰ１０３で行われる処理には、所定サンプル数のＰＣＭ形式の音声データが必要とされます。S63 (CPU102)
Waiting for the completion of receiving the specified amount of audio data. The processing performed by the DSP 103 requires a predetermined number of samples of PCM audio data.

Ｓ６５（ＣＰＵ１０２）
メールボックス１１４を用い、ＤＳＰ１０３に所定量の音声データが既にメインメモリ２４に格納されていることを通知します。S65 (CPU 102)
Using the mail box 114, the DSP 103 is notified that a predetermined amount of audio data is already stored in the main memory 24.

Ｓ６７（ＣＰＵ１０２）
サウンド処理用カウンタの値を更新します。本実施例では、カラオケの伴奏はＭＩＤＩベースの楽譜データに基づいて行われます。この場合、テンポや各音符の再生タイミングの管理については、ＣＰＵ１０２がサウンド処理用カウンタに基づいて行います。S67 (CPU102)
Updates the value of the sound processing counter. In this example, karaoke accompaniment is based on MIDI-based score data. In this case, the CPU 102 performs management of the playback timing of the tempo and each note based on the sound processing counter.

Ｓ６９（ＣＰＵ１０２）
楽譜データは、各音符の「楽器の種類」「再生開始タイミング」、「再生停止タイミング」、「ピッチ（音の高さ）」、「ベロシティ（発音の強さ）」、「パンポット（音像の位置）」等の情報を、時系列で格納しています。ＣＰＵ１０２は、楽譜データの読み出し位置を指し示すポインタを管理し、サウンド処理カウンタの値に応じてポインタをどこまで進めるかを制御します。このステップでは、ポインタが現在指し示す位置の楽譜データを読み出し、解釈を行います。S69 (CPU 102)
The musical score data is “musical instrument type”, “playback start timing”, “playback stop timing”, “pitch (pitch)”, “velocity (sounding intensity)”, “panpot (sound image Information such as “Position” ”is stored in chronological order. The CPU 102 manages a pointer indicating the reading position of the score data, and controls how far the pointer is advanced according to the value of the sound processing counter. In this step, the musical score data at the position pointed to by the pointer is read and interpreted.

Ｓ７０（ＣＰＵ１０２）
ポインタの値を、楽譜データから次に読み出しを行うべき位置へ更新します。S70 (CPU 102)
The pointer value is updated from the musical score data to the position to be read next.

Ｓ７１（ＣＰＵ１０２）
サウンド処理用カウンタの値に基づき、ポインタの値が今回処理を行うべき楽譜データを全て読み出し・解釈済みであるか否かを判断します。読み出し済みならＳ７３へ進み、そうでないならＳ６９へ戻る。S71 (CPU 102)
Based on the value of the sound processing counter, it is determined whether or not the pointer value has been read and interpreted all the musical score data to be processed this time. If it has been read, the process proceeds to S73, and if not, the process returns to S69.

Ｓ７３（ＣＰＵ１０２）
読み出した楽譜データの「楽器の種類」「再生開始タイミング」、「再生停止タイミング」、「ピッチ（音の高さ）」、「ベロシティ（発音の強さ）」、「パンポット（音像の位置）」の情報に応じて、描画する画像オブジェクトの「種類」、「大きさ」、「色」、「形状」、「座標」のパラメータを算出します。S73 (CPU102)
"Musical instrument type", "Playback start timing", "Playback stop timing", "Pitch (pitch)", "Velocity (sounding strength)", "Panpot (sound image position)" The "type", "size", "color", "shape", and "coordinate" parameters of the image object to be drawn are calculated according to the information of "".

Ｓ７５（ＣＰＵ１０２）
メールボックス１１４を確認し、ＤＳＰ１０３からマイク入力音声のパラメータが通知されているか否かを判断します。パラメータが通知済みならばＳ７７へ進み、そうでないなら待機する。S75 (CPU102)
Check the mailbox 114 and determine whether the DSP 103 has notified the microphone input voice parameters. If the parameter has been notified, the process proceeds to S77, and if not, waits.

Ｓ７７（ＣＰＵ１０２）
ＤＳＰ１０３から通知されるマイク入力音声のパラメータ「平均レベル」、「ピークレベル」、「最もエネルギーの大きな周波数成分」、「特定の周波数成分のエネルギー」に応じて、描画する画像オブジェクトの「種類」、「大きさ」、「色」、「形状」、「座標」のパラメータを変化させます。S77 (CPU 102)
According to the parameters “average level”, “peak level”, “frequency component with the largest energy”, and “energy of a specific frequency component” of the microphone input sound notified from the DSP 103, the “type” of the image object to be drawn, Change the "Size", "Color", "Shape" and "Coordinate" parameters.

Ｓ７９（ＣＰＵ１０２）
Ｓ７３ならびにＳ７７にて算出された画像オブジェクトのパラメータに基づき、背景として表示される画像オブジェクトのディスプレイリストを作成します。また、字幕画像のディスプレイリストも合わせて作成します。背景として表示される画像オブジェクトの内、２ＤＣＧで再現されるものはスプライト、３ＤＣＧで再現されるものはポリゴンとしてフレームバッファに描画されます。字幕画像を構成する各文字は、動画再生の場合と同様に、スプライトとしてフレームバッファに描画されます。ＣＰＵ１０２は、「各ポリゴンの描画座標、頂点色、テクスチャパターン、テクスチャ色深度、頂点テクスチャ座標等」、ならびに、「スプライトの描画座標、色深度、カラールックアップテーブル指定等」のパラメータを含むディスプレイリストを作成します。S79 (CPU 102)
Based on the image object parameters calculated in S73 and S77, a display list of image objects to be displayed as a background is created. A display list of subtitle images is also created. Of the image objects displayed as the background, those reproduced with 2DCG are drawn in the frame buffer as sprites and those reproduced with 3DCG as polygons. Each character that makes up a subtitle image is drawn as a sprite in the frame buffer, just like when playing a movie. The CPU 102 displays a display list including parameters such as “drawing coordinates of each polygon, vertex color, texture pattern, texture color depth, vertex texture coordinates, etc.” and “sprite drawing coordinates, color depth, color lookup table designation, etc.” Create a.

Ｓ８１（ＣＰＵ１０２）
Ｓ７９で作成されたディスプレイリストに基づいて、背景として表示される画像オブジェクト、ならびに、字幕画像をフレームバッファに描画するよう、グラフィックスプロセッサ１０７に描画指示を与えます。S81 (CPU 102)
Based on the display list created in S79, the graphics processor 107 is instructed to draw the image object displayed as the background and the subtitle image in the frame buffer.

Ｓ８３（ＣＰＵ１０２）
コンポジットビデオ信号における垂直ブランキング期間の開始を待機します。S83 (CPU102)
Wait for the start of the vertical blanking period in the composite video signal.

Ｓ８４（ＣＰＵ１０２）
フレームバッファの更新が完了していれば、ディスプレイコントローラ１０９が表示を行うために読み出すフレームバッファの切り替えを行います。Ｓ８４の終了後、Ｓ６１に戻ります。S84 (CPU102)
If the frame buffer update is complete, the display controller 109 switches the frame buffer to be read for display. After S84 ends, the process returns to S61.

一方、ＤＳＰ１０３の動作について説明する。
Ｓ８５（ＤＳＰ１０３）
所定量のマイク入力音声データが、メインメモリ２４に既にロードされているか否かを確認します。ロードされていた場合は、Ｓ８７に進み、そうでない場合は待機する。On the other hand, the operation of the DSP 103 will be described.
S85 (DSP103)
Check whether a predetermined amount of microphone input audio data is already loaded in the main memory 24. If it has been loaded, the process proceeds to S87, and if not, it waits.

Ｓ８７（ＤＳＰ１０３）
ＤＭＡを用い、ロード済みのマイク入力音声データをメインメモリ２４からＤＳＰローカルメモリ１０５に転送します。S87 (DSP103)
Using DMA, the loaded microphone input audio data is transferred from the main memory 24 to the DSP local memory 105.

Ｓ８９（ＤＳＰ１０３）
マイク入力音声データの「平均レベル」ならびに「ピークレベル」の算出を行います。平均レベルは、全サンプルの平均値を求めた後、平均偏差または標準偏差より求めます。ピークレベルは、全サンプルの平均値からの最大偏差より求めます。S89 (DSP103)
Calculates “average level” and “peak level” of microphone input voice data. The average level is calculated from the average deviation or standard deviation after calculating the average value of all samples. The peak level is obtained from the maximum deviation from the average value of all samples.

Ｓ９１（ＤＳＰ１０３）
マイク入力音声データに対するＦＦＴ（高速フーリエ変換）処理を行います。処理結果より、「最もエネルギーの大きな周波数成分」ならびに「特定の周波数成分のエネルギー」を算出します。S91 (DSP103)
Performs FFT (Fast Fourier Transform) processing for microphone input audio data. From the processing results, calculate the “frequency component with the largest energy” and the “energy of a specific frequency component”.

Ｓ９３（ＤＳＰ１０３）
メールボックス１１４を用い、Ｓ８９ならびにＳ９１で算出したマイク入力音声データの「平均レベル」、「ピークレベル」、「最もエネルギーの大きな周波数成分」、「特定の周波数成分のエネルギー」をＣＰＵ１０２に通知します。Ｓ９３の終了後、Ｓ８５に戻ります。S93 (DSP103)
Using the mailbox 114, the CPU 102 is notified of the “average level”, “peak level”, “frequency component with the largest energy”, and “energy of a specific frequency component” of the microphone input voice data calculated in S89 and S91. . After S93 ends, the process returns to S85.

次に、本発明のカラオケ装置を用いた、アプリケーションソフトの例を示す。Next, an example of application software using the karaoke apparatus of the present invention will be shown.

１．動画背景とシンクロＣＧの合成例
ＤＳＰ１０３は、Ｈ．２６４またはＭＰＥＧ−４のデコーダソフトウェアを実行し、圧縮された動画データを解凍し、メインメモリ２４に格納する。1. An example DSP 103 for synthesizing a moving image background and synchro CG is H.264. H.264 or MPEG-4 decoder software is executed, and the compressed moving image data is decompressed and stored in the main memory 24.

ＣＰＵ１０２は、ＭＩＤＩベースの楽譜データに基づき、サウンドプロセッサ１１０に対してカラオケの伴奏の再生指示を行う。The CPU 102 instructs the sound processor 110 to reproduce karaoke accompaniment based on the MIDI-based score data.

また、ＣＰＵ１０２は、楽譜データ中の音符のパラメータ、ならびにマイク音声のパラメータに基づき、３ＤＣＧ画像オブジェクトのパラメータを算出する。さらに算出したパラメータに基づき、３ＤＣＧ画像オブジェクトを構成する各ポリゴンのディスプレイリストを生成し、グラフィックスプロセッサ１０７に渡す。Further, the CPU 102 calculates the parameters of the 3DCG image object based on the note parameters in the score data and the microphone sound parameters. Further, based on the calculated parameters, a display list of each polygon constituting the 3DCG image object is generated and passed to the graphics processor 107.

また、ＣＰＵ１０２は、楽譜データに埋め込まれた歌詞データならびに歌詞表示制御データに基づき、字幕画像を構成する各スプライトのディスプレイリストを生成し、グラフィックスプロセッサ１０７に渡す。Further, the CPU 102 generates a display list of each sprite constituting the subtitle image based on the lyrics data embedded in the score data and the lyrics display control data, and passes it to the graphics processor 107.

グラフィックスプロセッサ１０７は、ディスプレイリストに基づき、ポリゴンで構成された３ＤＣＧ画像オブジェクト、ならびにスプライトで構成された字幕画像をフレームバッファに描画する。The graphics processor 107 draws a 3DCG image object composed of polygons and a subtitle image composed of sprites in a frame buffer based on the display list.

ディスプレイコントローラ１０９は、メインメモリ２４に格納された解凍済み動画データをビデオスクリーンの上に、フレームバッファに格納された画像をオーバーレイして表示する。The display controller 109 displays the decompressed moving image data stored in the main memory 24 on the video screen by overlaying the image stored in the frame buffer.

これにより、予め録画された動画の上に、伴奏やマイク音声に応じてリアルタイムに変化するＣＧキャラクタを合成した画像を、カラオケ背景として用いることが可能。As a result, it is possible to use, as a karaoke background, an image in which a CG character that changes in real time according to accompaniment or microphone sound is synthesized on a previously recorded video.

２．シンクロＣＧカラオケの表示例
ポリゴンで表現される３ＤＣＧ画像オブジェクトとして「人間」を表示。2. Display example of synchro CG karaoke “Human” is displayed as a 3DCG image object represented by polygons.

伴奏の楽譜データ中の、「ピアノ」の音符の「ピッチ（音の高さ）」に応じて「人間」の手の位置が上下に移動。ピッチが高ければ手の位置も高く、ピッチが低ければ手の位置も低く描画される。The position of the “human” hand moves up and down according to the “pitch (pitch)” of the “piano” notes in the accompaniment score data. If the pitch is high, the hand position is high, and if the pitch is low, the hand position is low.

「人間」が画面内でジャンプを繰り返している。マイク音声データの「ピークレベル」に応じてジャンプの高さが決定される。ピークレベルが高ければジャンプの高さが高くなり、ピークレベルが低ければジャンプの高さも低くなる。“Human” repeatedly jumps on the screen. The jump height is determined according to the “peak level” of the microphone sound data. If the peak level is high, the jump height is high, and if the peak level is low, the jump height is low.

３．伴奏やマイク音声に応じてインタラクティブに変化するカラオケ動画背景例
多数の短い動画データがメモリカートリッジ２に予め格納されている。3. A large number of short karaoke video background examples that change interactively according to accompaniment and microphone sound are stored in the memory cartridge 2 in advance.

ＣＰＵ１０２は、楽譜データ中の音符のパラメータ、ならびにマイク音声のパラメータに基づき、現在再生中の動画データの再生が完了するまでに、どの動画データを次に再生するを決定し、デコード前の動画データをメモリカートリッジ２からメインメモリ２４にロードする。The CPU 102 determines which moving image data is to be reproduced next before the reproduction of the currently reproduced moving image data based on the note parameters in the musical score data and the microphone sound parameters, and the moving image data before decoding. Is loaded from the memory cartridge 2 into the main memory 24.

ＤＳＰ１０３は、メインメモリ２４に格納された動画データを順次デコード（解凍）する。The DSP 103 sequentially decodes (decompresses) the moving image data stored in the main memory 24.

これにより、例えば、伴奏やマイク音声に応じてストーリーが変化する動画をカラオケ背景として表示できる。Thereby, for example, a moving image whose story changes according to accompaniment or microphone sound can be displayed as a karaoke background.

さらに、本発明のカラオケ装置は次のような特徴を持ってもよい。
４．例えば、「カラオケ装置を用いるボイストレーニング」を実行する場合は、次のような構成を用いて実現することが可能である。Furthermore, the karaoke apparatus of the present invention may have the following features.
4). For example, when “voice training using a karaoke device” is executed, it can be realized using the following configuration.

マルチメディアプロセッサＳｏＣ１００に搭載されているＤＳＰ１０３は、マイクから入力されデジタルデータに変換されたユーザの音声に対し、ＦＦＴ（高速フーリエ変換）を施し、最もエネルギーの大きな周波数成分を算出する。The DSP 103 mounted on the multimedia processor SoC 100 performs FFT (Fast Fourier Transform) on the user's voice input from the microphone and converted into digital data, and calculates a frequency component having the largest energy.

マルチメディアプロセッサＳｏＣ１００に搭載されているＣＰＵ１０２は、ＭＩＤＩベースの楽譜データをメモリカートリッジ２から読み出しつつ、サウンドプロセッサ１１０に指示を与え、カラオケ伴奏を再生する。The CPU 102 mounted on the multimedia processor SoC 100 gives instructions to the sound processor 110 while reading out MIDI-based score data from the memory cartridge 2 and reproduces the karaoke accompaniment.

楽譜データには、「ボイスガイドデータ」が含まれる。ボイスガイドデータは、ユーザが歌うべき音階の各音符の音の高さならびにタイミングを示すデータである。ＣＰＵ１０２はボイスガイドデータを楽譜データから読み出すが、必ずしもサウンドプロセッサ１１０を用いて再生は行わない。The score data includes “voice guide data”. The voice guide data is data indicating the pitch and timing of each note of the scale to be sung by the user. The CPU 102 reads out the voice guide data from the musical score data, but does not necessarily reproduce using the sound processor 110.

ＣＰＵ１０２は、ボイスガイドデータの各音符に対応する、１または複数の第１の楽譜画像を画面に表示するようグラフィックスプロセッサ１０７に指示を与える。音符画像の表示位置は、ボイスガイドデータの音符の音の高さに関連付けられる。例えば、音の高さが高ければ、音符画像の画面上における垂直表示位置が高くなる。The CPU 102 instructs the graphics processor 107 to display one or more first score images corresponding to each note of the voice guide data on the screen. The display position of the note image is associated with the pitch of the note of the voice guide data. For example, if the pitch of the sound is high, the vertical display position of the note image on the screen becomes high.

ボイスガイドデータの各音符に対応する音符画像の近傍に、第１の楽譜画像に一対一に対応する第２の楽譜画像を画面に表示するようグラフィックスプロセッサ１０７に指示を与える。第１の楽譜画像と第２の楽譜画像とは、例えば色または形状が異なり、ユーザが画面上でその違いを識別可能なようになされている。第２の音符画像の表示位置は、ＤＳＰ１０３から伝達された「ユーザの音声の最もエネルギーの大きな周波数成分」の高さに関連付けられる。例えば、音の高さが高ければ、第２の音符画像の画面上における垂直表示位置が高くなる。The graphics processor 107 is instructed to display on the screen a second score image corresponding to the first score image in the vicinity of the note image corresponding to each note of the voice guide data. The first score image and the second score image are different in color or shape, for example, so that the user can identify the difference on the screen. The display position of the second note image is associated with the height of “frequency component with the highest energy of the user's voice” transmitted from the DSP 103. For example, if the pitch of the sound is high, the vertical display position of the second note image on the screen becomes high.

ユーザは、画面上の二種類の音符画像の表示位置の違いから、歌うべき音階の高さと、現在歌っている音の高さとのずれをリアルタイムに認識することができるため、カラオケのトレーニングとしての効果が期待できる。The user can recognize in real time the difference between the pitch of the musical note to be sung and the pitch of the current singing sound from the difference in the display position of the two types of note images on the screen. The effect can be expected.

５．音声を利用したゲーム
・音声認識機能や英語発音評定機能を付加することにより、３ＤＣＧなどとあわせて、従来機器では実現困難なゲームコンテンツの提供が可能である。
・本発明によれば、拡張ポート１７にマイク、マラカス、タイコ等を接続することによって、音楽の演奏や各種音楽ゲームを実行可能である。5. By adding a game / speech recognition function using speech and an English pronunciation rating function, it is possible to provide game content that is difficult to achieve with conventional devices in combination with 3DCG and the like.
According to the present invention, music performance and various music games can be executed by connecting a microphone, maracas, tyco, etc. to the expansion port 17.

６．伴奏用楽器
本体拡張ポート１７に接続することにより、楽器を用いた伴奏が可能になる。マラカスや太鼓などの演奏はもとより、楽器マイナス１演奏（ちょうどよいタイミングで楽器を操作することにより、そのタイミングの音程・音色での音が鳴る）によりギターやピアノなどの音程がある楽器の演奏も可能である。6). By connecting to the accompaniment instrument main body expansion port 17, accompaniment using an instrument becomes possible. In addition to playing maracas, drums, etc., playing instruments minus 1 (playing the instrument at the right timing will produce a sound with the timing and tone of that timing) will also play an instrument with a pitch such as a guitar or piano Is possible.

７．通信カラオケ
ＲＯＭカートリッジスロットに無線ＬＡＮモジュールを差し込むことにより、ＰＣ不要の通信カラオケが実現可能である。7). By inserting a wireless LAN module into the communication karaoke ROM cartridge slot, a PC-free communication karaoke can be realized.

８．動画表示機能を活かした「カラドラマ」・「カラアニメ」
既存動画コンテンツの二次利用が可能なので、ドラマやアニメの登場人物になりきることができる「カラドラマ」や「カラアニメ」コンテンツの提供が可能である。（「カラドラマ」・「カラアニメ」とは、ドラマやアニメの登場人物のアテレコを楽しむことができるコンテンツです。）特定の人物の音声だけをミュートした状態で、その人物の台詞をアテレコすることにより、あたかも、その番組に登場しているような楽しみを得ることができます。さらに、音声認識を利用して、ユーザのアテレコの内容によりドラマの展開を変えるなど、これまでの機器では実現が困難であった、さまざまな遊びを提供することが可能です。8). “Kara-drama” and “Kara-Anime” utilizing the video display function
Since secondary use of existing video content is possible, it is possible to provide “Kara-drama” and “Kara-Anime” content that can become characters in drama and anime. ("Kara-drama" and "Kara-Anime" are contents that allow you to enjoy the drama and anime characters' telephony.) By muting only the voice of a specific person, by telcoing that person's dialogue , You can get the fun as if it were on the show. Furthermore, by using voice recognition, it is possible to provide a variety of games that were difficult to achieve with conventional devices, such as changing the development of dramas depending on the contents of the user's ateleco.

なお、本願発明のカラオケ装置の特徴をまとめると次のようになる。
本願発明を用いることにより、テレビに接続する家庭用カラオケ装置を安価に実現することができます。本願発明は、ＣＰＵ、グラフィックスプロセッサ１０７、サウンドプロセッサ１１０等のカラオケ装置に必要とされる機能を内蔵しているだけでなく、搭載されているＤＳＰ１０３により、圧縮された動画データの解凍、様々なサウンドプロセッシング等を行うことが可能です。
本願発明のカラオケ装置の特長を以下に示します。The characteristics of the karaoke apparatus of the present invention are summarized as follows.
By using the present invention, a home karaoke device connected to a television can be realized at low cost. The present invention not only incorporates functions necessary for a karaoke apparatus such as a CPU, a graphics processor 107, and a sound processor 110, but also decompresses moving image data compressed by the DSP 103 installed, Sound processing etc. can be performed.
The features of the karaoke device of the present invention are as follows.

１．動画によるカラオケの背景
本願発明のカラオケ装置に搭載されているＣＰＵ１０２ならびにＤＳＰ１０３により、Ｈ．２６４またはＭＰＥＧ−４による動画をカラオケの背景として再生することが可能です。Ｈ．２６４でＱＶＧＡ（３２０×２４０ピクセル）／１５ｆｐｓ、ＭＰＥＧ−４でＱＶＧＡ／３０ｆｐｓを達成できる。1. Background of Karaoke by Moving Image The CPU 102 and the DSP 103 mounted on the karaoke apparatus of the present invention make it possible to H.264 or MPEG-4 video can be played back as a karaoke background. H. H.264 can achieve QVGA (320 × 240 pixels) / 15 fps, and MPEG-4 can achieve QVGA / 30 fps.

２．さまざまなオーディオコーデックへの対応
ＤＳＰ１０３のミドルウェアとして、ＭＰ３、ＡＡＣＳＢＲ等の様々なオーディオコーデック（デコーダ）が既に用意されています。これにより、従来のＭＩＤＩベースの家庭用カラオケ機器では再生できなかった「ボーカル付きのオーディオトラック」や「生演奏を録音したトラック」をカラオケの伴奏として再生することが可能になりました。2. Correspondence to various audio codecs As middleware of DSP103, various audio codecs (decoders) such as MP3 and AAC SBR are already prepared. This has made it possible to play “audio tracks with vocals” and “tracks recorded with live performances” as accompaniment to karaoke, which could not be played with conventional MIDI-based home karaoke equipment.

３．伴奏、マイク音声にシンクロした３Ｄグラフィックス
本願発明のカラオケ装置に搭載されているＧＰＵの優れた３Ｄグラフィックス生成機能と、ＤＳＰ１０３で実行されるＦＦＴ等のミドルウェアとの組み合わせにより、「カラオケ伴奏」ならびに「マイクから入力されたユーザーの音声」にシンクロした３Ｄグラフィックスをカラオケの背景として表示することが可能です。3. Accompaniment, 3D graphics synchronized to mic sound By combining the excellent 3D graphics generation function of the GPU installed in the karaoke apparatus of the present invention with middleware such as FFT executed by the DSP 103, "Karaoke accompaniment" and It is possible to display 3D graphics synchronized with “user's voice input from microphone” as the background of karaoke.

図１はこの発明のカラオケ装置の接続図である。FIG. 1 is a connection diagram of the karaoke apparatus of the present invention. 図２はこの発明のカラオケ装置から出力される映像信号の表示例である。FIG. 2 is a display example of a video signal output from the karaoke apparatus of the present invention. 図３は図１に示すゲーム装置の電気的な構成を示すブロック図である。FIG. 3 is a block diagram showing an electrical configuration of the game apparatus shown in FIG. 図４は図３に示すマルチメディアプロセッサＳｏＣの電気的な構成を示すブロック図である。FIG. 4 is a block diagram showing an electrical configuration of the multimedia processor SoC shown in FIG. 図５はＣＰＵ１０２およびＤＳＰ１０３の動作を示すフロー図である。FIG. 5 is a flowchart showing the operations of the CPU 102 and the DSP 103. 図６はＣＰＵ１０２およびＤＳＰ１０３の動作を示すフロー図である。FIG. 6 is a flowchart showing the operations of the CPU 102 and the DSP 103. 図７はＣＰＵ１０２およびＤＳＰ１０３の動作を示すフロー図である。FIG. 7 is a flowchart showing the operations of the CPU 102 and the DSP 103.

Explanation of symbols

１ …カラオケ装置
２ …メモリカートリッジ
３ …家庭用テレビ
４ …電子楽器等
５ …外部マイクロフォン
１１ …マイクロフォン
１２ …メモリカートリッジコネクタ
１３ …キースイッチ
１４ …映像出力端子
１５ …音声出力端子（左）
１６ …音声出力端子（右）
１７ …拡張ポート
１８ …外部マイクロフォン入力端子
２１ …入力段アンプ
２２ …ＡＤＣ付きマイコン
２３ …ブートＲＯＭ
２４ …メインメモリ
２５ …メモリバス
２６ …オーディオＤＡＣ
２７ …出力段アンプ
２８ …乾電池
２９ …電源回路
１００…マルチメディアプロセッサＳｏＣ
１０１…ＣＣＣＰ
１０２…ＣＰＵ
１０３…ＤＳＰ
１０４…ＡＨＢ
１０５…ＤＳＰローカルメモリ
１０６…ＤＭＡＣ
１０７…グラフィックスプロセッサ
１０８…ＧＰＵローカルメモリ
１０９…ディスプレイコントローラ
１１０…サウンドプロセッサ
１１１…ＣＡＵ
１１２…パワーマネジメントユニット
１１３…タイマ
１１４…メールボックス
１１５…ビデオＤＡＣ
１１６…ＧＰＩＯ＃Ｏ
１２０…ＡＰＢDESCRIPTION OF SYMBOLS 1 ... Karaoke apparatus 2 ... Memory cartridge 3 ... Home television 4 ... Electronic musical instrument etc. 5 ... External microphone 11 ... Microphone 12 ... Memory cartridge connector 13 ... Key switch 14 ... Video output terminal 15 ... Audio | voice output terminal (left)
16: Audio output terminal (right)
17 ... expansion port 18 ... external microphone input terminal 21 ... input stage amplifier 22 ... microcomputer 23 with ADC ... boot ROM
24 ... main memory 25 ... memory bus 26 ... audio DAC
27 ... Output stage amplifier 28 ... Dry battery 29 ... Power supply circuit 100 ... Multimedia processor SoC
101 ... CCCP
102 ... CPU
103 ... DSP
104 ... AHB
105 ... DSP local memory 106 ... DMAC
107 ... Graphics processor 108 ... GPU local memory 109 ... Display controller 110 ... Sound processor 111 ... CAU
112 ... Power management unit 113 ... Timer 114 ... Mailbox 115 ... Video DAC
116 ... GPIO # O
120 ... APB

Claims

A karaoke device capable of outputting a video signal and an audio signal to a home TV device,
A microphone built into or detachable from the karaoke device;
A built-in or detachable memory in the karaoke device, and an image object expressed in two and / or three dimensions are generated, and an image composed of one or a plurality of the image objects is generated as a karaoke background image And image processing means for synthesizing with a lyric image showing lyrics of karaoke and outputting as a video signal,
The image processing means includes
“Musical instrument type”, “pitch”, “sound duration”, “sounding intensity (velocity)”, “sound image” included in the “karaoke accompaniment score data stored in the memory” "Panpot", as well as "Data obtained by analog-to-digital conversion of user's voice input from microphone", "Average level", "Peak level", "Most energy" Depending on one or more of “large frequency component” and “energy of specific frequency component”, one of “type”, “size”, “color”, “shape”, and “coordinate” of the image object A karaoke apparatus characterized by dynamically changing a plurality of parameters.

The karaoke device
In addition, it has a connection port that can connect electronic musical instruments
“Musical instrument performance data input from the connection port in a format similar to MIDI or MIDI”, “Musical instrument type”, “Pitch”, “Sound duration”, “Strong pronunciation” “Type”, “size”, “color”, “shape”, “coordinate” of the image object in accordance with one or more parameters of “sales (velocity)” and “sound image position (panpot)” The karaoke apparatus according to claim 1, wherein one or a plurality of parameters are dynamically changed.