JP6289330B2

JP6289330B2 - Video information distribution system and server

Info

Publication number: JP6289330B2
Application number: JP2014199458A
Authority: JP
Inventors: 路子新井
Original assignee: Xing Inc
Current assignee: Xing Inc
Priority date: 2014-09-29
Filing date: 2014-09-29
Publication date: 2018-03-07
Anticipated expiration: 2034-09-29
Also published as: JP2016071087A

Description

本発明は、通信回線を介して複数の通信端末装置に対する動画情報の配信を行う動画情報配信システム及びサーバに関し、特に、共演動画情報の組み合わせ候補の検索に関して、各利用者の音域に合った適切な動画情報の検索を実現するための改良に関する。 The present invention relates to a moving image information distribution system and server that distribute moving image information to a plurality of communication terminal devices via a communication line, and particularly, for searching for combination candidate of co-starring moving image information, suitable for each user's sound range. The present invention relates to an improvement for realizing a search for dynamic video information.

所定の通信回線を介して複数の通信端末装置に対する動画情報の配信を行う動画情報配信システムが知られており、近年における通信技術の向上に伴いコンテンツビジネスの中核として更なる発展が期待されている。そのような動画情報配信システムにおいては、通常、所謂動画投稿サイトにおいて所定の通信端末装置から投稿（アップロード）された複数の動画情報が所定のデータベースに記憶され、任意の通信端末装置によりその動画投稿サイトを介してのアクセスが可能とされる。それにより、前記所定の通信端末装置により投稿された動画情報が、任意の通信端末装置により閲覧可能とされる。 A moving image information distribution system that distributes moving image information to a plurality of communication terminal devices via a predetermined communication line is known, and further development is expected as the core of the content business with recent improvements in communication technology. . In such a moving image information distribution system, a plurality of pieces of moving image information posted (uploaded) from a predetermined communication terminal device in a so-called moving image posting site is usually stored in a predetermined database, and the moving image posting is performed by an arbitrary communication terminal device. Access through the site is possible. Thereby, the moving image information posted by the predetermined communication terminal device can be browsed by any communication terminal device.

前記動画情報配信システムの一態様として、本出願人は、カラオケ装置を通信端末装置として動画情報配信システムに組み込み、そのカラオケ装置による所定の演奏曲の出力に際して撮像装置により撮影された映像情報及び音声入力装置により入力された音声情報の投稿を受け付けて閲覧可能とする技術を発案し、実用化している。この技術に関して、共通の演奏曲に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、前記カラオケ装置のみならずパーソナルコンピュータ等の通信端末装置により閲覧可能とする共演動画（コラボ動画）のサービスを展開している。例えば、特許文献１に記載されたカラオケシステムがその一例である。 As one aspect of the moving picture information distribution system, the applicant of the present application incorporates a karaoke apparatus as a communication terminal apparatus in a moving picture information distribution system, and video information and sound captured by an imaging apparatus when a predetermined performance music is output by the karaoke apparatus. A technology has been devised and put into practical use that makes it possible to receive and browse posts of voice information input by an input device. With regard to this technology, a co-star video (collaboration video) that can be viewed by not only the karaoke device but also a communication terminal device such as a personal computer by editing a plurality of video information associated with a common performance piece. ) Service. For example, the karaoke system described in Patent Document 1 is an example.

特開２０１１−０５９６１９号公報JP 2011-059619 A

しかし、前記従来の技術においては、共演動画情報の組み合わせ候補の検索に関して、必ずしも適切な動画情報を検索できないという不具合があった。すなわち、前記動画情報配信システムにおいて投稿される動画情報は膨大な数であり、例えば曲名又は歌手名等により検索をかけた場合に、共演動画情報の組み合わせ候補として数百もの動画情報が提示されることも少なくない。そのように多数の動画情報の中から、各利用者の音域に合った適切な動画情報を、共演動画情報の組み合わせ候補として検索することは困難であった。このような課題は、動画情報配信システムの機能向上を意図して本発明者が鋭意研究を継続する過程において新たに見出したものである。 However, the conventional technology has a problem in that it is not always possible to search for appropriate moving image information with respect to the search for a combination candidate of co-starring moving image information. That is, there is an enormous number of moving image information posted in the moving image information distribution system. For example, when searching by a song name or a singer name, hundreds of moving image information is presented as combinations of co-starring moving image information. There are many cases. As described above, it has been difficult to search suitable video information suitable for each user's range as a combination candidate of co-star video information from a large number of video information. Such a problem has been newly found in the process in which the present inventor continues eager research in order to improve the function of the moving image information distribution system.

本発明は、以上の事情を背景として為されたものであり、その目的とするところは、共演動画情報の組み合わせ候補の検索に関して、各利用者の音域に合った適切な動画情報の検索を実現する動画情報配信システム及びサーバを提供することにある。 The present invention has been made against the background of the above circumstances, and the purpose of the present invention is to search for suitable moving image information suitable for each user's sound range with respect to searching for a combination candidate of co-starring moving image information. It is to provide a moving image information distribution system and a server.

斯かる目的を達成するために、本第１発明の要旨とするところは、通信回線を介して複数の通信端末装置に対する動画情報の配信を行う動画情報配信システムであって、前記通信回線を介して前記通信端末装置から、予め定められた複数の演奏曲情報の何れかに対応する動画情報の投稿を受け付け、その投稿された動画情報を対応する演奏曲情報と対応付けて動画データベースに記憶させる投稿受付制御手段と、その投稿受付制御手段により受け付けられる、共通の演奏曲情報に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、その編集された共演動画情報をその共通の演奏曲情報と対応付けて前記動画データベースに記憶させる共演動画編集制御手段と、音声入力装置から入力された音声情報に基づいて、その音声情報の入力主体である利用者の歌唱音声に係る利用者音域を特定する利用者音域特定制御手段と、前記動画データベースに記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段により特定された前記利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する動画抽出制御手段とを、備えたことを特徴とするものである。 In order to achieve such an object, the gist of the first invention is a moving image information distribution system for distributing moving image information to a plurality of communication terminal devices via a communication line, the communication line via the communication line. Then, posting of moving picture information corresponding to any of a plurality of pieces of predetermined performance music information is received from the communication terminal device, and the posted moving picture information is stored in the moving picture database in association with the corresponding music performance information. The co-starring video information is edited by combining the post acceptance control means and the plurality of video information associated with the common performance music information accepted by the post acceptance control means, and the edited co-star video information Based on the audio information input from the audio input device and the co-starring video editing control means stored in the video database in association with the performance music information, the audio information Among the plurality of pieces of moving image information stored in the moving image database, the sound range in the moving image information is the use range, the user sound range specifying control means for specifying the user sound range related to the singing voice of the user who is the input subject Moving image extraction control means for extracting moving image information that matches the user sound range specified by the person sound range specifying control means as a combination candidate of the co-starring moving image information is provided.

前記目的を達成するために、本第２発明の要旨とするところは、通信回線を介して複数の通信端末装置に対する動画情報の配信を行うサーバであって、前記通信回線を介して前記通信端末装置から、予め定められた複数の演奏曲情報の何れかに対応する動画情報の投稿を受け付け、その投稿された動画情報を対応する演奏曲情報と対応付けて動画データベースに記憶させる投稿受付制御手段と、その投稿受付制御手段により受け付けられる、共通の演奏曲情報に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、その編集された共演動画情報をその共通の演奏曲情報と対応付けて前記動画データベースに記憶させる共演動画編集制御手段と、音声入力装置から入力された音声情報に基づいて、その音声情報の入力主体である利用者の歌唱音声に係る利用者音域を特定する利用者音域特定制御手段と、前記動画データベースに記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段により特定された前記利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する動画抽出制御手段とを、備えたことを特徴とするものである。 In order to achieve the above object, the gist of the second invention is a server for distributing moving image information to a plurality of communication terminal devices via a communication line, the communication terminal via the communication line. Post accepting control means for accepting posting of moving picture information corresponding to any of a plurality of pieces of predetermined performance music information from the apparatus, and storing the posted moving picture information in association with the corresponding performance music information in the moving picture database And editing the co-star video information by combining a plurality of video information associated with the common musical piece information accepted by the posting acceptance control means, and the edited co-star video information as the common musical piece information. Based on the voice information input from the voice input device and the co-starring video editing control means to be associated and stored in the video database, Among the plurality of moving image information stored in the moving image database, the sound range in the moving image information is a user sound range specifying control unit that specifies the user sound range related to the user's singing voice. And a moving image extraction control unit that extracts moving image information that matches the user's sound range specified as a combination candidate of the co-starring moving image information.

前記第１発明によれば、前記通信回線を介して前記通信端末装置から、予め定められた複数の演奏曲情報の何れかに対応する動画情報の投稿を受け付け、その投稿された動画情報を対応する演奏曲情報と対応付けて動画データベースに記憶させる投稿受付制御手段と、その投稿受付制御手段により受け付けられる、共通の演奏曲情報に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、その編集された共演動画情報をその共通の演奏曲情報と対応付けて前記動画データベースに記憶させる共演動画編集制御手段と、音声入力装置から入力された音声情報に基づいて、その音声情報の入力主体である利用者の歌唱音声に係る利用者音域を特定する利用者音域特定制御手段と、前記動画データベースに記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する動画抽出制御手段とを、備えたものであることから、前記動画データベースに記憶された多数の動画情報のうちから、各利用者の音域にあった適切な動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。すなわち、共演動画情報の組み合わせ候補の検索に関して、各利用者の音域に合った適切な動画情報の検索を実現する動画情報配信システムを提供することができる。 According to the first aspect of the present invention, the posting of moving image information corresponding to any of a plurality of predetermined pieces of performance music information is received from the communication terminal device via the communication line, and the posted moving image information is handled. The post acceptance control means for storing in the video database in association with the performance music information to be performed, and the plurality of video information associated with the common performance music information accepted by the post acceptance control means are combined to edit the co-star video information And the co-star video editing control means for storing the edited co-star video information in the video database in association with the common performance music information, and based on the voice information input from the voice input device, User range specification control means for specifying the user range related to the singing voice of the user who is the input subject, and a plurality of pieces of video information stored in the video database Among them, moving image information in which the sound range in the moving image information is matched with the user sound range corresponding to the user specified by the user sound range specifying control means is extracted as a combination candidate of the co-starring moving image information Control means, and from among a large number of pieces of moving picture information stored in the moving picture database, suitable moving picture information suitable for each user's range can be easily used as a combination candidate of co-starring moving picture information. Can be extracted. That is, it is possible to provide a moving image information distribution system that realizes a search for appropriate moving image information suitable for the sound range of each user regarding the search for the combination candidate of the co-star moving image information.

前記第１発明において、好適には、前記動画抽出制御手段は、前記動画データベースに記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段により特定された前記利用者に対応する利用者音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出するものである。このようにすれば、各利用者の歌唱音声に係る音域に適合し、各利用者が危なげなく歌うことができるものと思われる演奏曲に対応する動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。 In the first aspect of the present invention, preferably, the moving image extraction control unit specifies a sound range in the moving image information from the plurality of moving image information stored in the moving image database by the user sound range specifying control unit. The moving image information included in the user sound range corresponding to the user is extracted as a combination candidate of the co-starring moving image information. In this way, the video information corresponding to the performance music that fits the range of each user's singing voice and that each user is able to sing without danger is easily used as a combination candidate of the co-star video information. Can be extracted.

好適には、前記動画抽出制御手段は、前記動画データベースに記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段により特定された前記利用者音域に規定の許容区間を加えた判定区間に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出するものである。このようにすれば、各利用者の歌唱音声に対応する音域を基準とする判定区間に適合し、各利用者が歌唱可能であるものと思われる演奏曲に対応する動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。 Preferably, the moving image extraction control means defines a sound range in the moving image information as the user sound range specified by the user sound range specifying control means from among a plurality of pieces of moving image information stored in the moving image database. Is extracted as a combination candidate of the co-starring moving picture information. In this way, the video information corresponding to the performance song that is considered to be sung by each user and that matches the determination range based on the range corresponding to each user's singing voice, Can be easily extracted as a combination candidate.

好適には、前記動画抽出制御手段は、前記動画データベースに記憶された複数の動画情報のうちから、対応する演奏曲情報の最低音から最高音までの音域が、前記利用者音域特定制御手段により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出するものである。このようにすれば、前記動画データベースに記憶された多数の動画情報のうちから、検索者の歌唱音声に係る利用者音域に適合する演奏曲に対応する動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。 Preferably, the moving image extraction control means is configured such that, from the plurality of moving image information stored in the moving image database, a range from the lowest sound to the highest sound of the corresponding performance music information is determined by the user sound range specifying control means. The moving image information suitable for the user sound range corresponding to the specified user is extracted as a combination candidate of the co-starring moving image information. If it does in this way, animation information corresponding to a performance music suitable for a user's sound range concerning a searcher's singing voice among a lot of animation information memorized by the animation database as a combination candidate of co-starring animation information It can be easily extracted.

好適には、前記投稿受付制御手段は、前記投稿された動画情報を、投稿主体である利用者に対応して前記利用者音域特定制御手段により特定された利用者音域と対応付けて前記動画データベースに記憶させるものであり、前記動画抽出制御手段は、前記動画データベースに記憶された複数の動画情報のうちから、その動画情報に対応付けられた投稿主体である利用者に対応する利用者音域が、前記利用者音域特定制御手段により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出するものである。このようにすれば、前記動画データベースに記憶された多数の動画情報のうちから、歌唱音声に係る利用者音域が適合する投稿者により投稿された動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。 Preferably, the posting acceptance control means associates the posted moving picture information with the user sound range specified by the user sound range specifying control means corresponding to the user who is the posting subject, and the moving image database. The moving image extraction control means has a user sound range corresponding to a user who is a posting subject associated with the moving image information among a plurality of moving image information stored in the moving image database. The moving image information suitable for the user sound range corresponding to the user specified by the user sound range specifying control means is extracted as a combination candidate of the co-starring moving image information. In this way, the video information posted by the contributor to which the user's range related to the singing voice is suitable among the many pieces of video information stored in the video database can be easily used as a combination candidate of the co-star video information. Can be extracted.

前記第２発明によれば、前記通信回線を介して前記通信端末装置から、予め定められた複数の演奏曲情報の何れかに対応する動画情報の投稿を受け付け、その投稿された動画情報を対応する演奏曲情報と対応付けて動画データベースに記憶させる投稿受付制御手段と、その投稿受付制御手段により受け付けられる、共通の演奏曲情報に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、その編集された共演動画情報をその共通の演奏曲情報と対応付けて前記動画データベースに記憶させる共演動画編集制御手段と、音声入力装置から入力された音声情報に基づいて、その音声情報の入力主体である利用者の歌唱音声に係る利用者音域を特定する利用者音域特定制御手段と、前記動画データベースに記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する動画抽出制御手段とを、備えたものであることから、前記動画データベースに記憶された多数の動画情報のうちから、各利用者の音域にあった適切な動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。すなわち、共演動画情報の組み合わせ候補の検索に関して、各利用者の音域に合った適切な動画情報の検索を実現するサーバを提供することができる。 According to the second aspect of the present invention, the posting of moving image information corresponding to any of a plurality of predetermined pieces of performance music information is received from the communication terminal device via the communication line, and the posted moving image information is handled. The post acceptance control means for storing in the video database in association with the performance music information to be performed, and the plurality of video information associated with the common performance music information accepted by the post acceptance control means are combined to edit the co-star video information And the co-star video editing control means for storing the edited co-star video information in the video database in association with the common performance music information, and based on the voice information input from the voice input device, User range specification control means for specifying the user range related to the singing voice of the user who is the input subject, and a plurality of pieces of video information stored in the video database Among them, moving image information in which the sound range in the moving image information is matched with the user sound range corresponding to the user specified by the user sound range specifying control means is extracted as a combination candidate of the co-starring moving image information Control means, and from among a large number of pieces of moving picture information stored in the moving picture database, suitable moving picture information suitable for each user's range can be easily used as a combination candidate of co-starring moving picture information. Can be extracted. In other words, it is possible to provide a server that realizes a search for appropriate moving image information suitable for the sound range of each user regarding the search for the combination candidate of the co-starring moving image information.

本発明の好適な実施例である動画情報配信システムの構成を説明する図である。It is a figure explaining the structure of the moving image information delivery system which is a suitable Example of this invention. 図１に示す動画情報配信システムに備えられたカラオケ装置の構成を例示するブロック線図である。It is a block diagram which illustrates the structure of the karaoke apparatus with which the moving image information delivery system shown in FIG. 1 was equipped. 図１に示す動画情報配信システムに備えられたサーバの構成を例示するブロック線図である。It is a block diagram which illustrates the structure of the server with which the moving image information delivery system shown in FIG. 1 was equipped. 図２に示すカラオケ装置のＣＰＵ及び図３に示すサーバのＣＰＵに備えられた制御機能の要部を説明する機能ブロック線図である。It is a functional block diagram explaining the principal part of the control function with which CPU of the karaoke apparatus shown in FIG. 2 and CPU of the server shown in FIG. 3 were equipped. 図２に示すカラオケ装置等の通信端末装置に表示される共演動画の一例を示す図である。It is a figure which shows an example of the co-starring animation displayed on communication terminal devices, such as a karaoke apparatus shown in FIG. 図２に示すカラオケ装置に対応付けられた電子早見本装置のタッチパネルディスプレイに表示される動画情報推薦画面を例示する図である。It is a figure which illustrates the moving image information recommendation screen displayed on the touchscreen display of the electronic quick sample apparatus matched with the karaoke apparatus shown in FIG. 図２に示すカラオケ装置のＣＰＵにより実行される音域判定制御の一例の要部を説明するフローチャートである。It is a flowchart explaining the principal part of an example of the sound range determination control performed by CPU of the karaoke apparatus shown in FIG. 図２に示すカラオケ装置のＣＰＵにより実行される動画投稿制御の一例の要部を説明するフローチャートである。It is a flowchart explaining the principal part of an example of the moving image posting control performed by CPU of the karaoke apparatus shown in FIG. 図３に示すサーバのＣＰＵにより実行される利用者音域特定制御の一例の要部を説明するフローチャートである。It is a flowchart explaining the principal part of an example of the user range specification control performed by CPU of the server shown in FIG. 図３に示すサーバのＣＰＵにより実行される動画抽出制御の一例の要部を説明するフローチャートである。It is a flowchart explaining the principal part of an example of the moving image extraction control performed by CPU of the server shown in FIG. 図３に示すサーバのＣＰＵにより実行される動画情報管理／配信制御の一例の要部を説明するフローチャートである。It is a flowchart explaining the principal part of an example of the moving image information management / distribution control performed by CPU of the server shown in FIG.

以下、本発明の好適な実施例を図面に基づいて詳細に説明する。 Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明が好適に適用される動画情報配信システム１０の構成を説明する図である。この図１に示す動画情報配信システム１０においては、カラオケボックス、スナック、旅館等の店舗１２における複数の個室１４ａ、１４ｂ、１４ｃ、・・・（以下、特に区別しない場合には単に個室１４と称する）にそれぞれ１台乃至は複数台ずつ（図１では１台ずつ）カラオケ装置１６ａ、１６ｂ、１６ｃ、・・・（以下、特に区別しない場合には単にカラオケ装置１６と称する）が設置されている。これら複数のカラオケ装置１６は、ＬＡＮ（local area network）２４及びルータ２２を介して通信回線２０に接続されており、同様にその通信回線２０に接続されたサーバ１８等との相互間でその通信回線２０を介して情報の通信が可能とされている。すなわち、本実施例の動画情報配信システム１０は、前記通信回線２０に接続された複数の前記カラオケ装置１６を備えたカラオケシステム（通信カラオケシステム）でもある。 FIG. 1 is a diagram illustrating a configuration of a moving picture information distribution system 10 to which the present invention is preferably applied. In the moving picture information distribution system 10 shown in FIG. 1, a plurality of private rooms 14a, 14b, 14c,... In a store 12 such as a karaoke box, a snack, an inn, etc. ) Are provided with one or more (one in FIG. 1) karaoke devices 16a, 16b, 16c,... (Hereinafter simply referred to as karaoke devices 16 unless otherwise distinguished). . The plurality of karaoke apparatuses 16 are connected to a communication line 20 via a LAN (local area network) 24 and a router 22, and similarly communicate with each other with a server 18 or the like connected to the communication line 20. Information can be communicated via the line 20. That is, the moving picture information distribution system 10 of the present embodiment is also a karaoke system (communication karaoke system) including a plurality of the karaoke devices 16 connected to the communication line 20.

前記動画情報配信システム１０は、複数の電子早見本装置２８ａ、２８ｂ、２８ｃ、・・・（以下、特に区別しない場合には単に電子早見本装置２８と称する）を備えており、前記カラオケ装置１６の利用に際して、各利用者（グループ）毎に１台乃至数台ずつの前記電子早見本装置２８が貸与され、各個室１４において後述するように前記カラオケ装置１６との間で対応付け処理が行われることで、そのカラオケ装置１６の遠隔操作装置として用いられるようになっている。図１に示すように、前記店舗１２内には前記複数のカラオケ装置１６を相互に接続するＬＡＮ２４が敷設されており、前記電子早見本装置２８からのカラオケ装置１６への入力は、所定のアクセスポイント２６及びＬＡＮ２４を介したＬＡＮ通信等により行われる。 The moving picture information distribution system 10 includes a plurality of electronic quick sample devices 28a, 28b, 28c,... (Hereinafter simply referred to as an electronic quick sample device 28 unless otherwise distinguished). 1 to several of the electronic sample devices 28 are lent to each user (group), and an association process is performed between each individual room 14 and the karaoke device 16 as will be described later. Therefore, it can be used as a remote control device for the karaoke device 16. As shown in FIG. 1, a LAN 24 for connecting the plurality of karaoke devices 16 to each other is laid in the store 12, and an input to the karaoke device 16 from the electronic quick sample device 28 is performed according to a predetermined access. This is performed by LAN communication via the point 26 and the LAN 24.

前記通信回線２０は、例えば公衆電話回線、ＡＤＳＬ回線、或いは光ファイバ回線等から構成されるＷＷＷ（World Wide Web）等のインターネットに接続された広域情報通信網である。前記サーバ１８は、例えば、前記動画情報配信システム１０を管理する情報配信サービス提供会社によって運営されるサーバであり、その動画情報配信システム１０における動画情報をはじめとするデジタルコンテンツ（Digital Contents）の保管や入出力管理の基本的な制御を行うと共に、前記カラオケ装置１６に対して楽曲情報（カラオケデータ）、背景映像情報、曲間情報等の配信を行うセンタ装置として機能する。前記動画情報配信システム１０を利用する各利用者毎の情報を、その利用者の識別情報（ユーザＩＤ）と対応付けて記憶するＳＮＳデータベース１０８（図４を参照）を管理するデータベースサーバとして機能する。なお、本実施例においては、前記動画情報配信システム１０のセンタ装置としての機能及びデータベースサーバとしての機能を兼ね備えた単一の前記サーバ１８を備えた構成について説明するが、それらセンタ装置及びデータベースサーバが個別のサーバとして構成されたものであってもよい。動画情報配信システムに係るサーバと、カラオケシステムに係るサーバとが、個別のサーバとして構成されたものであってもよい。すなわち、後述する前記サーバ１８による各種制御が、複数のサーバにより分散的に実行される等の態様も考えられる。 The communication line 20 is a wide area information communication network connected to the Internet such as a WWW (World Wide Web) configured by, for example, a public telephone line, an ADSL line, or an optical fiber line. The server 18 is a server operated by, for example, an information distribution service provider that manages the video information distribution system 10, and stores digital contents including digital information in the video information distribution system 10. In addition to performing basic control of input / output management, it functions as a center device that distributes music information (karaoke data), background video information, information between songs, and the like to the karaoke device 16. It functions as a database server that manages an SNS database 108 (see FIG. 4) that stores information for each user who uses the video information distribution system 10 in association with identification information (user ID) of the user. . In the present embodiment, the configuration including the single server 18 having both the function as the center device of the moving image information distribution system 10 and the function as the database server will be described. May be configured as individual servers. The server related to the video information distribution system and the server related to the karaoke system may be configured as individual servers. That is, a mode in which various controls by the server 18 described later are executed in a distributed manner by a plurality of servers is also conceivable.

図１に示すように、本実施例の動画情報配信システム１０は、前記通信回線２０を介して複数のシステム乃至通信端末装置との間で相互に情報の送受信が可能とされている。すなわち、通信端末装置としての携帯電話機３０及びタブレット３２等が中継基地局３４を介して前記通信回線２０に接続されており、それら携帯電話機３０及びタブレット３２等と前記サーバ１８等との間で相互に情報の送受信が可能とされている。前記通信回線２０には、好適には、前記携帯電話機３０及びタブレット３２等から前記カラオケ装置１６に対する選曲入力等の情報を中継する所謂プロキシサーバ（proxy server）として図示しないサーバが接続されており、前記携帯電話機３０及びタブレット３２等から前記カラオケ装置１６への情報の入力は、好適には、前記サーバを中継して行われるようになっている。通信端末装置であるパーソナルコンピュータ３６が前記通信回線２０に接続されており、そのパーソナルコンピュータ３６と前記サーバ１８等との間で相互に情報の送受信が可能とされている。後述のように、前記動画情報配信システム１０においては、前記サーバ１８から前記通信回線２０を介して前記カラオケ装置１６に対しての動画情報の配信が行われる。すなわち、本実施例において、前記カラオケ装置１６は、前記携帯電話機３０、タブレット３２、及びパーソナルコンピュータ３６等と同様に通信端末装置として機能する。 As shown in FIG. 1, the moving image information distribution system 10 according to the present embodiment can transmit and receive information to and from a plurality of systems or communication terminal devices via the communication line 20. That is, a mobile phone 30 and a tablet 32 as communication terminal devices are connected to the communication line 20 via a relay base station 34, and the mobile phone 30 and the tablet 32 and the server 18 are mutually connected. It is possible to send and receive information. The communication line 20 is preferably connected to a server (not shown) as a so-called proxy server that relays information such as music selection input to the karaoke apparatus 16 from the mobile phone 30 and the tablet 32. The input of information from the mobile phone 30 and the tablet 32 to the karaoke device 16 is preferably performed via the server. A personal computer 36, which is a communication terminal device, is connected to the communication line 20, and information can be transmitted and received between the personal computer 36 and the server 18 or the like. As will be described later, in the moving image information distribution system 10, the moving image information is distributed from the server 18 to the karaoke device 16 via the communication line 20. That is, in the present embodiment, the karaoke device 16 functions as a communication terminal device in the same manner as the mobile phone 30, tablet 32, personal computer 36, and the like.

図２は、前記カラオケ装置１６の構成を例示するブロック線図である。この図１に示すように、前記カラオケ装置１６は、中央演算処理装置であるＣＰＵ４２と、読出専用メモリであるＲＯＭ４４と、随時書込読出メモリであるＲＡＭ４６と、記憶装置であるハードディスク４８と、グラフィックスチップ（グラフィックスボード）等の映像処理部５０と、サウンドチップ（サウンドボード）等の音声処理部５２と、操作パネル５４と、表示制御部５６及び入力制御部５８を介して前記ＣＰＵ４２に接続されたタッチパネルディスプレイ６０と、ＬＡＮインターフェイス６６と、無線通信部６８と、ビデオ出力端子７０を介して前記映像処理部５０に接続された映像表示装置であるディスプレイ７２と、オーディオ入力端子７４を介して前記音声処理部５２に接続された音声入力装置であるマイクロフォン７６と、オーディオ出力端子７８を介して前記音声処理部５２に接続された音声増幅装置であるアンプ８０と、そのアンプ８０に備えられた音声出力装置であるスピーカ８２と、接続端子８８を介して前記ＣＰＵ４２に接続された撮像装置であるデジタルカメラ８６とを、備えて構成されている。 FIG. 2 is a block diagram illustrating the configuration of the karaoke apparatus 16. As shown in FIG. 1, the karaoke device 16 includes a CPU 42 as a central processing unit, a ROM 44 as a read-only memory, a RAM 46 as a write / read memory as needed, a hard disk 48 as a storage device, and a graphic. Connected to the CPU 42 through a video processing unit 50 such as a chip (graphics board), an audio processing unit 52 such as a sound chip (sound board), an operation panel 54, a display control unit 56 and an input control unit 58. The touch panel display 60, the LAN interface 66, the wireless communication unit 68, the display 72 which is a video display device connected to the video processing unit 50 via the video output terminal 70, and the audio input terminal 74. A microphone 76 that is an audio input device connected to the audio processing unit 52; An amplifier 80 that is an audio amplifying device connected to the audio processing unit 52 via an audio output terminal 78, a speaker 82 that is an audio output device provided in the amplifier 80, and the CPU 42 via a connection terminal 88. A digital camera 86, which is a connected imaging device, is provided.

前記ＣＰＵ４２は、前記ＲＡＭ４６の一時記憶機能を利用しつつ前記ＲＯＭ４４に予め記憶された所定のプログラムに基づいて電子情報を処理・制御する所謂マイクロコンピュータであり、前記カラオケ装置１６における各種制御を実行する。すなわち、前記操作パネル５４、タッチパネルディスプレイ６０、或いは電子早見本装置２８等により所定の楽曲（カラオケ演奏曲）が選曲入力された場合、その選曲入力された楽曲を前記ＲＡＭ４６等に設けられた予約曲テーブルに登録する選曲予約制御、その予約曲テーブルの演奏順に従って前記ハードディスク４８から前記ＲＡＭ４６に選曲されたカラオケ演奏曲の演奏情報及び歌詞情報（楽曲データ）を読み出す楽曲データ読出制御、楽曲の演奏進行に応じてそのＲＡＭ４６から前記音声処理部５２へ演奏情報を送信する演奏出力制御、その演奏出力制御に際して前記ＲＡＭ４６に展開された歌詞情報に基づいて歌詞文字映像を生成して前記映像処理部５０へ送信する歌詞文字映像出力制御、前記演奏出力制御に際して前記映像処理部５０を制御して所定の背景映像を再生させる背景映像出力制御、及びカラオケ演奏が行われていない間すなわち曲間において、新譜情報、選曲ランキング、店舗広告等の曲間情報を出力させる曲間情報出力制御等の基本的な制御を実行する。 The CPU 42 is a so-called microcomputer that processes and controls electronic information based on a predetermined program stored in the ROM 44 using the temporary storage function of the RAM 46, and executes various controls in the karaoke device 16. . That is, when a predetermined music piece (karaoke performance music piece) is selected and input by the operation panel 54, the touch panel display 60, the electronic quick sample device 28, or the like, the selected music piece is reserved music provided in the RAM 46 or the like. Music selection reservation control to be registered in the table, music data read control for reading out the performance information and lyrics information (music data) of the karaoke performance music selected from the hard disk 48 in accordance with the performance order of the reserved music table, music performance reading progress In response to the performance output control for transmitting performance information from the RAM 46 to the sound processing unit 52, the lyrics character image is generated based on the lyric information developed in the RAM 46 during the performance output control, and sent to the video processing unit 50. The lyric character video output control to be transmitted, the video processing in the performance output control Background video output control for controlling 50 to reproduce a predetermined background video, and inter-song information for outputting inter-song information such as new music information, music selection ranking, store advertisement, etc. while karaoke performance is not performed, that is, between songs Basic control such as output control is executed.

前記映像処理部５０は、前記ディスプレイ７２に表示される画面（映像）の描画に係る各種制御を行う。例えば、前記ＣＰＵ４２から供給されるデータに基づいてグラフィックスメモリにそのデータを書き込み、そのデータを読み出すことによって前記ビデオ出力端子７０を介して前記ディスプレイ７２に所定の画面を表示させる制御を行う。具体的には、前記カラオケ装置１６による楽曲の演奏出力（カラオケ演奏）に際して、前記ＣＰＵ４２において生成された歌詞文字映像等の文字映像（テロップ）を出力させたり、前記ハードディスク４８に記憶されたＭＰＥＧ（Moving Picture Experts Group）データ等の背景映像情報に基づいて所定の背景映像を再生（デコード）させたり、その背景映像の前面側に前記歌詞文字映像を合成させて前記ディスプレイ７２に表示させたり、その歌詞文字映像を前記楽曲の演奏進行に応じて順次色替わり表示させる等の各種表示制御を行う。なお、本実施例においては、前記映像処理部５０により前記ディスプレイ７２の表示制御を行う一方、後述する表示制御部５６により前記タッチパネルディスプレイ６０（表示装置６２）の表示制御を行う態様について説明するが、前記映像処理部５０により前記タッチパネルディスプレイ６０の表示制御をも行う態様も考えられる。この態様において、前記表示制御部５６は必ずしも設けられなくともよい。 The video processing unit 50 performs various controls related to drawing of a screen (video) displayed on the display 72. For example, based on the data supplied from the CPU 42, the data is written into the graphics memory, and the data is read out, thereby controlling the display 72 to display a predetermined screen via the video output terminal 70. Specifically, at the time of musical performance output (karaoke performance) by the karaoke device 16, a text image (telop) such as a lyric character image generated by the CPU 42 is output, or MPEG (stored in the hard disk 48). Based on the background video information such as Moving Picture Experts Group) data, a predetermined background video is reproduced (decoded), the lyrics character video is synthesized on the front side of the background video and displayed on the display 72, Various display controls are performed such as displaying lyric text images in a color-changed manner as the music progresses. In the present embodiment, a mode in which display control of the display 72 is performed by the video processing unit 50 and display control of the touch panel display 60 (display device 62) is performed by the display control unit 56 described later will be described. A mode in which display control of the touch panel display 60 is also performed by the video processing unit 50 is also conceivable. In this aspect, the display control unit 56 is not necessarily provided.

前記音声処理部５２は、ＦＭ音源やＰＣＭ音源等の各種音源を備え、前記カラオケ装置１６による音声出力に係る各種制御を行う。好適には、電子回路により音を合成し、各種音色を発生するシンセサイザ（synthesizer）を備えている。このシンセサイザは、前記ハードディスク４８から読み出されて送られて来るカラオケ演奏曲の演奏情報に基づいて楽器の演奏信号等の音楽信号を生成する。前記シンセサイザは、好適には、ＭＩＤＩ（Musical Instrument Digital Interface）端子を備えたものであり、前記演奏情報は、例えばＭＩＤＩ形式のデータである。そのＭＩＤＩデータに基づいて前記シンセサイザにより生成された音楽信号は、前記マイクロフォン７６から前記オーディオ入力端子７４を介して入力される利用者（演奏者）の歌声とミキシングされ、前記オーディオ出力端子７８を介して前記アンプ８０に供給されてそのアンプ８０により増幅されて前記スピーカ８２から出力される。 The sound processing unit 52 includes various sound sources such as an FM sound source and a PCM sound source, and performs various controls related to sound output by the karaoke apparatus 16. Preferably, a synthesizer that synthesizes sound by an electronic circuit and generates various timbres is provided. This synthesizer generates a music signal such as a musical instrument performance signal based on performance information of a karaoke performance music read from the hard disk 48 and sent. The synthesizer preferably includes a MIDI (Musical Instrument Digital Interface) terminal, and the performance information is, for example, data in the MIDI format. The music signal generated by the synthesizer based on the MIDI data is mixed with the singing voice of the user (performer) input from the microphone 76 via the audio input terminal 74, and via the audio output terminal 78. Is supplied to the amplifier 80, amplified by the amplifier 80, and output from the speaker 82.

前記操作パネル５４は、前記カラオケ装置１６の利用者が歌いたい楽曲を選択したり、楽曲の演奏出力に係る音程を調整したり、演奏と歌との音量バランスを調整したり、その他、エコー、音量、トーン等の各種調整を行うための操作ボタン（スイッチ）或いはつまみを備えた入力装置である。前記タッチパネルディスプレイ６０は、画像（映像）を表示させると共に利用者の操作に応じて前記カラオケ装置１６への操作入力を行う装置であり、そのタッチパネルディスプレイ６０に所定の画像（映像）を表示させる表示装置６２と、利用者の指や図示しない備え付けのペン等による前記タッチパネルディスプレイ６０への接触により入力を行うタッチパネル６４とを、備えている。前記表示制御部５６は、前記ＣＰＵ４２から供給される情報に基づいて前記表示装置６２に表示される画面（映像）の描画を制御する映像処理部である。前記入力制御部５８は、前記タッチパネル６４により入力される操作入力情報を前記ＣＰＵ４２等に供給する入力処理部である。以上の構成を備えていることで、前記タッチパネルディスプレイ６０は、前記ディスプレイ７２とは別に第２の映像表示装置として機能すると共に、前記カラオケ装置１６の利用者が歌いたい楽曲を選択したり、楽曲の演奏出力に係る音程を調整したり、演奏と歌との音量バランスを調整したり、その他、エコー、音量、トーン等の各種調整を行うための入力装置として機能する。 The operation panel 54 selects a song that the user of the karaoke apparatus 16 wants to sing, adjusts the pitch related to the performance output of the song, adjusts the volume balance between the performance and the song, The input device includes operation buttons (switches) or knobs for performing various adjustments such as volume and tone. The touch panel display 60 is a device that displays an image (video) and performs an operation input to the karaoke device 16 according to a user's operation. The touch panel display 60 displays a predetermined image (video) on the touch panel display 60. A device 62 and a touch panel 64 for performing input by touching the touch panel display 60 with a user's finger, an attached pen (not shown), or the like are provided. The display control unit 56 is a video processing unit that controls drawing of a screen (video) displayed on the display device 62 based on information supplied from the CPU 42. The input control unit 58 is an input processing unit that supplies operation input information input from the touch panel 64 to the CPU 42 and the like. With the above configuration, the touch panel display 60 functions as a second video display device separately from the display 72, and the user of the karaoke device 16 selects a song that the user wants to sing, It functions as an input device for adjusting the pitch related to the performance output, adjusting the volume balance between the performance and the song, and performing various adjustments such as echo, volume and tone.

前記ＬＡＮインターフェイス６６は、前記カラオケ装置１６をＬＡＮ接続端子８４を介して前記ＬＡＮ２４に接続するための接続器であり、そのように前記ＬＡＮ２４に接続されることで、前記カラオケ装置１６は、同様に前記ＬＡＮ２４に接続された前記電子早見本装置２８等の他の機器との間で情報の送受信が可能とされる。前記カラオケ装置１６が設置される店舗等に複数台のカラオケ装置が備えられている場合において、同様に前記ＬＡＮ２４に接続されたカラオケ装置相互間において情報の送受信が可能とされる。例えば、前記ＬＡＮ２４に接続されたアクセスポイント２６を介して受信される電子早見本装置２８からの選曲入力を受け付けて前記ＲＡＭ４６に設けられた予約曲テーブルに記憶したり、そのアクセスポイント２６を介して前記カラオケ装置１６から電子早見本装置２８へ所定の情報を送信したりというように、電波を介して前記カラオケ装置１６と電子早見本装置２８との間における相互の情報のやりとりが実行される。 The LAN interface 66 is a connector for connecting the karaoke device 16 to the LAN 24 via a LAN connection terminal 84. By being connected to the LAN 24 in this way, the karaoke device 16 is similarly connected to the LAN 24. Information can be transmitted to and received from other devices such as the electronic sample device 28 connected to the LAN 24. When a plurality of karaoke devices are provided in a store or the like where the karaoke device 16 is installed, information can be transmitted and received between the karaoke devices connected to the LAN 24 in the same manner. For example, the music selection input from the electronic sample device 28 received via the access point 26 connected to the LAN 24 is accepted and stored in the reserved music table provided in the RAM 46, or via the access point 26. Mutual information exchange is performed between the karaoke device 16 and the electronic quick sample device 28 via radio waves, such as transmitting predetermined information from the karaoke device 16 to the electronic quick sample device 28.

前記カラオケ装置１６は、図１に示すように、前記ＬＡＮ２４及びルータ２２等を介して前記通信回線２０に接続されており、同様にその通信回線２０に接続された他の機器との相互間でその通信回線２０を介して情報の通信が可能とされている。好適には、前記通信回線２０を介して前記サーバ１８に接続されており、そのサーバ１８から楽曲情報（カラオケデータ）、背景映像情報、曲間情報、及び動画情報等のデジタルコンテンツ（Digital Contents）の配信を受け付けるものである。すなわち、前記カラオケ装置１６は、好適には、所定の通信回線に接続されてサーバとの間で各種情報の送受信を行う通信カラオケ装置であるが、斯かる通信回線に接続されない非通信型のカラオケ装置等にも本発明は好適に適用される。 As shown in FIG. 1, the karaoke apparatus 16 is connected to the communication line 20 via the LAN 24, the router 22, and the like. Similarly, between the karaoke apparatus 16 and other devices connected to the communication line 20 Information can be communicated via the communication line 20. Preferably, it is connected to the server 18 via the communication line 20, and digital contents (Digital Contents) such as music information (karaoke data), background video information, inter-song information, and moving picture information are transmitted from the server 18. Is accepted. That is, the karaoke device 16 is preferably a communication karaoke device that is connected to a predetermined communication line and transmits / receives various information to / from the server, but is not connected to such a communication line. The present invention is also suitably applied to devices and the like.

前記無線通信部６８は、前記カラオケ装置１６と前記電子早見本装置２８等の入力装置との間の無線通信を行う。例えば、前記電子早見本装置２８等の入力装置から送信されるリモコン信号を受信するリモコン受信部として機能する。前記カラオケ装置１６と電子早見本装置２８との対応付け（くくりつけ）処理は、好適には、斯かるリモコン信号（赤外線信号）により前記無線通信部６８を介して行われる。すなわち、前記電子早見本装置２８は、それぞれ個別のシリアル番号を有しており、前記対応付け処理においては、例えばそのシリアル番号（例えば、下４桁）及び所定の接続コードを含む信号が接続通知として前記カラオケ装置１６へ送信され、前記無線通信部６８によりその接続信号を受信したカラオケ装置１６に対して前記電子早見本装置２８が対応付けられる。そのようにして前記カラオケ装置１６に対応付けられた電子早見本装置２８は、そのカラオケ装置１６の入力装置（遠隔操作装置）として機能し、その電子早見本装置２８から送信される信号が前記ＣＰＵ４２に供給されることで、前記カラオケ装置１６の利用者が歌いたい楽曲を選択したり、楽曲の演奏出力に係る音程を調整したり、演奏と歌との音量バランスを調整したり、その他、エコー、音量、トーン等の各種調整を行うための入力が受け付けられるようになっている。なお、前記対応付け処理が行われた後、前記電子早見本装置２８と前記カラオケ装置１６との間の通信は、前記ＬＡＮ２４及びアクセスポイント２６等を介したＬＡＮ通信により行われる。本実施例においては、前記カラオケ装置１６に対応付け処理の行われた電子早見本装置２８等の入力装置もそのカラオケ装置１６の一部を構成するものであるとして以下の説明を行う。 The wireless communication unit 68 performs wireless communication between the karaoke device 16 and an input device such as the electronic quick sample device 28. For example, it functions as a remote control receiving unit that receives a remote control signal transmitted from an input device such as the electronic sample device 28. The association (sticking) processing between the karaoke device 16 and the electronic quick sample device 28 is preferably performed via the wireless communication unit 68 by such a remote control signal (infrared signal). In other words, the electronic sample device 28 has an individual serial number, and in the association process, for example, a signal including the serial number (for example, the last four digits) and a predetermined connection code is notified of the connection. The electronic quick sample device 28 is associated with the karaoke device 16 that is transmitted to the karaoke device 16 and receives the connection signal by the wireless communication unit 68. The electronic quick sample device 28 thus associated with the karaoke device 16 functions as an input device (remote control device) of the karaoke device 16, and a signal transmitted from the electronic quick sample device 28 is the CPU 42. The user of the karaoke device 16 selects a song that the user wants to sing, adjusts the pitch related to the performance output of the song, adjusts the volume balance between the performance and the song, and so on. Input for making various adjustments such as volume, tone, etc. is accepted. After the association processing is performed, communication between the electronic sample device 28 and the karaoke device 16 is performed by LAN communication via the LAN 24, the access point 26, and the like. In the present embodiment, the following explanation will be given on the assumption that the input device such as the electronic quick sample device 28 subjected to the associating process with the karaoke device 16 also constitutes a part of the karaoke device 16.

前記デジタルカメラ８６は、例えばＣＣＤ（charge coupled device）等の撮像素子及びレンズを備え、そのレンズから入射される映像を撮像素子により検知し、その映像を電子情報（映像データ）として取得する所謂デジタルビデオカメラであり、少なくとも動画（時間の経過に従い変化する動きのある映像）を撮影し得るものであるが、必要に応じて静止画（スチル写真）を撮影できるように構成されたものであってもよい。このデジタルカメラ８６により撮影された映像情報は、前記接続端子８８等のインターフェイスを介して前記ＣＰＵ４２等へ供給され、例えばＡＶＩ（Audio-Video Interleaved）形式、ＭＰＥＧ（Moving Picture Experts Group）形式、ＦＬＶ（Flash Video）形式等の映像ファイルとして前記ＲＡＭ４６等に記憶される。前記デジタルカメラ８６は、必ずしも前記カラオケ装置１６の一部として備えられたものでなくともよく、例えば前記カラオケ装置１６が設置された室における所定位置に固設された別体のビデオカメラ乃至前記携帯電話機３０等に備えられた撮像装置等により撮影された映像が所定のインターフェイスを介して前記カラオケ装置１６に入力される態様も考えられる。 The digital camera 86 includes, for example, an image sensor such as a CCD (charge coupled device) and a lens, detects an image incident from the lens by the image sensor, and acquires the image as electronic information (video data). A video camera that can shoot at least moving images (moving images that change over time), but is configured to shoot still images (still photos) as needed. Also good. The video information photographed by the digital camera 86 is supplied to the CPU 42 and the like via the interface such as the connection terminal 88, and for example, AVI (Audio-Video Interleaved) format, MPEG (Moving Picture Experts Group) format, FLV (FLV) Flash video) is stored in the RAM 46 or the like as a video file. The digital camera 86 does not necessarily have to be provided as a part of the karaoke device 16. For example, the digital camera 86 is a separate video camera or mobile phone fixed at a predetermined position in a room where the karaoke device 16 is installed. A mode in which an image taken by an imaging device provided in the telephone 30 or the like is input to the karaoke device 16 via a predetermined interface is also conceivable.

図３は、前記サーバ１８の構成を説明する図である。この図３に示すように、前記サーバ１８は、中央演算処理装置であるＣＰＵ９０、読出専用メモリであるＲＯＭ９２、及び随時書込読出メモリであるＲＡＭ９４を備え、前記ＣＰＵ９０によりＲＡＭ９４の一時記憶機能を利用しつつＲＯＭ９２に予め記憶されたプログラムに従って信号処理を行う所謂コンピュータである。ＴＦＴやＰＤＰ等の映像表示装置９６と、その映像表示装置９６による映像の表示を制御するためのビデオボード（グラフィックスチップ）等の映像処理部９８と、キーボード等の入力装置１００と、その入力装置１００による入力を処理するためのインターフェイス１０２と、前記ＣＰＵ９０等を前記通信回線２０に接続するためのモデム１０４とを、備えて構成されている。 FIG. 3 is a diagram for explaining the configuration of the server 18. As shown in FIG. 3, the server 18 includes a CPU 90 that is a central processing unit, a ROM 92 that is a read-only memory, and a RAM 94 that is a write / read memory as needed. The CPU 90 uses the temporary storage function of the RAM 94. However, it is a so-called computer that performs signal processing according to a program stored in the ROM 92 in advance. An image display device 96 such as TFT or PDP, an image processing unit 98 such as a video board (graphics chip) for controlling display of images by the image display device 96, an input device 100 such as a keyboard, and its input An interface 102 for processing input by the apparatus 100 and a modem 104 for connecting the CPU 90 and the like to the communication line 20 are provided.

図３に示すように、前記サーバ１８は、楽曲データベース１０６及びＳＮＳデータベース１０８をはじめとする各種データベースを備えている。この楽曲データベース１０６は、前記動画情報配信システム１０における前記カラオケ装置１６等に配信するための多数の楽曲情報（カラオケデータ）を記憶するものであり、新しく作成された楽曲情報はこの楽曲データベース１０６に蓄積される。そして、所定の配信制御プログラムにより定期的に、或いは前記カラオケ装置１６等からの配信要求に応じて、随時新たな楽曲情報が前記サーバ１８の楽曲データベース１０６から前記通信回線２０を介して配信され、前記カラオケ装置１６等に配信される。前記カラオケ装置１６に配信された楽曲情報が、そのカラオケ装置１６に備えられた前記ハードディスク４８等に蓄積されることで、そのハードディスク４８等に前記楽曲データベース１０６と同様のデータベースが形成されるものであってもよい。 As shown in FIG. 3, the server 18 includes various databases including a music database 106 and an SNS database 108. The music database 106 stores a large number of music information (karaoke data) to be distributed to the karaoke device 16 or the like in the moving image information distribution system 10, and newly created music information is stored in the music database 106. Accumulated. Then, new music information is distributed from the music database 106 of the server 18 via the communication line 20 periodically by a predetermined distribution control program or in response to a distribution request from the karaoke device 16 or the like. It is distributed to the karaoke device 16 or the like. The music information distributed to the karaoke device 16 is stored in the hard disk 48 or the like provided in the karaoke device 16, thereby forming a database similar to the music database 106 on the hard disk 48 or the like. There may be.

前記楽曲データベース１０６は、前記カラオケ装置１６により出力可能な楽曲にそれぞれ対応する多数（例えば、数万曲分）の楽曲情報（カラオケデータ）を記憶する。この楽曲情報は、前記音声処理部５２により所定の楽器の演奏音を生成するための演奏情報と、歌詞文字映像（歌詞テロップ）を生成するための歌詞情報と、その歌詞情報に基づいて生成された歌詞文字映像を演奏の進行に合わせて順次色替わりさせてゆくための歌詞色替情報とを、含むものであり、コンテンツＩＤである各楽曲に固有の選曲番号により識別される。前記楽曲情報には、属性情報として、その楽曲の曲名、アーティスト名（歌手名）、発表年月日、曲の長さ（演奏時間）、ジャンル、テンポ、及び曲調等の情報が、例えばメタデータ（Ｍｅｔａ情報）等に記憶されている。このジャンルとは、Ｊ−ｐｏｐ（ポップス）、ロックンロール、Ｒ＆Ｂ、テクノ、レゲエ、演歌、軍歌、アニメソング、邦楽、洋楽等の演奏曲の分類や、ドラマ、夏、懐メロ、恋愛、自然、酒、海、川等の演奏曲の曲調を表すキーワード等であり、各楽曲情報はそれら複数のジャンルのうち少なくとも１つのジャンルに属するものである。前記楽曲データベース１０６には、好適には、ＭＩＤＩ形式のデータを前記演奏情報として含むスタンダードタイプの楽曲情報と、例えばＭＰＥＧ形式等のデータを前記演奏情報として含む生演奏タイプの楽曲情報とが記憶される。これらスタンダードタイプの楽曲情報及び生演奏タイプの楽曲情報は、同一の演奏曲に対応する別個の楽曲情報（それぞれ異なる選曲番号に対応）として前記楽曲データベース１０６に記憶される。 The music database 106 stores a large number (for example, tens of thousands of songs) of music information (karaoke data) corresponding to the music that can be output by the karaoke apparatus 16. The music information is generated based on performance information for generating a performance sound of a predetermined musical instrument, lyrics information for generating a lyric character image (lyric telop), and the lyrics information. Lyrics color change information for sequentially changing the color of the lyric character video in accordance with the progress of the performance, and is identified by a music selection number unique to each music piece as the content ID. The music information includes, as attribute information, information such as the song title, artist name (singer name), date of announcement, song length (performance time), genre, tempo, and tone, for example, metadata. (Meta information) and the like. This genre includes the classification of performance songs such as J-pop (pops), rock and roll, R & B, techno, reggae, enka, military song, anime song, Japanese music, Western music, drama, summer, melody, love, nature, sake , A keyword representing the tone of a musical composition such as sea, river, etc., and each piece of music information belongs to at least one genre among the plurality of genres. The music database 106 preferably stores standard type music information including MIDI format data as the performance information and live performance type music information including data such as MPEG format as the performance information. The The standard type music information and the live performance type music information are stored in the music database 106 as separate music information (corresponding to different music selection numbers) corresponding to the same performance music.

前記ＳＮＳデータベース１０８は、前記動画情報配信システム１０の利用者を対象とするソーシャルネットワークサービス（Social Network Service）に係る各種情報を記憶する。このソーシャルネットワークサービスとは、例えば、予め会員登録された会員相互間に限定して情報の閲覧等のサービスを提供する会員制のコミュニティ型のウェブサイトをいう。以下の説明において、ソーシャルネットワークサービスをＳＮＳと略いう。前記ＳＮＳデータベース１０８は、前記動画情報配信システム１０を利用する各利用者毎の、前記カラオケ装置１６を用いたカラオケ演奏に関する情報を、その利用者の識別情報（ユーザＩＤ）と対応付けて記憶する記憶装置である。このＳＮＳデータベース１０８には、ユーザＩＤにより識別される各利用者毎に、例えばその利用者の名前（ニックネーム）、生年月日、実際の年齢、性別、メールアドレス、利用者の住所又は居所に対応する地域、血液型、星座、ＳＮＳへのログイン認証に用いられるパスワード、パスワードを忘れたときのための質問及び解答、アバタ（ネット上において利用者を象徴する人型映像）に関する情報、及び利用者の歌年齢等の情報がその利用者のユーザＩＤと対応付けられて記憶される。前記歌年齢とは、利用者の楽曲の好みの傾向がどの程度の年代（何歳）に相当するものかを示す仮想的な年齢情報であり、対象となる利用者が前記カラオケ装置１６において過去に選曲（演奏）した楽曲に基づいて判断される値である。 The SNS database 108 stores various types of information related to a social network service targeted at the user of the moving picture information distribution system 10. This social network service refers to, for example, a member-based community-type website that provides services such as information browsing only between members who are registered in advance. In the following description, the social network service is abbreviated as SNS. The SNS database 108 stores information related to karaoke performance using the karaoke device 16 for each user who uses the moving picture information distribution system 10 in association with identification information (user ID) of the user. It is a storage device. This SNS database 108 corresponds to each user identified by the user ID, for example, the user's name (nickname), date of birth, actual age, gender, e-mail address, user address or residence. Area, blood type, constellation, password used for login authentication to SNS, questions and answers for forgotten passwords, information on avatars (humanoid images that symbolize users on the net), and users Is stored in association with the user ID of the user. The song age is virtual age information indicating what age (how many years) the user's preference for music is equivalent to, and the target user can use the past in the karaoke device 16. This value is determined based on the music selected (performed).

前記ＳＮＳデータベース１０８には、各利用者の前記カラオケ装置１６を用いたカラオケ演奏に関する情報として、例えばその利用者が過去に利用したカラオケ装置１６に対応する店舗１２（そのカラオケ装置１６が設置された店舗１２）に関する情報である来店履歴、その利用者が前記カラオケ装置１６によるカラオケ演奏において十八番曲或いはお気に入りとして登録した楽曲（簡易な操作により選曲入力を行い得るように設定された楽曲）に関する情報、その利用者が過去に前記カラオケ装置１６によるカラオケ演奏において選曲した選曲履歴（カラオケ装置１６において過去に選曲された楽曲の履歴）に関する情報、その利用者が前記カラオケ装置１６によるカラオケ演奏において過去に行った演奏評価の評価結果に関する情報、及びその利用者がフレンドとして登録した他の利用者に関する情報等が各利用者毎にその利用者のユーザＩＤと対応付けられて記憶される。前記フレンドとは、本実施例のＳＮＳにおいて、フレンドではない不特定多数の利用者とは一線を画す関係であることを示す身分であり、例えば、前記ＳＮＳデータベース１０８に記憶された所定の利用者に対応する情報のうち、公開レベルが「友達」とされた情報に関しては、その利用者のフレンドとして登録されている利用者は閲覧できるが、フレンドとして登録されていない利用者は閲覧できない。このフレンド登録に関する情報は、各利用者に関連する情報として前記ＳＮＳデータベース１０８に記憶されるものであってもよいし、例えばフレンド登録された利用者それぞれのユーザＩＤを別途登録するというように前記ＳＮＳデータベース１０８とは別のデータベースに記憶されるものであってもよい。好適には、フレンド登録の申し込みがあった場合に、一律にフレンド登録を行うのではなく、申し込まれた利用者の許可を待ってその申し込みに係るフレンド登録を行う。 In the SNS database 108, as information on karaoke performance using the karaoke device 16 of each user, for example, the store 12 corresponding to the karaoke device 16 used by the user in the past (the karaoke device 16 is installed). The store visit history, which is information related to the store 12), the information related to the eighteenth song or the favorite song registered by the user in the karaoke performance by the karaoke device 16 (the song set so that the song selection input can be performed by a simple operation), Information related to the music selection history (the history of songs previously selected in the karaoke device 16) that the user has selected in the karaoke performance by the karaoke device 16 in the past, the user has performed in the past in the karaoke performance by the karaoke device 16 Information on evaluation results of performance evaluation The user information concerning other users registered as friends are stored in association with the user ID of the user for each user. In the SNS of the present embodiment, the friend is an identity indicating that it has a relationship that stands out from a large number of unspecified users who are not friends. For example, the friend is a predetermined user stored in the SNS database 108. Among the information corresponding to, information whose public level is “friend” can be viewed by a user registered as a friend of the user, but cannot be viewed by a user who is not registered as a friend. The information related to friend registration may be stored in the SNS database 108 as information related to each user. For example, the user ID of each user registered as a friend is registered separately. It may be stored in a database different from the SNS database 108. Preferably, when there is an application for friend registration, the friend registration is not performed uniformly, but the friend registration related to the application is performed after waiting for the permission of the applied user.

前記ＳＮＳデータベース１０８には、前記動画情報配信システム１０の各利用者から投稿された動画情報が、その利用者のユーザＩＤと対応付けられて記憶される。例えば、前記カラオケ装置１６に備えられた図示しない撮像装置や、携帯型端末装置である前記携帯電話機３０に備えられたデジタルカメラ１４４等により撮影された動画情報（撮像データ）が、その動画情報に対応する演奏曲情報及び投稿者である利用者の識別情報（好適にはユーザＩＤ）と対応付けられて記憶される。ここで、前記演奏曲情報とは、前記動画情報に対応する演奏曲を識別するための情報であり、好適には、前記カラオケ装置１６により出力可能なその演奏曲に対応する楽曲情報の選曲番号である。すなわち、本実施例においては、前記ＳＮＳデータベース１０８が、各利用者から投稿された動画情報を対応する演奏曲情報と対応付けて記憶する動画データベースに相当する。本実施例においては、前記カラオケ装置１６を用いたカラオケ演奏に係る選曲履歴等の各種情報と、各利用者から投稿された動画情報とが、単一のデータベースである前記ＳＮＳデータベース１０８に一元的に記憶される態様について説明するが、前記カラオケ装置１６を用いたカラオケ演奏に係る選曲履歴等の各種情報を記憶するデータベースと、各利用者から投稿された動画情報を記憶する動画データベースとが、それぞれ個別のデータベースとして備えられたものであってもよい。本実施例においては、前記ＳＮＳに係るデータベースサーバとしての機能と動画情報配信サーバとしての機能を兼ね備えた単一の前記サーバ１８を備えた構成について説明するが、それらの機能が複数のサーバにより分散的に実行される等の態様も考えられる。 The SNS database 108 stores moving image information posted by each user of the moving image information distribution system 10 in association with the user ID of the user. For example, moving image information (imaging data) captured by an imaging device (not shown) provided in the karaoke device 16 or a digital camera 144 provided in the mobile phone 30 which is a portable terminal device is included in the moving image information. Corresponding musical performance information and identification information (preferably user ID) of the user who is the poster are stored in association with each other. Here, the performance music information is information for identifying a performance music corresponding to the moving image information, and preferably a music selection number of music information corresponding to the performance music that can be output by the karaoke device 16. It is. In other words, in the present embodiment, the SNS database 108 corresponds to a moving image database that stores moving image information posted by each user in association with corresponding performance music information. In this embodiment, various kinds of information such as a music selection history related to karaoke performance using the karaoke device 16 and moving picture information posted from each user are unified in the SNS database 108 which is a single database. A database for storing various information such as a music selection history related to karaoke performance using the karaoke device 16 and a video database for storing video information posted from each user are described below. Each may be provided as a separate database. In the present embodiment, a description will be given of a configuration including a single server 18 having both a function as a database server related to the SNS and a function as a moving image information distribution server. However, these functions are distributed by a plurality of servers. It is also conceivable to execute the system automatically.

図４は、前記カラオケ装置１６のＣＰＵ４２及び前記サーバ１８のＣＰＵ９０に備えられた制御機能の要部を説明する機能ブロック線図である。この図４に示すカラオケ演奏制御手段１１０、音域判定制御手段１１２、及び動画投稿制御手段１１８は、好適には、前記カラオケ装置１６のＣＰＵ４２に機能的に備えられたものである。音域記憶制御手段１１４、利用者音域特定手段１１６、投稿受付制御手段１２０、動画配信制御手段１２２、共演動画編集制御手段１２４、動画音域判定手段１２６、及び動画抽出制御手段１２８は、好適には、前記サーバ１８のＣＰＵ９０に機能的に備えられたものである。すなわち、本実施例においては、通信端末装置としての前記カラオケ装置１６に対する動画情報の配信等を行う態様について説明するが、図４に示す各制御手段は、前記動画情報配信システム１０の形態に応じて別の装置に備えられたものであってもよい。例えば、前記音域判定制御手段１１２が前記サーバ１８のＣＰＵ９０或いは前記電子早見本装置２８のＣＰＵに機能的に備えられたものであってもよい。前記動画抽出制御手段１２８が前記カラオケ装置１６のＣＰＵ４２に備えられたものであってもよい。 FIG. 4 is a functional block diagram illustrating a main part of control functions provided in the CPU 42 of the karaoke apparatus 16 and the CPU 90 of the server 18. The karaoke performance control means 110, the sound range determination control means 112, and the moving image posting control means 118 shown in FIG. 4 are preferably functionally provided in the CPU 42 of the karaoke apparatus 16. The sound range storage control means 114, the user sound range specifying means 116, the posting reception control means 120, the moving image distribution control means 122, the co-starring moving image editing control means 124, the moving image sound range determination means 126, and the moving image extraction control means 128 are preferably The CPU 90 of the server 18 is functionally provided. That is, in the present embodiment, an aspect of distributing moving image information to the karaoke device 16 as a communication terminal device will be described. However, each control unit shown in FIG. 4 depends on the form of the moving image information distribution system 10. It may be provided in another device. For example, the sound range determination control means 112 may be functionally provided in the CPU 90 of the server 18 or the CPU of the electronic quick sample device 28. The moving image extraction control means 128 may be provided in the CPU 42 of the karaoke apparatus 16.

前記カラオケ演奏制御手段１１０は、前記カラオケ装置１６によるカラオケ演奏に際しての演奏曲の出力を制御する。前記電子早見本装置２８等の入力装置により前記カラオケ装置１６に対する選曲入力操作が行われると、選曲入力された演奏曲の識別情報である選曲番号が前記ＲＡＭ４６等の予約曲テーブルに記憶される。好適には、前記電子早見本装置２８においては、選曲主体（操作主体）である利用者を切り替えて選曲入力操作を行うことができるようになっており、前記電子早見本装置２８により前記カラオケ装置１６に対する選曲入力操作が行われると、選曲入力された演奏曲の選曲番号が、その演奏曲の選曲主体である利用者のユーザＩＤと対応付けられて前記ＲＡＭ４６等の予約曲テーブルに記憶される。前記カラオケ演奏制御手段１１０は、前記ＲＡＭ４６等の予約曲テーブルにおける上位の予約曲から順に（すなわち入力順に）、その予約曲テーブルに記憶された予約曲の選曲番号に対応する楽曲情報を前記ハードディスク４８の楽曲データベースから読み出し、その楽曲情報に含まれる演奏情報に基づいて演奏曲の出力を制御する。例えば、演奏情報としてのＭＩＤＩデータに基づいて、前記音声処理部５２によりそのＭＩＤＩデータにおける楽譜情報としてのトラック乃至チャンネルに対応する楽器の演奏音（音楽情報）を出力させ、前記アンプ８０等を介して前記スピーカ８２から出力させる。斯かる処理と併行して、楽曲情報に含まれる歌詞情報に基づいて演奏曲に係る歌詞文字映像の出力を制御する。すなわち、歌詞情報に基づいて歌詞文字映像を生成し、前記映像処理部５０を介してその歌詞文字映像を前記ディスプレイ７２に表示させる。また、カラオケ演奏の進行に伴い、その歌詞文字映像を切替表示させると共に、歌詞色替情報に基づいて歌詞文字映像を順次色替え表示させる。 The karaoke performance control means 110 controls the output of a performance tune when the karaoke device 16 performs the karaoke performance. When a music selection input operation is performed on the karaoke device 16 by an input device such as the electronic quick sample device 28, a music selection number, which is identification information of the performance music input by the music selection, is stored in the reserved music table such as the RAM 46. Preferably, in the electronic quick sample device 28, the user who is the music selection main body (operation main body) can be switched and a music selection input operation can be performed. When the music selection input operation is performed on the music composition number 16, the music selection number of the performance music inputted and selected is stored in the reserved music table such as the RAM 46 in association with the user ID of the user who is the music selection subject of the performance music. . The karaoke performance control means 110 stores music information corresponding to the music selection number of the reserved music stored in the reserved music table in order from the upper reserved music in the reserved music table such as the RAM 46 (that is, in the input order). The output of the performance music is controlled based on the performance information included in the music information. For example, based on the MIDI data as performance information, the sound processing unit 52 outputs the performance sound (music information) of the musical instrument corresponding to the track or channel as the score information in the MIDI data, via the amplifier 80 or the like. Output from the speaker 82. In parallel with such processing, the output of the lyric character image related to the performance music is controlled based on the lyric information included in the music information. That is, a lyric character image is generated based on the lyric information, and the lyric character image is displayed on the display 72 via the image processing unit 50. Further, as the karaoke performance progresses, the lyric character video is switched and displayed, and the lyric character video is sequentially color-changed and displayed based on the lyric color change information.

前記音域判定制御手段１１２は、音声入力装置である前記マイクロフォン７６から入力された音声情報に基づいて、その音声情報の入力主体である利用者の歌唱音声に係る音域を判定する。本実施例において、歌唱音声に係る音域とは、その歌唱音声に関して適切に発声できている音域をいい、以下に説明するようにして判定される最低音から最高音までの音の範囲をいう。前記音域判定制御手段１１２は、好適には、前記カラオケ演奏制御手段１１０による演奏曲の出力に際して、前記マイクロフォン７６から入力された音声情報に基づいて、その演奏曲に対応する前記利用者の歌唱音声に係る音域を判定する。具体的には、前記カラオケ演奏制御手段１１０による演奏曲の出力と併行して、前記マイクロフォン７６から入力される音声に対応して、音程及びテンポを評価基準とする評価を行う。例えば、前記演奏曲の出力に際して、前記音声処理部５２を介して出力されるその演奏曲のピッチ（音程）及びテンポと、前記マイクロフォン７６により入力される音声情報のピッチ及びテンポとを比較することにより、前記マイクロフォン７６から入力される歌唱音声において、前記演奏曲の各テンポにおけるピッチが正しく発声できているか否かを判定する。そして、判定対象となる演奏曲１曲の演奏を通して、正しく発声できていたと判定された最低音から最高音までの音域を、前記演奏曲に対応する前記利用者の歌唱音声に係る音域として判定する。 The sound range determination control means 112 determines the sound range related to the singing sound of the user who is the input subject of the sound information, based on the sound information input from the microphone 76 which is a sound input device. In the present embodiment, the sound range related to the singing voice refers to a sound range that can be appropriately uttered with respect to the singing voice, and refers to a range of sound from the lowest sound to the highest sound determined as described below. The sound range determination control means 112 is preferably configured to output the performance song by the karaoke performance control means 110 based on the voice information input from the microphone 76 when the karaoke performance control means 110 outputs the song song of the user corresponding to the performance song. To determine the range of sound. Specifically, in parallel with the output of the performance tune by the karaoke performance control means 110, evaluation is performed using the pitch and tempo as evaluation criteria corresponding to the sound input from the microphone 76. For example, when the performance music is output, the pitch (pitch) and tempo of the performance music output via the audio processing unit 52 are compared with the pitch and tempo of the audio information input by the microphone 76. Thus, in the singing voice input from the microphone 76, it is determined whether or not the pitch at each tempo of the performance music can be uttered correctly. Then, the range from the lowest sound to the highest sound determined to have been uttered correctly through the performance of one performance song to be determined is determined as the range related to the user's singing voice corresponding to the performance song. .

前記音域記憶制御手段１１４は、前記カラオケ演奏制御手段１１０による演奏曲の出力に際して、前記音域判定制御手段１１２により判定された音域に関する音域情報を、前記演奏曲の識別情報すなわち選曲番号及び前記音域情報に対応する利用者の識別情報すなわちユーザＩＤと対応付けて前記ＳＮＳデータベース１０８に記憶させる。ここで、前記音域情報に対応する利用者とは、好適には、前記音域判定制御手段１１２の判定対象となった演奏曲の演奏主体（基本的には、音声情報の入力主体）としての利用者であり、具体的には、前記演奏曲（楽曲情報）の選曲主体である利用者である。前記音域情報とは、好適には、前記音域判定制御手段１１２により演奏曲１曲の演奏を通して正しく発声できていたと判定された最低音から最高音までの音域である。例えば、斯かる最低音及び最高音が、前記音域情報に相当する。 The range storage control unit 114 outputs the range information related to the range determined by the range determination control unit 112 when the karaoke performance control unit 110 outputs the performance song, and the identification information of the performance song, that is, the music selection number and the range information. Is stored in the SNS database 108 in association with user identification information corresponding to the user ID, that is, the user ID. Here, the user corresponding to the sound range information is preferably used as a performance subject (basically, a sound information input subject) of the performance tune that is the determination target of the sound range determination control means 112. Specifically, it is a user who is the main music selection subject of the performance music (music information). The sound range information is preferably a sound range from the lowest sound to the highest sound determined by the sound range determination control means 112 as having been correctly uttered through the performance of one performance piece. For example, the lowest sound and the highest sound correspond to the sound range information.

前記音域記憶制御手段１１４は、例えば、前記音域情報を、前記演奏曲の出力に係る選曲履歴に付随する情報として、その演奏曲の選曲主体である利用者のユーザＩＤと対応付けて前記ＳＮＳデータベース１０８に記憶させる。すなわち、前記カラオケ演奏制御手段１１０による演奏曲の出力に際して、前記音域判定制御手段１１２により音域の判定が行われた場合、前記出力された演奏曲の選曲履歴及びその選曲履歴に付随する情報としての前記音域情報を、その演奏曲の選曲主体である利用者のユーザＩＤと対応付けて前記ＳＮＳデータベース１０８に記憶させる。 For example, the sound range storage control unit 114 associates the sound range information with the user ID of the user who is the music selection subject of the performance music as information accompanying the music selection history related to the output of the performance music. 108 is stored. That is, when a musical range is determined by the musical range determination control unit 112 when the musical performance control unit 110 outputs a musical composition, the music selection history of the output musical composition and information accompanying the music selection history are used. The range information is stored in the SNS database 108 in association with the user ID of the user who is the subject of the music selection.

前記利用者音域特定手段１１６は、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された前記音域情報に基づいて、その利用者の歌唱音声に係る音域である利用者音域を特定する。具体的には、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された複数の前記音域情報に関して、規定数の音域情報に重複して含まれる音域を、前記利用者の歌唱音声に係る利用者音域として特定する。例えば、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された、前記音域情報に対応付けられた複数の選曲履歴に関して、記憶された時点が新しい方から複数曲分（例えば、３曲分）の選曲履歴にそれぞれ対応する音域情報に重複（共通）して含まれる音域を、前記利用者の歌唱音声に係る利用者音域として特定する。すなわち、斯かる態様において、前記利用者音域特定手段１１６による前記利用者の歌唱音声に係る利用者音域の特定には、その利用者に対応する前記音域情報が少なくとも３つ（３曲分）必要とされる。前記利用者音域特定手段１１６は、或いは、前記音域情報に対応付けられた全ての選曲履歴に関して、各選曲履歴にそれぞれ対応する音域情報に重複して含まれる音域を、前記利用者の歌唱音声に係る利用者音域として特定するものであってもよい。或いは、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された前記音域情報に対応付けられた複数の選曲履歴に関して、少なくとも１曲に対応する音域情報に含まれる音域を前記利用者の歌唱音声に係る利用者音域として特定するものであってもよい。すなわち、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された全ての選曲履歴にそれぞれ対応する音域情報における最低音から最高音までの音域を、前記利用者の歌唱音声に係る利用者音域として特定するものであってもよい。前記利用者音域特定手段１１６は、以上のようにして特定された前記利用者音域を、各利用者のユーザＩＤと対応付けて前記ＳＮＳデータベース１０８に記憶させる。 The user sound range specifying means 116 specifies a user sound range that is a sound range related to the singing voice of the user based on the sound range information stored in the SNS database 108 corresponding to each user. Specifically, for a plurality of the range information stored in the SNS database 108 corresponding to each user, the range included in the specified number of range information is used for the user's singing voice. Identified as a human voice range. For example, with respect to a plurality of music selection histories associated with the range information stored in the SNS database 108 corresponding to each user, the stored time points are for a plurality of songs (for example, three songs) from the newest one. Is specified as the user's range related to the user's singing voice. That is, in such an aspect, in order to specify the user sound range related to the user's singing voice by the user sound range specifying means 116, at least three (three music pieces) of the sound range information corresponding to the user is required. It is said. The user sound range specifying means 116 or, for all the music selection histories associated with the sound range information, the sound range included in the sound range information corresponding to each music selection history is included in the user's singing voice. It may be specified as such a user sound range. Alternatively, regarding a plurality of music selection histories associated with the sound range information stored in the SNS database 108 corresponding to each user, the sound range included in the sound range information corresponding to at least one song is represented by the user's singing voice. It may be specified as the user's sound range related to. That is, the range from the lowest tone to the highest tone in the range information corresponding to all the music selection histories stored in the SNS database 108 corresponding to each user is set as the user range related to the user's singing voice. It may be specified. The user sound range specifying means 116 stores the user sound range specified as described above in the SNS database 108 in association with the user ID of each user.

前記利用者音域特定手段１１６は、好適には、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された音域情報に基づいて、その利用者の歌唱音声に係る安定音域及びチャレンジ音域を判定する。ここで、安定音域とは、各利用者が危なげなく発声できる音域に相当する。チャレンジ音域とは、各利用者が少し無理をすれば発声できる音域を含む音域に相当する。好適には、前記チャレンジ音域は、前記安定音域を包含し且つその安定音域よりも広い音域に相当する。 The user range specifying means 116 preferably determines a stable range and a challenge range related to the singing voice of the user based on the range information stored in the SNS database 108 corresponding to each user. . Here, the stable sound range corresponds to a sound range that each user can utter without danger. The challenge sound range corresponds to a sound range including a sound range that can be uttered if each user makes a little effort. Preferably, the challenge sound range includes the stable sound range and corresponds to a sound range wider than the stable sound range.

前記利用者音域特定手段１１６は、好適には、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された複数の前記音域情報に関して、規定数以上の音域情報に重複して含まれる音域を前記安定音域として特定し、少なくとも１つの音域情報に含まれる音域を前記チャレンジ音域として特定する。例えば、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された複数の選曲履歴にそれぞれ対応する音域情報に関して、３曲分の選曲履歴にそれぞれ対応する音域情報に重複（共通）して含まれる音域を前記安定音域として特定する。或いは、全ての選曲履歴にそれぞれ対応する音域情報に重複して含まれる音域を前記安定音域として特定するものであってもよい。各利用者に対応して前記ＳＮＳデータベース１０８に記憶された複数の選曲履歴にそれぞれ対応する音域情報に関して、少なくとも１曲に対応する音域情報に含まれる音域を前記チャレンジ音域として特定する。すなわち、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された全ての選曲履歴にそれぞれ対応する音域情報における最低音から最高音までの音域を前記チャレンジ音域として特定する。或いは、前記安定音域の判定に係る規定数よりも少ない音域情報に重複して含まれる音域を前記チャレンジ音域として特定する。例えば、３曲分の選曲履歴にそれぞれ対応する音域情報に重複して含まれる音域を前記安定音域として特定する態様において、２曲分の選曲履歴にそれぞれ対応する音域情報に重複して含まれる音域を前記チャレンジ音域として特定する。前記利用者音域特定手段１１６は、以上のようにして特定された前記安定音域及びチャレンジ音域を、各利用者のユーザＩＤと対応付けて前記ＳＮＳデータベース１０８に記憶させる。 Preferably, the user sound range specifying means 116, for a plurality of the sound range information stored in the SNS database 108 corresponding to each user, includes a sound range that is redundantly included in the sound range information of a specified number or more. A stable range is specified, and a range included in at least one range information is specified as the challenge range. For example, regarding the range information corresponding to each of a plurality of song selection histories stored in the SNS database 108 corresponding to each user, the range information corresponding to the song selection history for three songs is included in duplicate (common). A sound range is specified as the stable sound range. Alternatively, a sound range that is redundantly included in the sound range information corresponding to all the music selection histories may be specified as the stable range. Regarding the range information corresponding to each of a plurality of music selection histories stored in the SNS database 108 corresponding to each user, the range included in the range information corresponding to at least one song is specified as the challenge range. That is, the range from the lowest tone to the highest tone in the range information corresponding to all the music selection histories stored in the SNS database 108 corresponding to each user is specified as the challenge range. Alternatively, a range that is redundantly included in the range information less than the prescribed number related to the determination of the stable range is specified as the challenge range. For example, in the aspect in which the range that is included in the range information corresponding to the song selection history for three songs is specified as the stable range, the range that is included in the range information corresponding to the song selection history for two songs. Is identified as the challenge range. The user sound range specifying means 116 stores the stable sound range and challenge sound range specified as described above in the SNS database 108 in association with the user ID of each user.

前記動画投稿制御手段１１８は、前記カラオケ装置１６による前記サーバ１８への動画情報の投稿を制御する。例えば、前記カラオケ装置１６により、前記楽曲データベース１０６に記憶された何れかの楽曲情報に対応するカラオケ演奏が行われる際、前記デジタルカメラ８６により撮影された映像情報及び前記マイクロフォン７６により入力された音声情報を含む動画情報を記録し、その動画情報を、対応する演奏曲情報すなわち前記楽曲情報の選曲番号及び投稿者のユーザＩＤと対応付けて前記サーバ１８へ投稿（アップロード）する。好適には、前記デジタルカメラ８６により撮影された映像情報と、前記マイクロフォン７６により入力されて前記音声処理部５２によりディジタル信号に変換された音声情報とを、例えばＡＶＩ形式、ＭＰＥＧ形式、ＦＬＶ形式等のファイル形式にて統合し、映像情報及び音声情報を含む動画情報（動画ファイル）として記録する。図４に示す各制御手段は、映像情報及び音声情報をそれぞれ個別の情報（ファイル）として記録、送受信、乃至蓄積等するものであってもよいが、本実施例においてはそれらを統合して動画ファイルとして記録、送受信、乃至蓄積等する態様について説明する。すなわち、本実施例において、特に言及しない場合には、動画情報は、映像情報及びその再生に際して出力される音声に対応する音声情報を含むものである。 The moving image posting control means 118 controls posting of moving image information to the server 18 by the karaoke device 16. For example, when the karaoke device 16 performs a karaoke performance corresponding to any piece of music information stored in the music database 106, the video information photographed by the digital camera 86 and the voice input by the microphone 76. The moving image information including the information is recorded, and the moving image information is posted (uploaded) to the server 18 in association with the corresponding performance music information, that is, the music selection number of the music information and the user ID of the poster. Preferably, the video information captured by the digital camera 86 and the audio information input by the microphone 76 and converted into a digital signal by the audio processing unit 52 are, for example, AVI format, MPEG format, FLV format, etc. Are recorded as moving image information (moving image file) including video information and audio information. Each control unit shown in FIG. 4 may record, transmit / receive, or store video information and audio information as individual information (files), but in the present embodiment, they are integrated into a moving image. A mode of recording, transmission / reception, and accumulation as a file will be described. That is, in this embodiment, unless otherwise specified, the moving image information includes video information and audio information corresponding to audio output at the time of reproduction.

前記動画投稿制御手段１１８は、好適には、前記カラオケ装置１６により所定の楽曲情報に対応するカラオケ演奏が行われる際、前記デジタルカメラ８６により撮影された映像情報及び前記マイクロフォン７６により入力された音声情報を含む動画情報を記録すると共に、前記音域判定制御手段１１２により前記利用者の歌唱音声に係る音域情報を判定し、前記動画情報を、前記楽曲情報の選曲番号、投稿者のユーザＩＤ、及び前記音域情報と対応付けて前記サーバ１８へ投稿する。すなわち、前記動画情報の記録に際して行われていたカラオケ演奏に対応して前記音域判定制御手段１１２により判定された音域情報を、前記動画情報と対応付けて前記サーバ１８へ投稿する。 The video posting control means 118 is preferably configured such that when the karaoke device 16 performs a karaoke performance corresponding to predetermined music information, the video information photographed by the digital camera 86 and the voice input by the microphone 76 are used. While recording moving image information including information, the range determination control means 112 determines the range information related to the user's singing voice, and the moving image information includes the music selection number of the music information, the user ID of the poster, and Posting to the server 18 in association with the range information. That is, the sound range information determined by the sound range determination control means 112 corresponding to the karaoke performance performed at the time of recording the moving image information is posted to the server 18 in association with the moving image information.

前記投稿受付制御手段１２０は、前記通信回線２０を介して前記カラオケ装置１６をはじめとする通信端末装置から、予め定められた複数の演奏曲情報の何れかに対応する動画情報の投稿を受け付け、その投稿された動画情報を対応する演奏曲情報と対応付けて前記ＳＮＳデータベース１０８に記憶させる。例えば、前記動画投稿制御手段１１８により投稿された動画情報を、その動画情報に対応付けられた楽曲情報の選曲番号及びユーザＩＤと対応付けて前記ＳＮＳデータベース１０８に記憶させる。換言すれば、前記動画投稿制御手段１１８により投稿された動画情報を、その動画情報に対応付けられたユーザＩＤに対応する利用者を投稿主体とする投稿動画として、その動画情報に対応付けられた楽曲情報の選曲番号と対応付けて前記ＳＮＳデータベース１０８に記憶させる。前記動画情報に音域情報が対応付けられている場合には、前記動画投稿制御手段１１８により投稿された動画情報を、その動画情報に対応付けられた楽曲情報の選曲番号、ユーザＩＤ、及び音域情報と対応付けて前記ＳＮＳデータベース１０８に記憶させる。また好適には、前記投稿された動画情報を、投稿主体である利用者に対応して前記ＳＮＳデータベース１０８に記憶された利用者音域と対応付けて前記ＳＮＳデータベース１０８に記憶させる。或いは、前記投稿された動画情報に対応付けられた投稿主体である利用者のユーザＩＤをもって、後述する動画情報の抽出に際してその利用者に対応して前記ＳＮＳデータベース１０８に記憶された利用者音域を読み出すものであってもよい。 The posting acceptance control means 120 accepts posting of moving picture information corresponding to any of a plurality of pieces of predetermined performance music information from the communication terminal device such as the karaoke device 16 via the communication line 20. The posted moving image information is stored in the SNS database 108 in association with the corresponding performance music information. For example, the moving image information posted by the moving image posting control means 118 is stored in the SNS database 108 in association with the music selection number and the user ID of the music information associated with the moving image information. In other words, the moving image information posted by the moving image posting control means 118 is associated with the moving image information as a posted moving image whose user is the user corresponding to the user ID associated with the moving image information. It is stored in the SNS database 108 in association with the music selection number of the music information. When the range information is associated with the moving image information, the moving image information posted by the moving image posting control means 118 is changed to the music selection number, the user ID, and the range information of the music information associated with the moving image information. And stored in the SNS database 108. Preferably, the posted moving image information is stored in the SNS database 108 in association with the user range stored in the SNS database 108 corresponding to the user who is the posting subject. Alternatively, with the user ID of the user who is the posting subject associated with the posted moving picture information, the user sound range stored in the SNS database 108 corresponding to the user when extracting the moving picture information to be described later is used. You may read.

前記動画配信制御手段１２２は、前記ＳＮＳデータベース１０８に記憶された動画情報を前記通信回線２０を介して配信する。例えば、前記カラオケ装置１６、携帯電話機３０、タブレット３２、或いはパーソナルコンピュータ３６等、前記通信回線２０に接続された通信端末装置から前記通信回線２０を介して所定の動画情報の配信要求があった場合には、その配信要求に応じて前記ＳＮＳデータベース１０８に記憶されたその配信要求に係る動画情報を要求元である通信端末装置に前記通信回線２０を介して配信する（図４においては、携帯電話機３０に対する配信を図示している）。この動画配信制御手段１２２により各通信端末装置に配信された動画情報は、斯かる通信端末装置に備えられたブラウザ（Web browser）等によりその通信端末装置の表示部（映像表示装置）に表示される。 The moving image distribution control unit 122 distributes moving image information stored in the SNS database 108 via the communication line 20. For example, when there is a request for distribution of predetermined moving image information from the communication terminal device connected to the communication line 20, such as the karaoke device 16, the mobile phone 30, the tablet 32, or the personal computer 36, via the communication line 20. In response to the distribution request, the video information related to the distribution request stored in the SNS database 108 is distributed to the requesting communication terminal device via the communication line 20 (in FIG. 30 for delivery). The moving image information distributed to each communication terminal device by the moving image distribution control means 122 is displayed on the display unit (video display device) of the communication terminal device by a browser (Web browser) provided in the communication terminal device. The

前記動画配信制御手段１２２は、前記ＳＮＳデータベース１０８に記憶された複数の動画情報を、共演動画（コラボ動画）として前記通信回線２０を介して配信する。この共演動画とは、同一の楽曲に対応する複数（好適には１組）の動画情報を、所定の表示画面内において同期（併行）して出力させるものである。すなわち、前記動画配信制御手段１２２は、前記通信回線２０に接続された通信端末装置からその通信回線２０を介して所定の共演動画の配信要求があった場合には、その配信要求に応じて前記ＳＮＳデータベース１０８に記憶されたその配信要求に係る共演動画を表示させるためのデータを要求元である通信端末装置に前記通信回線２０を介して配信する。この共演動画を表示させるためのデータは、好適には、前記ＳＮＳデータベース１０８においてそれぞれ独立した動画情報として記憶された組み合わせ対象となる動画情報と、それらの動画情報を共演動画として同期出力させるための組み合わせ情報としての共演動画情報であるが、それら組み合わせに係る複数の動画情報が予め合成され、単一の動画情報（動画ファイル）として記憶されたものであってもよい。 The moving image distribution control means 122 distributes a plurality of pieces of moving image information stored in the SNS database 108 through the communication line 20 as a co-starring moving image (collaboration moving image). The co-starring moving image is to output a plurality (preferably one set) of moving image information corresponding to the same music piece in synchronization (in parallel) within a predetermined display screen. That is, when there is a request for distribution of a predetermined co-starring moving image via the communication line 20 from the communication terminal device connected to the communication line 20, the moving image distribution control means 122 responds to the distribution request. Data for displaying the co-starring moving image related to the distribution request stored in the SNS database 108 is distributed to the communication terminal device that is the request source via the communication line 20. The data for displaying the co-star video is preferably video information to be combined and stored as independent video information in the SNS database 108, and for synchronously outputting the video information as the co-star video. Although it is co-starring moving image information as combination information, a plurality of pieces of moving image information related to the combination may be synthesized in advance and stored as single moving image information (moving image file).

前記共演動画編集制御手段１２４は、前記投稿受付制御手段１２０により受け付けられる複数の動画情報を組み合わせて共演動画情報を編集する。すなわち、前記動画配信制御手段１２２により、前記携帯電話機３０をはじめとする通信端末装置に対して前記共演動画を配信するための情報を編集する。すなわち、前記投稿受付制御手段１２０により受け付けられる、共通の演奏曲情報に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、その編集された共演動画情報をその共通の演奏曲情報と対応付けて前記ＳＮＳデータベース１０８に記憶させる。換言すれば、前記投稿受付制御手段１２４により受け付けられる、共通の演奏曲に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、その編集された共演動画情報をその演奏曲の識別情報（コンテンツＩＤ）と対応付けて前記動画データベース６６に記憶させる。 The co-star video editing control unit 124 edits the co-star video information by combining a plurality of video information received by the posting reception control unit 120. That is, the moving image distribution control unit 122 edits information for distributing the co-starring moving image to communication terminal devices such as the mobile phone 30. That is, the co-star video information is edited by combining a plurality of video information associated with the common music piece information received by the posting reception control means 120, and the edited co-star video information is used as the common music piece information. And stored in the SNS database 108. In other words, the co-star video information is edited by combining a plurality of pieces of video information associated with the common musical piece accepted by the posting acceptance control unit 124, and the edited co-star video information is identified as the musical piece. It is stored in the moving image database 66 in association with information (content ID).

前記共演動画編集制御手段１２４により編集される共演動画情報とは、前記組み合わせに係る複数の動画情報を同期して（同時に）出力させるための情報であり、好適には、前記組み合わせに係る複数の動画情報が合成され、それら複数の動画情報とは別に１動画情報（動画ファイル）としての共演動画情報が作成されて前記ＳＮＳデータベース１０８に記憶される。或いは、前記ＳＮＳデータベース１０８にそれぞれ個別に（独立した動画ファイルとして）記憶された組み合わせに係る複数の動画情報が、後述する図５に示すように共演映像として同期出力されるものであってもよい。すなわち、前記共演動画情報は、前記ＳＮＳデータベース１０８にそれぞれ個別に（独立した動画ファイルとして）記憶された組み合わせに係る複数の動画情報を読み出し、それら動画情報を共演映像として同期出力させるための組み合わせ情報に相当するものであってもよい。前記共演動画編集制御手段１２４は、好適には、前記共演動画情報の属性情報として、対象となる演奏曲の識別情報（選曲番号）、その演奏曲の曲名、アーティスト名、演奏時間、投稿日時、組み合わせに係る各動画情報の投稿者の識別情報（ユーザＩＤ）、各投稿者の名前（ニックネーム）、各投稿者のコメント、共演映像のタイトル等を、対象となる共演動画情報と対応付けて前記ＳＮＳデータベース１０８に記憶させる。 The co-star video information edited by the co-star video editing control means 124 is information for synchronizing (simultaneously) outputting a plurality of video information related to the combination, and preferably a plurality of video related to the combination. The moving image information is synthesized, and apart from the plurality of moving image information, co-starred moving image information as one moving image information (moving image file) is created and stored in the SNS database 108. Alternatively, a plurality of pieces of video information related to combinations stored individually (as independent video files) in the SNS database 108 may be synchronously output as a co-star video as shown in FIG. 5 described later. . That is, the co-star video information is a combination information for reading a plurality of video information related to the combinations stored individually (as independent video files) in the SNS database 108 and synchronously outputting the video information as a co-star video. It may correspond to. The co-starring video editing control means 124 preferably has, as attribute information of the co-starring video information, identification information (music selection number) of the target performance song, song name, artist name, performance time, posting date / time of the performance song, The identification information (user ID) of each contributor of each video information related to the combination, the name (nickname) of each contributor, the comment of each contributor, the title of the co-star video, etc., in association with the target co-star video information It is stored in the SNS database 108.

前記共演動画編集制御手段１２４は、好適には、前記投稿受付制御手段１２２により先に投稿された第１の動画情報に対して、後に投稿された第２の動画情報を組み合わせて前記共演動画情報を編集する。前記動画情報配信システム１０において、好適には、前記カラオケ装置１６によるカラオケ演奏に際して、共演動画の組み合わせ対象となる第１の動画情報を出力させつつ、その出力と併行して前記第２の動画情報を記録する制御が実行される。すなわち、前記カラオケ装置１６による所定の楽曲情報の演奏に先立って、前記ＳＮＳデータベース１０８に記憶されたその楽曲情報（選曲番号）に対応する複数の動画情報のうち、前記カラオケ演奏と併行して出力される前記第１の動画情報が選択される。前記楽曲情報に基づくカラオケ演奏に際して、前記カラオケ装置１６により前記第１の動画情報が出力され、その出力と同期して前記楽曲情報に基づくカラオケ演奏が行われる。このカラオケ演奏に対応して、前記動画投稿制御手段１１８により動画情報の記録が行われ、カラオケ演奏の終了後、前記第２の動画情報として前記サーバ１８へ投稿される。そして、前記共演動画編集制御手段１２４により、前記第１の動画情報に対して、新たに投稿された前記第２の動画情報が組み合わされ、前記共演動画情報が編集される。 Preferably, the co-star video editing control unit 124 combines the first video information previously posted by the post acceptance control unit 122 with the second video information posted later, and the co-star video information. Edit. In the moving image information distribution system 10, the second moving image information is preferably output in parallel with the output of the first moving image information to be combined with the co-starring moving images when the karaoke device 16 performs the karaoke performance. Is recorded. That is, prior to the performance of the predetermined music information by the karaoke device 16, among a plurality of pieces of moving picture information corresponding to the music information (music selection number) stored in the SNS database 108, it is output in parallel with the karaoke performance. The first moving image information to be selected is selected. When performing a karaoke performance based on the music information, the first moving image information is output by the karaoke device 16, and a karaoke performance based on the music information is performed in synchronization with the output. Corresponding to this karaoke performance, moving image information is recorded by the moving image posting control means 118, and after the karaoke performance is completed, it is posted to the server 18 as the second moving image information. Then, the co-star video editing control unit 124 combines the first video information with the newly posted second video information to edit the co-star video information.

図５は、前記カラオケ装置１６、携帯電話機３０、タブレット３２、或いはパーソナルコンピュータ３６等の通信端末装置に表示される共演動画１３０の一例を示す図である。この図５に示す例においては、共通の楽曲情報に対応付けられた１対の動画情報の組み合わせに係る共演動画情報の表示について説明しているが、３つ以上の動画情報の組み合わせに係る共演動画情報も考えられる。図５に示す共演動画１３０においては、第１の動画情報１３２と、その第１の動画情報１３２の出力と同期して実行されたカラオケ演奏の出力に際して前記動画投稿制御手段１１８により記録された第２の動画情報１３４とが、所定のウェブサイト上において同期して（同時に）出力される例を示している。この図５に示すウェブサイトは、例えば、前記動画情報配信システム１０を運営する情報配信サービス提供会社により管理される所謂動画投稿サイトに相当するものであり、前記カラオケ装置１６において閲覧可能とされると共に、一般的なブラウザにより前記携帯電話機３０、タブレット３２、或いはパーソナルコンピュータ３６等の通信端末装置においても閲覧可能とされる。このウェブサイト上においては、出力される共演動画に係る共通の演奏曲の曲名及びアーティスト名が表示されると共に、再生／停止ボタン１３６、頭出しボタン１３８、再生位置決定つまみ１４０、音量設定つまみ１４２、及び表示画面の全画面表示乃至等倍表示を切り替える表示切替ボタン１４４等が表示され、各ボタン乃至つまみにより共演映像の出力に係る各種操作が可能とされている。 FIG. 5 is a diagram showing an example of a co-star movie 130 displayed on a communication terminal device such as the karaoke device 16, the mobile phone 30, the tablet 32, or the personal computer 36. In the example shown in FIG. 5, the display of co-starring video information related to a combination of a pair of video information associated with common music information has been described, but co-starring related to a combination of three or more video information Video information can also be considered. In the co-star movie 130 shown in FIG. 5, the first movie information 132 and the first movie information recorded by the movie posting control means 118 when the karaoke performance is executed in synchronization with the output of the first movie information 132 are recorded. In this example, two pieces of moving image information 134 are output synchronously (simultaneously) on a predetermined website. The website shown in FIG. 5 corresponds to, for example, a so-called video posting site managed by an information distribution service provider that operates the video information distribution system 10 and can be browsed on the karaoke apparatus 16. At the same time, it can be browsed on a communication terminal device such as the mobile phone 30, the tablet 32, or the personal computer 36 by a general browser. On this website, the name and artist name of a common performance song related to the output co-starring movie are displayed, and a play / stop button 136, a cue button 138, a playback position determination knob 140, and a volume setting knob 142 are displayed. , And a display switching button 144 for switching between full-screen display and same-size display on the display screen are displayed, and various operations related to the output of the co-starring video can be performed by each button or knob.

前記動画音域判定手段１２６は、前記ＳＮＳデータベース１０８に記憶された複数の動画情報それぞれにおける音域を判定する。ここで、前記動画情報における音域とは、その動画情報の投稿主体である利用者に関する音域、その動画情報に対応付けられた演奏曲情報に関する音域、或いはその動画情報に含まれる音声情報に関する音域等である。前記動画音域判定手段１２６は、好適には、前記動画情報の投稿主体である利用者に対応して前記ＳＮＳデータベース１０８に記憶された利用者音域を、その動画情報における音域として判定する。また好適には、前記動画情報に対応付けられて記憶された音域情報を、その動画情報における音域として判定する。また好適には、前記動画情報に対応付けられた演奏曲情報に対応する演奏曲の最低音から最高音までの音域を、その動画情報における音域として判定する。また好適には、前記動画情報に含まれる音声情報を解析すること等により、その音声情報における最低音から最高音までの音域を、その動画情報における音域として判定する。 The moving image sound range determination unit 126 determines a sound range in each of a plurality of pieces of moving image information stored in the SNS database 108. Here, the sound range in the moving image information refers to a sound range related to a user who is a posting subject of the moving image information, a sound range related to performance music information associated with the moving image information, a sound range related to audio information included in the moving image information, or the like. It is. The moving image sound range determining means 126 preferably determines the user sound range stored in the SNS database 108 corresponding to the user who is the posting subject of the moving image information as the sound range in the moving image information. Preferably, sound range information stored in association with the moving image information is determined as a sound range in the moving image information. Preferably, the sound range from the lowest sound to the highest sound of the performance music corresponding to the performance music information associated with the video information is determined as the sound range in the video information. Preferably, the sound range from the lowest sound to the highest sound in the sound information is determined as the sound range in the moving image information by analyzing sound information included in the moving image information.

前記動画抽出制御手段１２８は、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する。すなわち、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうち、前記動画音域判定手段１２６により判定されるその動画情報における音域が、抽出された動画情報を提示する等その抽出の対象となる利用者（以下、単に対象となる利用者という）に対応して前記ＳＮＳデータベース１０８に記憶された利用者音域に適合する動画情報を、その利用者に対応する前記共演動画情報の組み合わせ候補として抽出する。換言すれば、対象となる利用者が前記カラオケ装置１６によりカラオケ演奏を行う際における、前記第１の動画情報の候補として抽出する。以下に詳述する動画情報の抽出に先立ち、抽出対象となる動画情報に対応する演奏曲の曲名、歌手名、或いはジャンル等による絞り込みが行われるものであってもよい。斯かる態様において、前記動画抽出制御手段１２８は、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、指定された曲名、歌手名、ジャンル等の条件に合致し、且つその動画情報における音域が、前記利用者音域特定制御手段により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する。 The moving image extraction control means 128 is a user corresponding to the user whose sound range in the moving image information is specified by the user sound range specifying control means from among a plurality of pieces of moving picture information stored in the SNS database 108. The moving image information suitable for the sound range is extracted as a combination candidate of the co-starring moving image information. In other words, among the plurality of pieces of moving picture information stored in the SNS database 108, the sound range in the moving picture information determined by the moving picture sound range determination unit 126 is used for extraction, such as presenting the extracted moving picture information. Video information that matches a user's range stored in the SNS database 108 corresponding to a user (hereinafter simply referred to as a target user) is extracted as a combination candidate of the co-starring video information corresponding to the user . In other words, when the target user performs a karaoke performance with the karaoke device 16, it is extracted as a candidate for the first moving image information. Prior to the extraction of the moving image information described in detail below, narrowing down by the song name, singer name, genre, etc. of the performance song corresponding to the moving image information to be extracted may be performed. In such an aspect, the moving image extraction control means 128 matches a specified song name, singer name, genre, etc. among the plurality of moving image information stored in the SNS database 108, and the moving image information The moving image information whose sound range matches the user sound range corresponding to the user specified by the user sound range specifying control means is extracted as a combination candidate of the co-starring moving image information.

前記動画抽出制御手段１２８は、具体的には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報の投稿主体である利用者に対応する利用者音域が、前記対象となる利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に対応付けられて記憶された音域情報が、前記対象となる利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に対応付けられた演奏曲情報に対応する演奏曲の最低音から最高音までの音域が、前記対象となる利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に含まれる音声情報の最低音から最高音までの音域が、前記対象となる利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する。更に、これらを組み合わせて動画情報の抽出を行うものであってもよい。 Specifically, the moving image extraction control unit 128 determines that the user sound range corresponding to the user who is the posting subject of the moving image information from the plurality of moving image information stored in the SNS database 108 is the target. The moving image information suitable for the user sound range corresponding to the user is extracted as a combination candidate of the co-starring moving image information. Alternatively, among the plurality of moving image information stored in the SNS database 108, the sound range information stored in association with the moving image information matches the user sound range corresponding to the target user. Are extracted as combination candidates of the co-starring moving picture information. Alternatively, from among a plurality of pieces of moving picture information stored in the SNS database 108, a range from the lowest sound to the highest sound of the musical piece corresponding to the musical piece information associated with the moving picture information is used as the target. Moving image information suitable for the user's range corresponding to the user is extracted as a combination candidate of the co-starring moving image information. Alternatively, the sound range from the lowest sound to the highest sound of the audio information included in the moving image information among the plurality of moving image information stored in the SNS database 108 is the user sound range corresponding to the target user. The suitable moving image information is extracted as a combination candidate of the co-starring moving image information. Further, the moving image information may be extracted by combining these.

前記動画抽出制御手段１２８は、好適には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。好適には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、対象となる利用者に対応して前記ＳＮＳデータベース１０８に記憶された利用者音域に包含される（その利用者音域を逸脱しない）動画情報を、前記共演動画情報の組み合わせ候補として抽出する。具体的には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報の投稿主体である利用者に対応する利用者音域における最低音から最高音までの音域が、前記対象となる利用者に対応する利用者音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に対応付けられて記憶された音域情報における最低音から最高音までの音域が、前記対象となる利用者に対応する利用者音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に対応付けられた演奏曲情報（選曲番号）に対応する演奏曲の最低音から最高音までの音域が、前記対象となる利用者に対応する利用者音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に含まれる音声情報の最低音から最高音までの音域が、前記対象となる利用者に対応する利用者音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。更に、これらを組み合わせて動画情報の抽出を行うものであってもよい。 Preferably, the moving image extraction control unit 128 is configured such that the sound range in the moving image information is specified by the user sound range specifying control unit 116 from among a plurality of pieces of moving image information stored in the SNS database 108. Is extracted as a combination candidate of the co-starring moving image information. Preferably, the sound range in the moving image information is included in the user sound range stored in the SNS database 108 corresponding to the target user among the plurality of moving image information stored in the SNS database 108. Moving image information (which does not deviate from the user's sound range) is extracted as a combination candidate of the co-starring moving image information. Specifically, from among a plurality of pieces of moving image information stored in the SNS database 108, a sound range from a lowest sound to a highest sound in a user sound range corresponding to a user who is a posting subject of the moving image information is the target. The moving image information included in the user sound range corresponding to the user to be extracted is extracted as a combination candidate of the co-starring moving image information. Alternatively, among the plurality of pieces of moving image information stored in the SNS database 108, the sound range from the lowest sound to the highest sound in the sound range information stored in association with the moving image information corresponds to the target user. The moving image information included in the user sound range to be extracted is extracted as a combination candidate of the co-starring moving image information. Alternatively, from among a plurality of pieces of moving image information stored in the SNS database 108, the range from the lowest sound to the highest sound of the musical piece corresponding to the musical piece information (music selection number) associated with the moving image information is The moving image information included in the user sound range corresponding to the target user is extracted as a combination candidate of the co-starring moving image information. Alternatively, the sound range from the lowest sound to the highest sound of the audio information included in the moving image information among the plurality of moving image information stored in the SNS database 108 is the user sound range corresponding to the target user. The moving image information included is extracted as a combination candidate of the co-starring moving image information. Further, the moving image information may be extracted by combining these.

前記動画抽出制御手段１２８は、好適には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に規定の許容区間を加えた判定区間に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。好適には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、対象となる利用者に対応して前記ＳＮＳデータベース１０８に記憶された利用者音域に規定の許容区間を加えた判定区間に包含される（その判定区間を逸脱しない）動画情報を、前記共演動画情報の組み合わせ候補として抽出する。具体的には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報の投稿主体である利用者に対応する利用者音域における最低音から最高音までの音域が、前記対象となる利用者に対応する前記判定区間に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に対応付けられて記憶された音域情報における最低音から最高音までの音域が、前記対象となる利用者に対応する前記判定区間に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に対応付けられた演奏曲情報（選曲番号）に対応する演奏曲の最低音から最高音までの音域が、前記対象となる利用者に対応する前記判定区間に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。或いは、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に含まれる音声情報の最低音から最高音までの音域が、前記対象となる利用者に対応する前記判定区間に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。更に、これらを組み合わせて動画情報の抽出を行うものであってもよい。 Preferably, the moving image extraction control unit 128 is configured such that the sound range in the moving image information is specified by the user sound range specifying control unit 116 from among a plurality of pieces of moving image information stored in the SNS database 108. The moving image information included in the determination section obtained by adding a prescribed allowable section to the user sound range corresponding to is extracted as a combination candidate of the co-starring moving image information. Preferably, from among a plurality of pieces of moving image information stored in the SNS database 108, a sound range in the moving image information is defined in a user sound range stored in the SNS database 108 corresponding to a target user. The moving image information included in the determination interval including the allowable interval (without departing from the determination interval) is extracted as a combination candidate of the co-starring moving image information. Specifically, from among a plurality of pieces of moving image information stored in the SNS database 108, a sound range from a lowest sound to a highest sound in a user sound range corresponding to a user who is a posting subject of the moving image information is the target. The moving image information included in the determination section corresponding to the user is extracted as a combination candidate of the co-starring moving image information. Alternatively, among the plurality of pieces of moving image information stored in the SNS database 108, the sound range from the lowest sound to the highest sound in the sound range information stored in association with the moving image information corresponds to the target user. The moving image information included in the determination section is extracted as a combination candidate of the co-starring moving image information. Alternatively, from among a plurality of pieces of moving image information stored in the SNS database 108, the range from the lowest sound to the highest sound of the musical piece corresponding to the musical piece information (music selection number) associated with the moving image information is The moving image information included in the determination section corresponding to the target user is extracted as a combination candidate of the co-starring moving image information. Alternatively, the sound range from the lowest sound to the highest sound of the audio information included in the moving image information among the plurality of moving image information stored in the SNS database 108 is in the determination section corresponding to the target user. The included video information is extracted as a combination candidate of the co-starring video information. Further, the moving image information may be extracted by combining these.

前記許容区間は、好適には、予め定められた所定（一定）の音域に相当する。例えば、低音側及び高音側それぞれに加えられる一定の音域（例えば、それぞれ１音）に相当する。すなわち、前記動画抽出制御手段１２８は、好適には、各利用者に対応して前記ＳＮＳデータベース１０８に記憶されたその利用者の歌唱音声に対応する利用者音域に、前記許容区間として低音側及び高音側それぞれに一定の音域を加えた判定区間と、前記ＳＮＳデータベース１０８に記憶された複数の動画情報それぞれにおける音域とを、比較し、前記判定区間に含まれる（その判定区間を逸脱しない）動画情報を、前記共演動画情報の組み合わせ候補として抽出する。 The allowable section preferably corresponds to a predetermined (constant) sound range that is determined in advance. For example, it corresponds to a certain sound range (for example, one sound each) applied to each of the low sound side and the high sound side. That is, the moving image extraction control means 128 is preferably configured such that the user's range corresponding to each user's singing voice stored in the SNS database 108 corresponds to the low frequency side and A determination section in which a certain range is added to each of the high-pitched sound sides and a sound range in each of a plurality of pieces of moving image information stored in the SNS database 108 are compared, and a moving image included in the determination section (not deviating from the determination section) Information is extracted as a combination candidate of the co-starring moving picture information.

前記許容区間は、好適には、前記動画抽出制御手段１２８による動画情報の抽出の対象となる利用者の性別に応じて異なる値が定められるものである。すなわち、前記動画抽出制御手段１２８は、好適には、前記利用者が男性である場合には、前記利用者音域特定手段１１６により特定されたその利用者に対応する利用者音域の少なくとも低音側に前記許容区間を加え、前記利用者が女性である場合には、前記利用者音域特定手段１１６により特定されたその利用者に対応する利用者音域の少なくとも高音側に前記許容区間を加えることで前記判定区間を定める。或いは、前記利用者が男性である場合には、前記利用者音域特定手段１１６により特定されたその利用者に対応する利用者音域の低音側に加えられる前記許容区間を、前記利用者が女性である場合よりも広い音域に相当するものとする。前記利用者が女性である場合には、前記利用者音域特定手段１１６により特定されたその利用者に対応する利用者音域の高音側に加えられる前記許容区間を、前記利用者が男性である場合よりも広い音域に相当するものとする。 The permissible section is preferably set to have a different value depending on the sex of the user who is the target of the moving image information extraction by the moving image extraction control means 128. That is, preferably, when the user is a male, the moving image extraction control means 128 is located at least on the bass side of the user sound range corresponding to the user specified by the user sound range specifying means 116. When the allowable section is added and the user is a woman, the allowable section is added to at least the high-pitched side of the user range corresponding to the user specified by the user range specifying means 116. Determine the judgment interval. Alternatively, when the user is a male, the user is a female in the permissible section added to the bass side of the user sound range corresponding to the user specified by the user sound range specifying means 116. It corresponds to a wider sound range than some cases. When the user is a woman, when the user is a male, the allowable interval added to the high frequency side of the user sound range corresponding to the user specified by the user sound range specifying means 116 It corresponds to a wider range.

前記動画抽出制御手段１２８は、好適には、前記利用者音域特定手段１１６により特定された前記利用者に対応する利用者音域には含まれないが、その利用者に対応して前記ＳＮＳデータベース１０８に記憶された前記音域情報において、規定回数以上正しく発声できている音高に対応する音域を前記許容区間として前記利用者音域に加えることで、前記判定区間を定める。ここで、前記利用者音域特定手段１１６により前記利用者に対応する安定音域及びチャレンジ音域を特定する態様において、好適には、前記安定音域が、前記利用者に対応する利用者音域に相当し、前記チャレンジ音域が、前記利用者に対応する利用者音域ではないが、その利用者に対応して前記ＳＮＳデータベース１０８に記憶された前記音域情報において、規定回数以上正しく発声できている音高に対応する音域に相当する。すなわち、前記動画抽出制御手段１２８は、好適には、前記利用者音域特定手段１１６により前記利用者に対応する安定音域及びチャレンジ音域を特定する態様において、前記安定音域には含まれないが前記チャレンジ音域には含まれる音域を前記許容区間として前記利用者音域に加えることで、前記判定区間を定める。換言すれば、前記動画抽出制御手段１２８は、好適には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、対象となる利用者に対応して前記ＳＮＳデータベース１０８に記憶された前記チャレンジ音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する。 The moving image extraction control means 128 is preferably not included in the user sound range corresponding to the user specified by the user sound range specifying means 116, but the SNS database 108 corresponding to the user. In the sound range information stored in the above, the determination section is determined by adding a sound range corresponding to a pitch that has been uttered correctly more than a specified number of times to the user sound range as the allowable section. Here, in the aspect of specifying the stable sound range and the challenge sound range corresponding to the user by the user sound range specifying means 116, preferably, the stable sound range corresponds to the user sound range corresponding to the user, The challenge range is not a user range corresponding to the user, but corresponds to a pitch that can be uttered correctly more than a specified number of times in the range information stored in the SNS database 108 corresponding to the user. It corresponds to the range to be played. In other words, the moving image extraction control means 128 is preferably not included in the stable sound range in the aspect in which the user sound range specifying means 116 specifies the stable sound range and the challenge sound range corresponding to the user. The determination range is determined by adding a range included in the range to the user range as the allowable range. In other words, the moving image extraction control means 128 preferably has the SNS corresponding to the target user whose sound range in the moving image information is among the plurality of moving image information stored in the SNS database 108. The moving image information included in the challenge sound range stored in the database 108 is extracted as a combination candidate of the co-starring moving image information.

前記動画抽出制御手段１２８は、好適には、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する第１の抽出制御、及び、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に規定の許容区間を加えた判定区間に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出する第２の抽出制御の何れか一方を、前記利用者の入力操作に応じて選択的に実行する。例えば、前記電子早見本装置２８による前記利用者の入力操作（選択入力操作）に応じて、前記第１の抽出制御及び前記第２の抽出制御の何れか一方を実行する。換言すれば、各利用者に対応する利用者音域に収まる動画情報を抽出する前記第１の抽出制御、又は前記利用者音域に余裕をもたせた前記判定区間に収まる動画情報を抽出する前記第２の抽出制御の何れか一方を、前記動画情報の抽出の対象となる利用者の希望に応じて実行する。 Preferably, the moving image extraction control unit 128 is configured such that the sound range in the moving image information is specified by the user sound range specifying control unit 116 from among a plurality of pieces of moving image information stored in the SNS database 108. The first extraction control for extracting the moving image information included in the user sound range corresponding to the above as a combination candidate of the co-starring moving image information, and the moving image information from among the plurality of moving image information stored in the SNS database 108 Video information included in a determination section obtained by adding a specified allowable section to the user range corresponding to the user specified by the user range specification control means 116 as a combination candidate of the co-starring video information Either one of the second extraction controls to be extracted is selectively executed according to the input operation of the user. For example, one of the first extraction control and the second extraction control is executed in response to an input operation (selection input operation) of the user by the electronic quick sample device 28. In other words, the first extraction control for extracting the moving image information that falls within the user sound range corresponding to each user, or the second extraction for extracting the moving image information that falls within the determination section with a margin in the user sound range. Any one of the extraction controls is executed according to the desire of the user who is the target of the extraction of the moving image information.

前記動画抽出制御手段１２８により抽出された動画情報は、好適には、抽出の対象となる利用者の操作に係る前記電子早見本装置２８に選択入力可能に表示される。図６は、前記電子早見本装置２８のタッチパネルディスプレイに表示される動画情報推薦画面１４６を例示する図である。この動画情報推薦画面１４６には、前記動画抽出制御手段１２８により抽出された複数の動画情報それぞれに対応するサムネイル１４８ａ、１４８ｂ、１４８ｃ、１４８ｄ、・・・（以下、特に区別しない場合には単にサムネイル１４８という）が表示される。好適には、前記動画抽出制御手段１２８により抽出された複数の動画情報のうち、対象となる利用者に対応する利用者音域に係る適合度が高いものから優先的に、前記動画情報推薦画面１４６に前記サムネイル１４８として表示される。ここで、前記利用者に対応する利用者音域に係る適合度とは、前記動画音域判定手段１２６により判定される前記動画情報における音域と、前記利用者に対応する利用者音域との差に関する値であり、その差が小さいほど、前記利用者に対応する利用者音域との適合度が高いものとされる。前記動画情報推薦画面１４６には、好適には、前記動画抽出制御手段１２８により抽出された、曲名及び歌手名等による絞り込みを行わない複数の動画情報に対応するサムネイル１４８が表示されるが、この動画情報推薦画面１４６における表示から、前記動画情報に対応する演奏曲の曲名、歌手名、ジャンル等に基づく絞り込みが行われ、条件に適合する動画情報に対応するサムネイル１４６が前記電子早見本装置２８に表示されるものであってもよい。或いは、予め前記動画情報に対応する演奏曲の曲名、歌手名、ジャンル等に基づく絞り込みが行われ、その条件に適合する動画情報であり且つ前記動画抽出制御手段１２８により抽出された動画情報に対応するサムネイル１４６が前記電子早見本装置２８に表示されるものであってもよい。 The moving image information extracted by the moving image extraction control means 128 is preferably displayed on the electronic quick sample device 28 relating to the operation of the user to be extracted so as to be selectively input. FIG. 6 is a diagram illustrating a moving image information recommendation screen 146 displayed on the touch panel display of the electronic quick sample device 28. The moving picture information recommendation screen 146 includes thumbnails 148a, 148b, 148c, 148d,... Corresponding to each of the plurality of moving picture information extracted by the moving picture extraction control means 128 (hereinafter, simply thumbnails unless otherwise distinguished). 148) is displayed. Preferably, the moving image information recommendation screen 146 is preferentially selected from the plurality of moving image information extracted by the moving image extraction control unit 128 in the order of the high degree of fitness related to the user sound range corresponding to the target user. Is displayed as the thumbnail 148. Here, the degree of fitness related to the user sound range corresponding to the user is a value relating to a difference between the sound range in the moving image information determined by the moving image sound range determining unit 126 and the user sound range corresponding to the user. The smaller the difference is, the higher the degree of matching with the user sound range corresponding to the user is. Preferably, the video information recommendation screen 146 displays thumbnails 148 corresponding to a plurality of video information extracted by the video extraction control means 128 and not narrowed down by a song name, a singer name, etc. From the display on the moving picture information recommendation screen 146, narrowing down based on the song name, singer name, genre, etc. of the performance music corresponding to the moving picture information is performed, and the thumbnail 146 corresponding to the moving picture information meeting the conditions is displayed in the electronic quick sample device 28. May be displayed. Alternatively, narrowing based on the song name, singer name, genre, etc. of the performance song corresponding to the video information in advance is performed, and the video information that matches the conditions and corresponds to the video information extracted by the video extraction control means 128 The thumbnail 146 to be displayed may be displayed on the electronic quick sample device 28.

前記動画情報推薦画面１４６において、タッチパネルディスプレイにおける何れかのサムネイル１４８に対応する位置への接触操作が行われると、その操作に係るサムネイル１４８に対応する動画情報を前記第１の動画情報として選択する指示が前記サーバ１８へ送信される。そして、前記サーバ１８から前記第１の動画情報が前記カラオケ装置１６に配信され、そのカラオケ装置１６により前記第１の動画情報が出力され、その出力と同期して前記楽曲情報のカラオケ演奏が行われる。このカラオケ演奏に対応して、前記動画投稿制御手段１１８により動画情報の記録が行われ、カラオケ演奏の終了後、前記第２の動画情報として前記サーバ１８へ投稿される。そして、前記共演動画編集制御手段１２４により、前記第１の動画情報に対して、新たに投稿された前記第２の動画情報が組み合わされ、前記共演動画情報が編集される。 When a touch operation on a position corresponding to any thumbnail 148 on the touch panel display is performed on the video information recommendation screen 146, the video information corresponding to the thumbnail 148 related to the operation is selected as the first video information. An instruction is transmitted to the server 18. Then, the first video information is distributed from the server 18 to the karaoke device 16, and the karaoke device 16 outputs the first video information, and the karaoke performance of the music information is performed in synchronization with the output. Is called. Corresponding to this karaoke performance, moving image information is recorded by the moving image posting control means 118, and after the karaoke performance is completed, it is posted to the server 18 as the second moving image information. Then, the co-star video editing control unit 124 combines the first video information with the newly posted second video information to edit the co-star video information.

図７は、前記カラオケ装置１６のＣＰＵ４２により実行される音域判定制御の一例の要部を説明するフローチャートであり、所定の周期で繰り返し実行されるものである。 FIG. 7 is a flowchart for explaining a main part of an example of the sound range determination control executed by the CPU 42 of the karaoke apparatus 16, and is repeatedly executed at a predetermined cycle.

先ず、ステップ（以下、ステップを省略する）ＳＡ１において、前記ＲＡＭ４６等における予約曲テーブルに演奏曲（予約曲）が記憶されているか否かが判断される。このＳＡ１の判断が否定される場合には、それをもって本ルーチンが終了させられるが、ＳＡ１の判断が肯定される場合には、ＳＡ２において、前記予約曲テーブルにおける先頭（最上位）の予約曲に対応する楽曲情報が例えば前記サーバ１８における前記楽曲データベース１０６から読み出される。次に、ＳＡ３において、ＳＡ２にて読み出された楽曲情報に基づいて、その楽曲情報に対応するカラオケ演奏出力が行われる。次に、ＳＡ４において、前記楽曲情報に基づく演奏曲の出力と併行して、前記マイクロフォン７６から入力される音声情報に基づいて、その音声情報の入力主体である利用者の歌唱音声に係る音域が判定される。次に、ＳＡ５において、楽曲情報に基づく演奏曲の出力が終了させられるか否かが判断される。このＳＡ５の判断が否定される場合には、ＳＡ３以下の処理が再び実行されるが、ＳＡ５の判断が肯定される場合には、ＳＡ６において、ＳＡ４にて判定された音域に関する音域情報、前記楽曲情報の選曲番号、及び前記楽曲情報の選曲主体である利用者のユーザＩＤが対応づけられ、前記通信回線２０を介して前記サーバ１８へ送信された後、本ルーチンが終了させられる。 First, in step (hereinafter, step is omitted) SA1, it is determined whether or not a performance song (reserved song) is stored in the reserved song table in the RAM 46 or the like. If the determination of SA1 is negative, the routine is terminated accordingly, but if the determination of SA1 is affirmative, in SA2, the routine is the first (highest) reserved song in the reserved song table. Corresponding music information is read from the music database 106 in the server 18, for example. Next, in SA3, karaoke performance output corresponding to the music information is performed based on the music information read in SA2. Next, in SA4, along with the output of the performance music based on the music information, based on the audio information input from the microphone 76, the sound range related to the singing voice of the user who is the input subject of the audio information is Determined. Next, in SA5, it is determined whether or not the output of the performance music based on the music information is terminated. When the determination of SA5 is negative, the processing after SA3 is executed again. However, when the determination of SA5 is positive, in SA6, the range information regarding the range determined in SA4, the music The music selection number of the information and the user ID of the user who is the music selection subject of the music information are associated with each other and transmitted to the server 18 via the communication line 20, and then this routine is terminated.

図８は、前記カラオケ装置１６のＣＰＵ４２により実行される動画情報投稿制御の一例の要部を説明するフローチャートであり、所定の周期で繰り返し実行されるものである。 FIG. 8 is a flowchart for explaining a main part of an example of the moving picture information posting control executed by the CPU 42 of the karaoke apparatus 16, and is repeatedly executed at a predetermined cycle.

先ず、ＳＢ１において、前記カラオケ装置１６により演奏が開始される予約曲は、動画記録設定すなわちその演奏に際して動画情報を記録する設定がなされているか否かが判断される。このＳＢ１の判断が否定される場合には、それをもって本ルーチンが終了させられるが、ＳＢ１の判断が肯定される場合には、ＳＢ２において、前記カラオケ装置１６により演奏が開始される演奏曲に対応する楽曲情報が、前記サーバ１８における前記楽曲データベース１０６からダウンロードされる。次に、ＳＢ３において、ＳＢ２にてダウンロードされた楽曲情報に基づいて、その楽曲情報に対応するカラオケ演奏出力が行われる。次に、ＳＢ４において、前記カラオケ装置１６によるカラオケ演奏と併行して、前記デジタルカメラ８６により撮影された映像情報、前記マイクロフォン７６から入力される音声情報、及びその音声情報に基づいて判定される音域情報が記録される。次に、ＳＢ５において、楽曲情報に基づく演奏曲の出力が終了させられるか否かが判断される。このＳＢ５の判断が否定される場合には、ＳＢ３以下の処理が再び実行されるが、ＳＢ５の判断が肯定される場合には、ＳＢ６において、ＳＢ４にて記録された映像情報及び音声情報を含む動画情報、前記楽曲情報の選曲番号、前記楽曲情報の選曲主体である利用者のユーザＩＤ、及びＳＢ４にて記録された音域情報が対応づけられ、前記通信回線２０を介して前記サーバ１８へ送信された後、本ルーチンが終了させられる。 First, in SB1, it is determined whether or not the reserved music whose performance is started by the karaoke device 16 has a moving image recording setting, that is, a setting for recording moving image information during the performance. If the determination at SB1 is negative, the routine is terminated accordingly. If the determination at SB1 is affirmative, in SB2, the performance corresponding to the musical piece whose performance is started by the karaoke device 16 is handled. Music information to be downloaded is downloaded from the music database 106 in the server 18. Next, in SB3, karaoke performance output corresponding to the music information is performed based on the music information downloaded in SB2. Next, in SB4, in parallel with the karaoke performance by the karaoke apparatus 16, video information photographed by the digital camera 86, audio information input from the microphone 76, and a sound range determined based on the audio information Information is recorded. Next, in SB5, it is determined whether or not the output of the performance music based on the music information is terminated. When the determination at SB5 is negative, the processing after SB3 is executed again, but when the determination at SB5 is affirmative, the video information and audio information recorded at SB4 are included at SB6. The moving image information, the music selection number of the music information, the user ID of the user who is the music selection subject of the music information, and the range information recorded in SB 4 are associated with each other and transmitted to the server 18 via the communication line 20. Then, this routine is terminated.

図９は、前記サーバ１８のＣＰＵ９０により実行される利用者音域特定制御の一例の要部を説明するフローチャートであり、所定の周期で繰り返し実行されるものである。 FIG. 9 is a flowchart for explaining a main part of an example of user range specifying control executed by the CPU 90 of the server 18, and is repeatedly executed at a predetermined cycle.

先ず、ＳＣ１において、前記カラオケ装置１６から前記通信回線２０を介して受信された選曲番号及び音域情報が、それらに対応づけられたユーザＩＤに対応する利用者の選曲履歴として、前記ＳＮＳデータベース１０８に記憶される。次に、ＳＣ２において、ＳＣ１にて新たに記憶された選曲履歴に係る利用者に対応して、前記音域情報に対応付けられた選曲履歴が前記ＳＮＳデータベース１０８に３曲分以上記憶されているか否かが判断される。このＳＣ２の判断が否定される場合には、それをもって本ルーチンが終了させられるが、ＳＣ２の判断が肯定される場合には、ＳＣ３において、各利用者に対応して前記ＳＮＳデータベース１０８に記憶された複数の選曲番号（選曲履歴）に対応する音域情報が読み出され、それぞれの音域幅（最低音から最高音までの音域）が判定される。次に、ＳＣ４において、ＳＣ３にて判定された複数の音域幅に関して、所定数（例えば、３曲分）の音域幅に重複して含まれる音域が、前記利用者に対応する利用者音域として特定される。次に、ＳＣ５において、ＳＣ４にて特定された利用者音域が、前記利用者のユーザＩＤと対応づけられて前記ＳＮＳデータベース１０８に記憶された後、本ルーチンが終了させられる。 First, in SC1, the music selection number and the range information received from the karaoke apparatus 16 via the communication line 20 are stored in the SNS database 108 as the user's music selection history corresponding to the user ID associated therewith. Remembered. Next, in SC2, whether or not the music selection history associated with the range information is stored in the SNS database 108 for three or more songs corresponding to the user related to the music selection history newly stored in SC1. Is judged. If the determination at SC2 is negative, the routine is terminated accordingly. If the determination at SC2 is affirmative, the routine is stored in the SNS database 108 corresponding to each user at SC3. The range information corresponding to the plurality of music selection numbers (music selection history) is read out, and the range of each range (the range from the lowest sound to the highest sound) is determined. Next, in SC4, with respect to the plurality of sound ranges determined in SC3, a sound range that overlaps with a predetermined number (for example, three songs) of the sound range is specified as the user range corresponding to the user. Is done. Next, in SC5, the user's range specified in SC4 is stored in the SNS database 108 in association with the user ID of the user, and then this routine is terminated.

図１０は、前記サーバ１８のＣＰＵ９０により実行される動画抽出制御の一例の要部を説明するフローチャートであり、所定の周期で繰り返し実行されるものである。 FIG. 10 is a flowchart for explaining a main part of an example of the moving image extraction control executed by the CPU 90 of the server 18, and is repeatedly executed at a predetermined cycle.

先ず、ＳＤ１において、前記カラオケ装置１６から前記通信回線２０を介して動画検索指示が受信されたか否かが判断される。このＳＤ１の判断が否定される場合には、それをもって本ルーチンが終了させられるが、ＳＤ１の判断が肯定される場合には、ＳＤ２において、対象となる利用者がユーザＩＤをもって前記動画情報配信システム１０に正常にログインしているか否かが判断される。このＳＤ２の判断が否定される場合には、ＳＤ９において、対象となる利用者がユーザＩＤをもって前記動画情報配信システム１０にログインすべき旨のログイン指示が前記カラオケ装置１６に送信された後、本ルーチンが終了させられるが、ＳＤ２の判断が肯定される場合には、ＳＤ３において、前記対象となる利用者に対応して前記ＳＮＳデータベース１０８に記憶された利用者音域が読み出される。次に、ＳＤ４において、前記カラオケ装置１６から、検索対象となる動画情報に対応する演奏曲の曲名又は歌手名が受信されたか否かが判断される。このＳＤ４の判断が否定されるうちは、ＳＤ４の判断が繰り返されることにより待機させられるが、ＳＤ４の判断が肯定される場合には、ＳＤ５において、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうち、ＳＤ４にて受信された曲名又は歌手名の演奏曲に対応付けられた動画情報が抽出される。次に、ＳＤ６において、ＳＤ５にて抽出された動画情報それぞれにおける音域が判定され、それら動画情報のうちから、その動画情報における音域が、ＳＤ３にて読み出された前記利用者に対応する利用者音域に適合する動画情報が抽出される。次に、ＳＤ７において、ＳＤ６にて抽出された動画情報を昇順例えば前記利用者に対応する利用者音域との適合度が高い順に並べ替えたリストが作成される。次に、ＳＤ８において、ＳＤ７にて作成された動画情報のリストが、前記通信回線２０を介して前記動画検索指示の送信元である前記カラオケ装置１６に送信された後、本ルーチンが終了させられる。 First, in SD1, it is determined whether or not a moving image search instruction has been received from the karaoke apparatus 16 via the communication line 20. If the determination of SD1 is negative, this routine is terminated accordingly. If the determination of SD1 is affirmative, in SD2, the target user has the user ID with the user ID. 10 is determined whether or not the user has logged in normally. If the determination of SD2 is negative, after a login instruction indicating that the target user should log in to the moving picture information distribution system 10 with a user ID is transmitted to the karaoke device 16 in SD9, The routine is terminated, but if the determination in SD2 is affirmed, the user's range stored in the SNS database 108 corresponding to the target user is read in SD3. Next, in SD4, it is determined whether or not the tune name or singer name of the performance song corresponding to the moving image information to be searched is received from the karaoke apparatus 16. While the determination of SD4 is denied, the process waits by repeating the determination of SD4. However, when the determination of SD4 is affirmed, a plurality of pieces of moving image information stored in the SNS database 108 are stored in SD5. Among them, the moving image information associated with the performance song of the song name or singer name received in SD4 is extracted. Next, in SD6, the sound range in each of the moving image information extracted in SD5 is determined, and from among the moving image information, the sound range in the moving image information corresponds to the user read in SD3. Movie information suitable for the range is extracted. Next, in SD7, a list is created in which the moving picture information extracted in SD6 is rearranged in ascending order, for example, in descending order of suitability with the user's sound range corresponding to the user. Next, in SD8, after the moving image information list created in SD7 is transmitted to the karaoke apparatus 16 that is the transmission source of the moving image search instruction via the communication line 20, this routine is terminated. .

図１１は、前記サーバ１８のＣＰＵ９０により実行される動画情報管理／配信制御の一例の要部を説明するフローチャートであり、所定の周期で繰り返し実行されるものである。 FIG. 11 is a flowchart for explaining a main part of an example of the moving image information management / distribution control executed by the CPU 90 of the server 18, and is repeatedly executed at a predetermined cycle.

先ず、ＳＥ１において、前記カラオケ装置１６をはじめとする通信端末装置から前記通信回線２０を介して、動画情報のアップロード（投稿）があったか否かが判断される。このＳＥ１の判断が否定される場合には、ＳＥ３以下の処理が実行されるが、ＳＥ１の判断が肯定される場合には、ＳＥ２において、アップロードされた動画情報が投稿主体である利用者のユーザＩＤ、対応する演奏曲の選曲番号、及び音域情報等と対応付けられて前記ＳＮＳデータベース１０８に記憶される。次に、ＳＥ３において、前記カラオケ装置１６をはじめとする通信端末装置から前記通信回線２０を介して、前記ＳＮＳデータベース１０８に記憶された複数の動画情報を組み合わせて共演動画情報を作成する共演動画組み合わせ指示が受信されたか否かが判断される。このＳＥ３の判断が否定される場合には、ＳＥ５以下の処理が実行されるが、ＳＥ３の判断が肯定される場合には、ＳＥ４において、指示に係る複数の動画情報が組み合わされて共演動画情報が作成され、対応する演奏曲の選曲番号等に対応付けられて前記ＳＮＳデータベース１０８に記憶される。次に、ＳＥ５において、前記カラオケ装置１６をはじめとする通信端末装置から前記通信回線２０を介して、前記ＳＮＳデータベース１０８に記憶された共演動画情報の配信要求が受信されたか否かが判断される。このＳＥ５の判断が否定される場合には、それをもって本ルーチンが終了させられるが、ＳＥ５の判断が肯定される場合には、ＳＥ６において、要求に係る共演動画情報が前記ＳＮＳデータベース１０８から読み出され、前記通信回線２０を介して要求元である前記カラオケ装置１６等の通信端末装置へ配信された後、本ルーチンが終了させられる。 First, in SE1, it is determined whether or not video information has been uploaded (posted) from the communication terminal device including the karaoke device 16 via the communication line 20. When the determination of SE1 is negative, the processing after SE3 is executed. When the determination of SE1 is affirmative, the user's user whose uploaded video information is the posting subject in SE2 It is stored in the SNS database 108 in association with the ID, the music selection number of the corresponding musical piece, the range information, and the like. Next, in SE3, a co-star video combination that creates co-star video information by combining a plurality of video information stored in the SNS database 108 from the communication terminal device including the karaoke device 16 via the communication line 20 It is determined whether an instruction has been received. If the determination of SE3 is negative, the processing from SE5 onward is executed. If the determination of SE3 is positive, in SE4, a plurality of pieces of moving image information according to the instruction are combined and the co-starring moving image information Is created and stored in the SNS database 108 in association with the music selection number of the corresponding musical piece. Next, in SE5, it is determined whether or not a distribution request for the co-star video information stored in the SNS database 108 is received from the communication terminal device including the karaoke device 16 via the communication line 20. . If the determination at SE5 is negative, the routine is terminated accordingly. If the determination at SE5 is affirmative, the requested co-star video information is read from the SNS database 108 at SE6. After being distributed to the communication terminal device such as the karaoke device 16 that is the request source via the communication line 20, this routine is terminated.

以上の制御において、ＳＡ１〜ＳＡ３、ＳＡ５、ＳＢ２、ＳＢ３、及びＳＢ５が前記カラオケ演奏制御手段１１０の動作に、ＳＡ４及びＳＢ４が前記音域判定制御手段１１２の動作に、ＳＣ１が前記音域記憶制御手段１１４の動作に、ＳＣ２〜ＳＣ５が前記利用者音域特定手段１１６の動作に、ＳＢ４及びＳＢ６が前記動画投稿制御手段１１８の動作に、ＳＥ２が前記投稿受付制御手段１２０の動作に、ＳＥ６が前記動画配信制御手段１２２の動作に、ＳＥ４が前記共演動画編集制御手段１２４の動作に、ＳＤ６が前記動画音域判定手段１２６の動作に、ＳＤ３〜ＳＤ６が前記動画抽出制御手段１２８の動作に、それぞれ対応する。 In the above control, SA1 to SA3, SA5, SB2, SB3, and SB5 are the operations of the karaoke performance control means 110, SA4 and SB4 are the operations of the range determination control means 112, and SC1 is the range storage control means 114. SC2 to SC5 are the operations of the user sound range specifying means 116, SB4 and SB6 are the actions of the moving picture posting control means 118, SE2 is the actions of the posting acceptance control means 120, and SE6 is the moving picture distribution. For the operation of the control means 122, SE4 corresponds to the operation of the co-starring moving image editing control means 124, SD6 corresponds to the operation of the moving image sound range determination means 126, and SD3 to SD6 correspond to the operation of the moving image extraction control means 128, respectively.

本実施例によれば、前記通信回線２０を介して通信端末装置である前記カラオケ装置１６等から、予め定められた複数の演奏曲情報の何れかに対応する動画情報の投稿を受け付け、その投稿された動画情報を対応する演奏曲情報と対応付けて動画データベースとしての前記ＳＮＳデータベース１０８に記憶させる投稿受付制御手段１２０（ＳＥ２）と、その投稿受付制御手段１２０により受け付けられる、共通の演奏曲情報に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、その編集された共演動画情報をその共通の演奏曲情報と対応付けて前記ＳＮＳデータベース１０８に記憶させる共演動画編集制御手段１２４（ＳＥ４）と、音声入力装置である前記マイクロフォン７６から入力された音声情報に基づいて、その音声情報の入力主体である利用者の歌唱音声に係る利用者音域を特定する利用者音域特定制御手段１１６（ＳＣ２〜ＳＣ５）と、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する動画抽出制御手段１２８（ＳＤ３〜ＳＤ６）とを、備えたものであることから、前記ＳＮＳデータベース１０８に記憶された多数の動画情報のうちから、各利用者の音域にあった適切な動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。すなわち、共演動画情報の組み合わせ候補の検索に関して、各利用者の音域に合った適切な動画情報の検索を実現する動画情報配信システム１０を提供することができる。 According to the present embodiment, the posting of moving picture information corresponding to any of a plurality of pieces of predetermined performance music information is received from the karaoke apparatus 16 or the like which is a communication terminal apparatus via the communication line 20, and the posting Posting reception control means 120 (SE2) for storing the recorded video information in correspondence with the corresponding performance music information and storing it in the SNS database 108 as a video database, and common performance music information received by the posting reception control means 120 Co-starring video information is edited by combining a plurality of pieces of video information associated with each other, and the edited co-starring video information is stored in the SNS database 108 in association with the common performance music information. (SE4) and the sound based on the sound information input from the microphone 76 which is a sound input device. Among the plurality of pieces of moving picture information stored in the SNS database 108, the user range specification control means 116 (SC2 to SC5) for specifying the user range related to the singing voice of the user who is the information input subject, Moving image extraction control means 128 for extracting moving image information whose sound range in the moving image information matches the user sound range corresponding to the user specified by the user sound range specifying control means 116 as a combination candidate of the co-starring moving picture information ( SD3 to SD6), from among a large number of pieces of moving picture information stored in the SNS database 108, suitable moving picture information suitable for each user's range is combined with the co-starring moving picture information. Can be easily extracted as a candidate. That is, it is possible to provide the moving image information distribution system 10 that realizes the search for the appropriate moving image information suitable for the sound range of each user regarding the search for the combination candidate of the co-starring moving image information.

前記動画抽出制御手段１２８は、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出するものであるため、各利用者の利用者音域に適合し、各利用者が危なげなく歌うことができるものと思われる演奏曲に対応する動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。 The moving image extraction control means 128 uses the user information whose sound range in the moving image information is specified by the user sound range specifying control means 116 from among the plurality of moving image information stored in the SNS database 108. Since the video information included in the person's range is extracted as a combination candidate of the above-mentioned co-star video information, the performance music that fits the user's range of each user and that each user can sing without harm Can be easily extracted as a combination candidate of co-starring movie information.

前記動画抽出制御手段１２８は、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に規定の許容区間を加えた判定区間に含まれる動画情報を、前記共演動画情報の組み合わせ候補として抽出するものであるため、各利用者の歌唱音声に対応する音域を基準とする判定区間に適合し、各利用者が歌唱可能であるものと思われる演奏曲に対応する動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。 The moving image extraction control means 128 uses the user information whose sound range in the moving image information is specified by the user sound range specifying control means 116 from among the plurality of moving image information stored in the SNS database 108. Since the moving image information included in the determination interval obtained by adding the specified allowable interval to the person's range is extracted as a combination candidate of the co-starring movie information, the determination interval based on the range corresponding to each user's singing voice It is possible to easily extract moving image information corresponding to a musical piece that is considered to be sung by each user as a combination candidate of co-starring moving image information.

前記投稿受付制御手段１２０は、前記投稿された動画情報を、投稿主体である利用者に対応して前記利用者音域特定制御手段１１６により特定された利用者音域と対応付けて前記ＳＮＳデータベース１０８に記憶させるものであり、前記動画抽出制御手段１２８は、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報に対応付けられた投稿主体である利用者に対応する利用者音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出するものであるため、前記ＳＮＳデータベース１０８に記憶された多数の動画情報のうちから、歌唱音声に係る利用者音域が適合する投稿者により投稿された動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。 The posting acceptance control means 120 associates the posted moving picture information with the user sound range specified by the user sound range specifying control means 116 corresponding to the user who is the posting subject in the SNS database 108. The moving image extraction control means 128 stores the user sound range corresponding to the user who is the posting subject associated with the moving image information from among the plurality of moving image information stored in the SNS database 108. However, since the moving image information suitable for the user sound range corresponding to the user specified by the user sound range specifying control means 116 is extracted as a combination candidate of the co-starring moving image information, it is stored in the SNS database 108. Video information posted by a contributor that matches the user's range related to the singing voice from among the many stored video information And it can be easily extracted as a combination candidate of the played video.

前記動画抽出制御手段１２８は、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、対応する演奏曲情報の最低音から最高音までの音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出するものであるため、前記ＳＮＳデータベース１０８に記憶された多数の動画情報のうちから、検索者の歌唱音声に係る利用者音域に適合する演奏曲に対応する動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。 The moving image extraction control unit 128 specifies, by the user range specifying control unit 116, a range from the lowest sound to the highest sound of the corresponding performance music information among the plurality of moving image information stored in the SNS database 108. Since the moving image information suitable for the user range corresponding to the user is extracted as a combination candidate of the co-starring moving image information, a search is performed from among a large number of moving image information stored in the SNS database 108. It is possible to easily extract moving image information corresponding to a performance song that matches the user's range related to the user's singing voice as a combination candidate of co-starring moving image information.

本実施例によれば、前記通信回線２０を介して前記カラオケ装置１６等から、予め定められた複数の演奏曲情報の何れかに対応する動画情報の投稿を受け付け、その投稿された動画情報を対応する演奏曲情報と対応付けて前記ＳＮＳデータベース１０８に記憶させる投稿受付制御手段１２０と、その投稿受付制御手段１２０により受け付けられる、共通の演奏曲情報に対応付けられた複数の動画情報を組み合わせて共演動画情報を編集し、その編集された共演動画情報をその共通の演奏曲情報と対応付けて前記ＳＮＳデータベース１０８に記憶させる共演動画編集制御手段１２４と、前記マイクロフォン７６から入力された音声情報に基づいて、その音声情報の入力主体である利用者の歌唱音声に係る利用者音域を特定する利用者音域特定制御手段１１６と、前記ＳＮＳデータベース１０８に記憶された複数の動画情報のうちから、その動画情報における音域が、前記利用者音域特定制御手段１１６により特定された前記利用者に対応する利用者音域に適合する動画情報を、前記共演動画情報の組み合わせ候補として抽出する動画抽出制御手段１２８とを、備えたものであることから、前記ＳＮＳデータベース１０８に記憶された多数の動画情報のうちから、各利用者の音域にあった適切な動画情報を、共演動画情報の組み合わせ候補として簡便に抽出することができる。すなわち、共演動画情報の組み合わせ候補の検索に関して、各利用者の音域に合った適切な動画情報の検索を実現するサーバ１８を提供することができる。 According to the present embodiment, the posting of moving image information corresponding to any of a plurality of predetermined pieces of performance music information is received from the karaoke device 16 or the like via the communication line 20, and the posted moving image information is Combining post acceptance control means 120 to be stored in the SNS database 108 in association with corresponding musical piece information, and a plurality of pieces of moving picture information associated with the common musical piece information accepted by the posting acceptance control means 120. The co-starring video editing control means 124 that edits the co-star video information and stores the edited co-star video information in the SNS database 108 in association with the common performance music information, and the audio information input from the microphone 76 Based on the user's range specification system that specifies the user's range related to the singing voice of the user who is the input subject of the voice information. The sound range in the moving image information among the plurality of moving image information stored in the means 116 and the SNS database 108 is adapted to the user sound range corresponding to the user specified by the user sound range specifying control means 116. Since the moving image extraction control means 128 for extracting moving image information as a combination candidate of the co-starring moving image information is provided, each user is selected from among a large number of moving image information stored in the SNS database 108. Therefore, it is possible to easily extract appropriate moving image information suitable for the sound range as a combination candidate of co-starring moving image information. That is, it is possible to provide a server 18 that realizes a search for appropriate moving image information suitable for each user's sound range with respect to searching for a combination candidate of co-starring moving image information.

以上、本発明の好適な実施例を図面に基づいて詳細に説明したが、本発明はこれに限定されるものではなく、更に別の態様においても実施される。 The preferred embodiments of the present invention have been described in detail with reference to the drawings. However, the present invention is not limited to these embodiments, and may be implemented in other modes.

例えば、前述の実施例においては、通信端末装置としての前記カラオケ装置１６から動画情報の投稿を行う態様について説明したが、本発明はこれに限定されるものではなく、更に別の態様においても実施される。例えば、前記動画情報配信システム１０は、前記パーソナルコンピュータ３６等の他の通信端末装置から動画情報の投稿を受け付けるものであってもよい。斯かる態様において、前記動画抽出制御手段１２８は、好適には、抽出された動画情報のリストを前記通信回線２０等を介して前記パーソナルコンピュータ３６等の通信端末装置へ送信する。 For example, in the above-described embodiment, the aspect in which the moving image information is posted from the karaoke apparatus 16 as the communication terminal apparatus has been described. However, the present invention is not limited to this and may be implemented in another aspect. Is done. For example, the moving image information distribution system 10 may accept posting of moving image information from another communication terminal device such as the personal computer 36. In such an aspect, the moving image extraction control unit 128 preferably transmits the extracted moving image information list to the communication terminal device such as the personal computer 36 via the communication line 20 or the like.

前述の実施例においては、前記カラオケ演奏制御手段１１０、前記音域判定制御手段１１２、及び前記動画投稿制御手段１１８が前記カラオケ装置１６のＣＰＵ４２に、前記音域記憶制御手段１１４、前記利用者音域特定手段１１６、前記投稿受付制御手段１２０、前記動画配信制御手段１２２、前記共演動画編集制御手段１２４、前記動画音域判定手段１２６、及び前記動画抽出制御手段１２８が前記サーバ１８のＣＰＵ９０に、それぞれ機能的に備えられた構成について説明したが、本発明はこれに限定されるものではなく、更に別の態様においても実施される。例えば、図４に示す各制御手段すなわち前記カラオケ演奏制御手段１１０、前記音域判定制御手段１１２、前記音域記憶制御手段１１４、前記利用者音域特定手段１１６、前記動画投稿制御手段１１８、前記投稿受付制御手段１２０、前記動画配信制御手段１２２、前記共演動画編集制御手段１２４、前記動画音域判定手段１２６、及び前記動画抽出制御手段１２８が何れも前記カラオケ装置１６のＣＰＵ４２に機能的に備えられ、そのカラオケ装置１６内で制御が完結するものであってもよい。斯かる態様において、前記ＳＮＳデータベース１０８は、前記カラオケ装置１６のハードディスク４８等に設けられる。すなわち、本発明は、通信回線に接続されない非通信型のカラオケ装置にも好適に適用される。 In the above-described embodiment, the karaoke performance control means 110, the sound range determination control means 112, and the moving image posting control means 118 are connected to the CPU 42 of the karaoke apparatus 16 by the sound range storage control means 114, the user sound range specifying means. 116, the posting acceptance control means 120, the moving picture distribution control means 122, the co-starring moving picture editing control means 124, the moving picture sound range determination means 126, and the moving picture extraction control means 128 are functionally connected to the CPU 90 of the server 18, respectively. Although the provided structure was demonstrated, this invention is not limited to this, Furthermore, it implements in another aspect. For example, each control means shown in FIG. 4, that is, the karaoke performance control means 110, the sound range determination control means 112, the sound range storage control means 114, the user sound range specifying means 116, the moving image posting control means 118, the posting acceptance control. The means 120, the moving picture distribution control means 122, the co-starring moving picture editing control means 124, the moving picture sound range determination means 126, and the moving picture extraction control means 128 are all functionally provided in the CPU 42 of the karaoke apparatus 16, and the karaoke The control may be completed within the device 16. In such an aspect, the SNS database 108 is provided in the hard disk 48 of the karaoke apparatus 16 or the like. That is, the present invention is also suitably applied to a non-communication karaoke apparatus that is not connected to a communication line.

また、前述の実施例においては、通信端末装置として、前記カラオケ装置１６、携帯電話機３０、タブレット３２、パーソナルコンピュータ３６を備えた動画情報配信システム１０を例示したが、本発明の動画情報配信システムに備えられる通信型端末装置としては他の機器も考えられ、例えば、通信機能を備えたデジタルビデオカメラ、家庭用ゲーム機、或いは通信機能及びタッチパネルディスプレイを備えた携帯型音楽プレイヤ等を通信端末装置として備えたものであってもよい。 In the above-described embodiment, the moving image information distribution system 10 including the karaoke device 16, the mobile phone 30, the tablet 32, and the personal computer 36 is exemplified as the communication terminal device. Other devices may be considered as the communication type terminal device provided, for example, a digital video camera having a communication function, a home game machine, or a portable music player having a communication function and a touch panel display as the communication terminal device. It may be provided.

その他、一々例示はしないが、本発明はその趣旨を逸脱しない範囲内において種々の変更が加えられて実施されるものである。 In addition, although not illustrated one by one, the present invention is implemented with various modifications within a range not departing from the gist thereof.

１０：動画情報配信システム、１６：カラオケ装置（通信端末装置）、１８：サーバ、２０：通信回線、３０：携帯電話機（通信端末装置）、３２：タブレット（通信端末装置）、３６：パーソナルコンピュータ（通信端末装置）、７６：マイクロフォン（音声入力装置）、１０８：ＳＮＳデータベース（動画データベース）、１１６：利用者音域特定手段、１２０：投稿受付制御手段、１２４：共演動画編集制御手段、１２８：動画抽出制御手段、１３０：共演動画 10: Video information distribution system, 16: Karaoke device (communication terminal device), 18: Server, 20: Communication line, 30: Mobile phone (communication terminal device), 32: Tablet (communication terminal device), 36: Personal computer ( Communication terminal device), 76: microphone (speech input device), 108: SNS database (moving image database), 116: user sound range specifying means, 120: posting acceptance control means, 124: co-starring moving image editing control means, 128: moving image extraction Control means, 130: Co-star movie

Claims

A video information distribution system that distributes video information to a plurality of communication terminal devices via a communication line,
Accepting posting of moving image information corresponding to any of a plurality of predetermined pieces of performance music information from the communication terminal device via the communication line, and associating the posted video information with the corresponding performance music information Post acceptance control means to be stored in the video database;
The co-star video information is edited by combining a plurality of video information associated with the common performance music information received by the posting reception control means, and the edited co-star video information is associated with the common music performance information. Co-starring video editing control means to be stored in the video database;
Based on the voice information input from the voice input device, user range specification control means for specifying the user range related to the singing voice of the user who is the input subject of the voice information;
Among the plurality of moving image information stored in the moving image database, the moving image information in which the sound range in the moving image information matches the user sound range corresponding to the user specified by the user sound range specifying control unit, A moving picture information distribution system comprising: moving picture extraction control means for extracting as a combination candidate of co-starring moving picture information.

The moving image extraction control unit is configured such that a sound range in the moving image information is a user sound range corresponding to the user specified by the user sound range specifying control unit from among a plurality of pieces of moving image information stored in the moving image database. The moving picture information distribution system according to claim 1, wherein the moving picture information included is extracted as a combination candidate of the co-starring moving picture information.

The moving image extraction control unit is configured such that a sound range in the moving image information is a user sound range corresponding to the user specified by the user sound range specifying control unit from among a plurality of pieces of moving image information stored in the moving image database. The moving picture information distribution system according to claim 1, wherein moving picture information included in a determination section to which a predetermined allowable section is added is extracted as a combination candidate of the co-starring moving picture information.

The posting acceptance control means stores the posted moving picture information in the moving picture database in association with the user sound range specified by the user sound range specifying control means corresponding to the user who is the posting subject. And
The moving image extraction control means includes a user range corresponding to a user who is a posting subject associated with the moving image information among a plurality of pieces of moving image information stored in the moving image database. The moving image information distribution according to any one of claims 1 to 3, wherein moving image information matching a user sound range corresponding to the user specified by the means is extracted as a combination candidate of the co-starring moving image information. system.

The moving image extraction control unit is configured such that, from among a plurality of pieces of moving image information stored in the moving image database, a range from the lowest sound to the highest sound of the corresponding performance music information is specified by the user sound range specifying control unit The moving picture information distribution system according to any one of claims 1 to 3, wherein moving picture information that matches a user sound range corresponding to a user is extracted as a combination candidate of the co-starring moving picture information.

A server that distributes video information to a plurality of communication terminal devices via a communication line,
Accepting posting of moving image information corresponding to any of a plurality of predetermined pieces of performance music information from the communication terminal device via the communication line, and associating the posted video information with the corresponding performance music information Post acceptance control means to be stored in the video database;
The co-star video information is edited by combining a plurality of video information associated with the common performance music information received by the posting reception control means, and the edited co-star video information is associated with the common music performance information. Co-starring video editing control means to be stored in the video database;
Based on the voice information input from the voice input device, user range specification control means for specifying the user range related to the singing voice of the user who is the input subject of the voice information;
Among the plurality of moving image information stored in the moving image database, the moving image information in which the sound range in the moving image information matches the user sound range corresponding to the user specified by the user sound range specifying control unit, A server comprising: a moving image extraction control means for extracting as a combination candidate of co-starring moving image information.