JP2004501576A - Karaoke system - Google Patents

Karaoke system Download PDF

Info

Publication number
JP2004501576A
JP2004501576A JP2002504133A JP2002504133A JP2004501576A JP 2004501576 A JP2004501576 A JP 2004501576A JP 2002504133 A JP2002504133 A JP 2002504133A JP 2002504133 A JP2002504133 A JP 2002504133A JP 2004501576 A JP2004501576 A JP 2004501576A
Authority
JP
Japan
Prior art keywords
video
user
audio
karaoke
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2002504133A
Other languages
Japanese (ja)
Inventor
コルセ,イザベル
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Publication of JP2004501576A publication Critical patent/JP2004501576A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/272Means for inserting a foreground image in a background image, i.e. inlay, outlay
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B31/00Arrangements for the associated working of recording or reproducing apparatus with related apparatus
    • G11B31/02Arrangements for the associated working of recording or reproducing apparatus with related apparatus with automatic musical instruments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/36Accompaniment arrangements
    • G10H1/361Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems
    • G10H1/368Recording/reproducing of accompaniment for use with an external source, e.g. karaoke systems displaying animated or moving pictures synchronized with the music or audio part
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2220/00Input/output interfacing specifically adapted for electrophonic musical tools or instruments
    • G10H2220/005Non-interactive screen display of musical or status data
    • G10H2220/011Lyrics displays, e.g. for karaoke applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2220/00Input/output interfacing specifically adapted for electrophonic musical tools or instruments
    • G10H2220/155User input interfaces for electrophonic musical instruments
    • G10H2220/441Image sensing, i.e. capturing images or optical patterns for musical purposes or musical control purposes
    • G10H2220/455Camera input, e.g. analyzing pictures from a video camera and using the analysis results as control data

Abstract

カラオケを歌う概念は、オーディオベースの技法にだけ基づく。本発明は、ビデオの中に入る概念を展開し、ユーザは、お気に入りのシンガー又はプレーヤーの代わりにビデオクリップ又は映画に自身の画像を挿入して見ることができ、従って、ビデオテープでクリップ/ソングを再生することができ、どの有名な人とも置き換えることができる。より明確には、本発明は、ユーザの画像及び音声を捕捉する手段、得られた信号を分析し処理する手段、このように分析処理されたオーディオ/ビデオ信号及び事前記録された素材をミックスする手段、及び、得られる、組み合わされた信号を表示する手段を連続して有するカラオケシステムに関わる。The concept of singing karaoke is based solely on audio-based techniques. The present invention expands the concept of falling into video, allowing users to insert and view their images in video clips or movies instead of their favorite singers or players, and thus clip / song on videotape. Can be played and replaced with any famous person. More specifically, the invention relates to a means for capturing a user's image and sound, a means for analyzing and processing the resulting signal, mixing the audio / video signal thus analyzed and pre-recorded material. Means and a karaoke system having successively means for displaying the resulting combined signal.

Description

【0001】
本発明は、ビデオクリップ又は映画のようなシーケンス中に歌うカラオケシステムに関わる。
【0002】
例えば、欧州特許出願EP0782338に開示するようなカラオケシステムでは、音楽、歌詞、又は、任意の種類のオーディオデータが送信局から分配局に送信される。システムの主モジュールの音楽制御手段は、モニタの内蔵型スピーカを通じて音楽を流し、見えないマイクロホンから音声を上記スピーカを通じて流す。画像制御手段は、背景画像(例えば、ビデオ画像又は背景画像記憶手段から抽出された静止画像)をモニタ上に表示し、歌詞制御手段は、背景画像の上に重畳して歌詞を表示する。CCDカメラのような画像捕捉装置は、シンガーの画像を捕捉し、ビデオ画像制御手段を通じてその画像をモニタのスクリーン上に重畳し重畳された画像とする。このようなシステムは、カラオケの概念において「ビデオミクシング」と定義される。
【0003】
本発明は、更なる機能性を具備する別の種類のカラオケシステムを提供することを目的とする。
【0004】
このためにシステムは、ビデオクリップ又は映画のようなシーケンス中に歌い、ユーザの画像及び音声を捕捉する捕捉装置と、ユーザの少なくとも一部分を背景から分離する分析及び処理装置と、上記分析及び処理装置の出力信号を事前記録された素材と組合すミクシング及びレンダリング装置と、組合し信号を表示する表示装置とを連続して有するカラオケシステムに関わる。
【0005】
今日まで、カラオケを歌うといった概念は、オーディオに基づく技法にだけ基づき、機能性に制限をつけ、ユーザがビデオの想像の世界に実際に入り込む可能性はなかった。ビデオミクシングの考えを導入した提案される解決策により、このカラオケの概念をビデオ、より特定的には、完全にオーディオ−ビデオに入り込む概念に発展させ、上記概念によると、歌のビデオクリップ中の音声及び顔は偶然のシンガー(以降、彼/彼女はシンガー、プレーヤー、ダンサー等でもよいためユーザと称する)の音声及び顔によって置き換えられ得る。同じ提案された技法は、他の文脈、例えば、eコマースの分野、又は、事前記録されたコンテンツのビデオ編集においても同様に適用できる。
【0006】
ここで、本発明を添付の図面を参照して例によって以下に説明する。
【0007】
図1に示すように、本発明によるカラオケシステムを実施するために必要な異なるサブシステムは、主として、分析及び処理システム11とミクシング及びレンダリング装置12である。
【0008】
分析及び処理装置11は、捕捉装置10によって捕捉されたユーザの画像及び音声(黒で示される人)を受信し、例えば、ユーザの顔を背景から分離してアルファ面を画成するセグメント化回路(このような回路は、例えば、ユーザがブルースクリーンの前のステージに上がった場合にクロムキー技法に基づき得る)を含む。ミクシング及びレンダリング装置12は、装置11において分析された形状情報を用い、媒体13から伝えられる背景の事前記録されたビデオ又はオーディオ−ビデオ(上記事前記録された素材は媒体の左側に示される)とユーザを合成する回路である。この構成により、上記ユーザの音声と歌の背景が事前記録された音楽とをミックスするために供給されるオーディオ部分が完成される。装置11によって画成されるアルファ面を用いて、次のタイプの関係:[(ビデオ1×アルファ)+(ビデオ2×(255−アルファ))]/255=最終ビデオに従って2つのソースを簡単に組合すことができる。モニタのような表示装置14は、最終結果(即ち、事前記録された素材と特にユーザに属するものとの組合せ)を表示するために最終的に使用される。
【0009】
明らかに、質を改善するためには、装置11中で実行される分析は8ビットのアルファ面を生成し得、これにより表面が覆われたオブジェクトの限界でより良くミックスすることを可能にする。システムは、ユーザの頭だけ又は彼/彼女の身体全体を置き換えることができることに注意する。
【0010】
オーディオ/ビデオの種類に関して他のケースも考えられる:
a)2つのオーディオ/ビデオソースが圧縮されていない:このオプションは、シンガーの身体全体がクリップ/映画の中で覆われているときに(事前記録されたデータはテープに記憶され、あるシンガーのビデオが分析されビデオミキサに直接的に伝送され得る)例えば、カラオケレストランにおいて使用され得、
b)1つの又は2つのソースが圧縮されている:このような場合の一つの適応された構成は、新しく展開されたMPEG−4規格であり、オブジェクトの形状及びアルファ面、ここではあるユーザの顔をエンコードすることを可能にする(MPEG−4はオーディオ及びビデオオブジェクトの構成を可能にする全体的なシステム構成を定める)。
【0011】
本発明の他のケースの適用法も考えられる:
a)ユーザがミクシング動作の結果を記録することを望む:これは、図2に示すように、図1の実行に類似するシステムであるが更なる記録装置25を含む。
【0012】
b)幾つかの場合において、カラオケシステムはオンラインで稼動する:事前記録されたクリップはデータベース(例えば、インターネット)に記憶され、ユーザが家で彼/彼女のパフォーマンスを録画し、合成カラオケクリップを生成して彼/彼女のホームページに載せることを望む(圧縮技法の使用は、このような場合、より一般的には、帯域幅が制限された環境において実行される全ての適用法において特に有用である。)、
c)更に幾つかの場合では、ユーザが彼/彼女の頭だけを元のシンガーの頭と置き換えることを望み、ユーザの頭の位置と元のシンガーの身体の向き及び姿勢が適合することが必要なため、ミクシング及びレンダリング装置12において更に処理することが必要となる。
【図面の簡単な説明】
【図1】本発明によるカラオケシステムのブロック図である。
【図2】本発明によるカラオケシステムの別の実行法を示す図である。
[0001]
The present invention relates to a karaoke system that sings during a sequence such as a video clip or movie.
[0002]
For example, in a karaoke system as disclosed in European Patent Application EP0782338, music, lyrics, or any type of audio data is transmitted from a transmitting station to a distribution station. The music control means of the main module of the system plays the music through the built-in speaker of the monitor and the sound from the invisible microphone through the speaker. The image control means displays a background image (for example, a video image or a still image extracted from the background image storage means) on a monitor, and the lyrics control means displays lyrics superimposed on the background image. An image capture device, such as a CCD camera, captures the image of the singer and superimposes that image on the screen of the monitor through the video image control means to produce a superimposed image. Such a system is defined as "video mixing" in the karaoke concept.
[0003]
It is an object of the present invention to provide another type of karaoke system with further functionality.
[0004]
To this end, the system comprises a capture device for singing in a sequence, such as a video clip or movie, capturing the image and sound of the user, an analysis and processing device for separating at least a part of the user from the background, and the analysis and processing device. The present invention relates to a karaoke system having a mixing and rendering device for combining the output signal of the above with pre-recorded material and a display device for displaying the combined signal.
[0005]
To date, the concept of singing karaoke has been based solely on audio-based techniques, with limited functionality, and it has never been possible for a user to actually get into the imaginary world of video. The proposed solution, which introduced the idea of video mixing, evolved this karaoke concept into the concept of video, and more specifically, completely into audio-video, and according to the above concept, in the video clip of a song, The voice and face may be replaced by the voice and face of a casual singer (hereafter referred to as a user as he / she may be a singer, player, dancer, etc.). The same proposed technique can be applied in other contexts as well, for example in the field of e-commerce, or in video editing of pre-recorded content.
[0006]
The invention will now be described by way of example with reference to the accompanying drawings.
[0007]
As shown in FIG. 1, the different subsystems required to implement a karaoke system according to the invention are mainly an analysis and processing system 11 and a mixing and rendering device 12.
[0008]
The analysis and processing device 11 receives the user's image and sound (the person shown in black) captured by the capture device 10 and, for example, segments the user's face from the background to define an alpha plane. (Such circuits may be based on chrome key techniques, for example, if the user steps up to a stage before the blue screen). The mixing and rendering device 12 uses the shape information analyzed in the device 11 to generate a pre-recorded video or audio-video of the background transmitted from the medium 13 (the pre-recorded material is shown on the left side of the medium). This is a circuit for combining users. This configuration completes the audio portion that is provided to mix the user's voice with the music with pre-recorded song background. Using the alpha plane defined by the device 11, the following type of relationship can be easily obtained: [(Video 1 × Alpha) + (Video 2 × (255-Alpha))] / 255 = Two sources according to the final video. Can be combined. A display device 14, such as a monitor, is ultimately used to display the final result (i.e., a combination of pre-recorded material and especially belonging to the user).
[0009]
Obviously, in order to improve the quality, the analysis performed in the device 11 can produce an 8-bit alpha plane, which allows to mix better at the limits of the object covered by the surface . Note that the system can replace only the user's head or his / her entire body.
[0010]
Other cases for audio / video types are possible:
a) Two audio / video sources are uncompressed: This option is used when the entire singer's body is covered in a clip / movie (pre-recorded data is stored on tape and some singer's Video can be analyzed and transmitted directly to a video mixer), for example, used in a karaoke restaurant;
b) One or two sources are compressed: one adapted configuration in such a case is the newly expanded MPEG-4 standard, which is the object's shape and alpha plane, here the user's Enables encoding faces (MPEG-4 defines an overall system configuration that allows for the organization of audio and video objects).
[0011]
Applications for other cases of the invention are also conceivable:
a) The user wants to record the result of the mixing operation: this is a system similar to the implementation of FIG. 1 but including a further recording device 25, as shown in FIG.
[0012]
b) In some cases, the karaoke system runs online: the pre-recorded clips are stored in a database (eg, the Internet) and the user records his / her performance at home and generates a synthetic karaoke clip (Using compression techniques is particularly useful in such applications, and more generally, in all applications performed in a bandwidth-limited environment). .),
c) In some further cases, the user wants to replace only his / her head with the original singer's head, and the user's head position must match the original singer's body orientation and posture. Therefore, further processing is required in the mixing and rendering device 12.
[Brief description of the drawings]
FIG. 1 is a block diagram of a karaoke system according to the present invention.
FIG. 2 is a diagram showing another execution method of the karaoke system according to the present invention.

Claims (1)

ビデオクリップ又は映画のようなシーケンス中に歌うカラオケシステムであって、
ユーザの画像及び音声を捕捉する捕捉装置と、上記ユーザの少なくとも一部分を背景から分離する分析及び処理装置と、上記分析及び処理装置の出力信号を事前記録された素材と組合すミクシング及びレンダリング装置と、上記組み合わせ信号を表示する表示装置とを連続して有するカラオケシステム。
A karaoke system for singing during a sequence such as a video clip or movie,
A capture device for capturing a user's image and sound; an analysis and processing device for separating at least a portion of the user from the background; and a mixing and rendering device for combining the output signal of the analysis and processing device with pre-recorded material. Karaoke system having a display device for displaying the combination signal continuously.
JP2002504133A 2000-06-20 2001-06-15 Karaoke system Pending JP2004501576A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP00401758 2000-06-20
PCT/EP2001/006884 WO2001099413A2 (en) 2000-06-20 2001-06-15 Karaoke system

Publications (1)

Publication Number Publication Date
JP2004501576A true JP2004501576A (en) 2004-01-15

Family

ID=8173734

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2002504133A Pending JP2004501576A (en) 2000-06-20 2001-06-15 Karaoke system

Country Status (6)

Country Link
US (1) US20020007718A1 (en)
EP (1) EP1297692A2 (en)
JP (1) JP2004501576A (en)
KR (1) KR20020026374A (en)
CN (1) CN1383543A (en)
WO (1) WO2001099413A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006527529A (en) * 2003-05-02 2006-11-30 アラン ロバート ステイカー、 Interactive system and method for video composition
US8824861B2 (en) 2008-07-01 2014-09-02 Yoostar Entertainment Group, Inc. Interactive systems and methods for video compositing
US10332560B2 (en) 2013-05-06 2019-06-25 Noo Inc. Audio-video compositing and effects

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2389221A (en) * 2002-05-15 2003-12-03 Stuart Arnold Recording to provide a rock star experience
US7053915B1 (en) * 2002-07-30 2006-05-30 Advanced Interfaces, Inc Method and system for enhancing virtual stage experience
US7734070B1 (en) * 2002-12-31 2010-06-08 Rajeev Sharma Method and system for immersing face images into a video sequence
AU2004281154A1 (en) * 2003-10-16 2005-04-28 Novartis Vaccines And Diagnostics, Inc. 2,6-disubstituted quinazolines, quinoxalines, quinolines and isoquinolines as inhibitors of Raf kinase for treatment of cancer
US7517219B2 (en) * 2004-02-20 2009-04-14 Mcdonald Michael Method of providing specialized dance videos
US20050206751A1 (en) * 2004-03-19 2005-09-22 East Kodak Company Digital video system for assembling video sequences
CN1312912C (en) * 2004-10-21 2007-04-25 上海交通大学 Entertainment system for video frequency real time synthesizing and recording
CA2633650A1 (en) * 2004-11-04 2006-05-18 Megamedia, Llc Apparatus and methods for encoding data for video compositing
KR20060127459A (en) * 2005-06-07 2006-12-13 엘지전자 주식회사 Digital broadcasting terminal with converting digital broadcasting contents and method
US8172638B2 (en) * 2005-08-06 2012-05-08 Parental Media LLC Method and apparatus for education and entertainment
US20070122786A1 (en) * 2005-11-29 2007-05-31 Broadcom Corporation Video karaoke system
GB0525789D0 (en) * 2005-12-19 2006-01-25 Landesburg Andrew Live performance entertainment apparatus and method
JP2007228343A (en) * 2006-02-24 2007-09-06 Orion Denki Kk Digital broadcast receiver
US8572642B2 (en) 2007-01-10 2013-10-29 Steven Schraga Customized program insertion system
US20080276792A1 (en) * 2007-05-07 2008-11-13 Bennetts Christopher L Lyrics superimposed on video feed
EP2141689A1 (en) 2008-07-04 2010-01-06 Koninklijke KPN N.V. Generating a stream comprising interactive content
JP5457092B2 (en) * 2009-07-03 2014-04-02 オリンパスイメージング株式会社 Digital camera and composite image display method of digital camera
US20110285878A1 (en) * 2010-05-24 2011-11-24 Yunshu Zhang Method for generating multimedia data to be displayed on display apparatus and associated multimedia player
CN102231272A (en) * 2011-01-21 2011-11-02 辜进荣 Method and device for synthesizing network videos and audios
CN102496359A (en) * 2011-11-28 2012-06-13 华为终端有限公司 Method and device for realizing multi-party remote karaoke
EP2805483A4 (en) * 2012-01-20 2016-03-02 Karaoke Reality Video Inc Interactive audio/video system and method
CN104424624B (en) * 2013-08-28 2018-04-10 中兴通讯股份有限公司 A kind of optimization method and device of image synthesis
US20190018572A1 (en) * 2015-01-13 2019-01-17 Google Inc. Content item players with voice-over on top of existing media functionality
CN104967900B (en) * 2015-05-04 2018-08-07 腾讯科技(深圳)有限公司 A kind of method and apparatus generating video
CN110164242B (en) * 2019-06-04 2020-12-08 平顶山学院 Vocal music singing simulation training platform

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10282975A (en) * 1997-04-04 1998-10-23 Amtex Kk Karaoke system
US6514083B1 (en) * 1998-01-07 2003-02-04 Electric Planet, Inc. Method and apparatus for providing interactive karaoke entertainment
JP2000209500A (en) * 1999-01-14 2000-07-28 Daiichikosho Co Ltd Method for synthesizing portrait image separately photographed with recorded background video image and for outputting the synthesized image for display and karaoke machine adopting this method

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006527529A (en) * 2003-05-02 2006-11-30 アラン ロバート ステイカー、 Interactive system and method for video composition
US8824861B2 (en) 2008-07-01 2014-09-02 Yoostar Entertainment Group, Inc. Interactive systems and methods for video compositing
US9143721B2 (en) 2008-07-01 2015-09-22 Noo Inc. Content preparation systems and methods for interactive video systems
US10332560B2 (en) 2013-05-06 2019-06-25 Noo Inc. Audio-video compositing and effects

Also Published As

Publication number Publication date
CN1383543A (en) 2002-12-04
WO2001099413A3 (en) 2002-03-07
KR20020026374A (en) 2002-04-09
WO2001099413A2 (en) 2001-12-27
US20020007718A1 (en) 2002-01-24
EP1297692A2 (en) 2003-04-02

Similar Documents

Publication Publication Date Title
JP2004501576A (en) Karaoke system
JP3615195B2 (en) Content recording / playback apparatus and content editing method
TW583877B (en) Synchronization of music and images in a camera with audio capabilities
JP5225847B2 (en) Information processing terminal, music information generation method, and program
US8170239B2 (en) Virtual recording studio
JP2003076380A (en) Method of displaying videos of users' own making as karaoke sing-along background videos with karaoke sing- along machines scattered in various places
CN102197646A (en) System and method for generating multichannel audio with a portable electronic device eg using pseudo-stereo
US20110009988A1 (en) Content reproduction apparatus and content reproduction method
WO2006011399A1 (en) Information processing device and method, recording medium, and program
EP1585109A1 (en) Information transmission method and device, information recording or reproduction method and device, and recording medium
JP4030440B2 (en) Message reproducing apparatus, message recording and reproducing method, and program
WO2007071954A1 (en) Live performance entertainment apparatus and method
JP2010140278A (en) Voice information visualization device and program
JP4786225B2 (en) Karaoke device, program, and ranking summary server
JP4471640B2 (en) Music player
Cremer et al. Machine-assisted editing of user-generated content
JPH086577A (en) Karaoke device
JP3743321B2 (en) Data editing method, information processing apparatus, server, data editing program, and recording medium
JP2006221253A (en) Image processor and image processing program
KR200354858Y1 (en) Multi-functional karaoke system
JP2006119178A (en) Content processing method and content processor
JP4642685B2 (en) Online karaoke system, karaoke device, and method that can play back songs recorded at any time.
JP4422538B2 (en) Sound playback device
KR20000049304A (en) Photograph apparatus and method for music video using a karaoke system
Gomes et al. Exploring audio immersion using user-generated recordings