WO2010102495A1 - Videophone and method for realizing karaoke between videophones - Google Patents

Videophone and method for realizing karaoke between videophones Download PDF

Info

Publication number
WO2010102495A1
WO2010102495A1 PCT/CN2009/075605 CN2009075605W WO2010102495A1 WO 2010102495 A1 WO2010102495 A1 WO 2010102495A1 CN 2009075605 W CN2009075605 W CN 2009075605W WO 2010102495 A1 WO2010102495 A1 WO 2010102495A1
Authority
WO
WIPO (PCT)
Prior art keywords
videophone
karaoke
mtv
local
mode
Prior art date
Application number
PCT/CN2009/075605
Other languages
French (fr)
Chinese (zh)
Inventor
陈瑞
杨起
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2010102495A1 publication Critical patent/WO2010102495A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone

Definitions

  • the present invention relates to the field of videophone technology, and in particular, to a videophone and a method for implementing a camera between videophones.
  • BACKGROUND OF THE INVENTION Karaoke originated in Japan in the 1960s in the 1960s. Due to its rich entertainment and extensive participation, it has been widely spread around the world. At present, singing karaoke has become a leisure way for people. From commercial KTV entertainment venues, home theaters for home use, to karaoke for the Internet, karaoke is available in a variety of forms, prompting more and more people to participate. The traditional karaoke is a gathering of family and friends, in a place, with a display terminal.
  • an object of the present invention is to provide a videophone and a method for implementing karaoke between videophones, so as to solve the problem in the prior art that only video calls and local play files can be played between videophones, but cannot be performed. Karaoke problem.
  • a videophone including a videophone protocol stack, a modem, and a camera, and a karaoke function module and an audio mixing and synthesizing module, wherein A karaoke function module for selecting a karaoke mode and a local MTV (Music television) file, and playing a local MTV file; the local MTV file includes audio and video; an audio mixing synthesis module, The user's voice (ie, the microphone tone) and the audio of the local MTV file (ie, the accompaniment) are synthesized into the audio source (synthesized sound) of the videophone, and the audio source of the synthesized videophone is passed through the videophone protocol stack and The modem is sent to the peer.
  • a karaoke function module for selecting a karaoke mode and a local MTV (Music television) file, and playing a local MTV file
  • the local MTV file includes audio and video
  • an audio mixing synthesis module The user's voice (ie, the microphone tone) and the
  • the videophone, the video of the local MTV file (ie, the image played by the local MTV file) or the image captured by the camera is a video source of the videophone, and the video source of the videophone is used by the videophone
  • the protocol stack and modem are sent to the peer.
  • the karaoke function module further has a switching function, and can switch between the karaoke and the call state.
  • the karaoke mode is an MTV mode or a self-made MTV mode.
  • the video source of the videophone is a video of a local MTV file; if the karaoke mode is a self-made MTV mode
  • the video source of the videophone is an image captured by a camera of the videophone.
  • the local MTV file is a karaoke track (such as an existing commercially available MTV, a network downloaded MTV, etc.) or an MTV recorded by the user himself.
  • a method for realizing karaoke between videophones comprising: a calling videophone initiates a videophone call, a called videophone picks up a call, and establishes a videophone call; the calling videophone passes its karaoke function module Setting a karaoke mode, selecting a local MTV file and playing, wherein the local MTV file includes audio and video; the calling videophone passes the user's voice and the audio of the local MTV file through its audio mixing synthesis module (ie The accompaniment sound is synthesized as the audio source of the videophone, and the synthesized audio source is sent to the called videophone through its video telephony stack and modem.
  • the video source of the calling videophone is the video of the local MTV file (that is, the image played by the local MTV file) or the image captured by the camera of the calling videophone, and the video source is The telephone protocol stack and modem are sent to the called videophone.
  • the karaoke function module of the calling videophone can switch between the karaoke and the call state: when the videophone is in the call state, the karaoke function module can be switched to the karaoke state; When the videophone is in the karaoke state, it can be switched to the call state through the karaoke function module.
  • the calling videophone sends The video source sent to the called videophone is the video of the local MTV file; if the karaoke mode set is the self-made MTV mode, the video source that the calling videophone sends to the called videophone is the primary videophone.
  • the local MTV file is a karaoke track (such as an existing commercially available MTV, a network downloaded MTV, etc.) or an MTV recorded by the user himself.
  • the present invention has the following beneficial effects: Since the audio source is synthesized by two channels of audio data (user sound and local MTV file audio stream), the video source can select the image captured by the camera or the video stream of the local MTV file. Therefore, as long as the karaoke function module and the audio mixing synthesis module are added to the existing videophone, the hardware does not need to be added, and the cost is low; in addition, the user can perform karaoke with family and friends anytime and anywhere, which can be very Convenient remote entertainment interaction; In addition, the cost is lower, users only need to pay for the videophone when using the karaoke function, there is no additional cost. BRIEF DESCRIPTION OF THE DRAWINGS FIG.
  • FIG. 1 is a schematic diagram of an MTV mode frame of an embodiment of a videophone according to the present invention
  • FIG. 2 is a schematic diagram of a self-made MTV mode frame of a videophone embodiment of the present invention
  • FIG. 3 is a view of a videophone according to an embodiment of the present invention
  • FIG. 4 is a schematic diagram of a karaoke mode selection interface according to an embodiment of the present invention
  • FIG. 5 is a schematic diagram of a karaoke track selection interface according to an embodiment of the present invention.
  • the videophone of the embodiment of the present invention includes a videophone protocol stack, a modem, a camera, a karaoke function module, and an audio mixing synthesis module.
  • the karaoke function module is used to select a karaoke mode and a local MTV file, and play a local MTV file, which includes audio and video.
  • the audio mixing and synthesizing module is used to synthesize the user's voice (ie, microphone sound) and the audio of the local MTV file (ie, the accompaniment) into an audio source (synthesized sound) of the videophone, and the audio source of the synthesized videophone It is sent to the peer through the videophone protocol stack and modem.
  • the video of the piece (that is, the image played by the local MTV file) or the image captured by the camera is the video source of the videophone.
  • the video source of the videophone is sent to the opposite end by the videophone protocol stack and the modem.
  • the videophone can activate the karaoke function through its karaoke function module, and set the MTV mode (MTV mode or homemade MTV mode;).
  • MTV mode MTV mode or homemade MTV mode;
  • FIG. 1 is a schematic diagram of an MTV mode frame of a videophone according to an embodiment of the present invention.
  • the videophone transmits the user's voice (ie, microphone tone) through its audio mixing and synthesizing module.
  • the videophone Synthesize with the audio of the local MTV file (ie, the accompaniment), synthesize it into the audio source of the videophone (synthesized sound), and send the synthesized audio source of the videophone to the pair via the videophone protocol stack and modem. end.
  • the videophone sends the video of the local MTV file (that is, the image played by the local MTV file) as a video source to the peer through the videophone protocol stack and the modem. Since the videophone of the embodiment of the present invention is implemented by adding a karaoke function module and an audio mixing synthesis module to the existing videophone, it is basically the same as the existing videophone, and therefore, the audio source and the video source thereof.
  • the data is synchronized by the video telephony stack.
  • FIG. 2 is a schematic diagram of a self-made MTV mode frame of a videophone according to an embodiment of the present invention.
  • the audio source (synthesized sound) of the videophone is also the audio of the user (ie, the microphone sound) and the audio of the local MTV file (ie, The accompaniment sound is synthesized, but the video source of the videophone is the image captured by the videophone through its camera.
  • FIG. 3 which is a karaoke flow chart between a videophone and a called videophone according to an embodiment of the present invention, the steps are as follows: Step 301: The calling videophone initiates a videophone call; 302: The called videophone answers the call and establishes a videophone call.
  • Step 303 During the call, the calling videophone can activate the karaoke function through its karaoke function module, and set the MTV mode, for example, the MTV mode, Homemade MTV mode, and select local MTV files, the local MTV files include audio and video; local MTV files can be karaoke tracks, for example, existing commercially available MTV, online download MTV, etc.
  • MTV mode for example, the MTV mode, Homemade MTV mode, and select local MTV files
  • the local MTV files include audio and video
  • local MTV files can be karaoke tracks, for example, existing commercially available MTV, online download MTV, etc.
  • the local MTV file may also be the MTV recorded by the user himself;
  • Step 306 The calling videophone selects a video source.
  • step 307 the calling videophone selects the image played by the local MTV file and the synthesized sound as the audio and video data source, and proceeds to step 309;
  • step 308 The calling videophone selects the image captured by the 4 image head and the synthesized sound as the audio and video data source, and proceeds to step 309;
  • Step 309 After the communication network, the calling videophone sends the audio and video data source to the called party. Facetime. During this process, the called videophone still works in the way that the ordinary videophone works.
  • the video source is the image captured by the camera, and the audio source is the sound collected by the microphone.
  • the videophone including the karaoke function module and the audio mixing synthesizing module is started.
  • Karaoke can be.
  • the user can switch between the karaoke and the call state through the karaoke function module: when the videophone is in the call state, the karaoke function module can be switched to the karaoke state; In the karaoke state, the karaoke function module can be switched to the call state.
  • the videophone interface provided by the embodiment of the invention is simple to operate, and the two videophones can select to enter or exit the karaoke function at any time during the video call, and perform karaoke interaction.
  • the user A uses the karaoke function and sings to the other user B as an example for explanation.
  • Step 1 User A and User B are in a videophone call, User A videophone (with karaoke function module) activates the karaoke function module, enters the karaoke mode selection interface, as shown in Figure 4, select Karaoke mode: MTV mode and homemade MTV mode.
  • Step 2 The user plays the local MTV file on the videophone. If user A selects the MTV mode, the user B can see the MTV image played by the user A on the videophone, and hear the user A's singing voice.
  • Step 3 After User A sings or does not want to sing, close "Karaoke" and enter normal videophone call.
  • Step 4 If any end hangs up, the karaoke function ends and the videophone call exits.
  • the embodiments of the present invention are applicable to any videophone terminal, including mobile terminals and fixed terminals, and are not limited by the communication network environment, including a packet domain and a circuit domain.
  • the implementation of the present invention does not modify the system architecture and the current processing flow, is easy to implement, facilitates promotion in the technical field, and has strong industrial applicability.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

A videophone and a method for realizing karaoke between videophones are provided. The videophone includes a videophone protocol stack, a modem, a camera, a karaoke function module and an audio mixing synthesis module. The method includes the following steps: a calling videophone starts a call, and a called videophone receives the call to establish the communication; the calling videophone sets the karaoke mode, selects a local MTV file and plays it by means of the karaoke function module, wherein the local MTV file includes audio and video; the calling videophone synthesizes the singing of the user and the audio of the local MTV file to form audio source by means of the audio mixing synthesis module, then transmits the synthesized audio source to the called videophone by means of the videophone protocol stack and the modem. The videophone and the method are easy to realize with low cost.

Description

可视电话及实现可视电话间卡拉 OK的方法 技术领域 本发明属于可视电话技术领域,具体涉及一种可视电话及实现可视电话 间卡拉 οκ的方法。 背景技术 卡拉 OK源于 20世纪的 60年代的日本,因其丰富的娱乐性和广泛的参 与性, 广泛流传至世界各地, 目前, 唱卡拉 OK已成为人们的一种休闲方式。 从商业性的 KTV 娱乐场所、 家庭使用的家庭影院、 到网络卡拉 OK, 卡拉 OK的形式多种多样, 促使越来越多的人参与其中。 而传统的卡拉 OK都是家人朋友聚在一起, 处于在一个场所, 同看一个 显示终端。 而随着当今生活节奏的不断加快, 人们工作学习越发繁忙, 且见 面相聚到同个地点的机会越来越少, 因此, 一起去 KTV 或在家里卡拉 OK 机会也越来越少了。 生活中, 家人朋友不能一起唱卡拉 OK的主要原因是时间和距离。 虽然 市面上出现了手机音乐卡拉 OK, 但这种终端只是在本地播放 MTV文件, 仅 限于个人独享, 不能与远处的朋友家人分享。 而网络卡拉 OK是将网络服务 器上的卡拉 OK文件传到本地进行播放, 也达不到与朋友分享的效果。 发明内容 针对上述缺点,本发明的目的在于提供一种可视电话及实现可视电话间 卡拉 OK的方法, 以解决现有技术中可视电话间只能进行通话、 本地播放文 件, 而不能进行卡拉 OK的问题。 为实现上述目的, 本发明是通过以下技术方案实现的: 一种可视电话, 包括可视电话协议栈、 调制解调器和 4聂像头, 还包括卡 拉 OK功能模块和音频混音合成模块, 其中, 卡拉 OK功能模块, 用于选择 卡拉 OK模式及本地 MTV ( Music television, 音乐电视 ) 文件, 并播放本地 MTV文件; 该本地 MTV文件包括音频和视频; 音频混音合成模块, 用于将 用户的声音 (即, 话筒音) 和本地 MTV文件的音频 (即伴奏音) 合成为可 视电话的音频源(合成音), 并将合成的可视电话的音频源通过可视电话协议 栈及调制解调器发送给对端。 上述可视电话, 所述本地 MTV文件的视频 (即本地 MTV文件播放出 来的图像) 或所述摄像头捕获的图像是可视电话的视频源, 该可视电话的视 频源由所述可视电话协议栈及调制解调器发送给对端。 上述可视电话, 所述卡拉 OK功能模块还具有切换功能, 能在卡拉 OK 和通话两种状态间进行切换。 上述可视电话, 所述卡拉 OK模式是 MTV模式或自制 MTV模式, 若 卡拉 OK模式为 MTV模式,则所述可视电话的视频源为本地 MTV文件的视 频; 若卡拉 OK模式为自制 MTV模式, 则所述可视电话的视频源为可视电 话的摄像头捕获的图像。 上述可视电话, 所述本地 MTV 文件是卡拉 OK 曲目 (如现有市售的 MTV、 网络下载的 MTV等) 或是用户自己录制的 MTV。 一种实现可视电话间卡拉 OK的方法, 包括: 主叫可视电话发起可视电 话呼叫, 被叫可视电话接听呼叫, 建立可视电话通话; 主叫可视电话通过其 卡拉 OK功能模块设置卡拉 OK模式, 选择本地 MTV文件并播放, 其中, 所述本地 MTV文件包括音频和视频; 主叫可视电话通过其音频混音合成模 块将用户的声音和所述本地 MTV文件的音频 (即伴奏音) 合成为主叫可视 电话的音频源, 并将合成的音频源通过其可视电话协议栈及调制解调器发送 给被叫可视电话。 上述方法中, 主叫可视电话的视频源是所述本地 MTV文件的视频(即 本地 MTV文件播放出来的图像) 或是主叫可视电话的摄像头捕获的图像, 该视频源由所述可视电话协议栈及调制解调器发送给被叫可视电话。 上述方法中,所述主叫可视电话的卡拉 OK功能模块能在卡拉 OK和通 话两种状态间进行切换: 在可视电话处于通话状态时, 可通过卡拉 OK功能 模块切换成卡拉 OK状态;在可视电话处于卡拉 OK状态时,可通过卡拉 OK 功能模块切换成通话状态。 上述方法中, 若设置的卡拉 OK模式是 MTV模式, 则主叫可视电话发 送给被叫可视电话的视频源为本地 MTV文件的视频; 若设置的卡拉 OK模 式是自制 MTV模式, 则主叫可视电话发送给被叫可视电话的视频源为主叫 可视电话的摄像头捕获的图像。 上述方法中,所述本地 MTV文件是卡拉 OK曲目(如现有市售的 MTV、 网络下载的 MTV等) 或是用户自己录制的 MTV。 本发明与现有技术相比具有以下有益效果: 由于音频源由两路音频数据 (用户声音和本地 MTV文件音频码流) 合成, 视频源可选择摄像头捕获的 图像或者本地 MTV文件的视频码流, 因此, 只要在现有可视电话基础上增 加卡拉 OK功能模块和音频混音合成模块即可实现, 不需要添加硬件, 成本 低; 另外, 用户可随时随地与家人朋友进行卡拉 OK, 可以很方便地进行远 程的娱乐互动; 此外, 费用较低, 用户在使用卡拉 OK功能时只需要负担可 视电话的费用, 没有任何其他附加费用。 附图说明 图 1是本发明可视电话实施例的 MTV模式框架示意图; 图 2是本发明可视电话实施例的自制 MTV模式框架示意图; 图 3是本发明可视电话实施例与被叫可视电话间的卡拉 OK流程图; 图 4是本发明实施例卡拉 OK模式选择界面示意图; 图 5是本发明实施例卡拉 OK曲目选择界面示意图。 具体实施方式 为了更好地理解本发明 ,下面结合附图和具体实施例对本发明作进一步 地描述。 本发明实施例的可视电话包括可视电话协议栈、 调制解调器、 摄像头、 卡拉 OK功能模块和音频混音合成模块。卡拉 OK功能模块用于选择卡拉 OK 模式及本地 MTV文件, 并播放本地 MTV文件, 该本地 MTV文件包括音频 和视频。 音频混音合成模块用于将用户的声音 (即话筒音) 和本地 MTV文 件的音频(即伴奏音)合成为可视电话的音频源(合成音), 并将合成的可视 电话的音频源通过可视电话协议栈及调制解调器发送给对端。 本地 MTV文 件的视频 (即本地 MTV文件播放出来的图像) 或摄像头捕获的图像是可视 电话的视频源, 该可视电话的视频源由可视电话协议栈及调制解调器发送给 对端。 可视电话可通过其卡拉 OK功能模块启动卡拉 OK功能, 设置 MTV模 式( MTV模式或自制 MTV模式;)。 请参阅图 1 , 该图是本发明实施例可视电 话的 MTV模式框架示意图, 卡拉 OK功能模块播放本地 MTV文件时, 可视 电话通过其音频混音合成模块将用户的声音 (即话筒音) 和本地 MTV文件 的音频(即伴奏音)进行合成, 使之合成为可视电话的音频源 (合成音), 并 将合成的可视电话的音频源通过可视电话协议栈及调制解调器发送给对端。 同时,可视电话将本地 MTV文件的视频(即本地 MTV文件播放出来的图像 ) 作为视频源通过可视电话协议栈及调制解调器发送给对端。 由于本发明实施 例的可视电话是在现有可视电话基础上增加卡拉 OK功能模块和音频混音合 成模块来实现的, 与现有可视电话基本一样, 因此, 其音频源和视频源的数 据通过可视电话协议栈处理后就能达到同步。 请参阅图 2, 该图是本发明实施例可视电话的自制 MTV模式框架示意 图, 可视电话的音频源 (合成音) 同样是用户的声音 (即话筒音) 和本地 MTV文件的音频(即伴奏音 )的合成, 但可视电话的视频源是可视电话通过 其摄像头捕获的图像。 请参阅图 3 , 该图是本发明实施例的可视电话与被叫可视电话间的卡拉 OK流程图, 其步 4聚如下: 步骤 301 : 主叫可视电话发起可视电话呼叫; 步骤 302: 被叫可视电话接听呼叫, 建立可视电话通话; 步骤 303 : 在通话过程中, 主叫可视电话可以通过其卡拉 OK功能模块 启动卡拉 OK功能, 设置 MTV模式, 例如, MTV模式、 自制 MTV模式, 并选择本地 MTV文件, 该本地 MTV文件包括音频和视频; 本地 MTV文件 可以是卡拉 OK曲目, 例如,现有市售的 MTV、 网络下载的 MTV等(如 "很 爱很爱你,,、 "爱情呼叫转移,, 等), 此外本地 MTV文件也可以是用户自己录 制的 MTV; 步骤 304: 主叫可视电话播放该本地 MTV文件; 步骤 305 : 主叫可视电话启动其音频混音合成模块, 4巴用户的声音(即 话筒音) 和本地 MTV文件的音频 (即伴奏音) 进行合成, 使之合成为可视 电话的音频源 (合成音); 步骤 306: 主叫可视电话选择视频源。 如果选择 MTV模式, 则进入步 骤 307 , 如果选择自制 MTV模式, 则进入步骤 308; 步骤 307: 主叫可视电话选取本地 MTV文件播放出来的图像和合成音 作为音视频数据源, 进入步骤 309; 步骤 308: 主叫可视电话选取 4聂像头捕获的图像和合成音作为音视频数 据源, 进入步骤 309; 步骤 309: 经过通讯网络, 主叫可视电话将音视频数据源发送给被叫可 视电话。 在此过程中, 被叫可视电话仍以普通可视电话的工作方式进行工作, 视 频源是其摄像头捕获的图像, 音频源是其话筒釆集到的声音。 当然, 上述过 程中只要主叫、 被叫可视电话两者之一含有卡拉 OK功能模块和音频混音合 成模块即可, 然后由含有卡拉 OK功能模块和音频混音合成模块的可视电话 发起卡拉 OK即可。 在此过程中,用户可通过卡拉 OK功能模块在卡拉 OK和通话两种状态 间进行切换: 在可视电话处于通话状态时, 可通过卡拉 OK功能模块切换成 卡拉 OK状态; 在可视电话处于卡拉 OK状态时, 可通过卡拉 OK功能模块 切换成通话状态。 本发明实施例提供的可视电话界面操作简单,两个可视电话在进行视频 通话中, 可以随时选择进入或退出卡拉 OK功能, 进行卡拉 OK互动。 下面 以用户 A使用卡拉 OK功能, 唱歌给对方用户 B听为例来进行说明。 第一步: 用户 A和用户 B正在进行可视电话通话中, 用户 A可视电话 (带有卡拉 OK功能模块) 启动卡拉 OK功能模块, 进入卡拉 OK模式选择 界面, 如图 4所示, 选择卡拉 OK模式: MTV模式和自制 MTV模式, 选择 某一模式后, 进入曲目选择界面, 如图 5所示, 选择将要唱的曲目, 确认之 后, 进入卡拉 OK功能; 如果用户界面显示 "关闭卡拉 OK" , 则说明已经启 动了卡拉 OK功能, 此时用户若点击 "关闭卡拉 OK" , 则退出卡拉 OK功能。 在启动了卡拉 OK功能时, 拨打 "可视电话" 菜单变为 "卡拉 OK"。 第二步:用户 Α可视电话上播放本地 MTV文件,如果用户 A选择 MTV 模式, 用户 B可视电话上就可以看到用户 A可视电话上播放的 MTV图像, 听到用户 A的唱歌声; 如果用户 A选择自制 MTV模式, 用户 B可视电话上 就可以看到用户 A, 听到用户 A的唱歌声。 第三步: 用户 A唱完或不想唱后, 关闭 "卡拉 OK" , 则进入正常的可 视电话通话。 第四步: 如果任何一端挂机, 卡拉 OK功能结束, 退出可视电话通话。 本发明实施例可适用于任何可视电话终端, 包括移动终端和固定终端 , 不受通信网络环境限制, 包括分组域和电路域。 另夕卜,本发明的实现没有对系统架构和目前的处理流程修改,易于实现, 便于在技术领域中进行推广, 具有较强的工业适用性。 以上所述仅为本发明的较佳实施例, 并不用以限制本发明, 应当指出, 对于本领域的普通技术人员来说, 凡是本发明的精神和原则之内所作的任何 修改、 等同替换或改进等, 均应包含在本发明的保护范围之内。 BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to the field of videophone technology, and in particular, to a videophone and a method for implementing a camera between videophones. BACKGROUND OF THE INVENTION Karaoke originated in Japan in the 1960s in the 1960s. Due to its rich entertainment and extensive participation, it has been widely spread around the world. At present, singing karaoke has become a leisure way for people. From commercial KTV entertainment venues, home theaters for home use, to karaoke for the Internet, karaoke is available in a variety of forms, prompting more and more people to participate. The traditional karaoke is a gathering of family and friends, in a place, with a display terminal. With the accelerating pace of life today, people are more busy working and learning, and there are fewer and fewer opportunities to meet in the same place. Therefore, there are fewer and fewer opportunities to go to KTV or karaoke at home. In life, the main reason why family and friends can't sing karaoke together is time and distance. Although mobile music karaoke has appeared on the market, this terminal only plays MTV files locally, and is limited to individual exclusives. It cannot be shared with friends and family in the distance. The network karaoke is to transfer the karaoke files on the web server to the local area for playback, and it does not achieve the effect of sharing with friends. SUMMARY OF THE INVENTION In view of the above disadvantages, an object of the present invention is to provide a videophone and a method for implementing karaoke between videophones, so as to solve the problem in the prior art that only video calls and local play files can be played between videophones, but cannot be performed. Karaoke problem. In order to achieve the above object, the present invention is achieved by the following technical solutions: a videophone, including a videophone protocol stack, a modem, and a camera, and a karaoke function module and an audio mixing and synthesizing module, wherein A karaoke function module for selecting a karaoke mode and a local MTV (Music television) file, and playing a local MTV file; the local MTV file includes audio and video; an audio mixing synthesis module, The user's voice (ie, the microphone tone) and the audio of the local MTV file (ie, the accompaniment) are synthesized into the audio source (synthesized sound) of the videophone, and the audio source of the synthesized videophone is passed through the videophone protocol stack and The modem is sent to the peer. The videophone, the video of the local MTV file (ie, the image played by the local MTV file) or the image captured by the camera is a video source of the videophone, and the video source of the videophone is used by the videophone The protocol stack and modem are sent to the peer. In the above videophone, the karaoke function module further has a switching function, and can switch between the karaoke and the call state. In the above videophone, the karaoke mode is an MTV mode or a self-made MTV mode. If the karaoke mode is the MTV mode, the video source of the videophone is a video of a local MTV file; if the karaoke mode is a self-made MTV mode The video source of the videophone is an image captured by a camera of the videophone. In the above videophone, the local MTV file is a karaoke track (such as an existing commercially available MTV, a network downloaded MTV, etc.) or an MTV recorded by the user himself. A method for realizing karaoke between videophones, comprising: a calling videophone initiates a videophone call, a called videophone picks up a call, and establishes a videophone call; the calling videophone passes its karaoke function module Setting a karaoke mode, selecting a local MTV file and playing, wherein the local MTV file includes audio and video; the calling videophone passes the user's voice and the audio of the local MTV file through its audio mixing synthesis module (ie The accompaniment sound is synthesized as the audio source of the videophone, and the synthesized audio source is sent to the called videophone through its video telephony stack and modem. In the above method, the video source of the calling videophone is the video of the local MTV file (that is, the image played by the local MTV file) or the image captured by the camera of the calling videophone, and the video source is The telephone protocol stack and modem are sent to the called videophone. In the above method, the karaoke function module of the calling videophone can switch between the karaoke and the call state: when the videophone is in the call state, the karaoke function module can be switched to the karaoke state; When the videophone is in the karaoke state, it can be switched to the call state through the karaoke function module. In the above method, if the set karaoke mode is the MTV mode, the calling videophone sends The video source sent to the called videophone is the video of the local MTV file; if the karaoke mode set is the self-made MTV mode, the video source that the calling videophone sends to the called videophone is the primary videophone. The image captured by the camera. In the above method, the local MTV file is a karaoke track (such as an existing commercially available MTV, a network downloaded MTV, etc.) or an MTV recorded by the user himself. Compared with the prior art, the present invention has the following beneficial effects: Since the audio source is synthesized by two channels of audio data (user sound and local MTV file audio stream), the video source can select the image captured by the camera or the video stream of the local MTV file. Therefore, as long as the karaoke function module and the audio mixing synthesis module are added to the existing videophone, the hardware does not need to be added, and the cost is low; in addition, the user can perform karaoke with family and friends anytime and anywhere, which can be very Convenient remote entertainment interaction; In addition, the cost is lower, users only need to pay for the videophone when using the karaoke function, there is no additional cost. BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a schematic diagram of an MTV mode frame of an embodiment of a videophone according to the present invention; FIG. 2 is a schematic diagram of a self-made MTV mode frame of a videophone embodiment of the present invention; FIG. 3 is a view of a videophone according to an embodiment of the present invention; FIG. 4 is a schematic diagram of a karaoke mode selection interface according to an embodiment of the present invention; and FIG. 5 is a schematic diagram of a karaoke track selection interface according to an embodiment of the present invention. DETAILED DESCRIPTION OF THE INVENTION In order to better understand the present invention, the present invention will be further described below in conjunction with the drawings and specific embodiments. The videophone of the embodiment of the present invention includes a videophone protocol stack, a modem, a camera, a karaoke function module, and an audio mixing synthesis module. The karaoke function module is used to select a karaoke mode and a local MTV file, and play a local MTV file, which includes audio and video. The audio mixing and synthesizing module is used to synthesize the user's voice (ie, microphone sound) and the audio of the local MTV file (ie, the accompaniment) into an audio source (synthesized sound) of the videophone, and the audio source of the synthesized videophone It is sent to the peer through the videophone protocol stack and modem. Local MTV The video of the piece (that is, the image played by the local MTV file) or the image captured by the camera is the video source of the videophone. The video source of the videophone is sent to the opposite end by the videophone protocol stack and the modem. The videophone can activate the karaoke function through its karaoke function module, and set the MTV mode (MTV mode or homemade MTV mode;). Please refer to FIG. 1 , which is a schematic diagram of an MTV mode frame of a videophone according to an embodiment of the present invention. When a karaoke function module plays a local MTV file, the videophone transmits the user's voice (ie, microphone tone) through its audio mixing and synthesizing module. Synthesize with the audio of the local MTV file (ie, the accompaniment), synthesize it into the audio source of the videophone (synthesized sound), and send the synthesized audio source of the videophone to the pair via the videophone protocol stack and modem. end. At the same time, the videophone sends the video of the local MTV file (that is, the image played by the local MTV file) as a video source to the peer through the videophone protocol stack and the modem. Since the videophone of the embodiment of the present invention is implemented by adding a karaoke function module and an audio mixing synthesis module to the existing videophone, it is basically the same as the existing videophone, and therefore, the audio source and the video source thereof. The data is synchronized by the video telephony stack. Please refer to FIG. 2 , which is a schematic diagram of a self-made MTV mode frame of a videophone according to an embodiment of the present invention. The audio source (synthesized sound) of the videophone is also the audio of the user (ie, the microphone sound) and the audio of the local MTV file (ie, The accompaniment sound is synthesized, but the video source of the videophone is the image captured by the videophone through its camera. Referring to FIG. 3, which is a karaoke flow chart between a videophone and a called videophone according to an embodiment of the present invention, the steps are as follows: Step 301: The calling videophone initiates a videophone call; 302: The called videophone answers the call and establishes a videophone call. Step 303: During the call, the calling videophone can activate the karaoke function through its karaoke function module, and set the MTV mode, for example, the MTV mode, Homemade MTV mode, and select local MTV files, the local MTV files include audio and video; local MTV files can be karaoke tracks, for example, existing commercially available MTV, online download MTV, etc. (such as "I love you very much." , ,, "Love Call Transfer,, etc.", in addition, the local MTV file may also be the MTV recorded by the user himself; Step 304: The calling videophone plays the local MTV file; Step 305: The calling videophone starts its audio mixing and synthesizing module, and the sound of the 4 bar user (ie, the microphone sound) and the audio of the local MTV file (ie, the accompaniment sound) are combined and synthesized into an audio source of the videophone. (Synthetic sound); Step 306: The calling videophone selects a video source. If the MTV mode is selected, proceed to step 307, if the home-made MTV mode is selected, proceed to step 308; step 307: the calling videophone selects the image played by the local MTV file and the synthesized sound as the audio and video data source, and proceeds to step 309; Step 308: The calling videophone selects the image captured by the 4 image head and the synthesized sound as the audio and video data source, and proceeds to step 309; Step 309: After the communication network, the calling videophone sends the audio and video data source to the called party. Facetime. During this process, the called videophone still works in the way that the ordinary videophone works. The video source is the image captured by the camera, and the audio source is the sound collected by the microphone. Of course, in the above process, as long as one of the calling and called videophones has a karaoke function module and an audio mixing synthesizing module, then the videophone including the karaoke function module and the audio mixing synthesizing module is started. Karaoke can be. In this process, the user can switch between the karaoke and the call state through the karaoke function module: when the videophone is in the call state, the karaoke function module can be switched to the karaoke state; In the karaoke state, the karaoke function module can be switched to the call state. The videophone interface provided by the embodiment of the invention is simple to operate, and the two videophones can select to enter or exit the karaoke function at any time during the video call, and perform karaoke interaction. In the following, the user A uses the karaoke function and sings to the other user B as an example for explanation. Step 1: User A and User B are in a videophone call, User A videophone (with karaoke function module) activates the karaoke function module, enters the karaoke mode selection interface, as shown in Figure 4, select Karaoke mode: MTV mode and homemade MTV mode. After selecting a mode, enter the track selection interface, as shown in Figure 5, select the track to be sung, after confirming, enter the karaoke function; if the user interface displays "Close Karaoke"" , then the karaoke function has been activated. At this time, if the user clicks "close karaoke", the karaoke function is exited. When the karaoke function is activated, dial the "Video Phone" menu to change to "Karaoke". Step 2: The user plays the local MTV file on the videophone. If user A selects the MTV mode, the user B can see the MTV image played by the user A on the videophone, and hear the user A's singing voice. If User A chooses the self-made MTV mode, User B can see User A on the videophone and hear User A's singing voice. Step 3: After User A sings or does not want to sing, close "Karaoke" and enter normal videophone call. Step 4: If any end hangs up, the karaoke function ends and the videophone call exits. The embodiments of the present invention are applicable to any videophone terminal, including mobile terminals and fixed terminals, and are not limited by the communication network environment, including a packet domain and a circuit domain. In addition, the implementation of the present invention does not modify the system architecture and the current processing flow, is easy to implement, facilitates promotion in the technical field, and has strong industrial applicability. The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. It should be noted that any modifications, equivalents, or substitutions made within the spirit and principles of the present invention will be apparent to those skilled in the art. Improvements and the like should be included in the scope of the present invention.

Claims

权 利 要 求 书 Claim
1. 一种可视电话, 包括可视电话协议栈、 调制解调器和 4聂像头, 其特征 在于, 还包括卡拉 OK功能模块和音频混音合成模块, 其中, A videophone, comprising a videophone protocol stack, a modem, and a 4D image header, further comprising a karaoke function module and an audio mixing synthesis module, wherein
所述卡拉 OK功能模块, 用于选择卡拉 OK模式及本地音乐电视 MTV文件, 并播放所述本地 MTV文件, 所述本地 MTV文件包括音 频和视频;  The karaoke function module is configured to select a karaoke mode and a local music TV MTV file, and play the local MTV file, where the local MTV file includes audio and video;
所述音频混音合成模块, 用于将用户的声音和所述本地 MTV文 件的音频合成为可视电话的音频源, 并将合成的可视电话的音频源通 过所述可视电话协议栈及所述调制解调器发送给对端。  The audio mixing and synthesizing module is configured to synthesize a user's voice and audio of the local MTV file into an audio source of a videophone, and pass the audio source of the synthesized videophone through the videophone protocol stack and The modem is sent to the peer.
2. 根据权利要求 1 所述的可视电话, 其特征在于, 所述本地 MTV文件 的视频或所述摄像头捕获的图像是可视电话的视频源, 该可视电话的 视频源由所述可视电话协议栈及所述调制解调器发送给对端。 2. The videophone according to claim 1, wherein the video of the local MTV file or the image captured by the camera is a video source of a videophone, and the video source of the videophone is The telephony protocol stack and the modem are sent to the peer.
3. 根据权利要求 1所述的可视电话, 其特征在于, 所述卡拉 OK功能模 块还具有切换功能, 能在卡拉 OK和通话两种状态间进行切换。 3. The videophone according to claim 1, wherein the karaoke function module further has a switching function to switch between karaoke and call.
4. 居权利要求 2所述的可视电话, 其特征在于, 所述卡拉 OK模式是 MTV模式或自制 MTV模式, 若卡拉 OK模式为 MTV模式, 则所述 可视电话的视频源为本地 MTV文件的视频; 若卡拉 OK模式为自制 MTV 模式, 则所述可视电话的视频源为可视电话的摄像头捕获的图 像。 4. The videophone of claim 2, wherein the karaoke mode is an MTV mode or a home-made MTV mode, and if the karaoke mode is an MTV mode, the video source of the videophone is a local MTV. The video of the file; if the karaoke mode is the homemade MTV mode, the video source of the videophone is the image captured by the camera of the videophone.
5. 根据权利要求 1 或 2所述的可视电话, 其特征在于, 所述本地 MTV 文件是卡拉 OK曲目或是用户自己录制的 MTV。 The videophone according to claim 1 or 2, wherein the local MTV file is a karaoke track or an MTV recorded by the user himself.
6. —种实现可视电话间卡拉 OK的方法, 包括: 6. A method for implementing karaoke between videophones, including:
主叫可视电话发起可视电话呼叫, 被叫可视电话接听呼叫, 建立 可视电话通话;  The calling videophone initiates a videophone call, and the called videophone answers the call and establishes a videophone call;
主叫可视电话通过其卡拉 OK功能模块设置卡拉 OK模式, 选择 本地 MTV文件并播放, 其中, 所述本地 MTV文件包括音频和视频; 主叫可视电话通过其音频混音合成模块将用户的声音和所述本 地 MTV 文件的音频合成为主叫可视电话的音频源, 并将合成的音频 源通过其可视电话协议栈及调制解调器发送给被叫可视电话。 The calling videophone sets the karaoke mode through its karaoke function module, selects and plays the local MTV file, wherein the local MTV file includes audio and video; the calling videophone transmits the user through its audio mixing synthesizing module. The audio of the sound and the local MTV file is synthesized as the audio source of the videophone, and the synthesized audio The source is sent to the called videophone through its video telephony stack and modem.
7. 根据权利要求 6所述的实现可视电话间卡拉 OK的方法, 其特征在于, 主叫可视电话的视频源是所述本地 MTV 文件的视频或是主叫可视电 话的摄像头捕获的图像, 所述视频源由所述可视电话协议栈及调制解 调器发送给被叫可视电话。 7. The method for implementing videophone inter-camera karaoke according to claim 6, wherein the video source of the calling videophone is captured by the video of the local MTV file or by the camera of the calling videophone. Image, the video source is sent by the video telephony protocol stack and modem to the called videophone.
8. 根据权利要求 6所述的实现可视电话间卡拉 OK的方法, 其特征在于, 所述主叫可视电话的卡拉 OK功能模块能在卡拉 OK和通话两种状态 间进行切换: 在可视电话处于通话状态时, 可通过卡拉 OK功能模块 切换成卡拉 OK状态; 在可视电话处于卡拉 OK状态时, 可通过卡拉 OK功能模块切换成通话状态。 8. The method for realizing karaoke between videophones according to claim 6, wherein the karaoke function module of the calling videophone can switch between two states: karaoke and call: When the call is in the call state, the karaoke function module can be switched to the karaoke state; when the videophone is in the karaoke state, the karaoke function module can be switched to the call state.
9. 根据权利要求 7所述的实现可视电话间卡拉 OK的方法, 其特征在于, 若设置的卡拉 OK模式是 MTV模式,则主叫可视电话发送给被叫可视 电话的视频源为本地 MTV文件的视频;若设置的卡拉 OK模式是自制 MTV模式, 则主叫可视电话发送给被叫可视电话的视频源为主叫可视 电话的摄像头捕获的图像。 9. The method for implementing videophone inter-camera karaoke according to claim 7, wherein if the set karaoke mode is the MTV mode, the video source sent by the calling videophone to the called videophone is The video of the local MTV file; if the set karaoke mode is the self-made MTV mode, the video source that the calling videophone sends to the called videophone is the image captured by the camera of the videophone.
10. 根据权利要求 6、 7或 8所述的实现可视电话间卡拉 OK的方法, 其特 征在于, 所述本地 MTV 文件是卡拉 OK 曲目或是用户自己录制的 MTV。 10. A method of implementing videophone inter-camera karaoke according to claim 6, 7 or 8, wherein the local MTV file is a karaoke track or an MTV recorded by the user himself.
PCT/CN2009/075605 2009-03-11 2009-12-15 Videophone and method for realizing karaoke between videophones WO2010102495A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2009101061158A CN101516018B (en) 2009-03-11 2009-03-11 Visible telephone and method for realizing karaoke between visible telephones
CN200910106115.8 2009-03-11

Publications (1)

Publication Number Publication Date
WO2010102495A1 true WO2010102495A1 (en) 2010-09-16

Family

ID=41040274

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2009/075605 WO2010102495A1 (en) 2009-03-11 2009-12-15 Videophone and method for realizing karaoke between videophones

Country Status (2)

Country Link
CN (1) CN101516018B (en)
WO (1) WO2010102495A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101516018B (en) * 2009-03-11 2011-06-22 中兴通讯股份有限公司 Visible telephone and method for realizing karaoke between visible telephones
CN103428471A (en) * 2013-08-26 2013-12-04 苏州跨界软件科技有限公司 Communication system achieving karaoke function between terminals and method
CN104486634A (en) * 2014-12-09 2015-04-01 北京歌华有线数字媒体有限公司 Audio and video acquisition and synthesis system based on linkage of wired television and intelligent mobile equipment
KR102067692B1 (en) * 2018-09-28 2020-01-17 주식회사 앤씨앤 Method and apparatus generating mixing signal of video and audio
CN111913628B (en) * 2020-06-22 2022-05-06 维沃移动通信有限公司 Sharing method and device and electronic equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002278572A (en) * 2001-03-21 2002-09-27 Ricoh Co Ltd Karaoke service system
US20030109219A1 (en) * 2001-12-10 2003-06-12 Zak Amselem System and method for real-time simultaneous recording on playback over communication network
CN1549550A (en) * 2003-05-09 2004-11-24 陈绍华 Intelligent information server and controlling method
CN1805454A (en) * 2005-01-10 2006-07-19 乐金电子(中国)研究开发中心有限公司 Mobile telephone with Kara OK device and method of implementing Kara OK
CN1866353A (en) * 2006-03-08 2006-11-22 华为技术有限公司 Apparatus and method for realizing karaoke function on mobile terminal
CN101098523A (en) * 2006-06-29 2008-01-02 海尔集团公司 Method for realizing karaoke by mobile phone and mobile phone with karaoke function
CN101272570A (en) * 2008-04-28 2008-09-24 北京中星微电子有限公司 Method for implementing network karaoke OK and portable equipment
CN201122431Y (en) * 2007-12-03 2008-09-24 中兴通讯股份有限公司 Karaoke OK device for telephone terminal
CN101516018A (en) * 2009-03-11 2009-08-26 中兴通讯股份有限公司 Visible telephone and method for realizing karaoke between visible telephones

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3975639B2 (en) * 2000-03-02 2007-09-12 ヤマハ株式会社 Telephone terminal device
CN1199501C (en) * 2002-04-10 2005-04-27 英华达(上海)电子有限公司 Method for realizing video and audio karaoke in mobile telephone

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002278572A (en) * 2001-03-21 2002-09-27 Ricoh Co Ltd Karaoke service system
US20030109219A1 (en) * 2001-12-10 2003-06-12 Zak Amselem System and method for real-time simultaneous recording on playback over communication network
CN1549550A (en) * 2003-05-09 2004-11-24 陈绍华 Intelligent information server and controlling method
CN1805454A (en) * 2005-01-10 2006-07-19 乐金电子(中国)研究开发中心有限公司 Mobile telephone with Kara OK device and method of implementing Kara OK
CN1866353A (en) * 2006-03-08 2006-11-22 华为技术有限公司 Apparatus and method for realizing karaoke function on mobile terminal
CN101098523A (en) * 2006-06-29 2008-01-02 海尔集团公司 Method for realizing karaoke by mobile phone and mobile phone with karaoke function
CN201122431Y (en) * 2007-12-03 2008-09-24 中兴通讯股份有限公司 Karaoke OK device for telephone terminal
CN101272570A (en) * 2008-04-28 2008-09-24 北京中星微电子有限公司 Method for implementing network karaoke OK and portable equipment
CN101516018A (en) * 2009-03-11 2009-08-26 中兴通讯股份有限公司 Visible telephone and method for realizing karaoke between visible telephones

Also Published As

Publication number Publication date
CN101516018A (en) 2009-08-26
CN101516018B (en) 2011-06-22

Similar Documents

Publication Publication Date Title
US8854414B2 (en) Method, application server and system for privacy protection in video call
CN101630507B (en) Method, device and system for realizing remote karaoke
US20080294721A1 (en) Architecture for teleconferencing with virtual representation
WO2007082433A1 (en) Apparatus, network device and method for transmitting video-audio signal
TWI492629B (en) Video conference system, video conference apparatus and method thereof
CN101534412A (en) Method for realizing video conference notification and device
WO2010102495A1 (en) Videophone and method for realizing karaoke between videophones
EP1681882B1 (en) Commmunication method and communication system to enable sending a message during a Push-To-Talk connection
JP2007201916A (en) PoC DATA TRANSMISSION METHOD AND PoC CALL SYSTEM AND DEVICE
JP4473260B2 (en) Telephone communication device
WO2012022093A1 (en) Method and wireless communication terminal for displaying calling video
JP2007274480A (en) Telephone system, and telephone terminal device
JP2010239641A (en) Communication device, communication system, control program of communication device, and recording media-recording control program of communication device
WO2006015527A1 (en) A method and system for adding the background music in the conversation
JP5436743B2 (en) Communication terminal device and communication control device
JP4572697B2 (en) Method, terminal and program for reproducing video content data during call connection based on IP telephone function
CN100464554C (en) System and method for playing background sound used for public telephone exchange net
JP2010512075A (en) Method for call session, telephone system and telephone terminal
WO2010070986A1 (en) Multimedia provision service
JP4318842B2 (en) Mobile phone system
JP4391362B2 (en) Image communication device
JP2006201272A (en) Ip telephone terminal and karaoke distribution server system
JP4207701B2 (en) Call device, call method, and call system
JP2005223403A (en) Speech unit and line connection type speech system
JP4202724B2 (en) Content playback device with telephone function

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09841364

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09841364

Country of ref document: EP

Kind code of ref document: A1