WO2014023042A1 - Set top box based video conversation method and system - Google Patents

Set top box based video conversation method and system Download PDF

Info

Publication number
WO2014023042A1
WO2014023042A1 PCT/CN2012/080298 CN2012080298W WO2014023042A1 WO 2014023042 A1 WO2014023042 A1 WO 2014023042A1 CN 2012080298 W CN2012080298 W CN 2012080298W WO 2014023042 A1 WO2014023042 A1 WO 2014023042A1
Authority
WO
WIPO (PCT)
Prior art keywords
top box
set top
mobile phone
message
peer
Prior art date
Application number
PCT/CN2012/080298
Other languages
French (fr)
Chinese (zh)
Inventor
王风涛
Original Assignee
青岛海信宽带多媒体技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 青岛海信宽带多媒体技术有限公司 filed Critical 青岛海信宽带多媒体技术有限公司
Publication of WO2014023042A1 publication Critical patent/WO2014023042A1/en

Links

Definitions

  • the present invention relates to communication technologies, and in particular, to a video call method and system. Background technique
  • Smart TVs enable a variety of application services such as network search, IP TV, video on demand, digital music, online news, and network video telephony based on set-top boxes. These application services are based on the set-top box connected to the network.
  • set-top boxes are becoming more and more popular through wireless network connections such as WIFI.
  • the video and audio are generally collected by the camera with the microphone; however, the camera with the microphone usually cannot simultaneously take care of the quality of the video and audio, because if the camera is away Too close, the video range is too small; if the camera is farther away, the quality of the collected sound is not good.
  • the patent document with the publication number 102387335A discloses a method and a system for realizing a visual call based on a mobile phone through a set top box.
  • the mobile phone collects video and audio, and the mobile phone packages the collected audio and video data to the set top box, and the set top box Send to the other party's set-top box via the network.
  • the method still has the mobile phone too close, the video range is too small; the mobile phone is far away, and the sound quality is not good.
  • the user needs to put the mobile phone in front of the user, which does not meet the user's habit of using the mobile phone.
  • the embodiment of the invention provides a video call method and system based on a set top box, which can simultaneously consider the quality of video and audio.
  • a set-top box-based video calling method including: after a local set-top box establishes a call with a peer set-top box, sending a startaudio message to a mobile phone with which a communication connection is established; in the startaudio message Carrying the IP address and port information of the peer set top box;
  • the mobile phone After receiving the startaudio message, the mobile phone starts to collect sound information; After the information is encoded into audio data, the audio data is sent to the peer set top box according to the audio IP address and port information of the peer set top box; and,
  • the local set top box After acquiring the image information collected by the camera device connected thereto, the local set top box encodes the acquired image information into video data and sends the image information to the peer set top box;
  • the peer set top box decodes the video data and the audio data, and plays the decoded sound and video through the connected TV.
  • the local set top box sends a stopaudio message to the mobile phone
  • the mobile phone stops collecting sound information according to the received stopaudio message.
  • the method further includes:
  • the local set top box establishes a communication connection with the mobile phone before calling the peer set top box; or the local set top box receives the call of the opposite set top box, and before responding to the call of the opposite set top box,
  • the mobile phone establishes a communication connection;
  • the local set top box establishes a communication connection with the mobile phone before receiving the call of the opposite set top box.
  • the local set top box After the local set top box establishes a communication connection with the mobile phone, the local set top box periodically sends a keepalive message to the mobile phone;
  • the local set top box After the local set top box sends a keepalive message, if the response message returned by the mobile phone is not received within the set time period, the other device is switched to perform sound information collection.
  • the establishing, by the local set top box, the communication connection between the local set top box and the mobile phone is:
  • the local set top box scans the mobile phone in the same local area network by sending a broadcast message; if scanning a mobile phone, establishing a communication connection with the mobile phone; if scanning a plurality of mobile phones, prompting the user to make a selection, and establishing a communication connection with the mobile phone selected by the user ;
  • the local set top box establishes a communication connection with the mobile phone according to the IP address specified by the user. Further, the peer set top box sends video data to the local set top box; and
  • the peer mobile phone sends audio data to the local set top box;
  • the peer mobile phone is a mobile phone that establishes a communication connection with the peer set top box;
  • the local set top box decodes the video data sent by the peer set top box and the audio data sent by the peer mobile phone, and plays the sound and video through the television connected thereto.
  • the method further includes: The local set top box determines that the audio collection device selected by the user is a mobile phone.
  • a set-top box-based video calling system comprising: a local set top box, and a mobile phone having a communication connection with the local set top box;
  • the local set top box is configured to send a startaudio message to the mobile phone after establishing a call with the peer set top box; and obtain image information collected by the camera device connected thereto, and then encode the acquired image information into video data and send the image information to The peer-end set-top box; wherein the startaudio message carries the IP address and port information of the peer set-top box;
  • the mobile phone is configured to start collecting sound information after receiving the startaudio message; and after encoding the sound information into audio data, send the audio data to the office according to the audio IP address and port information of the peer set top box. Said the opposite set-top box. Said mobile phone sends a stopaudio message;
  • the mobile phone is further configured to stop collecting sound information according to the received stopaudio message.
  • the local set top box is further configured to establish a communication connection with the mobile phone before calling the peer set top box;
  • the local set top box is further configured to establish a communication connection with the mobile phone after receiving the call of the opposite set top box and before responding to the call of the opposite set top box;
  • the local set top box is further configured to establish a communication connection with the mobile phone before receiving the call of the peer set top box.
  • the local set top box is further configured to periodically send a keepalive message to the mobile phone after establishing a communication connection with the mobile phone; if the response message returned by the mobile phone is not received within a set time period, Switch other devices to collect sound information.
  • the local set top box is further configured to: after receiving the video data sent by the peer set top box and the audio data sent by the peer mobile phone, and decoding the audio data transmitted by the connected mobile phone;
  • the peer mobile phone is a mobile phone that establishes a communication connection with the peer set top box.
  • the local set top box is further configured to determine that the audio collection device selected by the user is a mobile phone before sending the startaudio message to the mobile phone with which the communication connection is established.
  • a set top box including:
  • a mobile communication module for establishing a communication connection with the mobile phone
  • a video call control module configured to send a call start notification to the mobile phone communication module after establishing a call with the peer set top box;
  • the mobile phone communication module is further configured to establish communication after receiving the call start notification
  • the connected mobile phone sends a startaudio message;
  • the startaudio message carries the IP address and port information of the peer set top box;
  • a video data encoding module configured to acquire image information collected by an image capturing device connected to the set top box, and encode the acquired image information into video data;
  • the video call control module is further configured to send the video data encoded by the video data encoding module to the peer set top box.
  • the video call control module is further configured to send a call end notification to the mobile phone communication module when the call with the peer set top box ends;
  • the mobile phone communication module is further configured to send a stopaudio message to the mobile phone after receiving the call end notification.
  • the communication connection between the mobile phone communication module and the mobile phone is specifically as follows:
  • the mobile phone communication module establishes a communication connection with the mobile phone before the video call control module calls the peer set top box;
  • the mobile phone communication module establishes a communication connection with the mobile phone before the video call control module receives the call of the opposite set top box.
  • the video call control module is further configured to: after the mobile phone communication module establishes a communication connection with the mobile phone, periodically send a notification to the mobile phone communication module to detect that the connection is valid; and the mobile phone communication module detects according to the received Connecting a valid notification, sending a keepalive message to the mobile phone; if the response message returned by the mobile phone is not received within the set time period after the keepalive message is sent, the mobile phone communication module returns to the video call control module Invalid connection notification;
  • the video call control module is further configured to switch other devices to collect sound information after receiving the notification that the connection is invalid.
  • the video call control module is further configured to receive video data sent by the peer set-top box and audio data sent by the peer mobile phone, where the peer mobile phone establishes a communication connection with the peer set-top box.
  • the set top box further includes:
  • a decoding module configured to decode the audio and video data received by the video call control module, and send the audio and video data to the television connected to the set top box for playing.
  • the system further comprises:
  • An audio collection device selection module for providing a user with a selectable audio collection device and determining an audio collection device selected by the user;
  • the video call control module is further configured to determine that the selected audio collection device is a mobile phone before sending the call start notification to the mobile phone communication module.
  • a mobile phone including:
  • a local area network device connection module configured to establish a communication connection with a device in the local area network, and receive a message sent by the device that establishes a communication connection in the local area network;
  • a message parsing module configured to parse a message received by the local area network device connection module; if the message is a startaudio message, send an audio collection notification;
  • An audio collection and transmission module configured to start collecting sound information after receiving the audio collection notification sent by the message parsing module; and encoding the audio information into audio data, according to the audio IP address and port carried in the startaudio message The information transmits the audio data.
  • the message parsing module is further configured to: if the received message is parsed as a stopaudio message, send a stop audio collection notification to the audio collection and transmission module;
  • the audio collection and transmission module is further configured to stop collecting sound information according to the stopping the audio collection notification.
  • the message parsing module is further configured to: if the received message is parsed as a keepalive message, return a response message by using the local area network device connection module.
  • the set-top box in the video call system of the embodiment of the present invention can send a message to the mobile phone, notify the mobile phone to collect the sound information, and send the sound information to the set-top box of the opposite end through the mobile phone; and acquire the image information through the camera device and then send the image information to the set-top box.
  • the user can have a certain distance from the camera device, which is convenient for the camera device to obtain better quality video images; and the mobile phone can be placed at a close distance of the user to obtain better sound quality; The purpose of video quality.
  • the user can be in front of the TV, put the phone around or hold the ear (or through the phone's earphones) for video calls, and it is also in line with the user's usual habits.
  • FIG. 1 is a schematic diagram of a video call system based on a set top box according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a video call method based on a set top box according to an embodiment of the present invention
  • FIG. 3 is a block diagram showing the internal structure of a set top box and a mobile phone according to an embodiment of the present invention.
  • module is intended to include a computer-related entity such as, but not limited to, hardware, firmware, hardware and software combinations, software, or software in execution.
  • a module can be, but is not limited to: a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and/or a computer.
  • an application running on a computing device and this computing device can both be modules.
  • One or more modules may be located within a process and/or thread in execution.
  • the main idea of the present invention is that on the basis of the existing set top box supporting the WIFI and the network, the sound information is obtained by the mobile phone and then sent to the set top box of the opposite end, and the image information is acquired by the camera device disposed at the upper end of the television, and the set top box obtains the image information of the camera device. Sended to the other party's set-top box, the peer set-top box receives the sound information and image information and then plays the image and sound through the TV.
  • the user can be separated from the camera device at the upper end of the TV, and the video range of the camera device can be larger, which is more suitable for the conference occasion; and the mobile phone can be placed at a close distance of the user, for example, at the user's side or even at the mouth. , or by talking on the phone's mic, you can get better sound quality.
  • the user can be in front of the TV, put the phone around or hold the ear (or through the phone's earphones) for video calls, and it is also in line with the user's usual habits.
  • the set-top box-based video call system of the embodiment of the present invention includes: a local set top box 101, a mobile phone 102 that has established a communication connection with the local set top box, an image capturing device 104 connected to the local set top box 101, and a local set top box 101.
  • TV 105 and peer set top box 103 includes: a local set top box 101, a mobile phone 102 that has established a communication connection with the local set top box, an image capturing device 104 connected to the local set top box 101, and a local set top box 101.
  • TV 105 and peer set top box 103 includes: a local set top box 101, a mobile phone 102 that has established a communication connection with the local set top box, an image capturing device 104 connected to the local set top box 101, and a local set top box 101.
  • TV 105 and peer set top box 103 includes: a local set top box 101, a mobile phone 102 that has established a communication connection with the local set top box, an image capturing device
  • a method for performing a video call in a set-top box-based video call system includes the following steps:
  • S201 The local set top box 101 establishes a call with the peer set top box 103.
  • the local set top box 101 is responsible for establishing a call with the peer set top box 103: the local set top box 101 acts as the calling party after the user runs the video calling program of the local set top box 101, and the peer set top box 103 acts as the called party established by the called party; The set top box 103 acts as a calling party, and the local set top box 101 acts as a call established by the called party.
  • the local set top box 101 signaling control protocol supports SIP (Session Initiation Protocol), and can be used as an IMS (Ip Multimedia Subsystem, IP Multimedia Subsystem) network video call terminal; its voice coding supports G711.
  • the method for establishing a call between the local set top box 101 and the peer set top box 103 is a technique well known to those skilled in the art, and details are not described herein again.
  • the local set top box 101 sends a startaudio message to the handset 102 that has previously established a connection with the local set top box 101, that is, the handset 102 establishes a communication connection with the local set top box 101 in advance.
  • the startaudio message carries the audio IP address and port information of the peer set top box 103.
  • the local set top box 101 and the handset 102 can establish a connection at one of the following times: Opportunity A: The local set top box 101 establishes a communication connection with the handset 102 prior to calling the peer set top box 103.
  • Timing B The local set top box 101 establishes a communication connection with the handset 102 after receiving the call to the peer set top box 103 and before responding to the call to the peer set top box 103.
  • Timing C The local set top box 101 establishes a communication connection with the handset 102 before receiving the call to the peer set top box 103. Specifically, after the user selects the mobile phone as the collection device of the sound information after running the video call program of the local set top box 101, the local set top box 101 establishes a communication connection with the mobile phone 102.
  • the method for establishing the connection between the local set top box 101 and the mobile phone 102 can be as follows:
  • the local set top box 101 scans the mobile phone in the same local area network by sending a broadcast message; if a mobile phone is scanned, a communication connection is established with the mobile phone; if the local set top box 101 scans a plurality of mobile phones, the user is prompted to make a selection, and the user selects one of them. After the mobile phone, the local set top box 101 establishes a communication connection with the mobile phone selected by the user. Specifically, the local set top box 101 can establish a communication connection with the mobile phone through the WIFI.
  • the method for establishing a communication connection between the local set top box 101 and the mobile phone in the same local area network is a technology well known to those skilled in the art, and details are not described herein again.
  • Method B The local set top box 101 establishes a communication connection with the mobile phone of the IP address according to the IP address specified by the user.
  • the mobile phone 102 After establishing a communication connection with the local set top box 101, the mobile phone 102 can display corresponding prompt information on the mobile phone.
  • the mobile phone 102 After acquiring the sound information, the mobile phone 102 encodes the sound information into audio data, and sends the sound information to the peer set top box 103.
  • the mobile phone 102 After acquiring the voice information, the mobile phone 102 encodes and packages the voice information into an RTP data packet; and according to the audio IP (Internet Protocol) address of the peer set top box 103. And the port information, the RTP data packet is sent to the peer set top box 103.
  • RTP Internet Protocol
  • the local set top box 101 acquires image information acquired by the camera unit 104.
  • the step of acquiring, by the local set top box 101, the image information collected by the camera device 104 in the step S203 may be the same as the step of collecting and acquiring the sound information by the mobile phone 102 in step S203 after the local set top box 101 establishes a call with the opposite set top box 103. get on.
  • the local set top box 101 encodes the acquired image information into video data, and sends the image information to the peer set top box 103.
  • the camera device 104 can be installed at a position with a better imaging effect. For example, the user needs to watch the image of the other party's caller playing on the television, usually located in front of the television, and the local camera device 104 is installed at the upper end of the television. It is easy to take pictures of users located in front of the TV.
  • the connection mode of the camera device 104 with the local set top box 101 can be either a wired connection or a wireless connection. After acquiring the image information, the camera device 104 transmits the image information to the local set top box 101, and the local set top box 101 encodes the acquired image information into video data, and then sends the image information to the peer set top box 103.
  • the peer set top box 103 decodes the video data sent by the local set top box 101 and the audio data sent by the mobile phone 102, and performs video and audio playback through the television connected to the peer set top box 103.
  • the local set top box 101 decodes the video data sent by the peer set top box 103 and the audio data sent by the peer mobile phone 106, and plays the audio and video through the television 105.
  • the peer mobile phone 106 is a mobile phone that establishes a communication connection with the peer set top box 103, and the peer set top box 103 also sends a startaudio message to the opposite mobile phone 106 after establishing a call with the local set top box 101.
  • the peer mobile phone 106 collects the voice information according to the startaudio message, and after the voice information is encoded into the audio data, sends the audio data to the local set top box 101 according to the audio IP address and port information of the local set top box 101 carried in the startaudio message;
  • the set top box 103 sends the video data to the local set top box 101.
  • the local set top box 101 decodes the video data sent by the peer set top box 103 and the audio data sent by the peer mobile phone 106, and plays the audio and video through the television 105.
  • S210 The mobile phone 102 ends the call state, stops collecting, and acquires the obtained sound information.
  • the local set top box 101 After the local set top box 101 establishes a communication connection with the mobile phone 102, in order to ensure that the communication connection between the local set top box 101 and the mobile phone 102 remains normal, the local set top box 101 periodically sends a keepalive message to the mobile phone 102 (eg, every 120 seconds). (Keep valid message); After receiving the keepalive message, the mobile phone 102 returns a response message to the local set top box 101.
  • a keepalive message eg, every 120 seconds.
  • the local set top box 101 After the local set top box 101 sends the keepalive message, if the local set top box 101 does not receive the response message returned by the mobile phone 102 within the set time period, the mobile phone 102 is considered to be in an abnormal state, and the local set top box 101 automatically switches other devices to collect sound information, for example, The default device, the MIC (Mike) that comes with the camera, collects sound information.
  • the default device, the MIC (Mike) that comes with the camera, collects sound information.
  • the local set top box 101 sends a keepalive message (maintaining a valid message) to the mobile phone 102 periodically (eg, every 120 seconds) after transmitting the startaudio message to the mobile phone 102, and before transmitting the stopaudio message to the mobile phone 102 to end the call; If the local set top box 101 does not receive the response message returned by the mobile phone 102 within the set time period, the mobile phone 102 is considered to be in an abnormal state, and the local set top box 101 automatically switches other devices to collect sound information, for example, switching to the default device, that is, the MIC of the camera itself. (Mike) collects sound information.
  • the local set top box 101 may specifically be a set top box based on an Android (Android) system; the mobile phone 102 may specifically be a mobile phone of the Android system.
  • Android Android
  • the local set top box 101 can also select the audio collection device before sending the startaudio message to the mobile phone 102. After determining that the audio collection device selected by the user is the mobile phone, the above steps S202-S208 are performed. If the audio collection device selected by the user is a microphone, the local set top box 101 uses the prior art method to collect audio and video information, and details are not described herein.
  • FIG. 3 An internal structure of a set top box and a mobile phone according to an embodiment of the present invention is shown in FIG. 3.
  • the set top box includes: a mobile phone communication module 301, a video call control module 302, and a video data encoding module 303.
  • the mobile phone includes: a local area network device connection module 311, a message parsing module 312, and an audio collection and transmission module 313.
  • the mobile phone communication module 301 of the set top box is used to establish a communication connection with the mobile phone. Specifically, the mobile phone communication module 301 can notify the mobile communication module 301 to establish a communication connection with the mobile phone before the video call control module 302 calls the opposite set top box. Or; the mobile phone communication module 301 is before the video call control module 302 receives the call of the opposite set top box, The mobile phone communication module 301 is notified to establish a communication connection with the mobile phone.
  • the local area network device connection module 311 of the mobile phone establishes a communication connection with the device in the local area network, that is, the local area network device connection module 311 responds to the communication connection request of the set top box in the local area network, and establishes a communication connection with the local area network device connection module 311.
  • the set-top box mobile communication module 301 can send a message to the mobile phone after establishing a communication connection with the mobile phone.
  • the LAN device connection module of the mobile phone 311 can establish a communication connection with the set-top box, and then can receive the message sent by the set-top box (that is, the device that establishes the communication connection in the local area network).
  • the video call control module 302 of the set top box is configured to establish a call with the peer set top box, and send a call start notification to the mobile phone communication module 301 after the call is established;
  • the mobile phone communication module 301 After receiving the call start notification, the mobile phone communication module 301 sends a startaudio message to the mobile phone that establishes the communication connection; the startaudio message carries the IP address and port information of the peer set top box;
  • the local area network device connection module 311 of the mobile phone After receiving the message sent by the set top box, the local area network device connection module 311 of the mobile phone sends a message to the message parsing module 312 for parsing.
  • the message parsing module 312 of the mobile phone parses the message received by the local area network device connection module 311. If the message is parsed as a startaudio message, the audio collection and transmission module 313 sends an audio collection notification.
  • the audio collection and transmission module 313 of the mobile phone After receiving the audio collection notification sent by the message parsing module 312, the audio collection and transmission module 313 of the mobile phone starts to collect the audio information; and after encoding the audio information into the audio data, according to the audio IP address and port information carried in the startaudio message. The audio data is transmitted.
  • the video data encoding module 303 of the set top box acquires the image information collected by the camera device connected to the set top box, and encodes the obtained image information into video data;
  • the video call control module 302 of the set top box sends the video data encoded by the video data encoding module 303 to the peer set top box.
  • the video call control module 302 of the set top box also sends a call end notification to the mobile phone communication module 301 when the call with the opposite set top box ends;
  • the mobile phone communication module 301 After receiving the call end notification, the mobile phone communication module 301 sends a startaudio message to the mobile phone.
  • the local area network device connection module 311 of the mobile phone After receiving the message sent by the set top box, the local area network device connection module 311 of the mobile phone sends a message to the message parsing module 312 for parsing. If the message parsing module 312 of the mobile phone parses the message received by the local area network device connection module 311 as a stopaudio message, sends a stop audio collection notification to the audio collection and transmission module 313; and the audio collection and transmission module 313 stops collecting according to the stop audio collection notification. Sound information.
  • the video call control module 302 of the set top box periodically sends a notification to the mobile phone communication module 301 to detect that the connection is valid; the mobile phone communication module 301 sends a keepalive to the mobile phone according to the received notification that the connection is valid. If the message parsing module 312 of the mobile phone parses the message received by the local area network device connection module 311, if the message is a keepalive message, the local area network device connection module 311 returns a response message;
  • the mobile phone control module 302 If the mobile phone communication module 301 does not receive the response message returned by the mobile phone within the set time period after transmitting the keepalive message, the mobile phone control module 302 returns a notification that the connection is invalid; the video call control module 302 receives the connection invalid. After the notification, switch other devices to collect sound information.
  • the video call control module 302 of the set top box can also receive the video data sent by the peer set top box and the audio data sent by the peer mobile phone; wherein the peer mobile phone is a mobile phone that establishes a communication connection with the peer set top box; as well as
  • the set top box further includes: a decoding module 304.
  • the decoding module 304 is configured to decode the audio and video data received by the video call control module 302 and send the data to the television connected to the set top box for playing.
  • the set-top box in the video call system of the embodiment of the present invention can send a message to the mobile phone, notify the mobile phone to collect the sound information, and send the sound information to the set-top box of the opposite end through the mobile phone; and acquire the image information by the camera device and send the image information to the peer set-top box. .
  • the user can have a certain distance from the camera device, which is convenient for the camera device to obtain better quality video images; and the mobile phone can be placed at a close distance of the user to obtain better sound quality; The purpose of video quality.
  • the user can be in front of the TV, put the phone around or hold the ear (or through the phone's earphones) for video calls, and it is also in line with the user's usual habits.

Landscapes

  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)

Abstract

Disclosed are a set top box based video conversation method and system. The method comprises: a local set top box setting up a conversation with a set top box of a peer end, and sending a startaudio massage to a mobile phone; after receiving the startaudio massage, the mobile phone collecting acoustic information; coding the acoustic information into audio data, and sending the audio data to the set top box of the peer end; and meanwhile, the local set top box acquiring image information collected by a camera device, and coding the acquired image information into video data and sending the video data to the set top box of the peer end. Because the acoustic information can be acquired by the mobile phone and then sent to the set top box of the peer end, and the image information is acquired by the camera device and then sent to the set top box at the peer end, certain distance is kept from a user to the camera device, thereby facilitating photographing of the camera device, so as to obtain video images with good quality. Moreover, the mobile phone can be placed at the short distance from the user, and good acoustic quality can be obtained.

Description

基于机顶盒的视频通话方法及系统  Video call method and system based on set top box
技术领域 Technical field
本发明涉及通信技术, 尤其涉及一种视频通话方法及系统。 背景技术  The present invention relates to communication technologies, and in particular, to a video call method and system. Background technique
随着科技的发展, 数字化越来越深入生活。 当前 PC早就智能化, 手机和 平板也在大面积智能化, 而电视 TV也在近年走向智能化。 目前推出的智能电 视拥有传统电视所不具备的应用平台优势。 智能电视基于机顶盒实现了网络 搜索、 IP电视、 视频点播、 数字音乐、 网络新闻、 网络视频电话等各种应用服 务。这些应用服务的基 都是建立在机顶盒连接到网络的基 之上。随着 WIFI 技术的普及, 机顶盒通过 WIFI等无线网络连接实现联网也越来越受普遍。  With the development of technology, digitalization is getting more and more life. The PC has long been intelligent, and mobile phones and tablets are also becoming smarter in a large area, and TV TVs are becoming more intelligent in recent years. The current smart TV has the advantage of an application platform that is not available in traditional TV. Smart TVs enable a variety of application services such as network search, IP TV, video on demand, digital music, online news, and network video telephony based on set-top boxes. These application services are based on the set-top box connected to the network. With the popularity of WIFI technology, set-top boxes are becoming more and more popular through wireless network connections such as WIFI.
在现有技术中的基于机顶盒实现视频通话功能中, 一般是通过自带麦克 的摄像头进行视频、 音频的采集; 但是自带麦克的摄像头通常不能同时兼顾 视频和音频的质量, 因为如果摄像头离的太近, 视频范围太小; 如果摄像头 离的稍远, 则采集的声音的质量不好。  In the prior art based on the set-top box to implement the video call function, the video and audio are generally collected by the camera with the microphone; however, the camera with the microphone usually cannot simultaneously take care of the quality of the video and audio, because if the camera is away Too close, the video range is too small; if the camera is farther away, the quality of the collected sound is not good.
公开号为 102387335A的专利文件则公开了一种基于手机通过机顶盒实现 可视通话的方法及系统, 在该系统中利用手机采集视频和音频, 手机将采集 的音视频数据打包发送给机顶盒, 由机顶盒通过网络发送给对方机顶盒。 然 而, 该方法仍然存在手机离的太近, 视频范围太小; 手机离的稍远, 采集声 音质量不好的问题。 并且, 为了手机能够同时采集到音、 视频, 用户需要将 手机置于面前, 不符合用户平时使用手机的习惯。  The patent document with the publication number 102387335A discloses a method and a system for realizing a visual call based on a mobile phone through a set top box. In the system, the mobile phone collects video and audio, and the mobile phone packages the collected audio and video data to the set top box, and the set top box Send to the other party's set-top box via the network. However, the method still has the mobile phone too close, the video range is too small; the mobile phone is far away, and the sound quality is not good. Moreover, in order for the mobile phone to collect audio and video at the same time, the user needs to put the mobile phone in front of the user, which does not meet the user's habit of using the mobile phone.
综上所述, 现有技术的基于机顶盒进行视频通话的方法都不能同时兼顾 视频和音频的质量。 发明内容  In summary, the prior art method of making a video call based on a set top box cannot simultaneously take into account the quality of video and audio. Summary of the invention
本发明实施例提供了一种基于机顶盒的视频通话方法及系统, 可以同时 兼顾视频和音频的质量。  The embodiment of the invention provides a video call method and system based on a set top box, which can simultaneously consider the quality of video and audio.
根据本发明的一个方面, 提供了一种基于机顶盒的视频通话方法, 包括: 本地机顶盒在与对端机顶盒建立通话后, 将 startaudio消息发送给与之建 立了通信连接的手机;所述 startaudio消息中携带有所述对端机顶盒的 IP地址 和端口信息;  According to an aspect of the present invention, a set-top box-based video calling method is provided, including: after a local set-top box establishes a call with a peer set-top box, sending a startaudio message to a mobile phone with which a communication connection is established; in the startaudio message Carrying the IP address and port information of the peer set top box;
所述手机在接收到所述 startaudio消息后, 开始采集声音信息; 并将声音 信息编码为音频数据后, 根据所述对端机顶盒的音频 IP地址和端口信息将所 述音频数据发送给所述对端机顶盒; 并且, After receiving the startaudio message, the mobile phone starts to collect sound information; After the information is encoded into audio data, the audio data is sent to the peer set top box according to the audio IP address and port information of the peer set top box; and,
所述本地机顶盒获取与之连接的摄像装置采集的图像信息后, 将获取的 图像信息编码为视频数据后发送给所述对端机顶盒;  After acquiring the image information collected by the camera device connected thereto, the local set top box encodes the acquired image information into video data and sends the image information to the peer set top box;
所述对端机顶盒接收到所述视频数据和音频数据后解码, 并通过与之连 接的电视播放解码后的音、 视频。  The peer set top box decodes the video data and the audio data, and plays the decoded sound and video through the connected TV.
进一步, 在本地机顶盒与对端机顶盒通话结束时, 所述本地机顶盒向所 述手机发送 stopaudio消息;  Further, when the local set top box and the peer set top box end the call, the local set top box sends a stopaudio message to the mobile phone;
所述手机根据接收的 stopaudio消息停止采集声音信息。  The mobile phone stops collecting sound information according to the received stopaudio message.
进一步, 在所述本地机顶盒将 startaudio消息发送给与之建立了通信连接 的手机之前, 所述方法还包括:  Further, before the local set top box sends a startaudio message to the mobile phone with which the communication connection is established, the method further includes:
所述本地机顶盒在呼叫对端机顶盒之前, 与所述手机建立通信连接; 或者, 所述本地机顶盒在接收到所述对端机顶盒的呼叫后, 并在响应所 述对端机顶盒的呼叫之前, 与所述手机建立通信连接;  The local set top box establishes a communication connection with the mobile phone before calling the peer set top box; or the local set top box receives the call of the opposite set top box, and before responding to the call of the opposite set top box, The mobile phone establishes a communication connection;
或者, 所述本地机顶盒在接收到所述对端机顶盒的呼叫之前, 与所述手 机建立通信连接。  Alternatively, the local set top box establishes a communication connection with the mobile phone before receiving the call of the opposite set top box.
进一步, 在所述本地机顶盒与所述手机建立通信连接后, 所述本地机顶 盒周期性地向所述手机发送 keepalive消息;  Further, after the local set top box establishes a communication connection with the mobile phone, the local set top box periodically sends a keepalive message to the mobile phone;
在所述本地机顶盒发送 keepalive消息后, 若设定时间段内没有接收到所 述手机返回的回应消息, 则切换其它设备进行声音信息采集。  After the local set top box sends a keepalive message, if the response message returned by the mobile phone is not received within the set time period, the other device is switched to perform sound information collection.
其中, 所述本地机顶盒与所述手机建立通信连接具体为:  The establishing, by the local set top box, the communication connection between the local set top box and the mobile phone is:
所述本地机顶盒通过发送广播消息扫描同一局域网内的手机; 若扫描到 一个手机, 则与之建立通信连接; 若扫描到多个手机, 则提示用户进行选择, 并与用户选择的手机建立通信连接;  The local set top box scans the mobile phone in the same local area network by sending a broadcast message; if scanning a mobile phone, establishing a communication connection with the mobile phone; if scanning a plurality of mobile phones, prompting the user to make a selection, and establishing a communication connection with the mobile phone selected by the user ;
或者, 所述本地机顶盒根据用户指定的 IP地址与手机建立通信连接。 进一步, 所述对端机顶盒向所述本地机顶盒发送视频数据; 并  Alternatively, the local set top box establishes a communication connection with the mobile phone according to the IP address specified by the user. Further, the peer set top box sends video data to the local set top box; and
对端手机向所述本地机顶盒发送音频数据; 所述对端手机为与所述对端 机顶盒建立了通信连接的手机;  The peer mobile phone sends audio data to the local set top box; the peer mobile phone is a mobile phone that establishes a communication connection with the peer set top box;
所述本地机顶盒在接收到所述对端机顶盒发送的视频数据, 以及所述对 端手机发送的音频数据后进行解码, 并通过与之连接的电视播放音、 视频。  The local set top box decodes the video data sent by the peer set top box and the audio data sent by the peer mobile phone, and plays the sound and video through the television connected thereto.
较佳地, 在所述本地机顶盒将 startaudio消息发送给与之建立了通信连接 的手机之前, 还包括: 所述本地机顶盒确定用户选择的音频采集设备为手机。 Preferably, before the local set top box sends the startaudio message to the mobile phone with which the communication connection is established, the method further includes: The local set top box determines that the audio collection device selected by the user is a mobile phone.
根据本发明的另一个方面, 还提供了一种基于机顶盒的视频通话系统, 包括: 本地机顶盒, 以及与所述本地机顶盒建立了通信连接的手机;  According to another aspect of the present invention, a set-top box-based video calling system is further provided, comprising: a local set top box, and a mobile phone having a communication connection with the local set top box;
所述本地机顶盒用于在与对端机顶盒建立通话后, 将 startaudio消息发送 给所述手机; 并获取与之连接的摄像装置采集的图像信息后, 将获取的图像 信息编码为视频数据后发送给所述对端机顶盒; 其中, 所述 startaudio消息中 携带有所述对端机顶盒的 IP地址和端口信息;  The local set top box is configured to send a startaudio message to the mobile phone after establishing a call with the peer set top box; and obtain image information collected by the camera device connected thereto, and then encode the acquired image information into video data and send the image information to The peer-end set-top box; wherein the startaudio message carries the IP address and port information of the peer set-top box;
所述手机用于在接收到所述 startaudio消息后, 开始采集声音信息; 并将 声音信息编码为音频数据后, 根据所述对端机顶盒的音频 IP地址和端口信息 将所述音频数据发送给所述对端机顶盒。 述手机发送 stopaudio消息; 以及  The mobile phone is configured to start collecting sound information after receiving the startaudio message; and after encoding the sound information into audio data, send the audio data to the office according to the audio IP address and port information of the peer set top box. Said the opposite set-top box. Said mobile phone sends a stopaudio message;
所述手机还用于根据接收的 stopaudio消息停止采集声音信息。  The mobile phone is further configured to stop collecting sound information according to the received stopaudio message.
较佳地, 所述本地机顶盒还用于在呼叫对端机顶盒之前, 与所述手机建 立通信连接;  Preferably, the local set top box is further configured to establish a communication connection with the mobile phone before calling the peer set top box;
或者, 所述本地机顶盒还用于在接收到所述对端机顶盒的呼叫后, 并在 响应所述对端机顶盒的呼叫之前, 与所述手机建立通信连接;  Or the local set top box is further configured to establish a communication connection with the mobile phone after receiving the call of the opposite set top box and before responding to the call of the opposite set top box;
或者, 所述本地机顶盒还用于在接收到所述对端机顶盒的呼叫之前, 与 所述手机建立通信连接。  Alternatively, the local set top box is further configured to establish a communication connection with the mobile phone before receiving the call of the peer set top box.
较佳地, 所述本地机顶盒还用于在与所述手机建立通信连接后, 周期性 地向所述手机发送 keepalive消息; 若设定时间段内没有接收到所述手机返回 的回应消息, 则切换其它设备进行声音信息采集。  Preferably, the local set top box is further configured to periodically send a keepalive message to the mobile phone after establishing a communication connection with the mobile phone; if the response message returned by the mobile phone is not received within a set time period, Switch other devices to collect sound information.
较佳地, 所述本地机顶盒还用于在接收到所述对端机顶盒发送的视频数 据, 以及对端手机发送的音频数据后进行解码, 并通过与之连接的电视播放 音、 视频; 其中, 所述对端手机为与所述对端机顶盒建立了通信连接的手机。  Preferably, the local set top box is further configured to: after receiving the video data sent by the peer set top box and the audio data sent by the peer mobile phone, and decoding the audio data transmitted by the connected mobile phone; The peer mobile phone is a mobile phone that establishes a communication connection with the peer set top box.
较佳地, 所述本地机顶盒还用于在将 startaudio消息发送给与之建立了通 信连接的手机之前, 确定用户选择的音频采集设备为手机。  Preferably, the local set top box is further configured to determine that the audio collection device selected by the user is a mobile phone before sending the startaudio message to the mobile phone with which the communication connection is established.
根据本发明的另一个方面, 还提供了一种机顶盒, 包括:  According to another aspect of the present invention, a set top box is also provided, including:
手机通讯模块, 用于与手机建立通信连接;  a mobile communication module for establishing a communication connection with the mobile phone;
视频通话控制模块, 用于在与对端机顶盒建立通话后, 向所述手机通讯 模块发送通话开始通知;  a video call control module, configured to send a call start notification to the mobile phone communication module after establishing a call with the peer set top box;
所述手机通讯模块还用于在接收到所述通话开始通知后, 向建立了通信 连接的手机发送 startaudio消息;所述 startaudio消息中携带有所述对端机顶盒 的 IP地址和端口信息; The mobile phone communication module is further configured to establish communication after receiving the call start notification The connected mobile phone sends a startaudio message; the startaudio message carries the IP address and port information of the peer set top box;
视频数据编码模块, 用于获取与所述机顶盒连接的摄像装置采集的图像 信息后, 将获取的图像信息编码为视频数据;  a video data encoding module, configured to acquire image information collected by an image capturing device connected to the set top box, and encode the acquired image information into video data;
所述视频通话控制模块还用于将所述视频数据编码模块编码的视频数据 发送给所述对端机顶盒。  The video call control module is further configured to send the video data encoded by the video data encoding module to the peer set top box.
进一步, 所述视频通话控制模块还用于在与所述对端机顶盒通话结束时, 向所述手机通讯模块发送通话结束通知; 以及  Further, the video call control module is further configured to send a call end notification to the mobile phone communication module when the call with the peer set top box ends;
所述手机通讯模块还用于在接收到所述通话结束通知后, 向所述手机发 送 stopaudio消息。  The mobile phone communication module is further configured to send a stopaudio message to the mobile phone after receiving the call end notification.
其中, 所述手机通讯模块与手机建立通信连接具体为:  The communication connection between the mobile phone communication module and the mobile phone is specifically as follows:
所述手机通讯模块在所述视频通话控制模块呼叫对端机顶盒之前, 与所 述手机建立通信连接;  The mobile phone communication module establishes a communication connection with the mobile phone before the video call control module calls the peer set top box;
或者, 所述手机通讯模块在所述视频通话控制模块接收到所述对端机顶 盒的呼叫之前, 与所述手机建立通信连接。  Alternatively, the mobile phone communication module establishes a communication connection with the mobile phone before the video call control module receives the call of the opposite set top box.
较佳地, 所述视频通话控制模块还用于在所述手机通讯模块与手机建立 通信连接后, 周期性向所述手机通讯模块发送检测连接有效的通知; 以及 所述手机通讯模块根据接收的检测连接有效的通知, 向所述手机发送 keepalive消息; 若在发送 keepalive消息后, 设定时间段内没有接收到所述手 机返回的回应消息, 则所述手机通讯模块向所述视频通话控制模块返回连接 无效的通知;  Preferably, the video call control module is further configured to: after the mobile phone communication module establishes a communication connection with the mobile phone, periodically send a notification to the mobile phone communication module to detect that the connection is valid; and the mobile phone communication module detects according to the received Connecting a valid notification, sending a keepalive message to the mobile phone; if the response message returned by the mobile phone is not received within the set time period after the keepalive message is sent, the mobile phone communication module returns to the video call control module Invalid connection notification;
所述视频通话控制模块还用于在接收到连接无效的通知后, 切换其它设 备进行声音信息采集。  The video call control module is further configured to switch other devices to collect sound information after receiving the notification that the connection is invalid.
进一步, 所述视频通话控制模块还用于接收到所述对端机顶盒发送的视 频数据, 以及对端手机发送的音频数据; 其中, 所述对端手机为与所述对端 机顶盒建立了通信连接的手机; 以及  Further, the video call control module is further configured to receive video data sent by the peer set-top box and audio data sent by the peer mobile phone, where the peer mobile phone establishes a communication connection with the peer set-top box. Mobile phone;
所述机顶盒还包括:  The set top box further includes:
解码模块, 用于将所述视频通话控制模块接收的音、 视频数据解码后发 送给与所述机顶盒连接的电视进行播放。  And a decoding module, configured to decode the audio and video data received by the video call control module, and send the audio and video data to the television connected to the set top box for playing.
较佳地, 所述系统还包括:  Preferably, the system further comprises:
音频采集设备选择模块, 用于为用户提供可选择的音频采集设备, 并确 定用户所选择的音频采集设备; 以及 所述视频通话控制模块还用于在向所述手机通讯模块发送通话开始通知 之前, 确定所选择的音频采集设备为手机。 An audio collection device selection module for providing a user with a selectable audio collection device and determining an audio collection device selected by the user; The video call control module is further configured to determine that the selected audio collection device is a mobile phone before sending the call start notification to the mobile phone communication module.
根据本发明的另一个方面, 还提供了一种手机, 包括:  According to another aspect of the present invention, a mobile phone is provided, including:
局域网设备连接模块, 用于与局域网内的设备建立通信连接, 并接收所 述局域网内建立了通信连接的设备发送的消息;  a local area network device connection module, configured to establish a communication connection with a device in the local area network, and receive a message sent by the device that establishes a communication connection in the local area network;
消息解析模块, 用于对所述局域网设备连接模块接收的消息进行解析; 若解析出该消息为 startaudio消息, 则发送音频采集通知;  a message parsing module, configured to parse a message received by the local area network device connection module; if the message is a startaudio message, send an audio collection notification;
音频采集发送模块, 用于在接收到所述消息解析模块发送的音频采集通 知后, 开始采集声音信息; 并将声音信息编码为音频数据后, 根据所述 startaudio消息中携带的音频 IP地址和端口信息将所述音频数据进行发送。  An audio collection and transmission module, configured to start collecting sound information after receiving the audio collection notification sent by the message parsing module; and encoding the audio information into audio data, according to the audio IP address and port carried in the startaudio message The information transmits the audio data.
进一步,所述消息解析模块还用于若解析出接收的消息为 stopaudio消息, 则向所述音频采集发送模块发送停止音频采集通知; 以及  Further, the message parsing module is further configured to: if the received message is parsed as a stopaudio message, send a stop audio collection notification to the audio collection and transmission module;
所述音频采集发送模块还用于根据所述停止音频采集通知, 停止采集声 音信息。  The audio collection and transmission module is further configured to stop collecting sound information according to the stopping the audio collection notification.
进一步,所述消息解析模块还用于若解析出接收的消息为 keepalive消息, 则通过所述局域网设备连接模块返回回应消息。  Further, the message parsing module is further configured to: if the received message is parsed as a keepalive message, return a response message by using the local area network device connection module.
本发明实施例的视频通话系统中的机顶盒由于可以向手机发送消息, 通 知手机采集声音信息, 通过手机在获取声音信息后发送给对端的机顶盒; 并 且通过摄像装置获取图像信息后发送给对方机顶盒。 这样, 用户可以离摄像 装置有一定的距离, 便于摄像装置的较好摄像, 得到质量较佳的视频图像; 而手机可以放于用户的近距离处, 获得较好的声音质量; 从而达到兼顾音视 频质量的目的。 而且, 用户可以位于电视前, 将手机放于身边或持于耳边(或 者通过手机的耳机)进行视频通话, 也非常符合用户平时的习惯。 附图说明  The set-top box in the video call system of the embodiment of the present invention can send a message to the mobile phone, notify the mobile phone to collect the sound information, and send the sound information to the set-top box of the opposite end through the mobile phone; and acquire the image information through the camera device and then send the image information to the set-top box. In this way, the user can have a certain distance from the camera device, which is convenient for the camera device to obtain better quality video images; and the mobile phone can be placed at a close distance of the user to obtain better sound quality; The purpose of video quality. Moreover, the user can be in front of the TV, put the phone around or hold the ear (or through the phone's earphones) for video calls, and it is also in line with the user's usual habits. DRAWINGS
图 1为本发明实施例的基于机顶盒的视频通话系统示意图;  1 is a schematic diagram of a video call system based on a set top box according to an embodiment of the present invention;
图 2为本发明实施例的基于机顶盒的视频通话方法流程图;  2 is a flowchart of a video call method based on a set top box according to an embodiment of the present invention;
图 3为本发明实施例的机顶盒和手机内部结构框图。  FIG. 3 is a block diagram showing the internal structure of a set top box and a mobile phone according to an embodiment of the present invention.
具体实施方式 detailed description
为使本发明的目的、 技术方案及优点更加清楚明白, 以下参照附图并举 出优选实施例, 对本发明进一步详细说明。 然而, 需要说明的是, 说明书中 列出的许多细节仅仅是为了使读者对本发明的一个或多个方面有一个透彻的 本申请使用的 "模块"、 "系统" 等术语旨在包括与计算机相关的实体, 例如但不限于硬件、 固件、 软硬件组合、 软件或者执行中的软件。 例如, 模 块可以是, 但并不仅限于: 处理器上运行的进程、 处理器、 对象、 可执行程 序、 执行的线程、 程序和 /或计算机。 举例来说, 计算设备上运行的应用程序 和此计算设备都可以是模块。 一个或多个模块可以位于执行中的一个进程和 / 或线程内。 The present invention will be further described in detail below with reference to the accompanying drawings. However, it should be noted that many of the details listed in the specification are only intended to provide the reader with a thorough understanding of one or more aspects of the present invention. The terms "module,""system," and the like, as used herein, are intended to include a computer-related entity such as, but not limited to, hardware, firmware, hardware and software combinations, software, or software in execution. For example, a module can be, but is not limited to: a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and/or a computer. For example, an application running on a computing device and this computing device can both be modules. One or more modules may be located within a process and/or thread in execution.
本发明的主要思路为在现有支持 WIFI和网络的机顶盒基础上,通过手机 获取声音信息后发送给对端的机顶盒, 通过安置于电视上端的摄像装置获取 图像信息, 机顶盒将摄像装置获取的图像信息发送给对方机顶盒, 对端机顶 盒接收到声音信息和图像信息后通过电视播放图像和声音。 这样, 用户可以 离电视上端的摄像装置有一定的距离, 摄像装置的视频范围可以较大, 更适 合于会议的场合; 而手机可以放于用户的近距离处, 比如放于用户身边甚至 嘴边, 或者通过手机的麦克进行通话, 从而可以获得较好的声音质量。 而且, 用户可以位于电视前, 将手机放于身边或持于耳边(或者通过手机的耳机) 进行视频通话, 也非常符合用户平时的习惯。  The main idea of the present invention is that on the basis of the existing set top box supporting the WIFI and the network, the sound information is obtained by the mobile phone and then sent to the set top box of the opposite end, and the image information is acquired by the camera device disposed at the upper end of the television, and the set top box obtains the image information of the camera device. Sended to the other party's set-top box, the peer set-top box receives the sound information and image information and then plays the image and sound through the TV. In this way, the user can be separated from the camera device at the upper end of the TV, and the video range of the camera device can be larger, which is more suitable for the conference occasion; and the mobile phone can be placed at a close distance of the user, for example, at the user's side or even at the mouth. , or by talking on the phone's mic, you can get better sound quality. Moreover, the user can be in front of the TV, put the phone around or hold the ear (or through the phone's earphones) for video calls, and it is also in line with the user's usual habits.
下面结合附图详细说明本发明实施例的技术方案。 本发明实施例的基于 机顶盒的视频通话系统, 如图 1所示, 包括: 本地机顶盒 101、 与本地机顶盒 已建立通信连接的手机 102、 与本地机顶盒 101连接的摄像装置 104、 与本地 机顶盒 101连接的电视 105和对端机顶盒 103。  The technical solutions of the embodiments of the present invention are described in detail below with reference to the accompanying drawings. The set-top box-based video call system of the embodiment of the present invention, as shown in FIG. 1, includes: a local set top box 101, a mobile phone 102 that has established a communication connection with the local set top box, an image capturing device 104 connected to the local set top box 101, and a local set top box 101. TV 105 and peer set top box 103.
本发明实施例的基于机顶盒的视频通话系统进行视频通话的方法流程, 如图 2所示, 包括如下步骤:  A method for performing a video call in a set-top box-based video call system according to an embodiment of the present invention, as shown in FIG. 2, includes the following steps:
S201: 本地机顶盒 101与对端机顶盒 103建立通话。  S201: The local set top box 101 establishes a call with the peer set top box 103.
本地机顶盒 101 负责与对端机顶盒 103建立通话: 既可以是用户在运行 本地机顶盒 101的视频通话程序后, 本地机顶盒 101作为主叫, 对端机顶盒 103作为被叫建立的通话; 也可以是对端机顶盒 103作为主叫, 本地机顶盒 101 作为被叫建立的通话。 具体地, 本地机顶盒 101 信令控制协议支持 SIP ( Session Initiation Protocol ,会话发起十办议)十办议,可以作为 IMS( Ip Multimedia Subsystem, IP 多媒体子系统) 网络视频通话终端; 其语音编码支持 G711 ALAW/ULAW, G722, SILK编解码; 其视频编码支持 H.264编解码; 其视频采 集使用摄像装置 104, 音视频播放使用电视 105。 由于本地机顶盒 101与对端 机顶盒 103建立通话的方法为本领域技术人员所熟知的技术, 此处不再赘述。 S202: 本地机顶盒 101在与对端机顶盒 103建立通话后, 将 startaudio消 息(开始音频通话消息)发送给手机 102。 The local set top box 101 is responsible for establishing a call with the peer set top box 103: the local set top box 101 acts as the calling party after the user runs the video calling program of the local set top box 101, and the peer set top box 103 acts as the called party established by the called party; The set top box 103 acts as a calling party, and the local set top box 101 acts as a call established by the called party. Specifically, the local set top box 101 signaling control protocol supports SIP (Session Initiation Protocol), and can be used as an IMS (Ip Multimedia Subsystem, IP Multimedia Subsystem) network video call terminal; its voice coding supports G711. ALAW/ULAW, G722, SILK codec; its video coding supports H.264 codec; its video capture uses camera device 104, and audio and video playback uses TV 105. The method for establishing a call between the local set top box 101 and the peer set top box 103 is a technique well known to those skilled in the art, and details are not described herein again. S202: After establishing a call with the peer set top box 103, the local set top box 101 sends a startaudio message (starting an audio call message) to the mobile phone 102.
具体地, 本地机顶盒 101在与对端机顶盒 103建立通话后, 将 startaudio 消息发送给之前已经与本地机顶盒 101建立连接的手机 102, 也就是说, 手机 102预先与本地机顶盒 101建立了通信连接。 所述 startaudio消息中携带有对 端机顶盒 103的音频 IP地址和端口信息。  Specifically, after establishing a call with the peer set top box 103, the local set top box 101 sends a startaudio message to the handset 102 that has previously established a connection with the local set top box 101, that is, the handset 102 establishes a communication connection with the local set top box 101 in advance. The startaudio message carries the audio IP address and port information of the peer set top box 103.
本地机顶盒 101与手机 102可以在如下时机之一的时候建立连接: 时机 A: 本地机顶盒 101在呼叫对端机顶盒 103之前与手机 102建立通 信连接。  The local set top box 101 and the handset 102 can establish a connection at one of the following times: Opportunity A: The local set top box 101 establishes a communication connection with the handset 102 prior to calling the peer set top box 103.
时机 B: 本地机顶盒 101在接收到对端机顶盒 103的呼叫后, 并在响应 对端机顶盒 103的呼叫之前, 与手机 102建立通信连接。  Timing B: The local set top box 101 establishes a communication connection with the handset 102 after receiving the call to the peer set top box 103 and before responding to the call to the peer set top box 103.
时机 C: 本地机顶盒 101在接收到对端机顶盒 103的呼叫前, 与手机 102 建立通信连接。 具体地, 用户在运行本地机顶盒 101 的视频通话程序后, 选 择手机作为声音信息的采集设备后, 本地机顶盒 101 即与手机 102建立通信 连接。  Timing C: The local set top box 101 establishes a communication connection with the handset 102 before receiving the call to the peer set top box 103. Specifically, after the user selects the mobile phone as the collection device of the sound information after running the video call program of the local set top box 101, the local set top box 101 establishes a communication connection with the mobile phone 102.
本地机顶盒 101与手机 102建立连接的方法可以是如下方法:  The method for establishing the connection between the local set top box 101 and the mobile phone 102 can be as follows:
方法 A: 本地机顶盒 101通过发送广播消息扫描同一局域网内的手机; 若扫描到一个手机, 则与之建立通信连接; 若本地机顶盒 101 扫描到多个手 机, 则提示用户进行选择, 用户从中选择一个手机后, 本地机顶盒 101 与用 户选择的手机建立通信连接。 具体地, 本地机顶盒 101可以通过 WIFI与手机 建立通信连接; 本地机顶盒 101 与同一局域网内的手机建立通信连接的方法 为本领域技术人员所熟知的技术, 此处不再赘述。  Method A: The local set top box 101 scans the mobile phone in the same local area network by sending a broadcast message; if a mobile phone is scanned, a communication connection is established with the mobile phone; if the local set top box 101 scans a plurality of mobile phones, the user is prompted to make a selection, and the user selects one of them. After the mobile phone, the local set top box 101 establishes a communication connection with the mobile phone selected by the user. Specifically, the local set top box 101 can establish a communication connection with the mobile phone through the WIFI. The method for establishing a communication connection between the local set top box 101 and the mobile phone in the same local area network is a technology well known to those skilled in the art, and details are not described herein again.
方法 B:本地机顶盒 101根据用户指定的 IP地址与该 IP地址的手机建立 通信连接。  Method B: The local set top box 101 establishes a communication connection with the mobile phone of the IP address according to the IP address specified by the user.
手机 102在与本地机顶盒 101建立通信连接后, 可以在手机上显示相应 提示信息。  After establishing a communication connection with the local set top box 101, the mobile phone 102 can display corresponding prompt information on the mobile phone.
S203: 手机 102在接收到 startaudio消息后, 转入通话状态, 并开始采集、 获取获取声音信息。  S203: After receiving the startaudio message, the mobile phone 102 transfers to the call state, and starts collecting and obtaining the obtained sound information.
S204: 手机 102在获取声音信息后, 将声音信息编码为音频数据后, 发 送给对端机顶盒 103。  S204: After acquiring the sound information, the mobile phone 102 encodes the sound information into audio data, and sends the sound information to the peer set top box 103.
具体地, 手机 102在获取声音信息后, 将声音信息编码、 打包为 RTP数 据包; 并根据对端机顶盒 103的音频 IP ( Internet Protocol, 互联网协议)地址 和端口信息, 将 RTP数据包发送给对端机顶盒 103。 Specifically, after acquiring the voice information, the mobile phone 102 encodes and packages the voice information into an RTP data packet; and according to the audio IP (Internet Protocol) address of the peer set top box 103. And the port information, the RTP data packet is sent to the peer set top box 103.
S205: 本地机顶盒 101获取摄像装置 104采集的图像信息。  S205: The local set top box 101 acquires image information acquired by the camera unit 104.
这里需要指出的是, 为了便于描述, 本文中为各步骤进行了顺序编号, 但是这些顺序编号并非意味着时间上的严格顺序;  It should be noted here that, for the convenience of description, the steps are sequentially numbered in the present paper, but these sequential numbers do not mean a strict sequence in time;
事实上, S205中的本地机顶盒 101获取摄像装置 104采集的图像信息的 步骤可以是在本地机顶盒 101在与对端机顶盒 103建立通话后, 与步骤 S203 中手机 102采集、 获取获取声音信息的步骤同时进行。  In fact, the step of acquiring, by the local set top box 101, the image information collected by the camera device 104 in the step S203 may be the same as the step of collecting and acquiring the sound information by the mobile phone 102 in step S203 after the local set top box 101 establishes a call with the opposite set top box 103. get on.
S206: 本地机顶盒 101将获取的图像信息编码为视频数据后, 发送给对 端机顶盒 103。  S206: The local set top box 101 encodes the acquired image information into video data, and sends the image information to the peer set top box 103.
具体地, 摄像装置 104可以安装于有较好摄像效果的位置, 比如, 用户 需要观看电视中播放的对方通话者的形象, 通常会位于电视的前方, 则本地 摄像装置 104安装于电视的上端, 便于对位于电视前方的用户进行摄像。  Specifically, the camera device 104 can be installed at a position with a better imaging effect. For example, the user needs to watch the image of the other party's caller playing on the television, usually located in front of the television, and the local camera device 104 is installed at the upper end of the television. It is easy to take pictures of users located in front of the TV.
摄像装置 104与本地机顶盒 101的连接方式, 既可以是有线连接, 也可 以是无线连接。 摄像装置 104获取图像信息后, 将图像信息发送给本地机顶 盒 101 , 本地机顶盒 101将获取的图像信息编码为视频数据后,发送给对端机 顶盒 103。  The connection mode of the camera device 104 with the local set top box 101 can be either a wired connection or a wireless connection. After acquiring the image information, the camera device 104 transmits the image information to the local set top box 101, and the local set top box 101 encodes the acquired image information into video data, and then sends the image information to the peer set top box 103.
S207: 对端机顶盒 103在接收到本地机顶盒 101发送的视频数据, 以及 手机 102发送的音频数据后进行解码, 并通过与对端机顶盒 103连接的电视 进行视频、 音频播放。  S207: The peer set top box 103 decodes the video data sent by the local set top box 101 and the audio data sent by the mobile phone 102, and performs video and audio playback through the television connected to the peer set top box 103.
S208: 本地机顶盒 101在接收到对端机顶盒 103发送的音视频数据后, 进行解码, 并通过电视 105播放音、 视频。  S208: After receiving the audio and video data sent by the peer set top box 103, the local set top box 101 decodes and plays the audio and video through the television 105.
或者, S208步骤中本地机顶盒 101在接收到对端机顶盒 103发送的视频 数据, 以及对端手机 106发送的音频数据后进行解码, 并通过电视 105播放 音、 视频。 与上述步骤 S201- S207相类似地, 对端手机 106为与对端机顶盒 103建立了通信连接的手机,对端机顶盒 103在与本地机顶盒 101建立通话后, 也向对端手机 106发送了 startaudio消息,对端手机 106根据 startaudio消息采 集声音信息, 在将声音信息编码为音频数据后, 根据 startaudio消息中携带的 本地机顶盒 101的音频 IP地址和端口信息向本地机顶盒 101发送音频数据; 同时, 对端机顶盒 103向本地机顶盒 101发送视频数据; 本地机顶盒 101在 接收到对端机顶盒 103发送的视频数据, 以及对端手机 106发送的音频数据 后进行解码, 并通过电视 105播放音、 视频。  Alternatively, in step S208, the local set top box 101 decodes the video data sent by the peer set top box 103 and the audio data sent by the peer mobile phone 106, and plays the audio and video through the television 105. Similar to the above steps S201-S207, the peer mobile phone 106 is a mobile phone that establishes a communication connection with the peer set top box 103, and the peer set top box 103 also sends a startaudio message to the opposite mobile phone 106 after establishing a call with the local set top box 101. The peer mobile phone 106 collects the voice information according to the startaudio message, and after the voice information is encoded into the audio data, sends the audio data to the local set top box 101 according to the audio IP address and port information of the local set top box 101 carried in the startaudio message; The set top box 103 sends the video data to the local set top box 101. The local set top box 101 decodes the video data sent by the peer set top box 103 and the audio data sent by the peer mobile phone 106, and plays the audio and video through the television 105.
S209: 当通话结束时, 本地机顶盒 101将 stopaudio (停止音频通话消息) 消息发送给手机 102。 S209: When the call ends, the local set top box 101 will stopaudio (stop the audio call message) The message is sent to the handset 102.
S210: 手机 102结束通话状态, 停止采集、 获取获取声音信息。  S210: The mobile phone 102 ends the call state, stops collecting, and acquires the obtained sound information.
较佳地, 在本地机顶盒 101在与手机 102建立通信连接后, 为了确保本 地机顶盒 101与手机 102的通信连接保持正常, 本地机顶盒 101会周期性地 (例如每隔 120s )向手机 102发送 keepalive消息(保持有效消息); 手机 102 接收到 keepalive消息后, 向本地机顶盒 101返回回应消息。 本地机顶盒 101 发送 keepalive消息后,若本地机顶盒 101在设定时间段内没有接收到手机 102 返回的回应消息, 则认为手机 102状态异常, 本地机顶盒 101 自动切换其它 设备进行声音信息采集, 例如切换为默认设备, 即摄像头自带的 MIC (麦克) 采集声音信息。  Preferably, after the local set top box 101 establishes a communication connection with the mobile phone 102, in order to ensure that the communication connection between the local set top box 101 and the mobile phone 102 remains normal, the local set top box 101 periodically sends a keepalive message to the mobile phone 102 (eg, every 120 seconds). (Keep valid message); After receiving the keepalive message, the mobile phone 102 returns a response message to the local set top box 101. After the local set top box 101 sends the keepalive message, if the local set top box 101 does not receive the response message returned by the mobile phone 102 within the set time period, the mobile phone 102 is considered to be in an abnormal state, and the local set top box 101 automatically switches other devices to collect sound information, for example, The default device, the MIC (Mike) that comes with the camera, collects sound information.
或者, 本地机顶盒 101在将 startaudio消息发送给手机 102之后, 以及在 将 stopaudio消息发送给手机 102结束通话之前, 周期性地(例如每隔 120s ) 向手机 102发送 keepalive消息(保持有效消息); 若本地机顶盒 101在设定时 间段内没有接收到手机 102返回的回应消息, 则认为手机 102状态异常, 本 地机顶盒 101 自动切换其它设备进行声音信息采集, 例如切换为默认设备, 即摄像头自带的 MIC (麦克 )采集声音信息。  Alternatively, the local set top box 101 sends a keepalive message (maintaining a valid message) to the mobile phone 102 periodically (eg, every 120 seconds) after transmitting the startaudio message to the mobile phone 102, and before transmitting the stopaudio message to the mobile phone 102 to end the call; If the local set top box 101 does not receive the response message returned by the mobile phone 102 within the set time period, the mobile phone 102 is considered to be in an abnormal state, and the local set top box 101 automatically switches other devices to collect sound information, for example, switching to the default device, that is, the MIC of the camera itself. (Mike) collects sound information.
上述的本地机顶盒 101具体可以是基于 Android (安卓 ) 系统的机顶盒; 手机 102具体可以 ^^于 Android系统的手机。  The local set top box 101 may specifically be a set top box based on an Android (Android) system; the mobile phone 102 may specifically be a mobile phone of the Android system.
较佳地, 本地机顶盒 101在将 startaudio消息发送给手机 102之前, 还可 供用户选择音频采集设备; 在确定用户选择的音频采集设备为手机后, 再执 行上述步骤 S202-S208。 若用户选择的音频采集设备为麦克, 则本地机顶盒 101采用现有技术的方法进行音、 视频信息的采集, 此处不再赘述。  Preferably, the local set top box 101 can also select the audio collection device before sending the startaudio message to the mobile phone 102. After determining that the audio collection device selected by the user is the mobile phone, the above steps S202-S208 are performed. If the audio collection device selected by the user is a microphone, the local set top box 101 uses the prior art method to collect audio and video information, and details are not described herein.
本发明实施例提供的一种机顶盒和手机的内部结构, 如图 3所示。  An internal structure of a set top box and a mobile phone according to an embodiment of the present invention is shown in FIG. 3.
其中, 机顶盒包括: 手机通讯模块 301、 视频通话控制模块 302、 视频数 据编码模块 303。  The set top box includes: a mobile phone communication module 301, a video call control module 302, and a video data encoding module 303.
手机包括: 局域网设备连接模块 311、 消息解析模块 312、 音频采集发送 模块 313。  The mobile phone includes: a local area network device connection module 311, a message parsing module 312, and an audio collection and transmission module 313.
上述模块的功能描述如下:  The functions of the above modules are described as follows:
机顶盒的手机通讯模块 301 用于与手机建立通信连接; 具体地, 手机通 讯模块 301可以是在视频通话控制模块 302呼叫对端机顶盒之前, 视频通话 控制模块 302通知手机通讯模块 301与手机建立通信连接的; 或者, 手机通 讯模块 301是在视频通话控制模块 302接收到所述对端机顶盒的呼叫之前, 通知手机通讯模块 301与手机建立通信连接的。 The mobile phone communication module 301 of the set top box is used to establish a communication connection with the mobile phone. Specifically, the mobile phone communication module 301 can notify the mobile communication module 301 to establish a communication connection with the mobile phone before the video call control module 302 calls the opposite set top box. Or; the mobile phone communication module 301 is before the video call control module 302 receives the call of the opposite set top box, The mobile phone communication module 301 is notified to establish a communication connection with the mobile phone.
相应地, 手机的局域网设备连接模块 311 则与局域网内的设备建立通信 连接, 即局域网设备连接模块 311 响应局域网内的机顶盒的通信连接请求, 并与之建立通信连接。  Correspondingly, the local area network device connection module 311 of the mobile phone establishes a communication connection with the device in the local area network, that is, the local area network device connection module 311 responds to the communication connection request of the set top box in the local area network, and establishes a communication connection with the local area network device connection module 311.
机顶盒的手机通讯模块 301 在与手机建立通信连接后, 则可以向该手机 发送消息。  The set-top box mobile communication module 301 can send a message to the mobile phone after establishing a communication connection with the mobile phone.
手机的局域网设备连接模块 311 在与机顶盒建立通信连接后, 则可以接 收机顶盒(即局域网内建立了通信连接的设备)发送的消息。  The LAN device connection module of the mobile phone 311 can establish a communication connection with the set-top box, and then can receive the message sent by the set-top box (that is, the device that establishes the communication connection in the local area network).
机顶盒的视频通话控制模块 302用于与对端机顶盒建立通话, 并在通话 建立后向手机通讯模块 301发送通话开始通知;  The video call control module 302 of the set top box is configured to establish a call with the peer set top box, and send a call start notification to the mobile phone communication module 301 after the call is established;
手机通讯模块 301 在接收到所述通话开始通知后, 向建立了通信连接的 手机发送 startaudio消息; 所述 startaudio消息中携带有所述对端机顶盒的 IP 地址和端口信息;  After receiving the call start notification, the mobile phone communication module 301 sends a startaudio message to the mobile phone that establishes the communication connection; the startaudio message carries the IP address and port information of the peer set top box;
手机的局域网设备连接模块 311 在接收到机顶盒发送的消息后, 将消息 发送给消息解析模块 312进行解析。  After receiving the message sent by the set top box, the local area network device connection module 311 of the mobile phone sends a message to the message parsing module 312 for parsing.
手机的消息解析模块 312对局域网设备连接模块 311接收的消息进行解 析; 若解析出该消息为 startaudio消息, 则向音频采集发送模块 313发送音频 采集通知。  The message parsing module 312 of the mobile phone parses the message received by the local area network device connection module 311. If the message is parsed as a startaudio message, the audio collection and transmission module 313 sends an audio collection notification.
手机的音频采集发送模块 313在接收到消息解析模块 312发送的音频采 集通知后, 开始采集声音信息; 并将声音信息编码为音频数据后, 根据所述 startaudio消息中携带的音频 IP地址和端口信息将所述音频数据进行发送。  After receiving the audio collection notification sent by the message parsing module 312, the audio collection and transmission module 313 of the mobile phone starts to collect the audio information; and after encoding the audio information into the audio data, according to the audio IP address and port information carried in the startaudio message. The audio data is transmitted.
在手机的音频采集发送模块 313采集声音信息的同时, 机顶盒的视频数 据编码模块 303获取与所述机顶盒连接的摄像装置采集的图像信息, 并将获 取的图像信息编码为视频数据;  While the audio collection and transmission module 313 of the mobile phone collects the sound information, the video data encoding module 303 of the set top box acquires the image information collected by the camera device connected to the set top box, and encodes the obtained image information into video data;
机顶盒的视频通话控制模块 302将视频数据编码模块 303编码的视频数 据发送给所述对端机顶盒。  The video call control module 302 of the set top box sends the video data encoded by the video data encoding module 303 to the peer set top box.
进一步, 机顶盒的视频通话控制模块 302在与所述对端机顶盒通话结束 时, 还向手机通讯模块 301发送通话结束通知; 以及  Further, the video call control module 302 of the set top box also sends a call end notification to the mobile phone communication module 301 when the call with the opposite set top box ends;
手机通讯模块 301 在接收到所述通话结束通知后, 向所述手机发送 startaudio消息。  After receiving the call end notification, the mobile phone communication module 301 sends a startaudio message to the mobile phone.
手机的局域网设备连接模块 311 在接收到机顶盒发送的消息后, 将消息 发送给消息解析模块 312进行解析。 手机的消息解析模块 312若解析出局域网设备连接模块 311接收的消息 为 stopaudio消息, 则向音频采集发送模块 313发送停止音频采集通知; 以及 音频采集发送模块 313根据所述停止音频采集通知, 停止采集声音信息。 进一步, 机顶盒的视频通话控制模块 302在手机通讯模块 301与手机建 立通信连接后, 周期性向手机通讯模块 301 发送检测连接有效的通知; 手机 通讯模块 301根据接收的检测连接有效的通知向手机发送 keepalive消息; 若手机的消息解析模块 312对局域网设备连接模块 311接收的消息进行 解析; 若解析出该消息为 keepalive消息, 则通过局域网设备连接模块 311返 回回应消息; After receiving the message sent by the set top box, the local area network device connection module 311 of the mobile phone sends a message to the message parsing module 312 for parsing. If the message parsing module 312 of the mobile phone parses the message received by the local area network device connection module 311 as a stopaudio message, sends a stop audio collection notification to the audio collection and transmission module 313; and the audio collection and transmission module 313 stops collecting according to the stop audio collection notification. Sound information. Further, after the mobile phone communication module 301 establishes a communication connection with the mobile phone, the video call control module 302 of the set top box periodically sends a notification to the mobile phone communication module 301 to detect that the connection is valid; the mobile phone communication module 301 sends a keepalive to the mobile phone according to the received notification that the connection is valid. If the message parsing module 312 of the mobile phone parses the message received by the local area network device connection module 311, if the message is a keepalive message, the local area network device connection module 311 returns a response message;
若手机通讯模块 301在发送 keepalive消息后, 设定时间段内没有接收到 所述手机返回的回应消息, 则向视频通话控制模块 302返回连接无效的通知; 视频通话控制模块 302在接收到连接无效的通知后, 切换其它设备进行声音 信息采集。  If the mobile phone communication module 301 does not receive the response message returned by the mobile phone within the set time period after transmitting the keepalive message, the mobile phone control module 302 returns a notification that the connection is invalid; the video call control module 302 receives the connection invalid. After the notification, switch other devices to collect sound information.
进一步, 机顶盒的视频通话控制模块 302还可以接收对端机顶盒发送的 视频数据, 以及对端手机发送的音频数据; 其中, 所述对端手机为与所述对 端机顶盒建立了通信连接的手机; 以及  Further, the video call control module 302 of the set top box can also receive the video data sent by the peer set top box and the audio data sent by the peer mobile phone; wherein the peer mobile phone is a mobile phone that establishes a communication connection with the peer set top box; as well as
所述机顶盒还包括: 解码模块 304。  The set top box further includes: a decoding module 304.
解码模块 304, 用于将所述视频通话控制模块 302接收的音、视频数据解 码后发送给与所述机顶盒连接的电视进行播放。  The decoding module 304 is configured to decode the audio and video data received by the video call control module 302 and send the data to the television connected to the set top box for playing.
本发明实施例的视频通话系统中的机顶盒由于可以向手机发送消息, 通 知手机采集声音信息, 通过手机在获取声音信息后发送给对端的机顶盒; 并 且通过摄像装置获取图像信息后发送给对端机顶盒。 这样, 用户可以离摄像 装置有一定的距离, 便于摄像装置的较好摄像, 得到质量较佳的视频图像; 而手机可以放于用户的近距离处, 获得较好的声音质量; 从而达到兼顾音视 频质量的目的。 而且, 用户可以位于电视前, 将手机放于身边或持于耳边(或 者通过手机的耳机)进行视频通话, 也非常符合用户平时的习惯。  The set-top box in the video call system of the embodiment of the present invention can send a message to the mobile phone, notify the mobile phone to collect the sound information, and send the sound information to the set-top box of the opposite end through the mobile phone; and acquire the image information by the camera device and send the image information to the peer set-top box. . In this way, the user can have a certain distance from the camera device, which is convenient for the camera device to obtain better quality video images; and the mobile phone can be placed at a close distance of the user to obtain better sound quality; The purpose of video quality. Moreover, the user can be in front of the TV, put the phone around or hold the ear (or through the phone's earphones) for video calls, and it is also in line with the user's usual habits.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分步骤 是可以通过程序来指令相关的硬件来完成, 该程序可以存储于一计算机可读 取存储介质中, 如: ROM/RAM、 磁碟、 光盘等。  A person skilled in the art can understand that all or part of the steps of implementing the above embodiments can be completed by a program to instruct related hardware, and the program can be stored in a computer readable storage medium, such as: ROM/RAM, Disk, CD, etc.
以上所述仅是本发明的优选实施方式, 应当指出, 对于本技术领域的普 通技术人员来说, 在不脱离本发明原理的前提下, 还可以作出若干改进和润 饰, 这些改进和润饰也应视为本发明的保护范围。  The above description is only a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It is considered as the scope of protection of the present invention.

Claims

权 利 要 求 书 Claim
1. 一种基于机顶盒的视频通话方法, 包括:  1. A video call method based on a set top box, comprising:
本地机顶盒在与对端机顶盒建立通话后, 将 startaudio消息发送给与之建 立了通信连接的手机;所述 startaudio消息中携带有所述对端机顶盒的 IP地址 和端口信息;  After the local set-top box establishes a call with the peer set-top box, the startaudio message is sent to the mobile phone with which the communication connection is established; the startaudio message carries the IP address and port information of the peer set-top box;
所述手机在接收到所述 startaudio消息后, 开始采集声音信息; 并将声音 信息编码为音频数据后, 根据所述对端机顶盒的音频 IP地址和端口信息将所 述音频数据发送给所述对端机顶盒; 并且,  After receiving the startaudio message, the mobile phone starts to collect sound information; after encoding the sound information into audio data, the audio data is sent to the pair according to the audio IP address and port information of the peer set top box. End set box; and,
所述本地机顶盒获取与之连接的摄像装置采集的图像信息后, 将获取的 图像信息编码为视频数据后发送给所述对端机顶盒;  After acquiring the image information collected by the camera device connected thereto, the local set top box encodes the acquired image information into video data and sends the image information to the peer set top box;
所述对端机顶盒接收到所述视频数据和音频数据后解码, 并通过与之连 接的电视播放解码后的音、 视频。  The peer set top box decodes the video data and the audio data, and plays the decoded sound and video through the connected TV.
2. 如权利要求 1所述的方法, 还包括: 2. The method of claim 1 further comprising:
在本地机顶盒与对端机顶盒通话结束时, 所述本地机顶盒向所述手机发 送 stopaudio消息;  When the local set top box and the opposite set top box end the call, the local set top box sends a stopaudio message to the mobile phone;
所述手机根据接收的 stopaudio消息停止采集声音信息。  The mobile phone stops collecting sound information according to the received stopaudio message.
3. 如权利要求 1或 2所述的方法, 在所述本地机顶盒将 startaudio消息 发送给与之建立了通信连接的手机之前, 还包括: 3. The method according to claim 1 or 2, before the local set top box sends a startaudio message to the mobile phone with which the communication connection is established, the method further includes:
所述本地机顶盒在呼叫对端机顶盒之前, 与所述手机建立通信连接; 或者, 所述本地机顶盒在接收到所述对端机顶盒的呼叫后, 并在响应所 述对端机顶盒的呼叫之前, 与所述手机建立通信连接;  The local set top box establishes a communication connection with the mobile phone before calling the peer set top box; or the local set top box receives the call of the opposite set top box, and before responding to the call of the opposite set top box, The mobile phone establishes a communication connection;
或者, 所述本地机顶盒在接收到所述对端机顶盒的呼叫之前, 与所述手 机建立通信连接。  Alternatively, the local set top box establishes a communication connection with the mobile phone before receiving the call of the opposite set top box.
4. 如权利要求 3所述的方法, 在所述本地机顶盒与所述手机建立通信连 接后, 还包括: 4. The method of claim 3, after the local set top box establishes a communication connection with the mobile phone, further comprising:
所述本地机顶盒周期性地向所述手机发送 keepalive消息;  The local set top box periodically sends a keepalive message to the mobile phone;
在所述本地机顶盒发送 keepalive消息后, 若设定时间段内没有接收到所 述手机返回的回应消息, 则切换其它设备进行声音信息采集。 After the local set top box sends the keepalive message, if the response message returned by the mobile phone is not received within the set time period, the other device is switched to perform sound information collection.
5. 如权利要求 4所述的方法, 其中, 所述本地机顶盒与所述手机建立通 信连接具体为: The method of claim 4, wherein the local set top box establishes a communication connection with the mobile phone, specifically:
所述本地机顶盒通过发送广播消息扫描同一局域网内的手机; 若扫描到 一个手机, 则与之建立通信连接; 若扫描到多个手机, 则提示用户进行选择, 并与用户选择的手机建立通信连接;  The local set top box scans the mobile phone in the same local area network by sending a broadcast message; if scanning a mobile phone, establishing a communication connection with the mobile phone; if scanning a plurality of mobile phones, prompting the user to make a selection, and establishing a communication connection with the mobile phone selected by the user ;
或者, 所述本地机顶盒根据用户指定的 IP地址与手机建立通信连接。  Alternatively, the local set top box establishes a communication connection with the mobile phone according to the IP address specified by the user.
6. 如权利要求 4所述的方法, 还包括: 6. The method of claim 4, further comprising:
所述对端机顶盒向所述本地机顶盒发送视频数据; 并  The peer set top box sends video data to the local set top box; and
对端手机向所述本地机顶盒发送音频数据; 所述对端手机为与所述对端 机顶盒建立了通信连接的手机;  The peer mobile phone sends audio data to the local set top box; the peer mobile phone is a mobile phone that establishes a communication connection with the peer set top box;
所述本地机顶盒在接收到所述对端机顶盒发送的视频数据, 以及所述对 端手机发送的音频数据后进行解码, 并通过与之连接的电视播放音、 视频。  The local set top box decodes the video data sent by the peer set top box and the audio data sent by the peer mobile phone, and plays the sound and video through the television connected thereto.
7. 如权利要求 4所述的方法, 在所述本地机顶盒将 startaudio消息发送 给与之建立了通信连接的手机之前, 还包括: 7. The method according to claim 4, before the local set top box sends a startaudio message to the mobile phone with which the communication connection is established, the method further includes:
所述本地机顶盒确定用户选择的音频采集设备为手机。  The local set top box determines that the audio collection device selected by the user is a mobile phone.
8. 一种基于机顶盒的视频通话系统, 包括: 本地机顶盒, 以及与所述本 地机顶盒建立了通信连接的手机; 8. A set-top box based video calling system, comprising: a local set top box; and a handset that establishes a communication connection with the local set top box;
所述本地机顶盒用于在与对端机顶盒建立通话后, 将 startaudio消息发送 给所述手机; 并获取与之连接的摄像装置采集的图像信息后, 将获取的图像 信息编码为视频数据后发送给所述对端机顶盒; 其中, 所述 startaudio消息中 携带有所述对端机顶盒的 IP地址和端口信息;  The local set top box is configured to send a startaudio message to the mobile phone after establishing a call with the peer set top box; and obtain image information collected by the camera device connected thereto, and then encode the acquired image information into video data and send the image information to The peer-end set-top box; wherein the startaudio message carries the IP address and port information of the peer set-top box;
所述手机用于在接收到所述 startaudio消息后, 开始采集声音信息; 并将 声音信息编码为音频数据后, 根据所述对端机顶盒的音频 IP地址和端口信息 将所述音频数据发送给所述对端机顶盒。  The mobile phone is configured to start collecting sound information after receiving the startaudio message; and after encoding the sound information into audio data, send the audio data to the office according to the audio IP address and port information of the peer set top box. Said the opposite set-top box.
9. 如权利要求 8所述的系统, 其特征在于, 送 stopaudio消息; 以及 9. The system of claim 8 wherein: sending a stopaudio message;
所述手机还用于根据接收的 stopaudio消息停止采集声音信息。 The mobile phone is further configured to stop collecting sound information according to the received stopaudio message.
10. 如权利要求 8或 9所述的系统, 其特征在于, 10. The system of claim 8 or 9, wherein
所述本地机顶盒还用于在呼叫对端机顶盒之前, 与所述手机建立通信连 接;  The local set top box is further configured to establish a communication connection with the mobile phone before calling the peer set top box;
或者, 所述本地机顶盒还用于在接收到所述对端机顶盒的呼叫后, 并在 响应所述对端机顶盒的呼叫之前, 与所述手机建立通信连接;  Or the local set top box is further configured to establish a communication connection with the mobile phone after receiving the call of the opposite set top box and before responding to the call of the opposite set top box;
或者, 所述本地机顶盒还用于在接收到所述对端机顶盒的呼叫之前, 与 所述手机建立通信连接。  Alternatively, the local set top box is further configured to establish a communication connection with the mobile phone before receiving the call of the peer set top box.
11. 如权利要求 10所述的系统, 其特征在于, 11. The system of claim 10, wherein
所述本地机顶盒还用于在与所述手机建立通信连接后, 周期性地向所述 手机发送 keepalive消息; 若设定时间段内没有接收到所述手机返回的回应消 息, 则切换其它设备进行声音信息采集。  The local set top box is further configured to periodically send a keepalive message to the mobile phone after establishing a communication connection with the mobile phone; if the response message returned by the mobile phone is not received within a set time period, switch the other device to perform Sound information collection.
12. 如权利要求 11所述的系统, 其特征在于, 12. The system of claim 11 wherein:
所述本地机顶盒还用于在接收到所述对端机顶盒发送的视频数据, 以及 对端手机发送的音频数据后进行解码, 并通过与之连接的电视播放音、 视频; 其中, 所述对端手机为与所述对端机顶盒建立了通信连接的手机。  The local set top box is further configured to: after receiving the video data sent by the peer set top box and the audio data sent by the peer mobile phone, decoding, and playing the sound and video through the television connected thereto; wherein, the opposite end The mobile phone is a mobile phone that establishes a communication connection with the peer set top box.
13. 如权利要求 11所述的系统, 其特征在于, 13. The system of claim 11 wherein:
所述本地机顶盒还用于在将 startaudio消息发送给与之建立了通信连接的 手机之前, 确定用户选择的音频采集设备为手机。  The local set top box is further configured to determine that the audio collection device selected by the user is a mobile phone before sending the startaudio message to the mobile phone with which the communication connection is established.
14. 一种机顶盒, 包括: 14. A set top box comprising:
手机通讯模块, 用于与手机建立通信连接;  a mobile communication module for establishing a communication connection with the mobile phone;
视频通话控制模块, 用于在与对端机顶盒建立通话后, 向所述手机通讯 模块发送通话开始通知;  a video call control module, configured to send a call start notification to the mobile phone communication module after establishing a call with the peer set top box;
所述手机通讯模块还用于在接收到所述通话开始通知后, 向建立了通信 连接的手机发送 startaudio消息;所述 startaudio消息中携带有所述对端机顶盒 的 IP地址和端口信息;  The mobile phone communication module is further configured to: after receiving the call start notification, send a startaudio message to the mobile phone that establishes the communication connection; the startaudio message carries the IP address and port information of the peer set top box;
视频数据编码模块, 用于获取与所述机顶盒连接的摄像装置采集的图像 信息后, 将获取的图像信息编码为视频数据; 所述视频通话控制模块还用于将所述视频数据编码模块编码的视频数据 发送给所述对端机顶盒。 a video data encoding module, configured to acquire image information collected by the camera device connected to the set top box, and encode the acquired image information into video data; The video call control module is further configured to send the video data encoded by the video data encoding module to the peer set top box.
15. 如权利要求 14所述的机顶盒, 其特征在于, 15. The set top box of claim 14 wherein:
所述视频通话控制模块还用于在与所述对端机顶盒通话结束时, 向所述 手机通讯模块发送通话结束通知; 以及  The video call control module is further configured to send a call end notification to the mobile phone communication module when the call with the peer set top box ends;
所述手机通讯模块还用于在接收到所述通话结束通知后, 向所述手机发 送 stopaudio消息。  The mobile phone communication module is further configured to send a stopaudio message to the mobile phone after receiving the call end notification.
16. 如权利要求 14或 15所述的机顶盒, 其特征在于, 所述手机通讯模 块与手机建立通信连接具体为: The set top box according to claim 14 or 15, wherein the mobile communication module establishes a communication connection with the mobile phone:
所述手机通讯模块在所述视频通话控制模块呼叫对端机顶盒之前, 与所 述手机建立通信连接;  The mobile phone communication module establishes a communication connection with the mobile phone before the video call control module calls the peer set top box;
或者, 所述手机通讯模块在所述视频通话控制模块接收到所述对端机顶 盒的呼叫之前, 与所述手机建立通信连接。  Alternatively, the mobile phone communication module establishes a communication connection with the mobile phone before the video call control module receives the call of the opposite set top box.
17. 如权利要求 16所述的机顶盒, 其特征在于, 17. The set top box of claim 16 wherein:
所述视频通话控制模块还用于在所述手机通讯模块与所述手机建立通信 连接后, 周期性向所述手机通讯模块发送检测连接有效的通知; 以及  The video call control module is further configured to periodically send a notification to the mobile phone communication module to detect that the connection is valid after the mobile phone communication module establishes a communication connection with the mobile phone;
所述手机通讯模块根据接收的检测连接有效的通知, 向所述手机发送 keepalive消息; 若在发送 keepalive消息后, 设定时间段内没有接收到所述手 机返回的回应消息, 则所述手机通讯模块向所述视频通话控制模块返回连接 无效的通知;  The mobile phone communication module sends a keepalive message to the mobile phone according to the received notification that the connection is valid; if the response message returned by the mobile phone is not received within the set time period after the keepalive message is sent, the mobile phone communication The module returns a notification that the connection is invalid to the video call control module;
所述视频通话控制模块还用于在接收到连接无效的通知后, 切换其它设 备进行声音信息采集。  The video call control module is further configured to switch other devices to collect sound information after receiving the notification that the connection is invalid.
18. 如权利要求 17所述的机顶盒, 其特征在于, 18. The set top box of claim 17 wherein:
所述视频通话控制模块还用于接收到所述对端机顶盒发送的视频数据, 以及对端手机发送的音频数据; 其中, 所述对端手机为与所述对端机顶盒建 立了通信连接的手机; 以及  The video call control module is further configured to receive the video data sent by the peer set top box and the audio data sent by the peer mobile phone; wherein the peer mobile phone is a mobile phone that establishes a communication connection with the peer set top box ; as well as
所述机顶盒还包括:  The set top box further includes:
解码模块, 用于将所述视频通话控制模块接收的音、 视频数据解码后发 送给与所述机顶盒连接的电视进行播放。 a decoding module, configured to decode the audio and video data received by the video call control module Send to the TV connected to the set top box for playback.
19. 如权利要求 17所述的机顶盒, 其特征在于, 还包括:19. The set top box of claim 17, further comprising:
Figure imgf000018_0001
用于为用户提供可选择的音频采集设备, 并确 定用户所选择的音频采集设备; 以及
Figure imgf000018_0001
Used to provide a user with a selectable audio capture device and to determine the audio capture device selected by the user;
所述视频通话控制模块还用于在向所述手机通讯模块发送通话开始通知 之前, 确定所选择的音频采集设备为手机。  The video call control module is further configured to determine that the selected audio collection device is a mobile phone before sending the call start notification to the mobile phone communication module.
20. 一种手机, 包括: 20. A mobile phone, comprising:
局域网设备连接模块, 用于与局域网内的设备建立通信连接, 并接收所 述局域网内建立了通信连接的设备发送的消息;  a local area network device connection module, configured to establish a communication connection with a device in the local area network, and receive a message sent by the device that establishes a communication connection in the local area network;
消息解析模块, 用于对所述局域网设备连接模块接收的消息进行解析; 若解析出该消息为 startaudio消息, 则发送音频采集通知;  a message parsing module, configured to parse a message received by the local area network device connection module; if the message is a startaudio message, send an audio collection notification;
音频采集发送模块, 用于在接收到所述消息解析模块发送的音频采集通 知后, 开始采集声音信息; 并将声音信息编码为音频数据后, 根据所述 startaudio消息中携带的音频 IP地址和端口信息将所述音频数据进行发送。  An audio collection and transmission module, configured to start collecting sound information after receiving the audio collection notification sent by the message parsing module; and encoding the audio information into audio data, according to the audio IP address and port carried in the startaudio message The information transmits the audio data.
21. 如权利要求 20所述的手机, 其特征在于, 21. The handset of claim 20, wherein
所述消息解析模块还用于若解析出接收的消息为 stopaudio消息, 则向所 述音频采集发送模块发送停止音频采集通知; 以及  The message parsing module is further configured to: if the received message is parsed as a stopaudio message, send a stop audio collection notification to the audio collection and transmission module;
所述音频采集发送模块还用于根据所述停止音频采集通知, 停止采集声 音信息。  The audio collection and transmission module is further configured to stop collecting sound information according to the stopping the audio collection notification.
22. 如权利要求 21所述的手机, 其特征在于, 22. The handset of claim 21, wherein
所述消息解析模块还用于若解析出接收的消息为 keepalive消息, 则通过 所述局域网设备连接模块返回回应消息。  The message parsing module is further configured to: if the received message is parsed as a keepalive message, return a response message by using the local area network device connection module.
PCT/CN2012/080298 2012-08-08 2012-08-17 Set top box based video conversation method and system WO2014023042A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210280659.8 2012-08-08
CN201210280659.8A CN102857729B (en) 2012-08-08 2012-08-08 Set top box based video conversation method and system

Publications (1)

Publication Number Publication Date
WO2014023042A1 true WO2014023042A1 (en) 2014-02-13

Family

ID=47403872

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/080298 WO2014023042A1 (en) 2012-08-08 2012-08-17 Set top box based video conversation method and system

Country Status (2)

Country Link
CN (1) CN102857729B (en)
WO (1) WO2014023042A1 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103595949B (en) * 2013-11-07 2018-06-08 深圳创维数字技术有限公司 A kind of forwarding method of video calling, terminal and system
BR102014011263B1 (en) * 2014-05-09 2019-07-02 Tqtvd Software Ltda METHOD FOR ENCLOSURING AUDIOVISUAL CONTENT STREAMS IN MPEG2-PRIVATE-SECTIONS, DEVICE FOR ENCLOSING AUDIOVISUAL CONTENT IN MPEG2-TRANSPORT-STREAM, AUDIO / AUDIO COMMUNICATION PROTOCOL DATA FOR USER DEVICES WITHOUT RESOURCES TO TUNE A DIGITAL TV SIGNAL BROADCAST THROUGH A DIGITAL TV SIGNAL BROADCAST
CN104539870A (en) * 2014-12-19 2015-04-22 华为软件技术有限公司 Video call device and method
CN106162368A (en) * 2015-04-24 2016-11-23 中兴通讯股份有限公司 A kind of fusion device supporting mediaphone and communication means, subscriber equipment
CN104954724B (en) * 2015-05-20 2018-01-23 南京创维信息技术研究院有限公司 A kind of video call switching method, Intelligent television terminal, mobile terminal and system
CN106331568B (en) * 2015-07-03 2019-11-15 华平智慧信息技术(深圳)有限公司 A kind of instant communication method, system and mobile terminal
CN106412648A (en) * 2015-07-31 2017-02-15 腾讯科技(深圳)有限公司 Video interaction method and device
CN105120368A (en) * 2015-08-26 2015-12-02 无锡华海天和信息科技有限公司 Network video telephone system and realization method thereof capable of reminding incoming call notification
CN105120199B (en) * 2015-08-26 2019-01-29 江苏金中微智慧科技有限公司 The implementation method of acoustic processing in a kind of video calling
CN105338311A (en) * 2015-10-12 2016-02-17 北京奇虎科技有限公司 Internet protocol camera, data transmission method thereof and system
CN105744351A (en) * 2016-02-15 2016-07-06 四川长虹电器股份有限公司 Realization method of virtual microphone of Android smart television
CN105847736A (en) * 2016-04-05 2016-08-10 上海斐讯数据通信技术有限公司 Video conversation system and video conversation method
CN108123927A (en) * 2016-11-30 2018-06-05 中兴通讯股份有限公司 A kind of CDN network communication means, apparatus and system
CN108307137A (en) * 2017-12-20 2018-07-20 江苏省公用信息有限公司 A method of mobile phone is optimized into video calling sound quality as IPTV set top box source of sound input equipment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101635820A (en) * 2004-08-18 2010-01-27 华为技术有限公司 Set-top box system with multimedia communication function
CN102387335A (en) * 2011-11-18 2012-03-21 康佳集团股份有限公司 Method and system based on mobile phone realizing video call through set-top box

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7627313B2 (en) * 2003-12-22 2009-12-01 Gigaset Communications Gmbh Method, telecommunication system and telecommunication handset for wireless communication and telecommunication in a smart home environment

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101635820A (en) * 2004-08-18 2010-01-27 华为技术有限公司 Set-top box system with multimedia communication function
CN102387335A (en) * 2011-11-18 2012-03-21 康佳集团股份有限公司 Method and system based on mobile phone realizing video call through set-top box

Also Published As

Publication number Publication date
CN102857729B (en) 2015-07-15
CN102857729A (en) 2013-01-02

Similar Documents

Publication Publication Date Title
WO2014023042A1 (en) Set top box based video conversation method and system
US8854414B2 (en) Method, application server and system for privacy protection in video call
JP2005033664A (en) Communication device and its operation control method
US8274545B2 (en) Apparatus and method for casting video data and audio data to web during video telephony in mobile communication terminal
WO2012079510A1 (en) Mute indication method and device applied to video conferencing
TWI451746B (en) Video conference system and video conference method thereof
WO2012022093A1 (en) Method and wireless communication terminal for displaying calling video
KR101701742B1 (en) Apparatus and method for live streaming between mobile communication terminals
WO2007115462A1 (en) Method for realizing remote monitoring service and video terminal device
WO2012055317A1 (en) Method and device for displaying information
CN103327380A (en) Set top box and method for achieving conversation on set top box
CN108322429B (en) Recording control method in real-time communication, real-time communication system and communication terminal
JP2006140973A (en) Home gateway, two-way video communication apparatus, and two-way video communication system
JP4939095B2 (en) Content providing system and content switching method
JP5010748B1 (en) Video display device, video processing method, and video display system
US20060135151A1 (en) Cordless IP telephone
WO2004077829A1 (en) Video conference system for mobile communication
KR100812429B1 (en) Network system and data distribution service providing method
TWI468013B (en) Video conference system and method
JP2008060752A (en) Calling method of communication terminal
KR100872076B1 (en) Method for providing substitute image service during video telephony, system and mobile communication teminal
JP3334253B2 (en) Video communication device
CN201323604Y (en) Networking telephone number digital photo frame system
JPH1132315A (en) Method for communicating video and voice and system therefor and storage medium storing video and voice communication program
CN113810331A (en) Terminal control method, system, control device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12882578

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12882578

Country of ref document: EP

Kind code of ref document: A1