WO2015131520A1 - Method and device for displaying layout in telepresence conferencing system - Google Patents

Method and device for displaying layout in telepresence conferencing system Download PDF

Info

Publication number
WO2015131520A1
WO2015131520A1 PCT/CN2014/087606 CN2014087606W WO2015131520A1 WO 2015131520 A1 WO2015131520 A1 WO 2015131520A1 CN 2014087606 W CN2014087606 W CN 2014087606W WO 2015131520 A1 WO2015131520 A1 WO 2015131520A1
Authority
WO
WIPO (PCT)
Prior art keywords
telepresence
terminal
telepresence terminal
seat
agent
Prior art date
Application number
PCT/CN2014/087606
Other languages
French (fr)
Chinese (zh)
Inventor
马铮
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2015131520A1 publication Critical patent/WO2015131520A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor

Definitions

  • the present invention relates to the field of multimedia telepresence communications, and in particular, to a display and layout method and apparatus for video viewed by a terminal in a telepresence system.
  • Telepresence also known as telepresence technology, is a teleconferencing technology that combines video communication and communication experience. It has the characteristics of life-size, ultra-high definition and low latency. It focuses on real face-to-face communication.
  • the implementation process involves network, communication, conference environment, functional applications and other aspects.
  • the final presentation to the conference participants is an integrated real communication experience combined with transactional applications.
  • the speech end is usually displayed as a large screen in the far-end video, and then the video of the other two participating terminals is displayed on the small screen of a certain agent screen.
  • FIG. 1 is a conventional layout display diagram of displaying a layout in a telepresence conference system according to an embodiment of the present invention.
  • the remote video layout of a telepresence terminal wherein the large video in the far-end video is the video of the speaking end, and the small pictures in the left and right screens are respectively videos of the other two telepresence terminals.
  • telepresence terminals there are some telepresence terminals (assuming that all telepresence terminals are three seats - that is, three-screen telepresence terminals), not all agents have participants.
  • a telepresence terminal is used as a remote video display output of other telepresence terminals, there will be cases where some agents at the far end are empty. In this way, some seats in the far-end video are empty, and some terminal participants do not have free space to display in the far-end video.
  • the telepresence technology can provide a real communication experience very easily in point-to-point communication, but in the case of multi-point communication, how can all of the multiple displays be displayed in multiple displays. Personnel images, and as much as possible to preserve the user experience of real face-to-point communication, this is a key issue that can enhance the user experience.
  • a method for displaying a layout in a telepresence conference system including:
  • the multipoint processing unit receives the local video layout information that is sent by each telepresence terminal and includes each of the agent images;
  • the multi-point processing unit analyzes the local video layout information of each telepresence terminal to obtain the seating information of the personnel of each telepresence terminal;
  • the multi-point processing unit determines whether each of the telepresence terminals has only one seat and has a person sitting according to the seat information of each telepresence terminal agent;
  • the multi-point processing unit determines that each of the telepresence terminals has only one seat and has a person sitting, the seat image of each telepresence terminal having a seat on the person is sent to the corresponding telepresence terminal.
  • each of the agent images in the local video layout information of each of the telepresence terminals is separately subjected to face recognition;
  • the seat image of the person sitting is extracted from the local video layout information of each telepresence terminal;
  • the remote video layout information used for sending to any of the telepresence terminals includes a seat image of the other telepresence terminal except the any telepresence terminal.
  • each of the video images in the remote video layout information is respectively formed into a corresponding video code stream including the display location identifier, and then sent to the corresponding telepresence terminal.
  • the method further includes: each of the telepresence terminals performing image display according to the corresponding video code stream.
  • each of the telepresence terminals displays the agent image on the corresponding display screen according to the display location identifier in the corresponding video code stream.
  • an apparatus for displaying a layout in a telepresence conference system including:
  • the receiving module is located in the multi-point processing unit, and is configured to receive, by the multi-point processing unit, the local video layout information that is sent by each telepresence terminal and includes each of the agent images;
  • the analysis module is located in the multi-point processing unit, and is configured as a multi-point processing unit to analyze the local video layout information of each telepresence terminal, and obtain the information of the seat of each telepresence terminal agent;
  • the judging module is located in the multi-point processing unit, and is configured as a multi-point processing unit, according to the information of the seat sitting of each telepresence terminal, determining whether each telepresence terminal has only one seat and has a person sitting;
  • the sending module is located in the multi-point processing unit, and is configured to send the seat image of each telepresence terminal to the corresponding telepresence terminal when the multi-point processing unit determines that only one agent of each telepresence terminal has a seat.
  • the analyzing module further comprises:
  • the identification sub-module is configured to perform face recognition on each of the agent images in the local video layout information of each of the telepresence terminals;
  • the determining sub-module is set to obtain the information of the person sitting in the seat of each telepresence terminal according to the recognition result of the face or the face of each agent image.
  • the sending module further includes:
  • the extraction sub-module is configured to extract, from the local video layout information of each telepresence terminal, a seat image that the person sits on;
  • the combination sub-module is configured to generate respective remote video layout information respectively sent to each telepresence terminal by separately combining the extracted agent images.
  • the sending module further includes:
  • the code stream sub-module is configured to respectively form each video code stream corresponding to the display position identifier by each seat image in the remote video layout information, and then send the video code stream to the corresponding telepresence terminal.
  • the present invention has the beneficial effects that a remote video layout display method can be implemented to enable participants of all telepresence terminals except the local telepresence terminal in a specific scenario to be implemented.
  • a reasonable layout display enhances the sensory experience of each party's participants and other participants.
  • FIG. 1 is a conventional layout display diagram of displaying a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method for displaying a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 3 is a structural diagram of an apparatus for displaying a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 4 is a four-party conference scene diagram for displaying a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 5 is a video layout diagram of a first telepresence terminal TerA that displays a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 6 is a video layout diagram of a second telepresence terminal TerB that displays a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 7 is a video layout diagram of a third telepresence terminal TerC that displays a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 8 is a video layout diagram of a fourth telepresence terminal TerD that displays a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 9 is a remote video layout view of a first telepresence terminal TerA that displays a layout in a telepresence conference system according to an embodiment of the present invention.
  • FIG. 10 is a remote video layout view seen by a second telepresence terminal TerB that displays a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 11 is a remote video layout view seen by a third telepresence terminal TerC that displays a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 12 is a remote video layout view of a fourth telepresence terminal TerD displayed in a telepresence conference system according to an embodiment of the present invention
  • FIG. 13 is a flowchart of processing a telepresence terminal that displays a layout in a telepresence conference system according to an embodiment of the present invention
  • FIG. 14 is a flowchart of processing of an MCU displaying a layout in a telepresence conference system according to an embodiment of the present invention.
  • FIG. 2 is a flowchart of a method for displaying a layout in a telepresence conference system according to an embodiment of the present invention.
  • the application scenario is defined as a four-party telepresence conference, and four telepresence terminals participating in the conference are all three. Screen real terminal, each party's participants are concentrated on one agent, the steps are as follows:
  • Step S1 The multi-point processing unit receives local video layout information including each agent image sent by each telepresence terminal.
  • the respective telepresence terminals use biometric identification technology, such as face recognition technology, to identify whether there is information about the sitting of the participant in the local agent, and send the information to the MCU. This process is an optional process, and the participating telepresence terminals may not perform this process.
  • the MCU collects local video layout information from each participant's telepresence terminal (ie, whether there is information on the seat of each participant at the terminal) and saves it.
  • Step S2 The multi-point processing unit analyzes the local video layout information of each telepresence terminal to obtain the seating information of the personnel of each telepresence terminal.
  • step S2 each of the seat images in the local video layout information of each of the collected telepresence terminals is separately subjected to face recognition;
  • Step S3 The multi-point processing unit determines, according to the seat sitting information of each telepresence terminal agent, whether each of the telepresence terminals has only one seat and has a person sitting. The determining whether each of the telepresence terminals is a quartet telepresence conference, and all the participants in the local video layout information of the participant telepresence terminal have a seat in the seat. The MCU can also manually determine that there is only one participant on the seat of each participant's telepresence terminal to sit.
  • Step S4 When the multi-point processing unit determines that each of the telepresence terminals has only one agent sitting, the seat image of each telepresence terminal having a person sitting is sent to the corresponding telepresence terminal.
  • step S4 the seat image of the person sitting is extracted from the local video layout information of each telepresence terminal;
  • Each remote video layout information used to be separately transmitted to each telepresence terminal is generated by separately combining the extracted seat images.
  • the remote video layout information used for sending to any of the telepresence terminals includes a seat image of the other telepresence terminal except the any telepresence terminal.
  • the MCU automatically organizes the remote video layout that it watches for each participant's telepresence terminal.
  • the left, middle, and right screens are the seats for the participants in the other three parties except the local end.
  • the MCU can also organize the remote video layout that it watches for each participant's telepresence terminal by manual control, that is, manually select and process the video for each agent of the telepresence terminal, for its left, middle, and The right three seats perform video switching processing respectively, and the video source comes from the other three-party telepresence terminal except the local end, where the participants sit at the seat.
  • Each of the video images in the remote video layout information is respectively formed into a corresponding video code stream including the display location identifier, and then sent to the corresponding telepresence terminal.
  • the method further includes: each of the telepresence terminals performing image display according to the corresponding video code stream.
  • each of the telepresence terminals displays the agent image on the corresponding display screen according to the display position identifier in the corresponding video code stream. That is, the final remote video layout of the four participants of the telepresence terminal is the seat of the other three parties of the telepresence terminal, and is displayed on the three seats screens of the local end.
  • FIG. 3 is a structural diagram of an apparatus for displaying a layout in a telepresence conference system according to an embodiment of the present invention. As shown in FIG. 3, the method includes: a receiving module, an analysis module, a judging module, and a sending module.
  • the receiving module is located in the multi-point processing unit, and is configured to receive the local video layout information that is sent by each telepresence terminal and includes each of the agent images.
  • the analysis module is located in the multi-point processing unit, and is configured to analyze the local video layout information of each telepresence terminal to obtain the seating information of the personnel of each telepresence terminal.
  • the identification sub-module of the analysis module is configured to perform face recognition on each seat image in the local video layout information of each telepresence terminal.
  • the determining sub-module of the analysis module is configured to obtain, according to the recognition result of the face image or the face of each agent image, whether each of the telepresence terminal seats has a person sitting on the seat.
  • the judging module is located in the multi-point processing unit, and is configured to determine, according to the seat sitting information of each telepresence terminal agent, whether each telepresence terminal has only one seat and has a person sitting.
  • the sending module is located in the multi-point processing unit, and is configured to send the seat image of each telepresence terminal to the corresponding telepresence terminal when it is determined that only one agent of each telepresence terminal has a seat.
  • the extraction submodule of the sending module is configured to extract a seat image of a person sitting from the local video layout information of each telepresence terminal.
  • the combination submodule of the sending module is configured to generate respective remote video layout information respectively sent to each telepresence terminal by separately combining the extracted agent images.
  • the code stream sub-module of the sending module is configured to respectively form each video code stream corresponding to the display position identifier by each seat image in the remote video layout information, and then send the video stream to the corresponding telepresence terminal.
  • FIG. 4 is a diagram of a four-party conference scene in which a layout is displayed in a telepresence conference system according to an embodiment of the present invention.
  • MCU Multipoint Control Unit
  • These four terminals are three-screen telepresence terminals with three screens: left (L), medium (C), and right (R).
  • FIG. 5 is a video layout diagram of a first telepresence terminal TerA that displays a layout in a telepresence conference system according to an embodiment of the present invention.
  • the left screen (L) position of TerA is seated by two participants, and the other two screens - the middle screen (C) and the right screen (R), are not seated by the participants.
  • FIG. 6 is a video layout diagram of a second telepresence terminal TerB that displays a layout in a telepresence conference system according to an embodiment of the present invention.
  • TerB's mid-screen (C) position has two participants, and the other two screens - the left screen (L) and the right screen (R) - are not attended by the participants.
  • FIG. 7 is a video layout diagram of a third telepresence terminal TerC that displays a layout in a telepresence conference system according to an embodiment of the present invention.
  • TerC's right screen (R) position has two participants, and the other two screens - left screen (L) and middle screen (C) - are not attended by participants.
  • FIG. 8 is a video layout diagram of a fourth telepresence terminal TerD that displays a layout in a telepresence conference system according to an embodiment of the present invention.
  • TerD's mid-screen (C) position is seated by one participant, and the other two screens - left screen (L) and right screen (R) - are not attended by participants.
  • FIG. 9 is a remote video layout view of a first telepresence terminal TerA that displays a layout in a telepresence conference system according to an embodiment of the present invention.
  • the left screen (L) of TerA is the video of two participants of TerB
  • the middle screen (C) is the video of two participants of TerC
  • the right screen (R) is a participant of TerD.
  • Video of people is the left screen (L) of TerA and the middle screen (C) is the video of two participants of TerC
  • R right screen
  • FIG. 10 is a remote video layout view of a second telepresence terminal TerB that displays a layout in a telepresence conference system according to an embodiment of the present invention.
  • the left screen (L) of TerB is the video of two participants of TerA
  • the middle screen (C) is the video of two participants of TerC
  • the right screen (R) is a participant of TerD.
  • Video of people is the left screen (L) of TerB and the middle screen (C) is the video of two participants of TerC
  • R right screen
  • FIG. 11 is a remote video layout diagram seen by a third telepresence terminal TerC that displays a layout in a telepresence conference system according to an embodiment of the present invention.
  • the left screen (L) of TerC is the video of two participants of TerA
  • the middle screen (C) is the video of two participants of TerB
  • the right screen (R) is a participant of TerD.
  • FIG. 12 is a remote video layout view of a fourth telepresence terminal TerD displayed in a telepresence conference system according to an embodiment of the present invention.
  • the left screen (L) of TerD is the video of two participants of TerA
  • the middle screen (C) is the video of two participants of TerB
  • the right screen (R) is the two participants of TerC. Video of people.
  • FIG. 13 is a flowchart of processing a telepresence terminal that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in FIG. 13 , the terminal can determine which participant in the video currently collected by the local party is seated by the face recognition technology, and save the information and send it to the MCU.
  • FIG. 14 is a flowchart of processing of an MCU displaying a layout in a telepresence conference system according to an embodiment of the present invention.
  • the MCU collects the local video layout information of each participant's telepresence terminal and saves it. After all the local video layout information of the participating telepresence terminals are collected, the MCU analyzes and judges if the current conference is a four-party conference ( That is, there are four participants in the conference (the telepresence terminal), and there is only one participant in the local video layout of each participant's telepresence terminal. When this condition is met, the MCU starts to organize for each conference terminal.
  • the far-end video layout of the remote video layout that it views is composed of the seats of the other three participants in the telepresence terminal except the local end. The participants of the other three participants of the telepresence terminal are displayed separately. In the left, middle and right seats.
  • the present invention has the following technical effects: in a quad-part telepresence conference, when only one of the three seats in each participant's telepresence terminal has a participant, each participant can pass The remote video of the telepresence terminal is located at the same time as the other three participants, that is, all the participants can be seen in one conference room, so that in this particular scenario, each telepresence site Participants can achieve large-screen display to achieve the best face-to-face sensory effect with all participants.
  • the technical solution provided by the present invention can realize a reasonable layout display by using a remote video layout display method to enable all participants of the telepresence terminal except the local telepresence terminal in a specific scenario to enhance each display.

Abstract

The invention relates to the field of multimedia telepresence communication. Disclosed are a method and device for displaying the layout in a telepresence conferencing system, the method comprising: a multipoint processing unit receives the local video layout information containing each seat image sent by each telepresence terminal; by analyzing the local video layout information of each telepresence terminal, the multipoint processing unit obtains attendee seating information on each telepresence terminal; according to the attendee seating information on each telepresence terminal, the multipoint processing unit determines whether each telepresence terminal has only one seat with an attendee seated; when the multipoint processing unit determines that each telepresence terminal has only one seat with an attendee seated, sending the image of the seat with the attendee seated on the telepresence terminal to the corresponding telepresence terminal. The invention uses a layout display method to enable reasonable display of the layout of the attendees of all telepresence terminals except the present telepresence terminal, thus enhancing the face to face experience.

Description

一种在网真会议系统中显示布局的方法及装置Method and device for displaying layout in telepresence conference system 技术领域Technical field
本发明涉及多媒体网真通信领域,特别涉及网真系统中终端所看视频的显示布局方法及装置。The present invention relates to the field of multimedia telepresence communications, and in particular, to a display and layout method and apparatus for video viewed by a terminal in a telepresence system.
背景技术Background technique
网真技术(Telepresence)也叫智真技术,是一种将视频通信与沟通体验融为一体的远程会议技术,具有真人大小、超高清晰、低延时的特点,其注重的是真实面对面沟通的效果,实现过程涉及到网络、通信、会商环境、功能应用等多个方面,最终呈现给会议参与者的是一种与事务应用相结合的一体化真实沟通体验。Telepresence, also known as telepresence technology, is a teleconferencing technology that combines video communication and communication experience. It has the characteristics of life-size, ultra-high definition and low latency. It focuses on real face-to-face communication. The implementation process involves network, communication, conference environment, functional applications and other aspects. The final presentation to the conference participants is an integrated real communication experience combined with transactional applications.
在一个四方网真会议中,每个网真终端处都只有一个坐席上有参会人员,这样实际上就是整个会议中全部只有四个坐席上有参会人员。按照现有常用的远端视频布局方式,通常是将发言端作为大画面显示在远端视频中,然后再将另外两个与会终端的视频显示在某个坐席屏幕的小画面中。In a quad-part telepresence conference, there is only one agent at each telepresence terminal, so in fact, there are only four seats in the entire conference. According to the conventional remote video layout method, the speech end is usually displayed as a large screen in the far-end video, and then the video of the other two participating terminals is displayed on the small screen of a certain agent screen.
图1是本发明实施例提供的在网真会议系统中显示布局的传统布局显示图。如图1所示,一网真终端的远端视频布局,其中远端视频中大画面为发言端的视频,左屏和右屏中的小画面分别为另外两网真终端的视频。1 is a conventional layout display diagram of displaying a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 1, the remote video layout of a telepresence terminal, wherein the large video in the far-end video is the video of the speaking end, and the small pictures in the left and right screens are respectively videos of the other two telepresence terminals.
因此,在一个多点网真会议中,有部分网真终端(在此假定所有网真终端均为三坐席——即三屏网真终端)不是所有坐席都有参会人员就坐。当这类网真终端作为其他网真终端的远端视频显示输出时,就会有远端有些坐席是空的情况出现。这样就会出现远端视频中有些坐席的位置是空的,而有些终端的参会人员却没有空余位置显示在远端视频中。Therefore, in a multipoint telepresence conference, there are some telepresence terminals (assuming that all telepresence terminals are three seats - that is, three-screen telepresence terminals), not all agents have participants. When such a telepresence terminal is used as a remote video display output of other telepresence terminals, there will be cases where some agents at the far end are empty. In this way, some seats in the far-end video are empty, and some terminal participants do not have free space to display in the far-end video.
可见,在现有技术中,网真技术虽然在点对点通信时可以非常容易地提供真实沟通的体验,但是在多点通信的时候,如何能在多个显示屏中尽可能多的显示出全部与会人员图像,并且尽可能的保留真实面对点沟通的用户体验,这是一个可以提升用户体验的关键问题。 It can be seen that in the prior art, the telepresence technology can provide a real communication experience very easily in point-to-point communication, but in the case of multi-point communication, how can all of the multiple displays be displayed in multiple displays. Personnel images, and as much as possible to preserve the user experience of real face-to-point communication, this is a key issue that can enhance the user experience.
发明内容Summary of the invention
本发明的目的在于提供一种在网真会议系统中显示布局的方法及装置,能够解决在多点通信的情况下,多点网真会议系统中存在的参会人员的布局显示不合理问题。It is an object of the present invention to provide a method and apparatus for displaying a layout in a telepresence conference system, which can solve the problem of unreasonable layout display of a participant in a multipoint telepresence conference system in the case of multipoint communication.
根据本发明的一个实施例,提供了一种在网真会议系统中显示布局的方法,包括:According to an embodiment of the present invention, a method for displaying a layout in a telepresence conference system is provided, including:
多点处理单元接收各个网真终端发送的包含每个坐席图像的本端视频布局信息;The multipoint processing unit receives the local video layout information that is sent by each telepresence terminal and includes each of the agent images;
多点处理单元通过对每个网真终端的本端视频布局信息进行分析,得到每个网真终端坐席的人员就坐信息;The multi-point processing unit analyzes the local video layout information of each telepresence terminal to obtain the seating information of the personnel of each telepresence terminal;
多点处理单元根据每个网真终端坐席的人员就坐信息,判断各个网真终端是否只有一个坐席有人员就坐;The multi-point processing unit determines whether each of the telepresence terminals has only one seat and has a person sitting according to the seat information of each telepresence terminal agent;
当多点处理单元判断各个网真终端只有一个坐席有人员就坐时,将各个网真终端有人员就坐的坐席图像发送给相应网真终端。When the multi-point processing unit determines that each of the telepresence terminals has only one seat and has a person sitting, the seat image of each telepresence terminal having a seat on the person is sent to the corresponding telepresence terminal.
优选地,对所述每个网真终端的本端视频布局信息中的每个坐席图像分别进行人脸识别;Preferably, each of the agent images in the local video layout information of each of the telepresence terminals is separately subjected to face recognition;
根据每个坐席图像有人脸或没有人脸的识别结果,得到每个网真终端坐席是否有人员就坐的人员就坐信息。According to the recognition result of the face image or the no face of each agent image, it is obtained whether the seat of each telepresence terminal has a person sitting on the seat.
优选地,从各个网真终端的本端视频布局信息中提取有人员就坐的坐席图像;Preferably, the seat image of the person sitting is extracted from the local video layout information of each telepresence terminal;
通过分别组合所提取的坐席图像,生成用来分别发送给各个网真终端的各个远端视频布局信息;Generating each remote video layout information respectively sent to each telepresence terminal by separately combining the extracted agent images;
其中,在用来发送给任何一个网真终端的远端视频布局信息中,包括除该任何一个网真终端外的其他网真终端的坐席图像。The remote video layout information used for sending to any of the telepresence terminals includes a seat image of the other telepresence terminal except the any telepresence terminal.
优选地,通过将所述远端视频布局信息中的各个坐席图像,分别形成对应的包含显示位置标识的各个视频码流后,发送给相应网真终端。Preferably, each of the video images in the remote video layout information is respectively formed into a corresponding video code stream including the display location identifier, and then sent to the corresponding telepresence terminal.
优选地,还包括:所述各个网真终端按照所述对应视频码流进行图像显示。Preferably, the method further includes: each of the telepresence terminals performing image display according to the corresponding video code stream.
优选地,所述各个网真终端根据所述对应视频码流中的显示位置标识,将所述坐席图像全屏显示在对应的显示屏上。 Preferably, each of the telepresence terminals displays the agent image on the corresponding display screen according to the display location identifier in the corresponding video code stream.
根据本发明的另一个实施例,提供了一种在网真会议系统中显示布局的装置,包括:According to another embodiment of the present invention, an apparatus for displaying a layout in a telepresence conference system is provided, including:
接收模块,位于多点处理单元中,设置为多点处理单元接收各个网真终端发送的包含每个坐席图像的本端视频布局信息;The receiving module is located in the multi-point processing unit, and is configured to receive, by the multi-point processing unit, the local video layout information that is sent by each telepresence terminal and includes each of the agent images;
分析模块,位于多点处理单元中,设置为多点处理单元通过对每个网真终端的本端视频布局信息进行分析,得到每个网真终端坐席的人员就坐信息;The analysis module is located in the multi-point processing unit, and is configured as a multi-point processing unit to analyze the local video layout information of each telepresence terminal, and obtain the information of the seat of each telepresence terminal agent;
判断模块,位于多点处理单元中,设置为多点处理单元根据每个网真终端坐席的人员就坐信息,判断各个网真终端是否只有一个坐席有人员就坐;The judging module is located in the multi-point processing unit, and is configured as a multi-point processing unit, according to the information of the seat sitting of each telepresence terminal, determining whether each telepresence terminal has only one seat and has a person sitting;
发送模块,位于多点处理单元中,设置为当多点处理单元判断各个网真终端只有一个坐席有人员就坐时,将各个网真终端有人员就坐的坐席图像发送给相应网真终端。The sending module is located in the multi-point processing unit, and is configured to send the seat image of each telepresence terminal to the corresponding telepresence terminal when the multi-point processing unit determines that only one agent of each telepresence terminal has a seat.
优选地,所述分析模块进一步包括:Preferably, the analyzing module further comprises:
识别子模块,设置为对所述每个网真终端的本端视频布局信息中的每个坐席图像分别进行人脸识别;The identification sub-module is configured to perform face recognition on each of the agent images in the local video layout information of each of the telepresence terminals;
确定子模块,设置为根据每个坐席图像有人脸或没有人脸的识别结果,得到每个网真终端坐席是否有人员就坐的人员就坐信息。The determining sub-module is set to obtain the information of the person sitting in the seat of each telepresence terminal according to the recognition result of the face or the face of each agent image.
优选地,所述发送模块进一步包括:Preferably, the sending module further includes:
提取子模块,设置为从各个网真终端的本端视频布局信息中提取有人员就坐的坐席图像;The extraction sub-module is configured to extract, from the local video layout information of each telepresence terminal, a seat image that the person sits on;
组合子模块,设置为通过分别组合所提取的坐席图像,生成用来分别发送给各个网真终端的各个远端视频布局信息。The combination sub-module is configured to generate respective remote video layout information respectively sent to each telepresence terminal by separately combining the extracted agent images.
优选地,所述发送模块进一步还包括:Preferably, the sending module further includes:
码流子模块,设置为通过将所述远端视频布局信息中的各个坐席图像,分别形成对应的包含显示位置标识的各个视频码流后,发送给相应网真终端。The code stream sub-module is configured to respectively form each video code stream corresponding to the display position identifier by each seat image in the remote video layout information, and then send the video code stream to the corresponding telepresence terminal.
与现有技术相比较,本发明的有益效果在于:能够通过一种远端视频布局显示的方法,使得在特定场景下的除本参会网真终端以外的所有网真终端的参会人员实现合理的布局显示,增强每一方参会人员与其他各方参会人员面对面的感官体验。 Compared with the prior art, the present invention has the beneficial effects that a remote video layout display method can be implemented to enable participants of all telepresence terminals except the local telepresence terminal in a specific scenario to be implemented. A reasonable layout display enhances the sensory experience of each party's participants and other participants.
附图说明DRAWINGS
图1是本发明实施例提供的在网真会议系统中显示布局的传统布局显示图;1 is a conventional layout display diagram of displaying a layout in a telepresence conference system according to an embodiment of the present invention;
图2是本发明实施例提供的在网真会议系统中显示布局的方法流程图;2 is a flowchart of a method for displaying a layout in a telepresence conference system according to an embodiment of the present invention;
图3是本发明实施例提供的在网真会议系统中显示布局的装置结构图;3 is a structural diagram of an apparatus for displaying a layout in a telepresence conference system according to an embodiment of the present invention;
图4是本发明实施例提供的在网真会议系统中显示布局的四方会议场景图;4 is a four-party conference scene diagram for displaying a layout in a telepresence conference system according to an embodiment of the present invention;
图5是本发明实施例提供的在网真会议系统中显示布局的第一网真终端TerA的视频布局图;5 is a video layout diagram of a first telepresence terminal TerA that displays a layout in a telepresence conference system according to an embodiment of the present invention;
图6是本发明实施例提供的在网真会议系统中显示布局的第二个网真终端TerB的视频布局图;6 is a video layout diagram of a second telepresence terminal TerB that displays a layout in a telepresence conference system according to an embodiment of the present invention;
图7是本发明实施例提供的在网真会议系统中显示布局的第三个网真终端TerC的视频布局图;7 is a video layout diagram of a third telepresence terminal TerC that displays a layout in a telepresence conference system according to an embodiment of the present invention;
图8是本发明实施例提供的在网真会议系统中显示布局的第四个网真终端TerD的视频布局图;8 is a video layout diagram of a fourth telepresence terminal TerD that displays a layout in a telepresence conference system according to an embodiment of the present invention;
图9是本发明实施例提供的在网真会议系统中显示布局的第一网真终端TerA所看到的远端视频布局图;9 is a remote video layout view of a first telepresence terminal TerA that displays a layout in a telepresence conference system according to an embodiment of the present invention;
图10是本发明实施例提供的在网真会议系统中显示布局的第二个网真终端TerB所看到的远端视频布局图;10 is a remote video layout view seen by a second telepresence terminal TerB that displays a layout in a telepresence conference system according to an embodiment of the present invention;
图11是本发明实施例提供的在网真会议系统中显示布局的第三个网真终端TerC所看到的远端视频布局图;11 is a remote video layout view seen by a third telepresence terminal TerC that displays a layout in a telepresence conference system according to an embodiment of the present invention;
图12是本发明实施例提供的在网真会议系统中显示布局的第四个网真终端TerD所看到的远端视频布局图;12 is a remote video layout view of a fourth telepresence terminal TerD displayed in a telepresence conference system according to an embodiment of the present invention;
图13是本发明实施例提供的在网真会议系统中显示布局的网真终端的处理流程图;13 is a flowchart of processing a telepresence terminal that displays a layout in a telepresence conference system according to an embodiment of the present invention;
图14是本发明实施例提供的在网真会议系统中显示布局的MCU的处理流程图。 FIG. 14 is a flowchart of processing of an MCU displaying a layout in a telepresence conference system according to an embodiment of the present invention.
具体实施方式detailed description
以下结合附图对本发明的优选实施例进行详细说明,应当理解,以下所说明的优选实施例仅用于说明和解释本发明,并不用于限定本发明。The preferred embodiments of the present invention are described in detail below with reference to the accompanying drawings.
图2是本发明实施例提供的在网真会议系统中显示布局的方法流程图,如图2所示,限定应用场景为,一个四方网真会议,参加会议的四个网真终端均为三屏网真终端,每一方的参会人员都只集中在一个坐席上,步骤如下:2 is a flowchart of a method for displaying a layout in a telepresence conference system according to an embodiment of the present invention. As shown in FIG. 2, the application scenario is defined as a four-party telepresence conference, and four telepresence terminals participating in the conference are all three. Screen real terminal, each party's participants are concentrated on one agent, the steps are as follows:
步骤S1:多点处理单元接收各个网真终端发送的包含每个坐席图像的本端视频布局信息。其中,所述各个网真终端利用生物特征识别技术,例如人脸识别技术辨识出本端坐席中是否有参会人员就坐的信息,并将此信息发送到MCU。这一处理过程为可选过程,参会的网真终端也可以不进行此项处理。所述MCU收集来自各个与会网真终端的本端视频布局信息(即,终端处每个坐席上是否有参会人员就坐的信息)并保存。Step S1: The multi-point processing unit receives local video layout information including each agent image sent by each telepresence terminal. The respective telepresence terminals use biometric identification technology, such as face recognition technology, to identify whether there is information about the sitting of the participant in the local agent, and send the information to the MCU. This process is an optional process, and the participating telepresence terminals may not perform this process. The MCU collects local video layout information from each participant's telepresence terminal (ie, whether there is information on the seat of each participant at the terminal) and saves it.
步骤S2:多点处理单元通过对每个网真终端的本端视频布局信息进行分析,得到每个网真终端坐席的人员就坐信息。Step S2: The multi-point processing unit analyzes the local video layout information of each telepresence terminal to obtain the seating information of the personnel of each telepresence terminal.
在步骤S2中,对收集到的所述每个网真终端的本端视频布局信息中的每个坐席图像分别进行人脸识别;In step S2, each of the seat images in the local video layout information of each of the collected telepresence terminals is separately subjected to face recognition;
根据每个坐席图像有人脸或没有人脸的识别结果,得到每个网真终端坐席是否有人员就坐的人员就坐信息。According to the recognition result of the face image or the no face of each agent image, it is obtained whether the seat of each telepresence terminal has a person sitting on the seat.
步骤S3:多点处理单元根据每个网真终端坐席的人员就坐信息,判断各个网真终端是否只有一个坐席有人员就坐。其中,判断所述各个网真终端是否为四方网真会议,并且所有与会网真终端的本端视频布局信息中均为仅有一个坐席中有参会人员就坐。所述MCU还可以通过人工判断各个与会网真终端只有一个坐席上有参会人员就坐。Step S3: The multi-point processing unit determines, according to the seat sitting information of each telepresence terminal agent, whether each of the telepresence terminals has only one seat and has a person sitting. The determining whether each of the telepresence terminals is a quartet telepresence conference, and all the participants in the local video layout information of the participant telepresence terminal have a seat in the seat. The MCU can also manually determine that there is only one participant on the seat of each participant's telepresence terminal to sit.
步骤S4:当多点处理单元判断各个网真终端只有一个坐席有人员就坐时,将各个网真终端有人员就坐的坐席图像发送给相应网真终端。Step S4: When the multi-point processing unit determines that each of the telepresence terminals has only one agent sitting, the seat image of each telepresence terminal having a person sitting is sent to the corresponding telepresence terminal.
在步骤S4中,从各个网真终端的本端视频布局信息中提取有人员就坐的坐席图像;In step S4, the seat image of the person sitting is extracted from the local video layout information of each telepresence terminal;
通过分别组合所提取的坐席图像,生成用来分别发送给各个网真终端的各个远端视频布局信息。 Each remote video layout information used to be separately transmitted to each telepresence terminal is generated by separately combining the extracted seat images.
其中,在用来发送给任何一个网真终端的远端视频布局信息中,包括除该任何一个网真终端外的其他网真终端的坐席图像。其中,MCU自动为每个与会网真终端组织其所观看的远端视频布局,左、中、右三屏分别为除本端外另外三方网真终端中有参会人员就坐的坐席。The remote video layout information used for sending to any of the telepresence terminals includes a seat image of the other telepresence terminal except the any telepresence terminal. The MCU automatically organizes the remote video layout that it watches for each participant's telepresence terminal. The left, middle, and right screens are the seats for the participants in the other three parties except the local end.
此外,MCU也可以通过人工控制的方式为每个与会网真终端组织其所观看的远端视频布局,即人工为每个与会网真终端的坐席进行视频选看处理,为其左、中、右三个坐席分别进行视频切换处理,其视频源来自除本端之外的另外三方网真终端中有参会人员就坐的坐席。In addition, the MCU can also organize the remote video layout that it watches for each participant's telepresence terminal by manual control, that is, manually select and process the video for each agent of the telepresence terminal, for its left, middle, and The right three seats perform video switching processing respectively, and the video source comes from the other three-party telepresence terminal except the local end, where the participants sit at the seat.
进一步地,还包括:Further, it also includes:
通过将所述远端视频布局信息中的各个坐席图像,分别形成对应的包含显示位置标识的各个视频码流后,发送给相应网真终端。Each of the video images in the remote video layout information is respectively formed into a corresponding video code stream including the display location identifier, and then sent to the corresponding telepresence terminal.
进一步地,还包括:所述各个网真终端按照所述对应视频码流进行图像显示。Further, the method further includes: each of the telepresence terminals performing image display according to the corresponding video code stream.
进一步地,所述各个网真终端根据所述对应视频码流中的显示位置标识,将所述坐席图像全屏显示在对应的显示屏上。即,四个与会网真终端最终的远端视频布局均为另外三方网真终端有参会人员就坐的坐席,并依次显示在本端的三个坐席屏幕上。Further, each of the telepresence terminals displays the agent image on the corresponding display screen according to the display position identifier in the corresponding video code stream. That is, the final remote video layout of the four participants of the telepresence terminal is the seat of the other three parties of the telepresence terminal, and is displayed on the three seats screens of the local end.
图3是本发明实施例提供的在网真会议系统中显示布局的装置结构图,如图3所示,包括:接收模块、分析模块、判断模块和发送模块。FIG. 3 is a structural diagram of an apparatus for displaying a layout in a telepresence conference system according to an embodiment of the present invention. As shown in FIG. 3, the method includes: a receiving module, an analysis module, a judging module, and a sending module.
所述接收模块,位于多点处理单元中,设置为接收各个网真终端发送的包含每个坐席图像的本端视频布局信息。The receiving module is located in the multi-point processing unit, and is configured to receive the local video layout information that is sent by each telepresence terminal and includes each of the agent images.
所述分析模块,位于多点处理单元中,设置为通过对每个网真终端的本端视频布局信息进行分析,得到每个网真终端坐席的人员就坐信息。其中,所述分析模块的识别子模块设置为对所述每个网真终端的本端视频布局信息中的每个坐席图像分别进行人脸识别。所述分析模块的确定子模块设置为根据每个坐席图像有人脸或没有人脸的识别结果,得到每个网真终端坐席是否有人员就坐的人员就坐信息。The analysis module is located in the multi-point processing unit, and is configured to analyze the local video layout information of each telepresence terminal to obtain the seating information of the personnel of each telepresence terminal. The identification sub-module of the analysis module is configured to perform face recognition on each seat image in the local video layout information of each telepresence terminal. The determining sub-module of the analysis module is configured to obtain, according to the recognition result of the face image or the face of each agent image, whether each of the telepresence terminal seats has a person sitting on the seat.
所述判断模块,位于多点处理单元中,设置为根据每个网真终端坐席的人员就坐信息,判断各个网真终端是否只有一个坐席有人员就坐。The judging module is located in the multi-point processing unit, and is configured to determine, according to the seat sitting information of each telepresence terminal agent, whether each telepresence terminal has only one seat and has a person sitting.
所述发送模块,位于多点处理单元中,设置为当判断各个网真终端只有一个坐席有人员就坐时,将各个网真终端有人员就坐的坐席图像发送给相应网真终端。其中, 所述发送模块的提取子模块设置为从各个网真终端的本端视频布局信息中提取有人员就坐的坐席图像。所述发送模块的组合子模块设置为通过分别组合所提取的坐席图像,生成用来分别发送给各个网真终端的各个远端视频布局信息。所述发送模块的码流子模块设置为通过将所述远端视频布局信息中的各个坐席图像,分别形成对应的包含显示位置标识的各个视频码流后,发送给相应网真终端。The sending module is located in the multi-point processing unit, and is configured to send the seat image of each telepresence terminal to the corresponding telepresence terminal when it is determined that only one agent of each telepresence terminal has a seat. among them, The extraction submodule of the sending module is configured to extract a seat image of a person sitting from the local video layout information of each telepresence terminal. The combination submodule of the sending module is configured to generate respective remote video layout information respectively sent to each telepresence terminal by separately combining the extracted agent images. The code stream sub-module of the sending module is configured to respectively form each video code stream corresponding to the display position identifier by each seat image in the remote video layout information, and then send the video stream to the corresponding telepresence terminal.
图4是本发明实施例提供的在网真会议系统中显示布局的四方会议场景图。如图4所示,有TerA、TerB、TerC和TerD四个网真终端,共同参加一个在多点处理单元(MCU:Multipoint Control Units)上召开的网真会议。这四个终端均为三屏网真终端,具有左(L)、中(C)、右(R)三个屏幕。FIG. 4 is a diagram of a four-party conference scene in which a layout is displayed in a telepresence conference system according to an embodiment of the present invention. As shown in FIG. 4, there are four telepresence terminals of TerA, TerB, TerC, and TerD, which jointly participate in a telepresence conference held on a Multipoint Control Unit (MCU). These four terminals are three-screen telepresence terminals with three screens: left (L), medium (C), and right (R).
图5是本发明实施例提供的在网真会议系统中显示布局的第一网真终端TerA的视频布局图。如图5所示,TerA的左屏(L)位置坐有两位参会人员,另外两屏——中屏(C)和右屏(R)处没有参会人员就坐。FIG. 5 is a video layout diagram of a first telepresence terminal TerA that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 5, the left screen (L) position of TerA is seated by two participants, and the other two screens - the middle screen (C) and the right screen (R), are not seated by the participants.
图6是本发明实施例提供的在网真会议系统中显示布局的第二个网真终端TerB的视频布局图。如图6所示,TerB的中屏(C)位置坐有两位参会人员,另外两屏——左屏(L)和右屏(R)处没有参会人员就坐。FIG. 6 is a video layout diagram of a second telepresence terminal TerB that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 6, TerB's mid-screen (C) position has two participants, and the other two screens - the left screen (L) and the right screen (R) - are not attended by the participants.
图7是本发明实施例提供的在网真会议系统中显示布局的第三个网真终端TerC的视频布局图。如图7所示,TerC的右屏(R)位置坐有两位参会人员,另外两屏——左屏(L)和中屏(C)处没有参会人员就坐。FIG. 7 is a video layout diagram of a third telepresence terminal TerC that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 7, TerC's right screen (R) position has two participants, and the other two screens - left screen (L) and middle screen (C) - are not attended by participants.
图8是本发明实施例提供的在网真会议系统中显示布局的第四个网真终端TerD的视频布局图。如图8所示,TerD的中屏(C)位置坐有一位参会人员,另外两屏——左屏(L)和右屏(R)处没有参会人员就坐。FIG. 8 is a video layout diagram of a fourth telepresence terminal TerD that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 8, TerD's mid-screen (C) position is seated by one participant, and the other two screens - left screen (L) and right screen (R) - are not attended by participants.
图9是本发明实施例提供的在网真会议系统中显示布局的第一网真终端TerA所看到的远端视频布局图。如图9所示,TerA的左屏(L)为TerB的两位参会人员视频,中屏(C)为TerC的两位参会人员视频,右屏(R)为TerD的一位参会人员的视频。FIG. 9 is a remote video layout view of a first telepresence terminal TerA that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 9, the left screen (L) of TerA is the video of two participants of TerB, the middle screen (C) is the video of two participants of TerC, and the right screen (R) is a participant of TerD. Video of people.
图10是本发明实施例提供的在网真会议系统中显示布局的第二个网真终端TerB所看到的远端视频布局图。如图10所示,TerB的左屏(L)为TerA的两位参会人员视频,中屏(C)为TerC的两位参会人员视频,右屏(R)为TerD的一位参会人员的视频。 FIG. 10 is a remote video layout view of a second telepresence terminal TerB that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 10, the left screen (L) of TerB is the video of two participants of TerA, the middle screen (C) is the video of two participants of TerC, and the right screen (R) is a participant of TerD. Video of people.
图11是本发明实施例提供的在网真会议系统中显示布局的第三个网真终端TerC所看到的远端视频布局图。如图11所示,TerC的左屏(L)为TerA的两位参会人员视频,中屏(C)为TerB的两位参会人员视频,右屏(R)为TerD的一位参会人员的视频。FIG. 11 is a remote video layout diagram seen by a third telepresence terminal TerC that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 11, the left screen (L) of TerC is the video of two participants of TerA, the middle screen (C) is the video of two participants of TerB, and the right screen (R) is a participant of TerD. Video of people.
图12是本发明实施例提供的在网真会议系统中显示布局的第四个网真终端TerD所看到的远端视频布局图。如图12所示,TerD的左屏(L)为TerA的两位参会人员视频,中屏(C)为TerB的两位参会人员视频,右屏(R)为TerC的两位参会人员的视频。FIG. 12 is a remote video layout view of a fourth telepresence terminal TerD displayed in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 12, the left screen (L) of TerD is the video of two participants of TerA, the middle screen (C) is the video of two participants of TerB, and the right screen (R) is the two participants of TerC. Video of people.
图13是本发明实施例提供的在网真会议系统中显示布局的网真终端的处理流程图。如图13所示,终端通过人脸识别技术可以确定当前本端所采集的视频中哪个坐席上有参会人员就坐,并将此信息保存下来,发送给MCU。FIG. 13 is a flowchart of processing a telepresence terminal that displays a layout in a telepresence conference system according to an embodiment of the present invention. As shown in FIG. 13 , the terminal can determine which participant in the video currently collected by the local party is seated by the face recognition technology, and save the information and send it to the MCU.
图14是本发明实施例提供的在网真会议系统中显示布局的MCU的处理流程图。如图14所示,MCU收集各个与会网真终端的本端视频布局信息并保存,当所有与会网真终端的本端视频布局信息都收集到了以后,进行分析判断,若当前会议为四方会议(即会议中有四个与会网真终端),并且每个与会网真终端的本端视频布局中仅有一个坐席上有参会人员,当这一条件符合后,MCU开始为每个与会终端组织其所观看的远端视频布局,其远端视频布局均由除本端外另外三个与会网真终端有参会人员就坐的坐席组成,另外三个与会网真终端的参会人员分别显示在左、中、右三个坐席中。FIG. 14 is a flowchart of processing of an MCU displaying a layout in a telepresence conference system according to an embodiment of the present invention. As shown in Figure 14, the MCU collects the local video layout information of each participant's telepresence terminal and saves it. After all the local video layout information of the participating telepresence terminals are collected, the MCU analyzes and judges if the current conference is a four-party conference ( That is, there are four participants in the conference (the telepresence terminal), and there is only one participant in the local video layout of each participant's telepresence terminal. When this condition is met, the MCU starts to organize for each conference terminal. The far-end video layout of the remote video layout that it views is composed of the seats of the other three participants in the telepresence terminal except the local end. The participants of the other three participants of the telepresence terminal are displayed separately. In the left, middle and right seats.
综上所述,本发明具有以下技术效果:在四方网真会议中,当每一参会网真终端三个坐席中仅有一个坐席上有参会人员时,每一个参会人员都可以通过其所在网真终端的远端视频同时看到另外三方的参会人员,即在一间会议室中就可以看到所有的参会人员,从而实现在此特定场景下,每一网真会场的参会人员均可实现大画面显示,达到与所有参会人员最优的面对面的感官效果。In summary, the present invention has the following technical effects: in a quad-part telepresence conference, when only one of the three seats in each participant's telepresence terminal has a participant, each participant can pass The remote video of the telepresence terminal is located at the same time as the other three participants, that is, all the participants can be seen in one conference room, so that in this particular scenario, each telepresence site Participants can achieve large-screen display to achieve the best face-to-face sensory effect with all participants.
尽管上文对本发明进行了详细说明,但是本发明不限于此,本技术领域技术人员可以根据本发明的原理进行各种修改。因此,凡按照本发明原理所作的修改,都应当理解为落入本发明的保护范围。Although the invention has been described in detail above, the invention is not limited thereto, and various modifications may be made by those skilled in the art in accordance with the principles of the invention. Therefore, modifications made in accordance with the principles of the invention are to be understood as falling within the scope of the invention.
工业实用性Industrial applicability
本发明提供的技术方案,能够通过一种远端视频布局显示的方法,使得在特定场景下的除本参会网真终端以外的所有网真终端的参会人员实现合理的布局显示,增强每一方参会人员与其他各方参会人员面对面的感官体验。 The technical solution provided by the present invention can realize a reasonable layout display by using a remote video layout display method to enable all participants of the telepresence terminal except the local telepresence terminal in a specific scenario to enhance each display. The sensory experience of a party attending a face-to-face meeting with other participants.

Claims (10)

  1. 一种在网真会议系统中显示布局的方法,包括:A method of displaying a layout in a telepresence conference system, including:
    多点处理单元接收各个网真终端发送的包含每个坐席图像的本端视频布局信息;The multipoint processing unit receives the local video layout information that is sent by each telepresence terminal and includes each of the agent images;
    多点处理单元通过对每个网真终端的本端视频布局信息进行分析,得到每个网真终端坐席的人员就坐信息;The multi-point processing unit analyzes the local video layout information of each telepresence terminal to obtain the seating information of the personnel of each telepresence terminal;
    多点处理单元根据每个网真终端坐席的人员就坐信息,判断各个网真终端是否只有一个坐席有人员就坐;The multi-point processing unit determines whether each of the telepresence terminals has only one seat and has a person sitting according to the seat information of each telepresence terminal agent;
    当多点处理单元判断各个网真终端只有一个坐席有人员就坐时,将各个网真终端有人员就坐的坐席图像发送给相应网真终端。When the multi-point processing unit determines that each of the telepresence terminals has only one seat and has a person sitting, the seat image of each telepresence terminal having a seat on the person is sent to the corresponding telepresence terminal.
  2. 根据权利要求1所述的方法,其中,所述的多点处理单元通过对每个网真终端的本端视频布局信息进行分析,得到每个网真终端坐席的人员就坐信息的步骤包括:The method according to claim 1, wherein the multi-point processing unit analyzes the local video layout information of each telepresence terminal, and obtains the information of the seating information of the personnel of each telepresence terminal:
    对所述每个网真终端的本端视频布局信息中的每个坐席图像分别进行人脸识别;Performing face recognition on each of the agent images in the local video layout information of each of the telepresence terminals;
    根据每个坐席图像有人脸或没有人脸的识别结果,得到每个网真终端坐席是否有人员就坐的人员就坐信息。According to the recognition result of the face image or the no face of each agent image, it is obtained whether the seat of each telepresence terminal has a person sitting on the seat.
  3. 根据权利要求1所述的方法,其中,所述的将各个网真终端有人员就坐的坐席图像发送给相应网真终端的步骤包括:The method according to claim 1, wherein the step of transmitting the agent image in which each of the telepresence terminals has a person to the corresponding telepresence terminal comprises:
    从各个网真终端的本端视频布局信息中提取有人员就坐的坐席图像;Extracting a seat image of a person sitting from the local video layout information of each telepresence terminal;
    通过分别组合所提取的坐席图像,生成用来分别发送给各个网真终端的各个远端视频布局信息;Generating each remote video layout information respectively sent to each telepresence terminal by separately combining the extracted agent images;
    其中,在用来发送给任何一个网真终端的远端视频布局信息中,包括除该任何一个网真终端外的其他网真终端的坐席图像。The remote video layout information used for sending to any of the telepresence terminals includes a seat image of the other telepresence terminal except the any telepresence terminal.
  4. 根据权利要求3所述的方法,其中,所述的将各个网真终端有人员就坐的坐席图像发送给相应网真终端的步骤还包括: The method according to claim 3, wherein the step of transmitting the agent image in which each of the telepresence terminals has a person to the corresponding telepresence terminal further comprises:
    通过将所述远端视频布局信息中的各个坐席图像,分别形成对应的包含显示位置标识的各个视频码流后,发送给相应网真终端。Each of the video images in the remote video layout information is respectively formed into a corresponding video code stream including the display location identifier, and then sent to the corresponding telepresence terminal.
  5. 根据权利要求3或4的方法,其中,还包括:所述各个网真终端按照所述对应视频码流进行图像显示。The method according to claim 3 or 4, further comprising: said each telepresence terminal performing image display according to said corresponding video code stream.
  6. 根据权利要求5所述的方法,其中,所述各个网真终端根据所述对应视频码流中的显示位置标识,将所述坐席图像全屏显示在对应的显示屏上。The method according to claim 5, wherein each of the telepresence terminals displays the agent image on the corresponding display screen in full screen according to the display location identifier in the corresponding video code stream.
  7. 一种在网真会议系统中显示布局的装置,包括:A device for displaying a layout in a telepresence conference system, comprising:
    接收模块,位于多点处理单元中,设置为接收各个网真终端发送的包含每个坐席图像的本端视频布局信息;The receiving module is located in the multi-point processing unit, and is configured to receive local video layout information that is sent by each telepresence terminal and includes each of the agent images;
    分析模块,位于多点处理单元中,设置为通过对每个网真终端的本端视频布局信息进行分析,得到每个网真终端坐席的人员就坐信息;The analysis module is located in the multi-point processing unit, and is configured to analyze the local video layout information of each telepresence terminal to obtain the information of the seat of each telepresence terminal agent;
    判断模块,位于多点处理单元中,设置为根据每个网真终端坐席的人员就坐信息,判断各个网真终端是否只有一个坐席有人员就坐;The judging module is located in the multi-point processing unit, and is configured to determine, according to the seat sitting information of each telepresence terminal agent, whether each telepresence terminal has only one seat and has a person sitting;
    发送模块,位于多点处理单元中,设置为在判断各个网真终端只有一个坐席有人员就坐时,将各个网真终端有人员就坐的坐席图像发送给相应网真终端。The sending module is located in the multi-point processing unit, and is configured to send the seat image of each telepresence terminal to the corresponding telepresence terminal when it is determined that only one agent of each telepresence terminal has a seat.
  8. 根据权利要求7所述的装置,其中,所述分析模块进一步包括:The apparatus of claim 7, wherein the analysis module further comprises:
    识别子模块,设置为对所述每个网真终端的本端视频布局信息中的每个坐席图像分别进行人脸识别;The identification sub-module is configured to perform face recognition on each of the agent images in the local video layout information of each of the telepresence terminals;
    确定子模块,设置为根据每个坐席图像有人脸或没有人脸的识别结果,得到每个网真终端坐席是否有人员就坐的人员就坐信息。The determining sub-module is set to obtain the information of the person sitting in the seat of each telepresence terminal according to the recognition result of the face or the face of each agent image.
  9. 根据权利要求7所述的装置,其中,所述发送模块进一步包括:The apparatus of claim 7, wherein the transmitting module further comprises:
    提取子模块,设置为从各个网真终端的本端视频布局信息中提取有人员就坐的坐席图像;The extraction sub-module is configured to extract, from the local video layout information of each telepresence terminal, a seat image that the person sits on;
    组合子模块,设置为通过分别组合所提取的坐席图像,生成用来分别发送给各个网真终端的各个远端视频布局信息。The combination sub-module is configured to generate respective remote video layout information respectively sent to each telepresence terminal by separately combining the extracted agent images.
  10. 根据权利要求9所述的装置,其中,所述发送模块进一步还包括:The apparatus of claim 9, wherein the sending module further comprises:
    码流子模块,设置为通过将所述远端视频布局信息中的各个坐席图像,分别形成对应的包含显示位置标识的各个视频码流后,发送给相应网真终端。 The code stream sub-module is configured to respectively form each video code stream corresponding to the display position identifier by each seat image in the remote video layout information, and then send the video code stream to the corresponding telepresence terminal.
PCT/CN2014/087606 2014-03-05 2014-09-26 Method and device for displaying layout in telepresence conferencing system WO2015131520A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410077770.6 2014-03-05
CN201410077770.6A CN104902217B (en) 2014-03-05 2014-03-05 A kind of method and device showing layout in netting true conference system

Publications (1)

Publication Number Publication Date
WO2015131520A1 true WO2015131520A1 (en) 2015-09-11

Family

ID=54034581

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/087606 WO2015131520A1 (en) 2014-03-05 2014-09-26 Method and device for displaying layout in telepresence conferencing system

Country Status (2)

Country Link
CN (1) CN104902217B (en)
WO (1) WO2015131520A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114615458A (en) * 2022-05-10 2022-06-10 全时云商务服务股份有限公司 Method and device for real-time screen closing and rapid drawing in cloud conference

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101860715A (en) * 2010-05-14 2010-10-13 中兴通讯股份有限公司 Multi-picture synthesis method and system and media processing device
CN102685445A (en) * 2012-04-27 2012-09-19 华为技术有限公司 Method and device for transferring telepresence video images and telepresence system
CN102833517A (en) * 2012-08-31 2012-12-19 华为技术有限公司 Remote presentation method and system
US20130335518A1 (en) * 2011-03-04 2013-12-19 Zte Corporation Method and system for sending and playing media data in telepresence technology

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8542266B2 (en) * 2007-05-21 2013-09-24 Polycom, Inc. Method and system for adapting a CP layout according to interaction between conferees
US8446454B2 (en) * 2007-05-21 2013-05-21 Polycom, Inc. Dynamic adaption of a continuous presence videoconferencing layout based on video content
WO2009117005A1 (en) * 2008-03-17 2009-09-24 Hewlett-Packard Development Company, L.P. Displaying panoramic video image streams
US8355040B2 (en) * 2008-10-16 2013-01-15 Teliris, Inc. Telepresence conference room layout, dynamic scenario manager, diagnostics and control system and method
US8537195B2 (en) * 2011-02-09 2013-09-17 Polycom, Inc. Automatic video layouts for multi-stream multi-site telepresence conferencing system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101860715A (en) * 2010-05-14 2010-10-13 中兴通讯股份有限公司 Multi-picture synthesis method and system and media processing device
US20130335518A1 (en) * 2011-03-04 2013-12-19 Zte Corporation Method and system for sending and playing media data in telepresence technology
CN102685445A (en) * 2012-04-27 2012-09-19 华为技术有限公司 Method and device for transferring telepresence video images and telepresence system
CN102833517A (en) * 2012-08-31 2012-12-19 华为技术有限公司 Remote presentation method and system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114615458A (en) * 2022-05-10 2022-06-10 全时云商务服务股份有限公司 Method and device for real-time screen closing and rapid drawing in cloud conference
CN114615458B (en) * 2022-05-10 2022-07-19 全时云商务服务股份有限公司 Method and device for real-time screen closing and rapid drawing in cloud conference, storage medium and server

Also Published As

Publication number Publication date
CN104902217B (en) 2019-07-16
CN104902217A (en) 2015-09-09

Similar Documents

Publication Publication Date Title
US10750124B2 (en) Methods and system for simulated 3D videoconferencing
US8319819B2 (en) Virtual round-table videoconference
US9641585B2 (en) Automated video editing based on activity in video conference
US9041767B2 (en) Method and system for adapting a CP layout according to interaction between conferees
CN106878658B (en) Automatic video layout for multi-stream multi-site telepresence conferencing system
CN102177711B (en) Method, device and computer program for processing images during video conferencing
US9729825B2 (en) Method for generating an immersive video of a plurality of persons
KR20160125972A (en) Displaying a presenter during a video conference
EP2838257B1 (en) A method for generating an immersive video of a plurality of persons
KR20110050595A (en) Compositing video streams
US11076127B1 (en) System and method for automatically framing conversations in a meeting or a video conference
US9088693B2 (en) Providing direct eye contact videoconferencing
WO2014177082A1 (en) Video conference video processing method and terminal
WO2016206471A1 (en) Multimedia service processing method, system and device
US20090115835A1 (en) Visually Enhancing a Conference
WO2015131520A1 (en) Method and device for displaying layout in telepresence conferencing system
US9609273B2 (en) System and method for not displaying duplicate images in a video conference
US20080043962A1 (en) Methods, systems, and computer program products for implementing enhanced conferencing services
CN114598835A (en) System and method for displaying users participating in a communication session
Kauff et al. Virtual team user environments-a step from tele-cubicles towards distributed tele-collaboration in mediated workspaces
US20230199041A1 (en) Remote collaboration platform
US10986311B1 (en) Three-way video visitation detection using frame detection
WO2019223736A1 (en) Video control method, video conference terminal and multi-point control unit (mcu)
JPH0514884A (en) Visual conference system
McNelley et al. What is Telepresence?

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14884894

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14884894

Country of ref document: EP

Kind code of ref document: A1