WO2011153926A1 - 会场图像广播方法及多点控制单元 - Google Patents

会场图像广播方法及多点控制单元 Download PDF

Info

Publication number
WO2011153926A1
WO2011153926A1 PCT/CN2011/075302 CN2011075302W WO2011153926A1 WO 2011153926 A1 WO2011153926 A1 WO 2011153926A1 CN 2011075302 W CN2011075302 W CN 2011075302W WO 2011153926 A1 WO2011153926 A1 WO 2011153926A1
Authority
WO
WIPO (PCT)
Prior art keywords
site
image
agent
screen
conference
Prior art date
Application number
PCT/CN2011/075302
Other languages
English (en)
French (fr)
Inventor
吴明亮
孙波
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2011153926A1 publication Critical patent/WO2011153926A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • a video conference system mainly includes a multipoint control unit (referred to as a multipoint control unit, for short
  • the MCU is a key device of the multi-point video conferencing system, which extracts audio, video, data and other information and signaling from the information flow from each conference site, and then points the conference sites.
  • Information and signaling are sent to the multipoint control module and the media processing module respectively to complete the corresponding audio mixing or switching, video mixing or switching, data broadcasting and routing, timing and conference control, and finally, each conference site
  • the various information required is reassembled and sent to the respective terminal system equipment.
  • the terminal system is divided into two types: desktop conference terminal and conference room conference terminal.
  • the desktop conference terminal is low in cost and easy to use, and is suitable for personal office use and small-scale conferences.
  • the conference room type terminal is equipped with external auxiliary equipment such as high-quality zoom lens, high-fidelity audio, large-screen color TV or projection, plus the video pre-processing/post-processor, which makes the picture quality clearer and achieves better conference results. Suitable for holding large-scale meetings.
  • the conference room terminal is suitable for conference rooms, a few to dozens of participants.
  • a broadcast terminal is a one-way receiving terminal that can receive images and sounds of a conference, but cannot transmit images and sounds.
  • the broadcast terminal can be set to occasions where information needs to be transmitted only in one direction, for example, when the superior level communicates policies to the lower level.
  • the mobile terminal is based on the desktop terminal, equipped with a wireless access card and a wireless transmitting device, and can be moved to join the conference within a certain area.
  • the conference terminal is configured in the local conference site and each conference site in the video conference.
  • each venue has only one single screen, one audio.
  • the image of the venue will be automatically broadcast to other venues. That is, the voice control system can broadcast the image of the loudest end to other venues without human intervention.
  • the vision is deeply loved by high-end users with its true presence. As the cost is gradually reduced, the family demand is expanding, and the vision will gradually enter thousands of households.
  • the vision rendering technology is applied to the conference television system to form a vision presentation conference system. In the perspective presentation conference system, there are multiple screens in each venue, multiple audio channels, and the audio input and output have a positional correspondence with the screen.
  • the present invention provides a voice control broadcast method and a vision presentation conference system to solve At least one of the above issues.
  • a method of broadcasting a venue image includes: determining a site with the highest intensity of the audio stream signal during the conference; broadcasting the site image of the conference to at least one of the plurality of screens of the conference site other than the conference site in the foreground presentation television system on the screen.
  • the venue for determining the maximum intensity of the audio stream signal includes: determining the agent with the highest intensity of the audio stream signal in each conference site of the conference television system; comparing the strengths of the audio stream signals corresponding to the agent, and obtaining the agent with the highest intensity of the audio stream signal in the seat
  • the site to which the obtained agent belongs is determined as the site with the highest audio stream signal strength.
  • the site image includes: a panoramic image of the venue and an image of the venue.
  • the site image is the site agent image, and the agent corresponding to the site agent image is the largest voice agent.
  • the site image of the site is broadcasted to at least one of the other sites.
  • the method is: determining whether the agent corresponding to the site image is the current broadcast source; No, the venue image is broadcast to one of the screens of each of the other venues. Broadcasting the site image of the site to each of the other sites includes: finding and obtaining the screen with the highest frequency of the site image in each of the other sites; determining that the agent corresponding to the site image is on the screen with the highest frequency, the last time Whether the time of speaking is greater than the predetermined duration; if yes, updating the speaking time of the agent corresponding to the agent image on the screen with the highest frequency of display.
  • Broadcasting the image of the site agent to a screen of each other site includes: For each other site, when the screen with the highest frequency of the agent image is not found, the screen of the site image is displayed and the screen has not been changed, and Record the number of times the site image is displayed on the screen; broadcast the site agent image to display the unchanged screen; update the floor time of the agent corresponding to the agent image on the screen that has not changed.
  • a multipoint control unit is provided.
  • the multipoint control unit includes: a determining module, configured to determine a site where the intensity of the audio stream signal is the largest during the conference; and a broadcast module, configured to broadcast the site image of the venue to the foreground presentation television system except the conference site At least one of the multiple screens of the other venues.
  • the determining module includes: a first determining sub-module, configured to determine a seat with the highest intensity of the audio stream signal in each meeting place of the foreground television system; and a comparison sub-module configured to compare the signal strengths of the audio streams corresponding to the agent to obtain the agent The agent having the highest intensity of the audio stream signal; the second determining sub-module is configured to determine the site to which the agent of the comparison sub-module belongs by comparing the site with the highest audio stream signal strength.
  • the above broadcast module is configured to broadcast each agent image in the field to the agent screen corresponding to the agent in each other conference site.
  • the broadcast module includes: a determining sub-module, configured to determine whether the agent corresponding to the site agent image is a broadcast source; and the broadcast sub-module is configured to broadcast the field image to each of the other sites when the output of the determining sub-module is negative On one screen.
  • a determining sub-module configured to determine whether the agent corresponding to the site agent image is a broadcast source
  • the broadcast sub-module is configured to broadcast the field image to each of the other sites when the output of the determining sub-module is negative On one screen.
  • FIG. 1 is a schematic structural diagram of a perspective presentation conference system according to an embodiment of the present invention
  • 2 is a flowchart of a method for broadcasting a site image according to an embodiment of the present invention
  • FIG. 3 is a flowchart of a method for broadcasting a panoramic image of a site according to a preferred embodiment of the present invention
  • FIG. 4 is a flowchart of a method for broadcasting a seat image according to a preferred embodiment of the present invention
  • FIG. 5 is a structural block diagram of a multipoint control unit according to an embodiment of the present invention
  • FIG. 6 is a structural block diagram of a multipoint control unit according to a preferred embodiment of the present invention.
  • FIG. 1 is a schematic structural diagram of a telepresence presentation conference system.
  • the system includes: a conference site 111, a conference site 112, a conference site 113, and a multipoint control unit 13.
  • the 111 site is the broadcast source site, and the other sites view the site image of the site 111.
  • There are multiple screens in each site three are shown in the figure), and multiple channels of audio.
  • FIG. 2 is a flowchart of a method for broadcasting a site image according to an embodiment of the present invention. The method is applied to the telepresence conference television system. As shown in FIG.
  • the site image broadcast method includes the following processes: Step S202: Determine, during the conference, a site with the highest audio stream signal strength; where the audio stream signal is The site with the strongest intensity is the site with the loudest voice. Step S204: The venue image of the venue is broadcasted to at least one of the plurality of screens of the conference site television system other than the conference site.
  • each venue has only one single screen, one audio.
  • each site has multiple screens, multiple audios, and the audio input and output have a positional correspondence with the screen. Therefore, with the conventional voice control method, it is impossible to implement a scheme in which each conference screen display image follows the sound broadcast.
  • each conference screen display image can be switched with the sound broadcast, thereby effectively improving the user experience.
  • the above step S202 may further include the following processing:
  • the site to which the obtained agent belongs is determined as the site with the highest audio stream signal strength.
  • the site with the largest voice in the site with multiple screens and multiple channels of audio can be effectively determined, and the site image of the site can be broadcasted, and the site screen display image can be switched following the sound broadcast.
  • the sounds corresponding to the seats with the largest voices in each site can be compared to determine the seat with the highest voice, and then the The site corresponding to the agent is determined to be the site with the loudest voice.
  • the site image includes but is not limited to: a site panoramic image and a site seating image.
  • step S204 may further include the following process:
  • Each agent image in the field is broadcasted to the agent screen corresponding to the agent in each other site.
  • the A-site corresponding agent image is broadcasted to the same seat screen of the B-site, and the A-site and the B-site seat image are corresponding.
  • FIG. 1 the conference site panoramic image (ie, three seat images) of the broadcast source site 111 is broadcasted to the conference site 112 and the conference site 113.
  • Each agent image of the site 111 is broadcasted to the agent screen corresponding to the agent in the conference site 112 and the conference site 113.
  • FIG. 3 is a flowchart of a method for broadcasting a panoramic image of a site according to a preferred embodiment of the present invention.
  • the method for broadcasting a panoramic image of the site includes: Step 4: S302: During the conference, determine the site with the highest acoustic level. Among them, there are multiple seats in each venue, and the maximum sound venue is determined according to the seat of the loudest sound.
  • the above step S304 may further include the following process: each agent image in the site with the highest sound sensitivity is separately broadcasted to the agent screen corresponding to the agent in the other site.
  • step S304 when the site image is a site agent image, another broadcast policy may be used to perform step S304.
  • step S304 may include the following processing:
  • the venue image is broadcast to one of the screens of each of the other venues.
  • the site image of the site may be broadcasted to any of the other sites, or the site image may be broadcasted to a predetermined screen of each site, that is, the same agent image may appear in a certain location. On the same screen in the venue, this can improve the user's physical insurance more effectively.
  • the foregoing step (2) may further include the following processes: Al, searching for and acquiring a screen with the highest frequency of the audience image in each of the other sites;
  • Bl Determine whether the seat corresponding to the seat image of the site is on the screen with the highest display frequency, and whether the time of the last speech is greater than the predetermined duration
  • step (2) may further include the following processing:
  • the method for broadcasting a seat image according to a preferred embodiment of the present invention includes: Step S402: During a conference, a site with the highest acoustic level is determined. Among them, there are multiple seats in each venue, and the maximum sound venue is determined according to the seat of the loudest sound.
  • Step S404 If the agent image is the current broadcast source, and does not need to broadcast again, step S414 is performed; Step S406: If the agent image is not the current broadcast source, look for the screen with the highest frequency of appearance of the agent image; Step S408: If found If the seat has the highest frequency of the screen, and the distance from the screen is greater than the predetermined time (for example, 1 minute), the process goes to step S414; if the distance of the screen is less than or equal to 1 minute, the process proceeds to step S410; step S410; : If the screen with the highest frequency of occurrence of the agent is not found, the recently inactive screen can be found, and the number of times the agent appears on the screen is recorded; Step S412: Broadcasting the agent to the inactive screen; Step S414: Updating the agent at The recent speaking time of the screen.
  • FIG. 5 is a structural block diagram of a multipoint control unit according to an embodiment of the present invention.
  • the multipoint control unit is applied to the telepresence conference television system.
  • the multipoint control unit includes: a determination module 50 and a broadcast module 52.
  • the determining module 50 is configured to determine a site with the highest intensity of the audio stream signal during the conference;
  • the broadcast module 52 is configured to broadcast the site image of the conference to the plurality of screens of the conference site television system other than the conference site. At least one screen.
  • the multi-point control unit provided by the present invention can switch the display image of each venue screen to follow the sound broadcast in the perspective presentation conference system, thereby effectively improving the user experience.
  • the determining module 50 may further include: a first determining sub-module 500, configured to determine a seat with the highest audio stream signal strength in each meeting place of the telepresence conference television system; the comparison sub-module 502 is set to Comparing the audio stream signal strengths corresponding to the seats, the agent having the highest audio stream signal strength in the agent is obtained; the second determining sub-module 504 is configured to determine that the site to which the comparing sub-module belongs is determined to have the highest audio stream signal strength. Meeting place.
  • the site with the largest number of sounds in the site with multiple screens and multiple channels of audio can be effectively determined, and the site image of the site can be broadcasted to realize the scenario in which the site screen display image follows the sound broadcast.
  • the broadcast module 52 is configured to broadcast each agent image in the field to the agent screen corresponding to the agent in each of the other sites.
  • the panoramic image of the site with the largest vocalization can be broadcasted to other sites, so that the site screen display image can be switched in real time by following the sound broadcast.
  • FIG. 3 the specific working mode of the broadcast module 52
  • the broadcast module 52 may further include: a determining submodule 520, configured to determine whether the agent corresponding to the site agent image is a broadcast source;
  • the module 522 is configured to broadcast the field agent image to one screen of each of the other sites if the output of the sub-module is negative.
  • the broadcast sub-module 522 is further configured to search for and obtain the screen with the highest frequency of the audience image in each of the other sites; and determine that the agent corresponding to the site image is on the screen with the highest display frequency, the last time speaking Whether the time is greater than the predetermined duration; if yes, the speaking time of the agent corresponding to the agent image on the screen with the highest display frequency is updated.
  • the broadcast sub-module 522 is further configured to: for each other site, when the screen with the highest frequency of the agent image display is not found, find and obtain the screen of the site image that has not been changed, and record The number of times the site image is displayed on this screen; the broadcast site agent Image to display unchanged screen; update the speaking time of the agent corresponding to the site agent image on the screen that has not changed.
  • the same agent image can be displayed on the same screen in a certain venue as much as possible, so that the user body-risk can be more effectively improved.
  • FIG. 4 For the preferred working mode of the above-mentioned judging sub-module 520 and the broadcast sub-module 522, refer to FIG. 4, which is not mentioned here.
  • each screen display image be switched with the sound broadcast, but also the sound discrimination can be ensured, that is, the sound is displayed in the image.
  • the corresponding position is output, which can effectively improve the user experience.
  • the above modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device, such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)

Description

会场图像广播方法及多点控制单元 技术领域 本发明涉及会议电视领域, 尤其涉及一种会场图像广播方法及多点控制单 元。 背景技术 视频会议系统主要包括多点控制单元 (Multipoint Control Unit, 简称为
MCU ) 和终端系统, 其中, MCU是多点视频会议系统的关键设备, 它从来自 各会议场点的信息流, 抽取出音频、 视频、 数据等信息和信令, 再将各会议场 点的信息和信令, 分别送入多点控制模块和媒体处理模块, 完成相应的音频混 合或切换、 视频混合或切换、 数据广播和路由选择、 定时和会议控制等过程, 最后将各会议场点所需的各种信息重新组合起来, 送往各相应的终端系统设 备。 终端系统分为桌面式会议终端和会议室型会议终端两大类。 桌面式会议终 端成本低、 使用方便等特点, 适合个人办公使用和召开小规模的会议。 会议室 型终端配备有高品质变焦镜头、 高保真音响、 大屏幕彩电或投影等外部辅助设 备,加上视频的前处理 /后处理器,使得画面画质更清晰、达到更好的会议效果, 适合召开较大规模的会议。 会议室终端适用于会议室, 几个到几十个与会人员 的环境。 广播终端是单向接收终端, 它可以接收会议的图像与声音, 但是不能 发送图像与声音。 广播终端可以设置为只需要单向传递信息的场合, 例如上级 向下级传达政策等场合。 移动终端是在桌面型终端的基础上, 配上无线接入卡 和无线发射装置, 可以在一定地区范围内移动加入会议。 会议终端配置在视频 会议中的本地会议场点和各分会场点。 在传统视频会议系统中, 每个会场只有一个单屏, 一路音频。 在某一个会 场发言时, 而且声音最大, 该会场图像会自动广播到其他会场, 即声控系统可 以把声音最大端的图像广播到其他会场, 无需人为操作。 远景呈现以其真实的临场感深受高端用户的喜爱, 随着成本逐渐降低, 家 庭需求日益扩大, 远景呈现将逐渐走进千家万户。 将远景呈现技术应用到会议电视系统中, 形成了远景呈现会议系统。 在远 景呈现会议系统中, 每个会场存在多个屏幕, 多路音频, 并且音频输入输出与 屏幕具有位置对应关系。 在多点会议中, 如果需要把一个会场图像广播给其他会场看, 需要用户人 为操作, 广播该会场, 使其成为广播源。 但是, 如果釆用视频会议系统中的单 路音频声控广播方案, 并不适用于远景呈现会议系统。 因为在远景呈现会议系 统中, 不仅要保证图像跟随声音广播, 还要保证听声辩位, 即声音在图像显示 的相应位置输出。 发明内容 针对相关技术在远景呈现会议系统中, 无法实现每个会场屏幕显示图像跟 随声音广播进行切换的方案, 导致用户体验的问题, 本发明提供一种声控广播 方法和远景呈现会议系统, 以解决上述问题至少之一。 根据本发明的一个方面, 提供了在一种会场图像广播方法。 根据本发明的会场图像广播方法包括: 在会议进行中, 确定音频流信号强 度最大的会场; 将会场的会场图像广播至远景呈现会议电视系统中除会场之外 其他会场的多个屏幕中至少一个屏幕上。 上述确定音频流信号强度最大的会场包括: 确定远景呈现会议电视系统的 各个会场中音频流信号强度最大的坐席; 将坐席对应的音频流信号强度进行比 较, 得到坐席中音频流信号强度最大的坐席; 将得到的坐席所属的会场确定为 音频流信号强度最大的会场。 上述会场图像包括: 会场全景图像、 会场坐席图像。 上述会场图像为会场全景图像, 将会场的会场图像广播至其他会场的至少 一个屏幕上包括: 将会场中每个坐席图像均广播至每个其他会场中与该坐席对 应的坐席屏幕上。 上述会场图像为会场坐席图像, 且会场坐席图像对应的坐席为最大发声坐 席, 将会场的会场图像广播至其他会场的至少一个屏幕上包括: 判断会场坐席 图像对应的坐席是否为当前广播源; 如果否, 则将会场坐席图像广播至各个其 他会场的一个屏幕上。 将上述会场坐席图像广播至各个其他会场的一个屏幕上包括: 查找并获取 各个其他会场中会场坐席图像显示频率最高的屏幕; 判断会场坐席图像对应的 坐席在显示频率最高的屏幕上, 距离上一次发言的时间是否大于预定时长; 如 果是, 则更新会场坐席图像对应的坐席在显示频率最高的屏幕上的发言时间。 将上述会场坐席图像广播至各个其他会场的一个屏幕上包括: 对于每个其 他会场, 在未查找到会场坐席图像显示频率最高的屏幕时, 查找并获取会场坐 席图像显示未改变过的屏幕, 并记录会场坐席图像在该屏幕显示的次数; 广播 会场坐席图像至显示未改变过的屏幕; 更新会场坐席图像对应的坐席在显示未 改变过的屏幕上的发言时间。 根据本发明的另一方面, 提供了在一种多点控制单元。 根据本发明的多点控制单元包括: 确定模块, 设置为在会议进行中, 确定 音频流信号强度最大的会场; 广播模块, 设置为将会场的会场图像广播至远景 呈现会议电视系统中除会场之外其他会场的多个屏幕中的至少一个屏幕上。 上述确定模块包括: 第一确定子模块, 设置为确定远景呈现会议电视系统 的各个会场中音频流信号强度最大的坐席; 比较子模块, 设置为将坐席对应的 音频流信号强度进行比较, 得到坐席中音频流信号强度最大的坐席; 第二确定 子模块, 设置为将比较子模块通过比较得到的坐席所属的会场确定为音频流信 号强度最大的会场。 上述广播模块, 设置为将会场中每个坐席图像均广播至每个其他会场中与 该坐席对应的坐席屏幕上。 上述广播模块包括: 判断子模块, 设置为判断会场坐席图像对应的坐席是 否为广播源; 广播子模块, 设置为在判断子模块输出为否的情况下, 将会场坐 席图像广播至各个其他会场的一个屏幕上。 通过本发明, 在会议进行中, 确定音频流信号强度最大的会场; 将会场的 会场图像广播至远景呈现会议电视系统中除会场之外其他会场的多个屏幕中 的至少一个屏幕上, 解决了相关技术在远景呈现会议系统中, 无法实现每个会 场屏幕显示图像跟随声音广播进行切换的方案, 导致用户体验不高的问题, 进 而可以在远景呈现会议系统中, 使每个会场屏幕显示图像跟随声音广播进行切 换, 从而有效提高了用户体 -险。 附图说明 为了更清楚地说明本发明实施例的技术方案, 下面将对实施例描述中所需 要使用的附图作简单地介绍。 图 1为 居本发明实施例的远景呈现会议系统的结构示意图; 图 2为根据本发明实施例的会场图像广播方法的流程图; 图 3为本发明优选实施例的会场全景图像广播方法的流程图; 图 4为本发明优选实施例的坐席图像广播方法的流程图; 图 5为本发明实施例的多点控制单元的结构框图; 以及 图 6为本发明优选实施例的多点控制单元的结构框图。 具体实施方式 下文中将参考附图并结合实施例来详细说明本发明。 需要说明的是, 在不 冲突的情况下, 本申请中的实施例及实施例中的特征可以相互组合。 下面将结合本发明实施例中的附图, 对本发明实施例中的技术方案进行清 楚、 完整地描述。 图 1是远景呈现会议系统的结构示意图。 如图 1所示, 该系统包括: 会场 111、 会场 112、 会场 113和多点控制单元 13。 其中, 111会场为广播源会场, 其他会场都看会场 111的会场图像,每个会场存在多个屏幕(图中示出了 3个), 多路音频。 图 2为本发明实施例的会场图像广播方法的流程图。 其中, 该方法应用于 远景呈现会议电视系统, 如图 2所示, 该会场图像广播方法包括以下处理: 步骤 S202: 在会议进行中, 确定音频流信号强度最大的会场; 其中, 上述音频流信号强度最大的会场, 即指发声最大的会场。 步骤 S204:将会场的会场图像广播至远景呈现会议电视系统中除会场之外 其他会场的多个屏幕中的至少一个屏幕上。 在传统视频会议系统中, 每个会场只有一个单屏, 一路音频。 而在在远景 呈现会议系统中, 每个会场存在多个屏幕, 多路音频, 并且音频输入输出与屏 幕具有位置对应关系。 因而釆用传统声控方法, 无法实现每个会场屏幕显示图 像跟随声音广播进行切换的方案。 釆用本发明提供的上述方法, 可以在远景呈 现会议系统中, 使每个会场屏幕显示图像艮随声音广播进行切换, 从而有效提 高了用户体验。 优选地, 上述步骤 S202可以进一步包括以下处理:
( 1 ) 确定远景呈现会议电视系统的各个会场中音频流信号强度最大的坐 席;
( 2 ) 将坐席对应的音频流信号强度进行比较, 得到坐席中音频流信号强 度最大的坐席;
( 3 ) 将得到的坐席所属的会场确定为音频流信号强度最大的会场。 通过上述处理, 可以有效确定具有多个屏幕和多路音频的会场中发声最大 的会场, 进而可以将该会场的会场图像进行广播, 实现会场屏幕显示图像跟随 声音广播进行切换的方案。 在优选实施过程中, 由于每个会场中存在多个坐席, 所以在确定发声最大 的会场时, 可以将各个会场中发声最大的坐席对应的声音响度进行比较, 确定 发声最大的坐席, 之后将该坐席对应的会场确定为发声最大的会场。 优选地, 上述会场图像包括但不限于: 会场全景图像、 会场坐席图像。 在优选实施过程中, 在上述会场图像为会场全景图像时, 步骤 S204 可以 进一步包括以下处理: 将会场中每个坐席图像均广播至每个其他会场中与该坐 席对应的坐席屏幕上。 例如, 将 A会场对应坐席图像广播到 B会场相同坐席屏幕, A会场与 B 会场坐席图像——对应。 具体可以参见图 1 , 如图 1所示, 广播源会场 111的 会场全景图像 (即三个坐席图像) 被广播到会场 112和会场 113。 其中, 会场 111 的每个坐席图像被广播至会场 112和会场 113 中与该坐席对应的坐席屏幕 上。 以下结合图 3描述上述优选实施过程。 图 3为本发明优选实施例的会场全景图像广播方法的流程图。如图 3所示, 该会场全景图像广播方法包括: 步 4聚 S302: 会议过程中, 确定声音响度最大的会场。 其中, 每个会场中有多个坐席, 根据最大声音响度的坐席确定最大声音会 场。 步骤 S304: 将声音响度最大的会场全景图像广播到其他各个会场。 优选地, 上述步骤 S304可以进一步包括以下处理: 将该声音响度最大的 会场中每个坐席图像分别广播到其他会场中与该坐席对应的坐席屏幕上。 在优选实施过程中, 当上述会场图像为会场坐席图像时, 可以釆用另一种 广播策略来执行步骤 S304。 例如, 当上述会场图像为会场坐席图像, 且会场坐席图像对应的坐席为最 大发声坐席, 步骤 S304可以包括以下处理:
( 1 ) 判断会场坐席图像对应的坐席是否为当前广播源;
( 2 ) 如果否, 则将会场坐席图像广播至各个其他会场的一个屏幕上。 在具体实施过程中, 可以将上述会场坐席图像广播至各个其他会场的任一 个屏幕上, 也可以将该会场坐席图像广播至各个其他会场的预定屏幕上, 即尽 量让同一个坐席图像出现在某个会场中的同一个屏幕上, 这样可以更有效地提 高用户体险。 下面对后一种情况进行描述。 优选地, 上述步骤 (2 ) 可以进一步包括以下处理: Al、 查找并获取各个其他会场中会场坐席图像显示频率最高的屏幕;
Bl、 判断会场坐席图像对应的坐席在显示频率最高的屏幕上, 距离上一次 发言的时间是否大于预定时长;
Cl、 如果是, 则更新会场坐席图像对应的坐席在显示频率最高的屏幕上的 发言时间。 优选地, 上述步骤 (2 ) 还可以进一步包括以下处理:
A2、对于每个其他会场,在未查找到会场坐席图像显示频率最高的屏幕时, 查找并获取会场坐席图像显示未改变过的屏幕, 并记录会场坐席图像在该屏幕 显示的次数;
B2、 广播会场坐席图像至显示未改变过的屏幕; C3、 更新会场坐席图像对应的坐席在显示未改变过的展幕上的发言时间。 以下结合图 4描述上述优选实施过程。 图 4为本发明优选实施例的坐席图像广播方法的流程图。 图 4所示, 该本 发明优选实施例的坐席图像广播方法包括: 步骤 S402: 会议过程中, 确定声音响度最大的会场。 其中, 每个会场中有多个坐席, 根据最大声音响度的坐席确定最大声音会 场。 步骤 S404: 如果该坐席图像是当前广播源, 不需要再次广播, 则执行步骤 S414; 步骤 S406: 如果该坐席图像不是当前广播源, 查找该坐席图像出现频率最 高的屏幕; 步骤 S408: 如果查找到该坐席出现频率最高的屏幕, 而且距离在该屏最近 发言时间大于预定时间 (例如, 1 分钟), 则转步骤 S414; 如果距离在该屏最 近发言时间小于等于 1分钟, 转步骤 S410; 步骤 S410: 如果没有查找到该坐席出现频率最高的屏幕, 可以找到最近未 活动屏, 并且记录该坐席在该屏出现的次数; 步骤 S412: 广播该坐席到该未活动屏; 步骤 S414: 更新该坐席在该屏的最近发言时间。 可选地, 在具体实施过程中, 也可以釆用上述方法将上述会场中一个或多 个其他坐席 (除发声最大的坐席之外) 图像同时广播至其他会场, 例如, 可以 将会场半景图像广播至其他会场。 图 5为本发明实施例的多点控制单元的结构框图。 其中, 该多点控制单元 应用于远景呈现会议电视系统, 如图 5所示, 该多点控制单元包括: 确定模块 50和广播模块 52。 确定模块 50 , 设置为在会议进行中, 确定音频流信号强度最大的会场; 广播模块 52 ,设置为将会场的会场图像广播至远景呈现会议电视系统中除 会场之外其他会场的多个屏幕中的至少一个屏幕上。 釆用本发明提供的多点控制单元, 可以在远景呈现会议系统中, 使每个会 场屏幕显示图像跟随声音广播进行切换, 从而有效提高了用户体验。 优选地,如图 6所示,确定模块 50可以进一步包括: 第一确定子模块 500, 设置为确定远景呈现会议电视系统的各个会场中音频流信号强度最大的坐席; 比较子模块 502 , 设置为将坐席对应的音频流信号强度进行比较, 得到坐席中 音频流信号强度最大的坐席; 第二确定子模块 504 , 设置为将比较子模块通过 比较得到的坐席所属的会场确定为音频流信号强度最大的会场。 通过确定模块 50 的处理, 可以有效确定具有多个屏幕和多路音频的会场 中发声最大的会场, 进而可以将该会场的会场图像进行广播, 实现会场屏幕显 示图像跟随声音广播进行切换的方案。 优选地, 在上述会场图像是会场全景图像的情况下, 广播模块 52 , 设置为 将会场中每个坐席图像均广播至每个其他会场中与该坐席对应的坐席屏幕上。 广播模块 52的优选工作方式具体可以参见图 3 , 此处不再赘述。 通过上述处理, 可以将发声最大的会场的全景图像广播至其他会场, 从而 可以实现会场屏幕显示图像跟随声音广播进行实时切换的方案。 优选地, 如图 6所示, 在上述会场图像是会场坐席图像的情况下, 广播模 块 52还可以进一步包括: 判断子模块 520 , 设置为判断会场坐席图像对应的坐 席是否为广播源; 广播子模块 522 , 设置为在判断子模块输出为否的情况下, 将会场坐席图像广播至各个其他会场的一个屏幕上。 通过上述处理, 可以将发声最大的会场的坐席图像广播至其他会场, 从而 可以实现会场屏幕显示图像跟随声音广播进行实时切换的方案。 在优选实施过程中, 上述广播子模块 522 , 还设置为查找并获取各个其他 会场中会场坐席图像显示频率最高的屏幕; 判断会场坐席图像对应的坐席在显 示频率最高的屏幕上, 距离上一次发言的时间是否大于预定时长; 如果是, 则 更新会场坐席图像对应的坐席在显示频率最高的屏幕上的发言时间。 在优选实施过程中, 上述广播子模块 522 , 还设置为对于每个其他会场, 在未查找到会场坐席图像显示频率最高的屏幕时, 查找并获取会场坐席图像显 示未改变过的屏幕, 并记录会场坐席图像在该屏幕显示的次数; 广播会场坐席 图像至显示未改变过的屏幕; 更新会场坐席图像对应的坐席在显示未改变过的 屏幕上的发言时间。 通过上述处理, 可以尽量让同一个坐席图像出现在某个会场中的同一个屏 幕上, 这样可以更有效地提高用户体 -险。 上述判断子模块 520与广播子模块 522相互结合的优选工作方式具体可以 参见图 4, 此处不再赞述。 综上所述, 借助本发明提供的上述实施例, 在远景呈现会议系统中, 不仅 可以保证每个会场屏幕显示图像艮随声音广播进行切换, 还可以保证听声辩 位, 即声音在图像显示的相应位置输出, 从而可以有效提高用户体验。 显然, 本领域的技术人员应该明白, 上述的本发明的各模块或各步骤可以 用通用的计算装置来实现, 它们可以集中在单个的计算装置上, 或者分布在多 个计算装置所组成的网络上, 可选地, 它们可以用计算装置可执行的程序代码 来实现, 从而, 可以将它们存储在存储装置中由计算装置来执行, 并且在某些 情况下, 可以以不同于此处的顺序执行所示出或描述的步骤, 或者将它们分别 制作成各个集成电路模块, 或者将它们中的多个模块或步骤制作成单个集成电 路模块来实现。 这样, 本发明不限制于任何特定的硬件和软件结合。 以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本领 域的技术人员来说, 本发明可以有各种更改和变化。 凡在本发明的 ^"神和原则 之内, 所作的任何修改、 等同替换、 改进等, 均应包含在本发明的保护范围之 内。

Claims

权 利 要 求 书
1. 一种会场图像广播方法, 应用于远景呈现会议电视系统, 包括:
在会议进行中, 确定音频流信号强度最大的会场;
将所述会场的会场图像广播至所述远景呈现会议电视系统中除所述 会场之外其他会场的多个屏幕中至少一个屏幕上。
2. 根据权利要求 1所述的方法, 其中, 确定音频流信号强度最大的会场包 括:
确定所述远景呈现会议电视系统的各个会场中音频流信号强度最大 的坐席;
将所述坐席对应的音频流信号强度进行比较, 得到所述坐席中音频 流信号强度最大的坐席;
将所述得到的坐席所属的会场确定为所述音频流信号强度最大的会 场。
3. 根据权利要求 1所述的方法, 其中, 所述会场图像包括:
会场全景图像、 会场坐席图像。
4. 根据权利要求 3所述的方法, 其中, 所述会场图像为所述会场全景图像, 所述将所述会场的会场图像广播至所述其他会场的所述至少一个屏幕上 包括:
将所述会场中每个坐席图像均广播至每个所述其他会场中与该坐席 对应的坐席屏幕上。
5. 根据权利要求 3所述的方法, 其中, 所述会场图像为所述会场坐席图像, 且所述会场坐席图像对应的坐席为最大发声坐席, 所述将所述会场的会 场图像广播至所述其他会场的所述至少一个屏幕上包括:
判断所述会场坐席图像对应的坐席是否为当前广播源; 如果否, 则将所述会场坐席图像广播至各个所述其他会场的一个屏 幕上。
6. 根据权利要求 5所述的方法, 其中, 将所述会场坐席图像广播至各个所 述其他会场的一个屏幕上包括:
查找并获取各个所述其他会场中所述会场坐席图像显示频率最高的 屏幕;
判断所述会场坐席图像对应的坐席在所述显示频率最高的屏幕上, 距离上一次发言的时间是否大于预定时长;
如果是, 则更新所述会场坐席图像对应的坐席在所述显示频率最高 的屏幕上的发言时间。
7. 根据权利要求 5所述的方法, 其中, 将所述会场坐席图像广播至各个所 述其他会场的一个屏幕上包括:
对于每个所述其他会场, 在未查找到所述会场坐席图像显示频率最 高的屏幕时, 查找并获取所述会场坐席图像显示未改变过的屏幕, 并记 录所述会场坐席图像在该屏幕显示的次数;
广播所述会场坐席图像至所述显示未改变过的屏幕;
更新所述会场坐席图像对应的坐席在所述显示未改变过的展幕上的 发言时间。
8. —种多点控制单元, 应用于远景呈现会议电视系统, 包括:
确定模块, 设置为在会议进行中, 确定音频流信号强度最大的会场; 广播模块, 设置为将所述会场的会场图像广播至所述远景呈现会议 电视系统中除所述会场之外其他会场的多个屏幕中的至少一个展幕上。
9. 根据权利要求 8所述的多点控制单元, 其中, 所述确定模块包括:
第一确定子模块, 设置为确定所述远景呈现会议电视系统的各个会 场中音频流信号强度最大的坐席;
比较子模块, 设置为将所述坐席对应的音频流信号强度进行比较, 得到所述坐席中音频流信号强度最大的坐席;
第二确定子模块, 设置为将所述比较子模块通过比较得到的坐席所 属的会场确定为所述音频流信号强度最大的会场。
10. 根据权利要求 8所述的多点控制单元, 其中,
所述广播模块, 设置为将所述会场中每个坐席图像均广播至每个所 述其他会场中与该坐席对应的座席屏幕上。
11. 根据权利要求 8所述的多点控制单元, 其中, 所述广播模块包括: 判断子模块, 设置为判断所述会场坐席图像对应的坐席是否为广播 源;
广播子模块, 设置为在所述判断子模块输出为否的情况下, 将所述 会场坐席图像广播至各个所述其他会场的一个屏幕上。
PCT/CN2011/075302 2010-06-11 2011-06-03 会场图像广播方法及多点控制单元 WO2011153926A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010204757.4 2010-06-11
CN 201010204757 CN102281424B (zh) 2010-06-11 2010-06-11 会场图像广播方法及多点控制单元

Publications (1)

Publication Number Publication Date
WO2011153926A1 true WO2011153926A1 (zh) 2011-12-15

Family

ID=45097535

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/075302 WO2011153926A1 (zh) 2010-06-11 2011-06-03 会场图像广播方法及多点控制单元

Country Status (2)

Country Link
CN (1) CN102281424B (zh)
WO (1) WO2011153926A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103905780A (zh) * 2014-03-18 2014-07-02 华为技术有限公司 一种数据处理方法、设备和视频会议系统
CN105915837B (zh) * 2016-05-30 2019-10-25 华为技术有限公司 一种视频切换方法、装置和系统
CN113596349A (zh) * 2021-07-26 2021-11-02 世邦通信股份有限公司 发言位自动联动视频的会议方法及系统、装置与存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101080000A (zh) * 2007-07-17 2007-11-28 华为技术有限公司 视频会议中显示发言人的方法、系统、服务器和终端
CN101335867A (zh) * 2007-09-27 2008-12-31 深圳市迪威新软件技术有限公司 一种会议电视系统的语音激励控制方法
CN101395912A (zh) * 2006-03-02 2009-03-25 思科技术公司 用于显示位置之间的视频会议中的参与者的系统和方法
CN101442654A (zh) * 2008-12-26 2009-05-27 深圳华为通信技术有限公司 视频通信中视频对象切换的方法、装置及系统

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100418340C (zh) * 2004-12-09 2008-09-10 西安大唐电信有限公司 会议电话语音选择合成的方法

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101395912A (zh) * 2006-03-02 2009-03-25 思科技术公司 用于显示位置之间的视频会议中的参与者的系统和方法
CN101080000A (zh) * 2007-07-17 2007-11-28 华为技术有限公司 视频会议中显示发言人的方法、系统、服务器和终端
CN101335867A (zh) * 2007-09-27 2008-12-31 深圳市迪威新软件技术有限公司 一种会议电视系统的语音激励控制方法
CN101442654A (zh) * 2008-12-26 2009-05-27 深圳华为通信技术有限公司 视频通信中视频对象切换的方法、装置及系统

Also Published As

Publication number Publication date
CN102281424B (zh) 2013-08-07
CN102281424A (zh) 2011-12-14

Similar Documents

Publication Publication Date Title
CA2874715C (en) Dynamic video and sound adjustment in a video conference
RU2533304C2 (ru) Способ управления конференц-связью и относящиеся к нему устройство и система
US8379076B2 (en) System and method for displaying a multipoint videoconference
CN101401109B (zh) 显示在多个位置之间的可视会议中的用户的系统和方法
JP6172610B2 (ja) テレビ会議用システム
JP6179834B1 (ja) テレビ会議装置
WO2011140812A1 (zh) 多画面合成方法、系统及媒体处理装置
US8773491B2 (en) Method, apparatus, and system for implementing audio mixing
WO2009009966A1 (fr) Procédé, dispositif et système pour afficher un locuteur dans une vidéoconférence
WO2011026382A1 (zh) 视频会议虚拟会场的呈现方法、设备及系统
US8836753B2 (en) Method, apparatus, and system for processing cascade conference sites in cascade conference
EP3070876A1 (en) Method and system for improving teleconference services
WO2011085594A1 (zh) 视频画面切换的方法和装置
WO2015003532A1 (zh) 多媒体会议的建立方法、装置及系统
WO2012034329A1 (zh) 视频通话中视频录制的方法及装置
EP3813361A1 (en) Video conference server capable of providing video conference by using plurality of video conference terminals, and camera tracking method therefor
WO2011153926A1 (zh) 会场图像广播方法及多点控制单元
US20210218932A1 (en) Video conference server capable of providing video conference by using plurality of terminals for video conference, and method for removing audio echo therefor
JPH07105106A (ja) 多地点電子会議装置
WO2016206471A1 (zh) 多媒体业务处理方法、系统及装置
WO2020038494A1 (zh) 一种智能音箱及智能音箱使用的方法
JP6668828B2 (ja) 会議システム
US8717407B2 (en) Telepresence between a multi-unit location and a plurality of single unit locations
WO2014026478A1 (zh) 一种视频会议信号处理的方法、视频会议服务器及系统
JP6500366B2 (ja) 管理装置、端末装置、伝送システム、伝送方法およびプログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11791917

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11791917

Country of ref document: EP

Kind code of ref document: A1