WO2011140812A1 - Multi-picture synthesis method and system, and media processing device - Google Patents

Multi-picture synthesis method and system, and media processing device Download PDF

Info

Publication number
WO2011140812A1
WO2011140812A1 PCT/CN2010/080320 CN2010080320W WO2011140812A1 WO 2011140812 A1 WO2011140812 A1 WO 2011140812A1 CN 2010080320 W CN2010080320 W CN 2010080320W WO 2011140812 A1 WO2011140812 A1 WO 2011140812A1
Authority
WO
WIPO (PCT)
Prior art keywords
media
picture
size
layout
code stream
Prior art date
Application number
PCT/CN2010/080320
Other languages
French (fr)
Chinese (zh)
Inventor
孙波
吴衍平
田智平
黄书平
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2011140812A1 publication Critical patent/WO2011140812A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • the present invention relates to the field of communications, and in particular to a multi-picture synthesis method, system, and media processing apparatus.
  • BACKGROUND OF THE INVENTION The emergence and development of conference television technology has gradually changed the way human beings socialize, and video communication has become an indispensable part of human social and economic life.
  • Traditional communication tools such as telephones and fax machines, cannot achieve the communication effect of face-to-face or group of people.
  • business trips have become an annoying and prohibitive business.
  • the use of conference television has not only achieved the purpose of convening a conference, but also avoided traveling to the field.
  • the video conference system mainly includes a multipoint control unit (MCU) and a terminal system.
  • MCU multipoint control unit
  • the MCU is a key device of the multipoint video conference system, and extracts audio from the information flow from each conference site. Video, data and other information and signaling, and then send the information and signaling of each conference site to the multi-point control module and the media processing module respectively, to complete the corresponding audio mixing or switching, frequency mixing or switching, data broadcasting and routing. Processes such as selection, timing, and conference control, and finally reassembling the various information required for each conference site and sending it to each terminal system device.
  • the terminal system is divided into two types: desktop conference terminal and conference room conference terminal.
  • the desktop conference terminal is low in cost and easy to use, and is suitable for personal office use and small-scale conferences.
  • the conference room type terminal is equipped with high-quality zoom lens, high-fidelity audio, large-screen color TV or projection and other external auxiliary equipment, plus video pre/post processor, which makes the picture quality clearer and achieve better conference results.
  • a broadcast terminal is a one-way receiving terminal that can receive images and sounds of a conference, but cannot transmit images and sounds.
  • the broadcast terminal can be used in situations where only one-way information needs to be transmitted, such as a policy in which the superior level conveys a policy to the lower level.
  • the mobile terminal is based on the desktop terminal, and is equipped with a wireless access card and a wireless transmitting device to move into the conference within a certain area.
  • the conference terminal is configured in the local conference site and each venue and conference site in the video conference.
  • the traditional conference TV through the close cooperation of the MCU and the terminal system, the simultaneous output of multiple pictures is realized in a split manner.
  • Symmetrical split from 4, 9, 16 to 1 + 5, 1 + 7, 3 + 4, 2 + 8, 1 + 12 and other asymmetric exhibitions, from 1 + 1, 1 + 2 and so on to 6, 2 + 4, etc. 16: 9 widescreen display, a variety of different split screen combinations, to meet the user's arbitrary Display requirements.
  • the multi-screen layout of a conventional conference television display will be described below with reference to FIG.
  • the layout is 9 screens, and each screen has the same size, which is 1/9 of the entire screen size.
  • the main object of the present invention is to provide a multi-picture synthesis method, system, and media processing apparatus to solve At least one of the above issues.
  • a multi-picture synthesis method is provided.
  • the method is applied to a video conference system, the method comprising: acquiring a multi-channel media code stream, wherein each media code stream is used to display one picture of the scene; and the multi-channel media stream is arranged according to the layout.
  • the composition is a set of multiple pictures, wherein the size of at least one of the pictures is equal to the true size of the displayed scene.
  • a multi-picture composition system is provided.
  • the multi-picture composition system includes: an access module, configured to acquire a multi-channel media code stream, wherein each media code stream is used to display one picture of the scene; and a media processing module is configured to multi-channel according to the layout
  • the media stream is synthesized into a set of multiple pictures, wherein the size of at least one of the frames is equal to the true size of the displayed scene.
  • a media processing apparatus is provided.
  • the media processing device includes: a receiving module, configured to receive a multi-channel media code stream, wherein each media code stream is used to display one picture of the scene; and a processing module, configured to receive the received multi-channel media code according to the layout Streaming into a set of multiple pictures, where at least one of the frames is in the layout
  • the size is equal to the true size of the displayed scene.
  • the multiplexed media streams are combined into a plurality of frames according to a predetermined layout, wherein the size of at least one of the layouts is equal to the real size of the displayed scene, and the related art cannot be solved as much as possible.
  • FIG. 1 is a multi-screen layout of a conventional conference television display
  • FIG. 2 is a schematic structural diagram of a video conference system
  • FIG. 3 is a flowchart of a multi-screen synthesis method according to an embodiment of the present invention
  • FIG. 5 is a schematic diagram of a multi-screen layout according to a preferred embodiment 2 of the present invention
  • FIG. 6 is a multi-group multi-screen according to a preferred embodiment 3 of the present invention
  • Figure 7 is a flow chart of a multi-picture synthesis method according to a preferred embodiment of the present invention
  • Figure 8 is a block diagram showing the structure of a multi-picture synthesis system according to an embodiment of the present invention
  • FIG. 5 is a schematic diagram of a multi-screen layout according to a preferred embodiment 2 of the present invention
  • FIG. 6 is a multi-group multi-screen according to a preferred embodiment 3 of the present invention
  • Figure 7 is a flow chart
  • FIG. 10 is a block diagram showing the structure of a media processing device in accordance with a preferred embodiment of the present invention.
  • BEST MODE FOR CARRYING OUT THE INVENTION the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict.
  • 2 is a schematic structural diagram of a video conference system. As shown in FIG. 2, the video conference system includes: a single and multiple video stream conference site and a multipoint control unit (MCU for short) 13 . Among them, 111, 112, and 113 are multi-stream venues, and 121, 122, and 123 are single-stream venues.
  • MCU multipoint control unit
  • 111 venues are broadcast source venues, other venues are watching 111 venues; 111 are viewing venues 113 venues.
  • the single-stream sites 121, 122, and 123 can only select one stream of multiple streams in the 111 site, and there is a defect that the information is incomplete. Therefore, the media processing module needs to synthesize the multiple streams into one stream.
  • the 111 venue is a multi-stream venue, it can only see one multi-stream venue. It also needs a media processing module to combine other site code streams into one stream.
  • the user needs to see multiple site information as much as possible, and can also see the real size of the site scene displayed on the screen, it is necessary to provide a new multi-screen synthesis method, which is described below in conjunction with FIG.
  • the multi-screen synthesis method includes the following processes: Step S302: Acquire a multi-media media code stream, where each media code stream is used to display one picture of the scene; Step S304: multiplex media code stream according to the layout
  • the composition is a set of multiple pictures, wherein the size of at least one of the pictures is equal to the true size of the displayed scene.
  • one or more scenes of a real body size may be included, or one or more pictures having a size smaller than the real size of the displayed scene may be included, so that as many synthesized as possible can be achieved. Scenes, while maintaining the true size of the scene.
  • FIGS. 4 and 5. 4 is a schematic diagram of a multi-screen layout according to a preferred embodiment 1 of the present invention; as shown in FIG.
  • FIG. 5 is a schematic diagram of a multi-screen layout according to a preferred embodiment 2 of the present invention; as shown in FIG. 5, the layout differs from FIG.
  • FIG. 4 is a schematic diagram of a multi-group multi-screen layout according to a preferred embodiment 3 of the present invention; in this embodiment, FIG.
  • 611 is a schematic diagram of the code stream viewed from the left screen of 111, and the images are from the left left seat of 113 and the venues of 121, 122, and 123 respectively.
  • 612 is a schematic diagram of the code stream seen in the 111 exhibition, and the images are from the 113 seats and the 112 site images respectively.
  • 613 is a schematic diagram of the code stream viewed from the right screen of 111, and the image is from the 113 right-seat image. It can be seen that the image information of the large screen of 111 is from 113, its size is kept true, and 112, 121, 122, 123 venue images are superimposed.
  • the method may further include: processing, in the conference, determining to change the current layout according to the conference state; and synthesizing the multi-media media streams into a group of multiple images according to the changed layout.
  • the method may further include the following processing: determining a more according to the state of the conference in the conference?
  • the code stream source acquires a new multi-media stream from the new stream source; synthesizes the new multi-media code 3 ⁇ 43 ⁇ 4 into a set of multi-pictures according to the layout.
  • flexible changes to the current layout or code stream source can effectively improve the user's viewing experience.
  • FIG. 7 is a flow chart of a multi-picture synthesis method in accordance with a preferred embodiment of the present invention. As shown in FIG.
  • the multi-screen synthesis method includes the following processing: Step S702: During the conference holding process, according to the MCU image synthesis capability and the characteristics of the conference terminal, it is determined to synthesize several groups of multiple screens, and what layout is used for each group of multiple screens .
  • the MCU image synthesis capability refers to how many multi-pictures can be synthesized, and also includes transcoding capabilities between different protocols at different rates. When the ability is limited, first ensure the multi-screen synthesis of the chairman's venue.
  • the layout of multi-screen use can be selected according to the actual situation, the layout shown in Figure 1, Figure 4, Figure 5. Ordinary venues generally use the layout of Figure 1, and the telepresence venues generally use the layout of Figure 4 or Figure 5.
  • the layout and number of frames can be dynamically adjusted during the meeting.
  • Step S704 In the conference, according to the state of the conference, change which terminal stream is synthesized by the multi-screen at any time.
  • Step S706 Select different media code streams to be transmitted to the site according to the characteristics of each site.
  • the site can directly view the scenes of other sites through multiple streams, or multiple screens synthesized by other sites. Specifically, it can be determined according to actual conditions.
  • Fig. 8 is a block diagram showing the structure of a multi-picture synthesizing system according to an embodiment of the present invention. As shown in FIG. 8, the multi-screen synthesis system is applied to a video conference system, and includes: an access module 80 and a media processing module 82.
  • the access module 80 is configured to acquire a multi-media media stream, where each media code stream is used to display one picture of the scene, and the media processing module 82 is configured to synthesize the multiple media code streams into a set of multiple pictures according to the layout.
  • the size of at least one of the screens in the layout is equal to the true size of the displayed scene.
  • the access module 80 is further configured to output a set of multiple pictures processed by the media processing module to the terminal.
  • the system may further include: a media switching module 84, configured to forward the media code stream acquired by the access module to the access module
  • the corresponding media processing module outputs the multi-screen processed by the media processing module to the access module corresponding to the media processing module.
  • Figure 10 is a block diagram showing the structure of a media processing device in accordance with a preferred embodiment of the present invention. As shown in FIG. 10, the media processing device mainly includes: a receiving module 10, a processing module 12, and an output module 14.
  • the receiving module 10 is configured to receive a multi-media media stream, where each media code stream is used to display a picture of the scene.
  • the receiving module 10 receives the terminal code stream, where the terminal code stream passes through the network and
  • the MCU access module enters the MCU.
  • the processing module 12 is configured to synthesize the received multi-media media code streams into a set of multiple pictures according to a layout, wherein a size of at least one of the pictures is equal to a real size of the displayed scene.
  • the processing module 12 synthesizes the multi-picture according to the picture layout and the multi-picture stuffing stream information.
  • the processing module 12 is further configured to perform code stream protocol processing and rate conversion.
  • the output module 14 is configured to output the combined set of multi-pictures.
  • the media processing device may synthesize the received multi-media media code streams into a plurality of multi-pictures according to a predetermined layout, wherein at least one of the above-mentioned layouts has a size equal to a real size of the displayed scene, and thus is processed by the media processing device.
  • the latter image can maintain the true size, does not affect the eye contact, and effectively improves the user experience.
  • there may be at least one small picture in the multi-picture wherein the size of each small picture is smaller than the real size of the displayed picture.
  • the layout of the multi-picture can be specifically seen in FIG. 4 and FIG. 5.
  • the user can see the screen of multiple scenes as much as possible, and effectively improve the viewing experience of the user.
  • it is possible to synthesize a plurality of site information as much as possible, and maintain the true size of the scene displayed on the screen, thereby facilitating eye contact between users and effectively improving the user's presence.
  • Body-risk Obviously, those skilled in the art should understand that the above modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices.
  • the computing device may be implemented by program code executable by the computing device, such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
  • the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps are fabricated as a single integrated circuit module.
  • the invention is not limited to any specific combination of hardware and software.
  • the above is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the scope of the present invention are intended to be included within the scope of the present invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The present invention discloses a multi-picture synthesis method and system, and a media processing device, which are applied to video conference systems. The multi-picture synthesis method includes the following steps: acquiring multiple paths of media code streams, wherein each path of media code streams is used for displaying one picture of a scene; synthesizing multiple paths of media code streams into a multi-picture group according to a layout, wherein the size of at least one picture in the layout is equal to the actual size of the displayed scene. According to the technical scheme provided by the invention, the actual size of the picture-displaying scene can be kept while multiple conference place information is synthesized as much as possible, which effectively improves the presence experiences of users.

Description

多画面合成方法、 系统及媒体处理装置 技术领域 本发明涉及通信领域, 具体而言, 涉及一种多画面合成方法、 系统及媒 体处理装置。 背景技术 会议电视技术的出现和发展, 逐渐改变了人类的社会活动方式, 视频通 信也成为人类社会经济生活中不可缺少的一部分。传统的通信工具,如电话、 传真机等都无法达到面对面或一群人聚集在一起的沟通效果。 对企事业单位 来说, 出差开会, 已成为令人苦恼, 望而却步的事情。 釆用会议电视的方式 既达到了召开会议的目的, 又避免了出差到外地。 视频会议系统主要包括多点控制单元 (Multipoint Control Unit, 简称为 MCU ) 和终端系统, 其中, MCU是多点视频会议系统的关键设备, 它从来 自各会议场点的信息流, 抽取出音频、 视频、 数据等信息和信令, 再将各会 议场点的信息和信令 分别送入多点控制模块和媒体处理模块, 完成相应的 音频混合或切换、 枧频混合或切换、 数据广播和路由选择、 定时和会议控制 等过程, 最后将各会议场点所需的各种信息重新组合起来 送往各相 ^的终 端系统设备。 终端系统分为桌面式会议终端和会议室型会议终端两大类。 桌 面式会议终端成本低、 使用方便等特点, 适合个人办公使用和召开小规模的 会议。 会议室型终端配备有高品质变焦镜头、 高保真音响、 大屏幕彩电或投 影等外部辅助设备、 加上视频的前处理 /后处理器, 使得画面画质更清晰、 达 到更好的会议效果 , 适合召开较大规模的会议 会议室终端适用于会议室 , 几个到几十个与会人员的环境。 广播终端是单向接收终端, 它可以接收会议 的图象与声音、 但是不能发送图像与声音。 广播终端可以用于只需要单向传 递信息的场合, 例如上级向下级传达政策等场合。 移动终端是在桌面型终端 的基础上, 配上无线接入卡和无线发射装置 可以在一定地区范围内移动加 入会议。 会议终端配置在.视频会议中的本地会议场点和各分 ·会场点。 在传统的会议电视中, 通过 MCU和终端系统的密切配合, 以分展方式 实现多画面的同时输出。 从 4、 9、 16的对称分展到 1 + 5、 1 + 7、 3 + 4、 2 + 8、 1 + 12等不对称分展, 从 1 + 1、 1 + 2等画中画到 6、 2 + 4等 16: 9宽 屏显示, 多种不同的分屏组合方式, 满足用户任意的显示要求。 以下结合图 1描述传统的会议电视显示的多画面布局。 该布局为 9个画 面, 每个画面大小相同, 为整个画面大小的 1/9。 传统会场不要求真身大小, 更期望能够看到更多会场信息。 9个画面每排 3个小画面可以显示一个多流 会场, 9个画面能够显示 3个多流会场。 由此可知, 用户虽然可以看到尽可能多的会场, 但是, 用户观看的画面 无法保持其显示场景的真身大小、 也难以有效进行眼神交流, 因此降低了用 户临场感体验。 发明内容 针对相关技术中无法在尽可能多地合成多个场景时, 还保持场景的真身 大小的问题, 本发明的主要目的在于提供一种多画面合成方法、 系统及媒体 处理装置, 以解决上述问题至少之一。 才艮据本发明的一个方面, 提供了一种多画面合成方法。 才艮据本发明的多画面合成方法, 应用于视频会议系统, 该方法包括: 获 取多路媒体码流, 其中, 每路媒体码流用于显示场景的一个画面; 按照布局 将多路媒体码流合成为一组多画面, 其中, 布局中的至少一个画面的大小等 于所显示场景的真实大小。 才艮据本发明的另一方面, 提供了一种多画面合成系统。 才艮据本发明的多画面合成系统包括: 接入模块, 用于获取多路媒体码流, 其中, 每路媒体码流用于显示场景的一个画面; 媒体处理模块, 用于按照布 局将多路媒体码流合成为一组多画面, 其中, 布局中的至少一个画面的大小 等于所显示场景的真实大小。 根据本发明的又一方面, 提供了一种媒体处理装置。 根据本发明的媒体处理装置包括: 接收模块, 用于接收多路媒体码流, 其中, 每路媒体码流用于显示场景的一个画面; 处理模块, 用于按照布局将 接收到的多路媒体码流合成为一组多画面, 其中, 布局中的至少一个画面的 大小等于所显示场景的真实大小。 输出模块, 用于将合成后的一组多画面输 出。 通过本发明,按照预定的布局将多路媒体码流合成为一组多画面, 其中 , 该布局中的至少一个画面的大小等于所显示场景的真实大小, 解决了相关技 术中无法在尽可能多地合成多个场景时, 还保持场景的真身大小的问题, 进 而可以在尽可能地合成多个会场信息外,还能保持画面显示场景的真实大小, 有效提高了用户临场感体 -险。 附图说明 此处所说明的附图用来提供对本发明的进一步理解, 构成本申请的一部 分, 本发明的示意性实施例及其说明用于解释本发明, 并不构成对本发明的 不当限定。 在附图中: 图 1是传统的会议电视显示的多画面布局; 图 2是视频会议系统的结构示意图; 图 3是才艮据本发明实施例的多画面合成方法的流程图; 图 4是才艮据本发明优选实施例一的多画面布局的示意图; 图 5是 居本发明优选实施例二的多画面布局的示意图; 图 6是才艮据本发明优选实施例三的多组多画面布局的示意图; 图 7是才艮据本发明优选实施例的多画面合成方法的流程图; 图 8是才艮据本发明实施例的多画面合成系统的结构框图; 图 9是才艮据本发明优选实施例的多画面合成系统的结构框图; 图 10是根据本发明优选实施例的媒体处理装置的结构框图。 具体实施方式 下文中将参考附图并结合实施例来详细说明本发明。 需要说明的是, 在 不冲突的情况下, 本申请中的实施例及实施例中的特征可以相互组合。 图 2是视频会议系统的结构示意图。如图 2所示,该视频会议系统包括: 单、 多视频流会场和多点控制单元 ( Multipoint Control Unit, 简称为 MCU ) 13。 其中, 111、 112、 113为多流会场, 121、 122、 123为单流会场。 其中, 111会场为广播源会场, 其他会场都看 111会场; 111所看会场为 113会场。 单流会场 121、 122、 123只能选看 111会场中多流的一路码流, 存在信息不 完备的缺陷, 因此需要媒体处理模块把多路码流合成为一路码流。 111会场 虽然是多流会场, 但是也只能看一个多流会场, 也需要媒体处理模块把其他 会场码流合成一路码流。 在上述视频会议系统中, 如果用户需要尽可能地看到多个会场信息, 还 能看到画面显示的会场场景的真实大小, 则需要提供一种新的多画面合成方 法, 以下结合图 3进行描述。 图 3是 居本发明实施例的多画面合成方法的流程图。 如图 3所示, 该 多画面合成方法包括以下处理: 步骤 S302: 获取多路媒体码流, 其中, 每路媒体码流用于显示场景的一 个画面; 步骤 S304: 按照布局将多路媒体码流合成为一组多画面, 其中, 布局中 的至少一个画面的大小等于所显示场景的真实大小。 釆用上述方法, 由于上述布局中的至少一个画面的大小等于所显示场景 的真实大小, 用户观看的画面可以保持其显示场景的真身大小、 也能够有效 进行眼神交流, 因此提高了用户临场感体验。 优选地, 上述多画面中还可以有至少一个小画面, 其中, 每个小画面的 大小小于所显示场景的真实大小。 通过上述处理, 在一组多画面中既可以包含一个或多个场景真身大小的 画面, 也可以包含一个或多个大小小于所显示场景真实大小的画面, 从而可 以达到尽可能多地合成多个场景, 同时还保持场景的真身大小的目的。 以下结合图 4和图 5描述上述优选实施过程。 图 4是才艮据本发明优选实施例一的多画面布局的示意图; 如图 4所示, 该布局由 4个画面组合而成, 其中 414对应是大画面, 并在其底部中间叠加 411、 412、 413共 3个长宽比 16: 9的小画面, 小画面的面积、在大画面的 1/36 至 1/16之间。 411、 412、 413媒体源来自同一多流会场或者多个单流会场。 可见, 414保持了真身大小, 不影响眼神交流, 411、 412、 413又能够看到其 他会场的图像信息。 图 5是 居本发明优选实施例二的多画面布局的示意图; 如图 5所示, 该布局与图 4的区别主要在于小画面叠加在顶部中间。 与图 4一样, 大画面 52保持了真身大小, 不影响眼神交流, 511、 512、 513又能够看到其他会场 的图像信息。 优选地, 根据会议状态, 可以动态确定需要合成的多画面组数, 每组多 画面用一个展幕显示。 在优选实施过程中, 上述多组多画面中,也可以釆用传统的多画面布局, 具体可以参见图 1。 以下结合图 6描述上述优选实施过程。 图 6是才艮据本发明优选实施例三的多组多画面布局的示意图; 本实施例 以图 2为例, 给出不同会场所看多画面合成后的图像信息。 611为 111左屏 所看码流示意图, 其图像分别来自 113左席和 121、 122、 123会场。 612为 111中展所看码流示意图, 其图像分别来自 113中席和 112会场图像。 613 为 111右屏所看码流示意图, 其图像来自 113右席图像。 可见 111所看会场 大画面图像信息来自 113 , 其大小保持真身大小, 此外还叠加了 112, 121、 122、 123会场图像。 621 , 622, 623为 113所看图像信息, 其大画面媒体源 为 111会场, 并在 622上合成多画面叠加 112会场信息, 因此 113能够同时 看到 111和 112会场视频。 63为 121、 122、 123单流会场所看的 9个画面图 像, 其媒体码流分别来自 111、 112、 113三个会场。 优选地, 该方法还可以包括以下处理: 在会议中根据会议状态, 确定更 改当前布局; 根据更改后的布局, 将多路媒体码流合成为一组多画面。 优选地, 该方法还可以包括以下处理: 在会议中根据会议状态, 确定更 ?丈码流源; 从新的码流源获取新的多路媒体码流; 按照布局将新的多路媒体 码¾¾合成为一组多画面。 才艮据会议状态, 灵活地更改当前布局或者码流源, 可以有效提高用户的 观看体验。 以下结合图 7描述上述优选实施过程。 图 7是才艮据本发明优选实施例的多画面合成方法的流程图。如图 7所示, 该多画面合成方法包括以下处理: 步骤 S702: 在会议召开过程中, 根据 MCU图像合成能力和与会终端特 点来决定合成几组多画面, 每组多画面釆用何种布局。 其中, MCU图像合成能力指的是能够合成多少个多画面, 还包括不同 协议不同速率间转码能力。 当能力有限时, 首先保证主席会场的多画面合成。 多画面釆用的布局, 可以才艮据实际情况选择图 1、 图 4、 图 5所示布局。 普 通会场一般釆用图 1布局, 网真会场一般釆用图 4或图 5布局。 此外, 布局 和画面数还可以在会议中动态调整。 步骤 S704: 会议中根据会议状态随时改变多画面由哪些终端码流合成TECHNICAL FIELD The present invention relates to the field of communications, and in particular to a multi-picture synthesis method, system, and media processing apparatus. BACKGROUND OF THE INVENTION The emergence and development of conference television technology has gradually changed the way human beings socialize, and video communication has become an indispensable part of human social and economic life. Traditional communication tools, such as telephones and fax machines, cannot achieve the communication effect of face-to-face or group of people. For enterprises and institutions, business trips have become an annoying and prohibitive business. The use of conference television has not only achieved the purpose of convening a conference, but also avoided traveling to the field. The video conference system mainly includes a multipoint control unit (MCU) and a terminal system. The MCU is a key device of the multipoint video conference system, and extracts audio from the information flow from each conference site. Video, data and other information and signaling, and then send the information and signaling of each conference site to the multi-point control module and the media processing module respectively, to complete the corresponding audio mixing or switching, frequency mixing or switching, data broadcasting and routing. Processes such as selection, timing, and conference control, and finally reassembling the various information required for each conference site and sending it to each terminal system device. The terminal system is divided into two types: desktop conference terminal and conference room conference terminal. The desktop conference terminal is low in cost and easy to use, and is suitable for personal office use and small-scale conferences. The conference room type terminal is equipped with high-quality zoom lens, high-fidelity audio, large-screen color TV or projection and other external auxiliary equipment, plus video pre/post processor, which makes the picture quality clearer and achieve better conference results. Suitable for holding large-scale conference room terminals suitable for conference rooms, several to dozens of participants' environment. A broadcast terminal is a one-way receiving terminal that can receive images and sounds of a conference, but cannot transmit images and sounds. The broadcast terminal can be used in situations where only one-way information needs to be transmitted, such as a policy in which the superior level conveys a policy to the lower level. The mobile terminal is based on the desktop terminal, and is equipped with a wireless access card and a wireless transmitting device to move into the conference within a certain area. The conference terminal is configured in the local conference site and each venue and conference site in the video conference. In the traditional conference TV, through the close cooperation of the MCU and the terminal system, the simultaneous output of multiple pictures is realized in a split manner. Symmetrical split from 4, 9, 16 to 1 + 5, 1 + 7, 3 + 4, 2 + 8, 1 + 12 and other asymmetric exhibitions, from 1 + 1, 1 + 2 and so on to 6, 2 + 4, etc. 16: 9 widescreen display, a variety of different split screen combinations, to meet the user's arbitrary Display requirements. The multi-screen layout of a conventional conference television display will be described below with reference to FIG. The layout is 9 screens, and each screen has the same size, which is 1/9 of the entire screen size. Traditional venues do not require real size, and they are expected to see more venue information. 9 screens Each row of 3 small screens can display a multi-stream venue, and 9 screens can display 3 multi-stream venues. It can be seen that although the user can see as many sites as possible, the user can not maintain the true size of the displayed scene, and it is difficult to effectively communicate with the eyes, thus reducing the user's presence experience. SUMMARY OF THE INVENTION In view of the problem in the related art that a plurality of scenes cannot be synthesized as much as possible, and the true size of the scene is maintained, the main object of the present invention is to provide a multi-picture synthesis method, system, and media processing apparatus to solve At least one of the above issues. According to an aspect of the present invention, a multi-picture synthesis method is provided. According to the multi-picture synthesis method of the present invention, the method is applied to a video conference system, the method comprising: acquiring a multi-channel media code stream, wherein each media code stream is used to display one picture of the scene; and the multi-channel media stream is arranged according to the layout. The composition is a set of multiple pictures, wherein the size of at least one of the pictures is equal to the true size of the displayed scene. According to another aspect of the present invention, a multi-picture composition system is provided. The multi-picture composition system according to the present invention includes: an access module, configured to acquire a multi-channel media code stream, wherein each media code stream is used to display one picture of the scene; and a media processing module is configured to multi-channel according to the layout The media stream is synthesized into a set of multiple pictures, wherein the size of at least one of the frames is equal to the true size of the displayed scene. According to still another aspect of the present invention, a media processing apparatus is provided. The media processing device according to the present invention includes: a receiving module, configured to receive a multi-channel media code stream, wherein each media code stream is used to display one picture of the scene; and a processing module, configured to receive the received multi-channel media code according to the layout Streaming into a set of multiple pictures, where at least one of the frames is in the layout The size is equal to the true size of the displayed scene. An output module for outputting a combined set of multi-pictures. According to the present invention, the multiplexed media streams are combined into a plurality of frames according to a predetermined layout, wherein the size of at least one of the layouts is equal to the real size of the displayed scene, and the related art cannot be solved as much as possible. When a plurality of scenes are synthesized, the problem of the true size of the scene is also maintained, and in addition, a plurality of venue information can be synthesized as much as possible, and the real size of the scene can be maintained, thereby effectively improving the user's presence and danger. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are set to illustrate,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, In the drawings: FIG. 1 is a multi-screen layout of a conventional conference television display; FIG. 2 is a schematic structural diagram of a video conference system; FIG. 3 is a flowchart of a multi-screen synthesis method according to an embodiment of the present invention; BRIEF DESCRIPTION OF THE DRAWINGS FIG. 5 is a schematic diagram of a multi-screen layout according to a preferred embodiment 2 of the present invention; FIG. 6 is a multi-group multi-screen according to a preferred embodiment 3 of the present invention; Figure 7 is a flow chart of a multi-picture synthesis method according to a preferred embodiment of the present invention; Figure 8 is a block diagram showing the structure of a multi-picture synthesis system according to an embodiment of the present invention; A block diagram of a multi-picture synthesis system of a preferred embodiment of the invention; FIG. 10 is a block diagram showing the structure of a media processing device in accordance with a preferred embodiment of the present invention. BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict. 2 is a schematic structural diagram of a video conference system. As shown in FIG. 2, the video conference system includes: a single and multiple video stream conference site and a multipoint control unit (MCU for short) 13 . Among them, 111, 112, and 113 are multi-stream venues, and 121, 122, and 123 are single-stream venues. Among them, 111 venues are broadcast source venues, other venues are watching 111 venues; 111 are viewing venues 113 venues. The single-stream sites 121, 122, and 123 can only select one stream of multiple streams in the 111 site, and there is a defect that the information is incomplete. Therefore, the media processing module needs to synthesize the multiple streams into one stream. Although the 111 venue is a multi-stream venue, it can only see one multi-stream venue. It also needs a media processing module to combine other site code streams into one stream. In the above video conferencing system, if the user needs to see multiple site information as much as possible, and can also see the real size of the site scene displayed on the screen, it is necessary to provide a new multi-screen synthesis method, which is described below in conjunction with FIG. description. 3 is a flow chart of a multi-screen synthesis method in accordance with an embodiment of the present invention. As shown in FIG. 3, the multi-screen synthesis method includes the following processes: Step S302: Acquire a multi-media media code stream, where each media code stream is used to display one picture of the scene; Step S304: multiplex media code stream according to the layout The composition is a set of multiple pictures, wherein the size of at least one of the pictures is equal to the true size of the displayed scene. According to the above method, since the size of at least one of the above-mentioned layouts is equal to the real size of the displayed scene, the screen viewed by the user can maintain the true size of the displayed scene, and can also effectively perform eye contact, thereby improving the user's presence. Experience. Preferably, there may be at least one small picture in the multi-picture, wherein the size of each small picture is smaller than the real size of the displayed picture. Through the above processing, in one set of multiple pictures, one or more scenes of a real body size may be included, or one or more pictures having a size smaller than the real size of the displayed scene may be included, so that as many synthesized as possible can be achieved. Scenes, while maintaining the true size of the scene. The above preferred implementation process will be described below in conjunction with FIGS. 4 and 5. 4 is a schematic diagram of a multi-screen layout according to a preferred embodiment 1 of the present invention; as shown in FIG. 4, the layout is composed of 4 screens, wherein 414 corresponds to a large screen, and 411 is superimposed in the middle of the bottom thereof. 412, 413 a total of 3 aspect ratio 16:9 small picture, the size of the small picture, 1/36 in the large picture Between 1/16. The 411, 412, and 413 media sources are from the same multi-stream site or multiple single-stream sites. It can be seen that the 414 maintains the true size and does not affect the eye contact, and the 411, 412, and 413 can see the image information of other venues. FIG. 5 is a schematic diagram of a multi-screen layout according to a preferred embodiment 2 of the present invention; as shown in FIG. 5, the layout differs from FIG. 4 mainly in that a small screen is superimposed in the middle of the top. As in Fig. 4, the large screen 52 maintains the true size, does not affect the eye contact, and 511, 512, and 513 can see the image information of other venues. Preferably, according to the state of the conference, the number of multi-screen groups that need to be synthesized can be dynamically determined, and each group of multi-screens is displayed by one screen. In a preferred implementation process, a conventional multi-screen layout may also be used in the multiple sets of multiple pictures. For details, refer to FIG. 1. The above preferred implementation process will be described below in conjunction with FIG. FIG. 6 is a schematic diagram of a multi-group multi-screen layout according to a preferred embodiment 3 of the present invention; in this embodiment, FIG. 2 is taken as an example to show image information after multi-screen synthesis in different meeting places. 611 is a schematic diagram of the code stream viewed from the left screen of 111, and the images are from the left left seat of 113 and the venues of 121, 122, and 123 respectively. 612 is a schematic diagram of the code stream seen in the 111 exhibition, and the images are from the 113 seats and the 112 site images respectively. 613 is a schematic diagram of the code stream viewed from the right screen of 111, and the image is from the 113 right-seat image. It can be seen that the image information of the large screen of 111 is from 113, its size is kept true, and 112, 121, 122, 123 venue images are superimposed. 621, 622, and 623 are the image information of 113, and the large-screen media source is the 111 site, and the multi-screen superimposed 112 site information is synthesized on 622, so 113 can simultaneously view the 111 and 112 site videos. 63 is the nine screen images of the 121, 122, and 123 single-flow meeting places, and the media code streams are from the three conference sites of 111, 112, and 113 respectively. Preferably, the method may further include: processing, in the conference, determining to change the current layout according to the conference state; and synthesizing the multi-media media streams into a group of multiple images according to the changed layout. Preferably, the method may further include the following processing: determining a more according to the state of the conference in the conference? The code stream source; acquires a new multi-media stream from the new stream source; synthesizes the new multi-media code 3⁄43⁄4 into a set of multi-pictures according to the layout. According to the state of the conference, flexible changes to the current layout or code stream source can effectively improve the user's viewing experience. The above preferred implementation process will be described below in conjunction with FIG. 7 is a flow chart of a multi-picture synthesis method in accordance with a preferred embodiment of the present invention. As shown in FIG. 7, the multi-screen synthesis method includes the following processing: Step S702: During the conference holding process, according to the MCU image synthesis capability and the characteristics of the conference terminal, it is determined to synthesize several groups of multiple screens, and what layout is used for each group of multiple screens . Among them, the MCU image synthesis capability refers to how many multi-pictures can be synthesized, and also includes transcoding capabilities between different protocols at different rates. When the ability is limited, first ensure the multi-screen synthesis of the chairman's venue. The layout of multi-screen use can be selected according to the actual situation, the layout shown in Figure 1, Figure 4, Figure 5. Ordinary venues generally use the layout of Figure 1, and the telepresence venues generally use the layout of Figure 4 or Figure 5. In addition, the layout and number of frames can be dynamically adjusted during the meeting. Step S704: In the conference, according to the state of the conference, change which terminal stream is synthesized by the multi-screen at any time.
(相当于上述在会议中根据会议状态,确定更改码流源)。会议中可能发生广 播源改变, 则每组多画面所合成的媒体码流要随之变化。 图 4、 图 5布局中 大画面选择广播源会场码流或者广播源所看会场码流, 小画面选择其他会场 码流并按次序填充。 图 1布局中媒体码流可由用户指定, 或者显示最近有发 言的多个会场。 步骤 S706: 根据每个会场特点选择不同的媒体码流传输给该会场。 该会 场通过多路码流能够直接看到其他会场的场景, 或者其他会场合成后的多画 面, 具体可以 -据实际情况而定。 在优选实施过程中, 可以实时调整以下至少之一: 多画面组数、 多画面 的布局、 媒体源。 具体地, 系统可以才艮据预定配置动态调整, 也可以响应用 户的调整指令进行动态调整。 图 8是 居本发明实施例的多画面合成系统的结构框图。 如图 8所示, 该多画面合成系统, 应用于视频会议系统, 包括: 接入模块 80、 媒体处理模 块 82。 接入模块 80 , 用于获取多路媒体码流, 其中, 每路媒体码流用于显示场 景的一个画面; 媒体处理模块 82 , 用于按照布局将多路媒体码流合成为一组多画面, 其 中, 布局中的至少一个画面的大小等于所显示场景的真实大小。 釆用上述系统, 由于上述布局中的至少一个画面的大小等于所显示场景 的真实大小, 用户观看的画面可以保持其显示场景的真身大小、 也能够有效 进行眼神交流, 因此提高了用户临场感体验。 优选地, 上述接入模块 80, 还用于将媒体处理模块处理后的一组多画面 输出至终端。 优选地, 如图 9所示, 如果接入模块和媒体处理模块均为多个, 则系统 还可以包括: 媒体交换模块 84 , 用于将接入模块获取的媒体码流转发至该接 入模块对应的媒体处理模块, 将媒体处理模块处理后的多画面输出至该媒体 处理模块对应的接入模块。 需要注意的是, 上述多画面合成系统中各个模块相互结合的优选实施方 式可以参见图 3至图 7中的描述, 此处不再赘述。 图 10是根据本发明优选实施例的媒体处理装置的结构框图。 如图 10所 示, 该媒体处理装置主要包括: 接收模块 10、 处理模块 12、 以及输出模块 14。 接收模块 10, 用于接收多路媒体码流, 其中, 每路媒体码流用于显示场 景的一个画面; 在优选实施过程中, 接收模块 10接收终端码流, 其中, 该终端码流通 过网络和 MCU接入模块进入到 MCU内部。 处理模块 12 , 用于按照布局将接收到的多路媒体码流合成为一组多画 面, 其中, 该布局中的至少一个画面的大小等于所显示场景的真实大小。 在优选实施过程中, 处理模块 12才艮据画面布局和多画面填充码流信息 合成多画面。 其中, 该处理模块 12还用于进行码流协议处理和速率转换。 输出模块 14 , 用于将合成后的一组多画面输出。 上述媒体处理装置可以按照预定的布局将接收到的多路媒体码流合成为 一组多画面, 其中, 上述布局中的至少一个画面的大小等于所显示场景的真 实大小, 因而经过媒体处理装置处理后的图像能够保持真身大小, 不影响眼 神交流, 有效提高了用户体验。 优选地, 多画面中还可以有至少一个小画面, 其中, 每个小画面的大小 小于所显示场景的真实大小。 该多画面的布局具体可以参见图 4和图 5。 通过上述处理, 可以使用户尽可能地看到多个场景的画面, 有效提高了 用户的观看体验。 综上所述, 借助本发明提供的上述实施例, 可以在尽可能地合成多个会 场信息外,还能保持画面显示场景的真实大小,便于用户之间进行眼神交流, 有效提高了用户临场感体 -险。 显然, 本领域的技术人员应该明白, 上述的本发明的各模块或各步骤可 以用通用的计算装置来实现, 它们可以集中在单个的计算装置上, 或者分布 在多个计算装置所组成的网络上, 可选地, 它们可以用计算装置可执行的程 序代码来实现, 从而, 可以将它们存储在存储装置中由计算装置来执行, 并 且在某些情况下, 可以以不同于此处的顺序执行所示出或描述的步骤, 或者 将它们分别制作成各个集成电路模块, 或者将它们中的多个模块或步骤制作 成单个集成电路模块来实现。 这样, 本发明不限制于任何特定的硬件和软件 结合。 以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本 领域的技术人员来说, 本发明可以有各种更改和变化。 凡在本发明的^"神和 原则之内, 所作的任何修改、 等同替换、 改进等, 均应包含在本发明的保护 范围之内。 (Equivalent to the above in the meeting according to the state of the meeting, to determine the source of the code stream). The broadcast source change may occur in the conference, and the media stream synthesized by each group of multiple pictures will change accordingly. In the layout of Figure 4 and Figure 5, select the broadcast source site stream or the site stream viewed by the broadcast source. The small screen selects other site streams and fills them in order. The media code stream in the layout of Figure 1 can be specified by the user, or multiple sites that have recently spoken. Step S706: Select different media code streams to be transmitted to the site according to the characteristics of each site. The site can directly view the scenes of other sites through multiple streams, or multiple screens synthesized by other sites. Specifically, it can be determined according to actual conditions. In a preferred implementation, at least one of the following may be adjusted in real time: multi-picture group number, multi-picture layout, media source. Specifically, the system may dynamically adjust according to a predetermined configuration, or may dynamically adjust in response to a user's adjustment instruction. Fig. 8 is a block diagram showing the structure of a multi-picture synthesizing system according to an embodiment of the present invention. As shown in FIG. 8, the multi-screen synthesis system is applied to a video conference system, and includes: an access module 80 and a media processing module 82. The access module 80 is configured to acquire a multi-media media stream, where each media code stream is used to display one picture of the scene, and the media processing module 82 is configured to synthesize the multiple media code streams into a set of multiple pictures according to the layout. The size of at least one of the screens in the layout is equal to the true size of the displayed scene. According to the above system, since the size of at least one of the above-mentioned layouts is equal to the real size of the displayed scene, the screen viewed by the user can maintain the true size of the displayed scene, and can also effectively perform eye contact, thereby improving the user's presence. Experience. Preferably, the access module 80 is further configured to output a set of multiple pictures processed by the media processing module to the terminal. Preferably, as shown in FIG. 9, if there are multiple access modules and media processing modules, the system may further include: a media switching module 84, configured to forward the media code stream acquired by the access module to the access module The corresponding media processing module outputs the multi-screen processed by the media processing module to the access module corresponding to the media processing module. It should be noted that the preferred embodiments of the foregoing modules in the multi-screen synthesis system may be referred to the description in FIG. 3 to FIG. 7, and details are not described herein again. Figure 10 is a block diagram showing the structure of a media processing device in accordance with a preferred embodiment of the present invention. As shown in FIG. 10, the media processing device mainly includes: a receiving module 10, a processing module 12, and an output module 14. The receiving module 10 is configured to receive a multi-media media stream, where each media code stream is used to display a picture of the scene. In a preferred implementation, the receiving module 10 receives the terminal code stream, where the terminal code stream passes through the network and The MCU access module enters the MCU. The processing module 12 is configured to synthesize the received multi-media media code streams into a set of multiple pictures according to a layout, wherein a size of at least one of the pictures is equal to a real size of the displayed scene. In a preferred implementation, the processing module 12 synthesizes the multi-picture according to the picture layout and the multi-picture stuffing stream information. The processing module 12 is further configured to perform code stream protocol processing and rate conversion. The output module 14 is configured to output the combined set of multi-pictures. The media processing device may synthesize the received multi-media media code streams into a plurality of multi-pictures according to a predetermined layout, wherein at least one of the above-mentioned layouts has a size equal to a real size of the displayed scene, and thus is processed by the media processing device. The latter image can maintain the true size, does not affect the eye contact, and effectively improves the user experience. Preferably, there may be at least one small picture in the multi-picture, wherein the size of each small picture is smaller than the real size of the displayed picture. The layout of the multi-picture can be specifically seen in FIG. 4 and FIG. 5. Through the above processing, the user can see the screen of multiple scenes as much as possible, and effectively improve the viewing experience of the user. In summary, with the above embodiments provided by the present invention, it is possible to synthesize a plurality of site information as much as possible, and maintain the true size of the scene displayed on the screen, thereby facilitating eye contact between users and effectively improving the user's presence. Body-risk. Obviously, those skilled in the art should understand that the above modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device, such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein. The steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps are fabricated as a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software. The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the scope of the present invention are intended to be included within the scope of the present invention.

Claims

权 利 要 求 书 Claim
1. 一种多画面合成方法, 应用于视频会议系统, 其特征在于, 包括: 获取多路媒体码流, 其中, 每路媒体码流用于显示场景的一个画 面; A multi-screen synthesis method, which is applied to a video conference system, and includes: acquiring a multi-channel media code stream, wherein each media code stream is used to display a picture of the scene;
按照布局将所述多路媒体码流合成为一组多画面, 其中, 所述布 局中的至少一个画面的大小等于所显示场景的真实大小。  The multi-media media stream is synthesized into a set of multi-pictures according to a layout, wherein a size of at least one of the layouts is equal to a real size of the displayed scene.
2. 根据权利要求 2所述的方法, 其特征在于, 所述多画面中有至少一个 小画面, 其中, 每个小画面的大小小于所显示场景的真实大小。 The method according to claim 2, wherein the multi-picture has at least one small picture, wherein a size of each small picture is smaller than a real size of the displayed picture.
3. 根据权利要求 1或 2所述的方法, 其特征在于, 还包括: The method according to claim 1 or 2, further comprising:
根据会议状态, 确定需要合成的多画面组数, 每组所述多画面用 一个屏幕显示。  According to the state of the conference, the number of multi-screen groups that need to be synthesized is determined, and each of the plurality of screens is displayed on one screen.
4. 根据权利要求 1所述的方法, 其特征在于, 还包括: 4. The method according to claim 1, further comprising:
在会议中才艮据会议状态, 确定更改所述布局;  In the meeting, according to the state of the meeting, it is determined to change the layout;
才艮据所述更改后的布局,将所述多路媒体码流合成为一组多画面。  The multi-media media streams are combined into a set of multi-pictures according to the changed layout.
5. 根据权利要求 1所述的方法, 其特征在于, 还包括: 5. The method according to claim 1, further comprising:
在会议中才艮据会议状态, 确定更改码流源;  Determine the source of the change code stream according to the status of the conference during the conference;
从所述更改后的码流源获取新的多路媒体码流;  Obtaining a new multi-media media code stream from the changed code stream source;
按照布局将所述新的多路媒体码流合成为一组多画面。  The new multi-media stream is synthesized into a set of multi-pictures according to a layout.
6. —种多画面合成系统, 应用于视频会议系统, 其特征在于, 包括: 接入模块, 用于获取多路媒体码流, 其中, 每路媒体码流用于显 示场景的一个画面; A multi-screen synthesis system, which is applied to a video conference system, and includes: an access module, configured to acquire a multi-channel media code stream, where each media code stream is used to display a picture of the scene;
媒体处理模块, 用于按照布局将所述多路媒体码流合成为一组多 画面, 其中, 所述布局中的至少一个画面的大小等于所显示场景的真 实大小。 And a media processing module, configured to synthesize the multiple media code streams into a set of multiple pictures according to a layout, wherein a size of at least one of the pictures is equal to a real size of the displayed scene.
7. 根据权利要求 6所述的系统, 其特征在于, 7. The system of claim 6 wherein:
所述接入模块, 还用于将所述媒体处理模块处理后的所述一组多 画面输出至终端。  The access module is further configured to output the set of multiple pictures processed by the media processing module to the terminal.
8. 根据权利要求 7所述的系统, 其特征在于, 所述接入模块和所述媒体 处理模块均为多个, 所述系统还包括: The system of claim 7, wherein the access module and the media processing module are both, and the system further includes:
媒体交换模块, 用于将所述接入模块获取的媒体码流转发至该接 入模块对应的媒体处理模块, 将所述媒体处理模块处理后的所述多画 面输出至该媒体处理模块对应的接入模块。  a media switching module, configured to forward the media code stream obtained by the access module to a media processing module corresponding to the access module, and output the multi-screen processed by the media processing module to a corresponding one of the media processing module Access module.
9. 一种媒体处理装置, 应用于视频会议系统, 其特征在于, 包括: 接收模块, 用于接收多路媒体码流, 其中, 每路媒体码流用于显 示场景的一个画面; A media processing device, applied to a video conferencing system, comprising: a receiving module, configured to receive a multi-media media code stream, where each media code stream is used to display a picture of a scene;
处理模块, 用于按照布局将接收到的所述多路媒体码流合成为一 组多画面, 其中, 所述布局中的至少一个画面的大小等于所显示场景 的真实大小;  a processing module, configured to synthesize the received multi-media media stream into a plurality of multi-screens according to a layout, wherein a size of at least one of the layouts is equal to a real size of the displayed scene;
输出模块, 用于将合成后的所述一组多画面输出。  And an output module, configured to output the synthesized set of multiple pictures.
10. 根据权利要求 9所述的装置, 其特征在于, 所述多画面中有至少一个 小画面, 其中, 每个小画面的大小小于所显示场景的真实大小。 10. The apparatus according to claim 9, wherein the multi-picture has at least one small picture, wherein a size of each small picture is smaller than a real size of the displayed picture.
PCT/CN2010/080320 2010-05-14 2010-12-27 Multi-picture synthesis method and system, and media processing device WO2011140812A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201010180447A CN101860715A (en) 2010-05-14 2010-05-14 Multi-picture synthesis method and system and media processing device
CN201010180447.3 2010-05-14

Publications (1)

Publication Number Publication Date
WO2011140812A1 true WO2011140812A1 (en) 2011-11-17

Family

ID=42946318

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/080320 WO2011140812A1 (en) 2010-05-14 2010-12-27 Multi-picture synthesis method and system, and media processing device

Country Status (2)

Country Link
CN (1) CN101860715A (en)
WO (1) WO2011140812A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104935866A (en) * 2014-03-19 2015-09-23 华为技术有限公司 Method, synthesis device and system for realizing video conference
CN105451022A (en) * 2015-11-17 2016-03-30 深圳联友科技有限公司 Method of compressing multipath video streams into video stream and system thereof

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101860715A (en) * 2010-05-14 2010-10-13 中兴通讯股份有限公司 Multi-picture synthesis method and system and media processing device
CN102420968A (en) * 2011-12-15 2012-04-18 广东威创视讯科技股份有限公司 Method and system for displaying video windows in video conference
CN104427293B (en) * 2013-08-23 2018-10-02 南京中兴新软件有限责任公司 A kind of method, apparatus and video terminal for establishing video conference interface
CN104902217B (en) * 2014-03-05 2019-07-16 中兴通讯股份有限公司 A kind of method and device showing layout in netting true conference system
CN104539872A (en) * 2014-12-03 2015-04-22 宁波Gqy视讯股份有限公司 Conference terminal
CN104519307B (en) * 2014-12-12 2018-01-12 华为软件技术有限公司 Meeting management system, continuous presence equipment, branch's field device, video meeting implementing method and system
CN106162045A (en) * 2015-04-17 2016-11-23 中兴通讯股份有限公司 Method for displaying image and device
CN106162046A (en) * 2015-04-24 2016-11-23 中兴通讯股份有限公司 A kind of video conference image rendering method and device thereof
CN106941598A (en) * 2016-01-04 2017-07-11 中兴通讯股份有限公司 Many picture bit stream synthetic methods, many picture bit streams synthesis control method and device
CN109104613A (en) * 2017-06-21 2018-12-28 苏宁云商集团股份有限公司 A kind of VR live broadcasting method and system for realizing the switching of multimachine position
CN107450871A (en) * 2017-06-22 2017-12-08 广州视源电子科技股份有限公司 Wireless screen transmission display method and device and storage medium
CN109068166B (en) * 2018-08-17 2020-02-14 北京达佳互联信息技术有限公司 Video synthesis method, device, equipment and storage medium
JP7230394B2 (en) * 2018-09-25 2023-03-01 京セラドキュメントソリューションズ株式会社 Teleconferencing device and teleconferencing program
CN110262866B (en) * 2019-06-18 2022-06-28 深圳市拔超科技股份有限公司 Screen multi-picture layout switching method and device and readable storage medium
CN112804471A (en) * 2019-11-14 2021-05-14 中兴通讯股份有限公司 Video conference method, conference terminal, server and storage medium
CN113596349B (en) * 2021-07-26 2024-06-04 世邦通信股份有限公司 Conference method, system, device and storage medium for automatic linkage video of speaking position
CN115550599A (en) * 2022-09-22 2022-12-30 苏州科达科技股份有限公司 Audio and video output method for presence meeting place, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1290107A (en) * 1999-12-18 2001-04-04 深圳市中兴通讯股份有限公司 Method for implementing picture-in-picture mode at remote end
CN1476242A (en) * 2002-07-23 2004-02-18 ������������ʽ���� Display system, network answering display device, terminal apparatus and controlling program
CN1878260A (en) * 2006-07-14 2006-12-13 杭州国芯科技有限公司 Multi-menu co-screen playing method
US7554571B1 (en) * 2005-03-18 2009-06-30 Avaya Inc. Dynamic layout of participants in a multi-party video conference
CN101860715A (en) * 2010-05-14 2010-10-13 中兴通讯股份有限公司 Multi-picture synthesis method and system and media processing device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7830409B2 (en) * 2005-03-25 2010-11-09 Cherng-Daw Hwang Split screen video in a multimedia communication system
CN101198008A (en) * 2008-01-03 2008-06-11 中兴通讯股份有限公司 Method and system for implementing multi-screen and multi-picture
CN101291417B (en) * 2008-06-06 2011-03-02 中兴通讯股份有限公司 Polling method and system for videoconference system
CN101640784A (en) * 2008-07-28 2010-02-03 上海领世通信技术发展有限公司 Device and method for controlling multi-image compounding in video conference system
NO333026B1 (en) * 2008-09-17 2013-02-18 Cisco Systems Int Sarl Control system for a local telepresence video conferencing system and method for establishing a video conferencing call.
CN101583011B (en) * 2009-05-27 2012-04-04 华为终端有限公司 Video conference control method and system, video conference network equipment and conference places

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1290107A (en) * 1999-12-18 2001-04-04 深圳市中兴通讯股份有限公司 Method for implementing picture-in-picture mode at remote end
CN1476242A (en) * 2002-07-23 2004-02-18 ������������ʽ���� Display system, network answering display device, terminal apparatus and controlling program
US7554571B1 (en) * 2005-03-18 2009-06-30 Avaya Inc. Dynamic layout of participants in a multi-party video conference
CN1878260A (en) * 2006-07-14 2006-12-13 杭州国芯科技有限公司 Multi-menu co-screen playing method
CN101860715A (en) * 2010-05-14 2010-10-13 中兴通讯股份有限公司 Multi-picture synthesis method and system and media processing device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104935866A (en) * 2014-03-19 2015-09-23 华为技术有限公司 Method, synthesis device and system for realizing video conference
US9848168B2 (en) 2014-03-19 2017-12-19 Huawei Technologies Co., Ltd. Method, synthesizing device, and system for implementing video conference
CN105451022A (en) * 2015-11-17 2016-03-30 深圳联友科技有限公司 Method of compressing multipath video streams into video stream and system thereof

Also Published As

Publication number Publication date
CN101860715A (en) 2010-10-13

Similar Documents

Publication Publication Date Title
WO2011140812A1 (en) Multi-picture synthesis method and system, and media processing device
JP5198567B2 (en) Video communication method, system and apparatus
AU2011258272B2 (en) Systems and methods for scalable video communication using multiple cameras and multiple monitors
JP5508450B2 (en) Automatic video layout for multi-stream and multi-site telepresence conferencing system
EP1683356B1 (en) Distributed real-time media composer
US9154737B2 (en) User-defined content magnification and multi-point video conference system, method and logic
CN102843542B (en) The media consulation method of multithread meeting, equipment and system
CN101291417B (en) Polling method and system for videoconference system
WO2008131644A1 (en) A method, device and system for realizing picture switching in the video service
US8836753B2 (en) Method, apparatus, and system for processing cascade conference sites in cascade conference
US9961303B2 (en) Video conference virtual endpoints
WO2011116611A1 (en) Method for playing video of tv meeting
WO2015003532A1 (en) Multimedia conferencing establishment method, device and system
JP2005286972A (en) Multi-point conference connection system and multi-point conference connection method
WO2012028018A1 (en) Distributed video processing method and video conference system
WO2014177082A1 (en) Video conference video processing method and terminal
CN108156413B (en) Video conference transmission method and device and MCU
WO2014012384A1 (en) Communication data transmitting method, system and receiving device
WO2014026478A1 (en) Video conference signal processing method, video conference server and video conference system
JP2823571B2 (en) Distributed multipoint teleconferencing equipment
TWI531244B (en) Method and system for processing video data of meeting
WO2021254452A1 (en) Method for controlling video conference system, and multipoint control unit and storage medium
CN118301274A (en) Video conference access method, device, terminal and computer readable storage medium
JPS63193682A (en) Inter-multilocation video conference controller
CN115734028A (en) Media stream pushing method and system based on cascade coding

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10851316

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10851316

Country of ref document: EP

Kind code of ref document: A1