WO2012175025A1 - Remotely presented conference system, method for recording and playing back remotely presented conference - Google Patents

Remotely presented conference system, method for recording and playing back remotely presented conference Download PDF

Info

Publication number
WO2012175025A1
WO2012175025A1 PCT/CN2012/077266 CN2012077266W WO2012175025A1 WO 2012175025 A1 WO2012175025 A1 WO 2012175025A1 CN 2012077266 W CN2012077266 W CN 2012077266W WO 2012175025 A1 WO2012175025 A1 WO 2012175025A1
Authority
WO
WIPO (PCT)
Prior art keywords
recording server
conference
recording
video
telepresence
Prior art date
Application number
PCT/CN2012/077266
Other languages
French (fr)
Chinese (zh)
Inventor
吴衍平
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012175025A1 publication Critical patent/WO2012175025A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N9/00Details of colour television systems
    • H04N9/79Processing of colour television signals in connection with recording
    • H04N9/80Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback
    • H04N9/82Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback the individual colour picture signal components being recorded simultaneously only
    • H04N9/8205Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback the individual colour picture signal components being recorded simultaneously only involving the multiplexing of an additional signal and the colour video signal
    • H04N9/8227Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback the individual colour picture signal components being recorded simultaneously only involving the multiplexing of an additional signal and the colour video signal the additional signal being at least another television signal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/155Conference systems involving storage of or access to video conference sessions

Definitions

  • the present invention relates to the field of communications, and in particular to a method for recording and playing back a telepresence conference system and a telepresence conference.
  • BACKGROUND OF THE INVENTION In the related art, the following methods are generally used for recording and playback of a video conference: One is a recording and playback method based on an analog mode, and FIG. 1 can be specifically seen. As shown in FIG. 1, it is independent of the video conference system (by the Multipoint Control Unit (MCU) 101, the video terminal host 103, the camera 106, the microphone 107, the PC 108, the television 109, the speaker 110, and the display 111.
  • MCU Multipoint Control Unit
  • a recording and recording system which includes a recording server 102 deployed in the central computer room and an encoder 104 and a decoder 105 deployed in the conference room, which have acquisition, encoding, transmission, storage, and decoding. And playback features.
  • the disadvantages are: the functions of the acquisition/encoding/decoding and the functions of the video conferencing system are duplicated, resulting in functional redundancy and high system configuration cost; the main lecture site cannot be tracked in real time (the conference site is broadcasted during the conference, during the conference, each conference site has May become a broadcast venue).
  • the other is a digital recording and playback method, as shown in Figure 2. As shown in FIG.
  • a set of recording server 202 is added independently of the video conferencing system (consisting of MCU 201, video terminal host 203, camera 206, microphone 207, PC 208, television 209, speaker 210, display 211).
  • the recording server needs to be digitally connected to the MCU and the venue terminal.
  • the terminal plays back.
  • the advantages are: complementing the functions of the video conference system, and only realizing the functions of multimedia information storage not available in the video conference system; and tracking the lecture site in real time.
  • the telepresence conferencing system is an advanced video conferencing system, which is characterized by the fact that there are multiple video terminal hosts in the venue; Multiple directly or indirectly connected to these video terminal hosts Audio and video capture devices, a dedicated telepresence camera, multiple audio and video output devices, a centrally controlled touch screen, multiple desktop LCD displays, and dedicated conference tables and chairs.
  • Figure 3 shows a typical three-screen telepresence site (the terminal host is not shown in Figure 3).
  • the communication entities in the existing telepresence conference system are generally composed of several video terminals complying with the ITU-T H.323 protocol (for example, for a three-screen telepresence conference site, generally including three video terminals), these terminals and telepresence MCUs. Communicate according to the ITU-T H.323 protocol.
  • the images of the remote presentation camera output have a spatial arrangement relationship.
  • the sounds collected by multiple microphones also have positional properties, and the images and sounds have a spatial relationship.
  • the recording and playback scheme of the conventional video conference system cannot guarantee the natural stitching effect and the sound recognition effect of the image during playback.
  • the present invention provides a remote presentation conference system, a remote presentation conference recording and playback method, to at least solve the related art in a telepresence conference system, using a conventional video conference system recording and playback scheme, can not guarantee playback The natural stitching effect of the image and the problem of the sound recognition effect.
  • a telepresence conferencing system is provided.
  • the telepresence conference system according to the present invention includes: an MCU and a recording server.
  • the MCU includes: a configuration module, configured to add the recording server as a telepresence site to the conference; the first connection establishment module is configured to initiate one or more calls to the recording server to establish a communication connection, where The call setup message carries spatial information of the location of the video associated with the communication connection to be established; the recording server includes: a receiving module configured to receive the media stream from the MCU; and a storage recording module configured to be a storage medium Stream, record the relationship between the various media streams.
  • the association relationship between the media streams includes at least one of the following: a spatial relationship between the video streams; a correspondence between the data streams and the video streams; and a correspondence between the audio streams and the video streams.
  • the foregoing MCU includes: a broadcast module, configured to broadcast a media stream recorded by the recording server according to an association relationship between the media streams in a form of calling a conference.
  • the telepresence conference system further includes: a telepresence conference site; the telepresence conference site includes: a second connection establishment module, configured to establish a point-to-point connection with the recording server; and a presentation module, configured to acquire a record according to an association relationship between the media streams The media stream recorded by the broadcast server is presented.
  • the foregoing second connection establishing module includes: an initiating unit, configured to initiate one or more calls for establishing a connection to the recording server, where the setup message of the call carries a video associated with the communication connection that needs to be established Spatial information of the orientation.
  • the recording server includes: a first sending module, configured to send file list information corresponding to the recorded media stream to one or more of the video terminals; and a second sending module, configured to respond to the user according to the file list information
  • the operation sends a media stream to each video terminal according to an association relationship between the media streams.
  • the communication signaling between the MCU and the recording and recording server, and the communication signaling between the telepresence site and the recording server are signaling based on the H.323 protocol protocol or the IETF SIP protocol. According to another aspect of the present invention, a recording method of a telepresence conference is provided.
  • the recording method of the telepresence conference includes: after the MCU in the telepresence conference system adds the recording server as a telepresence conference site to the conference, the recording server receives one or more initiated from the MCU for establishing a connected call, where the call setup message carries spatial information of the location of the video associated with the communication connection that needs to be established; the recording server receives the media stream from the MCU and stores it, and records between the media streams connection relation.
  • the MCU After the MCU is connected to the recording server, the MCU also performs the following processing on the port connected to the recording server: prohibiting the media stream from the recording server from entering the conference mixer; prohibiting the recording server from joining the conference list It is forbidden to notify the remote presentation site in the telepresence conference system of the upper and lower end information of the recording server; and to transfer the main site, the sub-site or the mixed media stream to the recording server.
  • the association relationship between the media streams includes at least one of the following: a spatial relationship between the video streams; a correspondence between the data streams and the video streams; and a correspondence between the audio streams and the video streams.
  • a playback method of a telepresence conference includes: The MCU in the telepresence conference system will broadcast the media stream recorded by the recording server according to the association relationship between the respective recorded media streams by calling the conference. According to still another aspect of the present invention, a playback method of a telepresence conference is provided.
  • the playback method of the telepresence conference includes: establishing a point-to-point connection between the video terminal of the telepresence site in the telepresence conference system and the recording server; and obtaining the recording server according to the association relationship between the recorded media streams by the telepresence site The recorded media stream is presented.
  • the establishing, by the video terminal, the peer-to-peer connection with the recording server includes: the video terminal initiating one or more calls for establishing a connection to the recording server, where the call setup message carries the video associated with the communication connection to be established. Spatial information of the orientation.
  • the remote presentation site obtains the media stream from the recording server, including: the recording server sends the file list information corresponding to the recorded media stream to one or more of the video terminals; and the recording server responds to the user according to the file list information.
  • the selection operation sends a recorded media stream to each video terminal according to the association relationship between the recorded media streams.
  • FIG. 1 is a schematic diagram of an analog-based video conferencing system according to the related art
  • FIG. 2 is a schematic diagram of a digital-based video conferencing system according to the related art
  • FIG. 3 is a three-screen remote presentation according to the related art.
  • 4 is a structural block diagram of a telepresence conference system according to an embodiment of the present invention
  • FIG. 5 is a structural block diagram of a telepresence conference system according to a preferred embodiment of the present invention
  • FIG. 6 is a telepresence conference according to an example of the present invention.
  • FIG. 7 is a flowchart of a method for recording a telepresence conference according to an embodiment of the present invention
  • FIG. 8 is a flowchart of a method for recording a telepresence conference according to a preferred embodiment of the present invention
  • 9 is a flowchart of a playback method of a telepresence conference according to an embodiment of the present invention
  • FIG. 10 is a flowchart of an on-demand playback method of a telepresence conference according to a preferred embodiment of the present invention.
  • FIG. 4 is a structural block diagram of a telepresence conference system in accordance with an embodiment of the present invention. As shown in FIG.
  • the telepresence conference system mainly includes: a multipoint control unit (MCU) 10 and a recording server 20; the MCU 10 further includes: a configuration module 100, configured to add the recording server as a telepresence site
  • the first connection establishing module 102 is configured to initiate one or more calls to the recording server to establish a communication connection, where the setup message of the call carries a video associated with the communication connection that needs to be established.
  • the recording server 20 further includes: a receiving module 200 configured to receive a media stream from the MCU; and a storage recording module 202 configured to store the media stream and record an association relationship between the media streams.
  • the association relationship between the foregoing media streams includes, but is not limited to, at least one of the following: a spatial relationship between the video streams; a correspondence between the data stream and the video stream; and a correspondence between the audio stream and the video stream.
  • the images of the remote presentation cameras are spatially arranged.
  • the sounds collected by the multiple microphones also have positional properties, and the images and sounds have spatial correspondences.
  • the recording and playback scheme of the traditional video conference system cannot guarantee the natural stitching effect and the sound recognition effect of the image during playback.
  • the MCU 10 carries the spatial information of the location of the video associated with the communication connection to be established in the call setup message, and sends the information to the recording server.
  • the recording server not only needs to store the media stream. , you also need to record the relationship between the various media streams. Thereby, the natural stitching effect and the sound recognition effect of the image during playback can be ensured.
  • the MCU 10 can be considered as a collection of multiple conferences held thereon. Each conference includes multiple telepresence sites, and each telepresence site includes several video terminals. The upper end process of a telepresence site in a conference means that the MCU establishes a connection with several video terminals of the telepresence site in the conference.
  • the recording server can be considered as a collection of several independent telepresence venues.
  • the recording server is added to the conference as a telepresence venue.
  • the telepresence site package Contains several video terminals that are virtualized by the recording server.
  • the upper end process of the telepresence site means a process of establishing a connection between the MCU and a plurality of video terminals virtualized by the recording server.
  • the MCU 10 initiates multiple calls to the recording server 20, and binds the communication connections established by the multiple calls to one communication port to indicate that the connections belong to a remote presentation site.
  • the MCU can perform the following processing on the port connected to the recording server:
  • the recording server is added to the conference as a telepresence venue, it is not equivalent to telepresencing the venue.
  • media streams from the recording server should be prevented from entering the conference mixer, thus interfering with the normal operation of the conference.
  • multiple video terminals in the remote presentation site can physically correspond to one device.
  • the physical device may have many changes in the communication protocol. For example, the communication between the physical device and the MCU and the communication with the recording server are no longer represented by multiple signaling interaction processes, but only as one letter. make. Similarly, the recording and playback signaling of the solution can also adopt a signaling manner.
  • each call carries spatial information of the location of the video associated with the communication connection that needs to be established
  • the call The number should be greater than or equal to the number of screens in the telepresence site with the most screens in the conference that needs to be recorded. Only when the above conditions are met, the spatial information of the orientation in which the video associated with the communication connection to be established is located can be completely transmitted to the recording server. The natural complete stitching effect of the image can be guaranteed during playback.
  • the above calls follow the H.323 protocol procedure.
  • a telepresence site when a telepresence site needs to play back the content stored in the recording server, it also communicates with several video terminals virtualized by the recording server using a communication method conforming to the H.323 protocol.
  • H.323 protocol procedure it is not limited to the H.323 protocol procedure, but other protocol procedures can also be used, for example, the IETF SIP protocol procedure.
  • IETF SIP protocol procedure For the way of playback, there are two situations. One is to convene a conference through the MCU. The MCU broadcasts the recording server and switches the video and audio data stream sent by the recording server to other sites. It can be understood that this process is equivalent to the reverse process of the recording process; preferably, As shown in FIG.
  • the MCU 10 may further include: a broadcast module 104, configured to broadcast a media stream recorded by the recording and recording server in a form of calling a conference.
  • the other is to use the site on-demand method, and the video terminal of the telepresence site communicates with the recording server to establish a point-to-point connection to receive the recorded media and present it on a multimedia output device such as a TV/speaker/display.
  • the telepresence conference system may further include: a telepresence conference site 30; the telepresence conference site 30 includes: a second connection establishment module 300, configured to establish a point-to-point connection with the recording server; Set to get the media stream recorded by the recording server and render.
  • the second connection establishing module 300 may further include: an initiating unit 3000 (not shown in FIG. 5) configured to initiate one or more calls for establishing a connection to the recording server, wherein the call establishment message It carries spatial information about the location of the video associated with the communication connection that needs to be established.
  • the recording and recording server 20 may further include: a first sending module 204, configured to send file list information corresponding to the recorded media stream to one or more of the video terminals; The module 206 is configured to send the recorded media stream to each video terminal according to the association relationship between the recorded media streams in response to the user performing a selection operation according to the file list information.
  • the recording server can also receive and respond to instructions such as fast forward and rewind from one or more video terminals.
  • the playback process may be implemented directly by using the communication terminal device in the telepresence site, or may be implemented by other methods, for example, using PC software.
  • the communication signaling between the MCU 10 and the recording server 20, and the communication signaling between the telepresence conference site 30 and the recording server 20 may be a letter extended based on the H.323 protocol protocol or the IETF SIP protocol protocol. make. Of course, it can also be signaling that is extended based on other protocol procedures. The above preferred embodiment is further described below in conjunction with the example shown in FIG. 6.
  • each venue includes: (1) The camera group 606 includes three cameras, which are set to capture images of local venue participants;
  • the microphone group 607 which includes four microphones, is set to collect audio information of the local site
  • PC group 608 including 1 ⁇ 3 PCs, as the conference dual-stream data source
  • TV group 609 including 3 TVs, set to display remote image information
  • Speaker group 610 including 2 speakers, set to output conference stereo audio information
  • the desktop LCD panel 611 which contains 3 LCD monitors, is set to display dual stream data. It can be understood that in addition to the above devices, other devices or systems for providing a sense of presence of the conference, such as lighting, venue decoration, etc., can be deployed.
  • the remote presentation conference central office system including the MCU 601 and its management system (not shown in FIG. 6) (1) MCU 601, implements multipoint conference control, and broadcasts and exchanges media streams (video and audio) of each venue. And mixing, is the core equipment of the entire conference system;
  • the recording and broadcasting system includes a recording server 602 and a management system thereof (not shown in FIG. 6).
  • the recording server 602 interacts with the MCU 401 and the video terminal host 622 624 of the remote presentation site (interfaces 612, 613) to implement multimedia content storage and live broadcast on demand;
  • Management system Provide the man-machine interface of the recording server administrator to realize system configuration, program editing, and so on.
  • the following describes the recording and playing back of the telepresence conference in which the pure three-screen telepresence site participates, and the above preferred embodiment is specifically described with reference to FIG. 6. It should be noted that other conference networking situations that are not represented by a pure three-screen telepresence site can be recorded and played back by referring to the following methods.
  • the conference is configured.
  • the Wang administrator configures the recording server address on the management system of the MCU 601, and then defines a telepresence conference that needs to be recorded by the pure three-screen telepresence site, and then goes to the MCU 601. After sending this configuration, the start conference command is released.
  • the MCU 601 After receiving the notification from the administrator, the MCU 601 successively initiates three H.225.0 Q.931 calls conforming to the H.323 protocol procedure to the recording server 602.
  • the setup message of the call according to the message extension mechanism defined by the H.225.0 Q.931 protocol, a video related to the communication connection to be established by the call is added in the call setup message (setup.sourcelnfo.nonStandardData.Data message) Spatial information of the left, center, and right directions.
  • the MCU 601 establishes a communication connection with the recording server according to the normal procedure of H.323.
  • the MCU sends a sub-site or main venue or the mixed audio and video data media stream to the recording server according to the H.323 normal procedure or other procedures according to the conference control policy.
  • the stored procedure recording server receives and stores the media streams sent from the three connections, and records the relationship between the media streams, for example, the spatial relationship between the video streams, the correspondence between the data streams and the video streams. Relationships, correspondences between audio streams and video streams, and the like.
  • the playback process is mainly divided into two situations. One is to call the conference through the MCU.
  • the MCU broadcasts the broadcast server and switches the video and audio data stream sent by the recording server to other sites. It can be understood that this process is equivalent to The reverse process of the recording process; the other is to use the site on-demand method, and the video terminal of the telepresence site communicates with the recording server to establish a point-to-point connection to receive the recorded media and present it on a multimedia output device such as a TV/speaker/display.
  • the communication connection establishment process of the latter playback mode will be described in detail below.
  • the three video terminals (622-624) of the telepresence site respectively initiate H.225.0 Q.931 calls to the recording server in accordance with the H.323 protocol procedure or other protocol procedures. Mouth 4.
  • the setup message of the call according to the message extension mechanism defined by the H.225.0 Q.931 protocol, in the call setup message (setup.sourcelnfo.nonStandardData.Data message), the video corresponding to the communication connection to be established by the call is added. Left, center and right space information.
  • the recording server 602 After receiving all the calls, the recording server 602 sends a file list video of the stored content to one or more predetermined video terminals (for example, a three-screen telepresence venue, an agreement intermediate video terminal 623) according to a system pre-agreed, And receive user selection. After determining that the content selected by the user is also a three-screen remote presentation site conference recording content, different media streams are sent to different video terminals according to the stored relationship between the stored media streams. During the playback process, the recording server can also accept and respond to instructions such as fast forward and rewind from the particular video terminal. According to an embodiment of the present invention, based on the above-described telepresence conference system, a recording method of a telepresence conference is also provided.
  • Step S702 After the MCU in the telepresence conference system adds the recording server as a telepresence conference site to the conference, the recording server receives the origination from the MCU.
  • Step S704 The recording server receives the information from the MCU The media stream is stored and recorded, and the relationship between the media streams is recorded.
  • the association relationship between the media streams includes, but is not limited to, at least one of the following: a spatial relationship between the video streams; a correspondence between the data streams and the video streams; and a correspondence between the audio streams and the video streams.
  • the MCU carries the spatial information of the location of the video associated with the communication connection to be established in the call setup message, and sends the information to the recording server.
  • the recording server not only needs to store the media stream. , you also need to record the relationship between the various media streams. Thereby, the natural stitching effect and the sound recognition effect of the image during playback can be ensured.
  • the MCU processes the port connected to the recording server, and the specific processing is as follows:
  • each call carries spatial information of the location of the video associated with the communication connection that needs to be established
  • the call The number should be greater than or equal to the number of screens in the telepresence site with the most screens in the conference that needs to be recorded. Only when the above conditions are met, the spatial information of the orientation in which the video associated with the communication connection to be established is located can be completely transmitted to the recording server. The natural complete stitching effect of the image can be guaranteed during playback.
  • multiple video terminals in the remote presentation site can physically correspond to one device.
  • the physical device may have many changes in the communication protocol.
  • the communication between the physical device and the MCU and the communication with the recording server are no longer represented by multiple signaling interaction processes, but only as one letter. make.
  • the recording and playback signaling of the solution can also adopt a signaling manner.
  • the above calls follow the H.323 protocol procedure.
  • a remote presentation site needs to play back the content stored in the recording server, it also communicates with several video communication terminals virtualized by the recording server by using the communication method conforming to the H.323 protocol.
  • the communication signaling between the MCU and the recording server, the recording server, and the telepresence site may be extended by the message extension mechanism specified by the H.323 protocol.
  • H.323 protocol procedures it is not limited to the above H.323 protocol procedures, but other protocol procedures, such as the IETF SIP protocol procedures, may also be employed.
  • FIG. 8 is a flow chart of a method of recording a telepresence conference in accordance with a preferred embodiment of the present invention. As shown in FIG.
  • the recording method of the telepresence conference mainly includes the following processing: Step S802: The MCU adds the recording server as a telepresence conference site to the conference; Step S804: The MCU calls each site, and the recording server serves as a special telepresence site, and initiates one or more calls to the recording server to establish a communication connection. Step S806: After the call is successful, send the current broadcast site media stream to Recording the server, and indicating the association relationship between the media streams; Step S808: The recording server stores the media stream according to the foregoing indication, and records the association relationship between the media streams.
  • Step S810 The conference ends, and the recording server responds to the administrator operation instruction, edits and publishes the recorded media stream, and waits for the user to order.
  • a playback method of a telepresence conference is also provided.
  • the method mainly includes: The MCU in the remote presentation conference system broadcasts the media stream recorded by the recording server according to the association relationship between the recorded media streams by calling the conference. It should be noted that this playback process is equivalent to the reverse process of the above recording process. Since the media stream recorded by the recording server is broadcast according to the association relationship between the media streams mentioned above during playback, the natural image during playback can be guaranteed. Stitching effect and listening sound recognition effect.
  • Step S902 The video terminal of the telepresence site in the telepresence conference system establishes a point-to-point connection with the recording server;
  • Step S904 The remote presentation site is recorded according to the The association between the media streams obtains the media stream recorded by the recording server and is presented.
  • the telepresence site obtains the media stream recorded by the recording server according to the relationship between the recorded media streams, which can ensure the natural stitching effect and the sound recognition effect of the image during playback.
  • the foregoing step S902 may further include the following steps: the video terminal initiates one or more calls for establishing a connection to the recording server, where the call setup message carries the video associated with the communication connection that needs to be established. Spatial information of the orientation.
  • the above step S904 may further include the following processing:
  • the recording server sends the file list information corresponding to the recorded media stream to one or more of the video terminals;
  • the recording server responds to the user's selection operation according to the file list information, and transmits the recorded media stream to each video terminal according to the association relationship between the recorded media streams.
  • An on-demand playback method of a telepresence conference is further described below in conjunction with FIG. 10 is a flow chart of a method of on-demand playback of a telepresence conference in accordance with a preferred embodiment of the present invention. As shown in FIG. 10, the on-demand playback method of the telepresence conference mainly includes the following processes: Step 1002: The remote presentation site responds to an administrator operation instruction, and needs to initiate a call to the recording server.
  • Step 1004 Remotely present the video terminals of the conference site to initiate a call Recording the call of the server, and carrying the call setup message carrying the spatial information (for example, the location identifier) of the location of the video associated with the communication connection to be established;
  • Step 1006 The recording server receives the call and according to the call The parameter determines the type of the caller's site, and sends a corresponding media stream to each video terminal according to the association relationship between the recorded media streams.
  • Step 1008 The remote presentation site video terminal presents the media stream on the multimedia device according to the requirements of time and space.
  • the recorded content not only contains multimedia content (media stream) in the conference room, but also records the relationship between them (for example, time and space relationship), thereby ensuring the true restoration possibility of the conference scene during playback.
  • the existing video conference recording server can record telepresence conferences through simple software upgrades, which can effectively save and protect user investment.
  • the above modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device so that they may be stored in the storage device by the computing device, or they may be separately fabricated into individual integrated circuit modules, or Multiple modules or steps are made into a single integrated circuit module.

Abstract

Provided are a remotely presented conference system, and a method for recording and playing back a remotely presented conference.The remotely presented conference system includes: an MCU and a recording and broadcasting server, wherein the MCU includes: a configuration module set up to add into a conference the recording and broadcasting server as a remote conference presenting location, and a first connection establishment module set up to initiate one or more calls towards the recording and broadcasting server to establish a communication connection, wherein an establishment message of the call carries spatial information about the position in which a video associated with the communication connection to be established is located;and the recording and broadcasting server includes: a receiving module set up to receive media streams from the MCU, and a storage and recording module set up to store the media streams and record the correlation between various media streams.The technical solution provided by the present invention can ensure natural splicing effects and sound location effects when playing back images.

Description

远程呈现会议系统、 远程呈现会议的录制与回放方法 技术领域 本发明涉及通信领域, 具体而言, 涉及一种远程呈现会议系统、 远程呈现会议的 录制与回放方法。 背景技术 相关技术中, 传统的视频会议录制与回放主要有以下方法: 一种是基于模拟方式的录制与回放方法, 具体可以参见图 1。 如图 1所示, 独立 于视频会议系统(由多点控制单元(Multipoint Control Unit, 简称为 MCU) 101, 视频 终端主机 103, 摄像机 106、 麦克风 107、 PC108、 电视机 109、 音箱 110、 显示器 111 组成) 之外, 增加一套录播系统, 该录播系统包括部署在中心机房的录播服务器 102 和部署于会议室的编码器 104和解码器 105, 具有采集、 编码、 传输、 存储、 解码和 回放功能。其缺点是: 采集 /编码 /解码等功能与视频会议系统具有的功能重复, 造成功 能冗余, 系统配置成本高; 无法实时跟踪主讲会场 (会议中被广播会场, 会议期间, 每个会场都有可能成为被广播会场)。 另一种是基于数字方式的录制与回放方法, 具体可以参见图 2。 如图 2所示, 独 立于视频会议系统之外 (由 MCU201 , 视频终端主机 203, 摄像机 206、 麦克风 207、 PC208、 电视机 209、 音箱 210、 显示器 211组成), 增加一套录播服务器 202, 该录播 服务器需要和 MCU以及会场终端进行数字对接。录制时,根据和多点控制单元 (MCU) 的交互信令, 接收 MCU送过来的多媒体信息, 并将其储存; 回放时, 根据和终端的 交互信令, 送出存储的多媒体信息给终端, 由终端进行回放。 其优点是: 与视频会议 系统进行功能互补, 仅实现视频会议系统没有的多媒体信息储存等功能; 能实时跟踪 主讲会场。 由于应用和技术上的局限, 传统视频会议系统一般仅需要记录视频、 音频以及数 据内容, 对除此外的其他信息不关注。 下面详细说明远程呈现会议系统相比传统视频会议系统的特点: 远程呈现会议系统是一种高级视频会议系统, 相比于传统视频会议系统而言, 其 特点在于: 会场存在多个视频终端主机; 与这些视频终端主机直接或间接相连的多个 音频和视频采集设备、 一个专用的远程呈现摄像头、 多个音视频输出设备、 一个中控 触摸屏、 多个桌面液晶显示屏以及专用的会议桌椅。 图 3显示了一个典型三屏远程呈 现会场 (图 3中未显示终端主机)。 现有远程呈现会议系统中的通讯实体一般由若干个遵从 ITU-T H.323协议的视频 终端组成(例如, 对于三屏远程呈现会场, 一般包括三个视频终端), 这些终端和远程 呈现 MCU间根据 ITU-T H.323协议通讯。 远程呈现摄像机输出的各路图像存在空间 排列关系, 同时, 多个麦克风采集的声音也具有位置属性, 且图像和声音在空间上有 对应关系。 但是, 在远程呈现会议系统中, 采用传统视频会议系统的录制与回放方案, 无法 保证回放时图像的自然拼接效果和听声辨位效果。 发明内容 本发明提供了一种远程呈现会议系统、 远程呈现会议的录制与回放方法, 以至少 解决相关技术中在远程呈现会议系统中, 采用传统视频会议系统的录制与回放方案, 无法保证回放时图像的自然拼接效果和听声辨位效果的问题。 根据本发明的一个方面, 提供了一种远程呈现会议系统。 根据本发明的远程呈现会议系统包括: MCU以及录播服务器。其中, MCU包括: 配置模块, 设置为将录播服务器作为一个远程呈现会场添加进会议中; 第一连接建立 模块, 设置为向录播服务器发起一个或多个呼叫以建立通信连接, 其中, 该呼叫的建 立消息中携带有与需要建立的通信连接相关联的视频所在的方位的空间信息; 录播服 务器包括: 接收模块, 设置为接收来自于 MCU的媒体流; 存储记录模块, 设置为存 储媒体流, 记录各个媒体流之间的关联关系。 各个媒体流之间的关联关系包括以下至少之一: 视频流之间的空间关系; 数据流 和视频流的对应关系; 音频流和视频流的对应关系。 上述 MCU包括: 广播模块, 设置为通过召集会议的形式, 根据各个媒体流之间 的关联关系广播录播服务器录制的媒体流。 上述远程呈现会议系统还包括: 远程呈现会场; 远程呈现会场包括: 第二连接建 立模块, 设置为与录播服务器建立点对点连接; 获取呈现模块, 设置为根据各个媒体 流之间的关联关系获取录播服务器录制的媒体流并呈现。 上述第二连接建立模块包括: 发起单元, 设置为向录播服务器发起一个或多个用 于建立连接的呼叫, 其中, 该呼叫的建立消息中携带有与需要建立的通信连接相关联 的视频所在的方位的空间信息。 上述录播服务器包括: 第一发送模块, 设置为向视频终端中的一个或多个发送录 制的媒体流所对应的文件列表信息; 第二发送模块, 设置为响应用户根据文件列表信 息进行的选择操作, 根据各个媒体流之间的关联关系, 向各个视频终端发送媒体流。 上述 MCU与录播服务器之间的通信信令、 以及远程呈现会场与录播服务器之间 的通信信令是基于 H.323协议规程或者 IETF SIP协议规程进行扩展的信令。 根据本发明的另一方面, 提供了一种远程呈现会议的录制方法。 根据本发明的远程呈现会议的录制方法包括: 在远程呈现会议系统中的 MCU将 录播服务器作为一个远程呈现会场添加进会议之后, 录播服务器接收来自于 MCU发 起的一个或多个用于建立连接的呼叫, 其中, 呼叫的建立消息中携带有与需要建立的 通信连接相关联的视频所在的方位的空间信息; 录播服务器接收来自于 MCU的媒体 流并存储, 记录各个媒体流之间的关联关系。 上述 MCU与录播服务器建立连接后, 还包括: MCU对与录播服务器相连接的端 口进行以下处理: 禁止来自于录播服务器的媒体流进入会议混音器; 禁止将录播服务 器加入会场列表中; 禁止将录播服务器的上端和下端信息通知远程呈现会议系统中的 远程呈现会场; 使主会场、 分会场或经过混合处理后的媒体流传送至录播服务器。 各个媒体流之间的关联关系包括以下至少之一: 视频流之间的空间关系; 数据流 和视频流的对应关系; 音频流和视频流的对应关系。 当呼叫为多个时, 呼叫的个数大于等于需要录制的会议中具有最多屏幕的远程呈 现会场的屏幕数。 根据本发明的又一方面, 提供了一种远程呈现会议的回放方法。 根据本发明的远程呈现会议的回放方法包括: 远程呈现会议系统中的 MCU将通 过召集会议的形式, 根据各个录制的媒体流之间的关联关系广播录播服务器录制的媒 体流。 根据本发明的又一方面, 提供了一种远程呈现会议的回放方法。 根据本发明的远程呈现会议的回放方法包括: 远程呈现会议系统中的远程呈现会 场的视频终端与录播服务器建立点对点连接; 远程呈现会场根据录制的媒体流之间的 关联关系, 获取录播服务器录制的媒体流并呈现。 上述视频终端与录播服务器建立点对点连接包括: 视频终端向录播服务器发起一 个或多个用于建立连接的呼叫, 其中, 呼叫的建立消息中携带有与需要建立的通信连 接相关联的视频所在的方位的空间信息。 上述远程呈现会场获取来自于录播服务器的媒体流包括: 录播服务器向视频终端 中的一个或多个发送录制的媒体流所对应的文件列表信息; 录播服务器响应用户根据 文件列表信息进行的选择操作, 根据录制的媒体流之间的关联关系, 向各个视频终端 发送录制的媒体流。 通过本发明, 录播服务器录制媒体流时, 记录各个媒体流之间的关联关系, 解决 了相关技术中在远程呈现会议系统中, 采用传统视频会议系统的录制与回放方案, 无 法保证回放时图像的自然拼接效果和听声辨位效果的问题, 进而可以保证回放时图像 的自然拼接效果和听声辨位效果。 附图说明 此处所说明的附图用来提供对本发明的进一步理解, 构成本申请的一部分, 本发 明的示意性实施例及其说明用于解释本发明, 并不构成对本发明的不当限定。 在附图 中: 图 1是根据相关技术的基于模拟方式的视频会议系统的示意图; 图 2是根据相关技术的基于数字方式的视频会议系统的示意图; 图 3是根据相关技术的三屏远程呈现会场的示意图; 图 4是根据本发明实施例的远程呈现会议系统的结构框图; 图 5是根据本发明优选实施例的远程呈现会议系统的结构框图; 图 6是根据本发明实例的远程呈现会议系统的示意图; 图 7是根据本发明实施例的远程呈现会议的录制方法的流程图; 图 8是根据本发明优选实施例的远程呈现会议的录制方法的流程图; 图 9是根据本发明实施例的远程呈现会议的回放方法的流程图; 以及 图 10是根据本发明优选实施例的远程呈现会议的点播回放方法的流程图。 具体实施方式 下文中将参考附图并结合实施例来详细说明本发明。 需要说明的是, 在不冲突的 情况下, 本申请中的实施例及实施例中的特征可以相互组合。 图 4是根据本发明实施例的远程呈现会议系统的结构框图。 如图 4所示, 该远程 呈现会议系统主要包括: 多点控制单元 (MCU) 10以及录播服务器 20; 上述 MCU 10进一步包括: 配置模块 100, 设置为将录播服务器作为一个远程呈 现会场添加进会议中; 第一连接建立模块 102, 设置为向录播服务器发起一个或多个 呼叫以建立通信连接, 其中, 该呼叫的建立消息中携带有与需要建立的通信连接相关 联的视频所在的方位的空间信息; 录播服务器 20进一步包括: 接收模块 200, 设置为接收来自于 MCU的媒体流; 存储记录模块 202, 设置为存储媒体流, 记录各个媒体流之间的关联关系。 其中, 上述各个媒体流之间的关联关系包括但不限于以下至少之一: 视频流之间 的空间关系; 数据流和视频流的对应关系; 音频流和视频流的对应关系。 在远程呈现会议系统中, 远程呈现摄像机输出的各路图像存在空间排列关系, 同 时, 多个麦克风采集的声音也具有位置属性, 且图像和声音在空间上有对应关系。 在 远程呈现会议系统中, 采用传统视频会议系统的录制与回放方案, 无法保证回放时图 像的自然拼接效果和听声辨位效果。图 4所示的远程呈现会议系统中, MCU10在呼叫 的建立消息中携带与需要建立的通信连接相关联的视频所在的方位的空间信息, 发送 给录播服务器, 录播服务器不仅需要存储媒体流, 还需要记录各个媒体流之间的关联 关系。 从而可以保证回放时图像的自然拼接效果和听声辨位效果。 需要注意的是, 可以将 MCU 10看作是其上所开多个会议的集合。 其中, 每个会 议包含多个远程呈现会场, 每个远程呈现会场包含若干个视频终端。 会议中某个远程 呈现会场的上端过程, 意味着在该会议里 MCU和该远程呈现会场的若干个视频终端 建立连接的过程。 同时, 可以将录播服务器看作是若干个独立的远程呈现会场的集合。 在每个需要 录制的会议中, 录播服务器被当作一个远程呈现会场添加进会议。 该远程呈现会场包 含若干个由录播服务器虚拟出的视频终端。该远程呈现会场的上端过程,意味着 MCU 和录播服务器虚拟出的若干个视频终端之间建立连接的过程。 会议开始后, MCU 10发起对录播服务器 20的多个呼叫, 并将此多个呼叫所建立 的通信连接绑定为一个通讯端口, 用以表征这些连接同属于一个远程呈现会场。 录播服务器以及各会场正常上端后, MCU可以对与录播服务器相连接的端口进行 以下处理: The present invention relates to the field of communications, and in particular to a method for recording and playing back a telepresence conference system and a telepresence conference. BACKGROUND OF THE INVENTION In the related art, the following methods are generally used for recording and playback of a video conference: One is a recording and playback method based on an analog mode, and FIG. 1 can be specifically seen. As shown in FIG. 1, it is independent of the video conference system (by the Multipoint Control Unit (MCU) 101, the video terminal host 103, the camera 106, the microphone 107, the PC 108, the television 109, the speaker 110, and the display 111. In addition to the composition, a recording and recording system is added, which includes a recording server 102 deployed in the central computer room and an encoder 104 and a decoder 105 deployed in the conference room, which have acquisition, encoding, transmission, storage, and decoding. And playback features. The disadvantages are: the functions of the acquisition/encoding/decoding and the functions of the video conferencing system are duplicated, resulting in functional redundancy and high system configuration cost; the main lecture site cannot be tracked in real time (the conference site is broadcasted during the conference, during the conference, each conference site has May become a broadcast venue). The other is a digital recording and playback method, as shown in Figure 2. As shown in FIG. 2, a set of recording server 202 is added independently of the video conferencing system (consisting of MCU 201, video terminal host 203, camera 206, microphone 207, PC 208, television 209, speaker 210, display 211). The recording server needs to be digitally connected to the MCU and the venue terminal. During recording, according to the interaction signaling with the multipoint control unit (MCU), receiving the multimedia information sent by the MCU and storing it; during playback, according to the interaction signaling with the terminal, sending the stored multimedia information to the terminal, The terminal plays back. The advantages are: complementing the functions of the video conference system, and only realizing the functions of multimedia information storage not available in the video conference system; and tracking the lecture site in real time. Due to application and technical limitations, traditional video conferencing systems generally only need to record video, audio, and data content, and are not concerned with other information. The following is a detailed description of the features of the telepresence conferencing system compared to the conventional video conferencing system: The telepresence conferencing system is an advanced video conferencing system, which is characterized by the fact that there are multiple video terminal hosts in the venue; Multiple directly or indirectly connected to these video terminal hosts Audio and video capture devices, a dedicated telepresence camera, multiple audio and video output devices, a centrally controlled touch screen, multiple desktop LCD displays, and dedicated conference tables and chairs. Figure 3 shows a typical three-screen telepresence site (the terminal host is not shown in Figure 3). The communication entities in the existing telepresence conference system are generally composed of several video terminals complying with the ITU-T H.323 protocol (for example, for a three-screen telepresence conference site, generally including three video terminals), these terminals and telepresence MCUs. Communicate according to the ITU-T H.323 protocol. The images of the remote presentation camera output have a spatial arrangement relationship. At the same time, the sounds collected by multiple microphones also have positional properties, and the images and sounds have a spatial relationship. However, in the telepresence conference system, the recording and playback scheme of the conventional video conference system cannot guarantee the natural stitching effect and the sound recognition effect of the image during playback. SUMMARY OF THE INVENTION The present invention provides a remote presentation conference system, a remote presentation conference recording and playback method, to at least solve the related art in a telepresence conference system, using a conventional video conference system recording and playback scheme, can not guarantee playback The natural stitching effect of the image and the problem of the sound recognition effect. According to one aspect of the invention, a telepresence conferencing system is provided. The telepresence conference system according to the present invention includes: an MCU and a recording server. The MCU includes: a configuration module, configured to add the recording server as a telepresence site to the conference; the first connection establishment module is configured to initiate one or more calls to the recording server to establish a communication connection, where The call setup message carries spatial information of the location of the video associated with the communication connection to be established; the recording server includes: a receiving module configured to receive the media stream from the MCU; and a storage recording module configured to be a storage medium Stream, record the relationship between the various media streams. The association relationship between the media streams includes at least one of the following: a spatial relationship between the video streams; a correspondence between the data streams and the video streams; and a correspondence between the audio streams and the video streams. The foregoing MCU includes: a broadcast module, configured to broadcast a media stream recorded by the recording server according to an association relationship between the media streams in a form of calling a conference. The telepresence conference system further includes: a telepresence conference site; the telepresence conference site includes: a second connection establishment module, configured to establish a point-to-point connection with the recording server; and a presentation module, configured to acquire a record according to an association relationship between the media streams The media stream recorded by the broadcast server is presented. The foregoing second connection establishing module includes: an initiating unit, configured to initiate one or more calls for establishing a connection to the recording server, where the setup message of the call carries a video associated with the communication connection that needs to be established Spatial information of the orientation. The recording server includes: a first sending module, configured to send file list information corresponding to the recorded media stream to one or more of the video terminals; and a second sending module, configured to respond to the user according to the file list information The operation sends a media stream to each video terminal according to an association relationship between the media streams. The communication signaling between the MCU and the recording and recording server, and the communication signaling between the telepresence site and the recording server are signaling based on the H.323 protocol protocol or the IETF SIP protocol. According to another aspect of the present invention, a recording method of a telepresence conference is provided. The recording method of the telepresence conference according to the present invention includes: after the MCU in the telepresence conference system adds the recording server as a telepresence conference site to the conference, the recording server receives one or more initiated from the MCU for establishing a connected call, where the call setup message carries spatial information of the location of the video associated with the communication connection that needs to be established; the recording server receives the media stream from the MCU and stores it, and records between the media streams connection relation. After the MCU is connected to the recording server, the MCU also performs the following processing on the port connected to the recording server: prohibiting the media stream from the recording server from entering the conference mixer; prohibiting the recording server from joining the conference list It is forbidden to notify the remote presentation site in the telepresence conference system of the upper and lower end information of the recording server; and to transfer the main site, the sub-site or the mixed media stream to the recording server. The association relationship between the media streams includes at least one of the following: a spatial relationship between the video streams; a correspondence between the data streams and the video streams; and a correspondence between the audio streams and the video streams. When there are multiple calls, the number of calls is greater than or equal to the number of screens of the telepresence site with the most screens in the conference that needs to be recorded. According to still another aspect of the present invention, a playback method of a telepresence conference is provided. The playback method of the telepresence conference according to the present invention includes: The MCU in the telepresence conference system will broadcast the media stream recorded by the recording server according to the association relationship between the respective recorded media streams by calling the conference. According to still another aspect of the present invention, a playback method of a telepresence conference is provided. The playback method of the telepresence conference according to the present invention includes: establishing a point-to-point connection between the video terminal of the telepresence site in the telepresence conference system and the recording server; and obtaining the recording server according to the association relationship between the recorded media streams by the telepresence site The recorded media stream is presented. The establishing, by the video terminal, the peer-to-peer connection with the recording server includes: the video terminal initiating one or more calls for establishing a connection to the recording server, where the call setup message carries the video associated with the communication connection to be established. Spatial information of the orientation. The remote presentation site obtains the media stream from the recording server, including: the recording server sends the file list information corresponding to the recorded media stream to one or more of the video terminals; and the recording server responds to the user according to the file list information. The selection operation sends a recorded media stream to each video terminal according to the association relationship between the recorded media streams. Through the invention, when the recording and recording server records the media stream, the association relationship between the media streams is recorded, and the recording and playback scheme of the traditional video conference system in the telepresence conference system in the related art is solved, and the image during playback cannot be guaranteed. The natural stitching effect and the problem of the sound recognition effect can ensure the natural stitching effect and the sound recognition effect of the image during playback. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are set to illustrate,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, In the drawings: FIG. 1 is a schematic diagram of an analog-based video conferencing system according to the related art; FIG. 2 is a schematic diagram of a digital-based video conferencing system according to the related art; FIG. 3 is a three-screen remote presentation according to the related art. 4 is a structural block diagram of a telepresence conference system according to an embodiment of the present invention; FIG. 5 is a structural block diagram of a telepresence conference system according to a preferred embodiment of the present invention; FIG. 6 is a telepresence conference according to an example of the present invention. FIG. 7 is a flowchart of a method for recording a telepresence conference according to an embodiment of the present invention; FIG. 8 is a flowchart of a method for recording a telepresence conference according to a preferred embodiment of the present invention; 9 is a flowchart of a playback method of a telepresence conference according to an embodiment of the present invention; and FIG. 10 is a flowchart of an on-demand playback method of a telepresence conference according to a preferred embodiment of the present invention. BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described in detail with reference to the accompanying drawings. It should be noted that the embodiments in the present application and the features in the embodiments may be combined with each other without conflict. 4 is a structural block diagram of a telepresence conference system in accordance with an embodiment of the present invention. As shown in FIG. 4, the telepresence conference system mainly includes: a multipoint control unit (MCU) 10 and a recording server 20; the MCU 10 further includes: a configuration module 100, configured to add the recording server as a telepresence site The first connection establishing module 102 is configured to initiate one or more calls to the recording server to establish a communication connection, where the setup message of the call carries a video associated with the communication connection that needs to be established. The recording server 20 further includes: a receiving module 200 configured to receive a media stream from the MCU; and a storage recording module 202 configured to store the media stream and record an association relationship between the media streams. The association relationship between the foregoing media streams includes, but is not limited to, at least one of the following: a spatial relationship between the video streams; a correspondence between the data stream and the video stream; and a correspondence between the audio stream and the video stream. In the telepresence conference system, the images of the remote presentation cameras are spatially arranged. At the same time, the sounds collected by the multiple microphones also have positional properties, and the images and sounds have spatial correspondences. In the telepresence conference system, the recording and playback scheme of the traditional video conference system cannot guarantee the natural stitching effect and the sound recognition effect of the image during playback. In the telepresence conference system shown in FIG. 4, the MCU 10 carries the spatial information of the location of the video associated with the communication connection to be established in the call setup message, and sends the information to the recording server. The recording server not only needs to store the media stream. , you also need to record the relationship between the various media streams. Thereby, the natural stitching effect and the sound recognition effect of the image during playback can be ensured. It should be noted that the MCU 10 can be considered as a collection of multiple conferences held thereon. Each conference includes multiple telepresence sites, and each telepresence site includes several video terminals. The upper end process of a telepresence site in a conference means that the MCU establishes a connection with several video terminals of the telepresence site in the conference. At the same time, the recording server can be considered as a collection of several independent telepresence venues. In each conference that needs to be recorded, the recording server is added to the conference as a telepresence venue. The telepresence site package Contains several video terminals that are virtualized by the recording server. The upper end process of the telepresence site means a process of establishing a connection between the MCU and a plurality of video terminals virtualized by the recording server. After the conference starts, the MCU 10 initiates multiple calls to the recording server 20, and binds the communication connections established by the multiple calls to one communication port to indicate that the connections belong to a remote presentation site. After the recording server and the normal upper end of each site, the MCU can perform the following processing on the port connected to the recording server:
( 1 ) 禁止来自于录播服务器的媒体流进入会议混音器; (1) prohibiting the media stream from the recording server from entering the conference mixer;
(2) 禁止将录播服务器加入会场列表中; (2) It is forbidden to add the recording server to the site list;
(3 )禁止将录播服务器的上端和下端信息通知远程呈现会议系统中的远程呈现会 场; (3) It is forbidden to notify the remote presentation site in the telepresence conference system of the upper and lower end information of the recording server;
(4) 使主会场、 分会场或经过混合处理后的媒体流传送至录播服务器。 录播服务器虽然作为一个远程呈现会场添加进会议中, 但其并不等同于远程呈现 会场。 在会议召开过程中, 应当避免来自于录播服务器的媒体流进入会议混音器, 从 而干扰会议的正常进行。 当然, 远程呈现会议系统中, 远程呈现会场内的多个视频终端, 物理上也可对应 一个设备。 进一步说, 此物理设备可以在通讯协议上有诸多变化, 例如, 此物理设备 和 MCU间的通信以及和录播服务器的通信, 不再表现为多个信令交互过程, 而仅表 现为一个信令。 同样的, 本方案录制与回放信令也可以采用一个信令的方式。 当发起的呼叫为多个时 (即录制与回放信令并非采用一个信令的方式, 各个呼叫 均携带有与需要建立的通信连接相关联的视频所在的方位的空间信息),则呼叫的个数 应当大于等于需要录制的会议中具有最多屏幕的远程呈现会场的屏幕数。 只有满足上述条件, 才可以将与需要建立的通信连接相关联的视频所在的方位的 空间信息完整地发送至录播服务器。 在回放时可以保证图像的自然完整拼接效果。 对于采用基于 H.323协议的远程呈现会议系统而言, 上述呼叫遵循 H.323协议规 程。同理,某远程呈现会场在需要回放录播服务器所存储内容时, 也是采用符合 H.323 协议规程的通信方法与录播服务器虚拟出的若干个视频终端进行通信。 当然, 不仅限 于 H.323协议规程, 也可以采用其他协议规程, 例如, IETF SIP协议规程。 对于回放的方式, 可以分两种情况。 一种是通过 MCU召集会议的形式, 由 MCU广播录播服务器,将录播服务器所发 送上的视音频数据流切换给其他会场,可以理解,此过程相当于是录制过程的逆过程; 优选地, 如图 5所示, 上述 MCU 10还可以包括: 广播模块 104, 设置为通过召 集会议的形式, 广播录播服务器录制的媒体流。 另一种是采用会场点播的方式, 由远程呈现会场的视频终端与录播服务器进行通 讯建立点对点连接接收所录制媒体并在电视 /音箱 /显示器等多媒体输出设备上呈现。 优选地, 如图 5所示, 远程呈现会议系统还可以包括: 远程呈现会场 30; 远程呈 现会场 30包括: 第二连接建立模块 300, 设置为与录播服务器建立点对点连接; 获取 呈现模块 302, 设置为获取录播服务器录制的媒体流并呈现。 优选地,第二连接建立模块 300可以进一步包括:发起单元 3000 (图 5中未示出), 设置为向录播服务器发起一个或多个用于建立连接的呼叫, 其中, 该呼叫的建立消息 中携带有与需要建立的通信连接相关联的视频所在的方位的空间信息。 优选地, 如图 5所示, 上述录播服务器 20可以进一步包括: 第一发送模块 204, 设置为向视频终端中的一个或多个发送录制的媒体流所对应的文件列表信息; 第二发 送模块 206, 设置为响应用户根据文件列表信息进行的选择操作, 根据录制的媒体流 之间的关联关系, 向各个视频终端发送录制的媒体流。 回放进程中, 录播服务器还可接收并响应从一个或多个视频终端传过来的快进快 退等指令。 在具体实施过程中, 回放过程可以是直接利用远程呈现会场中通信终端设 备实现, 也可以是采用其他方式, 例如, 采用 PC软件实现。 优选地, MCU 10与录播服务器 20之间的通信信令、 以及远程呈现会场 30与录 播服务器 20之间的通信信令可以是基于 H.323协议规程或者 IETF SIP协议规程进行 扩展的信令。 当然, 也可以是基于其他协议规程进行扩展的信令。 以下结合图 6所示的示例进一步描述上述优选实施方式。 图 6是根据本发明实例的远程呈现会议系统的示意图。如图 6所示, 该系统包括: 多个远程呈现会议会场、 远程呈现会议局端系统、 以及录播系统。 其中, 每个会场均包括: ( 1 ) 摄像机组 606, 包含 3个摄像头, 设置为拍摄本地会场与会者图像; (4) Transfer the main site, the breakout site, or the mixed media stream to the recording server. Although the recording server is added to the conference as a telepresence venue, it is not equivalent to telepresencing the venue. During the conference, media streams from the recording server should be prevented from entering the conference mixer, thus interfering with the normal operation of the conference. Of course, in the telepresence conference system, multiple video terminals in the remote presentation site can physically correspond to one device. Further, the physical device may have many changes in the communication protocol. For example, the communication between the physical device and the MCU and the communication with the recording server are no longer represented by multiple signaling interaction processes, but only as one letter. make. Similarly, the recording and playback signaling of the solution can also adopt a signaling manner. When there are multiple calls (ie, recording and playback signaling is not in a signaling manner, each call carries spatial information of the location of the video associated with the communication connection that needs to be established), then the call The number should be greater than or equal to the number of screens in the telepresence site with the most screens in the conference that needs to be recorded. Only when the above conditions are met, the spatial information of the orientation in which the video associated with the communication connection to be established is located can be completely transmitted to the recording server. The natural complete stitching effect of the image can be guaranteed during playback. For telepresence conferencing systems based on the H.323 protocol, the above calls follow the H.323 protocol procedure. Similarly, when a telepresence site needs to play back the content stored in the recording server, it also communicates with several video terminals virtualized by the recording server using a communication method conforming to the H.323 protocol. Of course, it is not limited to the H.323 protocol procedure, but other protocol procedures can also be used, for example, the IETF SIP protocol procedure. For the way of playback, there are two situations. One is to convene a conference through the MCU. The MCU broadcasts the recording server and switches the video and audio data stream sent by the recording server to other sites. It can be understood that this process is equivalent to the reverse process of the recording process; preferably, As shown in FIG. 5, the MCU 10 may further include: a broadcast module 104, configured to broadcast a media stream recorded by the recording and recording server in a form of calling a conference. The other is to use the site on-demand method, and the video terminal of the telepresence site communicates with the recording server to establish a point-to-point connection to receive the recorded media and present it on a multimedia output device such as a TV/speaker/display. Preferably, as shown in FIG. 5, the telepresence conference system may further include: a telepresence conference site 30; the telepresence conference site 30 includes: a second connection establishment module 300, configured to establish a point-to-point connection with the recording server; Set to get the media stream recorded by the recording server and render. Preferably, the second connection establishing module 300 may further include: an initiating unit 3000 (not shown in FIG. 5) configured to initiate one or more calls for establishing a connection to the recording server, wherein the call establishment message It carries spatial information about the location of the video associated with the communication connection that needs to be established. Preferably, as shown in FIG. 5, the recording and recording server 20 may further include: a first sending module 204, configured to send file list information corresponding to the recorded media stream to one or more of the video terminals; The module 206 is configured to send the recorded media stream to each video terminal according to the association relationship between the recorded media streams in response to the user performing a selection operation according to the file list information. During the playback process, the recording server can also receive and respond to instructions such as fast forward and rewind from one or more video terminals. In the specific implementation process, the playback process may be implemented directly by using the communication terminal device in the telepresence site, or may be implemented by other methods, for example, using PC software. Preferably, the communication signaling between the MCU 10 and the recording server 20, and the communication signaling between the telepresence conference site 30 and the recording server 20 may be a letter extended based on the H.323 protocol protocol or the IETF SIP protocol protocol. make. Of course, it can also be signaling that is extended based on other protocol procedures. The above preferred embodiment is further described below in conjunction with the example shown in FIG. 6. 6 is a schematic diagram of a telepresence conference system in accordance with an example of the present invention. As shown in FIG. 6, the system includes: a plurality of telepresence conference venues, a telepresence conference office system, and a recording system. Among them, each venue includes: (1) The camera group 606 includes three cameras, which are set to capture images of local venue participants;
(2) 麦克风组 607, 包含 4个麦克风, 设置为采集本地会场的音频信息; (2) The microphone group 607, which includes four microphones, is set to collect audio information of the local site;
(3 ) PC组 608, 包含 1~3台 PC, 作为会议双流数据源; (3) PC group 608, including 1~3 PCs, as the conference dual-stream data source;
(4) 电视机组 609, 包含 3个电视, 设置为显示远端图像信息; ( 5 ) 音箱组 610, 包含 2个音箱, 设置为输出会议立体声音频信息; (4) TV group 609, including 3 TVs, set to display remote image information; (5) Speaker group 610, including 2 speakers, set to output conference stereo audio information;
(6) 桌面液晶显示器组 611, 包含 3个液晶显示器, 设置为显示双流数据。 可以理解, 除上述设备外, 还可以部署其他用以提供会议临场感的其他设备或系 统, 如灯光、 会场装修装饰等。 其中, 远程呈现会议局端系统, 包括 MCU 601及其管理系统 (图 6中未示出) ( 1 ) MCU 601 , 实现多点会议控制, 各会场媒体流 (视音频以及数据) 的广播、 交换以及混合, 是整套会议系统的核心设备; (6) The desktop LCD panel 611, which contains 3 LCD monitors, is set to display dual stream data. It can be understood that in addition to the above devices, other devices or systems for providing a sense of presence of the conference, such as lighting, venue decoration, etc., can be deployed. The remote presentation conference central office system, including the MCU 601 and its management system (not shown in FIG. 6) (1) MCU 601, implements multipoint conference control, and broadcasts and exchanges media streams (video and audio) of each venue. And mixing, is the core equipment of the entire conference system;
(2) 管理系统, 提供局端管理人员的人机界面。 其中, 录播系统, 包括录播服务器 602及其管理系统 (图 6中未示出)。 (2) Management system, providing the human-machine interface of the central management personnel. The recording and broadcasting system includes a recording server 602 and a management system thereof (not shown in FIG. 6).
( 1 ) 录播服务器 602, 与 MCU401 以及远程呈现会场的视频终端主机 622 624 的数字交互 (接口 612、 613 ) , 实现多媒体内容存储与直播点播; (1) The recording server 602 interacts with the MCU 401 and the video terminal host 622 624 of the remote presentation site (interfaces 612, 613) to implement multimedia content storage and live broadcast on demand;
(2)管理系统: 提供录播服务器管理人员的人机界面, 实现系统配置、 节目编辑 等。 下面以纯三屏远程呈现会场参与的远程呈现会议的录制与回放为例, 结合图 6对 上述优选实施方式进行具体说明。 需要注意的是, 其他非由纯三屏远程呈现会场参会 的会议组网情况可以参考下述方法实现录制和回放。 (2) Management system: Provide the man-machine interface of the recording server administrator to realize system configuration, program editing, and so on. The following describes the recording and playing back of the telepresence conference in which the pure three-screen telepresence site participates, and the above preferred embodiment is specifically described with reference to FIG. 6. It should be noted that other conference networking situations that are not represented by a pure three-screen telepresence site can be recorded and played back by referring to the following methods.
1、 会议配置过禾 '王 管理员在 MCU 601的管理系统上先配置好录播服务器地址,接着定义一个需要录 制的由纯三屏远程呈现会场参与的远程呈现会议, 接着在向 MCU 601下发这个配置 后, 下达开始会议命令。 1. The conference is configured. The Wang administrator configures the recording server address on the management system of the MCU 601, and then defines a telepresence conference that needs to be recorded by the pure three-screen telepresence site, and then goes to the MCU 601. After sending this configuration, the start conference command is released.
2、 录制过禾 ( 1 ) 通信连接建立过程 2, recorded over Wo (1) Communication connection establishment process
MCU 601在接收到管理者发出的开会通知后, 向录播服务器 602先后发起三个符 合 H.323协议规程定义的 H.225.0 Q.931呼叫。 在呼叫的 setup消息中, 根据 H.225.0 Q.931 协 议 定 义 的 消 息 扩 充 机 制 , 在 呼 叫 的 建 立 消 息 ( setup. sourcelnfo.nonStandardData.Data 消息)中增加反映该呼叫所要建立的通信连接 所相关的视频所在的左、 中、 右方位空间信息。 其后 MCU 601按 H.323正常规程与录播服务器建立通信连接。 After receiving the notification from the administrator, the MCU 601 successively initiates three H.225.0 Q.931 calls conforming to the H.323 protocol procedure to the recording server 602. In the setup message of the call, according to the message extension mechanism defined by the H.225.0 Q.931 protocol, a video related to the communication connection to be established by the call is added in the call setup message (setup.sourcelnfo.nonStandardData.Data message) Spatial information of the left, center, and right directions. Thereafter, the MCU 601 establishes a communication connection with the recording server according to the normal procedure of H.323.
(2) 会议控制过程 录播服务器以及各会场正常上端后, MCU 401需要对连接的录播服务器 602的端 口做如下处理: (2) Conference control process After the recording server and the normal upper end of each site, the MCU 401 needs to process the port of the connected recording server 602 as follows:
(2.1 ) 不允许录播服务器传过来的音频信号进入会议混音器; (2.1) The audio signal transmitted from the recording server is not allowed to enter the conference mixer;
(2.2)不接收录播服务器传过来的视频信号, 也不允许其他会场或者管理员广播 /选看 /切换录播服务器所发送过来的视频信号; (2.2) Do not receive the video signal transmitted from the recording server, nor allow other sites or administrators to broadcast/select/switch the video signal sent by the recording server;
(2.3 )不将录播服务器所代表的远程呈现会场放入会场列表, 也不通知其他会场 此会场的上端和下端信息; (2.3) Do not put the telepresence site represented by the recording server into the site list, and do not notify other sites of the upper and lower end of the site;
(2.4) MCU根据会议控制策略将某分会场或主会场或经过混合处理后的音视频 数据媒体流按 H.323正常规程或其他规程发送给录播服务器。 (2.4) The MCU sends a sub-site or main venue or the mixed audio and video data media stream to the recording server according to the H.323 normal procedure or other procedures according to the conference control policy.
(3 ) 存储过程 录播服务器接收从三个连接上发送过来的媒体流并储存, 同时记录各媒体流之间 的相互关联关系, 例如, 视频流间的空间关系、 数据流和视频流的对应关系以及音频 流和视频流的对应关系等。 (3) The stored procedure recording server receives and stores the media streams sent from the three connections, and records the relationship between the media streams, for example, the spatial relationship between the video streams, the correspondence between the data streams and the video streams. Relationships, correspondences between audio streams and video streams, and the like.
3、 回放过程 主要分两种情况, 一种是通过 MCU召集会议的形式, 由 MCU广播录播服务器, 将录播服务器所发送的视音频数据流切换给其他会场, 可以理解, 此过程相当于是录 制过程的逆过程; 另一种是采用会场点播的方式, 由远程呈现会场的视频终端与录播 服务器进行通讯建立点对点连接接收所录制媒体并在电视 /音箱 /显示器等多媒体输出 设备上呈现。 下面详细介绍后一种回放方式的通信连接建立过程。 远程呈现会场的三个视频终端(622-624 )在接收到管理者发出的开会通知后, 分 别向录播服务器先后发起符合 H.323协议规程或其他协议规程定义的 H.225.0 Q.931呼 口 4。 在呼叫的 setup消息中, 根据 H.225.0 Q.931协议定义的消息扩充机制, 在呼叫的 建立消息 ( setup. sourcelnfo.nonStandardData.Data消息) 中增加反映该呼叫所要建立的 通信连接所相关视频所在的左、 中和右空间信息。 录播服务器 602在接收到所有呼叫后, 根据系统预先约定, 向一个或多个预定视 频终端 (例如, 对三屏远程呈现会场, 约定中间的视频终端 623)发送所储存内容的文件 列表视频, 并接收用户选择。 在判定用户所选文件内容同样是一个三屏远程呈现会场 会议录制内容后, 根据所存储的媒体流相互间的关联关系记录, 向不同视频终端送出 不同媒体流。 回放进程中, 录播服务器还可接受并响应从该特定视频终端传过来的快进快退等 指令。 根据本发明的实施例, 基于上述远程呈现会议系统, 还提供了一种远程呈现会议 的录制方法。 图 7是根据本发明实施例的远程呈现会议的录制方法的流程图。 如图 7所示, 该 远程呈现会议的录制方法包括以下处理: 步骤 S702: 在远程呈现会议系统中的 MCU将录播服务器作为一个远程呈现会场 添加进会议之后, 录播服务器接收来自于 MCU发起的一个或多个用于建立连接的呼 口 , 其中, 呼叫的建立消息中携带有与需要建立的通信连接相关联的视频所在的方位 的空间信息; 步骤 S704: 录播服务器接收来自于 MCU的媒体流并存储, 记录各个媒体流之间 的关联关系。 其中, 各个媒体流之间的关联关系包括但不限于以下至少之一: 视频流之间的空 间关系; 数据流和视频流的对应关系; 音频流和视频流的对应关系。 图 7所示的远程呈现会议方法中, MCU在呼叫的建立消息中携带与需要建立的通 信连接相关联的视频所在的方位的空间信息, 发送给录播服务器, 录播服务器不仅需 要存储媒体流, 还需要记录各个媒体流之间的关联关系。 从而可以保证回放时图像的 自然拼接效果和听声辨位效果。 优选地, MCU与录播服务器建立连接后, MCU对与录播服务器相连接的端口进 行处理, 具体处理如下: 3. The playback process is mainly divided into two situations. One is to call the conference through the MCU. The MCU broadcasts the broadcast server and switches the video and audio data stream sent by the recording server to other sites. It can be understood that this process is equivalent to The reverse process of the recording process; the other is to use the site on-demand method, and the video terminal of the telepresence site communicates with the recording server to establish a point-to-point connection to receive the recorded media and present it on a multimedia output device such as a TV/speaker/display. The communication connection establishment process of the latter playback mode will be described in detail below. After receiving the notification from the administrator, the three video terminals (622-624) of the telepresence site respectively initiate H.225.0 Q.931 calls to the recording server in accordance with the H.323 protocol procedure or other protocol procedures. Mouth 4. In the setup message of the call, according to the message extension mechanism defined by the H.225.0 Q.931 protocol, in the call setup message (setup.sourcelnfo.nonStandardData.Data message), the video corresponding to the communication connection to be established by the call is added. Left, center and right space information. After receiving all the calls, the recording server 602 sends a file list video of the stored content to one or more predetermined video terminals (for example, a three-screen telepresence venue, an agreement intermediate video terminal 623) according to a system pre-agreed, And receive user selection. After determining that the content selected by the user is also a three-screen remote presentation site conference recording content, different media streams are sent to different video terminals according to the stored relationship between the stored media streams. During the playback process, the recording server can also accept and respond to instructions such as fast forward and rewind from the particular video terminal. According to an embodiment of the present invention, based on the above-described telepresence conference system, a recording method of a telepresence conference is also provided. 7 is a flow chart of a method of recording a telepresence conference in accordance with an embodiment of the present invention. As shown in FIG. 7, the recording method of the telepresence conference includes the following processing: Step S702: After the MCU in the telepresence conference system adds the recording server as a telepresence conference site to the conference, the recording server receives the origination from the MCU. One or more call ports for establishing a connection, wherein the setup message of the call carries spatial information of the location of the video associated with the communication connection that needs to be established; Step S704: The recording server receives the information from the MCU The media stream is stored and recorded, and the relationship between the media streams is recorded. The association relationship between the media streams includes, but is not limited to, at least one of the following: a spatial relationship between the video streams; a correspondence between the data streams and the video streams; and a correspondence between the audio streams and the video streams. In the telepresence conference method shown in FIG. 7, the MCU carries the spatial information of the location of the video associated with the communication connection to be established in the call setup message, and sends the information to the recording server. The recording server not only needs to store the media stream. , you also need to record the relationship between the various media streams. Thereby, the natural stitching effect and the sound recognition effect of the image during playback can be ensured. Preferably, after the MCU establishes a connection with the recording server, the MCU processes the port connected to the recording server, and the specific processing is as follows:
( 1 ) 禁止来自于录播服务器的媒体流进入会议混音器; (1) prohibiting the media stream from the recording server from entering the conference mixer;
(2) 禁止将录播服务器加入会场列表中; (3 )禁止将录播服务器的上端和下端信息通知远程呈现会议系统中的远程呈现会 场; (2) It is forbidden to add the recording server to the site list; (3) It is forbidden to notify the remote presentation site in the telepresence conference system of the upper and lower end information of the recording server;
(4) 使主会场、 分会场或经过混合处理后的媒体流传送至录播服务器。 当发起的呼叫为多个时 (即录制与回放信令并非采用一个信令的方式, 各个呼叫 均携带有与需要建立的通信连接相关联的视频所在的方位的空间信息),则呼叫的个数 应当大于等于需要录制的会议中具有最多屏幕的远程呈现会场的屏幕数。 只有满足上述条件, 才可以将与需要建立的通信连接相关联的视频所在的方位的 空间信息完整地发送至录播服务器。 在回放时可以保证图像的自然完整拼接效果。 当然, 远程呈现会议系统中, 远程呈现会场内的多个视频终端, 物理上也可对应 一个设备。 进一步说, 此物理设备可以在通讯协议上有诸多变化, 例如, 此物理设备 和 MCU间的通信以及和录播服务器的通信, 不再表现为多个信令交互过程, 而仅表 现为一个信令。 同样的, 本方案录制与回放信令也可以采用一个信令的方式。 对于采用基于 H.323协议的远程呈现会议系统而言, 上述呼叫遵循 H.323协议规 程。同理,某远程呈现会场在需要回放录播服务器所存储内容时, 也是采用符合 H.323 协议规程的通信方法与录播服务器虚拟出的若干个视频通讯终端进行通信。 上述录制 和回放中, MCU和录播服务器、录播服务器和远程呈现会场间的通信信令, 可以采用 H.323 协议规程规定的消息扩充机制对标准交互信令进行扩展。 当然, 也不仅限于上 述 H.323协议规程, 还可以采用其他协议规程, 例如, IETF SIP协议规程。 以下结合图 8进一步描述上述优选实施过程。 图 8是根据本发明优选实施例的远程呈现会议的录制方法的流程图。如图 8所示, 该远程呈现会议的录制方法主要包括以下处理: 步骤 S802: MCU将录播服务器作为一个远程呈现会场添加进会议中; 步骤 S804: MCU呼叫各个会场, 并录播服务器作为一个特殊的远程呈现会场, 向录播服务器发起一个或多个呼叫以建立通信连接; 步骤 S806: 呼叫成功后, 将当前广播会场媒体流发送至录播服务器, 并指示各个 媒体流之间的关联关系; 步骤 S808: 录播服务器根据上述指示存储媒体流, 并记录各个媒体流之间的关联 关系。 步骤 S810: 会议结束, 录播服务器响应管理员操作指令, 对录制的媒体流进行编 辑和发布, 等待用户点播。 根据本发明的实施例, 基于上述远程呈现会议系统, 还提供了一种远程呈现会议 的回放方法。 该方法主要包括: 上述远程呈现会议系统中的 MCU将通过召集会议的 形式, 根据录制的媒体流之间的关联关系广播录播服务器录制的媒体流。 需要注意的是, 此回放过程相当于是上述录制过程的逆过程, 由于回放时, 根据 上面提到的各个媒体流之间的关联关系广播录播服务器录制的媒体流, 可以保证回放 时图像的自然拼接效果和听声辨位效果。 根据本发明的实施例, 基于上述远程呈现会议系统, 还提供了另一种远程呈现会 议的回放方法。 图 9是根据本发明实施例的远程呈现会议的回放方法的流程图。 如图 9所示, 该 远程呈现会议的回放方法主要包括以下处理: 步骤 S902:远程呈现会议系统中的远程呈现会场的视频终端与录播服务器建立点 对点连接; 步骤 S904:远程呈现会场根据录制的媒体流之间的关联关系获取录播服务器录制 的媒体流并呈现。 由于回放时, 远程呈现会场根据录制的媒体流之间的关联关系获取录播服务器录 制的媒体流, 可以保证回放时图像的自然拼接效果和听声辨位效果。 优选地,上述步骤 S902可以进一步包括以下处理:视频终端向录播服务器发起一 个或多个用于建立连接的呼叫, 其中, 呼叫的建立消息中携带有与需要建立的通信连 接相关联的视频所在的方位的空间信息。 优选地, 上述步骤 S904可以进一步包括以下处理: (4) Transfer the main site, the breakout site, or the mixed media stream to the recording server. When there are multiple calls (ie, recording and playback signaling is not in a signaling manner, each call carries spatial information of the location of the video associated with the communication connection that needs to be established), then the call The number should be greater than or equal to the number of screens in the telepresence site with the most screens in the conference that needs to be recorded. Only when the above conditions are met, the spatial information of the orientation in which the video associated with the communication connection to be established is located can be completely transmitted to the recording server. The natural complete stitching effect of the image can be guaranteed during playback. Of course, in the telepresence conference system, multiple video terminals in the remote presentation site can physically correspond to one device. Further, the physical device may have many changes in the communication protocol. For example, the communication between the physical device and the MCU and the communication with the recording server are no longer represented by multiple signaling interaction processes, but only as one letter. make. Similarly, the recording and playback signaling of the solution can also adopt a signaling manner. For telepresence conferencing systems based on the H.323 protocol, the above calls follow the H.323 protocol procedure. In the same way, when a remote presentation site needs to play back the content stored in the recording server, it also communicates with several video communication terminals virtualized by the recording server by using the communication method conforming to the H.323 protocol. In the above recording and playback, the communication signaling between the MCU and the recording server, the recording server, and the telepresence site may be extended by the message extension mechanism specified by the H.323 protocol. Of course, it is not limited to the above H.323 protocol procedures, but other protocol procedures, such as the IETF SIP protocol procedures, may also be employed. The above preferred implementation process is further described below in conjunction with FIG. 8 is a flow chart of a method of recording a telepresence conference in accordance with a preferred embodiment of the present invention. As shown in FIG. 8, the recording method of the telepresence conference mainly includes the following processing: Step S802: The MCU adds the recording server as a telepresence conference site to the conference; Step S804: The MCU calls each site, and the recording server serves as a special telepresence site, and initiates one or more calls to the recording server to establish a communication connection. Step S806: After the call is successful, send the current broadcast site media stream to Recording the server, and indicating the association relationship between the media streams; Step S808: The recording server stores the media stream according to the foregoing indication, and records the association relationship between the media streams. Step S810: The conference ends, and the recording server responds to the administrator operation instruction, edits and publishes the recorded media stream, and waits for the user to order. According to an embodiment of the present invention, based on the telepresence conference system described above, a playback method of a telepresence conference is also provided. The method mainly includes: The MCU in the remote presentation conference system broadcasts the media stream recorded by the recording server according to the association relationship between the recorded media streams by calling the conference. It should be noted that this playback process is equivalent to the reverse process of the above recording process. Since the media stream recorded by the recording server is broadcast according to the association relationship between the media streams mentioned above during playback, the natural image during playback can be guaranteed. Stitching effect and listening sound recognition effect. According to an embodiment of the present invention, based on the above-described telepresence conference system, another playback method of the telepresence conference is also provided. 9 is a flow chart of a method of playing back a telepresence conference in accordance with an embodiment of the present invention. As shown in FIG. 9, the playback method of the telepresence conference mainly includes the following processes: Step S902: The video terminal of the telepresence site in the telepresence conference system establishes a point-to-point connection with the recording server; Step S904: The remote presentation site is recorded according to the The association between the media streams obtains the media stream recorded by the recording server and is presented. During playback, the telepresence site obtains the media stream recorded by the recording server according to the relationship between the recorded media streams, which can ensure the natural stitching effect and the sound recognition effect of the image during playback. Preferably, the foregoing step S902 may further include the following steps: the video terminal initiates one or more calls for establishing a connection to the recording server, where the call setup message carries the video associated with the communication connection that needs to be established. Spatial information of the orientation. Preferably, the above step S904 may further include the following processing:
( 1 )录播服务器向视频终端中的一个或多个发送录制的媒体流所对应的文件列表 信息; (1) the recording server sends the file list information corresponding to the recorded media stream to one or more of the video terminals;
(2)录播服务器响应用户根据文件列表信息进行的选择操作,根据录制的媒体流 之间的关联关系, 向各个视频终端发送录制的媒体流。 以下结合图 10进一步描述远程呈现会议的点播回放方法。 图 10 是根据本发明优选实施例的远程呈现会议的点播回放方法的流程图。 如图 10所示, 该远程呈现会议的点播回放方法主要包括以下处理: 步骤 1002: 远程呈现会场响应管理员操作指令, 需要向录播服务器发起呼叫; 步骤 1004: 远程呈现会场各个视频终端发起对录播服务器的呼叫, 并在呼叫的建 立消息中携带有与需要建立的通信连接相关联的视频所在的方位的空间信息 (例如, 方位标识); 步骤 1006: 录播服务器接收呼叫, 并根据呼叫参数判断呼叫方会场类型, 根据录 制的媒体流之间的关联关系向各个视频终端发出相应的媒体流; 步骤 1008: 远程呈现会场视频终端将媒体流按照时间与空间的要求呈现在多媒体 设备上。 综上所述, 借助本发明提供的上述实施例, 可以实现对远程呈现会议的录制和回 放。录制内容不仅包含会议室内的多媒体内容(媒体流), 而且还能记录它们之间的关 联关系 (例如, 时空关系), 从而保证了回放时会议场景的真实还原可能性。 此外, 现 有视频会议录播服务器仅通过简单的软件升级, 即可录制远程呈现会议, 可以有效节 省和保护用户投资。 显然, 本领域的技术人员应该明白, 上述的本发明的各模块或各步骤可以用通用 的计算装置来实现, 它们可以集中在单个的计算装置上, 或者分布在多个计算装置所 组成的网络上, 可选地, 它们可以用计算装置可执行的程序代码来实现, 从而可以将 它们存储在存储装置中由计算装置来执行,或者将它们分别制作成各个集成电路模块, 或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。 这样, 本发明不限 制于任何特定的硬件和软件结合。 以上仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本领域的技术人 员来说, 本发明可以有各种更改和变化。 凡在本发明的精神和原则之内, 所作的任何 修改、 等同替换、 改进等, 均应包含在本发明的保护范围之内。 (2) The recording server responds to the user's selection operation according to the file list information, and transmits the recorded media stream to each video terminal according to the association relationship between the recorded media streams. An on-demand playback method of a telepresence conference is further described below in conjunction with FIG. 10 is a flow chart of a method of on-demand playback of a telepresence conference in accordance with a preferred embodiment of the present invention. As shown in FIG. 10, the on-demand playback method of the telepresence conference mainly includes the following processes: Step 1002: The remote presentation site responds to an administrator operation instruction, and needs to initiate a call to the recording server. Step 1004: Remotely present the video terminals of the conference site to initiate a call Recording the call of the server, and carrying the call setup message carrying the spatial information (for example, the location identifier) of the location of the video associated with the communication connection to be established; Step 1006: The recording server receives the call and according to the call The parameter determines the type of the caller's site, and sends a corresponding media stream to each video terminal according to the association relationship between the recorded media streams. Step 1008: The remote presentation site video terminal presents the media stream on the multimedia device according to the requirements of time and space. In summary, with the above embodiments provided by the present invention, recording and playback of a telepresence conference can be realized. The recorded content not only contains multimedia content (media stream) in the conference room, but also records the relationship between them (for example, time and space relationship), thereby ensuring the true restoration possibility of the conference scene during playback. In addition, the existing video conference recording server can record telepresence conferences through simple software upgrades, which can effectively save and protect user investment. Obviously, those skilled in the art should understand that the above modules or steps of the present invention can be implemented by a general-purpose computing device, which can be concentrated on a single computing device or distributed over a network composed of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device so that they may be stored in the storage device by the computing device, or they may be separately fabricated into individual integrated circuit modules, or Multiple modules or steps are made into a single integrated circuit module. Thus, the invention is not limited to any specific combination of hardware and software. The above are only the preferred embodiments of the present invention, and are not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and scope of the present invention are intended to be included within the scope of the present invention.

Claims

权 利 要 求 书 Claims
1. 一种远程呈现会议系统, 包括: 多点控制单元 MCU以及录播服务器; A telepresence conference system, comprising: a multipoint control unit MCU and a recording server;
所述 MCU包括:  The MCU includes:
配置模块,设置为将所述录播服务器作为一个远程呈现会场添加进会议中; 第一连接建立模块, 设置为向所述录播服务器发起一个或多个呼叫以建立 通信连接, 其中, 该呼叫的建立消息中携带有与需要建立的通信连接相关联的 视频所在的方位的空间信息;  a configuration module, configured to add the recording server as a telepresence site to the conference; a first connection establishment module, configured to initiate one or more calls to the recording server to establish a communication connection, where the call is The setup message carries spatial information of the location of the video associated with the communication connection that needs to be established;
所述录播服务器包括:  The recording server includes:
接收模块, 设置为接收来自于所述 MCU的媒体流;  a receiving module, configured to receive a media stream from the MCU;
存储记录模块, 设置为存储所述媒体流, 记录各个所述媒体流之间的关联 关系。  And a storage recording module, configured to store the media stream, and record an association relationship between the media streams.
2. 根据权利要求 1所述的系统, 其中, 各个所述媒体流之间的关联关系包括以下 至少之一: 2. The system according to claim 1, wherein the association relationship between each of the media streams comprises at least one of the following:
视频流之间的空间关系;  The spatial relationship between video streams;
数据流和视频流的对应关系;  Correspondence between data stream and video stream;
音频流和视频流的对应关系。  Correspondence between audio stream and video stream.
3. 根据权利要求 1所述的系统, 其中, 所述 MCU包括: 3. The system according to claim 1, wherein the MCU comprises:
广播模块, 设置为通过召集会议的形式, 根据各个所述媒体流之间的关联 关系广播所述录播服务器录制的媒体流。  The broadcast module is configured to broadcast the media stream recorded by the recording server according to an association relationship between the media streams in a form of calling a conference.
4. 根据权利要求 1所述的系统, 其中, 所述远程呈现会议系统还包括: 远程呈现 会场; 所述远程呈现会场包括: The system of claim 1, wherein the telepresence conference system further comprises: a remote presentation venue; the telepresence venue includes:
第二连接建立模块, 设置为与所述录播服务器建立点对点连接; 获取呈现模块, 设置为根据各个所述媒体流之间的关联关系获取所述录播 服务器录制的媒体流并呈现。  And a second connection establishing module, configured to establish a point-to-point connection with the recording server, and obtain a presentation module, configured to acquire and display the media stream recorded by the recording server according to the association relationship between the media streams.
5. 根据权利要求 4所述的系统, 其中, 所述第二连接建立模块包括: 发起单元,设置为向所述录播服务器发起一个或多个用于建立连接的呼叫, 其中, 该呼叫的建立消息中携带有与需要建立的通信连接相关联的视频所在的 方位的空间信息。 根据权利要求 4所述的系统, 其中, 所述录播服务器包括: The system according to claim 4, wherein the second connection establishing module comprises: The initiating unit is configured to initiate one or more calls for establishing a connection to the recording server, where the setup message of the call carries spatial information of the location of the video associated with the communication connection that needs to be established. The system according to claim 4, wherein the recording server comprises:
第一发送模块, 设置为向所述视频终端中的一个或多个发送所述录制的媒 体流所对应的文件列表信息;  a first sending module, configured to send, to one or more of the video terminals, file list information corresponding to the recorded media stream;
第二发送模块, 设置为响应用户根据所述文件列表信息进行的选择操作, 根据各个所述媒体流之间的关联关系, 向各个所述视频终端发送所述媒体流。 根据权利要求 4所述的系统, 其中, 所述 MCU与所述录播服务器之间的通信 信令、 以及所述远程呈现会场与所述录播服务器之间的通信信令是基于 H.323 协议规程或者 IETF SIP协议规程进行扩展的信令。 一种远程呈现会议的录制方法, 应用于如权利要求 1至 7中任一项所述的远程 呈现会议系统, 包括:  And a second sending module, configured to send the media stream to each of the video terminals according to an association relationship between the media streams in response to a selection operation performed by the user according to the file list information. The system according to claim 4, wherein communication signaling between the MCU and the recording server, and communication signaling between the remote presentation site and the recording server are based on H.323 Extended protocol signaling by protocol procedures or IETF SIP protocol procedures. A remote presentation conference recording method, the remote presentation conference system according to any one of claims 1 to 7, comprising:
在所述远程呈现会议系统中的多点控制单元 MCU将录播服务器作为一个 远程呈现会场添加进会议之后, 所述录播服务器接收来自于所述 MCU发起的 一个或多个用于建立连接的呼叫, 其中, 所述呼叫的建立消息中携带有与需要 建立的通信连接相关联的视频所在的方位的空间信息;  After the multipoint control unit MCU in the telepresence conference system adds the recording server as a telepresence conference site to the conference, the recording server receives one or more initiated connections from the MCU for establishing a connection. a call, where the setup message of the call carries spatial information of an orientation of a video associated with a communication connection that needs to be established;
所述录播服务器接收来自于所述 MCU的媒体流并存储, 记录各个所述媒 体流之间的关联关系。 根据权利要求 8所述的方法, 其中, 所述 MCU与所述录播服务器建立连接后, 还包括: 所述 MCU对与所述录播服务器相连接的端口进行以下处理:  The recording server receives and stores the media stream from the MCU, and records an association relationship between the respective media streams. The method according to claim 8, wherein, after the MCU establishes a connection with the recording server, the method further comprises: the MCU performing the following processing on a port connected to the recording server:
禁止来自于所述录播服务器的媒体流进入会议混音器;  Prohibiting the media stream from the recording server from entering the conference mixer;
禁止将所述录播服务器加入会场列表中;  It is forbidden to add the recording server to the site list.
禁止将所述录播服务器的上端和下端信息通知所述远程呈现会议系统中的 远程呈现会场;  Disabling the upper and lower end information of the recording server to notify the telepresence conference site in the telepresence conference system;
使主会场、 分会场或经过混合处理后的媒体流传送至所述录播服务器。 根据权利要求 8所述的方法, 其中, 各个所述媒体流之间的关联关系包括以下 至少之一:  The main site, the sub-site, or the mixed media stream is transmitted to the recording server. The method according to claim 8, wherein the association relationship between each of the media streams comprises at least one of the following:
视频流之间的空间关系; 数据流和视频流的对应关系; The spatial relationship between video streams; Correspondence between data stream and video stream;
音频流和视频流的对应关系。  Correspondence between audio stream and video stream.
11. 根据权利要求 8所述的方法, 其中, 当所述呼叫为多个时, 所述呼叫的个数大 于等于需要录制的会议中具有最多屏幕的远程呈现会场的屏幕数。 The method according to claim 8, wherein, when the number of the calls is plural, the number of the calls is greater than a number of screens of the telepresence site having the most screens among the conferences to be recorded.
12. 一种远程呈现会议的回放方法, 应用于如权利要求 1至 7中任一项所述的远程 呈现会议系统, 包括: A method of playing back a remote presentation conference, the remote presentation conference system according to any one of claims 1 to 7, comprising:
所述远程呈现会议系统中的多点控制单元 MCU将通过召集会议的形式, 根据各个录制的媒体流之间的关联关系广播录播服务器录制的媒体流。  The multipoint control unit MCU in the telepresence conference system broadcasts the media stream recorded by the recording server according to the association relationship between the respective recorded media streams in the form of a conference.
13. 一种远程呈现会议的回放方法, 应用于如权利要求 1至 7中任一项所述的远程 呈现会议系统, 包括: A method of playing back a remote presentation conference, the remote presentation conference system according to any one of claims 1 to 7, comprising:
所述远程呈现会议系统中的远程呈现会场的视频终端与录播服务器建立点 对点连接;  The video terminal of the telepresence site in the telepresence conference system establishes a point-to-point connection with the recording server;
所述远程呈现会场根据录制的媒体流之间的关联关系, 获取所述录播服务 器录制的媒体流并呈现。  The telepresence site acquires and presents the media stream recorded by the recording server according to the association relationship between the recorded media streams.
14. 根据权利要求 13所述的方法,其中,所述视频终端与录播服务器建立点对点连 接包括: 14. The method of claim 13, wherein establishing the point-to-point connection between the video terminal and the recording server comprises:
所述视频终端向所述录播服务器发起一个或多个用于建立连接的呼叫, 其 中, 所述呼叫的建立消息中携带有与需要建立的通信连接相关联的视频所在的 方位的空间信息。  The video terminal initiates one or more calls for establishing a connection to the recording server, wherein the setup message of the call carries spatial information of the location of the video associated with the communication connection that needs to be established.
15. 根据权利要求 13所述的方法,其中,所述远程呈现会场获取来自于所述录播服 务器的媒体流包括: The method of claim 13, wherein the remotely presenting the venue to obtain the media stream from the recording server comprises:
所述录播服务器向所述视频终端中的一个或多个发送所述录制的媒体流所 对应的文件列表信息;  Transmitting, by the recording and playing server, file list information corresponding to the recorded media stream to one or more of the video terminals;
所述录播服务器响应用户根据所述文件列表信息进行的选择操作, 根据所 述录制的媒体流之间的关联关系,向各个所述视频终端发送所述录制的媒体流。  And the recording and recording server sends the recorded media stream to each of the video terminals according to the association relationship between the recorded media streams in response to a selection operation performed by the user according to the file list information.
PCT/CN2012/077266 2011-06-21 2012-06-20 Remotely presented conference system, method for recording and playing back remotely presented conference WO2012175025A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110168147.8A CN102223515B (en) 2011-06-21 2011-06-21 Remote presentation conference system, the recording of remote presentation conference and back method
CN201110168147.8 2011-06-21

Publications (1)

Publication Number Publication Date
WO2012175025A1 true WO2012175025A1 (en) 2012-12-27

Family

ID=44779923

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/077266 WO2012175025A1 (en) 2011-06-21 2012-06-20 Remotely presented conference system, method for recording and playing back remotely presented conference

Country Status (2)

Country Link
CN (1) CN102223515B (en)
WO (1) WO2012175025A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102223515B (en) * 2011-06-21 2017-12-05 中兴通讯股份有限公司 Remote presentation conference system, the recording of remote presentation conference and back method
CN103686219B (en) * 2012-09-24 2017-09-29 华为技术有限公司 A kind of method, equipment and the system of video conference recorded broadcast
CN103888709B (en) * 2012-12-21 2017-02-08 深圳市捷视飞通科技股份有限公司 Terminal integrated apparatus of video conference and recording system
CN105007448B (en) * 2015-07-03 2018-11-06 苏州科达科技股份有限公司 A kind of video recording system and method for video conference
GB2540226A (en) * 2015-07-08 2017-01-11 Nokia Technologies Oy Distributed audio microphone array and locator configuration
CN109547728B (en) * 2018-10-23 2021-01-01 视联动力信息技术股份有限公司 Recorded broadcast source conference entering and conference recorded broadcast method and system
CN112153321B (en) * 2019-06-28 2022-04-05 华为技术有限公司 Conference recording method, device and system
CN112463283B (en) * 2020-12-25 2022-03-08 创想空间信息技术(苏州)有限公司 Method and system for reviewing historical content of application program and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1929593A (en) * 2005-09-07 2007-03-14 宝利通公司 Spatially correlated audio in multipoint videoconferencing
CN101132516A (en) * 2007-09-28 2008-02-27 深圳华为通信技术有限公司 Method, system for video communication and device used for the same
CN101179695A (en) * 2007-12-04 2008-05-14 中兴通讯股份有限公司 Method for implementing recorded broadcast of session, session television system and terminal
US20100171807A1 (en) * 2008-10-08 2010-07-08 Tandberg Telecom As System and associated methodology for multi-layered site video conferencing
CN102055949A (en) * 2009-11-02 2011-05-11 华为终端有限公司 Recording method, device and system of multimedia conference and rewinding method and device
CN102223515A (en) * 2011-06-21 2011-10-19 中兴通讯股份有限公司 Remote presentation meeting system and method for recording and replaying remote presentation meeting

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1725851A (en) * 2004-07-20 2006-01-25 赵亮 Control system and method of video conference
US7830409B2 (en) * 2005-03-25 2010-11-09 Cherng-Daw Hwang Split screen video in a multimedia communication system
US7653705B2 (en) * 2006-06-26 2010-01-26 Microsoft Corp. Interactive recording and playback for network conferencing
EP1885111B1 (en) * 2006-08-01 2011-03-02 Alcatel Lucent Conference server
CN101039409A (en) * 2007-04-04 2007-09-19 中兴通讯股份有限公司 System and method for recording/replaying audio and video of multimedia conference
CN101141617A (en) * 2007-10-24 2008-03-12 中兴通讯股份有限公司 Session television on-demand system and method
CN101741829B (en) * 2009-08-07 2012-07-25 株洲华通科技有限责任公司 Domain control server for integrated control of sound images

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1929593A (en) * 2005-09-07 2007-03-14 宝利通公司 Spatially correlated audio in multipoint videoconferencing
CN101132516A (en) * 2007-09-28 2008-02-27 深圳华为通信技术有限公司 Method, system for video communication and device used for the same
CN101179695A (en) * 2007-12-04 2008-05-14 中兴通讯股份有限公司 Method for implementing recorded broadcast of session, session television system and terminal
US20100171807A1 (en) * 2008-10-08 2010-07-08 Tandberg Telecom As System and associated methodology for multi-layered site video conferencing
CN102055949A (en) * 2009-11-02 2011-05-11 华为终端有限公司 Recording method, device and system of multimedia conference and rewinding method and device
CN102223515A (en) * 2011-06-21 2011-10-19 中兴通讯股份有限公司 Remote presentation meeting system and method for recording and replaying remote presentation meeting

Also Published As

Publication number Publication date
CN102223515B (en) 2017-12-05
CN102223515A (en) 2011-10-19

Similar Documents

Publication Publication Date Title
WO2012175025A1 (en) Remotely presented conference system, method for recording and playing back remotely presented conference
US9172912B2 (en) Telepresence method, terminal and system
RU2533304C2 (en) Conference call management method and related device and system
WO2011026382A1 (en) Method, device and system for presenting virtual conference site of video conference
WO2011050690A1 (en) Method and system for recording and replaying multimedia conference
WO2010034254A1 (en) Video and audio processing method, multi-point control unit and video conference system
RU2610451C2 (en) Method, apparatus and system for recording video conference
JP6172610B2 (en) Video conferencing system
WO2007082433A1 (en) Apparatus, network device and method for transmitting video-audio signal
WO2010135979A1 (en) Method and system for video conference control, network equipment and meeting places for video conference
WO2011063763A1 (en) Method, device and system for conference control including remote display conference places
WO2011140812A1 (en) Multi-picture synthesis method and system, and media processing device
US9113037B2 (en) Video conference virtual endpoints
WO2011057511A1 (en) Method, apparatus and system for implementing audio mixing
WO2010097044A1 (en) Remote user signals identification method, remote conference processing method, apparatus and system
WO2015127799A1 (en) Method and device for negotiating on media capability
CN111601068A (en) Method for realizing multi-MCU cascade centerless video conference
WO2011147182A1 (en) Method and remote presentation system for obtaining remote seat position information
WO2016082577A1 (en) Video conference processing method and device
WO2015003532A1 (en) Multimedia conferencing establishment method, device and system
US20130113873A1 (en) Video conference system
CN113194278A (en) Conference control method and device and computer readable storage medium
US20070211138A1 (en) System and method for configuring devices to facilitate video telephony
US9013537B2 (en) Method, device, and network systems for controlling multiple auxiliary streams
WO2016206471A1 (en) Multimedia service processing method, system and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12802471

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12802471

Country of ref document: EP

Kind code of ref document: A1