WO2013060295A1 - Method and system for video processing - Google Patents

Method and system for video processing Download PDF

Info

Publication number
WO2013060295A1
WO2013060295A1 PCT/CN2012/083637 CN2012083637W WO2013060295A1 WO 2013060295 A1 WO2013060295 A1 WO 2013060295A1 CN 2012083637 W CN2012083637 W CN 2012083637W WO 2013060295 A1 WO2013060295 A1 WO 2013060295A1
Authority
WO
WIPO (PCT)
Prior art keywords
remote
local
display
video information
observation points
Prior art date
Application number
PCT/CN2012/083637
Other languages
French (fr)
Chinese (zh)
Inventor
赵嵩
王静
刘源
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2013060295A1 publication Critical patent/WO2013060295A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • the present invention relates to telepresence technology, and more particularly to a video processing method and system.
  • the telepresence technology can be used in a video conferencing system, in which both parties in different geographical locations are included, and both parties need to realize the effect similar to the conference in the same place through video communication.
  • video communication on the one hand, local video information needs to be collected, and on the other hand, video information of the far end needs to be displayed. Due to the difference between the position of the camera and the remote image, when the participant looks at the far-end image and can't see the camera, it can't produce the effect of "visually witnessing" the conversation.
  • a technical problem to be solved by embodiments of the present invention is to provide a video processing method and system. Eye-to-eye effects can be achieved in a teleconferencing system.
  • the embodiment of the present invention provides a video processing method, which is used in a remote conference system, and the video sending part of the method includes:
  • the video display portion of the method includes:
  • the multi-view display device displays the far-end video information of the corresponding view to different local observation points to achieve an eye-to-eye display effect, and the display view angle of the multi-view display device is not less than the number of the local observation points;
  • the number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
  • the embodiment of the present invention further provides a video processing device, which is used in a remote conference system, where the sending module of the device includes:
  • a local video obtaining unit configured to acquire local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points;
  • a local video sending unit configured to send the local video information with different viewing angles to the remote end
  • the display module of the device includes:
  • the remote video receiving unit is configured to receive remote video information with different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points;
  • a remote video display unit configured to display, by using a multi-view display device, remote video information of a corresponding perspective to different local observation points, to achieve an eye-to-eye display effect, where the display angle of the multi-view display device is not less than Number of local observation points;
  • the number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
  • the embodiment of the present invention further provides a remote conference system, where the local end of the system includes:
  • a plurality of image capturing devices having different camera viewing angles for respectively acquiring local video information having different viewing angles, wherein the number of viewing angles of the local video information is not less than the number of remote viewing points;
  • a communication device configured to send, to the remote end, the local video information with different views obtained by the camera device, and receive remote video information with different perspectives from the remote end, the perspective of the remote video information
  • the number is not less than the number of local observation points
  • a multi-view display device configured to display remote video information of a corresponding view to different local observation points, to achieve an eye-to-eye display effect, where the display angle of view of the multi-view display device is not less than the number of the local observation points;
  • the number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
  • the multi-view local video information is obtained and sent to the remote display, and the multi-view remote video information from the far end is displayed on the local end, and the eye-to-eye can be realized as long as the viewing angle is properly matched during display. Meeting effect.
  • 1 is a layout diagram of one end of an existing remote conference system
  • FIG. 2 is a specific flowchart of a transmitting part in a video processing method according to an embodiment of the present invention
  • FIG. 3 is a specific flowchart of a display part in a video processing method according to an embodiment of the present invention
  • FIG. 4 is an implementation of the present invention
  • a specific composition diagram of the video processing device in the example
  • FIG. 5 is a specific composition diagram of a remote conference system in an embodiment of the present invention.
  • FIG. 6 is a schematic diagram of a layout 1 according to an embodiment of the present invention.
  • FIG. 7 is a schematic diagram of a layout 1 according to an embodiment of the present invention.
  • FIG. 8 is a schematic diagram of a layout 1 according to an embodiment of the present invention.
  • FIG. 9 is a schematic diagram of a layout 1 including an auxiliary stream display according to an embodiment of the present invention.
  • FIG. 10 is a schematic diagram of a layout 1 including a secondary stream display according to an embodiment of the present invention.
  • FIG. 11 is a schematic diagram of a layout 1 according to an embodiment of the present invention.
  • FIG. 12 is a schematic view showing a display angle of a multi-view display according to an embodiment of the present invention.
  • FIG. 13 is a schematic diagram showing a display principle of a multi-view display according to an embodiment of the present invention.
  • FIG. 14 is a schematic diagram of a layout la according to an embodiment of the present invention.
  • Figure 15 is a schematic diagram of a layout lb according to an embodiment of the present invention.
  • 16 is a schematic diagram of a layout 2 according to an embodiment of the present invention.
  • FIG. 17 is a schematic diagram of a layout 3 according to an embodiment of the present invention.
  • FIG. 18 is a schematic diagram of a layout 4 according to an embodiment of the present invention.
  • Figure 19 is a schematic diagram of a layout 5 according to an embodiment of the present invention.
  • 20 is a schematic diagram of a layout 6 according to an embodiment of the present invention
  • 21 is a schematic diagram of a layout 7 according to an embodiment of the present invention
  • Figure 22 is a schematic diagram of a layout 8 in accordance with an embodiment of the present invention.
  • FIG. 23 is a schematic diagram of a layout 9 according to an embodiment of the present invention.
  • Figure 24 is a schematic illustration of a layout 10 in accordance with an embodiment of the present invention.
  • the basis for witnessing the eye effect is that both parties can observe different perspectives. When one turns to a certain perspective, only the observer at that perspective can feel the effect of "frontal". Based on the principle, in the embodiment of the present invention, the number of possible viewing angles of both parties is fully considered, and a corresponding number of multi-view video information is obtained according to the number of observers to achieve a realistic eye-to-eye effect.
  • FIG. 2 and FIG. 3 it is a specific flowchart of a video processing method in an embodiment of the present invention, and the method can be used in a remote conference system.
  • a part of the video transmission process of the method is: 201. Acquiring local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points; 202, the different viewing angles are The local video information is sent to the far end.
  • the local video information can be acquired by the imaging device having different camera viewing angles at the multi-view display device described below.
  • the video display part of the method is: 301: receiving remote video information with different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points; 302, using multi-view display
  • the device displays the remote video information of the corresponding perspective to different local observation points to achieve an eye-to-eye display effect, and the display angle of view of the multi-view display device is not less than the number of the local observation points.
  • the number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
  • the remote conference system in the foregoing embodiment may include multiple remote ends, and step 301 may be: selecting one of the plurality of remote ends, and receiving the different perspectives sent from the selected remote end.
  • Remote video information On the other hand, a plurality of multi-view display devices may be selected for display, that is, in step 302, a plurality of multi-view display devices are used to display remote video information of corresponding views to different local observation points.
  • the remote or local observation point may refer to the location of the participant when attending the conference, or may refer to the location group of the participant (ie, there may be two or more participants in the conference when they participate in the conference. An observation point, without distinction).
  • the embodiment of the present invention further provides a video processing device 1 for use in a remote conference system.
  • the sending module 10 of the device 1 includes: a local video acquiring unit 100. And for acquiring local video information having different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points; the local video sending unit 102 is configured to send the local video information with different viewing angles to the remote end. .
  • the local video obtaining unit 100 is further configured to acquire local video information by using an imaging device having different camera viewing angles at the multi-view display device.
  • the display module 12 of the device 1 includes: a remote video receiving unit 120, configured to receive remote video information having different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points; the remote video display unit 122,
  • the multi-view display device is configured to display the remote video information of the corresponding view to different local observation points, so as to achieve an eye-to-eye display effect, the display view angle of the multi-view display device is not less than the number of the local observation points;
  • the number of the remote observation points and the number of local observation points are both natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
  • the remote video display unit 120 can also be used to display remote video information of a corresponding perspective to different local viewing points by using multiple multi-view display devices. If the system includes multiple remote ends, the remote video receiving unit 120 is further configured to select one of the plurality of remote ends, and receive remote video information with different perspectives sent from the selected remote end.
  • the embodiment of the present invention further provides a remote conference system, in which the physical device having the above functions is implemented to implement the entire system, of course, only the connection relationship is shown in the figure, Represents the positional relationship in the actual system.
  • the local end of the system includes: a plurality of camera devices 2 having different camera viewing angles, respectively, for acquiring local video information having different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points
  • the communication device 3 is configured to send the local video information with different perspectives obtained by the camera device to the remote end, and receive remote video information with different perspectives from the remote end, the remote video information.
  • the number of the viewing angles is not less than the number of the local viewing points; the multi-view display device 4 is configured to display the far-end video information of the corresponding viewing angles to different local viewing points to achieve an eye-to-eye display effect, and the display viewing angle of the multi-view display device
  • the number of the local observation points and the number of local observation points are both natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
  • multiple multi-view display devices can be set at the local end, so that the number of multi-view display devices in the system is not less than the number of remote view points. To achieve a better eye-to-eye effect.
  • the multi-view display device may be a multi-view display, or the multi-view display device may be a combination of a plurality of projectors and a projection screen having a multi-view display function.
  • the local communication device is further configured to select one of the plurality of remote terminals for receiving and transmitting multi-view video information.
  • a layout 1 of an embodiment of the present invention is shown.
  • the layout 1 an example of a specific positional relationship of each component device in the system shown in FIG. 5 is displayed, and a corresponding perspective of each device is shown. Wait.
  • the overall situation of the local end and the far end in the system is shown in Fig. 6 to Fig. 10.
  • Fig. 11 shows the specific layout of one end in the system (the two ends are symmetrically distributed).
  • the system includes two sites A and B.
  • the site AB is directly connected through the network.
  • Each site contains three large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display, which is similar to the size of a real person.
  • High-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
  • the three displays are placed in a folded plane, the middle display and the two sides of the display are close together, the images of the three displays form a complete presentation of the conference room scene, and the auxiliary display on the side of the conference table can display the shared Data and other information.
  • the display device used in the system is a display with multiple viewing angles. As shown in FIG. 11, each display has three viewing angles, and each viewing angle can present different contents, and the viewing angle of each display is as shown in FIG.
  • the display shown in FIG. 12 has the following features: different contents can be displayed at different viewing angles; as shown in the above figure, it is assumed that the object to be presented is identified by a broken line, having three surfaces, and the multi-view display can be in three different The face 1 content, the face 2 content and the face 3 content are respectively presented in the perspective, if the object is actually placed at the display position.
  • the display device is implemented using the parallax barrier principle, as shown in Fig. 13, and the image content of the viewing angle is observed from different viewing angles.
  • the conference table of each conference room is Dl, D2, D3 from left to right, and the display is Tl, ⁇ 2, ⁇ 3.
  • the three cameras on each display are Cl, C2, C3 from left to right, each The three viewing angles of the display are VI, V2, and V3 from left to right;
  • the camera C1 at the top of the left display covers the conference table D1, the camera C2 covers the conference table D2, and the camera C3 covers the conference table D3; the camera C1 at the top of the middle display covers the conference table D1, camera C2
  • the shooting range covers the conference table D2, the camera C3 shooting range covers the conference table D3; the camera C1 shooting range at the top of the right display screen covers the conference table D1, the camera C2 shooting range covers the conference table D2, and the camera C3 shooting range covers the conference table D.
  • the viewing angle VI of T1 corresponds to the conference table D1
  • the viewing angle V2 of T1 corresponds to the conference table D2
  • the viewing angle V3 of T1 corresponds to the conference table D3
  • the viewing angle VI of T2 corresponds to the conference table D1
  • the viewing angle V2 of T2 corresponds to the conference table D2
  • the viewing angle V3 of T2 corresponds to the conference table D3
  • the viewing angle VI of T3 corresponds to the conference table D1
  • the angle of view V2 of T3 corresponds to the conference table D2
  • the angle of view V3 of T3 corresponds to the conference table D3.
  • the two sites shown in FIG. 6 are the site A and the site B respectively. If the video stream of the camera of the site A is sent to the site B, for the middle seat area D2, There are the following transmission and reception correspondences of video streams:
  • a Tl C2 > B T2 V3
  • a T2 C2 > B T2 V2
  • a T3 C2 > B T2 VI
  • a T2 CI > B T3 V2
  • a Tl C3 > B Tl V3
  • a T2 C3 > B Tl V2
  • the foregoing video transmission and reception correspondence may be implemented in two ways: in the description of the display device of the site and the orientation information of the participant, the following provisions are made to all the participating regions.
  • the middle position is centered, the leftmost display device facing the middle position of all the main display devices is the 0th display device area, the second left display device is the first display device, and so on;
  • the leftmost conference area is the 0th (camera) coverage area, the second left conference area is the 1st (camera) coverage area, and so on.
  • a secondary display T4 may be present at the venues A and B to display the secondary stream video.
  • the transmission and reception correspondence of the video stream includes two ways: One is consistent with the manner described in Figures 6-8, as shown in Figure 9.
  • the other mode is the mirroring mode.
  • this mode is negotiated before the videoconferencing of the two parties in the system.
  • the content of the negotiation includes the above-mentioned video transmission and reception correspondence information. That is, there are the following processes.
  • sending a video stream Tl C3 can be described as:
  • the sender sends all the video streams of this method to the other party, and the other party receives the video stream.
  • the video is displayed on the specific display according to the above correspondence, which requires the other party to recognize the position of the video in the received video, so
  • the user data section is added with the following structure:
  • Auxiliary flow tag, 1 identifies the auxiliary stream, 0 means non-auxiliary stream (ie mainstream)
  • the position of the sender of the video stream (the position of the display), in the mainstream case, all the bits identify the position; in the case of the auxiliary stream, the highest 2 bits indicate the vertical position of the auxiliary stream, 11 indicates above the main display device, and 10 indicates The main display device is in the same horizontal position, 00 is below the main display device; [1 zone i or all zone i or]
  • the coverage area of the video stream the highest bit indicates whether to cover all areas, 1 means to cover all areas, then the last 8 bits are meaningless; 0 means only cover a certain area, followed by 0-7 bits to cover Area i or orientation;
  • Recver— pos 8bits video receiver location for capability negotiation
  • Displayer_pos The position of the 8bits video display for capability negotiation.
  • the layout la in the embodiment of the present invention similar to the layout 1 is shown.
  • the settings of the camera and the conference table are the same as those of the layout 1.
  • the display device uses the projection method, and nine high-resolution and high-brightness projectors are placed at nine different positions behind the projection screen.
  • cylindrical gratings are arranged on the projection screen to allow different content to be seen at different angles.
  • the layout lb in the embodiment of the present invention similar to the layout 1 is shown.
  • the settings of the camera and the conference table are the same as those of the layout 1.
  • the display device uses the projection method, and nine high-resolution and high-brightness projectors are placed at nine different positions in front of the projection screen.
  • cylindrical gratings are arranged on the projection screen to allow different content to be seen at different angles.
  • FIG. 16 it is a layout 2 in the embodiment of the present invention.
  • the system consists of two sites in AB that are directly connected through the network.
  • the configuration of the AB site is the same.
  • Each site uses two large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat-panel displays. It can display high-definition images close to life-size, and can use flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
  • the display device has multiple viewing angles and can display different content.
  • Each participant has three participant areas, which are configured as shown in the above figure. Each participant in the participant area can watch the multi-view display device.
  • the content of one view; two HD cameras are arranged in the convergence mode on both sides and the middle of the display, each camera can capture all the participants at different angles; the auxiliary display on the side of the conference table can display the shared data And other information.
  • the C1 camera on the display at the T1 position captures the conference area Dl
  • the C2 camera captures the conference area D2
  • the C1 camera on the display at the T2 position captures the conference area D1, C2 the camera captures the conference area D2;
  • the meeting tends to D1 to see the viewing angle VI of the display T1; the meeting tends to D1 can be viewed
  • a T2 CI > B Tl V2
  • FIG. 17 it is a layout 3 in the embodiment of the present invention.
  • the system consists of two sites in AB that are directly connected through the network.
  • the configuration of the AB site is the same.
  • Each site uses a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display.
  • Real-life high-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
  • the display device has three viewing angles and can display different contents.
  • Each participant has three participants, which are configured as shown in the above figure, and each participant can just watch the content of one perspective of the multi-view display device; Three HD cameras are set up on the sides and in the middle of the display. Each camera can capture all participants at different angles.
  • the auxiliary display on the side of the conference table can display information such as shared data.
  • the three viewing angles of the multi-view display device are VI, V2, and V3 from the right to the right.
  • C1C2C3 captures all participants Pl, P2, P3 from three angles;
  • the three participants are distributed on three perspectives of the multi-view display.
  • the view VI corresponds to P1
  • the view V2 corresponds to P2
  • the view V3 corresponds to P3.
  • the corresponding relationship between the video stream transmission and reception of the two sites is:
  • FIG. 18 it is a layout 4 in the embodiment of the present invention.
  • the system consists of two sites in AB that are directly connected through the network.
  • the configuration of the AB site is the same.
  • Each site uses a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display.
  • Real-life high-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
  • the display device has two viewing angles and can display different contents.
  • Each participant has two participants, which are configured as shown in the above figure, and each participant can just watch the content of one perspective of the multi-view display device;
  • Two HD cameras are arranged in the convergence mode on both sides and in the middle of the display. Each camera can capture all participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data.
  • C1C2 captures all participants Pl, P2 from two angles
  • the two participants are distributed on two viewing angles of the multi-view display.
  • the viewing angle VI corresponds to P1 and the viewing angle V2 corresponds to P2.
  • FIG. 19 it is a layout 5 in the embodiment of the present invention.
  • the system consists of two sites in AB that are directly connected through the network.
  • a site contains a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display, which can be used to render high-definition images close to life-size.
  • the display device is a conventional display device with only one viewing angle; three high-definition cameras are arranged at the top and the top of the display, and placed according to the convergence mode, Participants Pl are shot at 3 different angles.
  • the B site consists of a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display, for displaying high-definition images close to life-size, and can use PDP TV, LCD TV or DLP rear projection TV.
  • a display device such as a 65-inch or 70-inch flat panel display, for displaying high-definition images close to life-size, and can use PDP TV, LCD TV or DLP rear projection TV.
  • Flat panel display technology The display device has three viewing angles, and can display different contents; each participant has three participants, and is configured as shown in the above figure, each participant can just watch the content of one perspective of the multi-view display device;
  • An HD camera is placed in the middle of the top of the display device to support high-definition image collection of 720p and 1080p resolutions. This camera can cover all participants in the venue.
  • a secondary display located on the side of the conference table displays information such as shared data.
  • the video stream transmission and reception relationship between the two sites is:
  • FIG. 20 it is a layout 6 in the embodiment of the present invention.
  • the system consists of two sites in AB that are directly connected through the network.
  • a site consists of three large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat-panel displays, which can be used to render high-definition images close to life-size.
  • the display device is a conventional display device with only one viewing angle; three HD cameras are set on the top of the display, and all the participants can be photographed from three different angles.
  • the B site includes a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display for displaying high-definition images close to life-size, and can use PDP TV, LCD TV or DLP rear projection TV.
  • Flat panel display technology The display device has three viewing angles.
  • the venue has three participants, which are configured as shown in the figure above. The participant can just watch the content of one view of the multi-view display device; place 3 HD cameras in the middle and both ends of the display device to collect the participant images from 3 angles, the camera can support 720p and HD image collection with 1080p resolution.
  • a secondary display located on the side of the conference table displays information such as shared data.
  • Participants in Site A can choose to display the video stream of Site B on different display devices in the site. Normally, the video stream is displayed on the T2 display.
  • the layout 7 in the embodiment of the present invention consists of three conference sites PA, PB, and PC. Each site is configured in the same way.
  • a large-size multi-view flat panel display is used as a display device, such as a 65-inch or 70-inch flat panel display, for rendering close to the size of a real person.
  • the high-definition screen can use flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
  • the display device has 3 viewing angles and can display different contents.
  • Each venue has 3 participants, according to the above figure.
  • each participant can watch the content of one view of the multi-view display device; set up three HD cameras in the convergence mode on both sides and the middle of the display, each camera can shoot all at different angles Participants; the secondary display located on the side of the conference table displays information such as shared data.
  • the three sites PA, PB, and PC are connected through the MCU.
  • the media capability negotiation is performed before the conference starts.
  • Each site uploads all the videos of the site to the MCU.
  • each site can choose to view the remote site.
  • Conference site PA you can choose to watch the site PB or the site PC.
  • Contents assuming that the content of the site PC is viewed at a certain time, the MCU needs to send the three video streams of the site PC to the PA. In the case of eye-to-eye, the video stream transmission and reception of the two sites correspond to each other.
  • MCU forwarding is:
  • the PC For the site PB, the PC has a similar transmission and reception correspondence.
  • the PC Cl + PB Cl identifies the MCU to combine the Cl video of the PC site and the Cl video of the PB site to form a new video stream.
  • the layout 8 in the embodiment of the present invention consists of four sites PA, PB, PC, and PD.
  • the configuration of each site is the same.
  • the four sites are controlled by the MCU.
  • Each venue contains 3 large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs.
  • Flat panel display technology The three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the conference room scene.
  • the display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display
  • the top is equipped with a high-definition camera that collects the participants' images from three angles.
  • the camera can support 720p and 1080p resolution HD image collection; the auxiliary display on the side of the conference table can display shared data and other information.
  • Interest 3 large-size flat panel displays as the main display device, such as a 65
  • the conference table of each conference room is Dl, D2, D3 from left to right
  • the display is Tl, ⁇ 2, ⁇ 3, and the three viewing angles of each display are VI, V2, V3 from left to right
  • the cameras are Tl-Cl, T2—Cl, T3—CI, respectively, and the cameras Tl—Cl, T2—Cl, and T3 C1 can independently cover the areas where all conference tables D1, D2, and D3 are located;
  • the viewing angle VI of T1 corresponds to the conference table D1
  • the viewing angle V2 of T1 corresponds to the conference table D2
  • the viewing angle V3 of T1 corresponds to the conference table D3
  • the viewing angle VI of T2 corresponds to the conference table D1
  • the viewing angle V2 of T2 corresponds to the conference table D2
  • the viewing angle V3 of T2 corresponds to the conference table D3
  • the viewing angle VI of T3 corresponds to the conference table D1
  • the viewing angle V2 of T3 corresponds to the conference table D2
  • the viewing angle V3 of T3 corresponds to the conference table D3.
  • each participant in each site can view the participants of all the sites.
  • all the videos of the site are uploaded to the MCU, and the corresponding video is stitched in the MCU. Destination site; Now assume that the PA site needs to view the content of all sites, then in the case of eye-to-eye, the correspondence between the transmission and reception of the video stream is:
  • the PB Tl C1+ PC Tl C1+ PD Tl CI identifies the MCU to splicing the CI video of the PB site and the CI video of the PB site to form a new video stream.
  • the three display devices in the site respectively display the three sites at the remote end.
  • the layout 9 in the embodiment of the present invention consists of four sites PA, PB, PC, and PD.
  • the configuration of each site is the same.
  • the four sites are controlled by the MCU.
  • Each venue contains 3 large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs.
  • Flat panel display technology The three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the meeting room scene.
  • the display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display
  • the top is equipped with an HD camera that collects the participants' images from three angles.
  • the camera can support HD image collection with 720p and 1080p resolutions.
  • the auxiliary display on the side of the conference table can display information such as shared data.
  • the conference table of each conference room is Dl, D2, D3 from left to right
  • the display is Tl, ⁇ 2, ⁇ 3, and the three viewing angles of each display are VI, V2, V3 from left to right;
  • the cameras are Tl—Cl, T2—Cl, and T3—CI.
  • the cameras Tl—Cl, T2—Cl, and T3 C1 can independently cover the areas where all conference tables D1, D2, and D3 are located.
  • the viewing angle VI of T1 corresponds to the conference table D1
  • the viewing angle V2 of T1 corresponds to the conference table D2
  • the viewing angle V3 of T1 corresponds to the conference table D3
  • the viewing angle VI of T2 corresponds to the conference table D1
  • the viewing angle V2 of T2 corresponds to the conference table D1
  • the viewing angle V2 of T2 corresponds.
  • the angle of view V3 of the T2 corresponds to the conference table D3
  • the perspective VI of the T3 corresponds to the conference table D1
  • the perspective V2 of the T3 corresponds to the conference table D2
  • the perspective V3 of the T3 corresponds to the conference table D3.
  • each participant in each site can view the participants of all the sites.
  • all the videos of the site are uploaded to the MCU, and the corresponding video is stitched in the MCU. Destination site; Now assume that the PA site needs to view the content of all sites, then in the case of eye-to-eye, the correspondence between the transmission and reception of the video stream is:
  • the PB Tl C1+ PC Tl C1+ PD Tl CI identifies the MCU to splicing the CI video of the PB site and the CI video of the PB site to form a new video stream.
  • the three display devices in the site respectively display the three sites at the remote end.
  • the following video stream correspondence is as follows: Suppose the PB site is displayed on the T1 display. The PC venue is on the T2 display, and the PD venue is displayed on the T3 display:
  • the layout 10 in the embodiment of the present invention consists of three sites, PA, PB, and PC.
  • the configuration of each site is different.
  • the three sites are controlled by the MCU.
  • Venue A contains 3 large-size flat panel displays as the main display device, such as 65-inch or 70-inch flat panel display, for displaying high-definition images close to life-size, and can use flat panel displays such as PDP TV, LCD TV or DLP rear projection TV. technology.
  • the three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the meeting room scene.
  • the display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display
  • the top is equipped with an HD camera that collects the participants' images from three angles.
  • the camera can support HD image collection with 720p and 1080p resolution.
  • the auxiliary display on the side of the conference table can display information such as shared data. Refer to Layout 1 for the specific device orientation and coverage relationship of Site A.
  • Venue B contains a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs.
  • the display device has one viewing angle, which can display different contents; one participant in the venue, configured as shown in the above figure, three HD cameras are set in the convergence mode on both sides and in the middle of the display Each camera can capture participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data.
  • the specific device orientation and coverage relationship of Site B refer to Layout 3.
  • Venue C uses two large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat panel displays for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP backs.
  • Invest in flat panel display technology such as television.
  • the display device has multiple viewing angles and can display different content.
  • Each participant has two participant areas, which are configured as shown in the figure above.
  • Each participant in the participant area can watch the multi-view display device.
  • Each camera can capture all participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data.
  • Layout 2 For the specific device orientation and coverage relationship of Site C, refer to Layout 2.
  • the camera C1 on the display T1 captures the conference area D1;
  • Camera C2 on the display T1 captures the conference area D1;
  • the camera C1 on the display T2 captures the conference area D1;
  • the participant area D1 can see the viewing angle V2 of the display T1;
  • the participant area D1 can see the viewing angle VI of the display T2;
  • Participant area D2 can see the viewing angle V3 of the display T1;
  • the participant area D2 can see the viewing angle V2 of the display T2;
  • the three displays display the contents of the two sites in BC.
  • the T1 screen displays the B site
  • the T2 displays the D site of the C site
  • the T3 displays the D2 content of the C site.
  • the site of the C site can choose to view the contents of the two sites of the AB.
  • T1 displays the content of the site B
  • T2 displays the content of the D2 participant in the site.
  • a T3 C2 >C T2 V2
  • the multi-view local video information is obtained and sent to the remote display, and the multi-view remote video information from the far end is displayed on the local end, and the eye-to-eye can be realized as long as the viewing angle is properly matched during display. Meeting effect.
  • the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Disclosed are a method and system for video processing. A video transmission portion of the method comprises: acquiring local video information having different viewing angles, where the number of viewing angles of the local video information is no less than the number of remote observation points; and transmitting the local video information having different viewing angles to a remote end. A video display portion of the method comprises: receiving remote video information having different viewing angles, where the number of viewing angles of the remote video information is no less than the number of local observation points; using a multi-viewing angle display device to display the remote video information of corresponding viewing angles to the different local observation points, thereby implementing an eye-to-eye display effect, where the number of display viewing angles of the multi-viewing angle display device is no less than the number of local observation points; where the number of remote observation points and the number of local observation points are both natural numbers, and where at least one number between the number of remote observation points and the number of local observation points is no less than two. Employment of the present invention allows for the implementation of the eye-to-eye effect in a remote conference system.

Description

一种视频处理方法和系统  Video processing method and system
技术领域 Technical field
本发明涉及远程呈现技术, 尤其涉及一种视频处理方法和系统。  The present invention relates to telepresence technology, and more particularly to a video processing method and system.
背景技术 Background technique
远程呈现技术可用于视频会议系统中 ,在该系统中包括处于不同地理位置 的与会的双方, 双方需要通过视频通信实现类似于在同一地点进行会议的效 果。 在视频通信时, 一方面需要釆集本地视频信息, 另一方面需要显示远端的 视频信息。由于摄像机与远端图像位置的差异,当与会者看着远端图像交谈时, 不能看像摄像机, 这样就无法产生 "目艮对目艮" 的临场交谈的效果。  The telepresence technology can be used in a video conferencing system, in which both parties in different geographical locations are included, and both parties need to realize the effect similar to the conference in the same place through video communication. In video communication, on the one hand, local video information needs to be collected, and on the other hand, video information of the far end needs to be displayed. Due to the difference between the position of the camera and the remote image, when the participant looks at the far-end image and can't see the camera, it can't produce the effect of "visually witnessing" the conversation.
如图 1所示, 为现有的远程会议系统的一个组成布局, 该系统中 3台摄像 机无重复地均勾覆盖所有与会者;接收到 3路视频流分别显示在 3台显示器上。 显然, 只有所有与会人员都正视摄像机时, 才可以在中间屏幕上获得眼对眼效 果; 在其余情况下均无法获得 "眼对眼" 效果。 发明内容  As shown in Fig. 1, for one component layout of the existing teleconferencing system, three cameras in the system cover all participants without duplication; three video streams are received and displayed on three monitors. Obviously, only when all the participants face the camera, the eye-to-eye effect can be obtained on the middle screen; in other cases, the "eye-to-eye" effect is not obtained. Summary of the invention
本发明实施例所要解决的技术问题在于, 提供一种视频处理方法和系统。 可以在远程会议系统中实现眼对眼的效果。  A technical problem to be solved by embodiments of the present invention is to provide a video processing method and system. Eye-to-eye effects can be achieved in a teleconferencing system.
为了解决上述技术问题, 本发明实施例提供了一种视频处理方法, 用于远 程会议系统中, 所述方法的视频发送部分包括:  In order to solve the above technical problem, the embodiment of the present invention provides a video processing method, which is used in a remote conference system, and the video sending part of the method includes:
获取具有不同视角的本地视频信息,所述本地视频信息的视角数目不小于 远端观察点数目;  Acquiring local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points;
将所述具有不同视角的本地视频信息发送至远端;  Transmitting the local video information with different viewing angles to the remote end;
所述方法的视频显示部分包括:  The video display portion of the method includes:
接收具有不同视角的远端视频信息,所述远端视频信息的视角数目不小于 本地观察点数目;  Receiving remote video information having different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points;
釆用多视角显示设备向不同的本地观察点显示对应视角的远端视频信息, 以实现眼对眼显示效果,所述多视角显示设备的显示视角不小于所述本地观察 点数目; 其中, 所述远端观察点数目和本地观察点数目均为自然数,且所述远端观 察点数目和本地观察点数目中至少有一个数目不小于 2。 The multi-view display device displays the far-end video information of the corresponding view to different local observation points to achieve an eye-to-eye display effect, and the display view angle of the multi-view display device is not less than the number of the local observation points; The number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
同时, 本发明实施例还提供了一种视频处理装置, 用于远程会议系统中, 所述装置的发送模块包括:  In the meantime, the embodiment of the present invention further provides a video processing device, which is used in a remote conference system, where the sending module of the device includes:
本地视频获取单元, 用于获取具有不同视角的本地视频信息, 所述本地视 频信息的视角数目不小于远端观察点数目;  a local video obtaining unit, configured to acquire local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points;
本地视频发送单元, 用于将所述具有不同视角的本地视频信息发送至远 端;  a local video sending unit, configured to send the local video information with different viewing angles to the remote end;
所述装置的显示模块包括:  The display module of the device includes:
远端视频接收单元, 用于接收具有不同视角的远端视频信息, 所述远端视 频信息的视角数目不小于本地观察点数目;  The remote video receiving unit is configured to receive remote video information with different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points;
远端视频显示单元,用于釆用多视角显示设备向不同的本地观察点显示对 应视角的远端视频信息, 以实现眼对眼显示效果, 所述多视角显示设备的显示 视角不小于所述本地观察点数目;  a remote video display unit, configured to display, by using a multi-view display device, remote video information of a corresponding perspective to different local observation points, to achieve an eye-to-eye display effect, where the display angle of the multi-view display device is not less than Number of local observation points;
其中, 所述远端观察点数目和本地观察点数目均为自然数,且所述远端观 察点数目和本地观察点数目中至少有一个数目不小于 2。  The number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
相应地, 本发明实施例还提供了一种远程会议系统, 所述系统的本地端包 括:  Correspondingly, the embodiment of the present invention further provides a remote conference system, where the local end of the system includes:
多个具有不同摄像视角的摄像设备,用以分别获取具有不同视角的本地视 频信息, 所述本地视频信息的视角数目不小于远端观察点数目;  a plurality of image capturing devices having different camera viewing angles for respectively acquiring local video information having different viewing angles, wherein the number of viewing angles of the local video information is not less than the number of remote viewing points;
通讯设备,用于向远端发送所述摄像设备获得的所述具有不同视角的本地 视频信息, 并接收来自所述远端的具有不同视角的远端视频信息, 所述远端视 频信息的视角数目不小于本地观察点数目;  a communication device, configured to send, to the remote end, the local video information with different views obtained by the camera device, and receive remote video information with different perspectives from the remote end, the perspective of the remote video information The number is not less than the number of local observation points;
多视角显示设备, 用于向不同的本地观察点显示对应视角的远端视频信 息, 以实现眼对眼显示效果, 所述多视角显示设备的显示视角不小于所述本地 观察点数目;  a multi-view display device, configured to display remote video information of a corresponding view to different local observation points, to achieve an eye-to-eye display effect, where the display angle of view of the multi-view display device is not less than the number of the local observation points;
其中, 所述远端观察点数目和本地观察点数目均为自然数,且所述远端观 察点数目和本地观察点数目中至少有一个数目不小于 2。 在本发明实施例中, 获取多视角本地视频信息发送给远端显示, 在本地端 则显示来自远端的多视角远端视频信息,只要显示时恰当进行视角的匹配就可 以实现眼对眼的会议效果。 The number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2. In the embodiment of the present invention, the multi-view local video information is obtained and sent to the remote display, and the multi-view remote video information from the far end is displayed on the local end, and the eye-to-eye can be realized as long as the viewing angle is properly matched during display. Meeting effect.
附图说明 DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施 例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地, 下面描述 中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲, 在不付 出创造性劳动性的前提下, 还可以根据这些附图获得其他的附图。  In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any inventive labor.
图 1是现有的远程会议系统一端的布局示意图;  1 is a layout diagram of one end of an existing remote conference system;
图 2是本发明实施例中的视频处理方法中的发送部分的一个具体流程图; 图 3是本发明实施例中的视频处理方法中的显示部分的一个具体流程图; 图 4是本发明实施例中的视频处理装置的一个具体组成图;  2 is a specific flowchart of a transmitting part in a video processing method according to an embodiment of the present invention; FIG. 3 is a specific flowchart of a display part in a video processing method according to an embodiment of the present invention; FIG. 4 is an implementation of the present invention; a specific composition diagram of the video processing device in the example;
图 5是本发明实施例中的远程会议系统的一个具体组成图;  FIG. 5 is a specific composition diagram of a remote conference system in an embodiment of the present invention; FIG.
图 6是为本发明实施例的布局 1示意图;  6 is a schematic diagram of a layout 1 according to an embodiment of the present invention;
图 7是为本发明实施例的布局 1示意图;  7 is a schematic diagram of a layout 1 according to an embodiment of the present invention;
图 8是为本发明实施例的布局 1示意图;  FIG. 8 is a schematic diagram of a layout 1 according to an embodiment of the present invention; FIG.
图 9是为本发明实施例的包括辅流显示器的布局 1示意图;  9 is a schematic diagram of a layout 1 including an auxiliary stream display according to an embodiment of the present invention;
图 10是为本发明实施例的包括辅流显示器的布局 1示意图;  FIG. 10 is a schematic diagram of a layout 1 including a secondary stream display according to an embodiment of the present invention; FIG.
图 11是为本发明实施例的布局 1示意图;  11 is a schematic diagram of a layout 1 according to an embodiment of the present invention;
图 12是为本发明实施例中多视角显示器的显示视角示意图;  12 is a schematic view showing a display angle of a multi-view display according to an embodiment of the present invention;
图 13是为本发明实施例中多视角显示器的显示原理示意图;  13 is a schematic diagram showing a display principle of a multi-view display according to an embodiment of the present invention;
图 14是为本发明实施例的布局 la示意图;  FIG. 14 is a schematic diagram of a layout la according to an embodiment of the present invention; FIG.
图 15是为本发明实施例的布局 lb示意图;  Figure 15 is a schematic diagram of a layout lb according to an embodiment of the present invention;
图 16是为本发明实施例的布局 2示意图;  16 is a schematic diagram of a layout 2 according to an embodiment of the present invention;
图 17是为本发明实施例的布局 3示意图;  17 is a schematic diagram of a layout 3 according to an embodiment of the present invention;
图 18是为本发明实施例的布局 4示意图;  18 is a schematic diagram of a layout 4 according to an embodiment of the present invention;
图 19是为本发明实施例的布局 5示意图;  Figure 19 is a schematic diagram of a layout 5 according to an embodiment of the present invention;
图 20是为本发明实施例的布局 6示意图; 图 21是为本发明实施例的布局 7示意图; 20 is a schematic diagram of a layout 6 according to an embodiment of the present invention; 21 is a schematic diagram of a layout 7 according to an embodiment of the present invention;
图 22是为本发明实施例的布局 8示意图;  Figure 22 is a schematic diagram of a layout 8 in accordance with an embodiment of the present invention;
图 23是为本发明实施例的布局 9示意图;  23 is a schematic diagram of a layout 9 according to an embodiment of the present invention;
图 24是为本发明实施例的布局 10示意图。  Figure 24 is a schematic illustration of a layout 10 in accordance with an embodiment of the present invention.
具体实施方式 detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清 楚、 完整地描述, 显然, 所描述的实施例仅仅是本发明一部分实施例, 而不是 全部的实施例。基于本发明中的实施例, 本领域普通技术人员在没有作出创造 性劳动前提下所获得的所有其他实施例, 都属于本发明保护的范围。  BRIEF DESCRIPTION OF THE DRAWINGS The technical solutions in the embodiments of the present invention will be described in detail below with reference to the accompanying drawings. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative work are within the scope of the present invention.
目艮对眼效果的基础在于双方可以观察到不同视角的信息,当一方转向某一 视角时只有处于该视角的观察者能够感受到 "正视" 的效果。 基于该原理, 在 本发明实施例中充分考虑与会双方的可能视角数目,根据观察者的数目获取相 应数目的多视角视频信息, 以实现逼真的眼对眼效果。  The basis for witnessing the eye effect is that both parties can observe different perspectives. When one turns to a certain perspective, only the observer at that perspective can feel the effect of "frontal". Based on the principle, in the embodiment of the present invention, the number of possible viewing angles of both parties is fully considered, and a corresponding number of multi-view video information is obtained according to the number of observers to achieve a realistic eye-to-eye effect.
如图 2和图 3所示,为本发明实施例中的视频处理方法的一个具体流程图, 该方法可用于远程会议系统中。  As shown in FIG. 2 and FIG. 3, it is a specific flowchart of a video processing method in an embodiment of the present invention, and the method can be used in a remote conference system.
如图 2所示, 为该方法的视频发送部分流程: 201、 获取具有不同视角的 本地视频信息, 所述本地视频信息的视角数目不小于远端观察点数目; 202、 将所述具有不同视角的本地视频信息发送至远端。  As shown in FIG. 2, a part of the video transmission process of the method is: 201. Acquiring local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points; 202, the different viewing angles are The local video information is sent to the far end.
其中,在步骤 201中釆集视频信息时, 可以在下述的多视角显示设备处釆 用具有不同摄像视角的摄像设备获取本地视频信息。  Wherein, when the video information is collected in step 201, the local video information can be acquired by the imaging device having different camera viewing angles at the multi-view display device described below.
如图 3所示, 为该方法的视频显示部分流程: 301、 接收具有不同视角的 远端视频信息, 所述远端视频信息的视角数目不小于本地观察点数目; 302、 釆用多视角显示设备向不同的本地观察点显示对应视角的远端视频信息,以实 现眼对眼显示效果,所述多视角显示设备的显示视角不小于所述本地观察点数 目。 其中, 所述远端观察点数目和本地观察点数目均为自然数, 且所述远端观 察点数目和本地观察点数目中至少有一个数目不小于 2。  As shown in FIG. 3, the video display part of the method is: 301: receiving remote video information with different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points; 302, using multi-view display The device displays the remote video information of the corresponding perspective to different local observation points to achieve an eye-to-eye display effect, and the display angle of view of the multi-view display device is not less than the number of the local observation points. The number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
其中, 在上述实施例中的远程会议系统中可包括多个远端, 则步骤 301 可为选取多个远端中的一个远端,接收来自该选取的远端发送的具有不同视角 的远端视频信息。 另一方面, 在进行显示时可选用多个多视角显示设备, 即在 步骤 302 中釆用多个多视角显示设备向不同的本地观察点显示对应视角的远 端视频信息。 The remote conference system in the foregoing embodiment may include multiple remote ends, and step 301 may be: selecting one of the plurality of remote ends, and receiving the different perspectives sent from the selected remote end. Remote video information. On the other hand, a plurality of multi-view display devices may be selected for display, that is, in step 302, a plurality of multi-view display devices are used to display remote video information of corresponding views to different local observation points.
可以理解, 在一个视频会议流程中, 图 2和图 3中显示的流程是同步或基 本同步进行的。 上述远端或本地的观察点可以是指与会者在参加会议时的位 置、 也可以是指与会者的位置组(即, 可能有 2个或 2个以上的与会者在参加 会议时被当作一个观察点, 而不进行区分)。  It can be understood that in a video conference process, the processes shown in Figures 2 and 3 are synchronized or substantially synchronized. The remote or local observation point may refer to the location of the participant when attending the conference, or may refer to the location group of the participant (ie, there may be two or more participants in the conference when they participate in the conference. An observation point, without distinction).
在上述流程中没有具体指出在何种观察点情况下, 获得哪些视角信息,在 哪些视角显示信息, 这是由于这些情况依赖于非常具体的会议布局,在上述实 施例中仅能给出总体方案, 结合后续的不同实施例中的会议布局,应当能够理 解如何根据上述实施例的技术方案实现远程会议中的眼对眼效果。 同时, 在眼 对眼时, 其效果的逼真性也与系统的成本有关, 不代表在本发明的所有实施例 中均要实现最逼真的眼对眼效果,只实现部分的眼对眼效果的实施例也应当输 入本发明实施例中的一种。  In the above process, it is not specifically pointed out which perspective information is obtained, and which perspective information is obtained, which is because these situations depend on a very specific conference layout, and only the overall scheme can be given in the above embodiment. In combination with the conference layout in the subsequent different embodiments, it should be possible to understand how to achieve the eye-to-eye effect in the remote conference according to the technical solution of the above embodiment. At the same time, in the case of the eye to the eye, the effect of the effect is also related to the cost of the system, and does not mean that the most realistic eye-to-eye effect is achieved in all embodiments of the present invention, and only part of the eye-to-eye effect is achieved. Embodiments should also be input into one of the embodiments of the present invention.
相应于上述方法实施例, 本发明实施例中还提供了一种视频处理装置 1 , 该装置用于远程会议系统中, 如图 4所示, 装置 1的发送模块 10包括: 本地 视频获取单元 100, 用于获取具有不同视角的本地视频信息, 所述本地视频信 息的视角数目不小于远端观察点数目; 本地视频发送单元 102, 用于将所述具 有不同视角的本地视频信息发送至远端。  Corresponding to the foregoing method embodiment, the embodiment of the present invention further provides a video processing device 1 for use in a remote conference system. As shown in FIG. 4, the sending module 10 of the device 1 includes: a local video acquiring unit 100. And for acquiring local video information having different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points; the local video sending unit 102 is configured to send the local video information with different viewing angles to the remote end. .
其中, 本地视频获取单元 100, 还用于在所述多视角显示设备处釆用具有 不同摄像视角的摄像设备获取本地视频信息。  The local video obtaining unit 100 is further configured to acquire local video information by using an imaging device having different camera viewing angles at the multi-view display device.
装置 1的显示模块 12包括: 远端视频接收单元 120, 用于接收具有不同 视角的远端视频信息, 所述远端视频信息的视角数目不小于本地观察点数目; 远端视频显示单元 122, 用于釆用多视角显示设备向不同的本地观察点显示对 应视角的远端视频信息, 以实现眼对眼显示效果, 所述多视角显示设备的显示 视角不小于所述本地观察点数目; 其中, 所述远端观察点数目和本地观察点数 目均为自然数,且所述远端观察点数目和本地观察点数目中至少有一个数目不 小于 2。 其中的远端视频显示单元 120, 还可用于釆用多个多视角显示设备向不同 的本地观察点显示对应视角的远端视频信息。若所述系统包括多个远端, 则远 端视频接收单元 120 , 还用于选取多个远端中的一个远端, 接收来自该选取的 远端发送的具有不同视角的远端视频信息。 The display module 12 of the device 1 includes: a remote video receiving unit 120, configured to receive remote video information having different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points; the remote video display unit 122, The multi-view display device is configured to display the remote video information of the corresponding view to different local observation points, so as to achieve an eye-to-eye display effect, the display view angle of the multi-view display device is not less than the number of the local observation points; The number of the remote observation points and the number of local observation points are both natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2. The remote video display unit 120 can also be used to display remote video information of a corresponding perspective to different local viewing points by using multiple multi-view display devices. If the system includes multiple remote ends, the remote video receiving unit 120 is further configured to select one of the plurality of remote ends, and receive remote video information with different perspectives sent from the selected remote end.
相应于上述方法和装置实施例, 本发明实施例还提供了一种远程会议系 统,在该系统中明确了具有上述功能的实体设备如何实现整个系统, 当然图中 所示仅表示连接关系, 不代表实际系统中的位置关系。 如图 5所示, 系统的本 地端包括: 多个具有不同摄像视角的摄像设备 2, 用以分别获取具有不同视角 的本地视频信息, 所述本地视频信息的视角数目不小于远端观察点数目; 通讯 设备 3 , 用于向远端发送所述摄像设备获得的所述具有不同视角的本地视频信 息, 并接收来自所述远端的具有不同视角的远端视频信息, 所述远端视频信息 的视角数目不小于本地观察点数目; 多视角显示设备 4, 用于向不同的本地观 察点显示对应视角的远端视频信息, 以实现眼对眼显示效果, 所述多视角显示 设备的显示视角不小于所述本地观察点数目; 其中, 所述远端观察点数目和本 地观察点数目均为自然数,且所述远端观察点数目和本地观察点数目中至少有 一个数目不小于 2。  Corresponding to the above method and device embodiment, the embodiment of the present invention further provides a remote conference system, in which the physical device having the above functions is implemented to implement the entire system, of course, only the connection relationship is shown in the figure, Represents the positional relationship in the actual system. As shown in FIG. 5, the local end of the system includes: a plurality of camera devices 2 having different camera viewing angles, respectively, for acquiring local video information having different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points The communication device 3 is configured to send the local video information with different perspectives obtained by the camera device to the remote end, and receive remote video information with different perspectives from the remote end, the remote video information. The number of the viewing angles is not less than the number of the local viewing points; the multi-view display device 4 is configured to display the far-end video information of the corresponding viewing angles to different local viewing points to achieve an eye-to-eye display effect, and the display viewing angle of the multi-view display device The number of the local observation points and the number of local observation points are both natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
其中, 可在本地端设置多个多视角显示设备,使所述系统中的多视角显示 设备的数目不小于远端观察点数目。 以达到更好的眼对眼效果。  Wherein, multiple multi-view display devices can be set at the local end, so that the number of multi-view display devices in the system is not less than the number of remote view points. To achieve a better eye-to-eye effect.
所述多视角显示设备可为多视角显示器,或所述多视角显示设备为多个投 影仪与具有多视角显示功能的投影幕的组合。  The multi-view display device may be a multi-view display, or the multi-view display device may be a combination of a plurality of projectors and a projection screen having a multi-view display function.
若所述系统还包括多个远端,所述本地端的通讯设备还用于从所述多个远 端中选择一个远端进行多视角视频信息的接收和发送。  If the system further includes a plurality of remote ends, the local communication device is further configured to select one of the plurality of remote terminals for receiving and transmitting multi-view video information.
为了能够更清楚的说明上述实施例中如何实现眼对眼效果。以下描述几种 具体的系统布局, 并描述不同视频信息之间的釆集和显示对应关系。  In order to more clearly explain how the eye-to-eye effect is achieved in the above embodiment. Several specific system layouts are described below, and the mapping and display correspondence between different video information is described.
如图 6〜图 11所示, 为本发明实施例的布局 1 , 在布局 1中显示了图 5中 所示系统中的各组成设备的一种具体位置关系的示例,以及各设备的相应视角 等。 图 6〜图 10中显示了系统中的本地端和远端的总体情况, 图 11中显示系 统中的一端的具体布局 (两端为对称分布)。 系统中包括两个会场 A、 B, 会场 AB通过网络直接相连; 每个会场都包 含 3台大尺寸平板显示器作为主显示设备,如 65英寸或 70英寸的平板显示器, 用于呈现接近真人大小尺寸的高清画面,可以釆用 PDP电视, LCD电视或 DLP 背投电视等平板显示器技术。 3台显示器以一个折面的方式放置, 中间的显示 器和两边的显示器紧靠在一起, 3台显示器的图像构成了会议室场景的一个完 整呈现,位于会议桌侧面的辅助显示屏可以显示共享的数据等信息。 系统中釆 用的显示设备为具有多个视角的显示器, 如图 11所示, 各个显示器具有 3个 视角, 每个视角可以呈现不同的内容, 每个显示器的视角情况如图 12所示。 As shown in FIG. 6 to FIG. 11 , a layout 1 of an embodiment of the present invention is shown. In the layout 1, an example of a specific positional relationship of each component device in the system shown in FIG. 5 is displayed, and a corresponding perspective of each device is shown. Wait. The overall situation of the local end and the far end in the system is shown in Fig. 6 to Fig. 10. Fig. 11 shows the specific layout of one end in the system (the two ends are symmetrically distributed). The system includes two sites A and B. The site AB is directly connected through the network. Each site contains three large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display, which is similar to the size of a real person. High-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV. The three displays are placed in a folded plane, the middle display and the two sides of the display are close together, the images of the three displays form a complete presentation of the conference room scene, and the auxiliary display on the side of the conference table can display the shared Data and other information. The display device used in the system is a display with multiple viewing angles. As shown in FIG. 11, each display has three viewing angles, and each viewing angle can present different contents, and the viewing angle of each display is as shown in FIG.
如图 12所示的显示器具有如下特点: 在不同的视角上可以显示不同的内 容; 如上图中在假设待呈现的物体用虚线标识, 具有 3个表面, 该多视角显示 器可以在三个不同的视角上分别呈现出面 1内容, 面 2内容和面 3内容, 如果 该物体真实的放置在显示器位置一样。 该显示设备釆用视差挡板原理实现,如 图 13所示, 不同的视角观察到的是该视角的图像内容。  The display shown in FIG. 12 has the following features: different contents can be displayed at different viewing angles; as shown in the above figure, it is assumed that the object to be presented is identified by a broken line, having three surfaces, and the multi-view display can be in three different The face 1 content, the face 2 content and the face 3 content are respectively presented in the perspective, if the object is actually placed at the display position. The display device is implemented using the parallax barrier principle, as shown in Fig. 13, and the image content of the viewing angle is observed from different viewing angles.
以下具体描述系统中的一端的布局情况。 如图 11所示, 在每台显示器的 中间位置, 放置了 3台高清摄像机, 能够支持 720p和 1080p分辨率的高清图 像釆集, 并且每台摄像机无重复地覆盖所有与会者。  The layout of one end of the system is specifically described below. As shown in Figure 11, three HD cameras are placed in the middle of each display to support high-definition image collections of 720p and 1080p resolution, and each camera covers all participants without repeating.
每个会议室的会议桌按照从左至右分别为 Dl、 D2、 D3 , 显示器为 Tl、 Τ2、 Τ3,每个显示器上的 3 台摄像机从左至右分别为 Cl、 C2、 C3 , 每台显示 器的 3个视角从左至右分别为 VI、 V2、 V3;  The conference table of each conference room is Dl, D2, D3 from left to right, and the display is Tl, Τ2, Τ3. The three cameras on each display are Cl, C2, C3 from left to right, each The three viewing angles of the display are VI, V2, and V3 from left to right;
则左侧显示屏顶部的摄像机 C1拍摄范围覆盖会议桌 D1 ,摄像机 C2拍摄 范围覆盖会议桌 D2, 摄像机 C3拍摄范围覆盖会议桌 D3; 中间显示屏顶部的 摄像机 C1拍摄范围覆盖会议桌 D1 , 摄像机 C2拍摄范围覆盖会议桌 D2, 摄 像机 C3拍摄范围覆盖会议桌 D3; 右侧显示屏顶部的摄像机 C1拍摄范围覆盖 会议桌 D1 , 摄像机 C2拍摄范围覆盖会议桌 D2, 摄像机 C3拍摄范围覆盖会 议桌 D。  The camera C1 at the top of the left display covers the conference table D1, the camera C2 covers the conference table D2, and the camera C3 covers the conference table D3; the camera C1 at the top of the middle display covers the conference table D1, camera C2 The shooting range covers the conference table D2, the camera C3 shooting range covers the conference table D3; the camera C1 shooting range at the top of the right display screen covers the conference table D1, the camera C2 shooting range covers the conference table D2, and the camera C3 shooting range covers the conference table D.
T1的视角 VI对应于会议桌 Dl, T1的视角 V2对应于会议桌 D2, T1的视 角 V3对应于会议桌 D3; T2的视角 VI对应于会议桌 Dl, T2的视角 V2对应 于会议桌 D2, T2的视角 V3对应于会议桌 D3; T3的视角 VI对应于会议桌 D1, T3的视角 V2对应于会议桌 D2, T3的视角 V3对应于会议桌 D3。 The viewing angle VI of T1 corresponds to the conference table D1, the viewing angle V2 of T1 corresponds to the conference table D2, the viewing angle V3 of T1 corresponds to the conference table D3; the viewing angle VI of T2 corresponds to the conference table D1, and the viewing angle V2 of T2 corresponds to the conference table D2, The viewing angle V3 of T2 corresponds to the conference table D3; the viewing angle VI of T3 corresponds to the conference table D1, The angle of view V2 of T3 corresponds to the conference table D2, and the angle of view V3 of T3 corresponds to the conference table D3.
则为了实现眼对眼效果,假设如图 6所示的两个会场分别为会场 A、会场 B,如果是会场 A的摄像机的视频流被发送到会场 B显示,对于中间的座位区 域 D2, 则有如下视频流的发送、 接收对应关系:  In order to achieve the eye-to-eye effect, it is assumed that the two sites shown in FIG. 6 are the site A and the site B respectively. If the video stream of the camera of the site A is sent to the site B, for the middle seat area D2, There are the following transmission and reception correspondences of video streams:
A Tl C2 = > B T2 V3  A Tl C2 = > B T2 V3
A T2 C2 = > B T2 V2  A T2 C2 = > B T2 V2
A T3 C2 = > B T2 VI  A T3 C2 = > B T2 VI
对于左边的座位区域 Dl , 如图 7所示, 有如下的对应关系:  For the left seat area Dl, as shown in Figure 7, there is the following correspondence:
A Tl CI = > B T3 V3  A Tl CI = > B T3 V3
A T2 CI = > B T3 V2  A T2 CI = > B T3 V2
A T3 CI = > B T3 VI  A T3 CI = > B T3 VI
对于右边的座位区域 D3 , 如图 8所示, 有如下的对应关系:  For the right seat area D3, as shown in Figure 8, there is the following correspondence:
A Tl C3 = > B Tl V3  A Tl C3 = > B Tl V3
A T2 C3 = > B Tl V2  A T2 C3 = > B Tl V2
A T3 C3 = > B Tl VI 如果是会场 B的摄像机的视频流被发送到会场 A显示,类似的有如下对应关系: A T3 C3 = > B Tl VI If the video stream of the camera of the site B is sent to the site A, similar correspondences are as follows:
B T1 C1 = > A T3 V3 B T1 C1 = > A T3 V3
B T2 C1 = > A T3 V2  B T2 C1 = > A T3 V2
B T3 C1 = > A T3 VI  B T3 C1 = > A T3 VI
B T1 C2 = > A T2 V3  B T1 C2 = > A T2 V3
B T2 C2 = > A T2 V2  B T2 C2 = > A T2 V2
B T3 C2 = > A T2 VI  B T3 C2 = > A T2 VI
B T1 C3 = > A Tl V3  B T1 C3 = > A Tl V3
B T2 C3 = > A Tl V2  B T2 C3 = > A Tl V2
B T3 C3 = > A Tl VI  B T3 C3 = > A Tl VI
在具体实施例中, 以上的视频发送接收对应关系可以有两种实现方式: 在 对会场的显示设备和与会者的方位信息的表述中做如下规定,以所有与会区域 的中间位置为中心,面对所有主显示设备的中间位置的左手方向最左边的显示 设备为第 0个显示设备区域, 以次左边的显示设备为第 1个显示设备, 以此类 推; 在每个以最左边的与会区域为第 0个(摄像机)覆盖区域, 次左边的与会 区域为第 1个(摄像机)覆盖区域, 以此类推。 In a specific embodiment, the foregoing video transmission and reception correspondence may be implemented in two ways: in the description of the display device of the site and the orientation information of the participant, the following provisions are made to all the participating regions. The middle position is centered, the leftmost display device facing the middle position of all the main display devices is the 0th display device area, the second left display device is the first display device, and so on; The leftmost conference area is the 0th (camera) coverage area, the second left conference area is the 1st (camera) coverage area, and so on.
会场 A和 B可能会存在辅助显示器 T4 , 用于显示辅流视频。 在这种场景 下, 视频流的发送、 接收对应关系包括两种方式: 一种和图 6 ~ 8所述的方式 保持一致, 如图 9所示。 另一种方式为镜像方式, 如图 10所示, 这种方式在 在系统的与会双方进行视频会议之前, 先进行协商,协商内容即包括上述 的视频发送接收对应关系信息。 即, 有如下过程。  A secondary display T4 may be present at the venues A and B to display the secondary stream video. In this scenario, the transmission and reception correspondence of the video stream includes two ways: One is consistent with the manner described in Figures 6-8, as shown in Figure 9. The other mode is the mirroring mode. As shown in Figure 10, this mode is negotiated before the videoconferencing of the two parties in the system. The content of the negotiation includes the above-mentioned video transmission and reception correspondence information. That is, there are the following processes.
1、 在能力协商阶段将体现各个视频流的发送接收关系, 可以利用会话描 述协议来描述这种对应关系;  1. In the capability negotiation phase, the transmission and reception relationship of each video stream will be reflected, and the correspondence description relationship may be described by using a session description protocol;
比如发送视频流 Tl C3可以描述为:  For example, sending a video stream Tl C3 can be described as:
"AuxStream:OFF SndPos:0 CovPos:2"  "AuxStream: OFF SndPos: 0 CovPos: 2"
2、 发送方将本法所有的视频流发送到对方, 对方接收视频流, 解码后, 按照以上对应关系将视频显示在特定显示器上,这就需要对方在接收视频可以 识别出视频的方位,因此在 RTP HEADER中用户数据部分增加如下结构:  2. The sender sends all the video streams of this method to the other party, and the other party receives the video stream. After decoding, the video is displayed on the specific display according to the above correspondence, which requires the other party to recognize the position of the video in the received video, so In the RTP HEADER, the user data section is added with the following structure:
au _flg sender pos cover pos recver pos displayer pos  Au _flg sender pos cover pos recver pos displayer pos
au _flg: lbit  Au _flg: lbit
辅流标记, 1标识辅流, 0表示非辅流(即主流)  Auxiliary flow tag, 1 identifies the auxiliary stream, 0 means non-auxiliary stream (ie mainstream)
sender pos: 8bits  Sender pos: 8bits
视频流发送者的位置 (显示器的位置),在主流情况下,全部比特标识位置; 在辅流情况下, 最高 2比特表示辅流的垂直位置, 11表示在主显示设备的上 方, 10表示与主显示设备在同一水平位置, 00表示在主显示设备的下方; 【1 个区 i或, 全部区 i或】  The position of the sender of the video stream (the position of the display), in the mainstream case, all the bits identify the position; in the case of the auxiliary stream, the highest 2 bits indicate the vertical position of the auxiliary stream, 11 indicates above the main display device, and 10 indicates The main display device is in the same horizontal position, 00 is below the main display device; [1 zone i or all zone i or]
cover pos: 9bits  Cover pos: 9bits
视频流的覆盖区域, 最高比特表示是否覆盖全部区域, 1表示覆盖全部区 域, 此时后面 8比特无意义; 0表示只覆盖某一区域, 后面 0-7比特表示覆盖 的区 i或方位; The coverage area of the video stream, the highest bit indicates whether to cover all areas, 1 means to cover all areas, then the last 8 bits are meaningless; 0 means only cover a certain area, followed by 0-7 bits to cover Area i or orientation;
recver— pos: 8bits 视频接收者的位置, 用于能力协商;  Recver— pos: 8bits video receiver location for capability negotiation;
displayer_pos:8bits 视频显示的位置, 用于能力协商。  Displayer_pos: The position of the 8bits video display for capability negotiation.
如图 14所示, 位于布局 1类似的本发明实施例中的布局 la。 其中, 摄像 机、会议桌的设置均与布局 1相同,不同之处在于显示设备釆用的是投影方式, 在投影幕的后面的 9个不同位置放置了 9台高分辨率高亮度的投影机,如奥图 码的 1080p投影机,在投影幕上安排了柱面光栅, 可以使得在不同的角度看到 不同的内容。  As shown in Fig. 14, the layout la in the embodiment of the present invention similar to the layout 1 is shown. Among them, the settings of the camera and the conference table are the same as those of the layout 1. The difference is that the display device uses the projection method, and nine high-resolution and high-brightness projectors are placed at nine different positions behind the projection screen. For Optoma's 1080p projectors, cylindrical gratings are arranged on the projection screen to allow different content to be seen at different angles.
如图 15所示, 位于布局 1类似的本发明实施例中的布局 lb。 其中, 摄像 机、会议桌的设置均与布局 1相同,不同之处在于显示设备釆用的是投影方式, 在投影幕的前面的 9个不同位置放置了 9台高分辨率高亮度的投影机,如奥图 码的 1080p投影机,在投影幕上安排了柱面光栅, 可以使得在不同的角度看到 不同的内容。  As shown in Fig. 15, the layout lb in the embodiment of the present invention similar to the layout 1 is shown. Among them, the settings of the camera and the conference table are the same as those of the layout 1. The difference is that the display device uses the projection method, and nine high-resolution and high-brightness projectors are placed at nine different positions in front of the projection screen. For Optoma's 1080p projectors, cylindrical gratings are arranged on the projection screen to allow different content to be seen at different angles.
可以理解,布局 la和布局 lb中的视频釆集和显示对应关系与布局 1中的 相同, 协商过程也相同。  It can be understood that the video set and display correspondence in the layout la and the layout lb are the same as those in the layout 1, and the negotiation process is also the same.
如图 16所示, 为本发明实施例中的布局 2。 该系统包括 AB两个会场通 过网络直连, AB会场的配置情况相同; 每个会场釆用两个个大尺寸的多视角 平板显示器作为显示设备,如 65英寸或 70英寸的平板显示器, 用于呈现接近 真人大小尺寸的高清画面, 可以釆用 PDP电视, LCD电视或 DLP背投电视等 平板显示器技术。 该显示设备具有多个视角, 可以显示不同的内容; 每个会场 均有 3个与会者区域,按照上图所示方式配置,每个位于与会者区域的与会者 刚好可以观看到多视角显示设备的一个视角的内容;在显示器上面的两侧及中 间按照汇聚方式设置 2台高清摄像机,每台摄像机可以在不同的角度拍摄到全 部与会者; 位于会议桌侧面的辅助显示屏可以显示共享的数据等信息。  As shown in FIG. 16, it is a layout 2 in the embodiment of the present invention. The system consists of two sites in AB that are directly connected through the network. The configuration of the AB site is the same. Each site uses two large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat-panel displays. It can display high-definition images close to life-size, and can use flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV. The display device has multiple viewing angles and can display different content. Each participant has three participant areas, which are configured as shown in the above figure. Each participant in the participant area can watch the multi-view display device. The content of one view; two HD cameras are arranged in the convergence mode on both sides and the middle of the display, each camera can capture all the participants at different angles; the auxiliary display on the side of the conference table can display the shared data And other information.
位于 T1位置的显示器上面的 C1 摄像机拍摄与会区域 Dl , C2摄像机拍 摄与会区域 D2; 位于 T2位置的显示器上面的 C1 摄像机拍摄与会区域 D1 , C2摄像机拍摄与会区域 D2;  The C1 camera on the display at the T1 position captures the conference area Dl, the C2 camera captures the conference area D2; the C1 camera on the display at the T2 position captures the conference area D1, C2 the camera captures the conference area D2;
与会趋于 D1可以观看到显示器 T1的视角 VI; 与会趋于 D1可以观看到 显示器 T2的视角 VI ; 与会趋于 D2可以观看到显示器 Τ1的视角 V2; 与会趋 于 D2可以观看到显示器 T2的视角 V2 The meeting tends to D1 to see the viewing angle VI of the display T1; the meeting tends to D1 can be viewed The viewing angle VI of the display T2; the viewing angle V2 of the display Τ1 can be viewed by D2; the viewing angle V2 of the display T2 can be viewed by D2
为实现 "眼对眼" 效果, 该系统中的视频流的发送接收关系为: In order to achieve the "eye to eye" effect, the transmission and reception relationship of the video stream in the system is:
A Tl CI = > B T2 V2 A Tl CI = > B T2 V2
A Tl C2 = > B T2 VI  A Tl C2 = > B T2 VI
A T2 CI = > B Tl V2  A T2 CI = > B Tl V2
A T2 C2 = > B Tl VI  A T2 C2 = > B Tl VI
B Tl CI = > A T2 V2 B Tl CI = > A T2 V2
B Tl C2 = > A T2 VI  B Tl C2 = > A T2 VI
B T2 CI = > A Tl V2  B T2 CI = > A Tl V2
B T2 C2 = > A Tl VI  B T2 C2 = > A Tl VI
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。  Other details of this embodiment can be referred to in the layout 1 and will not be described here.
如图 17所示, 为本发明实施例中的布局 3。 该系统包括 AB两个会场通 过网络直连, AB会场的配置情况相同; 每个会场釆用一个大尺寸的多视角平 板显示器作为显示设备,如 65英寸或 70英寸的平板显示器, 用于呈现接近真 人大小尺寸的高清画面, 可以釆用 PDP电视, LCD电视或 DLP背投电视等平 板显示器技术。 该显示设备具有 3个视角, 可以显示不同的内容; 每个会场均 有 3个与会者,按照上图所示方式配置,每个与会者刚好可以观看到多视角显 示设备的一个视角的内容; 在显示器上面的两侧及中间按照汇聚方式设置 3 台高清摄像机,每台摄像机可以在不同的角度拍摄到全部与会者; 位于会议桌 侧面的辅助显示屏可以显示共享的数据等信息。  As shown in FIG. 17, it is a layout 3 in the embodiment of the present invention. The system consists of two sites in AB that are directly connected through the network. The configuration of the AB site is the same. Each site uses a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display. Real-life high-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV. The display device has three viewing angles and can display different contents. Each participant has three participants, which are configured as shown in the above figure, and each participant can just watch the content of one perspective of the multi-view display device; Three HD cameras are set up on the sides and in the middle of the display. Each camera can capture all participants at different angles. The auxiliary display on the side of the conference table can display information such as shared data.
假设会场中 3台摄像机从左至右分别设为 Cl、 C2、 C3,多视角显示设备的 3个视角从做至右分别为 VI、 V2、 V3 , 则有如下配置情况:  Assume that the three cameras in the conference are set to Cl, C2, and C3 from left to right. The three viewing angles of the multi-view display device are VI, V2, and V3 from the right to the right.
C1C2C3分别从 3个角度拍摄全部与会者 Pl、 P2、 P3;  C1C2C3 captures all participants Pl, P2, P3 from three angles;
三个与会者分布在多视角显示器的三个视角上, 视角 VI 对应 P1 , 视角 V2对应 P2, 视角 V3对应 P3。 则为达到目艮对眼效果, 两会场的视频流发送接收对应关系为:The three participants are distributed on three perspectives of the multi-view display. The view VI corresponds to P1, the view V2 corresponds to P2, and the view V3 corresponds to P3. In order to achieve the eye-catching effect, the corresponding relationship between the video stream transmission and reception of the two sites is:
A CI = > B V3 A CI = > B V3
A C2 = > B V2 A C2 = > B V2
A C3 = > B VI A C3 = > B VI
B CI = > A V3 B CI = > A V3
B C2 = > A V2 B C2 = > A V2
B C3 = > A VI B C3 = > A VI
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。  Other details of this embodiment can be referred to in the layout 1 and will not be described here.
如图 18所示, 为本发明实施例中的布局 4。 该系统包括 AB两个会场通 过网络直连, AB会场的配置情况相同; 每个会场釆用一个大尺寸的多视角平 板显示器作为显示设备,如 65英寸或 70英寸的平板显示器, 用于呈现接近真 人大小尺寸的高清画面, 可以釆用 PDP电视, LCD电视或 DLP背投电视等平 板显示器技术。 该显示设备具有 2个视角, 可以显示不同的内容; 每个会场均 有 2个与会者,按照上图所示方式配置,每个与会者刚好可以观看到多视角显 示设备的一个视角的内容; 在显示器上面的两侧及中间按照汇聚方式设置 2 台高清摄像机,每台摄像机可以在不同的角度拍摄到全部与会者; 位于会议桌 侧面的辅助显示屏可以显示共享的数据等信息。  As shown in FIG. 18, it is a layout 4 in the embodiment of the present invention. The system consists of two sites in AB that are directly connected through the network. The configuration of the AB site is the same. Each site uses a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display. Real-life high-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV. The display device has two viewing angles and can display different contents. Each participant has two participants, which are configured as shown in the above figure, and each participant can just watch the content of one perspective of the multi-view display device; Two HD cameras are arranged in the convergence mode on both sides and in the middle of the display. Each camera can capture all participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data.
假设会场中 2台摄像机从左至右分别设为 Cl、 C2,多视角显示设备的 2个 视角从做至右分别为 VI、 V2 , 则有如下配置情况:  Assume that two cameras in the conference are set to Cl and C2 from left to right. The two viewing angles of the multi-view display device are VI and V2 from the right to the right.
C1C2分别从 2个角度拍摄全部与会者 Pl、 P2;  C1C2 captures all participants Pl, P2 from two angles;
两个与会者分布在多视角显示器的两个视角上, 视角 VI 对应 P1 , 视角 V2对应 P2。  The two participants are distributed on two viewing angles of the multi-view display. The viewing angle VI corresponds to P1 and the viewing angle V2 corresponds to P2.
则为达到目艮对眼效果, 两会场的视频流发送接收对应关系为:  In order to achieve the eye-catching effect, the corresponding relationship between the video stream transmission and reception of the two sites is:
A CI = > B V2  A CI = > B V2
A C2 = > B VI B CI = > A V2 A C2 = > B VI B CI = > A V2
B C2 = > A VI  B C2 = > A VI
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。  Other details of this embodiment can be referred to in the layout 1 and will not be described here.
如图 19所示, 为本发明实施例中的布局 5。 该系统包括 AB两个会场通 过网络直连; A会场包含一台大尺寸的多视角平板显示器作为显示设备, 如 65 英寸或 70英寸的平板显示器, 用于呈现接近真人大小尺寸的高清画面, 可以 釆用 PDP电视, LCD电视或 DLP背投电视等平板显示器技术, 该显示设备为 常规显示设备, 只具有一个视角; 在该显示器顶部两端和中间设置 3台高清摄 像机, 按照汇聚方式放置, 可以从 3个不同的角度拍摄与会者 Pl。 B会场包 含一台大尺寸的多视角平板显示器作为显示设备,如 65英寸或 70英寸的平板 显示器, 用于呈现接近真人大小尺寸的高清画面, 可以釆用 PDP电视, LCD 电视或 DLP背投电视等平板显示器技术。 该显示设备具有 3个视角, 可以显 示不同的内容; 每个会场均有 3个与会者, 按照上图所示方式配置, 每个与会 者刚好可以观看到多视角显示设备的一个视角的内容;在显示设备顶端中间位 置放置一台高清摄像机, 能够支持 720p和 1080p分辨率的高清图像釆集, 该 摄像机可以覆盖该会场中的全部与会者。位于会议桌侧面的辅助显示屏可以显 示共享的数据等信息。  As shown in Fig. 19, it is a layout 5 in the embodiment of the present invention. The system consists of two sites in AB that are directly connected through the network. A site contains a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display, which can be used to render high-definition images close to life-size. Using flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV, the display device is a conventional display device with only one viewing angle; three high-definition cameras are arranged at the top and the top of the display, and placed according to the convergence mode, Participants Pl are shot at 3 different angles. The B site consists of a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display, for displaying high-definition images close to life-size, and can use PDP TV, LCD TV or DLP rear projection TV. Flat panel display technology. The display device has three viewing angles, and can display different contents; each participant has three participants, and is configured as shown in the above figure, each participant can just watch the content of one perspective of the multi-view display device; An HD camera is placed in the middle of the top of the display device to support high-definition image collection of 720p and 1080p resolutions. This camera can cover all participants in the venue. A secondary display located on the side of the conference table displays information such as shared data.
则为达到眼对眼效果, 两会场的视频流发送接收关系为:  In order to achieve the eye-to-eye effect, the video stream transmission and reception relationship between the two sites is:
A CI = > B V3  A CI = > B V3
A C2 = > B V2  A C2 = > B V2
A C3 = > B VI  A C3 = > B VI
B CI = > A VI  B CI = > A VI
B CI = > A V3 B CI = > A V3
B C2 = > A V2 B C2 = > A V2
B C3 = > A VI B C3 = > A VI
B CI = > A VI B CI = > A VI
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。 如图 20所示, 为本发明实施例中的布局 6。 该系统包括 AB两个会场通 过网络直连; A会场包含 3 台大尺寸的多视角平板显示器作为显示设备, 如 65英寸或 70英寸的平板显示器, 用于呈现接近真人大小尺寸的高清画面, 可 以釆用 PDP电视, LCD电视或 DLP背投电视等平板显示器技术, 该显示设备 为常规显示设备, 只具有一个视角; 在该显示器顶部设置 3台高清摄像机, 可 以分别从 3个不同的角度拍摄全部与会者 Pl、 P2、 P3。 B会场包含一台大尺 寸的多视角平板显示器作为显示设备, 如 65英寸或 70英寸的平板显示器, 用 于呈现接近真人大小尺寸的高清画面, 可以釆用 PDP电视, LCD电视或 DLP 背投电视等平板显示器技术。该显示设备具有 3个视角,该会场有 3个与会者, 按照上图所示方式配置。, 该与会者刚好可以观看到多视角显示设备的一个视 角的内容; 在显示设备顶端中间及两端位置放置 3 台高清摄像机, 分别从 3 个角度釆集与会者图像, 该摄像机能够支持 720p和 1080p分辨率的高清图像 釆集。 位于会议桌侧面的辅助显示屏可以显示共享的数据等信息。 Other details of this embodiment can be referred to in the layout 1 and will not be described here. As shown in FIG. 20, it is a layout 6 in the embodiment of the present invention. The system consists of two sites in AB that are directly connected through the network. A site consists of three large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat-panel displays, which can be used to render high-definition images close to life-size. Using flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV, the display device is a conventional display device with only one viewing angle; three HD cameras are set on the top of the display, and all the participants can be photographed from three different angles. Pl, P2, P3. The B site includes a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display for displaying high-definition images close to life-size, and can use PDP TV, LCD TV or DLP rear projection TV. Flat panel display technology. The display device has three viewing angles. The venue has three participants, which are configured as shown in the figure above. The participant can just watch the content of one view of the multi-view display device; place 3 HD cameras in the middle and both ends of the display device to collect the participant images from 3 angles, the camera can support 720p and HD image collection with 1080p resolution. A secondary display located on the side of the conference table displays information such as shared data.
A会场的与会者可以选择将 B会场的视频流显示在本会场中的不同的显 示设备上, 通常情况下, 将视频流显示在 T2显示器上。  Participants in Site A can choose to display the video stream of Site B on different display devices in the site. Normally, the video stream is displayed on the T2 display.
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。  Other details of this embodiment can be referred to in the layout 1 and will not be described here.
如图 21所示,为本发明实施例中的布局 7。该系统包括 3个会场 PA、 PB、 PC, 每个会场的配置情况相同, 都一台大尺寸的多视角平板显示器作为显示 设备,如 65英寸或 70英寸的平板显示器, 用于呈现接近真人大小尺寸的高清 画面, 可以釆用 PDP电视, LCD电视或 DLP背投电视等平板显示器技术, 该 显示设备具有 3个视角, 可以显示不同的内容; 每个会场均有 3个与会者, 按 照上图所示方式配置,每个与会者刚好可以观看到多视角显示设备的一个视角 的内容; 在显示器上面的两侧及中间按照汇聚方式设置 3台高清摄像机,每台 摄像机可以在不同的角度拍摄到全部与会者;位于会议桌侧面的辅助显示屏可 以显示共享的数据等信息。  As shown in Fig. 21, the layout 7 in the embodiment of the present invention. The system consists of three conference sites PA, PB, and PC. Each site is configured in the same way. A large-size multi-view flat panel display is used as a display device, such as a 65-inch or 70-inch flat panel display, for rendering close to the size of a real person. The high-definition screen can use flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV. The display device has 3 viewing angles and can display different contents. Each venue has 3 participants, according to the above figure. In the mode configuration, each participant can watch the content of one view of the multi-view display device; set up three HD cameras in the convergence mode on both sides and the middle of the display, each camera can shoot all at different angles Participants; the secondary display located on the side of the conference table displays information such as shared data.
3个会场 PA、 PB、 PC通过 MCU进行连接, 在会议开始前进行媒体能力 协商, 各个会场将本会场的所有视频上传至 MCU, 通常情况下, 每个会场均 可以选择观看远端会场, 比如会场 PA, 可以选择观看会场 PB或者会场 PC的 内容, 假设某一时刻观看的是会场 PC的内容, 则 MCU需要将会场 PC的 3 路视频流发送至 PA, 在实现眼对眼的情况下, 两会场的视频流发送与接收对 应关系(经过 MCU转发)为: The three sites PA, PB, and PC are connected through the MCU. The media capability negotiation is performed before the conference starts. Each site uploads all the videos of the site to the MCU. Generally, each site can choose to view the remote site. Conference site PA, you can choose to watch the site PB or the site PC. Contents, assuming that the content of the site PC is viewed at a certain time, the MCU needs to send the three video streams of the site PC to the PA. In the case of eye-to-eye, the video stream transmission and reception of the two sites correspond to each other. MCU forwarding) is:
PC Cl = > PA V3  PC Cl = > PA V3
PC C2 = > PA V2 PC C2 = > PA V2
PC C3 = > PA VI; PC C3 = > PA VI;
对于会场 PB, PC均有类似的发送接收对应关系。  For the site PB, the PC has a similar transmission and reception correspondence.
如果在某一会场需要同时看到另外两个会场, 则需要在 MCU完成将两外 两会场的图像拼接起来, 构成新的视频流; 在拼接过程中需要注意视点的对应 关系; 假设 PA会场需要同时看到 PB, PC会场的内容, 则实现眼对眼功能情 况下, 视频流的发送与接收对应关系为:  If you need to see two other sites at the same site, you need to splicing the images of the two external sites in the MCU to form a new video stream. You need to pay attention to the corresponding relationship in the splicing process. At the same time, when the content of the PB and the PC site is seen, the corresponding relationship between the transmission and reception of the video stream is as follows:
PC Cl + PB Cl = > PA V3  PC Cl + PB Cl = > PA V3
PC C2 + PB C2 = > PA V2 PC C2 + PB C2 = > PA V2
PC C3 + PB C3 = > PA VI; PC C3 + PB C3 = > PA VI;
其中 PC Cl + PB Cl标识 MCU将 PC会场的 Cl视频和 PB会场的 Cl视 频拼接起来构成新的视频流。  The PC Cl + PB Cl identifies the MCU to combine the Cl video of the PC site and the Cl video of the PB site to form a new video stream.
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。  Other details of this embodiment can be referred to in the layout 1 and will not be described here.
如图 22所示,为本发明实施例中的布局 8。该系统包括 4个会场 PA、 PB、 PC、 PD, 各个会场的配置情况相同; 4个会场通过 MCU进行控制。  As shown in Fig. 22, the layout 8 in the embodiment of the present invention. The system consists of four sites PA, PB, PC, and PD. The configuration of each site is the same. The four sites are controlled by the MCU.
每个会场都包含 3台大尺寸平板显示器作为主显示设备, 如 65英寸或 70 英寸的平板显示器, 用于呈现接近真人大小尺寸的高清画面, 可以釆用 PDP 电视, LCD电视或 DLP背投电视等平板显示器技术。 3台显示器以一个折面 的方式放置, 中间的显示器和两边的显示器紧靠在一起, 3台显示器的图像构 成了会议室场景的一个完整呈现。系统中釆用的显示设备为具有多个视角的显 示器, 如上图中各个显示器具有 3个视角, 每个视角可以呈现不同的内容, 每 个视角分别对应着会场的一个与会者;在每个显示器的顶部配置有一个高清摄 像机, 分别从 3个角度釆集与会者图像, 该摄像机能够支持 720p和 1080p分 辨率的高清图像釆集;位于会议桌侧面的辅助显示屏可以显示共享的数据等信 息。 Each venue contains 3 large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs. Flat panel display technology. The three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the conference room scene. The display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display The top is equipped with a high-definition camera that collects the participants' images from three angles. The camera can support 720p and 1080p resolution HD image collection; the auxiliary display on the side of the conference table can display shared data and other information. Interest.
假设每个会议室的会议桌按照从左至右分别为 Dl、 D2、 D3 , 显示器为 Tl、 Τ2、 Τ3,, 每台显示器的 3个视角从左至右分别为 VI、 V2、 V3; 3 台摄 像机从左至右分别为 Tl— Cl、 T2— Cl、 T3— CI,则摄像机 Tl— Cl、 T2— Cl、 T3 C1 均可以独立覆盖所有会议桌 Dl、 D2、 D3所在区域;  Assume that the conference table of each conference room is Dl, D2, D3 from left to right, and the display is Tl, Τ2, Τ3, and the three viewing angles of each display are VI, V2, V3 from left to right; From left to right, the cameras are Tl-Cl, T2—Cl, T3—CI, respectively, and the cameras Tl—Cl, T2—Cl, and T3 C1 can independently cover the areas where all conference tables D1, D2, and D3 are located;
T1的视角 VI对应于会议桌 Dl, T1的视角 V2对应于会议桌 D2, T1的视 角 V3对应于会议桌 D3; T2的视角 VI对应于会议桌 Dl, T2的视角 V2对应 于会议桌 D2, T2的视角 V3对应于会议桌 D3; T3的视角 VI对应于会议桌 D1, T3的视角 V2对应于会议桌 D2, T3的视角 V3对应于会议桌 D3。  The viewing angle VI of T1 corresponds to the conference table D1, the viewing angle V2 of T1 corresponds to the conference table D2, the viewing angle V3 of T1 corresponds to the conference table D3; the viewing angle VI of T2 corresponds to the conference table D1, and the viewing angle V2 of T2 corresponds to the conference table D2, The viewing angle V3 of T2 corresponds to the conference table D3; the viewing angle VI of T3 corresponds to the conference table D1, the viewing angle V2 of T3 corresponds to the conference table D2, and the viewing angle V3 of T3 corresponds to the conference table D3.
该系统中要求各个会场的每个与会者均可以看到其余所有会场的与会者, 则在会议进行中各个会场将本会场所有视频上传至 MCU, 在 MCU中完成对 应视频的拼接, 后发送到目的会场; 现假设 PA会场需要观看到所有会场的内 容, 则在满足眼对眼情况下, 视频流的发送接收对应关系为:  In this system, each participant in each site can view the participants of all the sites. In the conference, all the videos of the site are uploaded to the MCU, and the corresponding video is stitched in the MCU. Destination site; Now assume that the PA site needs to view the content of all sites, then in the case of eye-to-eye, the correspondence between the transmission and reception of the video stream is:
PB T1 C1 + PC Tl C1 + PD Tl CI = > PA T3 V3  PB T1 C1 + PC Tl C1 + PD Tl CI = > PA T3 V3
PB T1 C2 + PC Tl C2 + PD Tl C2 = > PA T2 V3  PB T1 C2 + PC Tl C2 + PD Tl C2 = > PA T2 V3
PB T1 C3 + PC Tl C3 + PD Tl C3 = > PA Tl V3  PB T1 C3 + PC Tl C3 + PD Tl C3 = > PA Tl V3
PB T2 C1 + PC T2 C1 + PD T2 CI = > PA T3 V2  PB T2 C1 + PC T2 C1 + PD T2 CI = > PA T3 V2
PB T2 C2 + PC T2 C2 + PD T2 C2 = > PA T2 V2  PB T2 C2 + PC T2 C2 + PD T2 C2 = > PA T2 V2
PB T2 C3 + PC T2 C3 + PD T2 C3 = > PA Tl V2  PB T2 C3 + PC T2 C3 + PD T2 C3 = > PA Tl V2
PB T3 C1 + PC T3 C1 + PD T3 CI = > PA T3 VI  PB T3 C1 + PC T3 C1 + PD T3 CI = > PA T3 VI
PB T3 C2 + PC T3 C2 + PD T3 C2 = > PA T2 VI  PB T3 C2 + PC T3 C2 + PD T3 C2 = > PA T2 VI
PB T3 C3 + PC T3 C3 + PD T3 C3 = > PA Tl VI  PB T3 C3 + PC T3 C3 + PD T3 C3 = > PA Tl VI
其中 PB Tl C1+ PC Tl C1+ PD Tl CI标识 MCU将 PB会场的 CI视频和 PB会场的 CI视频拼接起来构成新的视频流。  The PB Tl C1+ PC Tl C1+ PD Tl CI identifies the MCU to splicing the CI video of the PB site and the CI video of the PB site to form a new video stream.
本实施例的另外一种形式为; 会场中的 3 个显示设备分别显示远端的 3 个会场, 按照眼对眼的关系, 有如下视频流对应关系: 假设将 PB会场显示在 T1显示器上, PC会场侠士在 T2显示器上, PD会场显示在 T3显示器上: PB T3 CI = > PA Tl VI In another form of the embodiment, the three display devices in the site respectively display the three sites at the remote end. According to the relationship between the eyes and the eyes, the following video stream correspondence is as follows: Suppose the PB site is displayed on the T1 display. The PC venue is on the T2 display, and the PD venue is displayed on the T3 display: PB T3 CI = > PA Tl VI
PB T2 CI = > PA Tl V2  PB T2 CI = > PA Tl V2
PB Tl CI = > PA Tl V3  PB Tl CI = > PA Tl V3
PC T3 CI = > PA T2 VI  PC T3 CI = > PA T2 VI
PC T2 CI = > PA T2 V2  PC T2 CI = > PA T2 V2
PC Tl CI = > PA T2 V3  PC Tl CI = > PA T2 V3
PD T3 CI = > PA T3 VI  PD T3 CI = > PA T3 VI
PD T2 CI = > PA T3 V2  PD T2 CI = > PA T3 V2
PD Tl CI = > PA T3 V3  PD Tl CI = > PA T3 V3
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。  Other details of this embodiment can be referred to in the layout 1 and will not be described here.
如图 23所示,为本发明实施例中的布局 9。该系统包括 4个会场 PA、 PB、 PC、 PD, 各个会场的配置情况相同; 4个会场通过 MCU进行控制。  As shown in Fig. 23, the layout 9 in the embodiment of the present invention. The system consists of four sites PA, PB, PC, and PD. The configuration of each site is the same. The four sites are controlled by the MCU.
每个会场都包含 3台大尺寸平板显示器作为主显示设备, 如 65英寸或 70 英寸的平板显示器, 用于呈现接近真人大小尺寸的高清画面, 可以釆用 PDP 电视, LCD电视或 DLP背投电视等平板显示器技术。 3台显示器以一个折面 的方式放置, 中间的显示器和两边的显示器紧靠在一起, 3台显示器的图像构 成了会议室场景的一个完整呈现。系统中釆用的显示设备为具有多个视角的显 示器, 如上图中各个显示器具有 3个视角, 每个视角可以呈现不同的内容, 每 个视角分别对应着会场的一个与会者;在每个显示器的顶部配置有一个高清摄 像机, 分别从 3个角度釆集与会者图像, 该摄像机能够支持 720p和 1080p分 辨率的高清图像釆集;位于会议桌侧面的辅助显示屏可以显示共享的数据等信 息。  Each venue contains 3 large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs. Flat panel display technology. The three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the meeting room scene. The display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display The top is equipped with an HD camera that collects the participants' images from three angles. The camera can support HD image collection with 720p and 1080p resolutions. The auxiliary display on the side of the conference table can display information such as shared data.
假设每个会议室的会议桌按照从左至右分别为 Dl、 D2、 D3 , 显示器为 Tl、 Τ2、 Τ3,, 每台显示器的 3个视角从左至右分别为 VI、 V2、 V3; 3 台摄 像机从左至右分别为 Tl— Cl、 T2— Cl、 T3— CI,则摄像机 Tl— Cl、 T2— Cl、 T3 C1 均可以独立覆盖所有会议桌 Dl、 D2、 D3所在区域。  Assume that the conference table of each conference room is Dl, D2, D3 from left to right, and the display is Tl, Τ2, Τ3, and the three viewing angles of each display are VI, V2, V3 from left to right; From left to right, the cameras are Tl—Cl, T2—Cl, and T3—CI. The cameras Tl—Cl, T2—Cl, and T3 C1 can independently cover the areas where all conference tables D1, D2, and D3 are located.
T1的视角 VI对应于会议桌 Dl, T1的视角 V2对应于会议桌 D2, T1的视 角 V3对应于会议桌 D3; T2的视角 VI对应于会议桌 Dl, T2的视角 V2对应 于会议桌 D2, T2的视角 V3对应于会议桌 D3; T3的视角 VI对应于会议桌 D1, T3的视角 V2对应于会议桌 D2, T3的视角 V3对应于会议桌 D3。 The viewing angle VI of T1 corresponds to the conference table D1, the viewing angle V2 of T1 corresponds to the conference table D2, the viewing angle V3 of T1 corresponds to the conference table D3; the viewing angle VI of T2 corresponds to the conference table D1, and the viewing angle V2 of T2 corresponds. At the conference table D2, the angle of view V3 of the T2 corresponds to the conference table D3; the perspective VI of the T3 corresponds to the conference table D1, the perspective V2 of the T3 corresponds to the conference table D2, and the perspective V3 of the T3 corresponds to the conference table D3.
该系统中要求各个会场的每个与会者均可以看到其余所有会场的与会者, 则在会议进行中各个会场将本会场所有视频上传至 MCU, 在 MCU中完成对 应视频的拼接, 后发送到目的会场; 现假设 PA会场需要观看到所有会场的内 容, 则在满足眼对眼情况下, 视频流的发送接收对应关系为:  In this system, each participant in each site can view the participants of all the sites. In the conference, all the videos of the site are uploaded to the MCU, and the corresponding video is stitched in the MCU. Destination site; Now assume that the PA site needs to view the content of all sites, then in the case of eye-to-eye, the correspondence between the transmission and reception of the video stream is:
PB T1 C1 + PC Tl C1 + PD Tl CI = > PA T3 V3  PB T1 C1 + PC Tl C1 + PD Tl CI = > PA T3 V3
PB T1 C2 + PC Tl C2 + PD Tl C2 = > PA T2 V3  PB T1 C2 + PC Tl C2 + PD Tl C2 = > PA T2 V3
PB T1 C3 + PC Tl C3 + PD Tl C3 = > PA Tl V3  PB T1 C3 + PC Tl C3 + PD Tl C3 = > PA Tl V3
PB T2 C1 + PC T2 C1 + PD T2 CI = > PA T3 V2  PB T2 C1 + PC T2 C1 + PD T2 CI = > PA T3 V2
PB T2 C2 + PC T2 C2 + PD T2 C2 = > PA T2 V2  PB T2 C2 + PC T2 C2 + PD T2 C2 = > PA T2 V2
PB T2 C3 + PC T2 C3 + PD T2 C3 = > PA Tl V2  PB T2 C3 + PC T2 C3 + PD T2 C3 = > PA Tl V2
PB T3 C1 + PC T3 C1 + PD T3 CI = > PA T3 VI  PB T3 C1 + PC T3 C1 + PD T3 CI = > PA T3 VI
PB T3 C2 + PC T3 C2 + PD T3 C2 = > PA T2 VI  PB T3 C2 + PC T3 C2 + PD T3 C2 = > PA T2 VI
PB T3 C3 + PC T3 C3 + PD T3 C3 = > PA Tl VI  PB T3 C3 + PC T3 C3 + PD T3 C3 = > PA Tl VI
其中 PB Tl C1+ PC Tl C1+ PD Tl CI标识 MCU将 PB会场的 CI视频和 PB会场的 CI视频拼接起来构成新的视频流。  The PB Tl C1+ PC Tl C1+ PD Tl CI identifies the MCU to splicing the CI video of the PB site and the CI video of the PB site to form a new video stream.
本实施例的另外一种形式为; 会场中的 3 个显示设备分别显示远端的 3 个会场, 按照眼对眼的关系, 有如下视频流对应关系: 假设将 PB会场显示在 T1显示器上, PC会场侠士在 T2显示器上, PD会场显示在 T3显示器上: In another form of the embodiment, the three display devices in the site respectively display the three sites at the remote end. According to the relationship between the eyes and the eyes, the following video stream correspondence is as follows: Suppose the PB site is displayed on the T1 display. The PC venue is on the T2 display, and the PD venue is displayed on the T3 display:
PB T3 C1 = -- > PA Tl VI PB T3 C1 = -- > PA Tl VI
PB T2 C1 = -- > PA Tl V2  PB T2 C1 = -- > PA Tl V2
PB Tl C1 = -- > PA Tl V3  PB Tl C1 = -- > PA Tl V3
PC Τ3 C1 = -- > PA T2 VI  PC Τ3 C1 = -- > PA T2 VI
PC Τ2 C1 = -- > PA T2 V2  PC Τ2 C1 = -- > PA T2 V2
PC Tl C1 = -- > PA T2 V3  PC Tl C1 = -- > PA T2 V3
PD Τ3 C1 = = > PA T3 VI PD T2 CI = > PA T3 V2 PD Τ3 C1 = = > PA T3 VI PD T2 CI = > PA T3 V2
PD Tl CI = > PA T3 V3  PD Tl CI = > PA T3 V3
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。  Other details of this embodiment can be referred to in the layout 1 and will not be described here.
如图 24所示, 为本发明实施例中的布局 10。 该系统包括 3个会场 PA、 PB、 PC, 各个会场的配置情况不相同; 3个会场通过 MCU进行控制。  As shown in Fig. 24, the layout 10 in the embodiment of the present invention. The system consists of three sites, PA, PB, and PC. The configuration of each site is different. The three sites are controlled by the MCU.
会场 A包含 3台大尺寸平板显示器作为主显示设备, 如 65英寸或 70英 寸的平板显示器, 用于呈现接近真人大小尺寸的高清画面, 可以釆用 PDP电 视, LCD电视或 DLP背投电视等平板显示器技术。 3台显示器以一个折面的 方式放置, 中间的显示器和两边的显示器紧靠在一起, 3台显示器的图像构成 了会议室场景的一个完整呈现。系统中釆用的显示设备为具有多个视角的显示 器, 如上图中各个显示器具有 3个视角, 每个视角可以呈现不同的内容, 每个 视角分别对应着会场的一个与会者;在每个显示器的顶部配置有一个高清摄像 机, 分别从 3个角度釆集与会者图像, 该摄像机能够支持 720p和 1080p分辨 率的高清图像釆集; 位于会议桌侧面的辅助显示屏可以显示共享的数据等信 息。 会场 A的具体设备方位与覆盖关系可以参考布局 1。  Venue A contains 3 large-size flat panel displays as the main display device, such as 65-inch or 70-inch flat panel display, for displaying high-definition images close to life-size, and can use flat panel displays such as PDP TV, LCD TV or DLP rear projection TV. technology. The three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the meeting room scene. The display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display The top is equipped with an HD camera that collects the participants' images from three angles. The camera can support HD image collection with 720p and 1080p resolution. The auxiliary display on the side of the conference table can display information such as shared data. Refer to Layout 1 for the specific device orientation and coverage relationship of Site A.
会场 B包含一台大尺寸的多视角平板显示器作为显示设备, 如 65英寸或 70英寸的平板显示器,用于呈现接近真人大小尺寸的高清画面,可以釆用 PDP 电视, LCD电视或 DLP背投电视等平板显示器技术, 该显示设备具有 1个视 角, 可以显示不同的内容; 会场均有 1个与会者, 按照上图所示方式配置, 在 显示器上面的两侧及中间按照汇聚方式设置 3台高清摄像机,每台摄像机可以 在不同的角度拍摄到与会者;位于会议桌侧面的辅助显示屏可以显示共享的数 据等信息。 会场 B的具体设备方位与覆盖关系可以参考布局 3。  Venue B contains a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs. Flat panel display technology, the display device has one viewing angle, which can display different contents; one participant in the venue, configured as shown in the above figure, three HD cameras are set in the convergence mode on both sides and in the middle of the display Each camera can capture participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data. For the specific device orientation and coverage relationship of Site B, refer to Layout 3.
会场 C釆用两个个大尺寸的多视角平板显示器作为显示设备, 如 65英寸 或 70英寸的平板显示器, 用于呈现接近真人大小尺寸的高清画面, 可以釆用 PDP电视, LCD电视或 DLP背投电视等平板显示器技术。 该显示设备具有多 个视角, 可以显示不同的内容; 每个会场均有 2个与会者区域, 按照上图所示 方式配置,每个位于与会者区域的与会者刚好可以观看到多视角显示设备的一 个视角的内容; 在显示器上面的两侧及中间按照汇聚方式设置 3 台高清摄像 机,每台摄像机可以在不同的角度拍摄到全部与会者; 位于会议桌侧面的辅助 显示屏可以显示共享的数据等信息。 会场 C 的具体设备方位与覆盖关系可以 参考布局 2。 Venue C uses two large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat panel displays for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP backs. Invest in flat panel display technology such as television. The display device has multiple viewing angles and can display different content. Each participant has two participant areas, which are configured as shown in the figure above. Each participant in the participant area can watch the multi-view display device. One view of the content; set up 3 HD cameras on the sides and in the middle of the display Each camera can capture all participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data. For the specific device orientation and coverage relationship of Site C, refer to Layout 2.
显示器 T1上面的摄像机 C1拍摄与会区域 D1 ;  The camera C1 on the display T1 captures the conference area D1;
显示器 T1上面的摄像机 C2拍摄与会区域 D1 ;  Camera C2 on the display T1 captures the conference area D1;
显示器 T1上面的摄像机 C3拍摄与会区域 D2;  Camera C3 on the display T1 shooting meeting area D2;
显示器 T2上面的摄像机 C1拍摄与会区域 D1 ;  The camera C1 on the display T2 captures the conference area D1;
显示器 T2上面的摄像机 C2拍摄与会区域 D2;  Camera C2 on the display T2 shooting meeting area D2;
显示器 T2上面的摄像机 C3拍摄与会区域 D2;  Camera C3 on the display T2 shooting meeting area D2;
与会者区域 D1可以看到显示器 T1的视角 V2;  The participant area D1 can see the viewing angle V2 of the display T1;
与会者区域 D1可以看到显示器 T2的视角 VI;  The participant area D1 can see the viewing angle VI of the display T2;
与会者区域 D2可以看到显示器 T1的视角 V3;  Participant area D2 can see the viewing angle V3 of the display T1;
与会者区域 D2可以看到显示器 T2的视角 V2;  The participant area D2 can see the viewing angle V2 of the display T2;
对于会场 A来说, 其 3个显示屏分别显示 BC两个会场的内容, 其 T1屏 显示 B会场, T2显示 C会场的 Dl , T3显示 C会场的 D2内容;  For site A, the three displays display the contents of the two sites in BC. The T1 screen displays the B site, the T2 displays the D site of the C site, and the T3 displays the D2 content of the C site.
B会场可以选择地观看 AC某一会场的某一与会者的某一视角内容; 设显 示为 A会场的 D2;  At site B, you can view the content of a certain perspective of a participant in a certain AC site; set D2 as A site;
C会场可以选择观看 AB两会场的内容, 比如 T1显示 B会场的内容, T2 显示 A会场 D2与会者内容;  The site of the C site can choose to view the contents of the two sites of the AB. For example, T1 displays the content of the site B, and T2 displays the content of the D2 participant in the site.
各个会场将本会场的所有视频流发送 MCU, MCU完成视频流的发送接收 对应关系匹配, 则为了达到眼对眼效果, 有如下对应关系:  All the video streams of the site are sent to the MCU. The MCU completes the matching and sending of the video stream. In order to achieve the eye-to-eye effect, the following correspondences are available:
A会场:  A venue:
B Tl C3 = > A Tl VI  B Tl C3 = > A Tl VI
B Tl C2 = > A Tl V2  B Tl C2 = > A Tl V2
B Tl CI = > A Tl V3  B Tl CI = > A Tl V3
C T2 CI = > A T2 VI  C T2 CI = > A T2 VI
C Tl C2 = > A T2 V2  C Tl C2 = > A T2 V2
C Tl CI = > A T2 V3 C T2 C3 = > A T3 VI C Tl CI = > A T2 V3 C T2 C3 = > A T3 VI
C T2 C2 = > A T3 V2  C T2 C2 = > A T3 V2
C Tl C3 = > A T3 V3  C Tl C3 = > A T3 V3
B会场: B venue:
A Tl C2 = >B Tl VI  A Tl C2 = >B Tl VI
A T2 C2 = >B Tl VI  A T2 C2 = >B Tl VI
A T3 C2 = >B Tl VI  A T3 C2 = >B Tl VI
C会场: C venue:
B Tl C2 = >C Tl V2 B Tl C2 = >C Tl V2
B Tl CI = >C Tl V3  B Tl CI = >C Tl V3
A T2 C2 = >C T2 VI  A T2 C2 = >C T2 VI
A T3 C2 = >C T2 V2  A T3 C2 = >C T2 V2
本实施例的其他细节可以参考布局 1中所述, 此处不做赘述。  Other details of this embodiment can be referred to in the layout 1 and will not be described here.
在本发明实施例中, 获取多视角本地视频信息发送给远端显示, 在本地端 则显示来自远端的多视角远端视频信息,只要显示时恰当进行视角的匹配就可 以实现眼对眼的会议效果。  In the embodiment of the present invention, the multi-view local video information is obtained and sent to the remote display, and the multi-view remote video information from the far end is displayed on the local end, and the eye-to-eye can be realized as long as the viewing angle is properly matched during display. Meeting effect.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程 , 是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一计算 机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。 其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory, ROM )或随机存储记忆体(Random Access Memory, RAM )等。  A person skilled in the art can understand that all or part of the process of implementing the above embodiment method can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable storage medium, the program When executed, the flow of an embodiment of the methods as described above may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
以上所揭露的仅为本发明一种较佳实施例而已,当然不能以此来限定本发 明之权利范围, 因此依本发明权利要求所作的等同变化, 仍属本发明所涵盖的 范围。  The above is only a preferred embodiment of the present invention, and the scope of the present invention is not limited thereto, and thus equivalent changes made in the claims of the present invention are still within the scope of the present invention.

Claims

1、一种视频处理方法, 用于远程会议系统中, 其特征在于,  A video processing method for use in a remote conference system, characterized in that
所述方法的视频发送部分包括:  The video sending part of the method includes:
获取具有不同视角的本地视频信息,所述本地视频信息的视角数目不小于 远端观察点数目;  Acquiring local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points;
将所述具有不同视角的本地视频信息发送至远端;  Transmitting the local video information with different viewing angles to the remote end;
所述方法的视频显示部分包括:  The video display portion of the method includes:
接收具有不同视角的远端视频信息,所述远端视频信息的视角数目不小于 本地观察点数目;  Receiving remote video information having different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points;
釆用多视角显示设备向不同的本地观察点显示对应视角的远端视频信息, 以实现眼对眼显示效果,所述多视角显示设备的显示视角不小于所述本地观察 点数目;  The multi-view display device displays the far-end video information of the corresponding view to different local observation points to achieve an eye-to-eye display effect, and the display view angle of the multi-view display device is not less than the number of the local observation points;
其中, 所述远端观察点数目和本地观察点数目均为自然数,且所述远端观 察点数目和本地观察点数目中至少有一个数目不小于 2。  The number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
2、 如权利要求 1所述的方法, 其特征在于, 所述釆用多视角显示设备向 不同的本地观察点显示对应视角的远端视频信息包括:  The method according to claim 1, wherein the displaying, by the multi-view display device, the remote video information of the corresponding view to different local observation points comprises:
釆用多个多视角显示设备向不同的本地观察点显示对应视角的远端视频 信息。  多个 Displaying remote video information of a corresponding perspective to different local viewing points by using multiple multi-view display devices.
3、 如权利要求 2所述的方法, 其特征在于, 所述获取具有不同视角的本 地视频信息包括: 3. The method according to claim 2, wherein the acquiring local video information having different viewing angles comprises:
在所述多视角显示设备处釆用具有不同摄像视角的摄像设备获取本地视 频信息。  At the multi-view display device, the local video information is acquired by the image pickup apparatuses having different camera angles of view.
4、 如权利要求 1至 3中任一项所述的方法, 其特征在于, 所述系统包括 多个远端, 所述接收具有不同视角的远端视频信息包括: The method according to any one of claims 1 to 3, wherein the system includes a plurality of remote ends, and the receiving remote video information having different viewing angles includes:
选取多个远端中的一个远端,接收来自该选取的远端发送的具有不同视角 的远端视频信息。 One of the plurality of remote ends is selected to receive remote video information with different views transmitted from the selected remote end.
5、 一种视频处理装置, 用于远程会议系统中, 其特征在于, 所述装置的发送模块包括: The video processing device is used in a remote conference system, and the sending module of the device includes:
本地视频获取单元, 用于获取具有不同视角的本地视频信息, 所述本地视 频信息的视角数目不小于远端观察点数目;  a local video obtaining unit, configured to acquire local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points;
本地视频发送单元, 用于将所述具有不同视角的本地视频信息发送至远 端;  a local video sending unit, configured to send the local video information with different viewing angles to the remote end;
所述装置的显示模块包括:  The display module of the device includes:
远端视频接收单元, 用于接收具有不同视角的远端视频信息, 所述远端视 频信息的视角数目不小于本地观察点数目;  The remote video receiving unit is configured to receive remote video information with different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points;
远端视频显示单元,用于釆用多视角显示设备向不同的本地观察点显示对 应视角的远端视频信息, 以实现眼对眼显示效果, 所述多视角显示设备的显示 视角不小于所述本地观察点数目;  a remote video display unit, configured to display, by using a multi-view display device, remote video information of a corresponding perspective to different local observation points, to achieve an eye-to-eye display effect, where the display angle of the multi-view display device is not less than Number of local observation points;
其中, 所述远端观察点数目和本地观察点数目均为自然数,且所述远端观 察点数目和本地观察点数目中至少有一个数目不小于 2。  The number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
6、 如权利要求 5所述的装置, 其特征在于, 所述远端视频显示单元, 还 用于釆用多个多视角显示设备向不同的本地观察点显示对应视角的远端视频 信息。  The device of claim 5, wherein the remote video display unit is further configured to display remote video information of a corresponding perspective to different local viewing points by using multiple multi-view display devices.
7、 如权利要求 6所述的装置, 其特征在于, 所述本地视频获取单元, 还 用于在所述多视角显示设备处釆用具有不同摄像视角的摄像设备获取本地视 频信息。 The device according to claim 6, wherein the local video acquisition unit is further configured to acquire local video information by using an imaging device having a different camera angle of view at the multi-view display device.
8、 如权利要求 5至 7中任一项所述的装置, 其特征在于, 所述系统包括 多个远端, 所述远端视频接收单元, 还用于选取多个远端中的一个远端, 接收 来自该选取的远端发送的具有不同视角的远端视频信息。 The device according to any one of claims 5 to 7, wherein the system comprises a plurality of remote ends, and the remote video receiving unit is further configured to select one of the plurality of remote ends End, receiving remote video information with different perspectives sent from the selected remote end.
9、 一种远程会议系统, 其特征在于, 所述系统的本地端包括: 多个具有不同摄像视角的摄像设备,用以分别获取具有不同视角的本地视 频信息, 所述本地视频信息的视角数目不小于远端观察点数目; A remote conference system, wherein the local end of the system includes: a plurality of camera devices having different camera angles for respectively acquiring local video information having different viewing angles, and the number of views of the local video information Not less than the number of remote observation points;
通讯设备,用于向远端发送所述摄像设备获得的所述具有不同视角的本地 视频信息, 并接收来自所述远端的具有不同视角的远端视频信息, 所述远端视 频信息的视角数目不小于本地观察点数目;  a communication device, configured to send, to the remote end, the local video information with different views obtained by the camera device, and receive remote video information with different perspectives from the remote end, the perspective of the remote video information The number is not less than the number of local observation points;
多视角显示设备, 用于向不同的本地观察点显示对应视角的远端视频信 息, 以实现眼对眼显示效果, 所述多视角显示设备的显示视角不小于所述本地 观察点数目;  a multi-view display device, configured to display remote video information of a corresponding view to different local observation points, to achieve an eye-to-eye display effect, where the display angle of view of the multi-view display device is not less than the number of the local observation points;
其中, 所述远端观察点数目和本地观察点数目均为自然数,且所述远端观 察点数目和本地观察点数目中至少有一个数目不小于 2。  The number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
10、 如权利要求 9所述的系统, 其特征在于, 所述系统中的多视角显示设 备的数目不小于远端观察点数目。  10. The system of claim 9, wherein the number of multi-view display devices in the system is not less than the number of remote view points.
11、 如权利要求 9所述的系统, 其特征在于, 所述多视角显示设备为多视 角显示器,或所述多视角显示设备为多个投影仪与具有多视角显示功能的投影 幕的组合。 11. The system of claim 9, wherein the multi-view display device is a multi-view display, or the multi-view display device is a combination of a plurality of projectors and a projection screen having a multi-view display function.
12、 如权利要求 9至 11所述的系统, 其特征在于, 所述系统还包括多个 远端,所述本地端的通讯设备还用于从所述多个远端中选择一个远端进行多视 角视频信息的接收和发送。 The system according to any one of claims 9 to 11, wherein the system further comprises a plurality of remote ends, and the communication device of the local end is further configured to select one remote end from the plurality of remote ends to perform multiple Receiving and transmitting video information of a view.
PCT/CN2012/083637 2011-10-28 2012-10-27 Method and system for video processing WO2013060295A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110335141.5 2011-10-28
CN201110335141.5A CN103096015B (en) 2011-10-28 2011-10-28 Video processing method and video processing system

Publications (1)

Publication Number Publication Date
WO2013060295A1 true WO2013060295A1 (en) 2013-05-02

Family

ID=48167129

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/083637 WO2013060295A1 (en) 2011-10-28 2012-10-27 Method and system for video processing

Country Status (2)

Country Link
CN (1) CN103096015B (en)
WO (1) WO2013060295A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103310233B (en) * 2013-06-28 2016-03-23 青岛科技大学 With similarity method for digging between class behavior multi views and Activity recognition method
CN104639518B (en) * 2013-11-14 2018-12-21 中兴通讯股份有限公司 The method, apparatus of session establishment and the delivering method of session content and device
CN106488170B (en) * 2015-08-28 2020-01-10 华为技术有限公司 Method and system for video communication

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009055094A (en) * 2007-08-23 2009-03-12 Sharp Corp Video system
US20090146915A1 (en) * 2007-12-05 2009-06-11 Marathe Madhav V Multiple view display device
CN101668160A (en) * 2009-09-10 2010-03-10 深圳华为通信技术有限公司 Video image data processing method, device, video conference system and terminal
CN102047657A (en) * 2008-05-30 2011-05-04 坦德伯格电信公司 Method for displaying an image on a display

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009055094A (en) * 2007-08-23 2009-03-12 Sharp Corp Video system
US20090146915A1 (en) * 2007-12-05 2009-06-11 Marathe Madhav V Multiple view display device
CN102047657A (en) * 2008-05-30 2011-05-04 坦德伯格电信公司 Method for displaying an image on a display
CN101668160A (en) * 2009-09-10 2010-03-10 深圳华为通信技术有限公司 Video image data processing method, device, video conference system and terminal

Also Published As

Publication number Publication date
CN103096015B (en) 2015-03-11
CN103096015A (en) 2013-05-08

Similar Documents

Publication Publication Date Title
CN102342100B (en) For providing the system and method for three-dimensional imaging in a network environment
US8259155B2 (en) Providing perspective-dependent views to video conference participants
US8638354B2 (en) Immersive video conference system
US20070171275A1 (en) Three Dimensional Videoconferencing
US20070182812A1 (en) Panoramic image-based virtual reality/telepresence audio-visual system and method
WO2010074582A1 (en) Method, device and a computer program for processing images in a conference between a plurality of video conferencing terminals
US20120050458A1 (en) System and method for providing depth adaptive video conferencing
WO2010130084A1 (en) Telepresence system, method and video capture device
WO2018214746A1 (en) Video conference realization method, device and system, and computer storage medium
WO2010041954A1 (en) Method, device and computer program for processing images during video conferencing
KR20100085188A (en) A three dimensional video communication terminal, system and method
CN106878658A (en) For the automatic video frequency layout of multi-stream multi-site remote presentation conference system
CN211296837U (en) Holographic video conference system
WO2011140812A1 (en) Multi-picture synthesis method and system, and media processing device
US20090146915A1 (en) Multiple view display device
WO2013159515A1 (en) Method and device for transferring a telepresence video image and telepresence system
US9253442B1 (en) Holopresence system
JP3587106B2 (en) Eye-gaze video conferencing equipment
JP2002300602A (en) Window-type image pickup/display device and two-way communication method using the same
CN214959711U (en) Lightweight multi-platform interactive video live broadcast cloud control system
US20120038738A1 (en) Gaze correcting apparatus, a method of videoconferencing and a videoconferencing system
WO2013060295A1 (en) Method and system for video processing
WO2013067898A1 (en) Method and terminal for transmitting information
WO2011011917A1 (en) Method, device and system for video communication
US20210367985A1 (en) Immersive telepresence video conference system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12844206

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12844206

Country of ref document: EP

Kind code of ref document: A1