WO2013060295A1 - Procédé et système de traitement de vidéo - Google Patents
Procédé et système de traitement de vidéo Download PDFInfo
- Publication number
- WO2013060295A1 WO2013060295A1 PCT/CN2012/083637 CN2012083637W WO2013060295A1 WO 2013060295 A1 WO2013060295 A1 WO 2013060295A1 CN 2012083637 W CN2012083637 W CN 2012083637W WO 2013060295 A1 WO2013060295 A1 WO 2013060295A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- remote
- local
- display
- video information
- observation points
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
Definitions
- the present invention relates to telepresence technology, and more particularly to a video processing method and system.
- the telepresence technology can be used in a video conferencing system, in which both parties in different geographical locations are included, and both parties need to realize the effect similar to the conference in the same place through video communication.
- video communication on the one hand, local video information needs to be collected, and on the other hand, video information of the far end needs to be displayed. Due to the difference between the position of the camera and the remote image, when the participant looks at the far-end image and can't see the camera, it can't produce the effect of "visually witnessing" the conversation.
- a technical problem to be solved by embodiments of the present invention is to provide a video processing method and system. Eye-to-eye effects can be achieved in a teleconferencing system.
- the embodiment of the present invention provides a video processing method, which is used in a remote conference system, and the video sending part of the method includes:
- the video display portion of the method includes:
- the multi-view display device displays the far-end video information of the corresponding view to different local observation points to achieve an eye-to-eye display effect, and the display view angle of the multi-view display device is not less than the number of the local observation points;
- the number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
- the embodiment of the present invention further provides a video processing device, which is used in a remote conference system, where the sending module of the device includes:
- a local video obtaining unit configured to acquire local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points;
- a local video sending unit configured to send the local video information with different viewing angles to the remote end
- the display module of the device includes:
- the remote video receiving unit is configured to receive remote video information with different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points;
- a remote video display unit configured to display, by using a multi-view display device, remote video information of a corresponding perspective to different local observation points, to achieve an eye-to-eye display effect, where the display angle of the multi-view display device is not less than Number of local observation points;
- the number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
- the embodiment of the present invention further provides a remote conference system, where the local end of the system includes:
- a plurality of image capturing devices having different camera viewing angles for respectively acquiring local video information having different viewing angles, wherein the number of viewing angles of the local video information is not less than the number of remote viewing points;
- a communication device configured to send, to the remote end, the local video information with different views obtained by the camera device, and receive remote video information with different perspectives from the remote end, the perspective of the remote video information
- the number is not less than the number of local observation points
- a multi-view display device configured to display remote video information of a corresponding view to different local observation points, to achieve an eye-to-eye display effect, where the display angle of view of the multi-view display device is not less than the number of the local observation points;
- the number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
- the multi-view local video information is obtained and sent to the remote display, and the multi-view remote video information from the far end is displayed on the local end, and the eye-to-eye can be realized as long as the viewing angle is properly matched during display. Meeting effect.
- 1 is a layout diagram of one end of an existing remote conference system
- FIG. 2 is a specific flowchart of a transmitting part in a video processing method according to an embodiment of the present invention
- FIG. 3 is a specific flowchart of a display part in a video processing method according to an embodiment of the present invention
- FIG. 4 is an implementation of the present invention
- a specific composition diagram of the video processing device in the example
- FIG. 5 is a specific composition diagram of a remote conference system in an embodiment of the present invention.
- FIG. 6 is a schematic diagram of a layout 1 according to an embodiment of the present invention.
- FIG. 7 is a schematic diagram of a layout 1 according to an embodiment of the present invention.
- FIG. 8 is a schematic diagram of a layout 1 according to an embodiment of the present invention.
- FIG. 9 is a schematic diagram of a layout 1 including an auxiliary stream display according to an embodiment of the present invention.
- FIG. 10 is a schematic diagram of a layout 1 including a secondary stream display according to an embodiment of the present invention.
- FIG. 11 is a schematic diagram of a layout 1 according to an embodiment of the present invention.
- FIG. 12 is a schematic view showing a display angle of a multi-view display according to an embodiment of the present invention.
- FIG. 13 is a schematic diagram showing a display principle of a multi-view display according to an embodiment of the present invention.
- FIG. 14 is a schematic diagram of a layout la according to an embodiment of the present invention.
- Figure 15 is a schematic diagram of a layout lb according to an embodiment of the present invention.
- 16 is a schematic diagram of a layout 2 according to an embodiment of the present invention.
- FIG. 17 is a schematic diagram of a layout 3 according to an embodiment of the present invention.
- FIG. 18 is a schematic diagram of a layout 4 according to an embodiment of the present invention.
- Figure 19 is a schematic diagram of a layout 5 according to an embodiment of the present invention.
- 20 is a schematic diagram of a layout 6 according to an embodiment of the present invention
- 21 is a schematic diagram of a layout 7 according to an embodiment of the present invention
- Figure 22 is a schematic diagram of a layout 8 in accordance with an embodiment of the present invention.
- FIG. 23 is a schematic diagram of a layout 9 according to an embodiment of the present invention.
- Figure 24 is a schematic illustration of a layout 10 in accordance with an embodiment of the present invention.
- the basis for witnessing the eye effect is that both parties can observe different perspectives. When one turns to a certain perspective, only the observer at that perspective can feel the effect of "frontal". Based on the principle, in the embodiment of the present invention, the number of possible viewing angles of both parties is fully considered, and a corresponding number of multi-view video information is obtained according to the number of observers to achieve a realistic eye-to-eye effect.
- FIG. 2 and FIG. 3 it is a specific flowchart of a video processing method in an embodiment of the present invention, and the method can be used in a remote conference system.
- a part of the video transmission process of the method is: 201. Acquiring local video information with different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points; 202, the different viewing angles are The local video information is sent to the far end.
- the local video information can be acquired by the imaging device having different camera viewing angles at the multi-view display device described below.
- the video display part of the method is: 301: receiving remote video information with different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points; 302, using multi-view display
- the device displays the remote video information of the corresponding perspective to different local observation points to achieve an eye-to-eye display effect, and the display angle of view of the multi-view display device is not less than the number of the local observation points.
- the number of the remote observation points and the number of local observation points are all natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
- the remote conference system in the foregoing embodiment may include multiple remote ends, and step 301 may be: selecting one of the plurality of remote ends, and receiving the different perspectives sent from the selected remote end.
- Remote video information On the other hand, a plurality of multi-view display devices may be selected for display, that is, in step 302, a plurality of multi-view display devices are used to display remote video information of corresponding views to different local observation points.
- the remote or local observation point may refer to the location of the participant when attending the conference, or may refer to the location group of the participant (ie, there may be two or more participants in the conference when they participate in the conference. An observation point, without distinction).
- the embodiment of the present invention further provides a video processing device 1 for use in a remote conference system.
- the sending module 10 of the device 1 includes: a local video acquiring unit 100. And for acquiring local video information having different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points; the local video sending unit 102 is configured to send the local video information with different viewing angles to the remote end. .
- the local video obtaining unit 100 is further configured to acquire local video information by using an imaging device having different camera viewing angles at the multi-view display device.
- the display module 12 of the device 1 includes: a remote video receiving unit 120, configured to receive remote video information having different viewing angles, where the number of viewing angles of the remote video information is not less than the number of local viewing points; the remote video display unit 122,
- the multi-view display device is configured to display the remote video information of the corresponding view to different local observation points, so as to achieve an eye-to-eye display effect, the display view angle of the multi-view display device is not less than the number of the local observation points;
- the number of the remote observation points and the number of local observation points are both natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
- the remote video display unit 120 can also be used to display remote video information of a corresponding perspective to different local viewing points by using multiple multi-view display devices. If the system includes multiple remote ends, the remote video receiving unit 120 is further configured to select one of the plurality of remote ends, and receive remote video information with different perspectives sent from the selected remote end.
- the embodiment of the present invention further provides a remote conference system, in which the physical device having the above functions is implemented to implement the entire system, of course, only the connection relationship is shown in the figure, Represents the positional relationship in the actual system.
- the local end of the system includes: a plurality of camera devices 2 having different camera viewing angles, respectively, for acquiring local video information having different viewing angles, where the number of viewing angles of the local video information is not less than the number of remote viewing points
- the communication device 3 is configured to send the local video information with different perspectives obtained by the camera device to the remote end, and receive remote video information with different perspectives from the remote end, the remote video information.
- the number of the viewing angles is not less than the number of the local viewing points; the multi-view display device 4 is configured to display the far-end video information of the corresponding viewing angles to different local viewing points to achieve an eye-to-eye display effect, and the display viewing angle of the multi-view display device
- the number of the local observation points and the number of local observation points are both natural numbers, and at least one of the number of the remote observation points and the number of local observation points is not less than 2.
- multiple multi-view display devices can be set at the local end, so that the number of multi-view display devices in the system is not less than the number of remote view points. To achieve a better eye-to-eye effect.
- the multi-view display device may be a multi-view display, or the multi-view display device may be a combination of a plurality of projectors and a projection screen having a multi-view display function.
- the local communication device is further configured to select one of the plurality of remote terminals for receiving and transmitting multi-view video information.
- a layout 1 of an embodiment of the present invention is shown.
- the layout 1 an example of a specific positional relationship of each component device in the system shown in FIG. 5 is displayed, and a corresponding perspective of each device is shown. Wait.
- the overall situation of the local end and the far end in the system is shown in Fig. 6 to Fig. 10.
- Fig. 11 shows the specific layout of one end in the system (the two ends are symmetrically distributed).
- the system includes two sites A and B.
- the site AB is directly connected through the network.
- Each site contains three large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display, which is similar to the size of a real person.
- High-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
- the three displays are placed in a folded plane, the middle display and the two sides of the display are close together, the images of the three displays form a complete presentation of the conference room scene, and the auxiliary display on the side of the conference table can display the shared Data and other information.
- the display device used in the system is a display with multiple viewing angles. As shown in FIG. 11, each display has three viewing angles, and each viewing angle can present different contents, and the viewing angle of each display is as shown in FIG.
- the display shown in FIG. 12 has the following features: different contents can be displayed at different viewing angles; as shown in the above figure, it is assumed that the object to be presented is identified by a broken line, having three surfaces, and the multi-view display can be in three different The face 1 content, the face 2 content and the face 3 content are respectively presented in the perspective, if the object is actually placed at the display position.
- the display device is implemented using the parallax barrier principle, as shown in Fig. 13, and the image content of the viewing angle is observed from different viewing angles.
- the conference table of each conference room is Dl, D2, D3 from left to right, and the display is Tl, ⁇ 2, ⁇ 3.
- the three cameras on each display are Cl, C2, C3 from left to right, each The three viewing angles of the display are VI, V2, and V3 from left to right;
- the camera C1 at the top of the left display covers the conference table D1, the camera C2 covers the conference table D2, and the camera C3 covers the conference table D3; the camera C1 at the top of the middle display covers the conference table D1, camera C2
- the shooting range covers the conference table D2, the camera C3 shooting range covers the conference table D3; the camera C1 shooting range at the top of the right display screen covers the conference table D1, the camera C2 shooting range covers the conference table D2, and the camera C3 shooting range covers the conference table D.
- the viewing angle VI of T1 corresponds to the conference table D1
- the viewing angle V2 of T1 corresponds to the conference table D2
- the viewing angle V3 of T1 corresponds to the conference table D3
- the viewing angle VI of T2 corresponds to the conference table D1
- the viewing angle V2 of T2 corresponds to the conference table D2
- the viewing angle V3 of T2 corresponds to the conference table D3
- the viewing angle VI of T3 corresponds to the conference table D1
- the angle of view V2 of T3 corresponds to the conference table D2
- the angle of view V3 of T3 corresponds to the conference table D3.
- the two sites shown in FIG. 6 are the site A and the site B respectively. If the video stream of the camera of the site A is sent to the site B, for the middle seat area D2, There are the following transmission and reception correspondences of video streams:
- a Tl C2 > B T2 V3
- a T2 C2 > B T2 V2
- a T3 C2 > B T2 VI
- a T2 CI > B T3 V2
- a Tl C3 > B Tl V3
- a T2 C3 > B Tl V2
- the foregoing video transmission and reception correspondence may be implemented in two ways: in the description of the display device of the site and the orientation information of the participant, the following provisions are made to all the participating regions.
- the middle position is centered, the leftmost display device facing the middle position of all the main display devices is the 0th display device area, the second left display device is the first display device, and so on;
- the leftmost conference area is the 0th (camera) coverage area, the second left conference area is the 1st (camera) coverage area, and so on.
- a secondary display T4 may be present at the venues A and B to display the secondary stream video.
- the transmission and reception correspondence of the video stream includes two ways: One is consistent with the manner described in Figures 6-8, as shown in Figure 9.
- the other mode is the mirroring mode.
- this mode is negotiated before the videoconferencing of the two parties in the system.
- the content of the negotiation includes the above-mentioned video transmission and reception correspondence information. That is, there are the following processes.
- sending a video stream Tl C3 can be described as:
- the sender sends all the video streams of this method to the other party, and the other party receives the video stream.
- the video is displayed on the specific display according to the above correspondence, which requires the other party to recognize the position of the video in the received video, so
- the user data section is added with the following structure:
- Auxiliary flow tag, 1 identifies the auxiliary stream, 0 means non-auxiliary stream (ie mainstream)
- the position of the sender of the video stream (the position of the display), in the mainstream case, all the bits identify the position; in the case of the auxiliary stream, the highest 2 bits indicate the vertical position of the auxiliary stream, 11 indicates above the main display device, and 10 indicates The main display device is in the same horizontal position, 00 is below the main display device; [1 zone i or all zone i or]
- the coverage area of the video stream the highest bit indicates whether to cover all areas, 1 means to cover all areas, then the last 8 bits are meaningless; 0 means only cover a certain area, followed by 0-7 bits to cover Area i or orientation;
- Recver— pos 8bits video receiver location for capability negotiation
- Displayer_pos The position of the 8bits video display for capability negotiation.
- the layout la in the embodiment of the present invention similar to the layout 1 is shown.
- the settings of the camera and the conference table are the same as those of the layout 1.
- the display device uses the projection method, and nine high-resolution and high-brightness projectors are placed at nine different positions behind the projection screen.
- cylindrical gratings are arranged on the projection screen to allow different content to be seen at different angles.
- the layout lb in the embodiment of the present invention similar to the layout 1 is shown.
- the settings of the camera and the conference table are the same as those of the layout 1.
- the display device uses the projection method, and nine high-resolution and high-brightness projectors are placed at nine different positions in front of the projection screen.
- cylindrical gratings are arranged on the projection screen to allow different content to be seen at different angles.
- FIG. 16 it is a layout 2 in the embodiment of the present invention.
- the system consists of two sites in AB that are directly connected through the network.
- the configuration of the AB site is the same.
- Each site uses two large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat-panel displays. It can display high-definition images close to life-size, and can use flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
- the display device has multiple viewing angles and can display different content.
- Each participant has three participant areas, which are configured as shown in the above figure. Each participant in the participant area can watch the multi-view display device.
- the content of one view; two HD cameras are arranged in the convergence mode on both sides and the middle of the display, each camera can capture all the participants at different angles; the auxiliary display on the side of the conference table can display the shared data And other information.
- the C1 camera on the display at the T1 position captures the conference area Dl
- the C2 camera captures the conference area D2
- the C1 camera on the display at the T2 position captures the conference area D1, C2 the camera captures the conference area D2;
- the meeting tends to D1 to see the viewing angle VI of the display T1; the meeting tends to D1 can be viewed
- a T2 CI > B Tl V2
- FIG. 17 it is a layout 3 in the embodiment of the present invention.
- the system consists of two sites in AB that are directly connected through the network.
- the configuration of the AB site is the same.
- Each site uses a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display.
- Real-life high-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
- the display device has three viewing angles and can display different contents.
- Each participant has three participants, which are configured as shown in the above figure, and each participant can just watch the content of one perspective of the multi-view display device; Three HD cameras are set up on the sides and in the middle of the display. Each camera can capture all participants at different angles.
- the auxiliary display on the side of the conference table can display information such as shared data.
- the three viewing angles of the multi-view display device are VI, V2, and V3 from the right to the right.
- C1C2C3 captures all participants Pl, P2, P3 from three angles;
- the three participants are distributed on three perspectives of the multi-view display.
- the view VI corresponds to P1
- the view V2 corresponds to P2
- the view V3 corresponds to P3.
- the corresponding relationship between the video stream transmission and reception of the two sites is:
- FIG. 18 it is a layout 4 in the embodiment of the present invention.
- the system consists of two sites in AB that are directly connected through the network.
- the configuration of the AB site is the same.
- Each site uses a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display.
- Real-life high-definition screens can be used with flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
- the display device has two viewing angles and can display different contents.
- Each participant has two participants, which are configured as shown in the above figure, and each participant can just watch the content of one perspective of the multi-view display device;
- Two HD cameras are arranged in the convergence mode on both sides and in the middle of the display. Each camera can capture all participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data.
- C1C2 captures all participants Pl, P2 from two angles
- the two participants are distributed on two viewing angles of the multi-view display.
- the viewing angle VI corresponds to P1 and the viewing angle V2 corresponds to P2.
- FIG. 19 it is a layout 5 in the embodiment of the present invention.
- the system consists of two sites in AB that are directly connected through the network.
- a site contains a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display, which can be used to render high-definition images close to life-size.
- the display device is a conventional display device with only one viewing angle; three high-definition cameras are arranged at the top and the top of the display, and placed according to the convergence mode, Participants Pl are shot at 3 different angles.
- the B site consists of a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display, for displaying high-definition images close to life-size, and can use PDP TV, LCD TV or DLP rear projection TV.
- a display device such as a 65-inch or 70-inch flat panel display, for displaying high-definition images close to life-size, and can use PDP TV, LCD TV or DLP rear projection TV.
- Flat panel display technology The display device has three viewing angles, and can display different contents; each participant has three participants, and is configured as shown in the above figure, each participant can just watch the content of one perspective of the multi-view display device;
- An HD camera is placed in the middle of the top of the display device to support high-definition image collection of 720p and 1080p resolutions. This camera can cover all participants in the venue.
- a secondary display located on the side of the conference table displays information such as shared data.
- the video stream transmission and reception relationship between the two sites is:
- FIG. 20 it is a layout 6 in the embodiment of the present invention.
- the system consists of two sites in AB that are directly connected through the network.
- a site consists of three large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat-panel displays, which can be used to render high-definition images close to life-size.
- the display device is a conventional display device with only one viewing angle; three HD cameras are set on the top of the display, and all the participants can be photographed from three different angles.
- the B site includes a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display for displaying high-definition images close to life-size, and can use PDP TV, LCD TV or DLP rear projection TV.
- Flat panel display technology The display device has three viewing angles.
- the venue has three participants, which are configured as shown in the figure above. The participant can just watch the content of one view of the multi-view display device; place 3 HD cameras in the middle and both ends of the display device to collect the participant images from 3 angles, the camera can support 720p and HD image collection with 1080p resolution.
- a secondary display located on the side of the conference table displays information such as shared data.
- Participants in Site A can choose to display the video stream of Site B on different display devices in the site. Normally, the video stream is displayed on the T2 display.
- the layout 7 in the embodiment of the present invention consists of three conference sites PA, PB, and PC. Each site is configured in the same way.
- a large-size multi-view flat panel display is used as a display device, such as a 65-inch or 70-inch flat panel display, for rendering close to the size of a real person.
- the high-definition screen can use flat panel display technology such as PDP TV, LCD TV or DLP rear projection TV.
- the display device has 3 viewing angles and can display different contents.
- Each venue has 3 participants, according to the above figure.
- each participant can watch the content of one view of the multi-view display device; set up three HD cameras in the convergence mode on both sides and the middle of the display, each camera can shoot all at different angles Participants; the secondary display located on the side of the conference table displays information such as shared data.
- the three sites PA, PB, and PC are connected through the MCU.
- the media capability negotiation is performed before the conference starts.
- Each site uploads all the videos of the site to the MCU.
- each site can choose to view the remote site.
- Conference site PA you can choose to watch the site PB or the site PC.
- Contents assuming that the content of the site PC is viewed at a certain time, the MCU needs to send the three video streams of the site PC to the PA. In the case of eye-to-eye, the video stream transmission and reception of the two sites correspond to each other.
- MCU forwarding is:
- the PC For the site PB, the PC has a similar transmission and reception correspondence.
- the PC Cl + PB Cl identifies the MCU to combine the Cl video of the PC site and the Cl video of the PB site to form a new video stream.
- the layout 8 in the embodiment of the present invention consists of four sites PA, PB, PC, and PD.
- the configuration of each site is the same.
- the four sites are controlled by the MCU.
- Each venue contains 3 large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs.
- Flat panel display technology The three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the conference room scene.
- the display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display
- the top is equipped with a high-definition camera that collects the participants' images from three angles.
- the camera can support 720p and 1080p resolution HD image collection; the auxiliary display on the side of the conference table can display shared data and other information.
- Interest 3 large-size flat panel displays as the main display device, such as a 65
- the conference table of each conference room is Dl, D2, D3 from left to right
- the display is Tl, ⁇ 2, ⁇ 3, and the three viewing angles of each display are VI, V2, V3 from left to right
- the cameras are Tl-Cl, T2—Cl, T3—CI, respectively, and the cameras Tl—Cl, T2—Cl, and T3 C1 can independently cover the areas where all conference tables D1, D2, and D3 are located;
- the viewing angle VI of T1 corresponds to the conference table D1
- the viewing angle V2 of T1 corresponds to the conference table D2
- the viewing angle V3 of T1 corresponds to the conference table D3
- the viewing angle VI of T2 corresponds to the conference table D1
- the viewing angle V2 of T2 corresponds to the conference table D2
- the viewing angle V3 of T2 corresponds to the conference table D3
- the viewing angle VI of T3 corresponds to the conference table D1
- the viewing angle V2 of T3 corresponds to the conference table D2
- the viewing angle V3 of T3 corresponds to the conference table D3.
- each participant in each site can view the participants of all the sites.
- all the videos of the site are uploaded to the MCU, and the corresponding video is stitched in the MCU. Destination site; Now assume that the PA site needs to view the content of all sites, then in the case of eye-to-eye, the correspondence between the transmission and reception of the video stream is:
- the PB Tl C1+ PC Tl C1+ PD Tl CI identifies the MCU to splicing the CI video of the PB site and the CI video of the PB site to form a new video stream.
- the three display devices in the site respectively display the three sites at the remote end.
- the layout 9 in the embodiment of the present invention consists of four sites PA, PB, PC, and PD.
- the configuration of each site is the same.
- the four sites are controlled by the MCU.
- Each venue contains 3 large-size flat panel displays as the main display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs.
- Flat panel display technology The three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the meeting room scene.
- the display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display
- the top is equipped with an HD camera that collects the participants' images from three angles.
- the camera can support HD image collection with 720p and 1080p resolutions.
- the auxiliary display on the side of the conference table can display information such as shared data.
- the conference table of each conference room is Dl, D2, D3 from left to right
- the display is Tl, ⁇ 2, ⁇ 3, and the three viewing angles of each display are VI, V2, V3 from left to right;
- the cameras are Tl—Cl, T2—Cl, and T3—CI.
- the cameras Tl—Cl, T2—Cl, and T3 C1 can independently cover the areas where all conference tables D1, D2, and D3 are located.
- the viewing angle VI of T1 corresponds to the conference table D1
- the viewing angle V2 of T1 corresponds to the conference table D2
- the viewing angle V3 of T1 corresponds to the conference table D3
- the viewing angle VI of T2 corresponds to the conference table D1
- the viewing angle V2 of T2 corresponds to the conference table D1
- the viewing angle V2 of T2 corresponds.
- the angle of view V3 of the T2 corresponds to the conference table D3
- the perspective VI of the T3 corresponds to the conference table D1
- the perspective V2 of the T3 corresponds to the conference table D2
- the perspective V3 of the T3 corresponds to the conference table D3.
- each participant in each site can view the participants of all the sites.
- all the videos of the site are uploaded to the MCU, and the corresponding video is stitched in the MCU. Destination site; Now assume that the PA site needs to view the content of all sites, then in the case of eye-to-eye, the correspondence between the transmission and reception of the video stream is:
- the PB Tl C1+ PC Tl C1+ PD Tl CI identifies the MCU to splicing the CI video of the PB site and the CI video of the PB site to form a new video stream.
- the three display devices in the site respectively display the three sites at the remote end.
- the following video stream correspondence is as follows: Suppose the PB site is displayed on the T1 display. The PC venue is on the T2 display, and the PD venue is displayed on the T3 display:
- the layout 10 in the embodiment of the present invention consists of three sites, PA, PB, and PC.
- the configuration of each site is different.
- the three sites are controlled by the MCU.
- Venue A contains 3 large-size flat panel displays as the main display device, such as 65-inch or 70-inch flat panel display, for displaying high-definition images close to life-size, and can use flat panel displays such as PDP TV, LCD TV or DLP rear projection TV. technology.
- the three displays are placed in a folded plane, the middle display and the two sides of the display are close together, and the images of the three displays form a complete representation of the meeting room scene.
- the display device used in the system is a display with multiple viewing angles. Each display in the above figure has 3 viewing angles, each viewing angle can present different content, and each viewing angle corresponds to one participant of the venue; in each display
- the top is equipped with an HD camera that collects the participants' images from three angles.
- the camera can support HD image collection with 720p and 1080p resolution.
- the auxiliary display on the side of the conference table can display information such as shared data. Refer to Layout 1 for the specific device orientation and coverage relationship of Site A.
- Venue B contains a large-size multi-view flat panel display as a display device, such as a 65-inch or 70-inch flat panel display for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP rear-projection TVs.
- the display device has one viewing angle, which can display different contents; one participant in the venue, configured as shown in the above figure, three HD cameras are set in the convergence mode on both sides and in the middle of the display Each camera can capture participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data.
- the specific device orientation and coverage relationship of Site B refer to Layout 3.
- Venue C uses two large-size multi-view flat panel displays as display devices, such as 65-inch or 70-inch flat panel displays for high-definition images that are close to life-size, and can be used with PDP TVs, LCD TVs or DLP backs.
- Invest in flat panel display technology such as television.
- the display device has multiple viewing angles and can display different content.
- Each participant has two participant areas, which are configured as shown in the figure above.
- Each participant in the participant area can watch the multi-view display device.
- Each camera can capture all participants at different angles; the auxiliary display on the side of the conference table can display information such as shared data.
- Layout 2 For the specific device orientation and coverage relationship of Site C, refer to Layout 2.
- the camera C1 on the display T1 captures the conference area D1;
- Camera C2 on the display T1 captures the conference area D1;
- the camera C1 on the display T2 captures the conference area D1;
- the participant area D1 can see the viewing angle V2 of the display T1;
- the participant area D1 can see the viewing angle VI of the display T2;
- Participant area D2 can see the viewing angle V3 of the display T1;
- the participant area D2 can see the viewing angle V2 of the display T2;
- the three displays display the contents of the two sites in BC.
- the T1 screen displays the B site
- the T2 displays the D site of the C site
- the T3 displays the D2 content of the C site.
- the site of the C site can choose to view the contents of the two sites of the AB.
- T1 displays the content of the site B
- T2 displays the content of the D2 participant in the site.
- a T3 C2 >C T2 V2
- the multi-view local video information is obtained and sent to the remote display, and the multi-view remote video information from the far end is displayed on the local end, and the eye-to-eye can be realized as long as the viewing angle is properly matched during display. Meeting effect.
- the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM), or a random access memory (RAM).
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
L'invention concerne un procédé et un système de traitement de vidéo. Une partie de transmission de vidéo du procédé consiste à : acquérir des informations de vidéo locale ayant différents angles de visualisation, le nombre d'angles de visualisation des informations de vidéo locale n'étant pas inférieur au nombre de points d'observation à distance ; et transmettre les informations de vidéo locale ayant différents angles de visualisation à une extrémité à distance. Une partie d'affichage de vidéo du procédé consiste à : recevoir des informations de vidéo à distance ayant différents angles de visualisation, le nombre d'angles de visualisation des informations de vidéo à distance n'étant pas inférieur au nombre de points d'observation locaux ; utiliser un dispositif d'affichage à angles de visualisation multiples pour afficher les informations de vidéo à distance d'angles de visualisation correspondants aux différents points d'observation locaux, permettant ainsi de mettre en œuvre un effet d'affichage visuel, le nombre d'angles de visualisation d'affichage du dispositif d'affichage à angles de visualisation multiples n'étant pas inférieur au nombre de points d'observation locaux ; le nombre de points d'observation à distance et le nombre de points d'observation locaux étant tous les deux des nombres naturels, et au moins un nombre entre le nombre de points d'observation à distance et le nombre de points d'observation locaux n'étant pas inférieur à deux. L'emploi de la présente invention permet la mise en œuvre de l'effet visuel dans un système de téléconférence.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110335141.5A CN103096015B (zh) | 2011-10-28 | 2011-10-28 | 一种视频处理方法和系统 |
CN201110335141.5 | 2011-10-28 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013060295A1 true WO2013060295A1 (fr) | 2013-05-02 |
Family
ID=48167129
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2012/083637 WO2013060295A1 (fr) | 2011-10-28 | 2012-10-27 | Procédé et système de traitement de vidéo |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN103096015B (fr) |
WO (1) | WO2013060295A1 (fr) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103310233B (zh) * | 2013-06-28 | 2016-03-23 | 青岛科技大学 | 同类行为多视图间相似度挖掘方法及行为识别方法 |
CN104639518B (zh) * | 2013-11-14 | 2018-12-21 | 中兴通讯股份有限公司 | 会话建立的方法、装置及会话内容的递送方法和装置 |
CN106488170B (zh) * | 2015-08-28 | 2020-01-10 | 华为技术有限公司 | 视频通讯的方法和系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2009055094A (ja) * | 2007-08-23 | 2009-03-12 | Sharp Corp | 映像システム |
US20090146915A1 (en) * | 2007-12-05 | 2009-06-11 | Marathe Madhav V | Multiple view display device |
CN101668160A (zh) * | 2009-09-10 | 2010-03-10 | 深圳华为通信技术有限公司 | 视频图像数据处理方法、装置及视频会议系统及终端 |
CN102047657A (zh) * | 2008-05-30 | 2011-05-04 | 坦德伯格电信公司 | 在显示器上显示图像的方法 |
-
2011
- 2011-10-28 CN CN201110335141.5A patent/CN103096015B/zh active Active
-
2012
- 2012-10-27 WO PCT/CN2012/083637 patent/WO2013060295A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2009055094A (ja) * | 2007-08-23 | 2009-03-12 | Sharp Corp | 映像システム |
US20090146915A1 (en) * | 2007-12-05 | 2009-06-11 | Marathe Madhav V | Multiple view display device |
CN102047657A (zh) * | 2008-05-30 | 2011-05-04 | 坦德伯格电信公司 | 在显示器上显示图像的方法 |
CN101668160A (zh) * | 2009-09-10 | 2010-03-10 | 深圳华为通信技术有限公司 | 视频图像数据处理方法、装置及视频会议系统及终端 |
Also Published As
Publication number | Publication date |
---|---|
CN103096015A (zh) | 2013-05-08 |
CN103096015B (zh) | 2015-03-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102342100B (zh) | 用于在网络环境中提供三维成像的系统和方法 | |
US8259155B2 (en) | Providing perspective-dependent views to video conference participants | |
US8638354B2 (en) | Immersive video conference system | |
CN102638672B (zh) | 用于多流多站点远程呈现会议系统的自动视频布局 | |
US8896655B2 (en) | System and method for providing depth adaptive video conferencing | |
US20070171275A1 (en) | Three Dimensional Videoconferencing | |
US20070182812A1 (en) | Panoramic image-based virtual reality/telepresence audio-visual system and method | |
CN102843542B (zh) | 多流会议的媒体协商方法、设备和系统 | |
WO2010074582A1 (fr) | Procédé, dispositif et programme informatique pour traiter des images dans une conférence entre une pluralité de terminaux de visioconférence | |
WO2010130084A1 (fr) | Système de téléprésence, procédé et dispositif de capture vidéo | |
WO2018214746A1 (fr) | Procédé, dispositif et système de réalisation de conférence vidéo, et support de stockage informatique | |
EP2335415A1 (fr) | Procédé, dispositif et programme d'ordinateur pour traiter des images durant une visioconférence | |
KR20100085188A (ko) | 3차원 비디오 통신 단말기, 시스템 및 방법 | |
US20090146915A1 (en) | Multiple view display device | |
WO2011140812A1 (fr) | Procédé et système de synthèse à plusieurs images et dispositif de traitement multimédia | |
US9253442B1 (en) | Holopresence system | |
WO2013159515A1 (fr) | Procédé et dispositif pour le transfert d'une image vidéo de téléprésence, et système de téléprésence correspondant | |
JP3587106B2 (ja) | 視線一致テレビ会議装置 | |
JP2002300602A (ja) | 窓状撮像表示装置及びそれを使う双方向通信方法 | |
CN214959711U (zh) | 一种轻量化多平台互动视频直播云播控系统 | |
US20120038738A1 (en) | Gaze correcting apparatus, a method of videoconferencing and a videoconferencing system | |
US20210367985A1 (en) | Immersive telepresence video conference system | |
WO2013060295A1 (fr) | Procédé et système de traitement de vidéo | |
WO2011011917A1 (fr) | Procédé, dispositif et système de communication vidéo | |
JP2017184162A (ja) | 映像表示システム及び映像表示方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12844206 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 12844206 Country of ref document: EP Kind code of ref document: A1 |