WO2023040616A1 - Terminal device and video call method - Google Patents

Terminal device and video call method Download PDF

Info

Publication number
WO2023040616A1
WO2023040616A1 PCT/CN2022/114748 CN2022114748W WO2023040616A1 WO 2023040616 A1 WO2023040616 A1 WO 2023040616A1 CN 2022114748 W CN2022114748 W CN 2022114748W WO 2023040616 A1 WO2023040616 A1 WO 2023040616A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
terminal device
display panel
unit
image acquisition
Prior art date
Application number
PCT/CN2022/114748
Other languages
French (fr)
Chinese (zh)
Inventor
梁震
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2023040616A1 publication Critical patent/WO2023040616A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/695Control of camera direction for changing a field of view, e.g. pan, tilt or based on tracking of objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working

Definitions

  • the present disclosure relates to, but is not limited to, the field of display technology.
  • Video calls can be made on mobile terminals (such as mobile phones, laptops, etc.), that is, during the call, the mobile terminals of both users collect images of their own users and send them to the mobile terminal of the other party.
  • the users on both sides can also see the image (or video) of the other user, which has an effect similar to "face-to-face” communication; among them, the image can also be displayed on a larger screen such as a TV. display on the display device of the screen, so as to achieve the effect of video conferencing.
  • the present disclosure provides a terminal device and a method for video calling.
  • the present disclosure provides a terminal device, which includes: a transceiver unit configured to receive a peer image and send a local image; a display panel configured to display the peer image; an analysis unit configured to determine the The viewpoint position in the peer image displayed on the display panel; the driving unit configured to drive the image acquisition unit to move to the viewpoint position; the image acquisition unit configured to acquire the local image along the light emitting direction of the display panel.
  • the present disclosure provides a video call method, which is used in any one of the terminal devices described in the present disclosure, the method comprising: the transceiver unit receives an image of the opposite end; the display panel displays the image of the opposite end The analysis unit determines the viewpoint position in the peer image displayed on the display panel; the drive unit drives the image acquisition unit to move to the viewpoint position; the image acquisition unit acquires the local image at the viewpoint position along the light emitting direction of the display panel; The transceiver unit sends the local image.
  • FIG. 1 is a schematic block diagram of a terminal device provided by the present disclosure.
  • Fig. 2 is a schematic workflow diagram of an analysis unit in a terminal device provided by the present disclosure.
  • FIG. 3 is a schematic diagram of a peer image displayed on a display panel in a terminal device provided in the present disclosure.
  • FIG. 4 is a schematic structural diagram of a drive unit and an image acquisition unit located inside a display panel in a terminal device provided by the present disclosure.
  • Fig. 5 is a schematic diagram of a peer image displayed on a display panel in another terminal device provided by the present disclosure.
  • Fig. 6 is a schematic flowchart of a video calling method provided by the present disclosure.
  • the present disclosure may be described with reference to plan views and/or cross-sectional views by way of idealized schematic views of the present disclosure. Accordingly, the example illustrations may be modified according to manufacturing techniques and/or tolerances.
  • the terms used in the present disclosure are for describing specific embodiments only, and are not intended to limit the present disclosure.
  • the term “and/or” includes any and all combinations of one or more of the associated listed items.
  • the singular forms “a” and “the” are intended to include the plural forms as well, unless the context clearly dictates otherwise.
  • the terms “comprising”, “made up of” designate the presence of said features, integers, steps, operations, elements and/or components, but do not exclude the presence or addition of one or more other features, Integrals, steps, operations, elements, components and/or groups thereof.
  • the present disclosure is not limited to the embodiments shown in the drawings, but includes modifications of configurations formed based on manufacturing processes. Accordingly, the regions illustrated in the figures have schematic properties, and the shapes of the regions shown in the figures illustrate the specific shapes of the regions of the elements, but are not intended to be limiting.
  • the present disclosure provides a terminal device.
  • the terminal device provided by the present disclosure has a display function, an image collection function, an information transmission function, etc., and also has a voice collection function and a voice playback function, so that video calls can be realized.
  • the functions of the terminal device are not limited to video calls, and it can also implement other functions such as voice calls and local program running.
  • the terminal device is a mobile terminal.
  • the terminal device may be a mobile terminal, such as a mobile phone, a tablet computer, etc., because a mobile terminal is a commonly used device for performing video calls.
  • the type of terminal equipment is not limited to this, it also can be other types such as notebook computer, desktop computer, dedicated video conferencing equipment.
  • a terminal device provided by the present disclosure includes: a display panel 1 , a drive unit 2 , an image acquisition unit 3 , a transceiver unit 4 , and an analysis unit 5 .
  • the transceiver unit 4 is configured to receive the image of the opposite end and send the image of the local end.
  • Display panel 1 is configured to display the image of the peer end.
  • the analysis unit 5 is configured to determine the viewpoint position 92 in the peer image displayed by the display panel 1 .
  • the driving unit 2 is configured to drive the image acquisition unit 3 to move to the viewpoint position 92 .
  • the image acquisition unit 3 is configured to acquire a local image along the light emitting direction of the display panel 1 .
  • the terminal device in the embodiment of the present disclosure includes a transceiver unit 4 (such as a wireless communication unit, a wireless communication circuit, etc.) capable of realizing a remote information interaction function.
  • the transceiver unit 4 can receive an image from a peer terminal device in a video call.
  • the peer image is the image of the peer user collected by the image acquisition unit 3 of the peer terminal device (so it is also the local image of the peer terminal device).
  • the display panel 1 (such as a liquid crystal display panel 1, an organic light-emitting diode display panel 1, etc.) displays the above peer image, so that the local user can see the counterpart user's image.
  • the analysis unit 5 can determine the viewpoint position 92 of the counterpart user in the counterpart image according to the counterpart image displayed on the display panel 1 (that is, the position where the counterpart user's line of sight is emitted in the counterpart image) and display it on the local display panel.
  • the analysis unit can be an analysis circuit or the like.
  • the drive unit 2 drives the image acquisition unit 3 (such as a camera) to move to the above viewpoint position 92, so that the image acquisition unit 3 moves along the light emitting direction of the display panel 1 at the viewpoint position 92 (that is, the image captured by the image acquisition unit 3 is The image of the person facing the display surface of the display panel 1, that is, the image of the local user) collects the local image, and the transceiver unit 4 sends the local image to the opposite terminal device (so the local image is also the opposite terminal device).
  • the opposite terminal image of the terminal terminal device for the display panel 1 of the opposite terminal device to display, so that the other party user can see the image of the local user.
  • the image acquisition unit 3 therein can also move according to the viewpoint position 92 in the image of the local end, and collect the image of the opposite user and send it to the local end Terminal Equipment.
  • the transceiver unit 4 should continue to receive the image of the opposite end, so that each frame of the image of the opposite end can be processed in the above way, that is, the image acquisition unit 3 can "track" the opposite end in real time The viewpoint position 92 of the counterpart user in the image.
  • the position of the image acquisition unit 3 should also remain unchanged.
  • the display panel 1 should still display the opposite user. end image, the image acquisition unit 3 should still acquire the local end image, but the position of the image acquisition unit 3 can remain unchanged or be moved to a default position.
  • the peer image may also include other scenes, and these scenes can be displayed by the display panel 1, but the analysis unit 5 may not analyze them.
  • the terminal device may also include other units such as a voice playback unit (such as a speaker), a voice receiving unit (such as a microphone), and its transceiver unit 4 may also send and receive other information (such as audio information), It will not be described in detail here.
  • a voice playback unit such as a speaker
  • a voice receiving unit such as a microphone
  • its transceiver unit 4 may also send and receive other information (such as audio information), It will not be described in detail here.
  • the image acquisition unit 3 can continuously collect the local image (the image of the user) at the viewpoint position 92 of the image of the opposite end (that is, the image of the other user) and send it to the other user, so that the collected image of the local end It is similar to the image directly seen by the other user's eyes, that is, the image seen by the other user is similar to the effect of "seeing directly” by oneself.
  • the above local image can better convey information such as subtle eyes and body language (for example, the other user can easily see whether the local user is looking directly at him or looking away from the local image) ), thereby increasing the amount of information transmitted, making the effect of video calls more similar to "face-to-face” communication.
  • the other user can feel more detailed information, and the video call needs to be carried out between the two users, so if both users want to make a video call, they can To feel more detailed information, both users need to use the terminal equipment provided by the embodiments of the present disclosure.
  • both users need to use the terminal equipment provided by the embodiments of the present disclosure.
  • only one user uses the terminal equipment provided by the embodiment of the present disclosure, and the other user uses other conventional terminal equipment to make a video call, it is of course also feasible.
  • determining the viewpoint position 92 in the peer image displayed on the display panel 1 includes step S101 and step S102 .
  • step S101 image analysis is performed on the peer image to determine the position of the pupil 91 on the human face.
  • step S102 the viewpoint position 92 is determined based on the position of the pupil 91 .
  • the image of the opposite end through image analysis (such as target recognition) technology to determine whether there is a human face in the image of the opposite end, and whether there is a pupil 91 (or eyes) on the human face , and the position of the pupil 91; furthermore, the above viewpoint position 92 can be determined according to the above position of the pupil 91.
  • image analysis such as target recognition
  • determining the viewpoint position 92 according to the position of the pupil 91 ( S102 ) includes: S1021 , determining the position of the pupil 91 as the viewpoint position 92 .
  • the position of the pupil 91 determined above may be used as the viewpoint position 92 .
  • the position of one of the pupils 91 can be selected is the viewpoint position 92; with reference to Fig. 5, if there are two image acquisition units 3 (such as binocular cameras) in the local terminal device, then the positions of the two pupils 91 can be respectively the corresponding viewpoints of the two image acquisition units 3
  • the position 92 is the position where the two image acquisition units 3 move to the two pupils 91 respectively.
  • determining the viewpoint position 92 according to the position of the pupil 91 may also include: S1022 , determining the position of the midpoint of a line connecting two pupils 91 of a human face as the viewpoint position 92 .
  • the middle position of the two pupils 91 (the midpoint of the line between the two pupils 91 ) can also be used as the above viewpoint position 92 .
  • the image acquisition unit 3 and the driving unit are arranged inside the display panel 1 .
  • the image acquisition unit 3 needs to move to the viewpoint position 92 of the opposite end image, and in order to prevent the image acquisition unit 3 from blocking the viewpoint position 92 of the opposite end image (such as the eyes of the opposite user) and affecting the viewing effect of the own user, so as
  • the image acquisition unit 3 and the drive unit 2 above can be arranged "inside” the display panel 1, that is, the image acquisition unit 3 can be located “inside” the display surface of the display panel 1, so that " Under-screen camera” and so on. Therefore, for the local user watching the display panel 1, the image acquisition unit 3 and the driving unit are "invisible".
  • the driving unit 2 includes a first track 21 and a second track 22, the second track 22 is movably arranged on the first track 21, and the image acquisition unit 3 is movably arranged on the second track 22;
  • the extending direction of the first rail 21 intersects the extending direction of the second rail 22 .
  • the extending direction of the first track 21 is perpendicular to the extending direction of the second track 22 ; the extending direction of the first track 21 and the extending direction of the second track 22 are both parallel to the display surface of the display panel 1 .
  • the driving unit 2 may include two intersecting tracks (the first track 21 and the second track 22), wherein the second track 22 can move on the first track 21 (see the arrow in Figure 4), and the image acquisition unit 3 can move on the second track 22 (see the arrow in Figure 4 Arrow), thus, the movement of the image acquisition unit 3 in two different directions can be realized.
  • the above two tracks may be perpendicular to each other, and both are parallel to the display surface of the display panel 1, so that the image acquisition unit 3 moves rapidly, and the distance relative to the display surface of the display panel 1 remains unchanged during the movement, so The distance to the local user remains unchanged, so there is no need to refocus due to movement, and the operation is simple.
  • the driving unit 2 is configured to drive the image acquisition unit 3 to move within a preset range 99 , and the preset range 99 corresponds to a partial area of the display surface of the display panel 1 .
  • the image acquisition unit 3 can only move within a partial area of the display surface of the display panel 1 , but cannot move to all positions of the display surface of the display panel 1 . This is because, generally speaking, during most voice calls, the face of the opposite user in the image of the opposite end (of course corresponding to the viewpoint position 92) is located in a partial area of the display surface of the display panel 1 of the end (such as upper-middle region) without deviating too much. Therefore, the driving unit 2 only needs to be able to drive the image acquisition unit 3 to move within the preset range 99 corresponding to the above partial area, thereby simplifying the structure of the product and reducing the impact of the driving unit on the display effect.
  • the number of image acquisition units 3 is multiple, and each image acquisition unit 3 has a corresponding drive unit, and each drive unit has a corresponding preset range 99, and different preset ranges 99 have at least some non-overlapping
  • Driving the image acquisition unit 3 to move to the viewpoint position 92 includes: determining at least one preset range 99 where the viewpoint position 92 is located, and making the drive unit corresponding to the preset range 99 move the corresponding image acquisition unit 3 to the viewpoint position 92.
  • each image acquisition unit 3 can have a corresponding drive unit and a preset range 99, and different preset ranges 99 are not completely the same, that is, different image acquisition units 3
  • the possible motion ranges are different, so that when the viewpoint position 92 is determined, the corresponding drive unit and image acquisition unit 3 can be selected to move according to the preset range 99 where the viewpoint position 92 is located ( Figure 5 shows two preset The ranges 99 respectively correspond to the possible areas of the two pupils 91 and the viewpoint positions 92 , of course, multiple preset ranges 99 can also correspond to the possible regions of one viewpoint position 92 ).
  • the present disclosure provides a video call method, which is used for a terminal device in any one of the embodiments of the present disclosure.
  • the video calling method of the present disclosure implements video calling through the above terminal equipment.
  • a video call needs to be carried out between two users, and the method of the present disclosure describes the process of one of the users, that is, at least one user who has a video call uses the terminal device of the embodiment of the present disclosure to conduct a video call.
  • the terminal device provided by the embodiments of the present disclosure, and the other user uses other conventional terminal devices to make a video call, it is of course also feasible.
  • the video call method of the present disclosure includes steps S201 to S206.
  • step S201 the transceiver unit receives the image of the opposite end.
  • step S202 the display panel displays the peer image.
  • step S203 the analysis unit determines the position of the viewpoint in the peer image displayed on the display panel.
  • step S204 the driving unit drives the image acquisition unit to move to the viewpoint position.
  • step S205 the image acquisition unit acquires the local image at the viewpoint along the light emitting direction of the display panel.
  • step S206 the transceiver unit sends the local image.
  • the transceiver unit can also transmit the audio information of both users, etc., which will not be described in detail here.
  • the position of the image acquisition unit is continuously adjusted according to the received image of the opposite end (the image of the other party user), so that the image acquisition unit can collect the local image similar to that directly seen by the eyes of the other end user and concurrently
  • the image seen by the other party user is similar to the effect of "seeing directly” by oneself. Therefore, the above local image can better convey subtle information such as eye contact and body language (for example, the other party user can easily It can be seen whether the user on the other side is looking directly at him or looking away, that is, to achieve eye contact), increase the amount of information transmitted, and make the effect of video calling more similar to face-to-face communication.
  • the transceiver unit will continuously receive the image of the opposite end, and also need to continuously send the image of the local end to the opposite end, so the method of the present disclosure is actually a continuous loop after the video call is started. ongoing until the end of the video call.
  • the transceiver unit receiving the image of the opposite end includes: the transceiver unit receives the video stream of the opposite end, and obtains the image of the opposite end from the video stream of the opposite end; The local image forms the local video stream, and sends the local video stream.
  • the transceiver unit Because it is a "video call” process, the transceiver unit usually sends and receives video streams, so it needs to obtain (such as video decoding) the opposite-end image from the received opposite-end video stream, and obtain (such as video encoding) and send the local video stream.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Provided in the present disclosure are a terminal device and a video call method. The terminal device comprises: a transmitting-receiving unit, which is configured to receive and transmit an opposite-end image; a display panel, which is configured to display the opposite-end image; an analysis unit, which is configured to determine a viewpoint location in the opposite-end image displayed on the display panel; a driving unit, which is configured to drive an image collection unit to move to the viewpoint location; and the image collection unit, which is configured to collect the local-end image in a light emergence direction of the display panel.

Description

终端设备、视频通话的方法Terminal device, method of video call
相关申请的交叉引用Cross References to Related Applications
本申请要求2021年9月15日提交给中国专利局的第202111077903.6号专利申请的优先权,其全部内容通过引用合并于此。This application claims priority to Patent Application No. 202111077903.6 filed with the China Patent Office on September 15, 2021, the entire contents of which are hereby incorporated by reference.
技术领域technical field
本公开涉及但不限于显示技术领域。The present disclosure relates to, but is not limited to, the field of display technology.
背景技术Background technique
在移动终端(如手机、笔记本电脑等)上可进行视频通话,即在通话过程中,双方用户的移动终端均采集本方用户的图像,并发给对方用户的移动终端且在对方用户的移动终端上显示,从而双方用户除了听到对方用户的声音,还可看到对方用户的图像(或者说视频),起到类似“面对面”交流的效果;其中,图像还可在电视机等具有更大的屏幕的显示装置上进行显示,以实现视频会议的效果。Video calls can be made on mobile terminals (such as mobile phones, laptops, etc.), that is, during the call, the mobile terminals of both users collect images of their own users and send them to the mobile terminal of the other party. In addition to hearing the voice of the other user, the users on both sides can also see the image (or video) of the other user, which has an effect similar to "face-to-face" communication; among them, the image can also be displayed on a larger screen such as a TV. display on the display device of the screen, so as to achieve the effect of video conferencing.
但在实际的面对面交流中,很多信息(如70%左右的信息)可能是通过细微的眼神、肢体语言等“非语言”的方式传递的,而视频通话技术虽然可以传递图像,但仍不能有效传递这些信息。However, in actual face-to-face communication, a lot of information (such as about 70% of the information) may be transmitted through "non-verbal" means such as subtle eye contact and body language. Although video call technology can transmit images, it is still not effective. pass on this information.
发明内容Contents of the invention
本公开提供一种终端设备、视频通话的方法。The present disclosure provides a terminal device and a method for video calling.
第一方面,本公开提供一种终端设备,其包括:收发单元,配置为接收对端图像和发送本端图像;显示面板,配置为显示所述对端图像;分析单元,配置为确定所述显示面板显示的对端图像中的视点位置;驱动单元,配置为驱动图像采集单元移动至视点位置;图像采集单元,配置为沿所述显示面板的出光方向采集本端图像。In a first aspect, the present disclosure provides a terminal device, which includes: a transceiver unit configured to receive a peer image and send a local image; a display panel configured to display the peer image; an analysis unit configured to determine the The viewpoint position in the peer image displayed on the display panel; the driving unit configured to drive the image acquisition unit to move to the viewpoint position; the image acquisition unit configured to acquire the local image along the light emitting direction of the display panel.
第二方面,本公开提供一种视频通话的方法,其用于本公开任意 一项所述的终端设备,所述方法包括:所述收发单元接收对端图像;所述显示面板显示对端图像;所述分析单元确定显示面板显示的对端图像中的视点位置;所述驱动单元驱动图像采集单元移动至视点位置;所述图像采集单元在视点位置沿显示面板的出光方向采集本端图像;所述收发单元发送本端图像。In a second aspect, the present disclosure provides a video call method, which is used in any one of the terminal devices described in the present disclosure, the method comprising: the transceiver unit receives an image of the opposite end; the display panel displays the image of the opposite end The analysis unit determines the viewpoint position in the peer image displayed on the display panel; the drive unit drives the image acquisition unit to move to the viewpoint position; the image acquisition unit acquires the local image at the viewpoint position along the light emitting direction of the display panel; The transceiver unit sends the local image.
附图说明Description of drawings
图1为本公开提供的一种终端设备的组成示意框图。FIG. 1 is a schematic block diagram of a terminal device provided by the present disclosure.
图2为本公开提供的一种终端设备中的分析单元的工作流程示意图。Fig. 2 is a schematic workflow diagram of an analysis unit in a terminal device provided by the present disclosure.
图3为本公开提供的一种终端设备中的显示面板显示的对端图像的示意图。FIG. 3 is a schematic diagram of a peer image displayed on a display panel in a terminal device provided in the present disclosure.
图4为本公开提供的一种终端设备中的位于显示面板内部的驱动单元和图像采集单元的结构示意图。FIG. 4 is a schematic structural diagram of a drive unit and an image acquisition unit located inside a display panel in a terminal device provided by the present disclosure.
图5为本公开提供的另一种终端设备中的显示面板显示的对端图像的示意图。Fig. 5 is a schematic diagram of a peer image displayed on a display panel in another terminal device provided by the present disclosure.
图6为本公开提供的一种视频通话方法的流程示意图。Fig. 6 is a schematic flowchart of a video calling method provided by the present disclosure.
具体实施方式Detailed ways
为使本领域的技术人员更好地理解本公开的技术方案,下面结合附图对本公开实施方式提供的终端设备、视频通话方法进行详细描述。In order for those skilled in the art to better understand the technical solution of the present disclosure, the terminal device and the video call method provided by the embodiments of the present disclosure will be described in detail below with reference to the accompanying drawings.
在下文中将参考附图更充分地描述本公开,但是所示的实施方式可以以不同形式来体现,且本公开不应当被解释为限于以下阐述的实施方式。反之,提供这些实施方式的目的在于使本公开透彻和完整,并将使本领域技术人员充分理解本公开的范围。The present disclosure will be described more fully hereinafter with reference to the accompanying drawings, but the illustrated embodiments may be embodied in different forms, and the present disclosure should not be construed as limited to the embodiments set forth below. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
本公开实施方式的附图用来提供对本公开实施方式的进一步理解,并且构成说明书的一部分,与详细实施方式一起用于解释本公开,并不构成对本公开的限制。通过参考附图对详细实施方式进行描述,以上和其它特征和优点对本领域技术人员将变得更加显而易见。The drawings of the embodiments of the present disclosure are used to provide a further understanding of the embodiments of the present disclosure, and constitute a part of the description, and are used together with the detailed embodiments to explain the present disclosure, and do not constitute limitations to the present disclosure. The above and other features and advantages will become more apparent to those skilled in the art by describing detailed embodiments with reference to the accompanying drawings.
本公开可借助本公开的理想示意图而参考平面图和/或截面图进行描述。因此,可根据制造技术和/或容限来修改示例图示。The present disclosure may be described with reference to plan views and/or cross-sectional views by way of idealized schematic views of the present disclosure. Accordingly, the example illustrations may be modified according to manufacturing techniques and/or tolerances.
在不冲突的情况下,本公开各实施方式及实施方式中的各特征可相互组合。In the case of no conflict, each embodiment and each feature in the embodiment of the present disclosure can be combined with each other.
本公开所使用的术语仅用于描述特定实施方式,且不意欲限制本公开。如本公开所使用的术语“和/或”包括一个或多个相关列举条目的任何和所有组合。如本公开所使用的单数形式“一个”和“该”也意欲包括复数形式,除非上下文另外清楚指出。如本公开所使用的术语“包括”、“由……制成”,指定存在所述特征、整体、步骤、操作、元件和/或组件,但不排除存在或添加一个或多个其它特征、整体、步骤、操作、元件、组件和/或其群组。The terms used in the present disclosure are for describing specific embodiments only, and are not intended to limit the present disclosure. As used in this disclosure, the term "and/or" includes any and all combinations of one or more of the associated listed items. As used in this disclosure, the singular forms "a" and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. As used in the present disclosure, the terms "comprising", "made up of" designate the presence of said features, integers, steps, operations, elements and/or components, but do not exclude the presence or addition of one or more other features, Integrals, steps, operations, elements, components and/or groups thereof.
除非另外限定,否则本公开所用的所有术语(包括技术和科学术语)的含义与本领域普通技术人员通常理解的含义相同。还将理解,诸如那些在常用字典中限定的那些术语应当被解释为具有与其在相关技术以及本公开的背景下的含义一致的含义,且将不解释为具有理想化或过度形式上的含义,除非本公开明确如此限定。Unless otherwise defined, all terms (including technical and scientific terms) used in this disclosure have the same meaning as commonly understood by one of ordinary skill in the art. It will also be understood that terms such as those defined in commonly used dictionaries should be interpreted as having meanings consistent with their meanings in the context of the relevant art and the present disclosure, and will not be interpreted as having idealized or excessive formal meanings, Unless the disclosure expressly so limited.
本公开不限于附图中所示的实施方式,而是包括基于制造工艺而形成的配置的修改。因此,附图中例示的区具有示意性属性,并且图中所示区的形状例示了元件的区的具体形状,但并不是旨在限制性的。The present disclosure is not limited to the embodiments shown in the drawings, but includes modifications of configurations formed based on manufacturing processes. Accordingly, the regions illustrated in the figures have schematic properties, and the shapes of the regions shown in the figures illustrate the specific shapes of the regions of the elements, but are not intended to be limiting.
第一方面,参照图1至图5,本公开提供一种终端设备。In a first aspect, referring to FIG. 1 to FIG. 5 , the present disclosure provides a terminal device.
本公开提供的终端设备具有显示功能、图像采集功能、信息传输功能等,且也具有语音采集功能和语音播放功能,从而可实现视频通话。The terminal device provided by the present disclosure has a display function, an image collection function, an information transmission function, etc., and also has a voice collection function and a voice playback function, so that video calls can be realized.
当然,终端设备的功能不限于视频通话,其也可实现语音通话、本地程序运行等其它功能。Of course, the functions of the terminal device are not limited to video calls, and it can also implement other functions such as voice calls and local program running.
在一些实施方式中,终端设备为移动终端。In some embodiments, the terminal device is a mobile terminal.
作为本公开实施方式的一种方式,终端设备可以是移动终端,如手机、平板电脑等,因为移动终端是较常用的进行视频通话的设备。As a mode of implementation of the present disclosure, the terminal device may be a mobile terminal, such as a mobile phone, a tablet computer, etc., because a mobile terminal is a commonly used device for performing video calls.
当然,终端设备的类型也不限于此,其也可为笔记本电脑、台式 电脑、专用视频会议设备等其它类型。Certainly, the type of terminal equipment is not limited to this, it also can be other types such as notebook computer, desktop computer, dedicated video conferencing equipment.
参照图1,在一个实施方式中,本公开提供的终端设备包括:显示面板1、驱动单元2、图像采集单元3、收发单元4、分析单元5。Referring to FIG. 1 , in one embodiment, a terminal device provided by the present disclosure includes: a display panel 1 , a drive unit 2 , an image acquisition unit 3 , a transceiver unit 4 , and an analysis unit 5 .
收发单元4,配置为接收对端图像和发送本端图像。The transceiver unit 4 is configured to receive the image of the opposite end and send the image of the local end.
显示面板1,配置为显示对端图像。 Display panel 1 is configured to display the image of the peer end.
分析单元5,配置为确定显示面板1显示的对端图像中的视点位置92。The analysis unit 5 is configured to determine the viewpoint position 92 in the peer image displayed by the display panel 1 .
驱动单元2,配置为驱动图像采集单元3移动至视点位置92。The driving unit 2 is configured to drive the image acquisition unit 3 to move to the viewpoint position 92 .
图像采集单元3,配置为沿显示面板1的出光方向采集本端图像。The image acquisition unit 3 is configured to acquire a local image along the light emitting direction of the display panel 1 .
本公开实施方式的终端设备中包括能实现远程信息交互功能的收发单元4(如无线通信单元、无线通信电路等),收发单元4可接收来自视频通话的对端终端设备的对端图像,该对端图像是对端终端设备的图像采集单元3采集的对方用户的图像(故其也就是对端终端设备的本端图像)。The terminal device in the embodiment of the present disclosure includes a transceiver unit 4 (such as a wireless communication unit, a wireless communication circuit, etc.) capable of realizing a remote information interaction function. The transceiver unit 4 can receive an image from a peer terminal device in a video call. The peer image is the image of the peer user collected by the image acquisition unit 3 of the peer terminal device (so it is also the local image of the peer terminal device).
显示面板1(如液晶显示面板1、有机发光二极管显示面板1等)则显示以上对端图像,从而本方用户可看到对方用户的图像。The display panel 1 (such as a liquid crystal display panel 1, an organic light-emitting diode display panel 1, etc.) displays the above peer image, so that the local user can see the counterpart user's image.
由此,分析单元5可根据显示面板1显示的对端图像,确定出在该对端图像中的对方用户的视点位置92(即对端图像中对方用户视线发出的位置)在本端显示面板1的显示面中具体所在的物理位置。分析单元可以为分析电路等。Thus, the analysis unit 5 can determine the viewpoint position 92 of the counterpart user in the counterpart image according to the counterpart image displayed on the display panel 1 (that is, the position where the counterpart user's line of sight is emitted in the counterpart image) and display it on the local display panel. The specific physical location on the display surface of 1. The analysis unit can be an analysis circuit or the like.
进而,驱动单元2则带动图像采集单元3(如摄像头等)移动到以上视点位置92,从而图像采集单元3在该视点位置92沿显示面板1的出光方向(即图像采集单元3拍摄到的是面对显示面板1的显示面的人的图像,也就是本方用户的图像)采集本端图像,收发单元4再将该本端图像发给对端终端设备(故该本端图像也就是对端终端设备的对端图像),供对端终端设备的显示面板1进行显示,使对方用户能看到本方用户的图像。其中,若视频通话的对端终端设备也是本公开实施方式的端终端设备,则其中的图像采集单元3也可根据本端图像中的视点位置92移动,并采集对方用户的图像发给本端终端 设备。Furthermore, the drive unit 2 drives the image acquisition unit 3 (such as a camera) to move to the above viewpoint position 92, so that the image acquisition unit 3 moves along the light emitting direction of the display panel 1 at the viewpoint position 92 (that is, the image captured by the image acquisition unit 3 is The image of the person facing the display surface of the display panel 1, that is, the image of the local user) collects the local image, and the transceiver unit 4 sends the local image to the opposite terminal device (so the local image is also the opposite terminal device). The opposite terminal image of the terminal terminal device), for the display panel 1 of the opposite terminal device to display, so that the other party user can see the image of the local user. Wherein, if the peer terminal device of the video call is also the terminal device of the embodiment of the present disclosure, the image acquisition unit 3 therein can also move according to the viewpoint position 92 in the image of the local end, and collect the image of the opposite user and send it to the local end Terminal Equipment.
当然,在视频通话过程中,收发单元4应当是持续接收到对端图像的,从而对每帧对端图像,都可通过以上方式进行处理,即图像采集单元3可实时的“追踪”对端图像中对方用户的视点位置92。Of course, during the video call, the transceiver unit 4 should continue to receive the image of the opposite end, so that each frame of the image of the opposite end can be processed in the above way, that is, the image acquisition unit 3 can "track" the opposite end in real time The viewpoint position 92 of the counterpart user in the image.
当然,当接收到的多帧对端图像中对方用户的视点位置92保持不变时,则图像采集单元3的位置也应保持不变。Certainly, when the viewpoint position 92 of the opposite user in the received multi-frame opposite end images remains unchanged, the position of the image acquisition unit 3 should also remain unchanged.
当然,接收到的对端图像中可能不存在对方用户(如对方用户临时离开),或者是不存在对方用户的视点位置92(如对方用户转头),此时,显示面板1仍然应显示对端图像,图像采集单元3也仍然应采集本端图像,但图像采集单元3的位置可保持不变,或者移动至默认位置。Of course, there may be no other user in the received image of the other end (such as the other user leaving temporarily), or there may be no viewpoint position 92 of the other user (such as the other user turning his head), at this time, the display panel 1 should still display the opposite user. end image, the image acquisition unit 3 should still acquire the local end image, but the position of the image acquisition unit 3 can remain unchanged or be moved to a default position.
当然,接收到的对端图像中是否有对方用户,对端图像中都还可包括其它的景物,而这些景物都可由显示面板1进行显示,但分析单元5可不对其进行分析。Of course, whether there is the opposite user in the received peer image, the peer image may also include other scenes, and these scenes can be displayed by the display panel 1, but the analysis unit 5 may not analyze them.
当然,本公开实施方式提供的终端设备中还可包括语音播放单元(如扬声器)、语音接收单元(如麦克风)等其它单元,且其收发单元4还可收发其它的信息(如音频信息),在此不再详细描述。Of course, the terminal device provided by the embodiments of the present disclosure may also include other units such as a voice playback unit (such as a speaker), a voice receiving unit (such as a microphone), and its transceiver unit 4 may also send and receive other information (such as audio information), It will not be described in detail here.
本公开实施方式中,图像采集单元3可持续在对端图像(即对方用户的图像)的视点位置92采集本端图像(本方用户的图像)并发给对方用户,从而其采集的本端图像类似于对方用户眼睛直接看到的图像,即,对方用户看到的图像类似自己“直接看”的效果。由此,以上本端图像可更好的传递细微的眼神、肢体语言等信息(例如对方用户根据本端图像可很容易的看出本方用户是在目光直视他,还是将视线移开了),从而增加传递的信息量,使视频通话的效果更加类似于“面对面”的交流。In the embodiment of the present disclosure, the image acquisition unit 3 can continuously collect the local image (the image of the user) at the viewpoint position 92 of the image of the opposite end (that is, the image of the other user) and send it to the other user, so that the collected image of the local end It is similar to the image directly seen by the other user's eyes, that is, the image seen by the other user is similar to the effect of "seeing directly" by oneself. As a result, the above local image can better convey information such as subtle eyes and body language (for example, the other user can easily see whether the local user is looking directly at him or looking away from the local image) ), thereby increasing the amount of information transmitted, making the effect of video calls more similar to "face-to-face" communication.
当然,在本方用户使用本公开实施方式提供的终端设备时,是对方用户可感受到更多的细节信息,而视频通话需在双方用户之间进行,故若要视频通话的双方用户都能感受到更多的细节信息,则需要双方用户都使用本公开实施方式提供的终端设备。但如果只有一方用户使用本公开实施方式提供的终端设备,而另一方用户使用其它的常规的 终端设备进行视频通话,当然也是可行的。Of course, when the local user uses the terminal device provided by the embodiment of the present disclosure, the other user can feel more detailed information, and the video call needs to be carried out between the two users, so if both users want to make a video call, they can To feel more detailed information, both users need to use the terminal equipment provided by the embodiments of the present disclosure. However, if only one user uses the terminal equipment provided by the embodiment of the present disclosure, and the other user uses other conventional terminal equipment to make a video call, it is of course also feasible.
在一些实施方式中,参照图2,确定显示面板1显示的对端图像中的视点位置92包括步骤S101和步骤S102。In some implementations, referring to FIG. 2 , determining the viewpoint position 92 in the peer image displayed on the display panel 1 includes step S101 and step S102 .
在步骤S101,对对端图像进行图像分析,确定其中的人脸上的瞳孔91的位置。In step S101, image analysis is performed on the peer image to determine the position of the pupil 91 on the human face.
在步骤S102,根据瞳孔91的位置确定视点位置92。In step S102 , the viewpoint position 92 is determined based on the position of the pupil 91 .
作为本公开实施方式的一种方式,可以是通过图像分析(如目标识别)技术对对端图像进行分析,以确定出对端图像中是否有人脸,人脸上是否有瞳孔91(或眼睛),以及其中瞳孔91所在的位置;进而,可根据以上瞳孔91的位置,确定出以上视点位置92。As a way of implementing the present disclosure, it is possible to analyze the image of the opposite end through image analysis (such as target recognition) technology to determine whether there is a human face in the image of the opposite end, and whether there is a pupil 91 (or eyes) on the human face , and the position of the pupil 91; furthermore, the above viewpoint position 92 can be determined according to the above position of the pupil 91.
在一些实施方式中,根据瞳孔91的位置确定视点位置92(S102)包括:S1021、确定瞳孔91的位置为视点位置92。In some embodiments, determining the viewpoint position 92 according to the position of the pupil 91 ( S102 ) includes: S1021 , determining the position of the pupil 91 as the viewpoint position 92 .
作为本公开实施方式的一种方式,具体可以是就用以上确定出的瞳孔91的位置为视点位置92。As a manner of implementing the present disclosure, specifically, the position of the pupil 91 determined above may be used as the viewpoint position 92 .
例如,当对端图像中有两个瞳孔91(即有对方用户的双眼)时,若本端终端设备中只有一个图像采集单元3(如单目摄像头),则可选择其中一个瞳孔91的位置为视点位置92;参照图5,而若本端终端设备中有两个图像采集单元3(如双目摄像头),则可分别以两个瞳孔91的位置为两个图像采集单元3对应的视点位置92,即两个图像采集单元3分别移动至两个瞳孔91的位置。For example, when there are two pupils 91 (that is, the eyes of the opposite user) in the opposite end image, if there is only one image acquisition unit 3 (such as a monocular camera) in the local terminal device, the position of one of the pupils 91 can be selected is the viewpoint position 92; with reference to Fig. 5, if there are two image acquisition units 3 (such as binocular cameras) in the local terminal device, then the positions of the two pupils 91 can be respectively the corresponding viewpoints of the two image acquisition units 3 The position 92 is the position where the two image acquisition units 3 move to the two pupils 91 respectively.
或者,根据瞳孔91的位置确定视点位置92(S102)也可包括:S1022、确定一个人脸的两个瞳孔91间连线中点的位置为视点位置92。Alternatively, determining the viewpoint position 92 according to the position of the pupil 91 ( S102 ) may also include: S1022 , determining the position of the midpoint of a line connecting two pupils 91 of a human face as the viewpoint position 92 .
参照图3,作为本公开实施方式的另一种方式,也可用两个瞳孔91的中间位置(两个瞳孔91间连线中点)作为以上的视点位置92。Referring to FIG. 3 , as another embodiment of the present disclosure, the middle position of the two pupils 91 (the midpoint of the line between the two pupils 91 ) can also be used as the above viewpoint position 92 .
在一些实施方式中,图像采集单元3和驱动单元设于显示面板1的内部。In some embodiments, the image acquisition unit 3 and the driving unit are arranged inside the display panel 1 .
显然,图像采集单元3需要移动到对端图像的视点位置92,而为了避免图像采集单元3挡住对端图像的视点位置92(如对方用户的双眼)而影响本方用户的观看效果,故作为本公开实施方式的一种 方式,以上图像采集单元3和驱动单元2可设于显示面板1的“内部”,即图像采集单元3可位于显示面板1的显示面的“内侧”,从而为“屏下摄像头”等的形式。因此,对观看显示面板1的本方用户而言,图像采集单元3和驱动单元是“不可见”的。Obviously, the image acquisition unit 3 needs to move to the viewpoint position 92 of the opposite end image, and in order to prevent the image acquisition unit 3 from blocking the viewpoint position 92 of the opposite end image (such as the eyes of the opposite user) and affecting the viewing effect of the own user, so as In one way of the embodiment of the present disclosure, the image acquisition unit 3 and the drive unit 2 above can be arranged "inside" the display panel 1, that is, the image acquisition unit 3 can be located "inside" the display surface of the display panel 1, so that " Under-screen camera" and so on. Therefore, for the local user watching the display panel 1, the image acquisition unit 3 and the driving unit are "invisible".
在一些实施方式中,驱动单元2包括第一轨道21和第二轨道22,第二轨道22可移动的设于第一轨道21上,图像采集单元3可移动的设于第二轨道22上;第一轨道21的延伸方向与第二轨道22的延伸方向交叉。In some embodiments, the driving unit 2 includes a first track 21 and a second track 22, the second track 22 is movably arranged on the first track 21, and the image acquisition unit 3 is movably arranged on the second track 22; The extending direction of the first rail 21 intersects the extending direction of the second rail 22 .
在一些实施方式中,第一轨道21的延伸方向与第二轨道22的延伸方向垂直;第一轨道21的延伸方向和第二轨道22的延伸方向均平行于显示面板1的显示面。In some embodiments, the extending direction of the first track 21 is perpendicular to the extending direction of the second track 22 ; the extending direction of the first track 21 and the extending direction of the second track 22 are both parallel to the display surface of the display panel 1 .
示例性地,参照图4,作为本公开实施方式的一种更具体的方式,驱动单元2可包括两个相互交叉的、“埋设”在显示面板1的显示面之内的轨道(第一轨道21和第二轨道22),其中,第二轨道22可在第一轨道21上移动(见图4中的箭头),而图像采集单元3可在第二轨道22上移动(见图4中的箭头),由此,可实现图像采集单元3在两个不同方向上的移动。For example, referring to FIG. 4 , as a more specific way of implementing the present disclosure, the driving unit 2 may include two intersecting tracks (the first track 21 and the second track 22), wherein the second track 22 can move on the first track 21 (see the arrow in Figure 4), and the image acquisition unit 3 can move on the second track 22 (see the arrow in Figure 4 Arrow), thus, the movement of the image acquisition unit 3 in two different directions can be realized.
进一步的,以上两个轨道可以是相互垂直的,且均平行于显示面板1的显示面,从而图像采集单元3的移动迅速,且在移动时相对显示面板1的显示面的距离不变,故相对本方用户的距离也不变,也就不会因移动而需要重新对焦等,操作简便。Further, the above two tracks may be perpendicular to each other, and both are parallel to the display surface of the display panel 1, so that the image acquisition unit 3 moves rapidly, and the distance relative to the display surface of the display panel 1 remains unchanged during the movement, so The distance to the local user remains unchanged, so there is no need to refocus due to movement, and the operation is simple.
当然,以上驱动单元2和图像采集单元3若为其它的形式,或设于其它位置,也都是可行的,只要能实现图像采集单元3对视点位置92的“追踪”即可。Of course, if the above drive unit 2 and image acquisition unit 3 are in other forms or located in other locations, it is also feasible, as long as the image acquisition unit 3 can realize the "tracking" of the viewpoint position 92.
在一些实施方式中,驱动单元2配置为驱动图像采集单元3在预设范围99内运动,预设范围99对应显示面板1的显示面的部分区域。In some embodiments, the driving unit 2 is configured to drive the image acquisition unit 3 to move within a preset range 99 , and the preset range 99 corresponds to a partial area of the display surface of the display panel 1 .
参照图3至图5,图像采集单元3可只能在显示面板1的显示面的部分区域内活动,而不能运动到显示面板1的显示面的所有位置。这是因为,通常而言,在大多数的语音通话过程中,对端图像中 对方用户的脸部(当然对应的也是视点位置92)都位于本端的显示面板1的显示面的部分区域(如中部偏上的区域)中,而不会偏离太多。因此,驱动单元2只要能驱动图像采集单元3在对应以上部分区域的预设范围99内运动即可,从而可简化产品的结构,并降低驱动单元对显示效果的影响。Referring to FIG. 3 to FIG. 5 , the image acquisition unit 3 can only move within a partial area of the display surface of the display panel 1 , but cannot move to all positions of the display surface of the display panel 1 . This is because, generally speaking, during most voice calls, the face of the opposite user in the image of the opposite end (of course corresponding to the viewpoint position 92) is located in a partial area of the display surface of the display panel 1 of the end (such as upper-middle region) without deviating too much. Therefore, the driving unit 2 only needs to be able to drive the image acquisition unit 3 to move within the preset range 99 corresponding to the above partial area, thereby simplifying the structure of the product and reducing the impact of the driving unit on the display effect.
在一些实施方式中,图像采集单元3的数量为多个,每个图像采集单元3具有对应的驱动单元,每个驱动单元具有对应的预设范围99,不同预设范围99有至少部分不重合;驱动图像采集单元3移动至视点位置92包括:确定视点位置92所在的至少一个预设范围99,使该预设范围99对应的驱动单元将对应的图像采集单元3移动至视点位置92。In some embodiments, the number of image acquisition units 3 is multiple, and each image acquisition unit 3 has a corresponding drive unit, and each drive unit has a corresponding preset range 99, and different preset ranges 99 have at least some non-overlapping Driving the image acquisition unit 3 to move to the viewpoint position 92 includes: determining at least one preset range 99 where the viewpoint position 92 is located, and making the drive unit corresponding to the preset range 99 move the corresponding image acquisition unit 3 to the viewpoint position 92.
参照图5,当图像采集单元3为多个时,每个图像采集单元3可都有对应的驱动单元和预设范围99,且不同预设范围99不完全相同,即,不同图像采集单元3可能的运动范围不同,从而当确定出视点位置92时,可根据视点位置92所在的预设范围99,选择对应的驱动单元和图像采集单元3进行移动(图5示出的是两个预设范围99分别对应两个瞳孔91/视点位置92的可能区域,当然也可以是多个预设范围99均对应一个视点位置92的可能区域)。Referring to Fig. 5, when there are multiple image acquisition units 3, each image acquisition unit 3 can have a corresponding drive unit and a preset range 99, and different preset ranges 99 are not completely the same, that is, different image acquisition units 3 The possible motion ranges are different, so that when the viewpoint position 92 is determined, the corresponding drive unit and image acquisition unit 3 can be selected to move according to the preset range 99 where the viewpoint position 92 is located (Figure 5 shows two preset The ranges 99 respectively correspond to the possible areas of the two pupils 91 and the viewpoint positions 92 , of course, multiple preset ranges 99 can also correspond to the possible regions of one viewpoint position 92 ).
第二方面,参照图6,本公开提供一种视频通话方法,其用于本公开实施方式的任意一项的终端设备。In a second aspect, referring to FIG. 6 , the present disclosure provides a video call method, which is used for a terminal device in any one of the embodiments of the present disclosure.
本公开的视频通话方法通过以上的终端设备,实现视频通话。The video calling method of the present disclosure implements video calling through the above terminal equipment.
应当理解,视频通话需在两方用户之间进行,而本公开的方法描述的是其中一方用户的过程,即至少有视频通话的一方用户是采用本公开实施方式的终端设备进行视频通话的。但如果只有一方用户使用本公开实施方式提供的终端设备,而另一方用户使用其它的常规的终端设备进行视频通话,当然也是可行的。It should be understood that a video call needs to be carried out between two users, and the method of the present disclosure describes the process of one of the users, that is, at least one user who has a video call uses the terminal device of the embodiment of the present disclosure to conduct a video call. However, if only one user uses the terminal device provided by the embodiments of the present disclosure, and the other user uses other conventional terminal devices to make a video call, it is of course also feasible.
参照图6,在一个实施方式中,本公开的视频通话方法包括步骤S201至S206。Referring to FIG. 6 , in one embodiment, the video call method of the present disclosure includes steps S201 to S206.
在步骤S201,收发单元接收对端图像。In step S201, the transceiver unit receives the image of the opposite end.
在步骤S202,显示面板显示对端图像。In step S202, the display panel displays the peer image.
在步骤S203,分析单元确定显示面板显示的对端图像中的视点位置。In step S203, the analysis unit determines the position of the viewpoint in the peer image displayed on the display panel.
在步骤S204,驱动单元驱动图像采集单元移动至视点位置。In step S204, the driving unit drives the image acquisition unit to move to the viewpoint position.
在步骤S205,图像采集单元在视点位置沿显示面板的出光方向采集本端图像。In step S205, the image acquisition unit acquires the local image at the viewpoint along the light emitting direction of the display panel.
在步骤S206,收发单元发送本端图像。In step S206, the transceiver unit sends the local image.
当然,在视频通话过程中,除了以上过程外,收发单元还可传递双方用户的音频信息等,在此不再详细描述。Of course, during the video call, in addition to the above process, the transceiver unit can also transmit the audio information of both users, etc., which will not be described in detail here.
本公开的视频通话方法中,根据接收到的对端图像(对方用户的图像)不断调整图像采集单元的位置,以使图像采集单元可以采集到类似于对方用户眼睛直接看到的本端图像并发给对方用户的终端设备,即对方用户看到的图像类似自己“直接看”的效果,由此,以上本端图像可更好的传递细微的眼神、肢体语言等信息(例如对方用户可很容易的看出本方用户是在目光直视他,还是将视线移开了,即实现眼神交流),增加传递的信息量,使视频通话的效果更加类似于面对面的交流。In the video call method of the present disclosure, the position of the image acquisition unit is continuously adjusted according to the received image of the opposite end (the image of the other party user), so that the image acquisition unit can collect the local image similar to that directly seen by the eyes of the other end user and concurrently For the terminal device of the other party user, that is, the image seen by the other party user is similar to the effect of "seeing directly" by oneself. Therefore, the above local image can better convey subtle information such as eye contact and body language (for example, the other party user can easily It can be seen whether the user on the other side is looking directly at him or looking away, that is, to achieve eye contact), increase the amount of information transmitted, and make the effect of video calling more similar to face-to-face communication.
其中,应当理解,在视频通话的过程中,收发单元会不断接收到对端图像,且也需要不断将本端图像发送给对端,故本公开的方法在开始视频通话后,实际是不断循环进行的,直到视频通话结束为止。Among them, it should be understood that during the video call process, the transceiver unit will continuously receive the image of the opposite end, and also need to continuously send the image of the local end to the opposite end, so the method of the present disclosure is actually a continuous loop after the video call is started. ongoing until the end of the video call.
在一些实施方式中,收发单元接收对端图像(S201)包括:收发单元接收对端视频流,从对端视频流中获取对端图像;收发单元发送本端图像(S206)包括:收发单元用本端图像形成本端视频流,并发送本端视频流。In some embodiments, the transceiver unit receiving the image of the opposite end (S201) includes: the transceiver unit receives the video stream of the opposite end, and obtains the image of the opposite end from the video stream of the opposite end; The local image forms the local video stream, and sends the local video stream.
由于是“视频通话”过程,因此,收发单元通常实际发送和接收的都是视频流,从而其需要从接收的对端视频流中得到(如视频解码)对端图像,并根据本端图像得到(如视频编码)本端视频流并发送。Because it is a "video call" process, the transceiver unit usually sends and receives video streams, so it needs to obtain (such as video decoding) the opposite-end image from the received opposite-end video stream, and obtain (such as video encoding) and send the local video stream.
本公开已经公开了示例实施方式,并且虽然采用了具体术语,但它们仅用于并仅应当被解释为一般说明性含义,并且不用于限制的目的。在一些实例中,对本领域技术人员显而易见的是,除非另外明确指出,否则可单独使用与特定实施方式相结合描述的特征、特性和/ 或元素,或可与其它实施方式相结合描述的特征、特性和/或元件组合使用。因此,本领域技术人员将理解,在不脱离由所附的权利要求阐明的本公开的范围的情况下,可进行各种形式和细节上的改变。This disclosure has disclosed example embodiments and, although specific terms have been employed, they are used and should be construed in a generic descriptive sense only and not for purposes of limitation. In some instances, it will be apparent to those skilled in the art that features, characteristics and/or elements described in connection with a particular embodiment may be used alone, or may be described in combination with other embodiments, unless explicitly stated otherwise. Combinations of features and/or elements. Accordingly, it will be understood by those of ordinary skill in the art that various changes in form and details may be made without departing from the scope of the present disclosure as set forth in the appended claims.

Claims (11)

  1. 一种终端设备,其包括:A terminal device comprising:
    收发单元,配置为接收对端图像和发送本端图像;A transceiver unit configured to receive the image of the opposite end and send the image of the local end;
    显示面板,配置为显示所述对端图像;a display panel configured to display the peer image;
    分析单元,配置为确定所述显示面板显示的对端图像中的视点位置;An analysis unit configured to determine a viewpoint position in the peer image displayed on the display panel;
    驱动单元,配置为驱动图像采集单元移动至视点位置;a driving unit configured to drive the image acquisition unit to move to the position of the viewpoint;
    图像采集单元,配置为沿所述显示面板的出光方向采集本端图像。The image acquisition unit is configured to acquire the local image along the light emitting direction of the display panel.
  2. 根据权利要求1所述的终端设备,其中,所述确定所述显示面板显示的对端图像中的视点位置包括:The terminal device according to claim 1, wherein said determining the viewpoint position in the peer image displayed on the display panel comprises:
    对所述对端图像进行图像分析,确定其中的人脸上的瞳孔的位置;Performing image analysis on the peer image to determine the position of the pupil on the human face;
    根据所述瞳孔的位置确定视点位置。A viewpoint position is determined according to the position of the pupil.
  3. 根据权利要求2所述的终端设备,其中,所述根据所述瞳孔的位置确定视点位置包括:The terminal device according to claim 2, wherein said determining the position of the viewpoint according to the position of the pupil comprises:
    确定瞳孔的位置为所述视点位置。The position of the pupil is determined as the position of the viewpoint.
  4. 根据权利要求2所述的终端设备,其中,所述根据所述瞳孔的位置确定视点位置包括:The terminal device according to claim 2, wherein said determining the position of the viewpoint according to the position of the pupil comprises:
    确定一个人脸的两个瞳孔间连线中点的位置为所述视点位置。The position of the midpoint of the line between two pupils of a human face is determined as the position of the viewpoint.
  5. 根据权利要求1所述的终端设备,其中,The terminal device according to claim 1, wherein,
    所述图像采集单元和所述驱动单元设于显示面板的内部。The image acquisition unit and the driving unit are arranged inside the display panel.
  6. 根据权利要求5所述的终端设备,其中,The terminal device according to claim 5, wherein,
    所述驱动单元包括第一轨道和第二轨道,所述第二轨道可移动 的设于第一轨道上,所述图像采集单元可移动的设于第二轨道上;所述第一轨道的延伸方向与第二轨道的延伸方向交叉。The drive unit includes a first track and a second track, the second track is movably arranged on the first track, and the image acquisition unit is movably arranged on the second track; the extension of the first track The direction intersects the extending direction of the second rail.
  7. 根据权利要求6所述的终端设备,其中,The terminal device according to claim 6, wherein,
    所述第一轨道的延伸方向与第二轨道的延伸方向垂直;The extending direction of the first track is perpendicular to the extending direction of the second track;
    所述第一轨道的延伸方向和第二轨道的延伸方向均平行于所述显示面板的显示面。Both the extending direction of the first track and the extending direction of the second track are parallel to the display surface of the display panel.
  8. 根据权利要求1所述的终端设备,其中,The terminal device according to claim 1, wherein,
    所述驱动单元配置为驱动图像采集单元在预设范围内运动,所述预设范围对应显示面板的显示面的部分区域。The driving unit is configured to drive the image acquisition unit to move within a preset range, and the preset range corresponds to a partial area of the display surface of the display panel.
  9. 根据权利要求8所述的终端设备,其中,The terminal device according to claim 8, wherein,
    所述图像采集单元的数量为多个,每个所述图像采集单元具有对应的驱动单元,每个所述驱动单元具有对应的预设范围,不同预设范围有至少部分不重合;The number of the image acquisition units is multiple, each of the image acquisition units has a corresponding drive unit, each of the drive units has a corresponding preset range, and different preset ranges are at least partially non-overlapping;
    所述驱动图像采集单元移动至视点位置包括:确定所述视点位置所在的至少一个预设范围,使该预设范围对应的驱动单元将对应的图像采集单元移动至视点位置。The driving the image acquisition unit to move to the viewpoint position includes: determining at least one preset range where the viewpoint position is located, and making the drive unit corresponding to the preset range move the corresponding image acquisition unit to the viewpoint position.
  10. 一种视频通话的方法,其用于权利要求1至9中任意一项所述的终端设备,所述方法包括:A method for video calling, which is used for the terminal device described in any one of claims 1 to 9, the method comprising:
    所述收发单元接收对端图像;The transceiver unit receives the image of the opposite end;
    所述显示面板显示对端图像;The display panel displays the image of the opposite end;
    所述分析单元确定显示面板显示的对端图像中的视点位置;The analysis unit determines the viewpoint position in the peer image displayed on the display panel;
    所述驱动单元驱动图像采集单元移动至视点位置;The drive unit drives the image acquisition unit to move to the viewpoint position;
    所述图像采集单元在视点位置沿显示面板的出光方向采集本端图像;The image acquisition unit acquires the local image at the viewpoint position along the light emitting direction of the display panel;
    所述收发单元发送本端图像。The transceiver unit sends the local image.
  11. 根据权利要求10所述的方法,其中,The method of claim 10, wherein,
    所述收发单元接收对端图像包括:所述收发单元接收对端视频流,从所述对端视频流中获取对端图像;The receiving the image of the opposite end by the transceiver unit includes: receiving the video stream of the opposite end by the transceiver unit, and obtaining the image of the opposite end from the video stream of the opposite end;
    所述收发单元发送本端图像包括:所述收发单元用本端图像形成本端视频流,并发送所述本端视频流。Sending the local image by the transceiver unit includes: forming a local video stream by the transceiver unit using the local image, and sending the local video stream.
PCT/CN2022/114748 2021-09-15 2022-08-25 Terminal device and video call method WO2023040616A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111077903.6 2021-09-15
CN202111077903.6A CN115834813A (en) 2021-09-15 2021-09-15 Terminal device and video call method

Publications (1)

Publication Number Publication Date
WO2023040616A1 true WO2023040616A1 (en) 2023-03-23

Family

ID=85514932

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/114748 WO2023040616A1 (en) 2021-09-15 2022-08-25 Terminal device and video call method

Country Status (2)

Country Link
CN (1) CN115834813A (en)
WO (1) WO2023040616A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090179984A1 (en) * 2008-01-10 2009-07-16 Liang-Gee Chen Image Rectification Method and Related Device for a Video Device
CN105094307A (en) * 2014-05-23 2015-11-25 宇龙计算机通信科技(深圳)有限公司 Mobile equipment with front-facing camera
CN205378040U (en) * 2016-02-26 2016-07-06 彭昌兰 Mobile device with dollying head
US20190110023A1 (en) * 2016-05-18 2019-04-11 Sony Corporation Information processing apparatus, information processing method, and program
US20210104063A1 (en) * 2019-10-03 2021-04-08 Facebook Technologies, Llc Systems and methods for video communication using a virtual camera
CN113038111A (en) * 2016-07-18 2021-06-25 苹果公司 Methods, systems, and media for image capture and processing

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090179984A1 (en) * 2008-01-10 2009-07-16 Liang-Gee Chen Image Rectification Method and Related Device for a Video Device
CN105094307A (en) * 2014-05-23 2015-11-25 宇龙计算机通信科技(深圳)有限公司 Mobile equipment with front-facing camera
CN205378040U (en) * 2016-02-26 2016-07-06 彭昌兰 Mobile device with dollying head
US20190110023A1 (en) * 2016-05-18 2019-04-11 Sony Corporation Information processing apparatus, information processing method, and program
CN113038111A (en) * 2016-07-18 2021-06-25 苹果公司 Methods, systems, and media for image capture and processing
US20210104063A1 (en) * 2019-10-03 2021-04-08 Facebook Technologies, Llc Systems and methods for video communication using a virtual camera

Also Published As

Publication number Publication date
CN115834813A (en) 2023-03-21

Similar Documents

Publication Publication Date Title
US7227567B1 (en) Customizable background for video communications
US9445045B2 (en) Video conferencing device for a communications device and method of manufacturing and using the same
US8253770B2 (en) Residential video communication system
US8154583B2 (en) Eye gazing imaging for video communications
US8154578B2 (en) Multi-camera residential communication system
US8159519B2 (en) Personal controls for personal video communications
JP5836768B2 (en) Display device with imaging device
US20100118112A1 (en) Group table top videoconferencing device
US20080297588A1 (en) Managing scene transitions for video communication
US20020027597A1 (en) System for mobile videoconferencing
US20100103244A1 (en) device for and method of processing image data representative of an object
TW200307460A (en) Data processing device, data processing system and method for displaying conversation parties
WO2018014534A1 (en) Intelligent glasses, and photographing and display apparatus
US9088693B2 (en) Providing direct eye contact videoconferencing
JP4475579B2 (en) Video communication apparatus and video communication apparatus control method
JP2014049797A (en) Display device with camera
JP2012213013A (en) Tv conference system
US7986336B2 (en) Image capture apparatus with indicator
WO2023040616A1 (en) Terminal device and video call method
JP2006054830A (en) Image compression communication method and device
JP2009147792A (en) Communication apparatus with image, communication display method with image, program and communication system with image
JPH1075432A (en) Stereoscopic video telephone set
US11310465B1 (en) Video conference teminal and system there of
JP2007251778A (en) Image input-output apparatus
JP2006246079A (en) Communication device

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE