WO2019227905A1 - Method and equipment for performing remote assistance on the basis of augmented reality - Google Patents

Method and equipment for performing remote assistance on the basis of augmented reality Download PDF

Info

Publication number
WO2019227905A1
WO2019227905A1 PCT/CN2018/121729 CN2018121729W WO2019227905A1 WO 2019227905 A1 WO2019227905 A1 WO 2019227905A1 CN 2018121729 W CN2018121729 W CN 2018121729W WO 2019227905 A1 WO2019227905 A1 WO 2019227905A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
target object
user equipment
video
video information
Prior art date
Application number
PCT/CN2018/121729
Other languages
French (fr)
Chinese (zh)
Inventor
廖春元
唐荣兴
Original Assignee
亮风台(上海)信息科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 亮风台(上海)信息科技有限公司 filed Critical 亮风台(上海)信息科技有限公司
Publication of WO2019227905A1 publication Critical patent/WO2019227905A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/131Protocols for games, networked simulations or virtual reality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects

Abstract

The purpose of the present application is to provide a method for performing remote assistance on the basis of augmented reality, the method specifically comprising: capturing in real time video information related to a target object by means of a camera device in a first user equipment; by means of executing a target tracking operation on the target object in the video information, determining corresponding transfer matrix information of the target object in each video frame of the video information; according to the transfer matrix information, superimposing and displaying corresponding tag information on the target object, wherein the tag information comprises operation instruction information which is of a second user to the target object and which is sent by a corresponding second user equipment. In the present application, on the basis of augmented reality technology, a first user equipment superimposes tag information and the like sent by a second user equipment to display in current video information, thus achieving the remote real-time command of a second user to a first user, which may be used in a wide range of fields such as family supervision and guidance in everyday life, as well as in industry, medical treatment, education and so on.

Description

一种基于增强现实进行远程辅助的方法与设备Method and equipment for remote assistance based on augmented reality
本案要求CN 201810533512.2的优先权This case claims the priority of CN201810533512.2
技术领域Technical field
本申请涉及计算机领域,尤其涉及一种基于增强现实进行远程辅助的技术。The present application relates to the field of computers, and in particular, to a technology for remote assistance based on augmented reality.
背景技术Background technique
增强现实(AR)技术是一种全新的人机交互技术,它利用摄像头、陀螺仪、加速度传感器等,实时匹配空间中的三维点与图像中的二维点,并且利用匹配点对来跟踪,并计算相机在空间中的位置和方向,然后利用以上信息将真实的环境和虚拟的物体实时地叠加到同一个画面或空间,造成虚拟与现实共存的现象。用户可以通过增强现实系统感受到客观物理世界中原本不存在的增强信息,比如虚拟导航箭头、虚拟的游戏人物等,还能突破时间、空间以及其他客观限制,利用虚拟信息极大的增加用户对于真实世界的理解和交互。Augmented reality (AR) technology is a new type of human-computer interaction technology. It uses cameras, gyroscopes, acceleration sensors, etc. to match three-dimensional points in space and two-dimensional points in images in real time, and uses matching point pairs to track. And calculate the position and orientation of the camera in space, and then use the above information to superimpose the real environment and virtual objects on the same screen or space in real time, resulting in the phenomenon of coexistence of virtual and reality. Users can use augmented reality systems to feel augmented information that does not exist in the objective physical world, such as virtual navigation arrows, virtual game characters, etc., and can break through time, space, and other objective constraints. Using virtual information greatly increases users' Real world understanding and interaction.
发明内容Summary of the Invention
本申请的一个目的是提供一种基于增强现实进行远程辅助的方法与设备。An object of the present application is to provide a method and device for remote assistance based on augmented reality.
根据本申请的一个方面,提供了一种在第一用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:According to an aspect of the present application, a method for remote assistance based on augmented reality on a first user equipment side is provided, where the method includes:
通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;Shooting video information about a target object in real time through a camera device in the first user equipment;
通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Determining a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象, 其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。According to the transfer matrix information, corresponding mark information is superimposed and displayed on the target object, where the mark information includes corresponding instruction information of the second user on the target object sent by the second user equipment.
根据本申请的另一个方面,提供了一种在第二用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:According to another aspect of the present application, a method for remote assistance based on augmented reality on a second user equipment side is provided, where the method includes:
接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;Receiving video information corresponding to a target object that is sent by a corresponding first user equipment in real time through a camera device in the first user equipment;
呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。Presenting the video information, and maintaining corresponding target information superimposed on the target object displayed in each video frame of the video information, wherein the label information includes a second user using the second user device to Operation instruction information of the target object.
根据本申请的又一个方面,提供了一种在第一用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:According to another aspect of the present application, a method for remote assistance based on augmented reality at a first user equipment side is provided, where the method includes:
通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息;Shooting video information about a first target object in real time through a camera device in the first user equipment;
将所述视频信息发送至对应的网络设备;Sending the video information to a corresponding network device;
接收所述网络设备发送的、所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Receiving first transfer matrix information corresponding to the first target object in each video frame of the video information sent by the network device;
根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息。Superimposing and displaying the corresponding first marker information on the first target object according to the first transfer matrix information, wherein the first marker information includes a second user equipment corresponding to the first user object sent by the second user equipment; Operation instruction information of a target object.
根据本申请的又一个方面,提供了一种在网络设备端基于增强现实进行远程辅助的方法,其中,该方法包括:According to yet another aspect of the present application, a method for remote assistance based on augmented reality on a network device side is provided, where the method includes:
接收第一用户设备发送的关于第一目标对象的视频信息,其中,所述视频信息是通过所述第一用户设备中的摄像装置实时拍摄的;Receiving video information about a first target object sent by a first user equipment, where the video information is captured in real time by a camera device in the first user equipment;
通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Determining a first transition matrix information corresponding to the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information;
将所述第一转移矩阵信息发送至所述第一用户设备;Sending the first transfer matrix information to the first user equipment;
将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the video information and the first transfer matrix information to a second user equipment that belongs to the same remote assistance task as the first user equipment.
根据本申请的又一个方面,提供了一种在第三用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:According to yet another aspect of the present application, a method for remote assistance based on augmented reality on a third user equipment side is provided, where the method includes:
接收对应网络设备发送的、关于第三目标对象的视频信息及所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;Receiving video information about a third target object and third transfer matrix information corresponding to the third target object in each video frame of the video information sent by a corresponding network device;
呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括第二用户通过第二用户设备对所述第三目标对象的操作指示信息;Presenting the video information, and superimposing and displaying the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein the third marker The information includes operation instruction information of the second user on the third target object through the second user equipment;
其中,所述视频信息是通过第一用户设备中的摄像装置实时拍摄的,所述第一用户设备、所述第三用户设备与所述第二用户设备属于同一远程辅助任务,并分别接受所述第二用户设备的远程辅助。The video information is captured in real time by a camera device in the first user equipment, and the first user equipment, the third user equipment, and the second user equipment belong to the same remote assistance task, and accept all The remote assistance of the second user equipment is described.
根据本申请的又一个方面,提供了一种在第二用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:According to another aspect of the present application, a method for performing remote assistance based on augmented reality on a second user equipment side is provided, where the method includes:
接收对应网络设备发送的、关于第一目标对象的视频信息及所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Receiving video information about a first target object and first transfer matrix information corresponding to the first target object in each video frame of the video information sent by a corresponding network device;
呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述第一标记信息包括第二用户通过所述第二用户设备对所述第一目标对象的操作指示信息;Presenting the video information, and superimposing and displaying corresponding first marker information on the first target object in each video frame of the video information according to the first transition matrix information, wherein the first marker The information includes operation instruction information of the second user on the first target object through the second user equipment;
其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。The video information is captured in real time by a camera device in the first user equipment that belongs to the same remote assistance task as the second user device, or is based on the first target object captured by the camera device. The real-time video information and other video information of the first target object are reconstructed.
根据本申请的又一个方面,提供了一种在网络设备端基于增强现实进行远程辅助的方法,其中,该方法包括:According to yet another aspect of the present application, a method for remote assistance based on augmented reality on a network device side is provided, where the method includes:
接收第一用户设备发送的关于目标对象的视频信息,其中,所述视频信息包括通过所述第一用户设备中的摄像装置所拍摄的;Receiving video information about a target object sent by a first user equipment, where the video information includes a picture taken by an imaging device in the first user equipment;
通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Determining a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
根据所述转移矩阵信息将对应的标记信息添加至所述视频信息中的各视频帧,其中,所述标记信息保持叠加于所述视频信息的各视频帧中的所述目标对象,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;Adding corresponding tag information to each video frame in the video information according to the transfer matrix information, wherein the tag information remains the target object superimposed on each video frame of the video information, the tag The information includes corresponding operation instruction information of the second user on the target object sent by the second user equipment;
将编辑后的所述视频信息发送至第一用户设备,以及与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the edited video information to a first user equipment and a second user equipment that belongs to the same remote assistance task as the first user equipment.
根据本申请的一个方面,提供了一种基于增强现实进行远程辅助的方法,其中,该方法包括:According to an aspect of the present application, a method for remote assistance based on augmented reality is provided, wherein the method includes:
第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息,通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息,并根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;The first user equipment captures video information about a target object in real time through a camera device in the first user equipment, and determines a target object in the video by performing a target tracking operation on the target object in the video information. The corresponding transfer matrix information in each video frame of the information, and the corresponding marker information is superimposed and displayed on the target object according to the transfer matrix information, wherein the marker information includes a second User operation instruction information on the target object;
所述第一用户设备将所述视频信息发送至所述第二用户设备;Sending, by the first user equipment, the video information to the second user equipment;
所述第二用户设备接收并呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。The second user equipment receives and presents the video information, and maintains corresponding target information superimposed and displayed on the target object in each video frame of the video information, wherein the label information includes information obtained by the second user through The operation instruction information of the second user equipment on the target object is described.
根据本申请的另一个方面,提供了一种基于增强现实进行远程辅助的方法,其中,该方法包括:According to another aspect of the present application, a method for remote assistance based on augmented reality is provided, wherein the method includes:
第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息,并将所述视频信息发送至对应的网络设备;The first user equipment captures video information about the first target object in real time through a camera device in the first user equipment, and sends the video information to a corresponding network device;
所述网络设备接收所述视频信息,通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息,将所述第一转移矩阵信息发送至所述第一用户设备,将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备;The network device receives the video information, and determines a first corresponding object of the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information. Transfer matrix information, sending the first transfer matrix information to the first user equipment, and sending the video information and the first transfer matrix information to a first remote user task that belongs to the same remote auxiliary task as the first user equipment Two user equipment;
所述第一用户设备接收所述第一转移矩阵信息,根据所述第一转移矩 阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息;The first user equipment receives the first transfer matrix information, and superimposes and displays corresponding first marker information on the first target object according to the first transfer matrix information, where the first marker information includes Corresponding to the operation instruction information of the second user on the first target object sent by the second user equipment;
所述第二用户设备接收所述视频信息及所述第一转移矩阵信息,并呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。Receiving, by the second user equipment, the video information and the first transfer matrix information, presenting the video information, and superimposing and displaying the corresponding first tag information on the video according to the first transfer matrix information The first target object in each video frame of the information, wherein the video information is captured in real time by a camera device in the first user equipment that belongs to the same remote assistance task as the second user equipment, or is based on Real-time video information about the first target object and other video information of the first target object captured by the imaging device are reconstructed.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的方法,其中,该方法包括:According to another aspect of the present application, a method for remote assistance based on augmented reality is provided, wherein the method includes:
第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息,并将所述视频信息发送至对应的网络设备;The first user equipment captures video information about the first target object in real time through a camera device in the first user equipment, and sends the video information to a corresponding network device;
所述网络设备接收所述视频信息,通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息,并将所述第一转移矩阵信息发送至所述第一用户设备;The network device receives the video information, and determines a first corresponding object of the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information. Transfer matrix information, and send the first transfer matrix information to the first user equipment;
所述第一用户设备接收所述第一转移矩阵信息,根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息;The first user equipment receives the first transfer matrix information, and superimposes and displays corresponding first marker information on the first target object according to the first transfer matrix information, where the first marker information includes Corresponding to the operation instruction information of the second user on the first target object sent by the second user equipment;
所述网络设备通过对所述视频信息中的第三目标对象执行目标跟踪操作,确定所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,其中,所述第三目标对象与所述第一目标对象属于同一远程辅助任务;The network device determines a third transition matrix information corresponding to the third target object in each video frame of the video information by performing a target tracking operation on a third target object in the video information. The third target object belongs to the same remote auxiliary task as the first target object;
所述网络设备将所述视频信息及所述第三转移矩阵信息发送至所述远程辅助任务中与所述第三目标对象相对应的第三用户设备,将所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息发送至与所述第一 用户设备属于同一远程辅助任务的第二用户设备;Sending, by the network device, the video information and the third transfer matrix information to a third user equipment corresponding to the third target object in the remote assistance task, and sending the video information and the first Sending the transfer matrix information and the third transfer matrix information to a second user equipment that belongs to the same remote auxiliary task as the first user equipment;
所述第三用户设备接收所述视频信息及所述第三转移矩阵信息;Receiving, by the third user equipment, the video information and the third transfer matrix information;
所述第三用户设备呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象;The third user equipment presents the video information, and superimposes and displays the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information;
所述第二用户设备接收所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息,并在呈现所述视频信息过程中,根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象。Receiving, by the second user equipment, the video information, the first transition matrix information, and the third transition matrix information, and in presenting the video information, according to the first transition matrix information, the corresponding The first tag information is superimposed and displayed on the first target object in each video frame of the video information, and the corresponding third tag information is superimposed and displayed on each video of the video information according to the third transition matrix information. The third target object in the frame.
根据本申请的一个方面,提供了一种基于增强现实进行远程辅助的第一用户设备,其中,该设备包括:According to an aspect of the present application, a first user equipment for remote assistance based on augmented reality is provided, wherein the device includes:
实时拍摄模块,用于通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;A real-time shooting module, configured to shoot video information about a target object in real time through a camera device in the first user equipment;
目标跟踪模块,用于通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;A target tracking module, configured to determine a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
叠加显示模块,用于根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。An overlay display module, configured to superimpose and display corresponding mark information on the target object according to the transfer matrix information, where the mark information includes corresponding second user equipment to the target object sent by the second user equipment. Operation instructions.
根据本申请的另一个方面,提供了一种基于增强现实进行远程辅助的第二用户设备,其中,该设备包括:According to another aspect of the present application, a second user equipment for remote assistance based on augmented reality is provided, wherein the device includes:
视频接收模块,用于接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;A video receiving module, configured to receive video information about a target object that is sent by the first user equipment in real time through a camera device in the first user equipment;
视频呈现模块,用于呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。A video presentation module is configured to present the video information and maintain corresponding target information superimposed on the target object displayed in each video frame of the video information, wherein the label information includes a second user passing through the first Operation instruction information of the user equipment on the target object.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的第一用户设备,其中,该设备包括:According to another aspect of the present application, a first user equipment for remote assistance based on augmented reality is provided, where the device includes:
实时拍摄模块,用于通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息;A real-time shooting module, configured to shoot video information about a first target object in real time through a camera device in the first user equipment;
视频发送模块,用于将所述视频信息发送至对应的网络设备;A video sending module, configured to send the video information to a corresponding network device;
转移矩阵接收模块,用于接收所述网络设备发送的、所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;A transfer matrix receiving module, configured to receive first transfer matrix information sent by the network device and corresponding to the first target object in each video frame of the video information;
叠加显示模块,用于根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息。An overlay display module, configured to overlay and display corresponding first marker information on the first target object according to the first transfer matrix information, where the first marker information includes a first Operation instruction information of the two users on the first target object.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的网络设备,其中,该设备包括:According to yet another aspect of the present application, a network device for remote assistance based on augmented reality is provided, where the device includes:
视频接收模块,用于接收第一用户设备发送的关于第一目标对象的视频信息,其中,所述视频信息是通过所述第一用户设备中的摄像装置实时拍摄的;A video receiving module, configured to receive video information about a first target object sent by a first user equipment, where the video information is captured in real time by a camera device in the first user equipment;
目标跟踪模块,用于通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;A target tracking module, configured to determine first transfer matrix information corresponding to the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information;
第一发送模块,用于将所述第一转移矩阵信息发送至所述第一用户设备;A first sending module, configured to send the first transfer matrix information to the first user equipment;
第二发送模块,用于将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。A second sending module is configured to send the video information and the first transfer matrix information to a second user equipment that belongs to the same remote assistance task as the first user equipment.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的第三用户设备,其中,该设备包括:According to another aspect of the present application, a third user equipment for remote assistance based on augmented reality is provided, where the equipment includes:
接收模块,用于接收对应网络设备发送的、关于第三目标对象的视频信息及所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;A receiving module, configured to receive video information about a third target object sent by a corresponding network device and third transfer matrix information corresponding to the third target object in each video frame of the video information;
呈现模块,用于呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括第二用户通过第二用户设备对所述第三目标对象的操作指示信息;A presentation module, configured to present the video information and superimpose and display the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein, The third tag information includes operation instruction information of the second user on the third target object through the second user equipment;
其中,所述视频信息是通过第一用户设备中的摄像装置实时拍摄的,所述第一用户设备、所述第三用户设备与所述第二用户设备属于同一远程辅助任务,并分别接受所述第二用户设备的远程辅助。The video information is captured in real time by a camera device in the first user equipment, and the first user equipment, the third user equipment, and the second user equipment belong to the same remote assistance task, and accept all The remote assistance of the second user equipment is described.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的第二用户设备,其中,该设备包括:According to another aspect of the present application, a second user equipment for remote assistance based on augmented reality is provided, where the device includes:
接收模块,用于接收对应网络设备发送的、关于第一目标对象的视频信息及所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;A receiving module, configured to receive video information about a first target object and first transfer matrix information corresponding to the first target object in each video frame of the video information sent by a corresponding network device;
呈现模块,用于呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述第一标记信息包括第二用户通过所述第二用户设备对所述第一目标对象的操作指示信息;A presentation module, configured to present the video information and superimpose and display the corresponding first marker information on the first target object in each video frame of the video information according to the first transfer matrix information, wherein, The first marking information includes operation instruction information of a second user on the first target object through the second user equipment;
其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。The video information is captured in real time by a camera device in the first user equipment that belongs to the same remote assistance task as the second user device, or is based on the first target object captured by the camera device. The real-time video information and other video information of the first target object are reconstructed.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的网络设备,其中,该设备包括:According to yet another aspect of the present application, a network device for remote assistance based on augmented reality is provided, where the device includes:
视频接收模块,用于接收第一用户设备发送的关于目标对象的视频信息,其中,所述视频信息包括通过所述第一用户设备中的摄像装置所拍摄的;A video receiving module, configured to receive video information about a target object sent by a first user equipment, where the video information includes a picture taken by a camera device in the first user equipment;
目标跟踪模块,用于通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;A target tracking module, configured to determine a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
标记添加模块,用于根据所述转移矩阵信息将对应的标记信息添加至所述视频信息中的各视频帧,其中,所述标记信息保持叠加于所述视频信息的各视频帧中的所述目标对象,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;A tag adding module is configured to add corresponding tag information to each video frame in the video information according to the transfer matrix information, wherein the tag information remains superimposed on the video frames in the video information. A target object, where the tag information includes operation instruction information corresponding to the target object sent by the second user equipment to the second user;
视频发送模块,用于将编辑后的所述视频信息发送至第一用户设备, 以及与所述第一用户设备属于同一远程辅助任务的第二用户设备。A video sending module is configured to send the edited video information to a first user equipment and a second user equipment that belongs to the same remote assistance task as the first user equipment.
根据本申请的一个方面,提供了一种基于增强现实进行远程辅助的系统,其中,该系统包括如上所述的包含实时拍摄模块、目标跟踪模块及叠加显示模块的第一用户设备以及如上所述的包含视频接收模块及视频呈现模块的第二用户设备。According to an aspect of the present application, there is provided a system for remote assistance based on augmented reality, wherein the system includes the first user equipment including the real-time shooting module, the target tracking module, and the superimposed display module as described above, and as described above A second user equipment including a video receiving module and a video rendering module.
根据本申请的一个方面,提供了一种基于增强现实进行远程辅助的系统,其中,该系统包括如上所述的包含实时拍摄模块、视频发送模块、转移矩阵接收模块以及叠加显示模块的第一用户设备,如上所述包含接收模块及呈现模块的第二用户设备,以及如上所述包含视频接收模块、目标跟踪模块、第一发送模块以及第二发送模块的网络设备。According to an aspect of the present application, there is provided a system for remote assistance based on augmented reality, wherein the system includes a first user including a real-time shooting module, a video sending module, a transfer matrix receiving module, and an overlay display module as described above. The device is a second user equipment including a receiving module and a presentation module as described above, and a network device including a video receiving module, a target tracking module, a first sending module, and a second sending module as described above.
根据本申请的一个方面,提供了一种基于增强现实进行远程辅助的系统,其中,该系统包括如上所述的包含实时拍摄模块、视频发送模块、转移矩阵接收模块以及叠加显示模块的第一用户设备,如上所述的包含接收模块及呈现模块的第二用户设备,如上所述的接收模块、呈现模块的第三用户设备,以及如上所述的包含视频接收模块、目标跟踪模块、第一发送模块以及第二发送模块的网络设备。According to an aspect of the present application, there is provided a system for remote assistance based on augmented reality, wherein the system includes a first user including a real-time shooting module, a video sending module, a transfer matrix receiving module, and an overlay display module as described above. Device, the second user equipment including the receiving module and the presentation module as described above, the receiving module and the third user equipment of the presentation module as described above, and the video receiving module, the target tracking module, and the first transmitting device as described above Network equipment of the module and the second sending module.
根据本申请的一个方面,提供了一种基于增强现实进行远程辅助的第一用户设备,其中,该设备包括:According to an aspect of the present application, a first user equipment for remote assistance based on augmented reality is provided, wherein the device includes:
处理器;以及Processor; and
被安排成存储计算机可执行指令的存储器,所述可执行指令在被执行时使所述处理器执行:A memory arranged to store computer-executable instructions that, when executed, cause the processor to execute:
通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;Shooting video information about a target object in real time through a camera device in the first user equipment;
通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Determining a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。According to the transfer matrix information, corresponding mark information is superimposed and displayed on the target object, wherein the mark information includes corresponding instruction information of the second user on the target object sent by the second user equipment.
根据本申请的另一个方面,提供了一种基于增强现实进行远程辅助的 第二用户设备,其中,该设备包括:According to another aspect of the present application, a second user equipment for remote assistance based on augmented reality is provided, where the equipment includes:
处理器;以及Processor; and
被安排成存储计算机可执行指令的存储器,所述可执行指令在被执行时使所述处理器执行:A memory arranged to store computer-executable instructions that, when executed, cause the processor to execute:
接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;Receiving video information corresponding to a target object that is sent by a corresponding first user equipment in real time through a camera device in the first user equipment;
呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。Presenting the video information, and maintaining corresponding target information superimposed on the target object displayed in each video frame of the video information, wherein the label information includes a second user using the second user device to Operation instruction information of the target object.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的第一用户设备,其中,该设备包括:According to another aspect of the present application, a first user equipment for remote assistance based on augmented reality is provided, where the device includes:
处理器;以及Processor; and
被安排成存储计算机可执行指令的存储器,所述可执行指令在被执行时使所述处理器执行:A memory arranged to store computer-executable instructions that, when executed, cause the processor to execute:
通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息;Shooting video information about a first target object in real time through a camera device in the first user equipment;
将所述视频信息发送至对应的网络设备;Sending the video information to a corresponding network device;
接收所述网络设备发送的、所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Receiving first transfer matrix information corresponding to the first target object in each video frame of the video information sent by the network device;
根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息。Superimposing and displaying the corresponding first marker information on the first target object according to the first transfer matrix information, wherein the first marker information includes a second user equipment corresponding to Operation instruction information of a target object.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的网络设备,其中,该设备包括:According to yet another aspect of the present application, a network device for remote assistance based on augmented reality is provided, where the device includes:
处理器;以及Processor; and
被安排成存储计算机可执行指令的存储器,所述可执行指令在被执行时使所述处理器执行:A memory arranged to store computer-executable instructions that, when executed, cause the processor to execute:
接收第一用户设备发送的关于第一目标对象的视频信息,其中,所述视频信息是通过所述第一用户设备中的摄像装置实时拍摄的;Receiving video information about a first target object sent by a first user equipment, where the video information is captured in real time by a camera device in the first user equipment;
通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Determining a first transition matrix information corresponding to the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information;
将所述第一转移矩阵信息发送至所述第一用户设备;Sending the first transfer matrix information to the first user equipment;
将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the video information and the first transfer matrix information to a second user equipment that belongs to the same remote assistance task as the first user equipment.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的第三用户设备,其中,该设备包括:According to another aspect of the present application, a third user equipment for remote assistance based on augmented reality is provided, where the equipment includes:
处理器;以及Processor; and
被安排成存储计算机可执行指令的存储器,所述可执行指令在被执行时使所述处理器执行:A memory arranged to store computer-executable instructions that, when executed, cause the processor to execute:
接收对应网络设备发送的、关于第三目标对象的视频信息及所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;Receiving video information about a third target object and third transfer matrix information corresponding to the third target object in each video frame of the video information sent by a corresponding network device;
呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括第二用户通过第二用户设备对所述第三目标对象的操作指示信息;Presenting the video information, and superimposing and displaying the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein the third marker The information includes operation instruction information of the second user on the third target object through the second user equipment;
其中,所述视频信息是通过第一用户设备中的摄像装置实时拍摄的,所述第一用户设备、所述第三用户设备与所述第二用户设备属于同一远程辅助任务,并分别接受所述第二用户设备的远程辅助。The video information is captured in real time by a camera device in the first user equipment, and the first user equipment, the third user equipment, and the second user equipment belong to the same remote assistance task, and accept all The remote assistance of the second user equipment is described.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的第二用户设备,其中,该设备包括:According to another aspect of the present application, a second user equipment for remote assistance based on augmented reality is provided, where the device includes:
处理器;以及Processor; and
被安排成存储计算机可执行指令的存储器,所述可执行指令在被执行时使所述处理器执行:A memory arranged to store computer-executable instructions that, when executed, cause the processor to execute:
接收对应网络设备发送的、关于第一目标对象的视频信息及所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Receiving video information about a first target object and first transfer matrix information corresponding to the first target object in each video frame of the video information sent by a corresponding network device;
呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中, 所述第一标记信息包括第二用户通过所述第二用户设备对所述第一目标对象的操作指示信息;Presenting the video information, and superimposing and displaying the corresponding first marker information on the first target object in each video frame of the video information according to the first transition matrix information, wherein the first marker The information includes operation instruction information of the second user on the first target object through the second user equipment;
其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。The video information is captured in real time by a camera device in the first user equipment that belongs to the same remote assistance task as the second user device, or is based on the first target object captured by the camera device. The real-time video information and other video information of the first target object are reconstructed.
根据本申请的又一个方面,提供了一种基于增强现实进行远程辅助的网络设备,其中,该设备包括:According to yet another aspect of the present application, a network device for remote assistance based on augmented reality is provided, where the device includes:
处理器;以及Processor; and
被安排成存储计算机可执行指令的存储器,所述可执行指令在被执行时使所述处理器执行:A memory arranged to store computer-executable instructions that, when executed, cause the processor to execute:
接收第一用户设备发送的关于目标对象的视频信息,其中,所述视频信息包括通过所述第一用户设备中的摄像装置所拍摄的;Receiving video information about a target object sent by a first user equipment, where the video information includes a picture taken by an imaging device in the first user equipment;
通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Determining a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
根据所述转移矩阵信息将对应的标记信息添加至所述视频信息中的各视频帧,其中,所述标记信息保持叠加于所述视频信息的各视频帧中的所述目标对象,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;Adding corresponding tag information to each video frame in the video information according to the transfer matrix information, wherein the tag information remains the target object superimposed on each video frame of the video information, the tag The information includes corresponding operation instruction information of the second user on the target object sent by the second user equipment;
将编辑后的所述视频信息发送至第一用户设备,以及与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the edited video information to a first user equipment and a second user equipment that belongs to the same remote assistance task as the first user equipment.
根据本申请的一个方面,提供了包括指令的计算机可读介质,所述指令在被执行时使得系统进行:According to one aspect of the present application, a computer-readable medium is provided that includes instructions that, when executed, cause a system to:
通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;Shooting video information about a target object in real time through a camera device in the first user equipment;
通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Determining a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标 对象的操作指示信息。And superimposing and displaying corresponding marker information on the target object according to the transfer matrix information, wherein the marker information includes corresponding instruction information of the second user on the target object sent by the second user equipment.
根据本申请的另一个方面,提供了包括指令的计算机可读介质,所述指令在被执行时使得系统进行:According to another aspect of the present application, a computer-readable medium is provided that includes instructions that, when executed, cause a system to:
接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;Receiving video information corresponding to a target object that is sent by a corresponding first user equipment in real time through a camera device in the first user equipment;
呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。Presenting the video information, and maintaining corresponding target information superimposed on the target object displayed in each video frame of the video information, wherein the label information includes a second user using the second user device to Operation instruction information of the target object.
根据本申请的又一个方面,提供了包括指令的计算机可读介质,所述指令在被执行时使得系统进行:According to yet another aspect of the present application, a computer-readable medium is provided that includes instructions that, when executed, cause a system to:
通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息;Shooting video information about a first target object in real time through a camera device in the first user equipment;
将所述视频信息发送至对应的网络设备;Sending the video information to a corresponding network device;
接收所述网络设备发送的、所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Receiving first transfer matrix information corresponding to the first target object in each video frame of the video information sent by the network device;
根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息。Superimposing and displaying the corresponding first marker information on the first target object according to the first transfer matrix information, wherein the first marker information includes a second user equipment corresponding to Operation instruction information of a target object.
根据本申请的又一个方面,提供了包括指令的计算机可读介质,所述指令在被执行时使得系统进行:According to yet another aspect of the present application, a computer-readable medium is provided that includes instructions that, when executed, cause a system to:
接收第一用户设备发送的关于第一目标对象的视频信息,其中,所述视频信息是通过所述第一用户设备中的摄像装置实时拍摄的;Receiving video information about a first target object sent by a first user equipment, where the video information is captured in real time by a camera device in the first user equipment;
通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Determining a first transition matrix information corresponding to the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information;
将所述第一转移矩阵信息发送至所述第一用户设备;Sending the first transfer matrix information to the first user equipment;
将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the video information and the first transfer matrix information to a second user equipment that belongs to the same remote assistance task as the first user equipment.
根据本申请的又一个方面,提供了包括指令的计算机可读介质,所述 指令在被执行时使得系统进行:According to yet another aspect of the present application, a computer-readable medium is provided that includes instructions that, when executed, cause a system to:
接收对应网络设备发送的、关于第三目标对象的视频信息及所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;Receiving video information about a third target object and third transfer matrix information corresponding to the third target object in each video frame of the video information sent by a corresponding network device;
呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括第二用户通过第二用户设备对所述第三目标对象的操作指示信息;Presenting the video information, and superimposing and displaying the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein the third marker The information includes operation instruction information of the second user on the third target object through the second user equipment;
其中,所述视频信息是通过第一用户设备中的摄像装置实时拍摄的,所述第一用户设备、所述第三用户设备与所述第二用户设备属于同一远程辅助任务,并分别接受所述第二用户设备的远程辅助。The video information is captured in real time by a camera device in the first user equipment, and the first user equipment, the third user equipment, and the second user equipment belong to the same remote assistance task, and accept all The remote assistance of the second user equipment is described.
根据本申请的又一个方面,提供了包括指令的计算机可读介质,所述指令在被执行时使得系统进行:According to yet another aspect of the present application, a computer-readable medium is provided that includes instructions that, when executed, cause a system to:
接收对应网络设备发送的、关于第一目标对象的视频信息及所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Receiving video information about a first target object and first transfer matrix information corresponding to the first target object in each video frame of the video information sent by a corresponding network device;
呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述第一标记信息包括第二用户通过所述第二用户设备对所述第一目标对象的操作指示信息;Presenting the video information, and superimposing and displaying corresponding first marker information on the first target object in each video frame of the video information according to the first transition matrix information, wherein the first marker The information includes operation instruction information of the second user on the first target object through the second user equipment;
其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。The video information is captured in real time by a camera device in the first user equipment that belongs to the same remote assistance task as the second user device, or is based on the first target object captured by the camera device. The real-time video information and other video information of the first target object are reconstructed.
根据本申请的又一个方面,提供了包括指令的计算机可读介质,所述指令在被执行时使得系统进行:According to yet another aspect of the present application, a computer-readable medium is provided that includes instructions that, when executed, cause a system to:
接收第一用户设备发送的关于目标对象的视频信息,其中,所述视频信息包括通过所述第一用户设备中的摄像装置所拍摄的;Receiving video information about a target object sent by a first user equipment, where the video information includes a picture taken by an imaging device in the first user equipment;
通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Determining a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
根据所述转移矩阵信息将对应的标记信息添加至所述视频信息中的 各视频帧,其中,所述标记信息保持叠加于所述视频信息的各视频帧中的所述目标对象,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;Adding corresponding tag information to each video frame in the video information according to the transfer matrix information, wherein the tag information remains the target object superimposed on each video frame of the video information, the tag The information includes corresponding operation instruction information of the second user on the target object sent by the second user equipment;
将编辑后的所述视频信息发送至第一用户设备,以及与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the edited video information to a first user equipment and a second user equipment that belongs to the same remote assistance task as the first user equipment.
与现有技术相比,本申请基于增强现实技术,在第一用户设备与第二用户设备建立通信连接的基础上,第一用户设备将第二用户设备发送的标记信息等叠加显示于当前视频信息中,实现第二用户对第一用户的远程实时指挥,能够广泛应用于日常生活中家庭监督、指导以及工业、医疗、教育等广泛领域,提升了人与人之间沟通交流的效率,极大地提升了用户的使用体验。Compared with the prior art, this application is based on augmented reality technology. On the basis of establishing a communication connection between the first user equipment and the second user equipment, the first user equipment superimposes and displays marker information and the like sent by the second user equipment on the current video. In the information, the remote real-time command from the second user to the first user can be widely used in daily supervision, guidance, and industrial, medical, and educational fields. It improves the efficiency of communication between people. The earth improves the user experience.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
通过阅读参照以下附图所作的对非限制性实施例所作的详细描述,本申请的其它特征、目的和优点将会变得更明显:Other features, objects, and advantages of the present application will become more apparent by reading the detailed description of the non-limiting embodiments with reference to the following drawings:
图1示出根据本申请一个方面的一种基于增强现实进行远程辅助的系统拓扑图;FIG. 1 illustrates a system topology diagram of remote assistance based on augmented reality according to one aspect of the present application;
图2示出根据本申请一个实施例的一种在第一用户设备端基于增强现实进行远程辅助的方法流程图;2 shows a flowchart of a method for remote assistance based on augmented reality on a first user equipment side according to an embodiment of the present application;
图3示出根据本申请一个实施例的基于增强现实进行远程辅助时进行摄像控制的示例图;FIG. 3 shows an example diagram of camera control when performing remote assistance based on augmented reality according to an embodiment of the present application; FIG.
图4示出根据本申请另一个实施例的一种在第二用户设备端基于增强现实进行远程辅助的方法流程图;4 shows a flowchart of a method for performing remote assistance based on augmented reality on a second user equipment side according to another embodiment of the present application;
图5示出根据本申请又一个实施例的一种在第一用户设备端基于增强现实进行远程辅助的方法流程图;5 shows a flowchart of a method for remote assistance based on augmented reality on a first user equipment side according to yet another embodiment of the present application;
图6示出根据本申请又一个实施例的一种在网络设备端基于增强现实进行远程辅助的方法流程图;6 shows a flowchart of a method for remote assistance based on augmented reality on a network device side according to yet another embodiment of the present application;
图7示出根据本申请又一个实施例的一种在第三用户设备端基于增强 现实进行远程辅助的方法流程图;7 shows a flowchart of a method for remote assistance based on augmented reality on a third user equipment side according to yet another embodiment of the present application;
图8示出根据本申请又一个实施例的一种在第二用户设备端基于增强现实进行远程辅助的方法流程图;FIG. 8 shows a flowchart of a method for performing remote assistance based on augmented reality on a second user equipment side according to another embodiment of the present application;
图9示出根据本申请又一个实施例的一种在网络设备端基于增强现实进行远程辅助的方法流程图;9 shows a flowchart of a method for remote assistance based on augmented reality on a network device side according to yet another embodiment of the present application;
图10示出根据本申请一个方面的一种基于增强现实进行远程辅助的系统方法图;10 illustrates a system method diagram of remote assistance based on augmented reality according to one aspect of the present application;
图11示出根据本申请另一个方面的一种基于增强现实进行远程辅助的系统方法图;11 illustrates a system method diagram of remote assistance based on augmented reality according to another aspect of the present application;
图12示出根据本申请又一个方面的一种基于增强现实进行远程辅助的系统方法图;12 illustrates a system method diagram of remote assistance based on augmented reality according to yet another aspect of the present application;
图13示出根据本申请一个实施例的一种基于增强现实进行远程辅助的第一用户设备;13 illustrates a first user equipment for remote assistance based on augmented reality according to an embodiment of the present application;
图14示出根据本申请另一个实施例的一种基于增强现实进行远程辅助的第二用户设备;14 shows a second user equipment for remote assistance based on augmented reality according to another embodiment of the present application;
图15示出根据本申请又一个实施例的一种基于增强现实进行远程辅助的第一用户设备;15 illustrates a first user equipment for remote assistance based on augmented reality according to yet another embodiment of the present application;
图16示出根据本申请又一个实施例的一种基于增强现实进行远程辅助的网络设备;16 illustrates a network device for remote assistance based on augmented reality according to yet another embodiment of the present application;
图17示出根据本申请又一个实施例的一种基于增强现实进行远程辅助的第三用户设备;17 illustrates a third user equipment for remote assistance based on augmented reality according to yet another embodiment of the present application;
图18示出根据本申请又一个实施例的一种基于增强现实进行远程辅助的第二用户设备;FIG. 18 illustrates a second user equipment for remote assistance based on augmented reality according to another embodiment of the present application; FIG.
图19示出根据本申请又一个实施例的一种基于增强现实进行远程辅助的网络设备;FIG. 19 illustrates a network device for remote assistance based on augmented reality according to another embodiment of the present application; FIG.
图20示出根据本申请一个方面的一种基于增强现实进行远程辅助的系统示意图;20 shows a schematic diagram of a system for remote assistance based on augmented reality according to an aspect of the present application;
图21示出根据本申请另一个方面的一种基于增强现实进行远程辅助的系统示意图;21 illustrates a schematic diagram of a system for remote assistance based on augmented reality according to another aspect of the present application;
图22示出根据本申请又一个方面的一种基于增强现实进行远程辅助的系统示意图;22 illustrates a schematic diagram of a system for remote assistance based on augmented reality according to another aspect of the present application;
图23示出可被用于实施本申请中所述的各个实施例的示例性系统。FIG. 23 illustrates an exemplary system that can be used to implement various embodiments described in this application.
附图中相同或相似的附图标记代表相同或相似的部件。The same or similar reference numerals in the drawings represent the same or similar components.
具体实施方式Detailed ways
下面结合附图对本申请作进一步详细描述。The present application is described in further detail below with reference to the drawings.
在本申请一个典型的配置中,终端、服务网络的设备和可信方均包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration of this application, the terminal, the device serving the network, and the trusted party each include one or more processors (CPUs), input / output interfaces, network interfaces, and memory.
内存可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM)。内存是计算机可读介质的示例。Memory may include non-persistent memory, random access memory (RAM), and / or non-volatile memory in computer-readable media, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。Computer-readable media includes permanent and non-persistent, removable and non-removable media. Information storage can be accomplished by any method or technology. Information may be computer-readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), and read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technologies, read-only disc read-only memory (CD-ROM), digital versatile disc (DVD) or other optical storage, Magnetic tape cartridges, tape disk storage or other magnetic storage devices or any other non-transmitting medium can be used to store information that can be accessed by computing devices.
本申请所指设备包括但不限于用户设备、网络设备、或用户设备与网络设备通过网络相集成所构成的设备。所述用户设备包括但不限于任何一种可与用户进行人机交互的移动电子产品,例如智能手机、平板电脑等,所述移动电子产品可以采用任意操作系统,如android操作系统、iOS操作系统、Windows操作系统等。其中,所述网络设备包括一种能够按照事先设定或存储的指令,自动进行数值计算和信息处理的电子设备,其硬件包括但不限于微处理器、专用集成电路(ASIC)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、数字信号处理器(DSP)、嵌入式设备等。 所述网络设备包括但不限于计算机、网络主机、单个网络服务器、多个网络服务器集或多个服务器构成的云;在此,云由基于云计算(Cloud Computing)的大量计算机或网络服务器构成,其中,云计算是分布式计算的一种,由一群松散耦合的计算机集组成的一个虚拟超级计算机。所述网络包括但不限于互联网、广域网、城域网、局域网、VPN网络、无线自组织网络(Ad Hoc网络)等。优选地,所述设备还可以是运行于所述用户设备、网络设备、或用户设备与网络设备、网络设备、触摸终端或网络设备与触摸终端通过网络相集成所构成的设备上的程序。The equipment referred to in this application includes, but is not limited to, user equipment, network equipment, or equipment formed by integrating user equipment and network equipment through a network. The user equipment includes, but is not limited to, any mobile electronic product that can interact with the user, such as a smart phone or a tablet computer. The mobile electronic product can use any operating system, such as the android operating system and the iOS operating system. , Windows operating system, etc. Wherein, the network device includes an electronic device capable of automatically performing numerical calculation and information processing according to an instruction set or stored in advance. The hardware includes, but is not limited to, a microprocessor, an application specific integrated circuit (ASIC), and programmable logic. Devices (PLDs), field programmable gate arrays (FPGAs), digital signal processors (DSPs), embedded devices, and more. The network device includes, but is not limited to, a cloud composed of a computer, a network host, a single network server, multiple network server sets, or multiple servers; here, the cloud is composed of a large number of computers or network servers based on Cloud Computing, Among them, cloud computing is a type of distributed computing, a virtual supercomputer composed of a group of loosely coupled computer sets. The network includes, but is not limited to, the Internet, a wide area network, a metropolitan area network, a local area network, a VPN network, a wireless ad hoc network (Ad hoc network), and the like. Preferably, the device may also be a program running on the user device, the network device, or a device formed by integrating the user device and the network device, the network device, the touch terminal, or the network device and the touch terminal through a network.
当然,本领域技术人员应能理解上述设备仅为举例,其他现有的或今后可能出现的设备如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above equipment is just an example. If other existing or future equipment may be applicable to this application, it should also be included in the protection scope of this application, and hereby incorporated by reference. this.
图1示出本申请的一个典型场景,第一用户(如工人等)持有第一用户设备,第二用户(专家等)持有第二用户设备,其中,第一用户设备与第二用户设备建立了通信连接;第一用户设备通过接收第二用户设备发送的标记信息,并将该标记信息叠加显示于实时拍摄的视频信息中,辅助第一用户更加精准快速的完成任务,其中,标记信息可以是画圈等位置标记信息,也可以是通过手势识别获取的与预设操作信息匹配的操作指导信息等。其中,第一用户设备与第二用户设备可以是直接进行一对一的交互,也可以是通过云端进行一对一的交互,还可以是通过云端进行多对多的交互等方式。FIG. 1 shows a typical scenario of the present application. A first user (such as a worker) holds a first user equipment, and a second user (an expert, etc.) holds a second user equipment. The first user equipment and the second user The device has established a communication connection; the first user device receives the tag information sent by the second user device, and superimposes the tag information on the video information captured in real time to assist the first user to complete the task more accurately and quickly. Among them, the tag The information may be position mark information such as a circle, or operation guidance information that matches preset operation information and is obtained through gesture recognition. The first user equipment and the second user equipment may perform one-to-one interaction directly, one-to-one interaction through the cloud, or many-to-many interaction through the cloud.
第一用户设备包括但不限于增强现实眼镜、平板电脑、移动终端、PC端等设备,此处以增强现实眼镜为例阐述以下实施例,当然,本领域技术人员应能理解,该等实施例同样适用于平板电脑、移动终端、PC端等其他第一用户设备。第二用户设备包括但不限于增强现实眼镜、平板电脑、移动终端、PC端等设备,此处以平板电脑为例阐述以下实施例,当然,本领域技术人员应能理解,该等实施例同样适用于增强现实眼镜、移动终端、PC端等其他第二用户设备。The first user equipment includes, but is not limited to, augmented reality glasses, a tablet computer, a mobile terminal, a PC terminal and the like. Here, the following embodiments are described by taking augmented reality glasses as an example. Of course, those skilled in the art should understand that these embodiments are the same. Suitable for tablet, mobile terminal, PC and other first user equipment. The second user equipment includes, but is not limited to, augmented reality glasses, a tablet computer, a mobile terminal, a PC terminal and the like. Here, a tablet computer is used as an example to illustrate the following embodiments. Of course, those skilled in the art should understand that these embodiments are equally applicable. For other second user equipment such as augmented reality glasses, mobile terminals, PCs, etc.
图2示出根据本申请一个方面的一种在第一用户设备端基于增强显示进行远程辅助的方法,其中,该方法包括步骤S11、步骤S12和步骤S13。 在步骤S11中,第一用户设备通过所述第一用户设备中的摄像装置实时获取关于目标对象的视频信息;在步骤S12中,第一用户设备通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;在步骤S13中,第一用户设备根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。FIG. 2 illustrates a method for performing remote assistance based on an enhanced display at a first user equipment end according to an aspect of the present application, where the method includes steps S11, S12, and S13. In step S11, the first user equipment obtains video information about a target object in real time through a camera device in the first user equipment; in step S12, the first user equipment obtains video information about the target object in the video information. Perform a target tracking operation to determine the transfer matrix information corresponding to the target object in each video frame of the video information; in step S13, the first user equipment superimposes and displays the corresponding marker information on the transfer matrix information on the The target object, wherein the tag information includes corresponding instruction information of the second user on the target object sent by the second user equipment.
具体而言,在步骤S11中,第一用户设备通过所述第一用户设备中的摄像装置实时获取关于目标对象的视频信息。例如,目标对象包括第一用户标记的视频帧中图像信息对应的目标对象、第一用户接收的第二用户标记的视频帧中图像信息对应的目标对象以及第一用户设备根据第一用户输入的图像信息确定的目标对象等。第一用户设备包括摄像装置,第一用户设备通过该摄像装置实时拍摄关于目标对象的视频信息。Specifically, in step S11, the first user equipment acquires video information about the target object in real time through the camera device in the first user equipment. For example, the target object includes a target object corresponding to the image information in the video frame marked by the first user, a target object corresponding to the image information in the second user marked video frame received by the first user, and the first user equipment according to the first user input. Image information determines the target object and so on. The first user equipment includes an imaging device, through which the first user equipment captures video information about the target object in real time.
在步骤S12中,第一用户设备通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息。其中,转移矩阵信息包括第一用户设备根据目标跟踪算法得到的目标对象在当前视频帧与以往视频帧间的对应关系,目标跟踪算法包括但不限于核化相关滤波器目标跟踪算法(Kernelized correlation filter,KCF)、稠密光流(Denseopticalflow)跟踪算法、稀疏光流(Sparseopticalflow)跟踪算法、卡尔曼滤波(Kalmanfiltering)跟踪算法、多实例学习(Multipleinstancelearning)跟踪算法等;此处目标跟踪算法以核化相关滤波器目标跟踪算法(Kernelizedcorrelationfilter,KCF)为例,KCF算法通过学习核化的正则化最小二乘(Kernelizedregularizedleastsquares,KRLS)线性分类器解决跟踪问题。目标在场景中的移动可以看成是目标在水平方向上的移动和垂直方向上的移动的矢量和,KCF算法引入密集采样概念,将所有的样本当成是基准样本的循环移位。此时,高斯核函数高度结构化,即核函数矩阵是循环矩阵,根据循环卷积原理,所有与循环矩阵的点积操作都可转化成与该矩阵第一行向量的卷积操作。此时,借助DFT(Discretefouriertransform,离散傅里叶变换)可以将空域卷积通过时域点 积实现快速计算。In step S12, the first user equipment determines a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information. The transfer matrix information includes the corresponding relationship between the current video frame and the previous video frame of the target object obtained by the first user equipment according to the target tracking algorithm. The target tracking algorithm includes, but is not limited to, a kernel tracking filter target tracking algorithm. KCF), dense optical flow (Denseopticalflow) tracking algorithm, sparse optical flow (Sparseopticalflow) tracking algorithm, Kalmanfiltering (Kalmanfiltering) tracking algorithm, multiple instance learning (Multipleinstancelearning) tracking algorithm, etc .; here the target tracking algorithm is kernel-related The filter target tracking algorithm (Kernelizedcorrelationfilter, KCF) is taken as an example. The KCF algorithm solves the tracking problem by learning a Kernelized regularized least squares (KRLS) linear classifier. The movement of the target in the scene can be regarded as the vector sum of the movement of the target in the horizontal direction and the vertical direction. The KCF algorithm introduces the concept of dense sampling and regards all samples as cyclic shifts of the reference samples. At this time, the Gaussian kernel function is highly structured, that is, the kernel function matrix is a cyclic matrix. According to the principle of cyclic convolution, all dot product operations with the cyclic matrix can be converted into convolution operations with the first row vector of the matrix. At this time, using DFT (Discretefouriertransform, discrete Fourier transform), the spatial domain convolution can be performed through the time domain dot product to achieve fast calculation.
当然本领域技术人员应能理解,上述跟踪算法仅为举例,其他现有的或今后可能出现的跟踪算法如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above-mentioned tracking algorithm is only an example. If other existing or future tracking algorithms are applicable to this application, they should also be included in the protection scope of this application, and are hereby incorporated by reference. Included here.
在步骤S13中,第一用户设备根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。其中,标记信息包括第一用户设备接收到的第二用户设备发送的、关于所述目标对象的操作指示信息,如对目标对象的虚拟操作信息等。例如,第一用户设备接收了第二用户设备发送的关于目标对象的操作指示信息,第一用户根据转移矩阵信息进行目标跟踪的同时,根据转移矩阵信息将该标记信息叠加显示在目标对象对应的位置。其中,对于增强现实眼镜,该标记信息叠加显示该增强现实眼镜的镜片上对应的位置,该位置信息由增强现实眼镜/网络设备根据目标跟踪算法计算得出;对于PC端、平板电脑或移动终端等,该标记信息叠加显示于当前视频帧中目标对象对应的位置。其中,第一用户设备与第二用户设备可以是直接建立了通信连接,也可以是通过网络设备建立了通信连接,此处以第一用户设备与第二用户设备间直接建立通信连接为例阐述以下实施例,本领域技术人员应能理解该等实施例同样适用于通过网络设备建立通信连接等其他通信连接方式。In step S13, the first user equipment superimposes and displays the corresponding tag information on the target object according to the transfer matrix information, where the tag information includes a message sent by the second user equipment and corresponding to the second user equipment. Operation instruction information of the target object. The tag information includes operation instruction information about the target object, such as virtual operation information on the target object, sent by the second user equipment and received by the first user equipment. For example, when the first user equipment receives the operation instruction information about the target object sent by the second user equipment, the first user performs target tracking according to the transfer matrix information, and superimposes and displays the marker information on the corresponding target object according to the transfer matrix information. position. Wherein, for the augmented reality glasses, the marker information superimposedly displays the corresponding position on the lens of the augmented reality glasses, and the position information is calculated by the augmented reality glasses / network device according to the target tracking algorithm; for the PC terminal, tablet computer or mobile terminal The tag information is superimposed and displayed at a position corresponding to the target object in the current video frame. The first user equipment and the second user equipment may directly establish a communication connection, or may establish a communication connection through a network device. Here, the following is an example of the direct establishment of a communication connection between the first user equipment and the second user equipment. Embodiments, those skilled in the art should understand that these embodiments are also applicable to other communication connection modes such as establishing a communication connection through a network device.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑建立了通信连接。增强现实眼镜与平板电脑已进行了关于目标对象的视频流或图像的传输,且增强现实眼镜接收到平板电脑发送的关于目标对象在之前视频帧中的操作指示信息,如目标对象为某操作台上的零件,该目标对象可以是第一用户设备基于第一用户的选择操作(如画圈圈出等操作)确定的,也可以是第一用户设备接收到的第二用户设备基于第二用户的选择操作确定的,还可以是第一用户设备通过识别目标对象的初始图像信息确定的;对应的操作指示信息包括第二用户设备识别第二用户关于该零件操作的手势等获取的虚拟操作信息等。增强现实眼镜通过摄像头实时采集当前关于目标对象的视频信息,随后通过目标跟踪算法 计算当前视频帧中目标对象相对于之前视频帧中目标对象的转移矩阵信息。随后,增强现实眼镜根据转移矩阵信息确定目标对象在当前视频帧的位置信息,并在该位置叠加显示对应的标记信息,如在当前视频帧中操作台上的零件对应位置叠加显示第二用户的手势对应的操作指示信息等。For example, a first user holds augmented reality glasses, a second user holds a tablet computer, and the augmented reality glasses establish a communication connection with the tablet computer. The augmented reality glasses and tablet computer have transmitted the video stream or image about the target object, and the augmented reality glasses have received the tablet's operation instruction information about the target object in the previous video frame, such as the target object is a console The target object may be determined by the first user equipment based on the first user ’s selection operation (such as drawing a circle), or it may be the second user equipment received by the first user equipment based on the second user. The selection operation may also be determined by the first user equipment by identifying the initial image information of the target object; the corresponding operation instruction information includes virtual operation information obtained by the second user equipment to recognize the second user ’s gesture regarding the part operation. Wait. The augmented reality glasses collect the current video information about the target object through the camera in real time, and then use the target tracking algorithm to calculate the transfer matrix information of the target object in the current video frame relative to the target object in the previous video frame. Subsequently, the augmented reality glasses determine the position information of the target object in the current video frame according to the transfer matrix information, and superimpose and display the corresponding marker information at the position, such as the corresponding position of the part on the operating platform in the current video frame superimposedly displays the second user Operation instruction information and the like corresponding to the gesture.
当然本领域技术人员应能理解,上述标记信息和/或操作指示信息仅为举例,其他现有的或今后可能出现的标记信息和/或操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above tag information and / or operation instruction information is only an example. Other existing or possible future tag information and / or operation instruction information, if applicable to this application, should also be included in This application is within the scope of protection and is hereby incorporated by reference.
在一些实施例中,该方法还包括步骤S14(未示出)。在步骤S14中,第一用户设备将所述视频信息发送至所述第二用户设备。例如,第一用户设备实时拍摄当前关于目标对象的视频信息,并将该视频信息发送至第二用户设备端,或者通过网络设备将该视频信息发送至第二用户设备。其中,视频信息包括第一用户设备通过摄像装置采集的图像信息,还可以包括第一用户设备通过麦克风装置采集的音频信息,并将该音频和视频信息混流通过压缩算法压缩为视频/音频流;第一用户设备将压缩后的视频/音频流通过网络传输协议如用户数据报协议(UDP)、传输控制协议(TCP)或者实时传输协议(RTP)等传输至第二用户设备。In some embodiments, the method further includes step S14 (not shown). In step S14, the first user equipment sends the video information to the second user equipment. For example, the first user equipment captures the current video information about the target object in real time and sends the video information to the second user equipment end, or sends the video information to the second user equipment through the network device. The video information includes image information collected by the first user equipment through the camera device, and may also include audio information collected by the first user equipment through the microphone device, and the audio and video information is mixed into a video / audio stream through a compression algorithm; The first user equipment transmits the compressed video / audio stream to the second user equipment through a network transmission protocol such as a user datagram protocol (UDP), a transmission control protocol (TCP), or a real-time transmission protocol (RTP).
例如,增强现实眼镜实时拍摄关于当前目标对象相关的视频信息,并将该视频信息直接发送至平板电脑,或者发送至云端由云端转发至平板电脑端。平板电脑接收并呈现该视频信息,辅助第二用户继续指导第一用户进行对操作台上零件的加工等操作。For example, the augmented reality glasses capture video information related to the current target object in real time and send the video information directly to the tablet computer, or send it to the cloud and forward it to the tablet computer in the cloud. The tablet computer receives and presents the video information, and assists the second user to continue to instruct the first user to perform operations such as processing of parts on the operating table.
在一些实施例中,在步骤S14中,第一用户设备将所述视频信息及所述转移矩阵信息发送至所述第二用户设备。例如,第一用户设备将视频信息发送至第二用户设备的同时,将根据目标跟踪操作获得的转移矩阵信息同时发送至第二用户设备,以供第二用户在呈现该视频信息的同时对目标对象进行目标跟踪。In some embodiments, in step S14, the first user equipment sends the video information and the transfer matrix information to the second user equipment. For example, when the first user equipment sends video information to the second user equipment, it simultaneously sends the transfer matrix information obtained according to the target tracking operation to the second user equipment for the second user to target the target while presenting the video information. The subject performs target tracking.
例如,增强现实眼镜实时拍摄关于当前目标对象相关的视频信息,并将对该视频信息中目标对象结合之前视频帧执行目标跟踪操作,确定该目标对象在各视频帧中相对于前一视频帧的转移矩阵信息等。随后,增强现实眼镜将该视频信息以及视频信息中各视频帧对应的转移矩阵信息直接 发送或者通过云端发送至平板电脑。For example, the augmented reality glasses capture video information about the current target object in real time, and perform a target tracking operation on the target object in the video information in combination with the previous video frame to determine the target object's relative to the previous video frame in each video frame. Transfer matrix information, etc. Subsequently, the augmented reality glasses send the video information and the transfer matrix information corresponding to each video frame in the video information directly or through a cloud to a tablet computer.
在一些实施例中,该方法还包括步骤S15(未示出)。在步骤S15中,第一用户设备接收所述第二用户设备发送的、所述第二用户基于所述视频信息对所述目标对象的继续操作指示信息。例如,第二用户设备根据第二用户对目标对象的继续操作(如画出线段圆圈等标记),或者通过手势识别识别第二用户的手势操作等,生成对应的继续操作指示信息。随后,第二用户设备将该继续操作指示信息发送至第一用户设备,辅助第一用户继续对目标对象进行操作等。In some embodiments, the method further includes step S15 (not shown). In step S15, the first user equipment receives the operation instruction information of the second user on the target object based on the video information sent by the second user equipment. For example, the second user equipment generates corresponding continuous operation instruction information according to the second user's continuous operation on the target object (such as drawing a line segment circle or the like), or recognizes the gesture operation of the second user through gesture recognition, and the like. Subsequently, the second user equipment sends the continuing operation instruction information to the first user equipment to assist the first user in continuing to perform operations on the target object.
例如,增强现实眼镜将实时拍摄的关于目标对象的视频信息发送至平板电脑,平板电脑接收并呈现该视频信息。随后,平板电脑在得到的视频流各视频帧中执行目标跟踪,获取目标对象的在视频帧中位置,在一些实施例中,平板电脑通过线段、圆圈、局部增加亮度等方式将视频帧中目标对象突出显示出来。第二用户在平板电脑上做标记或者在平板电脑摄像头可拍摄范围内做手势等指导第一用户对零件进行加工,平板电脑将采集到第二用户的标记作为继续操作指示信息,或者通过对拍摄到的手势等进行手势识别,确定识别的手势为继续操作指示信息等。随后,平板电脑将该继续指示信息发送至增强现实眼镜。增强现实眼镜接收并在对应位置叠加显示该继续操作指示信息。For example, the augmented reality glasses send real-time video information about the target object to the tablet computer, and the tablet computer receives and presents the video information. Subsequently, the tablet computer performs target tracking in each video frame of the obtained video stream to obtain the position of the target object in the video frame. In some embodiments, the tablet computer targets the target in the video frame by means of line segments, circles, and locally increasing brightness. The object is highlighted. The second user instructs the first user to process the part by making a mark on the tablet computer or making gestures within the shooting range of the tablet computer camera. The tablet computer uses the second user's mark as the operation instruction information, or by shooting The gestures and the like obtained are used for gesture recognition, and it is determined that the recognized gestures are the operation instruction information and the like. The tablet then sends the resume instruction to the augmented reality glasses. The augmented reality glasses receive and display the continued operation instruction information in a superimposed position at the corresponding position.
又如,增强现实眼镜将实时拍摄的关于目标对象的视频信息发送至平板电脑,同时还将该视频信息中各视频帧对应的转移矩阵信息发送至平板电脑,平板电脑接收并呈现该视频信息。随后,平板电脑根据接收到的转移矩阵信息,确定目标对象的在视频帧中位置,在一些实施例中,平板电脑通过线段、圆圈、局部增加亮度等方式将视频帧中目标对象突出显示出来。第二用户在平板电脑上做标记或者在平板电脑摄像头可拍摄范围内做手势等指导第一用户对零件进行加工,平板电脑将采集到第二用户的标记作为继续操作指示信息,或者通过对拍摄到的手势等进行手势识别,确定识别的手势为继续操作指示信息等。随后,平板电脑将该继续指示信息发送至增强现实眼镜。增强现实眼镜接收并在对应位置叠加显示该继续操作指示信息。For another example, the augmented reality glasses sends video information about the target object captured in real time to the tablet computer, and also sends the transfer matrix information corresponding to each video frame in the video information to the tablet computer. The tablet computer receives and presents the video information. Subsequently, the tablet computer determines the position of the target object in the video frame according to the received transfer matrix information. In some embodiments, the tablet computer highlights the target object in the video frame by means of line segments, circles, and locally increasing brightness. The second user instructs the first user to process the part by making a mark on the tablet computer or making gestures within the shooting range of the tablet computer camera. The tablet computer uses the second user's mark as the operation instruction information, or by shooting The gestures and the like obtained are used for gesture recognition, and it is determined that the recognized gestures are the operation instruction information and the like. The tablet then sends the resume instruction to the augmented reality glasses. The augmented reality glasses receive and display the continued operation instruction information in a superimposed position at the corresponding position.
当然本领域技术人员应能理解,上述继续操作指示信息仅为举例,其他现有的或今后可能出现的继续操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above-mentioned continuing operation instruction information is only an example. If other existing or future continuing operation instruction information is applicable to this application, it should also be included in the protection scope of this application, and This is incorporated herein by reference.
在一些实施例中,该方法还包括步骤S16(未示出)。在步骤S16中,第一用户设备接收所述第二用户设备发送的、所述第二用户对所述摄像装置的摄像控制指令信息,根据所述摄像控制指令信息调整所述摄像装置的摄像参数信息,通过调整后的所述摄像装置实时拍摄关于所述目标对象的视频信息,并将通过所述调整后的摄像装置拍摄的所述视频信息发送至所述第二用户设备。例如,摄像控制指令信息包括对第一用户设备的摄像装置的硬件参数进行调控的指令信息,摄像参数信息包括但不限于分辨率、像素深度、最大帧率、曝光方式和快门速度、像元尺寸以及光谱响应特征等。例如,第一用户设备接收第二用户设备发送的、第二用户对第一用户的摄像装置进行调控的摄像控制指令信息,根据该摄像控制指令信息对拍摄装置的摄像参数信息进行调整,并通过调整后的摄像装置实时拍摄当前目标对象的视频信息,并将该视频信息发送至第二用户设备。In some embodiments, the method further includes step S16 (not shown). In step S16, the first user equipment receives the imaging control instruction information of the second user on the imaging device sent by the second user equipment, and adjusts the imaging parameters of the imaging device according to the imaging control instruction information. Information, which captures video information about the target object in real time through the adjusted camera device, and sends the video information shot by the adjusted camera device to the second user equipment. For example, the imaging control instruction information includes instruction information for adjusting hardware parameters of the imaging device of the first user equipment. The imaging parameter information includes, but is not limited to, resolution, pixel depth, maximum frame rate, exposure mode and shutter speed, and pixel size. And spectral response characteristics. For example, the first user equipment receives the imaging control instruction information sent by the second user equipment and the second user adjusts the imaging device of the first user, adjusts the imaging parameter information of the imaging device according to the imaging control instruction information, and The adjusted camera device captures video information of the current target object in real time, and sends the video information to the second user equipment.
例如,如图3所示,图A为第二用户收到的实时拍摄的视频信息,其中,目标对象为画面中桌上的鼠标垫,第二用户想进一步的细致观察目标对象,通过视频中右上角的设置图标进行操作或者直接通过在屏幕上进行两手指外扩的放大操作等,平板电脑基于第二用户的操作,生成对应的聚焦目标对象的摄像控制指令信息,并将该摄像控制指令信息发送至增强现实眼镜。增强现实眼镜接收该摄像控制指令信息,通过调整摄像装置的相关摄像参数,如分辨率、焦距等,拍摄关于目标对象的调整后的视频信息,并将该视频信息发送平板电脑。如图B所示,其画面为平板电脑接收并呈现的放大后的关于目标对象的视频信息。For example, as shown in Figure 3, Figure A is the real-time video information received by the second user, where the target object is the mouse pad on the table in the screen. The second user wants to observe the target object in more detail. The setting icon in the upper right corner is used to operate or directly zoom out by two-finger expansion on the screen. Based on the operation of the second user, the tablet computer generates corresponding camera control instruction information of the focused target object, and sends the camera control instruction The information is sent to the augmented reality glasses. The augmented reality glasses receive the imaging control instruction information, adjust relevant imaging parameters of the imaging device, such as resolution, focal length, etc., shoot the adjusted video information about the target object, and send the video information to the tablet computer. As shown in FIG. B, the picture is the enlarged video information about the target object received and presented by the tablet computer.
当然本领域技术人员应能理解,上述摄像控制指令信息和/或摄像参数信息仅为举例,其他现有的或今后可能出现的摄像控制指令信息和/或摄像参数信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the foregoing camera control instruction information and / or camera parameter information are merely examples, and other existing or future camera control instruction information and / or camera parameter information may be applicable to this application, It should also be included in the protection scope of this application, and hereby incorporated by reference.
在一些实施例中,所述标记信息还包括第一用户通过所述第一用户设 备对所述目标对象标示的辅助标示信息。其中,辅助标示信息包括第一用户设备采集的基于第一用户的操作,对目标对象的标记(如画线段、圆圈等)等,或者对第二用户设备发送的标记信息的反馈信息等,如在标记信息中提问、画圈圈出文字等。例如,第一用户设备根据第一用户的操作,生成对应的关于目标对象的辅助标示信息,第一用户设备将该辅助标示信息发送至第二用户设备,进行进一步的远程交互。In some embodiments, the marking information further includes auxiliary marking information marked by the first user on the target object through the first user device. Wherein, the auxiliary marking information includes operations based on the first user collected by the first user equipment, markings on target objects (such as drawing line segments, circles, etc.), or feedback information on the marking information sent by the second user equipment, such as Ask questions in circles, circle text, etc. For example, the first user equipment generates corresponding auxiliary identification information about the target object according to the operation of the first user, and the first user equipment sends the auxiliary identification information to the second user equipment for further remote interaction.
例如,第一用户拍摄关于目标对象的视频信息时,圈出目标对象的具体位置,第一用户设备根据第一用户的操作生成对应的辅助标示信息。第一用户设备在向第二用户设备发送视频信息的同时,将该辅助标示信息发送至第二用户设备,第二用户设备接收视频信息以及该辅助标示信息,根据辅助标示信息在视频帧中初始位置信息以及目标跟踪算法计算辅助标示信息的位置信息,并在呈现视频信息的同时在各视频帧对应的位置叠加显示该辅助标示信息;又如,第一用户设备根据目标跟踪算法计算该辅助标示信息在视频信息各视频帧的转移矩阵信息,并将视频信息、辅助标示信息以及对应的转移矩阵信息发送至第二用户设备,第二用户设备接收后在呈现视频信息的同时根据转移矩阵信息在对应的位置叠加显示辅助标示信息。For example, when the first user captures video information about the target object, the specific position of the target object is circled, and the first user equipment generates corresponding auxiliary identification information according to the operation of the first user. The first user equipment sends the auxiliary identification information to the second user equipment while sending the video information to the second user equipment. The second user equipment receives the video information and the auxiliary identification information, and initializes it in the video frame according to the auxiliary identification information. The position information and the target tracking algorithm calculate the position information of the auxiliary marker information, and display the auxiliary marker information at the corresponding position of each video frame while displaying the video information; for example, the first user equipment calculates the auxiliary marker according to the target tracking algorithm. The information is in the transfer matrix information of each video frame of the video information, and the video information, auxiliary identification information, and corresponding transfer matrix information are sent to the second user equipment. After receiving the second user equipment, the second user equipment presents the video information according to the transfer matrix information in the The corresponding position is superimposed to display auxiliary label information.
又如,增强现实眼镜在对应位置叠加显示第二用户对目标对象的操作指示信息后,第一用户对应该操作指示信息存在疑问,第一用户在该操作指示信息中画圈圈出疑问所在位置,或者第一用户已完成该操作指示,希望得到进一步的操作指示,在目标对象位置点击下一步操作的提示,增强现实眼镜基于第一用户的操作生成对应的操作指示信息的疑问信息或者下一步操作指示信息等作为辅助标示信息,并将该辅助标示信息发送至平板电脑。平板电脑接收并在对应位置叠加显示该辅助标示信息,并基于该辅助标示做出对应的继续操作指示信息,如对疑问的解答或者下一步的操作指示等,平板电脑将该继续操作指示信息发送至增强现实眼镜,增强现实眼镜在视频信息中叠加显示该继续操作指示信息,其中,该继续操作指示信息包括辅助标示信息,如之前的疑问是什么,或者下一步提示等。For another example, after the augmented reality glasses display the operation instruction information of the second user on the target object at the corresponding position, the first user has doubts about the operation instruction information, and the first user draws a circle around the question location in the operation instruction information. , Or the first user has completed the operation instruction, and wants to get further operation instructions, click the prompt of the next operation at the target object position, and the augmented reality glasses generate the question information or the next step of the corresponding operation instruction information based on the first user's operation The operation instruction information and the like are used as auxiliary identification information, and the auxiliary identification information is transmitted to the tablet computer. The tablet computer receives and displays the auxiliary label information in a corresponding position, and makes corresponding operation instruction information based on the auxiliary label, such as answering a question or the next operation instruction, etc. The tablet computer sends the continuous operation instruction information. To the augmented reality glasses, the augmented reality glasses superimposedly display the continuing operation instruction information in the video information, where the continuing operation instruction information includes auxiliary identification information, such as what was the previous question or a prompt for the next step.
当然本领域技术人员应能理解,上述辅助标示信息仅为举例,其他现 有的或今后可能出现的辅助标示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above auxiliary labeling information is only an example. If other existing or future auxiliary labeling information is applicable to this application, it should also be included in the protection scope of this application. References are included here.
在一些实施例中,所述目标对象包括在讨论纸件文档;所述第二用户对所述目标对象的操作指示信息包括所述第二用户对所述在讨论纸件文档的视频帧中的一个或多个标注位置信息。例如,目标对象可以是在讨论纸件文档,对应的操作指示信息包括第二用户对该在讨论纸件文档的视频帧中的一个或多个标注位置信息,如对文档中某位置的划线或画圈等标记,或者该文字对应的标注(如,文字的拼音、解释或者相关联的内容等)。In some embodiments, the target object includes a paper document under discussion; the operation instruction information of the second user on the target object includes information about the second user's video frame of the paper document under discussion. One or more callout locations. For example, the target object may be a paper document under discussion, and the corresponding operation instruction information includes one or more position information of the second user in the video frame of the paper document under discussion, such as underlining a position in the document. Or a mark such as a circle, or a mark corresponding to the text (such as the pinyin, explanation, or related content of the text).
例如,第一用户穿戴着增强现实眼镜,通过该增强现实眼镜在阅读纸件文档,第二用户持有平板电脑,平板电脑与增强现实眼镜间建立了通信连接。增强现实眼镜通过摄像装置拍摄在讨论纸件文档的视频信息,并将该视频信息发送至平板电脑。平板电脑接收该视频信息,并基于第二用户对在讨论文档中的一个或多个标注操作生成对应的操作指示信息,如包含提示该文档对应位置有错误等错误提示位置等操作指示信息。平板电脑将该操作指示信息发送至增强现实眼镜,增强现实眼镜在当前视频信息的视频帧中根据目标跟踪算法计算在讨论纸件文档在视频帧中的位置,如其对应的转移矩阵信息等,并根据该转移矩阵信息以及操作指示信息中错误提示位置等,在在讨论纸件文档中对应的位置实时叠加对应的一个或多个标注信息,提示第一用户当前文档对应的位置有错误。For example, a first user wears augmented reality glasses, and through the augmented reality glasses reading a paper document, the second user holds a tablet computer, and the tablet computer establishes a communication connection with the augmented reality glasses. The augmented reality glasses capture video information of the paper document in discussion through the camera device, and send the video information to the tablet computer. The tablet computer receives the video information and generates corresponding operation instruction information based on the second user's one or more annotation operations in the document under discussion, such as including operation instruction information indicating that the corresponding position of the document has an error such as an error prompt position. The tablet computer sends the operation instruction information to the augmented reality glasses, and the augmented reality glasses calculates the position of the paper document in the video frame under discussion in the video frame of the current video information according to the target tracking algorithm, such as its corresponding transfer matrix information, etc., and According to the transfer matrix information and the error prompt position in the operation instruction information, the corresponding one or more annotation information is superimposed in real time at the corresponding position in the discussion paper document to prompt the first user that the corresponding position in the current document is wrong.
当然本领域技术人员应能理解,上述目标对象和/或操作指示信息仅为举例,其他现有的或今后可能出现的目标对象和/或操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above target objects and / or operation instruction information are just examples, and other existing or future target objects and / or operation instruction information, if applicable to this application, should also be included in This application is within the scope of protection and is hereby incorporated by reference.
在一些实施例中,在步骤S13中,第一用户设备根据所述一个或多个标注位置信息生成渲染标记信息,并根据所述转移矩阵信息,将所述渲染标记信息叠加显示于所述目标对象。其中,渲染标记信息包括一个或多个标注位置的高亮投影、划线或画圈等标记等。例如,第一用户设备根据操作指示信息中的一个或多个标注在在讨论纸件文档中的标注位置信息,生成对应的渲染标记信息,并根据转移矩阵信息,确定在讨论纸件文档在视频信息各视频帧中的位置,从而确定渲染标记在各视频帧中的位置,并在 对应的位置叠加显示渲染标记信息。In some embodiments, in step S13, the first user equipment generates rendering marker information according to the one or more labeled position information, and superimposes the rendering marker information on the target according to the transfer matrix information. Object. Wherein, the rendering mark information includes highlight projections such as one or more marked positions, marks such as a line or a circle. For example, the first user equipment generates corresponding rendering mark information according to one or more of the marked position information marked in the discussion paper document in the operation instruction information, and determines, based on the transfer matrix information, whether the paper document in discussion is in the video. The position of each video frame is information, so as to determine the position of the rendering mark in each video frame, and the rendering mark information is superimposed and displayed at the corresponding position.
例如,增强现实眼镜接收操作指示信息,该操作指示信息中包含该在讨论纸件文档当前在读页面中第二排第五个字的标注信息。增强现实眼镜根据该操作指示信息,生成在在讨论纸件文档的在读页面第二排第五个字的对应位置最下方下划线的渲染标记信息。增强现实眼镜根据目标跟踪算法计算出在讨论纸件文档在当前视频信息各视频帧中的位置,并根据渲染标记相对于在讨论纸件文档的位置,在各视频帧中在讨论纸件文档的在读书页的第二排第五个字下方叠加显示下划线的渲染标记信息。For example, the augmented reality glasses receive operation instruction information, and the operation instruction information includes the tag information of the second and fifth words in the currently read page of the paper document in question. Based on the operation instruction information, the augmented reality glasses generates rendering mark information underlined at the bottom of the corresponding position of the fifth word in the second row of the read page of the paper document under discussion. The augmented reality glasses calculate the position of the paper document under discussion in each video frame of the current video information according to the target tracking algorithm, and according to the position of the rendering mark relative to the paper document under discussion, the paper document is discussed in each video frame. The underlined rendering mark information is superimposed under the fifth word in the second row of the reading page.
当然本领域技术人员应能理解,上述渲染标记信息仅为举例,其他现有的或今后可能出现的渲染标记信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above rendering mark information is only an example. If other existing or future rendering mark information is applicable to this application, it should also be included in the protection scope of this application, References are included here.
在一些实施例中,该方法还包括步骤S17(未示出)。在步骤S17中,第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的图像信息,将所述图像信息发送至对应的第二用户设备,接收关于所述目标对象的标记信息,其中,所述标记信息包括所述第二用户设备发送的、第二用户对所述图像信息中所述目标对象的操作指示信息,将所述标记信息叠加显示于所述目标对象;其中,在步骤S11中,第一用户设备通过所述摄像装置实时拍摄关于所述目标对象的视频信息。例如,第一用户设备通过摄像装置拍摄关于目标对象的图像信息,并将图像信息发送至第二用户设备,第二用户设备接收并呈现该图像信息,以供第二用户对目标对象进行操作。第二用户设备基于第二用户的操作,生成操作指示信息对应的标记信息,并将该标记信息发送至第一用户设备。第一用户设备接收该标记信息,并在图像中目标对象对应的位置叠加显示该标记信息。随后,第一用户设备通过摄像装置采集关于目标对象的视频流,并通过目标跟踪算法在该视频流各视频帧中叠加显示该标记信息。In some embodiments, the method further includes step S17 (not shown). In step S17, the first user equipment captures image information about the target object in real time through the camera device in the first user equipment, sends the image information to the corresponding second user equipment, and receives information about the target object. Tag information, wherein the tag information includes operation instruction information of the second user on the target object in the image information sent by the second user equipment, and the tag information is superimposed and displayed on the target object; Wherein, in step S11, the first user equipment captures video information about the target object in real time through the camera device. For example, the first user equipment captures image information about the target object through the imaging device, and sends the image information to the second user equipment. The second user equipment receives and presents the image information for the second user to operate the target object. Based on the operation of the second user, the second user equipment generates tag information corresponding to the operation instruction information, and sends the tag information to the first user equipment. The first user equipment receives the tag information, and superimposes and displays the tag information at a position corresponding to the target object in the image. Subsequently, the first user equipment collects the video stream about the target object through the camera device, and displays the marker information in each video frame of the video stream by using a target tracking algorithm.
例如,增强现实眼镜通过拍摄当前目标对象的图像信息,并将该图像信息发送至平板电脑,平板电脑接收并呈现该图像信息。第二用户基于呈现的图像信息对目标对象进行操作指示,平板电脑采集第二用户的操作指示信息生成对应的标记信息,并将该标记信息发送至增强现实眼镜。增强 现实眼镜接收该标记信息,并在拍摄的图像信息中根据目标跟踪算法叠加显示该标记信息。后续,增强现实眼镜继续采集目标对象的视频信息,并根据目标跟踪算法在对应的位置实时叠加该标注信息。For example, the augmented reality glasses capture image information of the current target object and send the image information to a tablet computer, and the tablet computer receives and presents the image information. The second user performs an operation instruction on the target object based on the presented image information. The tablet computer collects the operation instruction information of the second user to generate corresponding mark information, and sends the mark information to the augmented reality glasses. The augmented reality glasses receive the tag information, and superimpose and display the tag information in the captured image information according to the target tracking algorithm. Subsequently, the augmented reality glasses continue to collect video information of the target object, and superimpose the label information at the corresponding position in real time according to the target tracking algorithm.
图4示出根据本申请另一个方面的一种在第二用户设备基于增强现实进行远程辅助的方法,其中,该方法包括步骤S21和步骤S22。在步骤S21中,第二用户设备接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;在步骤S22中,第二用户设备呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。例如,第二用户设备接收并呈现第一用户设备发送的关于目标对象的图像信息或视频信息,采集第二用户的操作生成对应的标记信息。随后,第二用户设备继续接收第一用户设备发送的关于目标对象的视频信息,并呈现该视频信息的同时,在呈现的视频中叠加显示第二用户设备之前确定的标记信息。FIG. 4 illustrates a method for remote assistance based on augmented reality on a second user equipment according to another aspect of the present application, where the method includes steps S21 and S22. In step S21, the second user equipment receives video information corresponding to the target object that is captured by the first user equipment in real time through the camera device in the first user equipment; in step S22, the second user equipment presents the Video information, and maintain corresponding target information superimposed on the target object displayed in each video frame of the video information, wherein the label information includes a second user's Operation instructions. For example, the second user equipment receives and presents image information or video information about the target object sent by the first user equipment, and collects operations of the second user to generate corresponding mark information. Subsequently, the second user equipment continues to receive video information about the target object sent by the first user equipment and presents the video information, and superimposes and displays the tag information determined before the second user equipment in the presented video.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑建立了通信连接。增强现实眼镜与平板电脑已进行了关于目标对象的视频流或图像的传输,且增强现实眼镜接收到平板电脑发送的关于目标对象在之前视频帧中的操作指示信息,如目标对象为某操作台上的零件,该目标对象可以是第一用户设备基于第一用户的选择操作(如画圈圈出等操作)确定的,也可以是第一用户设备接收到的第二用户设备基于第二用户的选择操作确定的,还可以是第一用户设备通过识别目标对象的初始图像信息确定的;对应的操作指示信息包括第二用户设备识别第二用户关于该零件操作的手势等获取的虚拟操作信息等。增强现实眼镜通过摄像头实时采集当前关于目标对象的视频信息,随后通过目标跟踪算法计算当前视频帧中目标对象相对于之前视频帧中目标对象的转移矩阵信息。随后,增强现实眼镜根据转移矩阵信息确定目标对象在当前视频帧的位置信息,并在该位置叠加显示对应的标记信息,如在当前视频帧中操作台上的零件对应位置叠加显示第二用户的手势对应的操作指示信息等。同时,增强现实眼镜将视频信息发送至平板电脑,平板电脑接收并呈现该视 频信息,并在视频信息呈现的同时在视频信息中对应的位置叠加显示之前的标记信息。在另一些实时例中,增强现实眼镜还会向平板电脑发送辅助标示信息,其中,该辅助标示信息包括增强现实眼镜采集的基于第一用户的操作,对目标对象的标记(如画线段、圆圈等)等,或者对平板电脑发送的标记信息的反馈信息等,如在标记信息中提问、画圈圈出文字等;平板电脑接收该辅助标示信息,并在呈现视频信息的同时将该辅助标示信息叠加显示在目标对象对应的位置。For example, a first user holds augmented reality glasses, a second user holds a tablet computer, and the augmented reality glasses establish a communication connection with the tablet computer. The augmented reality glasses and tablet computer have transmitted the video stream or image about the target object, and the augmented reality glasses have received the tablet's operation instruction information about the target object in the previous video frame, such as the target object is a console The target object may be determined by the first user equipment based on the first user ’s selection operation (such as drawing a circle), or it may be the second user equipment received by the first user equipment based on the second user. The selection operation may also be determined by the first user equipment by identifying the initial image information of the target object; the corresponding operation instruction information includes virtual operation information obtained by the second user equipment to recognize the second user ’s gesture regarding the part operation. Wait. The augmented reality glasses collect the current video information about the target object through the camera in real time, and then use the target tracking algorithm to calculate the transfer matrix information of the target object in the current video frame relative to the target object in the previous video frame. Subsequently, the augmented reality glasses determine the position information of the target object in the current video frame according to the transfer matrix information, and superimpose and display the corresponding marker information at the position, such as the corresponding position of the part on the operating platform in the current video frame superimposedly displays the second user Operation instruction information and the like corresponding to the gesture. At the same time, the augmented reality glasses send video information to the tablet computer, and the tablet computer receives and presents the video information, and displays the previous tag information at the corresponding position in the video information while the video information is presented. In other real-time examples, the augmented reality glasses also send auxiliary labeling information to the tablet computer, where the auxiliary labeling information includes the target user's mark (such as a line segment, a circle, etc.) collected by the augmented reality glasses based on the operation of the first user. Etc.), or feedback information on the tag information sent by the tablet computer, such as asking questions in the tag information, circled text, etc .; the tablet computer receives the auxiliary tag information and presents the auxiliary message while displaying the video information. The information is displayed superimposed on the corresponding position of the target object.
当然本领域技术人员应能理解,上述标记信息仅为举例,其他现有的或今后可能出现的标记信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above tag information is only an example. If other existing or future tag information is applicable to this application, it should also be included in the protection scope of this application, and hereby incorporated by reference. Included here.
在一些实施中,该方法还包括步骤S23(未示出)。在步骤S23中,第二用户设备对所述视频信息中的所述目标对象执行目标跟踪操作;其中,在步骤S22中,第二用户设备呈现所述视频信息,并根据所述目标跟踪操作的结果信息,将对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。例如,第二用户设备接收第一用户设备发送有关目标对象的视频信息,第二用户设备根据目标对象的模板信息对该视频信息中目标对象执行目标跟踪操作,确定目标对象在视频信息各视频帧中的位置信息,其中,模板信息可以是第一用户设备发送至第二用户设备的,可以是第二用户设备基于第二用户的操作初始视频帧中选取的或者导入模板信息获取的。随后,第二用户设备呈现该视频信息时,根据目标跟踪的结果信息,在目标对象的对应位置叠加显示标记信息,其中,该标记信息可以是第二用户设备根据第二用户对初始视频帧或图像信息中目标对象进行指导生成的,也可以是第二用户基于后来发送的视频信息中目标对象所做的操作指导等生成的标记信息。In some implementations, the method further includes step S23 (not shown). In step S23, the second user equipment performs a target tracking operation on the target object in the video information; wherein, in step S22, the second user equipment presents the video information, and according to the target tracking operation, Result information, superimposing and displaying corresponding mark information on the target object in each video frame of the video information, wherein the mark information includes a second user's operation on the target object through the second user equipment Instructions. For example, the second user equipment receives video information about the target object sent by the first user equipment, and the second user equipment performs a target tracking operation on the target object in the video information according to the template information of the target object to determine the target object in each video frame of the video information. The location information in the template information may be sent by the first user equipment to the second user equipment, or may be obtained by the second user equipment based on the initial video frame selected by the second user operation or by importing the template information. Subsequently, when the second user equipment presents the video information, the marker information is superimposed and displayed on the corresponding position of the target object according to the result information of the target tracking, where the marker information may be the second user equipment's initial video frame or The target information in the image information is generated based on the guidance, and may also be mark information generated by the second user based on the operation guidance made by the target object in the video information sent later.
例如,平板电脑接收增强现实眼镜发送的视频信息,根据操作台的零件模板信息,在视频信息中各视频帧对该零件执行目标跟踪,获取该零件在各视频帧中的位置信息,其中,该零件的模板可以是第二用户导入的,可以是在初始化帧中选取的,也可以是增强现实眼镜发送的。平板电脑接 收并呈现视频信息,并根据第二用户对该零件的安装指导信息(如,圈出或箭头指向安装位置,或者根据手势识别对应预设的安装操作等)生成对应的标记信息。第二用户设备在呈现该视频信息的同时,根据该零件在各视频帧中位置信息,在后续视频帧中实时叠加显示该标记信息等。For example, the tablet computer receives video information sent by the augmented reality glasses, performs target tracking on each part in each video frame in the video information according to the part template information of the operating platform, and obtains position information of the part in each video frame. The template of the part may be imported by the second user, may be selected in the initialization frame, or may be sent by the augmented reality glasses. The tablet computer receives and presents video information, and generates corresponding mark information according to the second user's installation guide information for the part (for example, circled or arrow pointing to the installation position, or corresponding to a preset installation operation based on gesture recognition). While presenting the video information, the second user equipment displays the tag information and the like in real-time superimposed display in subsequent video frames according to the position information of the part in each video frame.
当然本领域技术人员应能理解,上述标记信息仅为举例,其他现有的或今后可能出现的标记信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above tag information is only an example. If other existing or future tag information is applicable to this application, it should also be included in the protection scope of this application, and hereby incorporated by reference. Included here.
在一些实施例中,在步骤S21中,第二用户设备接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时获取关于目标对象的视频信息,以及所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;其中,在步骤S22中,第二用户设备呈现所述视频信息,并根据所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息,将对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。例如,第一用户设备将视频信息发送至第二用户设备的同时,将根据目标跟踪操作获得的转移矩阵信息同时发送至第二用户设备,以供第二用户在呈现该视频信息的同时对目标对象进行目标跟踪。第二用户设备接收该视频信息及转移矩阵信息,在呈现视频信息的同时,根据该转移矩阵信息在视频信息中对应位置叠加显示标记信息。In some embodiments, in step S21, the second user equipment receives the video information about the target object obtained in real time through the camera device in the first user equipment and is sent by the corresponding first user equipment. Corresponding transition matrix information in each video frame of the video information; wherein, in step S22, the second user equipment presents the video information, and corresponds to the target object in each video frame of the video information The matrix information is transferred, and the corresponding marker information is superimposed and displayed on the target object in each video frame of the video information, wherein the marker information includes a second user's Operation instructions. For example, when the first user equipment sends video information to the second user equipment, it simultaneously sends the transfer matrix information obtained according to the target tracking operation to the second user equipment for the second user to target the target while presenting the video information. The subject performs target tracking. The second user equipment receives the video information and the transfer matrix information, and simultaneously displays the video information, and superimposes and displays the marker information on the corresponding position in the video information according to the transfer matrix information.
例如,增强现实眼镜实时拍摄关于当前目标对象相关的视频信息,并将对该视频信息中目标对象结合之前视频帧执行目标跟踪操作,确定该目标对象在各视频帧中相对于前一视频帧的转移矩阵信息等。随后,增强现实眼镜将该视频信息以及视频信息中各视频帧对应的转移矩阵信息直接发送或者通过云端发送至平板电脑。平板电脑接收该视频信息以及对应的转移矩阵信息,并在呈现该视频信息的同时,根据转移矩阵信息在视频信息的对应位置叠加显示标记信息等。For example, the augmented reality glasses capture video information about the current target object in real time, and perform a target tracking operation on the target object in the video information in combination with the previous video frame to determine the target object's relative to the previous video frame in each video frame. Transfer matrix information, etc. Subsequently, the augmented reality glasses send the video information and the transfer matrix information corresponding to each video frame in the video information directly or through a cloud to a tablet computer. The tablet computer receives the video information and the corresponding transfer matrix information, and displays the video information, and superimposes and displays marker information and the like on the corresponding position of the video information according to the transfer matrix information.
在一些实施例中,该方法还包括步骤S24(未示出)。在步骤S24中,第二用户设备获取所述第二用户基于所述视频信息对所述目标对象的继续操作指示信息,将所述继续操作指示信息发送至所述第一用户设备。例 如,第二用户设备根据第二用户对目标对象的继续操作(如画出线段圆圈等标记),或者通过手势识别识别第二用户的手势操作等,生成对应的继续操作指示信息。随后,第二用户设备将该继续操作指示信息发送至第一用户设备,辅助第一用户继续对目标对象进行操作等。In some embodiments, the method further includes step S24 (not shown). In step S24, the second user equipment obtains the operation instruction information of the second user on the target object based on the video information, and sends the operation instruction information to the first user equipment. For example, the second user equipment generates corresponding continuous operation instruction information according to the second user's continued operation on the target object (such as drawing a line segment circle or the like), or recognizes the second user's gesture operation through gesture recognition, and the like. Subsequently, the second user equipment sends the continuing operation instruction information to the first user equipment to assist the first user in continuing to perform operations on the target object.
例如,增强现实眼镜将实时拍摄的关于目标对象的视频信息发送至第平板电脑,平板电脑接收并呈现该视频信息。随后,平板电脑在得到的视频流各视频帧中执行目标跟踪,获取目标对象的在视频帧中位置,在一些实施例中,平板电脑通过线段、圆圈、局部增加亮度等方式将视频帧中目标对象突出显示出来。第二用户在平板电脑上做标记或者在平板电脑摄像头可拍摄范围内做手势等指导第一用户对零件进行加工,平板电脑将采集到第二用户的标记作为继续操作指示信息,或者通过对拍摄到的手势等进行手势识别,确定识别的手势为继续操作指示信息等。随后,平板电脑将该继续指示信息发送至增强现实眼镜。增强现实眼镜接收并在对应位置叠加显示该继续操作指示信息。For example, the augmented reality glasses send video information about the target object captured in real time to the second tablet computer, and the tablet computer receives and presents the video information. Subsequently, the tablet computer performs target tracking in each video frame of the obtained video stream to obtain the position of the target object in the video frame. In some embodiments, the tablet computer targets the target in the video frame by means of line segments, circles, and locally increasing brightness. The object is highlighted. The second user instructs the first user to process the part by making a mark on the tablet computer or making gestures within the shooting range of the tablet computer camera. The tablet computer uses the second user's mark as the operation instruction information, or by shooting The gestures and the like obtained are used for gesture recognition, and it is determined that the recognized gestures are the operation instruction information and the like. The tablet then sends the resume instruction to the augmented reality glasses. The augmented reality glasses receive and display the continued operation instruction information in a superimposed position at the corresponding position.
又如,增强现实眼镜将实时拍摄的关于目标对象的视频信息发送至第平板电脑,同时还将该视频信息中各视频帧对应的转移矩阵信息发送至平板电脑,平板电脑接收并呈现该视频信息。随后,平板电脑根据接收到的转移矩阵信息,确定目标对象的在视频帧中位置,在一些实施例中,平板电脑通过线段、圆圈、局部增加亮度等方式将视频帧中目标对象突出显示出来。第二用户在平板电脑上做标记或者在平板电脑摄像头可拍摄范围内做手势等指导第一用户对零件进行加工,平板电脑将采集到第二用户的标记作为继续操作指示信息,或者通过对拍摄到的手势等进行手势识别,确定识别的手势为继续操作指示信息等。随后,平板电脑将该继续指示信息发送至增强现实眼镜。增强现实眼镜接收并在对应位置叠加显示该继续操作指示信息。For another example, the augmented reality glasses send real-time video information about the target object to the tablet computer, and also send the transfer matrix information corresponding to each video frame in the video information to the tablet computer, and the tablet computer receives and presents the video information . Subsequently, the tablet computer determines the position of the target object in the video frame according to the received transfer matrix information. In some embodiments, the tablet computer highlights the target object in the video frame by means of line segments, circles, and locally increasing brightness. The second user instructs the first user to process the part by making a mark on the tablet computer or making gestures within the shooting range of the tablet computer camera. The tablet computer uses the second user's mark as the operation instruction information, or by shooting The gestures and the like obtained are used for gesture recognition, and it is determined that the recognized gestures are the operation instruction information and the like. The tablet then sends the resume instruction to the augmented reality glasses. The augmented reality glasses receive and display the continued operation instruction information in a superimposed position at the corresponding position.
当然本领域技术人员应能理解,上述继续操作指示信息仅为举例,其他现有的或今后可能出现的继续操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above-mentioned continuing operation instruction information is only an example. If other existing or future continuing operation instruction information is applicable to this application, it should also be included in the protection scope of this application, and This is incorporated herein by reference.
在一些实施例中,该方法还包括步骤S25(未示出)。在步骤S25中, 第二用户设备根据所述第二用户通过第二用户设备执行的摄像控制操作,生成所述第二用户对所述摄像装置的摄像控制指令信息,其中,所述摄像控制指令信息用于调整所述摄像装置的摄像参数信息,将所述摄像控制指令信息发送至所述第一用户设备,并接收所述第一用户设备发送的、通过所述调整后的摄像装置拍摄的所述视频信息。例如,第二用户设备接收到视频信息,对视频信息进行调整,如放大目标对象附近区域等。第二用户基于用户的操作确定对应的摄像控制指令信息,其中,该摄像控制指令信息包括用于调整第一用户设备的摄像装置的摄像参数信息,随后,第二用户设备将该摄像控制指令信息发送至第一用户设备。其中,摄像控制指令信息包括对第一用户设备的摄像装置的硬件参数进行调控的指令信息,摄像参数信息包括但不限于分辨率、像素深度、最大帧率、曝光方式和快门速度、像元尺寸以及光谱响应特征等。In some embodiments, the method further includes step S25 (not shown). In step S25, the second user equipment generates imaging control instruction information of the second user on the imaging device according to an imaging control operation performed by the second user through the second user equipment, where the imaging control instruction The information is used to adjust the imaging parameter information of the imaging device, send the imaging control instruction information to the first user equipment, and receive the image sent by the first user equipment and taken by the adjusted imaging device. The video information. For example, the second user equipment receives the video information and adjusts the video information, such as enlarging the area near the target object. The second user determines the corresponding imaging control instruction information based on the user's operation, where the imaging control instruction information includes imaging parameter information for adjusting the imaging device of the first user equipment, and then the second user equipment sends the imaging control instruction information Send to the first user equipment. The imaging control instruction information includes instruction information for adjusting hardware parameters of the imaging device of the first user equipment. The imaging parameter information includes, but is not limited to, resolution, pixel depth, maximum frame rate, exposure mode and shutter speed, and pixel size. And spectral response characteristics.
例如,如图3所示,图A为第二用户收到的实时拍摄的视频信息,其中,目标对象为画面中桌上的鼠标垫,第二用户想进一步的细致观察目标对象,通过视频中右上角的设置图标进行操作或者直接通过在屏幕上进行两手指外扩的放大操作等,平板电脑基于第二用户的操作,生成对应的聚焦目标对象的摄像控制指令信息,并将该摄像控制指令信息发送至增强现实眼镜。增强现实眼镜接收该摄像控制指令信息,通过调整摄像装置的相关摄像参数,如分辨率、焦距等,拍摄关于目标对象的调整后的视频信息,并将该视频信息发送平板电脑。如图B所示,其画面为平板电脑接收并呈现的放大后的关于目标对象的视频信息。For example, as shown in Figure 3, Figure A is the real-time video information received by the second user, where the target object is the mouse pad on the table in the screen. The second user wants to observe the target object in more detail. The setting icon in the upper right corner is used to operate or directly zoom out by two-finger expansion on the screen. Based on the operation of the second user, the tablet computer generates corresponding camera control instruction information of the focused target object, and sends the camera control instruction The information is sent to the augmented reality glasses. The augmented reality glasses receive the imaging control instruction information, adjust relevant imaging parameters of the imaging device, such as resolution, focal length, etc., shoot the adjusted video information about the target object, and send the video information to the tablet computer. As shown in FIG. B, the picture is the enlarged video information about the target object received and presented by the tablet computer.
当然本领域技术人员应能理解,上述摄像控制指令信息和/或摄像参数信息仅为举例,其他现有的或今后可能出现的摄像控制指令信息和/或摄像参数信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the foregoing camera control instruction information and / or camera parameter information are merely examples, and other existing or future camera control instruction information and / or camera parameter information may be applicable to this application, It should also be included in the protection scope of this application, and hereby incorporated by reference.
在一些实施例中,该方法还包括步骤S26(未示出)。在步骤S26中,第二用户设备接收并呈现对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的图像信息,获取所述第二用户对所述图像信息中所述目标对象的操作指示信息,将所述操作指示信息发送 至所述第一用户设备,将所述操作指示信息叠加显示于所述图像信息中所述目标对象;其中,在步骤S21中,第二用户设备接收所述第一用户设备发送的、通过所述摄像装置实时拍摄关于所述目标对象的视频信息。例如,第一用户设备通过摄像装置拍摄关于目标对象的图像信息,并将图像信息发送至第二用户设备,第二用户设备接收并呈现该图像信息,以供第二用户对目标对象进行操作。第二用户设备基于第二用户的操作,生成操作指示信息对应的标记信息,并将该标记信息发送至第一用户设备。第一用户设备接收该标记信息,并在图像中目标对象对应的位置叠加显示该标记信息。随后,第一用户设备通过摄像装置采集关于目标对象的视频流,并通过目标跟踪算法在该视频流各视频帧中叠加显示该标记信息。In some embodiments, the method further includes step S26 (not shown). In step S26, the second user equipment receives and presents image information about the target object that is captured by the first user equipment in real time through the camera device in the first user equipment, and acquires the second user's image on the image. The operation instruction information of the target object in the information, sending the operation instruction information to the first user equipment, and superimposing and displaying the operation instruction information on the target object in the image information; wherein, in step S21 In the second user equipment, the second user equipment receives video information about the target object captured by the first user equipment in real time through the imaging device. For example, the first user equipment captures image information about the target object through the imaging device, and sends the image information to the second user equipment. The second user equipment receives and presents the image information for the second user to operate the target object. Based on the operation of the second user, the second user equipment generates tag information corresponding to the operation instruction information, and sends the tag information to the first user equipment. The first user equipment receives the tag information, and superimposes and displays the tag information at a position corresponding to the target object in the image. Subsequently, the first user equipment collects the video stream about the target object through the camera device, and displays the marker information in each video frame of the video stream by using a target tracking algorithm.
例如,增强现实眼镜通过拍摄当前目标对象的图像信息,并将该图像信息发送至平板电脑,平板电脑接收并呈现该图像信息。第二用户基于呈现的图像信息对目标对象进行操作指示,平板电脑采集第二用户的操作指示信息生成对应的标记信息,并将该标记信息发送至增强现实眼镜。增强现实眼镜接收该标记信息,并在拍摄的图像信息中根据目标跟踪算法叠加显示该标记信息。后续,增强现实眼镜继续采集目标对象的视频信息,并根据目标跟踪算法在对应的位置实时叠加该标注信息。For example, the augmented reality glasses capture image information of the current target object and send the image information to a tablet computer, and the tablet computer receives and presents the image information. The second user performs an operation instruction on the target object based on the presented image information. The tablet computer collects the operation instruction information of the second user to generate corresponding mark information, and sends the mark information to the augmented reality glasses. The augmented reality glasses receive the tag information, and superimpose and display the tag information in the captured image information according to the target tracking algorithm. Subsequently, the augmented reality glasses continue to collect video information of the target object, and superimpose the label information at the corresponding position in real time according to the target tracking algorithm.
图5示出根据本申请又一个方面的一种在第一用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括步骤S31、步骤S32、步骤S33和步骤S34。在步骤S31中,第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息;在步骤S32中,第一用户设备将所述视频信息发送至对应的网络设备;在步骤S33中,第一用户设备接收所述网络设备发送的、所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;在步骤S34中,第一用户设备根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息。例如,第一用户设备与第二用户设备通过网络设备建立了通信连接,第一用户设备将拍摄的关于第一目标对象的视频信息发送至网络设备,由网络设备根据视频信息对第一目标对象执行目 标跟踪,确定对应视频信息各视频帧中第一目标对象的转移矩阵信息,并将该转移矩阵发送至第一用户设备和第二用户设备。随后,第一用户设备和第二用户设备基于网络设备发送的转移矩阵信息叠加显示第一标记信息等,其中,第一标记信息包括第二用户设备根据第二用户对第一目标对象的操作指示信息。FIG. 5 illustrates a method for remote assistance based on augmented reality on the first user equipment side according to another aspect of the present application, where the method includes steps S31, S32, S33, and S34. In step S31, the first user equipment captures video information about the first target object in real time through the camera device in the first user equipment. In step S32, the first user equipment sends the video information to the corresponding network. Device; in step S33, the first user equipment receives first transfer matrix information corresponding to the first target object in each video frame of the video information sent by the network device; in step S34, the first The user equipment superimposes and displays the corresponding first marker information on the first target object according to the first transfer matrix information, wherein the first marker information includes a second user equipment corresponding to the second user equipment sent by the second user equipment. The operation instruction information of the first target object is described. For example, the first user equipment and the second user equipment establish a communication connection through a network device, and the first user equipment sends the captured video information about the first target object to the network device, and the network device sends the first target object to the first target object according to the video information. Perform target tracking, determine the transition matrix information of the first target object in each video frame corresponding to the video information, and send the transition matrix to the first user equipment and the second user equipment. Subsequently, the first user equipment and the second user equipment superimpose and display the first tag information and the like based on the transfer matrix information sent by the network device, where the first tag information includes an operation instruction of the second user device on the first target object according to the second user equipment. information.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑通过网络设备(云端)建立了通信连接。第一用户对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲相关的视频信息,并将该视频信息发送至网络设备。网络设备接收该零件甲相关的视频信息,并根据目标跟踪算法确定该零件甲在视频信息中各视频帧中的转移矩阵信息,随后,网络设备将该转移矩阵信息返回至增强现实眼镜。增强现实眼镜接收该转移矩阵信息,并在呈现视频信息的同时根据转移矩阵信息在视频中对应位置实时叠加显示对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。For example, a first user holds augmented reality glasses and a second user holds a tablet computer, and the augmented reality glasses and the tablet computer establish a communication connection through a network device (cloud). The first user takes a real-time shot of the first target object (such as part A on the operating platform), obtains video information related to part A, and sends the video information to the network device. The network device receives the video information related to the part A, and determines the transfer matrix information of the part A in each video frame of the video information according to the target tracking algorithm. Then, the network device returns the transfer matrix information to the augmented reality glasses. The augmented reality glasses receive the transfer matrix information, and display the video information while displaying the corresponding marker information in real-time superimposed on the corresponding position in the video according to the transfer matrix information, wherein the marker information includes operations such as the second user's installation instruction for the part A Instruction information, where the operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation about a second user uploaded by the tablet computer.
当然本领域技术人员应能理解,上述操作指示信息仅为举例,其他现有的或今后可能出现的操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above operation instruction information is just an example. If other existing or future operation instruction information is applicable to this application, it should also be included in the protection scope of this application. References are included here.
图6示出根据本申请又一个方面的一种在网络设备端基于增强现实进行远程辅助的方法,其中,该方法包括步骤S41、步骤S42、步骤S43和步骤S44。在步骤S41中,网络设备接收第一用户设备发送的关于第一目标对象的视频信息,其中,所述视频信息是通过所述第一用户设备中的摄像装置实时拍摄的;在步骤S42中,网络设备通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;在步骤S43中,网络设备将所述第一转移矩阵信息发送至所述第一用户设备;在步骤S44中,网络设备将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。其中,网络设备是一个具有足够计 算能力的服务器,主要负责视频、音频和标记信息数据的转发,同时,网络设备具有一些计算机视觉和图像处理的算法,如视频/音频信息达到网络设备时,网络设备通过跟踪算法对目标对象(如第一目标对象等)进行跟踪,随后,将跟踪的结果信息返回至用户设备。FIG. 6 illustrates a method for performing remote assistance based on augmented reality on a network device side according to another aspect of the present application, where the method includes steps S41, S42, S43, and S44. In step S41, the network device receives video information about the first target object sent by the first user equipment, where the video information is captured in real time by a camera device in the first user equipment; in step S42, The network device determines a first transition matrix information corresponding to the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information; in step S43 , The network device sends the first transfer matrix information to the first user equipment; in step S44, the network device sends the video information and the first transfer matrix information to the first user equipment belonging to A second user device for the same remote assistance task. Among them, the network device is a server with sufficient computing power, which is mainly responsible for the forwarding of video, audio, and tag information data. At the same time, the network device has some computer vision and image processing algorithms. For example, when video / audio information reaches the network device, the network The device tracks the target object (such as the first target object) by using a tracking algorithm, and then returns the tracking result information to the user device.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑通过网络设备(云端)建立了通信连接。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲相关的视频信息,并将该视频信息发送至网络设备。网络设备接收该零件甲相关的视频信息,并根据目标跟踪算法确定该零件甲在视频信息中各视频帧中的转移矩阵信息,随后,网络设备将该转移矩阵信息返回至增强现实眼镜,并将转移矩阵信息以及视频信息发送至平板电脑,其中,增强现实眼镜与平板电脑通过网络设备建立通信执行同一远程辅助任务(如,对零件甲的安装指导)。增强现实眼镜接收该转移矩阵信息,并在呈现视频信息的同时根据转移矩阵信息在视频中对应位置实时叠加显示对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。平板电脑接收网络设备发送的转移矩阵信息以及视频信息,在呈现视频信息时,根据转移矩阵信息确定零件甲在各视频帧中的位置信息,并在该位置叠加显示关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息。For example, a first user holds augmented reality glasses and a second user holds a tablet computer, and the augmented reality glasses and the tablet computer establish a communication connection through a network device (cloud). The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform), obtain video information related to part A, and send the video information to the network device. The network device receives the video information related to the part A, and determines the transfer matrix information of the part A in each video frame according to the target tracking algorithm. Then, the network device returns the transfer matrix information to the augmented reality glasses, and The transfer matrix information and video information are sent to a tablet computer, where the augmented reality glasses and the tablet computer establish communication through a network device to perform the same remote assistance task (eg, installation instruction for part A). The augmented reality glasses receive the transfer matrix information, and display the video information while displaying the corresponding marker information in real-time superimposed on the corresponding position in the video according to the transfer matrix information, wherein the marker information includes operations such as the second user's installation instruction for the part A Instruction information, where the operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation about a second user uploaded by the tablet computer. The tablet receives the transfer matrix information and video information sent by the network device. When presenting the video information, the position information of the part A in each video frame is determined according to the transfer matrix information, and the mark information about the part A is superimposed and displayed at the position, such as Operation instructions such as installation instructions for Part A.
在一些实施例中,在步骤S42中,网络设备根据所述视频信息及所述第一目标对象的其它视频信息重建所述第一目标对象的视频信息,并通过对重建后的所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息。其中,网络设备主要负责视频、音频和标记信息等数据的转发,同时,网络设备具有一些计算机视觉和图像处理的能力,如果视频/音频信息发送至网络设备,网络设备通过目标跟踪算法、目标识别、重建、姿态估计和计算机图形算法(如虚拟物体渲染、点云处理(拼接、降/超采样、匹配、网格化等))对视频信息进行处理,并将处理的结果信息返回至用户设备。 例如,网络设备通过对第一用户上传的视频信息以及其他用户上传的视频进行重建,生成对于第一目标对象的总体的视频信息,随后,在重建视频信息中对第一目标对象进行目标跟踪。In some embodiments, in step S42, the network device reconstructs the video information of the first target object according to the video information and other video information of the first target object, and passes the reconstructed video information Performing a target tracking operation on the first target object in to determine first transfer matrix information corresponding to the first target object in each video frame of the video information. Among them, the network device is mainly responsible for the forwarding of data such as video, audio, and tag information. At the same time, the network device has some computer vision and image processing capabilities. If the video / audio information is sent to the network device, the network device uses the target tracking algorithm and target recognition. , Reconstruction, pose estimation and computer graphics algorithms (such as virtual object rendering, point cloud processing (splicing, down / oversampling, matching, meshing, etc.)) process video information and return the processed result information to the user device . For example, the network device reconstructs the video information uploaded by the first user and videos uploaded by other users to generate overall video information for the first target object, and then performs target tracking on the first target object in the reconstructed video information.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,其他用户(如第三用户等)持有第三用户设备(如增强现实眼镜、平板电脑等),增强现实眼镜、第三用户设备与平板电脑通过网络设备(云端)建立了通信连接,且增强现实眼镜、第三用户设备与平板电脑正在执行同一远程辅助任务(如,对零件甲的安装指导),增强现实眼镜和第三用设备均在拍摄零件甲相关的视频信息,其中,增强现实眼镜主要在拍摄零件甲的左半部分,第三用户设备主要在拍摄零件甲的右半部分,且有一定的重叠度。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲左半部分相关的第一视频信息,并将该第一视频信息发送至网络设备;第三用户对零件甲进行实时拍摄,获取零件甲右半部分相关的第三视频信息,并将该第三视频信息发送至网络设备。网络设备接收该零件甲相关的第一视频信息和第三视频信息,通过计算机视觉算法根据第一视频信息和第三视频信息获得包含整体零件甲的重构视频信息,并根据目标跟踪算法确定该零件甲在重构视频信息中各视频帧中的转移矩阵信息。随后,网络设备将该转移矩阵信息和重构视频信息返回至增强现实眼镜、第三用户设备和平板电脑。第三用户设备接收该转移矩阵信息以及重构视频信息,并在呈现重构视频信息的同时根据转移矩阵信息在视频中对应位置实时叠加显示对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的;在另一些实施例中,第三用户设备根据计算机视觉算法,计算出在重构视频信息中零件甲右半部分的位置信息相对于第三视频信息的转移矩阵信息,随后,第三用户设备呈现第三视频信息的同时在对应位置叠加显示对应的标记信息。For example, a first user holds augmented reality glasses, a second user holds a tablet computer, and another user (such as a third user) holds a third user device (such as augmented reality glasses, tablet computer, etc.). The three user devices and the tablet computer have established a communication connection through the network device (cloud), and the augmented reality glasses, the third user device and the tablet computer are performing the same remote assistance task (such as the installation instructions for part A), the augmented reality glasses, and The third device is used to capture video information related to Part A. Among them, the augmented reality glasses are mainly used to capture the left half of Part A, and the third user device is mainly used to capture the right half of Part A with a certain degree of overlap. The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform), obtain the first video information related to the left half of part A, and send the first video information to the network device; Part A performs real-time shooting, obtains third video information related to the right half of part A, and sends the third video information to the network device. The network device receives the first video information and the third video information related to the part A, obtains the reconstructed video information including the entire part A according to the first video information and the third video information through a computer vision algorithm, and determines the target video according to the target tracking algorithm. Part A transforms the matrix information in each video frame in the reconstructed video information. Subsequently, the network device returns the transfer matrix information and the reconstructed video information to the augmented reality glasses, the third user equipment, and the tablet computer. The third user equipment receives the transfer matrix information and the reconstructed video information, and displays the reconstructed video information while displaying the corresponding marker information in real-time on the corresponding position in the video according to the transfer matrix information, where the marker information includes the second user pair Operation instruction information such as installation instruction information of Part A, where the operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation about a second user uploaded by the tablet computer; in other embodiments, The third user equipment calculates the transfer matrix information of the position information of the right half of the part A in the reconstructed video information with respect to the third video information according to the computer vision algorithm. Subsequently, the third user equipment presents the third video information at the same time The corresponding mark information is superimposed and displayed at the corresponding position.
在一些实施例中,该方法还包括步骤S45(未示出)。在步骤S45中,网络设备通过对所述视频信息中的第三目标对象执行目标跟踪操作,确定 所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,其中,所述第三目标对象与所述第一目标对象属于同一远程辅助任务,并将所述视频信息及所述第三转移矩阵信息发送至所述远程辅助任务中与所述第三目标对象相对应的第三用户设备;其中,在步骤S44中,网络设备将所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。其中,第三用户持有第三用户设备,第三用户设备包括但不限于增强现实设备、平板电脑、PC端、移动终端等,此处以移动终端为例阐述以下实施例,本领域技术人员应能理解,该等实施例同样适用于增强现实设备、平板电脑、PC端等其他第三用户设备。In some embodiments, the method further includes step S45 (not shown). In step S45, the network device determines a third transition matrix information corresponding to the third target object in each video frame of the video information by performing a target tracking operation on the third target object in the video information, where , The third target object and the first target object belong to the same remote assistance task, and the video information and the third transfer matrix information are sent to the remote assistance task to be related to the third target object The corresponding third user equipment; wherein, in step S44, the network device sends the video information, the first transfer matrix information, and the third transfer matrix information to the same remote assistant as the first user equipment The second user equipment of the task. The third user holds a third user device. The third user device includes, but is not limited to, an augmented reality device, a tablet computer, a PC terminal, and a mobile terminal. Here, a mobile terminal is used as an example to describe the following embodiments. Those skilled in the art should It can be understood that these embodiments are also applicable to other third-user devices such as augmented reality devices, tablet computers, and PC terminals.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,第三用户持有移动终端,增强现实眼镜、平板电脑与移动终端通过网络设备(云端)建立了通信连接,且增强现实眼镜、移动终端与平板电脑正在执行同一远程辅助任务(如,对工作台上零件甲和零件乙的安装指导),增强现实眼镜负责拍摄工作台相关的视频信息。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取工作台上零件甲相关的视频信息,同时,该视频信息对应视频帧中包含零件乙;随后,增强现实眼镜并将该视频信息发送至网络设备。网络设备接收该视频信息,通过图像识别获取零件甲和零件乙的初始位置,并根据目标跟踪算法分别计算零件甲和零件乙在视频信息中各视频帧中的第一转移矩阵信息和第三转移矩阵信息,随后,网络设备将该第一转移矩阵信息返回至增强现实眼镜,将第三转移矩阵信息和视频信息发送至移动终端,并将第一转移矩阵信息、第三转移矩阵信息以及视频信息发送至平板电脑。增强现实眼镜接收该第一转移矩阵信息,并在呈现视频信息的同时根据第一转移矩阵信息在视频中对应位置实时叠加显示对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。移动终端接收网络设备发送的第三转移矩阵信息以及视频信息,在呈现视频信息时,根据第三转移矩阵信息确定零件乙在各视频帧中 的位置信息,并在该位置叠加显示关于零件乙的标记信息,如对于零件乙的安装指导信息等操作指示信息。平板电脑接收网络设备发送的第一转移矩阵信息、第三转移矩阵信息以及视频信息,在呈现视频信息时,根据第一转移矩阵信息确定零件甲在各视频帧中的位置信息,并在该位置叠加显示关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息,根据第三转移矩阵信息确定零件乙在各视频帧中的位置信息,并在该位置叠加显示关于零件乙的标记信息,如对于零件乙的安装指导信息等操作指示信息。其中,第二用户设备可以根据第二用户的选择操作确定当前第二用户设备的标记信息的对象。For example, the first user holds augmented reality glasses, the second user holds a tablet computer, and the third user holds a mobile terminal. The augmented reality glasses, tablet computer, and mobile terminal establish a communication connection through a network device (cloud), and the augmented reality The glasses, the mobile terminal, and the tablet computer are performing the same remote assistance task (for example, installation instructions for part A and part B on the workbench), and the augmented reality glasses are responsible for shooting video information related to the workbench. The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform) to obtain video information related to part A on the workbench. At the same time, the video information corresponding to the video frame contains part B; then, the augmented reality glasses And send the video information to the network device. The network device receives the video information, obtains the initial positions of part A and part B through image recognition, and calculates the first transition matrix information and the third transition of each video frame in the video information of the part A and part B according to the target tracking algorithm. Matrix information, and then, the network device returns the first transfer matrix information to the augmented reality glasses, sends the third transfer matrix information and video information to the mobile terminal, and sends the first transfer matrix information, the third transfer matrix information, and the video information Send to tablet. The augmented reality glasses receive the first transfer matrix information, and simultaneously display the video information while displaying the corresponding mark information on the corresponding position in the video according to the first transfer matrix information. The mark information includes the second user's installation of the part A. Operation instruction information such as guidance information, where the operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation about a second user uploaded by the tablet computer. The mobile terminal receives the third transfer matrix information and video information sent by the network device. When presenting the video information, the position information of part B in each video frame is determined according to the third transfer matrix information, and the position information of part B is superimposed and displayed at the position. Marking information, such as operation instructions for installation instructions for Part B. The tablet computer receives the first transfer matrix information, the third transfer matrix information, and the video information sent by the network device. When presenting the video information, the position information of the part A in each video frame is determined according to the first transfer matrix information, and at the position Superimposedly display mark information about Part A, such as installation instruction information for Part A, and determine the position information of Part B in each video frame based on the third transfer matrix information, and superimpose and display the information about Part B at this position. Marking information, such as operation instructions for installation instructions for Part B. The second user equipment may determine an object of the current tag information of the second user equipment according to a selection operation of the second user.
图7示出根据本申请又一个方面的一种在第三用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括步骤S51和步骤S52。在步骤S51中,第三用户设备接收对应网络设备发送的、关于第三目标对象的视频信息及所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;在步骤S52中,第三用户设备呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括第二用户通过第二用户设备对所述第三目标对象的操作指示信息;其中,所述视频信息是通过第一用户设备中的摄像装置实时拍摄的,所述第一用户设备、所述第三用户设备与所述第二用户设备属于同一远程辅助任务,并分别接受所述第二用户设备的远程辅助。FIG. 7 illustrates a method for performing remote assistance based on augmented reality on a third user equipment end according to another aspect of the present application, where the method includes steps S51 and S52. In step S51, the third user equipment receives the video information about the third target object and the third transfer matrix information corresponding to the third target object in each video frame of the video information sent by the corresponding network device; In step S52, the third user equipment presents the video information, and superimposes the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information. , Wherein the third mark information includes operation instruction information of the second user on the third target object through the second user equipment, and the video information is captured in real time by a camera device in the first user equipment, The first user equipment, the third user equipment, and the second user equipment belong to the same remote assistance task, and receive remote assistance from the second user equipment, respectively.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,第三用户持有移动终端,增强现实眼镜、平板电脑与移动终端通过网络设备(云端)建立了通信连接,且增强现实眼镜、移动终端与平板电脑正在执行同一远程辅助任务(如,对工作台上零件甲和零件乙的安装指导),增强现实眼镜负责拍摄工作台相关的视频信息。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取工作台上零件甲相关的视频信息,同时,该视频信息对应视频帧中包含零件乙;随后,增强现实眼镜并将该视频信息发送至网络设备。网络设备接收该视频信息,通过图像识别获取零件甲和零件乙的初始位置,并根据目标跟踪算法分别计算零件甲 和零件乙在视频信息中各视频帧中的第一转移矩阵信息和第三转移矩阵信息,随后,网络设备将第三转移矩阵信息和视频信息发送至移动终端。移动终端接收网络设备发送的第三转移矩阵信息以及视频信息,在呈现视频信息时,根据第三转移矩阵信息确定零件乙在各视频帧中的位置信息,并在该位置叠加显示关于零件乙的标记信息,如对于零件乙的安装指导信息等操作指示信息,其中,标记信息包括第二用户对零件乙的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。在另一些实时例中,标记信息还包括移动终端采集的基于第三用户的操作,对目标对象的标记(如画线段、圆圈等)等,或者对平板电脑发送的标记信息的反馈信息等,如在标记信息中提问、画圈圈出文字等;移动终端在呈现视频信息的同时将该辅助标示信息叠加显示在目标对象对应的位置。For example, the first user holds augmented reality glasses, the second user holds a tablet computer, and the third user holds a mobile terminal. The augmented reality glasses, tablet computer, and mobile terminal establish a communication connection through a network device (cloud), and the augmented reality The glasses, the mobile terminal, and the tablet computer are performing the same remote assistance task (for example, installation instructions for part A and part B on the workbench), and the augmented reality glasses are responsible for shooting video information related to the workbench. The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform) to obtain video information related to part A on the workbench. At the same time, the video information corresponding to the video frame contains part B; then, the augmented reality glasses And send the video information to the network device. The network device receives the video information, obtains the initial positions of part A and part B through image recognition, and calculates the first transition matrix information and the third transition of each video frame in the video information of the part A and part B according to the target tracking algorithm. Matrix information, and then the network device sends the third transfer matrix information and video information to the mobile terminal. The mobile terminal receives the third transfer matrix information and video information sent by the network device. When presenting the video information, the position information of part B in each video frame is determined according to the third transfer matrix information, and the position information of part B is superimposed and displayed at the position. Marking information, such as operation instruction information such as installation guidance information for Part B, where the marking information includes operation instruction information such as installation guidance information for Part B by the second user, where the operation instruction information may be generated on a tablet computer, It may also be generated by the network device according to the operation about the second user uploaded by the tablet computer. In some other real-time examples, the tagging information also includes operations based on the third user collected by the mobile terminal, marking on the target object (such as drawing line segments, circles, etc.), or feedback information on the tagging information sent by the tablet computer, etc. For example, in the tag information, questions are asked, text is drawn in circles, etc .; while the mobile terminal presents the video information, the auxiliary tag information is superimposed and displayed at a position corresponding to the target object.
图8示出根据本申请又一个方面的一种在第二用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括步骤S61和步骤S62。在步骤S61中,第二用户设备接收对应网络设备发送的、关于第一目标对象的视频信息及所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;在步骤S62中,第二用户设备呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述第一标记信息包括第二用户通过所述第二用户设备对所述第一目标对象的操作指示信息;其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。FIG. 8 illustrates a method for remote assistance based on augmented reality on the second user equipment side according to another aspect of the present application, where the method includes steps S61 and S62. In step S61, the second user equipment receives the video information about the first target object and the first transfer matrix information corresponding to the first target object in each video frame of the video information sent by the corresponding network device; In step S62, the second user equipment presents the video information, and superimposes the corresponding first marker information on the first target object in each video frame of the video information according to the first transfer matrix information. , Wherein the first tag information includes operation instruction information of the second user on the first target object through the second user equipment, and the video information is transmitted through the same remote as the second user equipment. The imaging device in the first user equipment assisting the task is shot in real time, or is reconstructed based on real-time video information about the first target object and other video information of the first target object captured by the camera device.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑通过网络设备(云端)建立了通信连接。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲相关的视频信息,并将该视频信息发送至网络设备。网络设备接收该零件甲相关的视频信息,并根据目标跟踪算法确定该零件甲在视频信息中各视频帧 中的转移矩阵信息,随后,网络设备将该转移矩阵信息返回至增强现实眼镜,并将转移矩阵信息以及视频信息发送至平板电脑,其中,增强现实眼镜与平板电脑通过网络设备建立通信执行同一远程辅助任务(如,对零件甲的安装指导)。平板电脑接收网络设备发送的转移矩阵信息以及视频信息,在呈现视频信息时,根据转移矩阵信息确定零件甲在各视频帧中的位置信息,并在该位置叠加显示关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息。For example, a first user holds augmented reality glasses and a second user holds a tablet computer, and the augmented reality glasses and the tablet computer establish a communication connection through a network device (cloud). The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform), obtain video information related to part A, and send the video information to the network device. The network device receives the video information related to the part A, and determines the transfer matrix information of the part A in each video frame according to the target tracking algorithm. Then, the network device returns the transfer matrix information to the augmented reality glasses, and The transfer matrix information and video information are sent to a tablet computer, where the augmented reality glasses and the tablet computer establish communication through a network device to perform the same remote assistance task (eg, installation instruction for part A). The tablet receives the transfer matrix information and video information sent by the network device. When presenting the video information, the position information of the part A in each video frame is determined according to the transfer matrix information, and the mark information about the part A is superimposed and displayed at the position, such as Operation instructions such as installation instructions for Part A.
在一些实施例中,该方法还包括步骤S63(未示出)。在步骤S63中,第二用户设备接收所述网络设备发送的、所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,并在呈现所述视频信息过程中,根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括所述第二用户通过所述第二用户设备对所述第三目标对象的操作指示信息。In some embodiments, the method further includes step S63 (not shown). In step S63, the second user equipment receives the third transfer matrix information corresponding to the third target object in each video frame of the video information sent by the network device, and in the process of presenting the video information And superimposing and displaying the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein the third marker information includes the second The user uses the second user equipment to perform operation instruction information on the third target object.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,第三用户持有移动终端,增强现实眼镜、平板电脑与移动终端通过网络设备(云端)建立了通信连接,且增强现实眼镜、移动终端与平板电脑正在执行同一远程辅助任务(如,对工作台上零件甲和零件乙的安装指导),增强现实眼镜负责拍摄工作台相关的视频信息。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取工作台上零件甲相关的视频信息,同时,该视频信息对应视频帧中包含零件乙;随后,增强现实眼镜并将该视频信息发送至网络设备。网络设备接收该视频信息,通过图像识别获取零件甲和零件乙的初始位置,并根据目标跟踪算法分别计算零件甲和零件乙在视频信息中各视频帧中的第一转移矩阵信息和第三转移矩阵信息,随后,网络设备将第一转移矩阵信息、第三转移矩阵信息以及视频信息发送至平板电脑。平板电脑接收网络设备发送的第一转移矩阵信息、第三转移矩阵信息以及视频信息,在呈现视频信息时,根据第一转移矩阵信息确定零件甲在各视频帧中的位置信息,并在该位置叠加显示关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息,根据第三转移矩阵信息确定零件乙在各视频帧中的位置信息,并在该位置叠加显示 关于零件乙的标记信息,如对于零件乙的安装指导信息等操作指示信息其中,标记信息包括第二用户对各零件的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。其中,第二用户设备可以根据第二用户的选择操作确定当前第二用户设备的标记信息的对象。For example, the first user holds augmented reality glasses, the second user holds a tablet computer, and the third user holds a mobile terminal. The augmented reality glasses, tablet computer, and mobile terminal establish a communication connection through a network device (cloud), and the augmented reality The glasses, the mobile terminal, and the tablet computer are performing the same remote assistance task (for example, installation instructions for part A and part B on the workbench), and the augmented reality glasses are responsible for shooting video information related to the workbench. The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform) to obtain video information related to part A on the workbench. At the same time, the video information corresponding to the video frame contains part B; then, the augmented reality glasses And send the video information to the network device. The network device receives the video information, obtains the initial positions of part A and part B through image recognition, and calculates the first transition matrix information and the third transition of each video frame in the video information of the part A and part B according to the target tracking algorithm. Matrix information, and then, the network device sends the first transfer matrix information, the third transfer matrix information, and the video information to the tablet computer. The tablet computer receives the first transfer matrix information, the third transfer matrix information, and the video information sent by the network device. When presenting the video information, the position information of the part A in each video frame is determined according to the first transfer matrix information, and at the position Superimposedly display mark information about Part A, such as installation instruction information for Part A, and determine the position information of Part B in each video frame based on the third transfer matrix information, and superimpose and display the information about Part B at this position. Marking information, such as operation instruction information such as installation guidance information for part B, where the marking information includes operation instruction information such as installation guidance information for each part by the second user, where the operation instruction information may be generated on a tablet computer, or It may be generated by the network device according to the operation about the second user uploaded by the tablet computer. The second user equipment may determine an object of the current tag information of the second user equipment according to a selection operation of the second user.
图9示出根据本申请又一个方面的一种在网络设备端基于增强现实进行远程辅助的方法,其中,该方法包括步骤S71、步骤S72、步骤S73和步骤S74。在步骤S71中,网络设备接收第一用户设备发送的关于目标对象的视频信息,其中,所述视频信息包括通过所述第一用户设备中的摄像装置所拍摄的;在步骤S72中,网络设备通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;在步骤S73中,网络设备根据所述转移矩阵信息将对应的标记信息添加至所述视频信息中的各视频帧,其中,所述标记信息保持叠加于所述视频信息的各视频帧中的所述目标对象,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;在步骤S74中,网络设备将编辑后的所述视频信息发送至第一用户设备,以及与所述第一用户设备属于同一远程辅助任务的第二用户设备。FIG. 9 illustrates a method for remote assistance based on augmented reality on a network device side according to still another aspect of the present application, where the method includes steps S71, S72, S73, and S74. In step S71, the network device receives video information about the target object sent by the first user equipment, where the video information includes pictures taken by the camera device in the first user equipment; in step S72, the network device Performing a target tracking operation on the target object in the video information to determine the transfer matrix information corresponding to the target object in each video frame of the video information; in step S73, the network device according to the transfer matrix The information adds corresponding tag information to each video frame in the video information, wherein the tag information remains superimposed on the target object in each video frame of the video information, and the tag information includes a corresponding second The operation instruction information of the second user on the target object sent by the user equipment; in step S74, the network device sends the edited video information to the first user equipment and belongs to the same as the first user equipment Second user equipment for remote assistance tasks.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑通过网络设备(云端)建立了通信连接。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲相关的视频信息,并将该视频信息发送至网络设备。网络设备接收该零件甲相关的视频信息,并根据目标跟踪算法确定该零件甲在视频信息中各视频帧中的转移矩阵信息,随后,网络设备根据该转移矩阵信息将零件甲对应的标记信息(如零件甲的指导操作等)添加在各视频帧对应的位置,并将编辑后的视频帧发送至增强现实眼镜和平板电脑,其中,增强现实眼镜与平板电脑通过网络设备建立通信执行同一远程辅助任务(如,对零件甲的安装指导)。增强现实眼镜接收并呈现视频信息,其中,在该视频信息中对应位置实时叠加显示了对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在 平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。同理,平板电脑接收并呈现视频信息,其中,在该视频信息对应位置叠加显示了关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息。For example, a first user holds augmented reality glasses and a second user holds a tablet computer, and the augmented reality glasses and the tablet computer establish a communication connection through a network device (cloud). The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform), obtain video information related to part A, and send the video information to the network device. The network device receives the video information related to the part A, and determines the transfer matrix information of the part A in each video frame of the video information according to the target tracking algorithm. Then, the network device uses the transfer matrix information to mark information corresponding to the part A ( (Such as the guidance operation of Part A), add it to the corresponding position of each video frame, and send the edited video frame to the augmented reality glasses and tablet computer, where the augmented reality glasses and tablet computer establish communication through the network device to perform the same remote assistance Tasks (eg, installation instructions for part A). The augmented reality glasses receive and present video information, in which corresponding mark information is displayed in real-time superimposed on the corresponding position in the video information, wherein the mark information includes operation instruction information such as the second user's installation instruction information on the part A, where the The operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation on the second user uploaded by the tablet computer. In the same way, the tablet computer receives and presents video information, and the mark information about the part A is superimposed and displayed at the corresponding position of the video information, such as operation instruction information such as the installation instruction information for the part A.
图10示出根据本申请一个方面的一种基于增强现实进行远程辅助的方法,其中,该方法包括:FIG. 10 illustrates a method for remote assistance based on augmented reality according to an aspect of the present application, wherein the method includes:
第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息,通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息,并根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;The first user equipment captures video information about a target object in real time through a camera device in the first user equipment, and determines a target object in the video by performing a target tracking operation on the target object in the video information. The corresponding transfer matrix information in each video frame of the information, and the corresponding marker information is superimposed and displayed on the target object according to the transfer matrix information, wherein the marker information includes a second User operation instruction information on the target object;
所述第一用户设备将所述视频信息发送至所述第二用户设备;Sending, by the first user equipment, the video information to the second user equipment;
所述第二用户设备接收并呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。The second user equipment receives and presents the video information, and maintains corresponding target information superimposed and displayed on the target object in each video frame of the video information, wherein the label information includes information obtained by the second user through The operation instruction information of the second user equipment on the target object is described.
图11示出根据本申请另一个方面的一种基于增强现实进行远程辅助的方法,其中,该方法包括:FIG. 11 illustrates a method for remote assistance based on augmented reality according to another aspect of the present application, wherein the method includes:
第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息,并将所述视频信息发送至对应的网络设备;The first user equipment captures video information about the first target object in real time through a camera device in the first user equipment, and sends the video information to a corresponding network device;
所述网络设备接收所述视频信息,通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息,将所述第一转移矩阵信息发送至所述第一用户设备,将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备;The network device receives the video information, and determines a first corresponding object of the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information. Transfer matrix information, sending the first transfer matrix information to the first user equipment, and sending the video information and the first transfer matrix information to a first remote user task that belongs to the same remote auxiliary task as the first user equipment Two user equipment;
所述第一用户设备接收所述第一转移矩阵信息,根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标 对象的操作指示信息;The first user equipment receives the first transfer matrix information, and superimposes and displays corresponding first marker information on the first target object according to the first transfer matrix information, where the first marker information includes Corresponding to the operation instruction information of the second user on the first target object sent by the second user equipment;
所述第二用户设备接收所述视频信息及所述第一转移矩阵信息,并呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。Receiving, by the second user equipment, the video information and the first transfer matrix information, presenting the video information, and superimposing and displaying the corresponding first tag information on the video according to the first transfer matrix information The first target object in each video frame of the information, wherein the video information is captured in real time by a camera device in the first user equipment that belongs to the same remote assistance task as the second user equipment, or is based on Real-time video information about the first target object and other video information of the first target object captured by the imaging device are reconstructed.
图12示出根据本申请又一个方面的一种基于增强现实进行远程辅助的方法,其中,该方法包括:FIG. 12 illustrates a method for remote assistance based on augmented reality according to another aspect of the present application, where the method includes:
第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息,并将所述视频信息发送至对应的网络设备;The first user equipment captures video information about the first target object in real time through a camera device in the first user equipment, and sends the video information to a corresponding network device;
所述网络设备接收所述视频信息,通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息,并将所述第一转移矩阵信息发送至所述第一用户设备;The network device receives the video information, and determines a first corresponding object of the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information. Transfer matrix information, and send the first transfer matrix information to the first user equipment;
所述第一用户设备接收所述第一转移矩阵信息,根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息;The first user equipment receives the first transfer matrix information, and superimposes and displays corresponding first marker information on the first target object according to the first transfer matrix information, where the first marker information includes Corresponding to the operation instruction information of the second user on the first target object sent by the second user equipment;
所述网络设备通过对所述视频信息中的第三目标对象执行目标跟踪操作,确定所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,其中,所述第三目标对象与所述第一目标对象属于同一远程辅助任务;The network device determines a third transition matrix information corresponding to the third target object in each video frame of the video information by performing a target tracking operation on a third target object in the video information. The third target object belongs to the same remote auxiliary task as the first target object;
所述网络设备将所述视频信息及所述第三转移矩阵信息发送至所述远程辅助任务中与所述第三目标对象相对应的第三用户设备,将所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备;Sending, by the network device, the video information and the third transfer matrix information to a third user equipment corresponding to the third target object in the remote assistance task, and sending the video information and the first Sending the transfer matrix information and the third transfer matrix information to a second user equipment that belongs to the same remote auxiliary task as the first user equipment;
所述第三用户设备接收所述视频信息及所述第三转移矩阵信息;Receiving, by the third user equipment, the video information and the third transfer matrix information;
所述第三用户设备呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象;The third user equipment presents the video information, and superimposes and displays the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information;
所述第二用户设备接收所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息,并在呈现所述视频信息过程中,根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象。Receiving, by the second user equipment, the video information, the first transition matrix information, and the third transition matrix information, and in presenting the video information, according to the first transition matrix information, the corresponding The first tag information is superimposed and displayed on the first target object in each video frame of the video information, and the corresponding third tag information is superimposed and displayed on each video of the video information according to the third transition matrix information. The third target object in the frame.
图13示出根据本申请一个方面的一种基于增强显示进行远程辅助的第一用户设备,其中,该设备包括实时拍摄模块11、目标跟踪模块12和叠加显示模块13。实时拍摄模块11,用于通过所述第一用户设备中的摄像装置实时获取关于目标对象的视频信息;目标跟踪模块12,用于通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;叠加显示模块13,用于根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。FIG. 13 shows a first user equipment for remote assistance based on an enhanced display according to an aspect of the present application, wherein the device includes a real-time shooting module 11, a target tracking module 12, and an overlay display module 13. A real-time shooting module 11 is configured to obtain video information about a target object in real time through a camera device in the first user equipment; a target tracking module 12 is configured to perform a target tracking operation on the target object in the video information To determine the corresponding transfer matrix information of the target object in each video frame of the video information; an overlay display module 13 is configured to superimpose and display the corresponding marker information on the target object according to the transfer matrix information, where The tag information includes corresponding operation instruction information of the second user on the target object sent by the second user equipment.
具体而言,实时拍摄模块11,用于通过所述第一用户设备中的摄像装置实时获取关于目标对象的视频信息。例如,目标对象包括第一用户标记的视频帧中图像信息对应的目标对象、第一用户接收的第二用户标记的视频帧中图像信息对应的目标对象以及第一用户设备根据第一用户输入的图像信息确定的目标对象等。第一用户设备包括摄像装置,第一用户设备通过该摄像装置实时拍摄关于目标对象的视频信息。Specifically, the real-time shooting module 11 is configured to obtain video information about a target object in real time through a camera device in the first user equipment. For example, the target object includes a target object corresponding to the image information in the video frame marked by the first user, a target object corresponding to the image information in the second user marked video frame received by the first user, and the first user equipment according to the first user input. Image information determines the target object and so on. The first user equipment includes an imaging device, through which the first user equipment captures video information about the target object in real time.
目标跟踪模块12,用于通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息。其中,转移矩阵信息包括第一用户设备根据目标跟踪算法得到的目标对象在当前视频帧与以往视频帧间的对应关系,目标跟踪算法包括但不限于核化相关滤波器目标跟踪算法(Kernelized correlation filter,KCF)、稠密光流(Denseopticalflow)跟踪算法、稀疏光流(Sparseopticalflow) 跟踪算法、卡尔曼滤波(Kalmanfiltering)跟踪算法、多实例学习(Multipleinstancelearning)跟踪算法等;此处目标跟踪算法以核化相关滤波器目标跟踪算法(Kernelizedcorrelationfilter,KCF)为例,KCF算法通过学习核化的正则化最小二乘(Kernelizedregularizedleastsquares,KRLS)线性分类器解决跟踪问题。目标在场景中的移动可以看成是目标在水平方向上的移动和垂直方向上的移动的矢量和,KCF算法引入密集采样概念,将所有的样本当成是基准样本的循环移位。此时,高斯核函数高度结构化,即核函数矩阵是循环矩阵,根据循环卷积原理,所有与循环矩阵的点积操作都可转化成与该矩阵第一行向量的卷积操作。此时,借助DFT(Discretefouriertransform,离散傅里叶变换)可以将空域卷积通过视域点积实现快速计算。A target tracking module 12 is configured to determine target transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information. The transfer matrix information includes the corresponding relationship between the current video frame and the previous video frame of the target object obtained by the first user equipment according to the target tracking algorithm. The target tracking algorithm includes, but is not limited to, a kernel tracking filter target tracking algorithm. , KCF), dense optical flow (Denseopticalflow) tracking algorithm, sparse optical flow (Sparseopticalflow) tracking algorithm, Kalman filtering (Kalmanfiltering) tracking algorithm, multiple instance learning (Multipleinstancelearning) tracking algorithm, etc .; here the target tracking algorithm to correlate The filter target tracking algorithm (Kernelizedcorrelationfilter, KCF) is taken as an example. The KCF algorithm solves the tracking problem by learning a Kernelized regularized least squares (KRLS) linear classifier. The movement of the target in the scene can be regarded as the vector sum of the movement of the target in the horizontal direction and the vertical direction. The KCF algorithm introduces the concept of dense sampling and regards all samples as cyclic shifts of the reference samples. At this time, the Gaussian kernel function is highly structured, that is, the kernel function matrix is a cyclic matrix. According to the principle of cyclic convolution, all dot product operations with the cyclic matrix can be converted into convolution operations with the first row vector of the matrix. At this time, with the help of DFT (Discretefouriertransform, Discrete Fourier Transform), the spatial convolution can be quickly calculated through the viewpoint dot product.
当然本领域技术人员应能理解,上述跟踪算法仅为举例,其他现有的或今后可能出现的跟踪算法如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above-mentioned tracking algorithm is only an example. If other existing or future tracking algorithms are applicable to this application, they should also be included in the protection scope of this application, and are hereby incorporated by reference. Included here.
叠加显示模块13,用于根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。其中,标记信息包括第一用户设备接收到的第二用户设备发送的、关于所述目标对象的操作指示信息,如对目标对象的虚拟操作信息等。例如,第一用户设备接收了第二用户设备发送的关于目标对象的操作指示信息,第一用户根据转移矩阵信息进行目标跟踪的同时,根据转移矩阵信息将该标记信息叠加显示在目标对象对应的位置。其中,对于增强现实眼镜,该标记信息叠加显示该增强现实眼镜的镜片上对应的位置,该位置信息由增强现实眼镜/网络设备根据目标跟踪算法计算得出;对于PC端、平板电脑或移动终端等,该标记信息叠加显示于当前视频帧中目标对象对应的位置。其中,第一用户设备与第二用户设备可以是直接建立了通信连接,也可以是通过网络设备建立了通信连接,此处以第一用户设备与第二用户设备间直接建立通信连接为例阐述以下实施例,本领域技术人员应能理解该等实施例同样适用于通过网络设备建立通信连接等其他通信连接方式。The superimposed display module 13 is configured to superimpose and display corresponding mark information on the target object according to the transfer matrix information, wherein the mark information includes a second user device corresponding to the target object sent by the second user equipment. Operation instructions. The tag information includes operation instruction information about the target object, such as virtual operation information on the target object, sent by the second user equipment and received by the first user equipment. For example, when the first user equipment receives the operation instruction information about the target object sent by the second user equipment, the first user performs target tracking according to the transfer matrix information, and superimposes and displays the marker information on the corresponding target object according to the transfer matrix information. position. Wherein, for the augmented reality glasses, the marker information superimposedly displays the corresponding position on the lens of the augmented reality glasses, and the position information is calculated by the augmented reality glasses / network device according to the target tracking algorithm; for the PC terminal, tablet computer or mobile terminal The tag information is superimposed and displayed at a position corresponding to the target object in the current video frame. The first user equipment and the second user equipment may directly establish a communication connection, or may establish a communication connection through a network device. Here, the following is an example of the direct establishment of a communication connection between the first user equipment and the second user equipment. Embodiments, those skilled in the art should understand that these embodiments are also applicable to other communication connection modes such as establishing a communication connection through a network device.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑建立了通信连接。增强现实眼镜与平板电脑已进行了关于目标对象的视频流或图像的传输,且增强现实眼镜接收到平板电脑发送的关于目标对象在之前视频帧中的操作指示信息,如目标对象为某操作台上的零件,该目标对象可以是第一用户设备基于第一用户的选择操作(如画圈圈出等操作)确定的,也可以是第一用户设备接收到的第二用户设备基于第二用户的选择操作确定的,还可以是第一用户设备通过识别目标对象的初始图像信息确定的;对应的操作指示信息包括第二用户设备识别第二用户关于该零件操作的手势等获取的虚拟操作信息等。增强现实眼镜通过摄像头实时采集当前关于目标对象的视频信息,随后通过目标跟踪算法计算当前视频帧中目标对象相对于之前视频帧中目标对象的转移矩阵信息。随后,增强现实眼镜根据转移矩阵信息确定目标对象在当前视频帧的位置信息,并在该位置叠加显示对应的标记信息,如在当前视频帧中操作台上的零件对应位置叠加显示第二用户的手势对应的操作指示信息等。For example, a first user holds augmented reality glasses, a second user holds a tablet computer, and the augmented reality glasses establish a communication connection with the tablet computer. The augmented reality glasses and tablet computer have transmitted the video stream or image about the target object, and the augmented reality glasses have received the tablet's operation instruction information about the target object in the previous video frame, such as the target object is a console The target object may be determined by the first user equipment based on the first user ’s selection operation (such as drawing a circle), or it may be the second user equipment received by the first user equipment based on the second user. The selection operation may also be determined by the first user equipment by identifying the initial image information of the target object; the corresponding operation instruction information includes virtual operation information obtained by the second user equipment to recognize the second user ’s gesture regarding the part operation. Wait. The augmented reality glasses collect the current video information about the target object through the camera in real time, and then use the target tracking algorithm to calculate the transfer matrix information of the target object in the current video frame relative to the target object in the previous video frame. Subsequently, the augmented reality glasses determine the position information of the target object in the current video frame according to the transfer matrix information, and superimpose and display the corresponding marker information at the position, such as the corresponding position of the part on the operating platform in the current video frame superimposedly displays the second user Operation instruction information and the like corresponding to the gesture.
当然本领域技术人员应能理解,上述标记信息和/或操作指示信息仅为举例,其他现有的或今后可能出现的标记信息和/或操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above tag information and / or operation instruction information is only an example. Other existing or possible future tag information and / or operation instruction information, if applicable to this application, should also be included in This application is within the scope of protection and is hereby incorporated by reference.
在一些实施例中,该设备还包括视频发送模块14(未示出)。视频发送模块14,用于将所述视频信息发送至所述第二用户设备。例如,第一用户设备实时拍摄当前关于目标对象的视频信息,并将该视频信息发送至第二用户设备端,或者通过网络设备将该视频信息发送至第二用户设备。其中,视频信息包括第一用户设备通过摄像装置采集的图像信息,还可以包括第一用户设备通过麦克风装置采集的音频信息,并将该音频信息混流通过压缩算法压缩为视频/音频流;第一用户设备将压缩后的视频/音频流通过网络传输协议如用户数据报协议(UDP)、传输控制协议(TCP)或者实时传输协议(RTP)等传输至第二用户设备。In some embodiments, the device further includes a video sending module 14 (not shown). The video sending module 14 is configured to send the video information to the second user equipment. For example, the first user equipment captures the current video information about the target object in real time and sends the video information to the second user equipment end, or sends the video information to the second user equipment through the network device. The video information includes image information collected by the first user equipment through the camera device, and may also include audio information collected by the first user equipment through the microphone device, and the audio information is mixed into a video / audio stream through a compression algorithm. The user equipment transmits the compressed video / audio stream to the second user equipment through a network transmission protocol such as a user datagram protocol (UDP), a transmission control protocol (TCP), or a real-time transmission protocol (RTP).
例如,增强现实眼镜实时拍摄关于当前目标对象相关的视频信息,并将该视频信息直接发送至平板电脑,或者发送至云端由云端转发至平板电脑端。平板电脑接收并呈现该视频信息,辅助第二用户继续指导第一用户 进行对操作台上零件的加工等操作。For example, the augmented reality glasses capture video information related to the current target object in real time and send the video information directly to the tablet computer, or send it to the cloud and forward it to the tablet computer in the cloud. The tablet computer receives and presents the video information, and assists the second user to continue to instruct the first user to perform operations such as processing of parts on the operating table.
在一些实施例中,视频发送模块14,用于将所述视频信息及所述转移矩阵信息发送至所述第二用户设备。例如,第一用户设备将视频信息发送至第二用户设备的同时,将根据目标跟踪操作获得的转移矩阵信息同时发送至第二用户设备,以供第二用户在呈现该视频信息的同时对目标对象进行目标跟踪。In some embodiments, the video sending module 14 is configured to send the video information and the transfer matrix information to the second user equipment. For example, when the first user equipment sends video information to the second user equipment, it simultaneously sends the transfer matrix information obtained according to the target tracking operation to the second user equipment for the second user to target the target while presenting the video information. The subject performs target tracking.
例如,增强现实眼镜实时拍摄关于当前目标对象相关的视频信息,并将对该视频信息中目标对象结合之前视频帧执行目标跟踪操作,确定该目标对象在各视频帧中相对于前一视频帧的转移矩阵信息等。随后,增强现实眼镜将该视频信息以及视频信息中各视频帧对应的转移矩阵信息直接发送或者通过云端发送至平板电脑。For example, the augmented reality glasses capture video information about the current target object in real time, and perform a target tracking operation on the target object in the video information in combination with the previous video frame to determine the target object's relative to the previous video frame in each video frame. Transfer matrix information, etc. Subsequently, the augmented reality glasses send the video information and the transfer matrix information corresponding to each video frame in the video information directly or through a cloud to a tablet computer.
在一些实施例中,该设备还包括操作接收模块15(未示出)。操作接收模块15,用于接收所述第二用户设备发送的、所述第二用户基于所述视频信息对所述目标对象的继续操作指示信息。例如,第二用户设备根据第二用户对目标对象的继续操作(如画出线段圆圈等标记),或者通过手势识别识别第二用户的手势操作等,生成对应的继续操作指示信息。随后,第二用户设备将该继续操作指示信息发送至第一用户设备,辅助第一用户继续对目标对象进行操作等。In some embodiments, the device further includes an operation receiving module 15 (not shown). The operation receiving module 15 is configured to receive continuing operation instruction information sent by the second user equipment to the target object based on the video information. For example, the second user equipment generates corresponding continuous operation instruction information according to the second user's continuous operation on the target object (such as drawing a line segment circle or the like), or recognizes the gesture operation of the second user through gesture recognition, and the like. Subsequently, the second user equipment sends the continuing operation instruction information to the first user equipment to assist the first user in continuing to perform operations on the target object.
例如,增强现实眼镜将实时拍摄的关于目标对象的视频信息发送至平板电脑,平板电脑接收并呈现该视频信息。随后,平板电脑在得到的视频流各视频帧中执行目标跟踪,获取目标对象的在视频帧中位置,在一些实施例中,平板电脑通过线段、圆圈、局部增加亮度等方式将视频帧中目标对象突出显示出来。第二用户在平板电脑上做标记或者在平板电脑摄像头可拍摄范围内做手势等指导第一用户对零件进行加工,平板电脑将采集到第二用户的标记作为继续操作指示信息,或者通过对拍摄到的手势等进行手势识别,确定识别的手势为继续操作指示信息等。随后,平板电脑将该继续指示信息发送至增强现实眼镜。增强现实眼镜接收并在对应位置叠加显示该继续操作指示信息。For example, the augmented reality glasses send real-time video information about the target object to the tablet computer, and the tablet computer receives and presents the video information. Subsequently, the tablet computer performs target tracking in each video frame of the obtained video stream to obtain the position of the target object in the video frame. In some embodiments, the tablet computer targets the target in the video frame by means of line segments, circles, and locally increasing brightness. The object is highlighted. The second user instructs the first user to process the part by making a mark on the tablet computer or making gestures within the shooting range of the tablet computer camera. The tablet computer uses the second user's mark as the operation instruction information, or by shooting The gestures and the like obtained are used for gesture recognition, and it is determined that the recognized gestures are the operation instruction information and the like. The tablet then sends the resume instruction to the augmented reality glasses. The augmented reality glasses receive and display the continued operation instruction information in a superimposed position at the corresponding position.
又如,增强现实眼镜将实时拍摄的关于目标对象的视频信息发送至平 板电脑,同时还将该视频信息中各视频帧对应的转移矩阵信息发送至平板电脑,平板电脑接收并呈现该视频信息。随后,平板电脑根据接收到的转移矩阵信息,确定目标对象的在视频帧中位置,在一些实施例中,平板电脑通过线段、圆圈、局部增加亮度等方式将视频帧中目标对象突出显示出来。第二用户在平板电脑上做标记或者在平板电脑摄像头可拍摄范围内做手势等指导第一用户对零件进行加工,平板电脑将采集到第二用户的标记作为继续操作指示信息,或者通过对拍摄到的手势等进行手势识别,确定识别的手势为继续操作指示信息等。随后,平板电脑将该继续指示信息发送至增强现实眼镜。增强现实眼镜接收并在对应位置叠加显示该继续操作指示信息。For another example, the augmented reality glasses send video information about the target object captured in real time to the tablet computer, and also send the transfer matrix information corresponding to each video frame in the video information to the tablet computer, and the tablet computer receives and presents the video information. Subsequently, the tablet computer determines the position of the target object in the video frame according to the received transfer matrix information. In some embodiments, the tablet computer highlights the target object in the video frame by means of line segments, circles, and locally increasing brightness. The second user instructs the first user to process the part by making a mark on the tablet computer or making gestures within the shooting range of the tablet computer camera. The tablet computer uses the second user's mark as the operation instruction information, or by shooting The gestures and the like obtained are used for gesture recognition, and it is determined that the recognized gestures are the operation instruction information and the like. The tablet then sends the resume instruction to the augmented reality glasses. The augmented reality glasses receive and display the continued operation instruction information in a superimposed position at the corresponding position.
当然本领域技术人员应能理解,上述继续操作指示信息仅为举例,其他现有的或今后可能出现的继续操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above-mentioned continuing operation instruction information is only an example. If other existing or future continuing operation instruction information is applicable to this application, it should also be included in the protection scope of this application, and This is incorporated herein by reference.
在一些实施例中,该设备还包括摄像控制模块16(未示出)。摄像控制模块16,用于接收所述第二用户设备发送的、所述第二用户对所述摄像装置的摄像控制指令信息,根据所述摄像控制指令信息调整所述摄像装置的摄像参数信息,通过调整后的所述摄像装置实时拍摄关于所述目标对象的视频信息,并将通过所述调整后的摄像装置拍摄的所述视频信息发送至所述第二用户设备。例如,摄像控制指令信息包括对第一用户设备的摄像装置的硬件参数进行调控的指令信息,摄像参数信息包括但不限于分辨率、像素深度、最大帧率、曝光方式和快门速度、像元尺寸以及光谱响应特征等。例如,第一用户设备接收第二用户设备发送的、第二用户对第一用户的摄像装置进行调控的摄像控制指令信息,根据该摄像控制指令信息对拍摄装置的摄像参数信息进行调整,并通过调整后的摄像装置实时拍摄当前目标对象的视频信息,并将该视频信息发送至第二用户设备。In some embodiments, the device further includes a camera control module 16 (not shown). The imaging control module 16 is configured to receive imaging control instruction information of the second user on the imaging device sent by the second user equipment, and adjust imaging parameter information of the imaging device according to the imaging control instruction information, The video information about the target object is captured in real time through the adjusted camera device, and the video information captured by the adjusted camera device is sent to the second user equipment. For example, the imaging control instruction information includes instruction information for adjusting hardware parameters of the imaging device of the first user equipment. The imaging parameter information includes, but is not limited to, resolution, pixel depth, maximum frame rate, exposure mode and shutter speed, and pixel size. And spectral response characteristics. For example, the first user equipment receives the imaging control instruction information sent by the second user equipment and the second user adjusts the imaging device of the first user, adjusts the imaging parameter information of the imaging device according to the imaging control instruction information, and The adjusted camera device captures video information of the current target object in real time, and sends the video information to the second user equipment.
例如,如图3所示,图A为第二用户收到的实时拍摄的视频信息,其中,目标对象为画面中桌上的鼠标垫,第二用户想进一步的细致观察目标对象,通过视频中右上角的设置图标进行操作或者直接通过在屏幕上进行两手指外扩的放大操作等,平板电脑基于第二用户的操作,生成对应的聚 焦目标对象的摄像控制指令信息,并将该摄像控制指令信息发送至增强现实眼镜。增强现实眼镜接收该摄像控制指令信息,通过调整摄像装置的相关摄像参数,如分辨率、焦距等,拍摄关于目标对象的调整后的视频信息,并将该视频信息发送平板电脑。如图B所示,其画面为平板电脑接收并呈现的放大后的关于目标对象的视频信息。For example, as shown in Figure 3, Figure A is the real-time video information received by the second user, where the target object is the mouse pad on the table in the screen. The second user wants to observe the target object in more detail. The setting icon in the upper right corner is used to operate or directly zoom out by two-finger expansion on the screen. Based on the operation of the second user, the tablet computer generates corresponding camera control instruction information of the focused target object, and sends the camera control instruction The information is sent to the augmented reality glasses. The augmented reality glasses receive the imaging control instruction information, adjust relevant imaging parameters of the imaging device, such as resolution, focal length, etc., shoot the adjusted video information about the target object, and send the video information to the tablet computer. As shown in FIG. B, the picture is the enlarged video information about the target object received and presented by the tablet computer.
当然本领域技术人员应能理解,上述摄像控制指令信息和/或摄像参数信息仅为举例,其他现有的或今后可能出现的摄像控制指令信息和/或摄像参数信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the foregoing camera control instruction information and / or camera parameter information are merely examples, and other existing or future camera control instruction information and / or camera parameter information may be applicable to this application, It should also be included in the protection scope of this application, and hereby incorporated by reference.
在一些实施例中,所述标记信息还包括第一用户通过所述第一用户设备对所述目标对象标示的辅助标示信息。其中,辅助标示信息包括第一用户设备采集的基于第一用户的操作,对目标对象的标记(如画线段、圆圈等)等,或者对第二用户设备发送的标记信息的反馈信息等,如在标记信息中提问、画圈圈出文字等。例如,第一用户设备根据第一用户的操作,生成对应的关于目标对象的辅助标示信息,第一用户设备将该辅助标示信息发送至第二用户设备,进行进一步的远程交互。In some embodiments, the marking information further includes auxiliary marking information that the first user marks on the target object through the first user equipment. Wherein, the auxiliary marking information includes operations based on the first user collected by the first user equipment, markings on target objects (such as drawing line segments, circles, etc.), or feedback information on the marking information sent by the second user equipment, such as Ask questions in circles, circle text, etc. For example, the first user equipment generates corresponding auxiliary identification information about the target object according to the operation of the first user, and the first user equipment sends the auxiliary identification information to the second user equipment for further remote interaction.
例如,第一用户拍摄关于目标对象的视频信息时,圈出目标对象的具体位置,第一用户设备根据第一用户的操作生成对应的辅助标示信息。第一用户设备在向第二用户设备发送视频信息的同时,将该辅助标示信息发送至第二用户设备,第二用户设备接收视频信息以及该辅助标示信息,根据辅助标示信息在视频帧中初始位置信息以及目标跟踪算法计算辅助标示信息的位置信息,并在呈现视频信息的同时在各视频帧对应的位置叠加显示该辅助标示信息;又如,第一用户设备根据目标跟踪算法计算该辅助标示信息在视频信息各视频帧的转移矩阵信息,并将视频信息、辅助标示信息以及对应的转移矩阵信息发送至第二用户设备,第二用户设备接收后在呈现视频信息的同时根据转移矩阵信息在对应的位置叠加显示辅助标示信息。For example, when the first user captures video information about the target object, the specific position of the target object is circled, and the first user equipment generates corresponding auxiliary identification information according to the operation of the first user. The first user equipment sends the auxiliary identification information to the second user equipment while sending the video information to the second user equipment. The second user equipment receives the video information and the auxiliary identification information, and initializes it in the video frame according to the auxiliary identification information. The position information and the target tracking algorithm calculate the position information of the auxiliary marker information, and display the auxiliary marker information at the corresponding position of each video frame while displaying the video information; for example, the first user equipment calculates the auxiliary marker according to the target tracking algorithm. The information is in the transfer matrix information of each video frame of the video information, and the video information, auxiliary identification information, and corresponding transfer matrix information are sent to the second user equipment. After receiving the second user equipment, the second user equipment presents the video information according to the transfer matrix information in the The corresponding position is superimposed to display auxiliary label information.
又如,增强现实眼镜在对应位置叠加显示第二用户对目标对象的操作指示信息后,第一用户对应该操作指示信息存在疑问,第一用户在该操作 指示信息中画圈圈出疑问所在位置,或者第一用户已完成该操作指示,希望得到进一步的操作指示,在目标对象位置点击下一步操作的提示,增强现实眼镜基于第一用户的操作生成对应的操作指示信息的疑问信息或者下一步操作指示信息等作为辅助标示信息,并将该辅助标示信息发送至平板电脑。平板电脑接收并在对应位置叠加显示该辅助标示信息,并基于该辅助标示做出对应的继续操作指示信息,如对疑问的解答或者下一步的操作指示等,平板电脑将该继续操作指示信息发送至增强现实眼镜,增强现实眼镜在视频信息中叠加显示该继续操作指示信息,其中,该继续操作指示信息包括辅助标示信息,如之前的疑问是什么,或者下一步提示等。For another example, after the augmented reality glasses display the operation instruction information of the second user on the target object at the corresponding position, the first user has doubts about the operation instruction information, and the first user draws a circle around the question location in the operation instruction information. , Or the first user has completed the operation instruction, and wants to get further operation instructions, click the prompt of the next operation at the target object position, and the augmented reality glasses generate the question information or the next step of the corresponding operation instruction information based on the first user's operation The operation instruction information and the like are used as auxiliary identification information, and the auxiliary identification information is transmitted to the tablet computer. The tablet computer receives and displays the auxiliary label information in a corresponding position, and makes corresponding operation instruction information based on the auxiliary label, such as answering a question or the next operation instruction, etc. The tablet computer sends the continuous operation instruction information. To the augmented reality glasses, the augmented reality glasses superimposedly display the continuing operation instruction information in the video information, where the continuing operation instruction information includes auxiliary identification information, such as what was the previous question or a prompt for the next step.
当然本领域技术人员应能理解,上述辅助标示信息仅为举例,其他现有的或今后可能出现的辅助标示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above auxiliary labeling information is only an example. If other existing or future auxiliary labeling information is applicable to this application, it should also be included in the protection scope of this application. References are included here.
在一些实施例中,所述目标对象包括在讨论纸件文档;所述第二用户对所述目标对象的操作指示信息包括所述第二用户对所述在讨论纸件文档的视频帧中的一个或多个标注位置信息。例如,目标对象可以是在讨论纸件文档,对应的操作指示信息包括第二用户对该在讨论纸件文档的视频帧中的一个或多个标注位置信息,如对文档中某位置的划线或画圈等标记,或者该文字对应的标注(如,文字的拼音、解释或者相关联的内容等)。In some embodiments, the target object includes a paper document under discussion; the operation instruction information of the second user on the target object includes information about the second user's video frame of the paper document under discussion. One or more callout locations. For example, the target object may be a paper document under discussion, and the corresponding operation instruction information includes one or more position information of the second user in the video frame of the paper document under discussion, such as underlining a position in the document. Or a mark such as a circle, or a mark corresponding to the text (such as the pinyin, explanation, or related content of the text).
例如,第一用户穿戴着增强现实眼镜,通过该增强现实眼镜在阅读纸件文档,第二用户持有平板电脑,平板电脑与增强现实眼镜间建立了通信连接。增强现实眼镜通过摄像装置拍摄在讨论纸件文档的视频信息,并将该视频信息发送至平板电脑。平板电脑接收该视频信息,并基于第二用户对在讨论文档中的一个或多个标注操作生成对应的操作指示信息,如包含提示该文档对应位置有错误等错误提示位置等操作指示信息。平板电脑将该操作指示信息发送至增强现实眼镜,增强现实眼镜在当前视频信息的视频帧中根据目标跟踪算法计算在讨论纸件文档在视频帧中的位置,如其对应的转移矩阵信息等,并根据该转移矩阵信息以及操作指示信息中错误提示位置等,在在讨论纸件文档中对应的位置实时叠加对应的一个或多个标注信息,提示第一用户当前文档对应的位置有错误。For example, a first user wears augmented reality glasses, and through the augmented reality glasses reading a paper document, the second user holds a tablet computer, and the tablet computer establishes a communication connection with the augmented reality glasses. The augmented reality glasses capture video information of the paper document in discussion through the camera device, and send the video information to the tablet computer. The tablet computer receives the video information and generates corresponding operation instruction information based on the second user's one or more annotation operations in the document under discussion, such as including operation instruction information indicating that the corresponding position of the document has an error such as an error prompt position. The tablet computer sends the operation instruction information to the augmented reality glasses, and the augmented reality glasses calculates the position of the paper document in the video frame under discussion in the video frame of the current video information according to the target tracking algorithm, such as its corresponding transfer matrix information, etc., and According to the transfer matrix information and the error prompt position in the operation instruction information, the corresponding one or more annotation information is superimposed in real time at the corresponding position in the discussion paper document to prompt the first user that the corresponding position in the current document is wrong.
当然本领域技术人员应能理解,上述目标对象和/或操作指示信息仅为举例,其他现有的或今后可能出现的目标对象和/或操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above target objects and / or operation instruction information are just examples, and other existing or future target objects and / or operation instruction information, if applicable to this application, should also be included in This application is within the scope of protection and is hereby incorporated by reference.
在一些实施例中,叠加显示模块13,用于根据所述一个或多个标注位置信息生成渲染标记信息,并根据所述转移矩阵信息,将所述渲染标记信息叠加显示于所述目标对象。其中,渲染标记信息包括一个或多个标注位置的高亮投影、划线或画圈等标记等。例如,第一用户设备根据操作指示信息中的一个或多个标注在在讨论纸件文档中的标注位置信息,生成对应的渲染标记信息,并根据转移矩阵信息,确定在讨论纸件文档在视频信息各视频帧中的位置,从而确定渲染标记在各视频帧中的位置,并在对应的位置叠加显示渲染标记信息。In some embodiments, the overlay display module 13 is configured to generate rendering mark information according to the one or more labeled position information, and superimpose and display the rendering mark information on the target object according to the transfer matrix information. Wherein, the rendering mark information includes highlight projections such as one or more marked positions, marks such as a line or a circle. For example, the first user equipment generates corresponding rendering mark information according to one or more of the marked position information marked in the discussion paper document in the operation instruction information, and determines, based on the transfer matrix information, whether the paper document in discussion is in the video. The position of each video frame is information, so as to determine the position of the rendering mark in each video frame, and the rendering mark information is superimposed and displayed at the corresponding position.
例如,增强现实眼镜接收操作指示信息,该操作指示信息中包含该在讨论纸件文档当前在读页面中第二排第五个字的标注信息。增强现实眼镜根据该操作指示信息,生成在在讨论纸件文档的在读页面第二排第五个字的对应位置最下方下划线的渲染标记信息。增强现实眼镜根据目标跟踪算法计算出在讨论纸件文档在当前视频信息各视频帧中的位置,并根据渲染标记相对于在讨论纸件文档的位置,在各视频帧中在讨论纸件文档的在读书页的第二排第五个字下方叠加显示下划线的渲染标记信息。For example, the augmented reality glasses receive operation instruction information, and the operation instruction information includes the tag information of the second and fifth words in the currently read page of the paper document in question. Based on the operation instruction information, the augmented reality glasses generates rendering mark information underlined at the bottom of the corresponding position of the fifth word in the second row of the read page of the paper document under discussion. The augmented reality glasses calculate the position of the paper document under discussion in each video frame of the current video information according to the target tracking algorithm, and according to the position of the rendering mark relative to the paper document under discussion, the paper document is discussed in each video frame. The underlined rendering mark information is superimposed under the fifth word in the second row of the reading page.
当然本领域技术人员应能理解,上述渲染标记信息仅为举例,其他现有的或今后可能出现的渲染标记信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above rendering mark information is only an example. If other existing or future rendering mark information is applicable to this application, it should also be included in the protection scope of this application, References are included here.
在一些实施例中,该设备还包括标记获取模块17(未示出)。标记获取模块17,用于通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的图像信息,将所述图像信息发送至对应的第二用户设备,接收关于所述目标对象的标记信息,其中,所述标记信息包括所述第二用户设备发送的、第二用户对所述图像信息中所述目标对象的操作指示信息,将所述标记信息叠加显示于所述目标对象;其中,在步骤S11中,第一用户设备通过所述摄像装置实时拍摄关于所述目标对象的视频信息。例如,第一用户设备通过摄像装置拍摄关于目标对象的图像信息,并将图像信息发送至第 二用户设备,第二用户设备接收并呈现该图像信息,以供第二用户对目标对象进行操作。第二用户设备基于第二用户的操作,生成操作指示信息对应的标记信息,并将该标记信息发送至第一用户设备。第一用户设备接收该标记信息,并在图像中目标对象对应的位置叠加显示该标记信息。随后,第一用户设备通过摄像装置采集关于目标对象的视频流,并通过目标跟踪算法在该视频流各视频帧中叠加显示该标记信息。In some embodiments, the device further includes a token acquisition module 17 (not shown). A marker acquisition module 17 is configured to capture image information about a target object in real time through a camera device in the first user equipment, send the image information to a corresponding second user equipment, and receive the marker information about the target object. , Wherein the tag information includes operation instruction information of the second user on the target object in the image information sent by the second user equipment, and the tag information is superimposed and displayed on the target object; wherein, In step S11, the first user equipment captures video information about the target object in real time through the imaging device. For example, the first user equipment captures image information about the target object through the imaging device, and sends the image information to the second user equipment. The second user equipment receives and presents the image information for the second user to operate the target object. Based on the operation of the second user, the second user equipment generates tag information corresponding to the operation instruction information, and sends the tag information to the first user equipment. The first user equipment receives the tag information, and superimposes and displays the tag information at a position corresponding to the target object in the image. Subsequently, the first user equipment collects the video stream about the target object through the camera device, and displays the marker information in each video frame of the video stream by using a target tracking algorithm.
例如,增强现实眼镜通过拍摄当前目标对象的图像信息,并将该图像信息发送至平板电脑,平板电脑接收并呈现该图像信息。第二用户基于呈现的图像信息对目标对象进行操作指示,平板电脑采集第二用户的操作指示信息生成对应的标记信息,并将该标记信息发送至增强现实眼镜。增强现实眼镜接收该标记信息,并在拍摄的图像信息中根据目标跟踪算法叠加显示该标记信息。后续,增强现实眼镜继续采集目标对象的视频信息,并根据目标跟踪算法在对应的位置实时叠加该标注信息。For example, the augmented reality glasses capture image information of the current target object and send the image information to a tablet computer, and the tablet computer receives and presents the image information. The second user performs an operation instruction on the target object based on the presented image information. The tablet computer collects the operation instruction information of the second user to generate corresponding mark information, and sends the mark information to the augmented reality glasses. The augmented reality glasses receive the tag information, and superimpose and display the tag information in the captured image information according to the target tracking algorithm. Subsequently, the augmented reality glasses continue to collect video information of the target object, and superimpose the label information at the corresponding position in real time according to the target tracking algorithm.
图14示出根据本申请另一个方面的一种基于增强现实进行远程辅助的第二用户设备,其中,该设备包括视频接收模块21和视频呈现模块22。视频接收模块21,用于接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;视频呈现模块22,用于呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。例如,第二用户设备接收并呈现第一用户设备发送的关于目标对象的图像信息或视频信息,采集第二用户的操作生成对应的标记信息。随后,第二用户设备继续接收第一用户设备发送的关于目标对象的视频信息,并呈现该视频信息的同时,在呈现的视频中叠加显示第二用户设备之前确定的标记信息。FIG. 14 illustrates a second user equipment for remote assistance based on augmented reality according to another aspect of the present application, where the device includes a video receiving module 21 and a video presentation module 22. The video receiving module 21 is configured to receive video information about a target object that is captured in real time by a camera device in the first user equipment and is sent by a corresponding first user equipment; a video presentation module 22 is configured to present the video information, and The corresponding target information is superimposed and displayed on the target object in each video frame of the video information, wherein the label information includes operation instruction information of the second object on the target object by the second user device. For example, the second user equipment receives and presents image information or video information about the target object sent by the first user equipment, and collects operations of the second user to generate corresponding mark information. Subsequently, the second user equipment continues to receive video information about the target object sent by the first user equipment and presents the video information, and superimposes and displays the tag information determined before the second user equipment in the presented video.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑建立了通信连接。增强现实眼镜与平板电脑已进行了关于目标对象的视频流或图像的传输,且增强现实眼镜接收到平板电脑发送的关于目标对象在之前视频帧中的操作指示信息,如目标对象为某操作台上的零件,该目标对象可以是第一用户设备基于第一用户的选择操作(如 画圈圈出等操作)确定的,也可以是第一用户设备接收到的第二用户设备基于第二用户的选择操作确定的,还可以是第一用户设备通过识别目标对象的初始图像信息确定的;对应的操作指示信息包括第二用户设备识别第二用户关于该零件操作的手势等获取的虚拟操作信息等。增强现实眼镜通过摄像头实时采集当前关于目标对象的视频信息,随后通过目标跟踪算法计算当前视频帧中目标对象相对于之前视频帧中目标对象的转移矩阵信息。随后,增强现实眼镜根据转移矩阵信息确定目标对象在当前视频帧的位置信息,并在该位置叠加显示对应的标记信息,如在当前视频帧中操作台上的零件对应位置叠加显示第二用户的手势对应的操作指示信息等。同时,增强现实眼镜将视频信息发送至平板电脑,平板电脑接收并呈现该视频信息,并在视频信息呈现的同时在视频信息中对应的位置叠加显示之前的标记信息。在另一些实时例中,增强现实眼镜还会向平板电脑发送辅助标示信息,其中,该辅助标示信息包括增强现实眼镜采集的基于第一用户的操作,对目标对象的标记(如画线段、圆圈等)等,或者对平板电脑发送的标记信息的反馈信息等,如在标记信息中提问、画圈圈出文字等;平板电脑接收该辅助标示信息,并在呈现视频信息的同时将该辅助标示信息叠加显示在目标对象对应的位置。For example, a first user holds augmented reality glasses, a second user holds a tablet computer, and the augmented reality glasses establish a communication connection with the tablet computer. The augmented reality glasses and tablet computer have transmitted the video stream or image about the target object, and the augmented reality glasses have received the tablet's operation instruction information about the target object in the previous video frame, such as the target object is a console The target object may be determined by the first user equipment based on the first user ’s selection operation (such as drawing a circle), or it may be the second user equipment received by the first user equipment based on the second user. The selection operation may also be determined by the first user equipment by identifying the initial image information of the target object; the corresponding operation instruction information includes virtual operation information obtained by the second user equipment to recognize the second user ’s gesture regarding the part operation. Wait. The augmented reality glasses collect the current video information about the target object through the camera in real time, and then use the target tracking algorithm to calculate the transfer matrix information of the target object in the current video frame relative to the target object in the previous video frame. Subsequently, the augmented reality glasses determine the position information of the target object in the current video frame according to the transfer matrix information, and superimpose and display the corresponding marker information at the position, such as the corresponding position of the part on the operating platform in the current video frame superimposedly displays the second user Operation instruction information and the like corresponding to the gesture. At the same time, the augmented reality glasses send the video information to the tablet computer, and the tablet computer receives and presents the video information, and displays the previous mark information at the corresponding position in the video information while the video information is presented. In other real-time examples, the augmented reality glasses also send auxiliary labeling information to the tablet computer, where the auxiliary labeling information includes the target user's mark (such as a line segment, a circle, etc.) collected by the augmented reality glasses based on the operation of the first user. Etc.), or feedback information on the tag information sent by the tablet computer, such as asking questions in the tag information, circled text, etc .; the tablet computer receives the auxiliary tag information and presents the auxiliary message while displaying the video information. The information is displayed superimposed on the corresponding position of the target object.
当然本领域技术人员应能理解,上述标记信息仅为举例,其他现有的或今后可能出现的标记信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above tag information is only an example. If other existing or future tag information is applicable to this application, it should also be included in the protection scope of this application, and hereby incorporated by reference. Included here.
在一些实施中,该设备还包括跟踪执行模块23(未示出)。跟踪执行模块23,用于对所述视频信息中的所述目标对象执行目标跟踪操作;其中,视频呈现模块22,用于呈现所述视频信息,并根据所述目标跟踪操作的结果信息,将对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。例如,第二用户设备接收第一用户设备发送有关目标对象的视频信息,第二用户设备根据目标对象的模板信息对该视频信息中目标对象执行目标跟踪操作,确定目标对象在视频信息各视频帧中的位置信息,其中,模板信息可以是第一用户设备发送至第二用户设备 的,可以是第二用户设备基于第二用户的操作初始视频帧中选取的或者导入模板信息获取的。随后,第二用户设备呈现该视频信息时,根据目标跟踪的结果信息,在目标对象的对应位置叠加显示标记信息,其中,该标记信息可以是第二用户设备根据第二用户对初始视频帧或图像信息中目标对象进行指导生成的,也可以是第二用户基于后来发送的视频信息中目标对象所做的操作指导等生成的标记信息。In some implementations, the device further includes a trace execution module 23 (not shown). A tracking execution module 23 is configured to perform a target tracking operation on the target object in the video information; wherein, a video presentation module 22 is configured to present the video information, and according to the result information of the target tracking operation, The corresponding mark information is superimposed and displayed on the target object in each video frame of the video information, wherein the mark information includes operation instruction information of the second object on the target object by the second user device. For example, the second user equipment receives video information about the target object sent by the first user equipment, and the second user equipment performs a target tracking operation on the target object in the video information according to the template information of the target object to determine the target object in each video frame of the video information. The location information in the template information may be sent by the first user equipment to the second user equipment, or may be obtained by the second user equipment based on the initial video frame selected by the second user operation or by importing the template information. Subsequently, when the second user equipment presents the video information, the marker information is superimposed and displayed on the corresponding position of the target object according to the result information of the target tracking, where the marker information may be the second user equipment's initial video frame or The target information in the image information is generated based on the guidance, and may also be mark information generated by the second user based on the operation guidance made by the target object in the video information sent later.
例如,平板电脑接收增强现实眼镜发送的视频信息,根据操作台的零件模板信息,在视频信息中各视频帧对该零件执行目标跟踪,获取该零件在各视频帧中的位置信息,其中,该零件的模板可以是第二用户导入的,可以是在初始化帧中选取的,也可以是增强现实眼镜发送的。平板电脑接收并呈现视频信息,并根据第二用户对该零件的安装指导信息(如,圈出或箭头指向安装位置,或者根据手势识别对应预设的安装操作等)生成对应的标记信息。第二用户设备在呈现该视频信息的同时,根据该零件在各视频帧中位置信息,在后续视频帧中实时叠加显示该标记信息等。For example, the tablet computer receives video information sent by the augmented reality glasses, performs target tracking on each part in each video frame in the video information according to the part template information of the operating platform, and obtains position information of the part in each video frame. The template of the part may be imported by the second user, may be selected in the initialization frame, or may be sent by the augmented reality glasses. The tablet computer receives and presents the video information, and generates corresponding mark information according to the second user's installation guide information for the part (for example, circled out or an arrow pointing to the installation position, or a preset installation operation corresponding to gesture recognition). While presenting the video information, the second user equipment displays the tag information and the like in real-time superimposed display in subsequent video frames according to the position information of the part in each video frame.
当然本领域技术人员应能理解,上述标记信息仅为举例,其他现有的或今后可能出现的标记信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above tag information is only an example. If other existing or future tag information is applicable to this application, it should also be included in the protection scope of this application, and hereby incorporated by reference. Included here.
在一些实施例中,视频接收模块21,用于接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时获取关于目标对象的视频信息,以及所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;其中,视频呈现模块22,用于呈现所述视频信息,并根据所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息,将对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。例如,第一用户设备将视频信息发送至第二用户设备的同时,将根据目标跟踪操作获得的转移矩阵信息同时发送至第二用户设备,以供第二用户在呈现该视频信息的同时对目标对象进行目标跟踪。第二用户设备接收该视频信息及转移矩阵信息,在呈现视频信息的同时,根据该转移矩阵信息在视频信息中对应位置叠加显示标记信息。In some embodiments, the video receiving module 21 is configured to receive video information about a target object obtained in real time through a camera device in the first user equipment and sent by the corresponding first user equipment. Corresponding transition matrix information in each video frame of the video information; wherein the video presentation module 22 is configured to present the video information and according to the corresponding transition matrix information of the target object in each video frame of the video information, And superimposing and displaying the corresponding mark information on the target object in each video frame of the video information, wherein the mark information includes operation instruction information of the second object on the target object by the second user device. For example, when the first user equipment sends video information to the second user equipment, it simultaneously sends the transfer matrix information obtained according to the target tracking operation to the second user equipment for the second user to target the target while presenting the video information. The subject performs target tracking. The second user equipment receives the video information and the transfer matrix information, and simultaneously displays the video information, and superimposes and displays the marker information on the corresponding position in the video information according to the transfer matrix information.
例如,增强现实眼镜实时拍摄关于当前目标对象相关的视频信息,并将对该视频信息中目标对象结合之前视频帧执行目标跟踪操作,确定该目标对象在各视频帧中相对于前一视频帧的转移矩阵信息等。随后,增强现实眼镜将该视频信息以及视频信息中各视频帧对应的转移矩阵信息直接发送或者通过云端发送至平板电脑。平板电脑接收该视频信息以及对应的转移矩阵信息,并在呈现该视频信息的同时,根据转移矩阵信息在视频信息的对应位置叠加显示标记信息等。For example, the augmented reality glasses capture video information about the current target object in real time, and perform a target tracking operation on the target object in the video information in combination with the previous video frame to determine the target object's relative to the previous video frame in each video frame. Transfer matrix information, etc. Subsequently, the augmented reality glasses send the video information and the transfer matrix information corresponding to each video frame in the video information directly or through a cloud to a tablet computer. The tablet computer receives the video information and the corresponding transfer matrix information, and displays the video information, and superimposes and displays marker information and the like on the corresponding position of the video information according to the transfer matrix information.
在一些实施例中,该设备还包括操作获取模块24(未示出)。操作获取模块24,用于获取所述第二用户基于所述视频信息对所述目标对象的继续操作指示信息,将所述继续操作指示信息发送至所述第一用户设备。例如,第二用户设备根据第二用户对目标对象的继续操作(如画出线段圆圈等标记),或者通过手势识别识别第二用户的手势操作等,生成对应的继续操作指示信息。随后,第二用户设备将该继续操作指示信息发送至第一用户设备,辅助第一用户继续对目标对象进行操作等。In some embodiments, the device further includes an operation acquisition module 24 (not shown). An operation acquiring module 24 is configured to acquire continuing operation instruction information of the second user on the target object based on the video information, and send the continuing operation instruction information to the first user equipment. For example, the second user equipment generates corresponding continuous operation instruction information according to the second user's continuous operation on the target object (such as drawing a line segment circle or the like), or recognizes the gesture operation of the second user through gesture recognition, and the like. Subsequently, the second user equipment sends the continuing operation instruction information to the first user equipment to assist the first user in continuing to perform operations on the target object.
例如,增强现实眼镜将实时拍摄的关于目标对象的视频信息发送至第平板电脑,平板电脑接收并呈现该视频信息。随后,平板电脑在得到的视频流各视频帧中执行目标跟踪,获取目标对象的在视频帧中位置,在一些实施例中,平板电脑通过线段、圆圈、局部增加亮度等方式将视频帧中目标对象突出显示出来。第二用户在平板电脑上做标记或者在平板电脑摄像头可拍摄范围内做手势等指导第一用户对零件进行加工,平板电脑将采集到第二用户的标记作为继续操作指示信息,或者通过对拍摄到的手势等进行手势识别,确定识别的手势为继续操作指示信息等。随后,平板电脑将该继续指示信息发送至增强现实眼镜。增强现实眼镜接收并在对应位置叠加显示该继续操作指示信息。For example, the augmented reality glasses send video information about the target object captured in real time to the second tablet computer, and the tablet computer receives and presents the video information. Subsequently, the tablet computer performs target tracking in each video frame of the obtained video stream to obtain the position of the target object in the video frame. In some embodiments, the tablet computer targets the target in the video frame by means of line segments, circles, and locally increasing brightness. The object is highlighted. The second user instructs the first user to process the part by making a mark on the tablet computer or making gestures within the shooting range of the tablet computer camera. The tablet computer uses the second user's mark as the operation instruction information, or by shooting The gestures and the like obtained are used for gesture recognition, and it is determined that the recognized gestures are the operation instruction information and the like. The tablet then sends the resume instruction to the augmented reality glasses. The augmented reality glasses receive and display the continued operation instruction information in a superimposed position at the corresponding position.
又如,增强现实眼镜将实时拍摄的关于目标对象的视频信息发送至第平板电脑,同时还将该视频信息中各视频帧对应的转移矩阵信息发送至平板电脑,平板电脑接收并呈现该视频信息。随后,平板电脑根据接收到的转移矩阵信息,确定目标对象的在视频帧中位置,在一些实施例中,平板电脑通过线段、圆圈、局部增加亮度等方式将视频帧中目标对象突出显示 出来。第二用户在平板电脑上做标记或者在平板电脑摄像头可拍摄范围内做手势等指导第一用户对零件进行加工,平板电脑将采集到第二用户的标记作为继续操作指示信息,或者通过对拍摄到的手势等进行手势识别,确定识别的手势为继续操作指示信息等。随后,平板电脑将该继续指示信息发送至增强现实眼镜。增强现实眼镜接收并在对应位置叠加显示该继续操作指示信息。For another example, the augmented reality glasses send real-time video information about the target object to the tablet computer, and also send the transfer matrix information corresponding to each video frame in the video information to the tablet computer, and the tablet computer receives and presents the video information . Subsequently, the tablet computer determines the position of the target object in the video frame according to the received transfer matrix information. In some embodiments, the tablet computer highlights the target object in the video frame by means of line segments, circles, and locally increasing brightness. The second user instructs the first user to process the part by making a mark on the tablet computer or making gestures within the shooting range of the tablet computer camera. The tablet computer uses the second user's mark as the operation instruction information, or by shooting The gestures and the like obtained are used for gesture recognition, and it is determined that the recognized gestures are the operation instruction information and the like. The tablet then sends the resume instruction to the augmented reality glasses. The augmented reality glasses receive and display the continued operation instruction information in a superimposed position at the corresponding position.
当然本领域技术人员应能理解,上述继续操作指示信息仅为举例,其他现有的或今后可能出现的继续操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above-mentioned continuing operation instruction information is only an example. If other existing or future continuing operation instruction information is applicable to this application, it should also be included in the protection scope of this application, and This is incorporated herein by reference.
在一些实施例中,该设备还包括摄像控制模块25(未示出)。摄像控制模块25,用于根据所述第二用户通过第二用户设备执行的摄像控制操作,生成所述第二用户对所述摄像装置的摄像控制指令信息,其中,所述摄像控制指令信息用于调整所述摄像装置的摄像参数信息,将所述摄像控制指令信息发送至所述第一用户设备,并接收所述第一用户设备发送的、通过所述调整后的摄像装置拍摄的所述视频信息。例如,第二用户设备接收到视频信息,对视频信息进行调整,如放大目标对象附近区域等。第二用户基于用户的操作确定对应的摄像控制指令信息,其中,该摄像控制指令信息包括用于调整第一用户设备的摄像装置的摄像参数信息,随后,第二用户设备将该摄像控制指令信息发送至第一用户设备。其中,摄像控制指令信息包括对第一用户设备的摄像装置的硬件参数进行调控的指令信息,摄像参数信息包括但不限于分辨率、像素深度、最大帧率、曝光方式和快门速度、像元尺寸以及光谱响应特征等。In some embodiments, the device further includes a camera control module 25 (not shown). The imaging control module 25 is configured to generate imaging control instruction information of the second user on the imaging device according to an imaging control operation performed by the second user through the second user equipment, where the imaging control instruction information is used For adjusting imaging parameter information of the imaging device, sending the imaging control instruction information to the first user equipment, and receiving the first user equipment and receiving the image captured by the adjusted imaging device Video information. For example, the second user equipment receives the video information and adjusts the video information, such as enlarging the area near the target object. The second user determines the corresponding imaging control instruction information based on the user's operation, where the imaging control instruction information includes imaging parameter information for adjusting the imaging device of the first user equipment, and then the second user equipment sends the imaging control instruction information Send to the first user equipment. The imaging control instruction information includes instruction information for adjusting hardware parameters of the imaging device of the first user equipment. The imaging parameter information includes, but is not limited to, resolution, pixel depth, maximum frame rate, exposure mode and shutter speed, and pixel size. And spectral response characteristics.
例如,如图3所示,图A为第二用户收到的实时拍摄的视频信息,其中,目标对象为画面中桌上的鼠标垫,第二用户想进一步的细致观察目标对象,通过视频中右上角的设置图标进行操作或者直接通过在屏幕上进行两手指外扩的放大操作等,平板电脑基于第二用户的操作,生成对应的聚焦目标对象的摄像控制指令信息,并将该摄像控制指令信息发送至增强现实眼镜。增强现实眼镜接收该摄像控制指令信息,通过调整摄像装置的相关摄像参数,如分辨率、焦距等,拍摄关于目标对象的调整后的视频信息, 并将该视频信息发送平板电脑。如图B所示,其画面为平板电脑接收并呈现的放大后的关于目标对象的视频信息。For example, as shown in Figure 3, Figure A is the real-time video information received by the second user, where the target object is the mouse pad on the table in the screen. The second user wants to observe the target object in more detail. The setting icon in the upper right corner is used to operate or directly zoom out by two-finger expansion on the screen. Based on the operation of the second user, the tablet computer generates corresponding camera control instruction information of the focused target object, and sends the camera control instruction The information is sent to the augmented reality glasses. The augmented reality glasses receive the imaging control instruction information, adjust related imaging parameters of the imaging device, such as resolution, focal length, etc., shoot the adjusted video information about the target object, and send the video information to the tablet computer. As shown in FIG. B, the picture is the enlarged video information about the target object received and presented by the tablet computer.
当然本领域技术人员应能理解,上述摄像控制指令信息和/或摄像参数信息仅为举例,其他现有的或今后可能出现的摄像控制指令信息和/或摄像参数信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the foregoing camera control instruction information and / or camera parameter information are merely examples, and other existing or future camera control instruction information and / or camera parameter information may be applicable to this application, It should also be included in the protection scope of this application, and hereby incorporated by reference.
在一些实施例中,该设备还包括标记获取模块26(未示出)。标记获取模块26,用于接收并呈现对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的图像信息,获取所述第二用户对所述图像信息中所述目标对象的操作指示信息,将所述操作指示信息发送至所述第一用户设备,将所述操作指示信息叠加显示于所述图像信息中所述目标对象;其中,在步骤S21中,第二用户设备接收所述第一用户设备发送的、通过所述摄像装置实时拍摄关于所述目标对象的视频信息。例如,第一用户设备通过摄像装置拍摄关于目标对象的图像信息,并将图像信息发送至第二用户设备,第二用户设备接收并呈现该图像信息,以供第二用户对目标对象进行操作。第二用户设备基于第二用户的操作,生成操作指示信息对应的标记信息,并将该标记信息发送至第一用户设备。第一用户设备接收该标记信息,并在图像中目标对象对应的位置叠加显示该标记信息。随后,第一用户设备通过摄像装置采集关于目标对象的视频流,并通过目标跟踪算法在该视频流各视频帧中叠加显示该标记信息。In some embodiments, the device further includes a tag acquisition module 26 (not shown). The mark acquiring module 26 is configured to receive and present the image information about the target object that is captured by the first user equipment in real time through the camera device in the first user equipment, and acquire the second user's Sending the operation instruction information of the target object to the first user equipment, and superimposing and displaying the operation instruction information on the target object in the image information; wherein, in step S21, The second user equipment receives video information about the target object that is captured by the first user equipment in real time through the camera. For example, the first user equipment captures image information about the target object through the imaging device, and sends the image information to the second user equipment. The second user equipment receives and presents the image information for the second user to operate the target object. Based on the operation of the second user, the second user equipment generates tag information corresponding to the operation instruction information, and sends the tag information to the first user equipment. The first user equipment receives the tag information, and superimposes and displays the tag information at a position corresponding to the target object in the image. Subsequently, the first user equipment collects the video stream about the target object through the camera device, and displays the marker information in each video frame of the video stream by using a target tracking algorithm.
例如,增强现实眼镜通过拍摄当前目标对象的图像信息,并将该图像信息发送至平板电脑,平板电脑接收并呈现该图像信息。第二用户基于呈现的图像信息对目标对象进行操作指示,平板电脑采集第二用户的操作指示信息生成对应的标记信息,并将该标记信息发送至增强现实眼镜。增强现实眼镜接收该标记信息,并在拍摄的图像信息中根据目标跟踪算法叠加显示该标记信息。后续,增强现实眼镜继续采集目标对象的视频信息,并根据目标跟踪算法在对应的位置实时叠加该标注信息。For example, the augmented reality glasses capture image information of the current target object and send the image information to a tablet computer, and the tablet computer receives and presents the image information. The second user performs an operation instruction on the target object based on the presented image information. The tablet computer collects the operation instruction information of the second user to generate corresponding mark information, and sends the mark information to the augmented reality glasses. The augmented reality glasses receive the tag information, and superimpose and display the tag information in the captured image information according to the target tracking algorithm. Subsequently, the augmented reality glasses continue to collect video information of the target object, and superimpose the label information at the corresponding position in real time according to the target tracking algorithm.
图15示出根据本申请又一个方面的一种基于增强现实进行远程辅助的第一用户设备,其中,该设备包括实时拍摄模块31、视频发送模块32、 转移矩阵接收模块33和叠加显示模块34。实时拍摄模块31,用于通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息;视频发送模块32,用于将所述视频信息发送至对应的网络设备;转移矩阵接收模块33,用于接收所述网络设备发送的、所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;叠加显示模块34,用于根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息。例如,第一用户设备与第二用户设备通过网络设备建立了通信连接,第一用户设备将拍摄的关于第一目标对象的视频信息发送至网络设备,由网络设备根据视频信息对第一目标对象执行目标跟踪,确定对应视频信息各视频帧中第一目标对象的转移矩阵信息,并将该转移矩阵发送至第一用户设备和第二用户设备。随后,第一用户设备和第二用户设备基于网络设备发送的转移矩阵信息叠加显示第一标记信息等,其中,第一标记信息包括第二用户设备根据第二用户对第一目标对象的操作指示信息。FIG. 15 illustrates a first user device for remote assistance based on augmented reality according to another aspect of the present application, where the device includes a real-time shooting module 31, a video sending module 32, a transfer matrix receiving module 33, and an overlay display module 34. . A real-time shooting module 31 is configured to capture video information about a first target object in real time through a camera device in the first user equipment; a video sending module 32 is configured to send the video information to a corresponding network device; a transfer matrix A receiving module 33 is configured to receive first transfer matrix information corresponding to the first target object in each video frame of the video information sent by the network device; and an overlay display module 34 is configured to The matrix information is transferred, and the corresponding first mark information is superimposed and displayed on the first target object, wherein the first mark information includes a second user operation on the first target object that is sent by the second user equipment. Instructions. For example, the first user equipment and the second user equipment establish a communication connection through a network device, and the first user equipment sends the captured video information about the first target object to the network device, and the network device sends the first target object to the first target object according to the video information. Perform target tracking, determine the transition matrix information of the first target object in each video frame corresponding to the video information, and send the transition matrix to the first user equipment and the second user equipment. Subsequently, the first user equipment and the second user equipment superimpose and display the first tag information and the like based on the transfer matrix information sent by the network device, where the first tag information includes an operation instruction of the second user device on the first target object according to the second user equipment. information.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑通过网络设备(云端)建立了通信连接。第一用户对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲相关的视频信息,并将该视频信息发送至网络设备。网络设备接收该零件甲相关的视频信息,并根据目标跟踪算法确定该零件甲在视频信息中各视频帧中的转移矩阵信息,随后,网络设备将该转移矩阵信息返回至增强现实眼镜。增强现实眼镜接收该转移矩阵信息,并在呈现视频信息的同时根据转移矩阵信息在视频中对应位置实时叠加显示对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。For example, a first user holds augmented reality glasses and a second user holds a tablet computer, and the augmented reality glasses and the tablet computer establish a communication connection through a network device (cloud). The first user takes a real-time shot of the first target object (such as part A on the operating platform), obtains video information related to part A, and sends the video information to the network device. The network device receives the video information related to the part A, and determines the transfer matrix information of the part A in each video frame of the video information according to the target tracking algorithm. Then, the network device returns the transfer matrix information to the augmented reality glasses. The augmented reality glasses receive the transfer matrix information, and display the video information while displaying the corresponding marker information in real-time superimposed on the corresponding position in the video according to the transfer matrix information, wherein the marker information includes operations such as the second user's installation instruction for the part A Instruction information, where the operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation about a second user uploaded by the tablet computer.
当然本领域技术人员应能理解,上述操作指示信息仅为举例,其他现有的或今后可能出现的操作指示信息如可适用于本申请,也应包含在本申请保护范围以内,并在此以引用方式包含于此。Of course, those skilled in the art should understand that the above operation instruction information is just an example. If other existing or future operation instruction information is applicable to this application, it should also be included in the protection scope of this application. References are included here.
图16示出根据本申请又一个方面的一种基于增强现实进行远程辅助的网络设备,其中,该设备包括视频接收模块41、目标跟踪模块42、第一发送模块43和第二发送模块44。视频接收模块41,用语言接收第一用户设备发送的关于第一目标对象的视频信息,其中,所述视频信息是通过所述第一用户设备中的摄像装置实时拍摄的;目标跟踪模块42,用于通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;第一发送模块43,用于将所述第一转移矩阵信息发送至所述第一用户设备;第二发送模块44,用于将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。其中,网络设备是一个具有足够计算能力的服务器,主要负责视频、音频和标记信息数据的转发,同时,网络设备具有一些计算机视觉和图像处理的算法,如视频/音频信息达到网络设备时,网络设备通过跟踪算法对目标对象(如第一目标对象等)进行跟踪,随后,将跟踪的结果信息返回至用户设备。FIG. 16 shows a network device for remote assistance based on augmented reality according to another aspect of the present application, where the device includes a video receiving module 41, a target tracking module 42, a first sending module 43 and a second sending module 44. The video receiving module 41 receives, in a language, video information about the first target object sent by the first user equipment, where the video information is captured in real time by a camera device in the first user equipment; the target tracking module 42, Configured to determine first transfer matrix information corresponding to the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information; a first sending module 43 is used for sending the first transfer matrix information to the first user equipment; second sending module 44 is used for sending the video information and the first transfer matrix information to the first user The device belongs to a second user device of the same remote assistance task. Among them, the network device is a server with sufficient computing power, which is mainly responsible for the forwarding of video, audio, and tag information data. At the same time, the network device has some computer vision and image processing algorithms. For example, when video / audio information reaches the network device, the network The device tracks the target object (such as the first target object) by using a tracking algorithm, and then returns the tracking result information to the user device.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑通过网络设备(云端)建立了通信连接。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲相关的视频信息,并将该视频信息发送至网络设备。网络设备接收该零件甲相关的视频信息,并根据目标跟踪算法确定该零件甲在视频信息中各视频帧中的转移矩阵信息,随后,网络设备将该转移矩阵信息返回至增强现实眼镜,并将转移矩阵信息以及视频信息发送至平板电脑,其中,增强现实眼镜与平板电脑通过网络设备建立通信执行同一远程辅助任务(如,对零件甲的安装指导)。增强现实眼镜接收该转移矩阵信息,并在呈现视频信息的同时根据转移矩阵信息在视频中对应位置实时叠加显示对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。平板电脑接收网络设备发送的转移矩阵信息以及视频信息,在呈现视频信息时,根据转移矩阵信息确定零件甲在各视频帧中的位置信息,并在该位置叠加显示关于零件甲 的标记信息,如对于零件甲的安装指导信息等操作指示信息。For example, a first user holds augmented reality glasses and a second user holds a tablet computer, and the augmented reality glasses and the tablet computer establish a communication connection through a network device (cloud). The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform), obtain video information related to part A, and send the video information to the network device. The network device receives the video information related to the part A, and determines the transfer matrix information of the part A in each video frame according to the target tracking algorithm. Then, the network device returns the transfer matrix information to the augmented reality glasses, and The transfer matrix information and video information are sent to a tablet computer, where the augmented reality glasses and the tablet computer establish communication through a network device to perform the same remote assistance task (eg, installation instruction for part A). The augmented reality glasses receive the transfer matrix information, and display the video information while displaying the corresponding marker information in real-time superimposed on the corresponding position in the video according to the transfer matrix information, wherein the marker information includes operations such as the second user's installation instruction for the part A Instruction information, where the operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation about a second user uploaded by the tablet computer. The tablet receives the transfer matrix information and video information sent by the network device. When presenting the video information, the position information of the part A in each video frame is determined according to the transfer matrix information, and the mark information about the part A is superimposed and displayed at the position, such as Operation instructions such as installation instructions for Part A.
在一些实施例中,目标跟踪模块42,用于根据所述视频信息及所述第一目标对象的其它视频信息重建所述第一目标对象的视频信息,并通过对重建后的所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息。其中,网络设备主要负责视频、音频和标记信息等数据的转发,同时,网络设备具有一些计算机视觉和图像处理的能力,如果视频/音频信息发送至网络设备,网络设备通过目标跟踪算法、目标识别、重建、姿态估计和计算机图形算法(如虚拟物体渲染、点云处理(拼接、降/超采样、匹配、网格化等))对视频信息进行处理,并将处理的结果信息返回至用户设备。例如,网络设备通过对第一用户上传的视频信息以及其他用户上传的视频进行重建,生成对于第一目标对象的总体的视频信息,随后,在重建视频信息中对第一目标对象进行目标跟踪。In some embodiments, the target tracking module 42 is configured to reconstruct the video information of the first target object according to the video information and other video information of the first target object, and to reconstruct the video information after the reconstruction Performing a target tracking operation on the first target object in to determine first transfer matrix information corresponding to the first target object in each video frame of the video information. Among them, the network device is mainly responsible for the forwarding of data such as video, audio, and tag information. At the same time, the network device has some computer vision and image processing capabilities. If the video / audio information is sent to the network device, the network device uses the target tracking algorithm and target recognition. , Reconstruction, pose estimation and computer graphics algorithms (such as virtual object rendering, point cloud processing (splicing, down / oversampling, matching, meshing, etc.)) process video information and return the processed result information to the user device . For example, the network device reconstructs the video information uploaded by the first user and videos uploaded by other users to generate overall video information for the first target object, and then performs target tracking on the first target object in the reconstructed video information.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,其他用户(如第三用户等)持有第三用户设备(如增强现实眼镜、平板电脑等),增强现实眼镜、第三用户设备与平板电脑通过网络设备(云端)建立了通信连接,且增强现实眼镜、第三用户设备与平板电脑正在执行同一远程辅助任务(如,对零件甲的安装指导),增强现实眼镜和第三用设备均在拍摄零件甲相关的视频信息,其中,增强现实眼镜主要在拍摄零件甲的左半部分,第三用户设备主要在拍摄零件甲的右半部分,且有一定的重叠度。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲左半部分相关的第一视频信息,并将该第一视频信息发送至网络设备;第三用户对零件甲进行实时拍摄,获取零件甲右半部分相关的第三视频信息,并将该第三视频信息发送至网络设备。网络设备接收该零件甲相关的第一视频信息和第三视频信息,通过计算机视觉算法根据第一视频信息和第三视频信息获得包含整体零件甲的重构视频信息,并根据目标跟踪算法确定该零件甲在重构视频信息中各视频帧中的转移矩阵信息。随后,网络设备将该转移矩阵信息和重构视频信息返回至增强现实眼镜、第三用户设备和平板电脑。第三用户设备接收该转移矩阵信息以及重构视频信息, 并在呈现重构视频信息的同时根据转移矩阵信息在视频中对应位置实时叠加显示对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的;在另一些实时例中,第三用户设备根据计算机视觉算法,计算出在重构视频信息中零件甲右半部分的位置信息相对于第三视频信息的转移矩阵信息,随后,第三用户设备呈现第三视频信息的同时在对应位置叠加显示对应的标记信息。For example, a first user holds augmented reality glasses, a second user holds a tablet computer, and another user (such as a third user) holds a third user device (such as augmented reality glasses, tablet computer, etc.). The three user devices and the tablet computer have established a communication connection through the network device (cloud), and the augmented reality glasses, the third user device and the tablet computer are performing the same remote assistance task (such as the installation instructions for part A), the augmented reality glasses, and The third device is used to capture video information related to Part A. Among them, the augmented reality glasses are mainly used to capture the left half of Part A, and the third user device is mainly used to capture the right half of Part A with a certain degree of overlap. The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform), obtain the first video information related to the left half of part A, and send the first video information to the network device; Part A performs real-time shooting, obtains third video information related to the right half of part A, and sends the third video information to the network device. The network device receives the first video information and the third video information related to the part A, obtains the reconstructed video information including the entire part A according to the first video information and the third video information through a computer vision algorithm, and determines the target video according to the target tracking algorithm. Part A transforms the matrix information in each video frame in the reconstructed video information. Subsequently, the network device returns the transfer matrix information and the reconstructed video information to the augmented reality glasses, the third user equipment, and the tablet computer. The third user equipment receives the transfer matrix information and the reconstructed video information, and displays the reconstructed video information while displaying the corresponding marker information in real-time on the corresponding position in the video according to the transfer matrix information, where the marker information includes the second user pair Operation instruction information such as the installation instruction information of Part A, where the operation instruction information may be generated on a tablet computer, or may be generated by a network device according to the operation of the second user uploaded by the tablet computer; in other real-time examples The third user equipment calculates the transfer matrix information of the position information of the right half of the part A in the reconstructed video information with respect to the third video information according to the computer vision algorithm. Subsequently, the third user equipment presents the third video information at the same time The corresponding mark information is superimposed and displayed at the corresponding position.
在一些实施例中,该设备还包括第三发送模块45(未示出)。第三发送模块45,用于通过对所述视频信息中的第三目标对象执行目标跟踪操作,确定所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,其中,所述第三目标对象与所述第一目标对象属于同一远程辅助任务,并将所述视频信息及所述第三转移矩阵信息发送至所述远程辅助任务中与所述第三目标对象相对应的第三用户设备;其中,第二发送模块44,用于将所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。其中,第三用户持有第三用户设备,第三用户设备包括但不限于增强现实设备、平板电脑、PC端、移动终端等,此处以移动终端为例阐述以下实施例,本领域技术人员应能理解,该等实施例同样适用于增强现实设备、平板电脑、PC端等其他第三用户设备。In some embodiments, the device further includes a third sending module 45 (not shown). A third sending module 45, configured to determine target third transfer matrix information corresponding to the third target object in each video frame of the video information by performing a target tracking operation on a third target object in the video information, The third target object and the first target object belong to the same remote assistance task, and the video information and the third transition matrix information are sent to the third assistance object in the remote assistance task. A corresponding third user equipment; wherein a second sending module 44 is configured to send the video information, the first transfer matrix information, and the third transfer matrix information to the same as the first user equipment Second user equipment for remote assistance tasks. The third user holds a third user device. The third user device includes, but is not limited to, an augmented reality device, a tablet computer, a PC terminal, and a mobile terminal. Here, a mobile terminal is used as an example to describe the following embodiments. Those skilled in the art should It can be understood that these embodiments are also applicable to other third-user devices such as augmented reality devices, tablet computers, and PC terminals.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,第三用户持有移动终端,增强现实眼镜、平板电脑与移动终端通过网络设备(云端)建立了通信连接,且增强现实眼镜、移动终端与平板电脑正在执行同一远程辅助任务(如,对工作台上零件甲和零件乙的安装指导),增强现实眼镜负责拍摄工作台相关的视频信息。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取工作台上零件甲相关的视频信息,同时,该视频信息对应视频帧中包含零件乙;随后,增强现实眼镜并将该视频信息发送至网络设备。网络设备接收该视频信息,通过图像识别获取零件甲和零件乙的初始位置,并根据目标跟踪算法分别计算零件甲 和零件乙在视频信息中各视频帧中的第一转移矩阵信息和第三转移矩阵信息,随后,网络设备将该第一转移矩阵信息返回至增强现实眼镜,将第三转移矩阵信息和视频信息发送至移动终端,并将第一转移矩阵信息、第三转移矩阵信息以及视频信息发送至平板电脑。增强现实眼镜接收该第一转移矩阵信息,并在呈现视频信息的同时根据第一转移矩阵信息在视频中对应位置实时叠加显示对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。移动终端接收网络设备发送的第三转移矩阵信息以及视频信息,在呈现视频信息时,根据第三转移矩阵信息确定零件乙在各视频帧中的位置信息,并在该位置叠加显示关于零件乙的标记信息,如对于零件乙的安装指导信息等操作指示信息。平板电脑接收网络设备发送的第一转移矩阵信息、第三转移矩阵信息以及视频信息,在呈现视频信息时,根据第一转移矩阵信息确定零件甲在各视频帧中的位置信息,并在该位置叠加显示关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息,根据第三转移矩阵信息确定零件乙在各视频帧中的位置信息,并在该位置叠加显示关于零件乙的标记信息,如对于零件乙的安装指导信息等操作指示信息。其中,第二用户设备可以根据第二用户的选择操作确定当前第二用户设备的标记信息的对象。For example, the first user holds augmented reality glasses, the second user holds a tablet computer, and the third user holds a mobile terminal. The augmented reality glasses, tablet computer, and mobile terminal establish a communication connection through a network device (cloud), and the augmented reality The glasses, the mobile terminal, and the tablet computer are performing the same remote assistance task (for example, installation instructions for part A and part B on the workbench), and the augmented reality glasses are responsible for shooting video information related to the workbench. The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform) to obtain video information related to part A on the workbench. At the same time, the video information corresponding to the video frame contains part B; then, the augmented reality glasses And send the video information to the network device. The network device receives the video information, obtains the initial positions of part A and part B through image recognition, and calculates the first transition matrix information and the third transition of each video frame in the video information of the part A and part B according to the target tracking algorithm. Matrix information, and then, the network device returns the first transfer matrix information to the augmented reality glasses, sends the third transfer matrix information and video information to the mobile terminal, and sends the first transfer matrix information, the third transfer matrix information, and the video information Send to tablet. The augmented reality glasses receive the first transfer matrix information, and simultaneously display the video information while displaying the corresponding mark information on the corresponding position in the video according to the first transfer matrix information. The mark information includes the second user's installation of the part A. Operation instruction information such as guidance information, where the operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation about a second user uploaded by the tablet computer. The mobile terminal receives the third transfer matrix information and video information sent by the network device. When presenting the video information, the position information of part B in each video frame is determined according to the third transfer matrix information, and the position information of part B is superimposed and displayed at the position. Marking information, such as operation instructions for installation instructions for Part B. The tablet computer receives the first transfer matrix information, the third transfer matrix information, and the video information sent by the network device. When presenting the video information, the position information of the part A in each video frame is determined according to the first transfer matrix information, and at the position Superimposedly display mark information about Part A, such as installation instruction information for Part A, and determine the position information of Part B in each video frame based on the third transfer matrix information, and superimpose and display the information about Part B at this position. Marking information, such as operation instructions for installation instructions for Part B. The second user equipment may determine an object of the current tag information of the second user equipment according to a selection operation of the second user.
图17示出根据本申请又一个方面的一种基于增强现实进行远程辅助的第三用户设备设备,其中,该设备包括接收模块51和呈现模块52。接收模块51,用于接收对应网络设备发送的、关于第三目标对象的视频信息及所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;呈现模块52,用于呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括第二用户通过第二用户设备对所述第三目标对象的操作指示信息;其中,所述视频信息是通过第一用户设备中的摄像装置实时拍摄的,所述第一用户设备、所述第三用户设备与所述第二用户设备属于同一远程辅助任务,并分别接受所述第二用户设 备的远程辅助。FIG. 17 illustrates a third user equipment device for remote assistance based on augmented reality according to another aspect of the present application, where the device includes a receiving module 51 and a presenting module 52. A receiving module 51, configured to receive video information about a third target object and third transfer matrix information corresponding to the third target object in each video frame of the video information sent by a corresponding network device; a presentation module 52, Configured to present the video information, and superimpose and display corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein the first The three mark information includes operation instruction information of the second user on the third target object through the second user equipment; wherein the video information is captured in real time by a camera device in the first user equipment, and the first user equipment The third user equipment and the second user equipment belong to the same remote assistance task, and receive the remote assistance of the second user equipment, respectively.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,第三用户持有移动终端,增强现实眼镜、平板电脑与移动终端通过网络设备(云端)建立了通信连接,且增强现实眼镜、移动终端与平板电脑正在执行同一远程辅助任务(如,对工作台上零件甲和零件乙的安装指导),增强现实眼镜负责拍摄工作台相关的视频信息。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取工作台上零件甲相关的视频信息,同时,该视频信息对应视频帧中包含零件乙;随后,增强现实眼镜并将该视频信息发送至网络设备。网络设备接收该视频信息,通过图像识别获取零件甲和零件乙的初始位置,并根据目标跟踪算法分别计算零件甲和零件乙在视频信息中各视频帧中的第一转移矩阵信息和第三转移矩阵信息,随后,网络设备将第三转移矩阵信息和视频信息发送至移动终端。移动终端接收网络设备发送的第三转移矩阵信息以及视频信息,在呈现视频信息时,根据第三转移矩阵信息确定零件乙在各视频帧中的位置信息,并在该位置叠加显示关于零件乙的标记信息,如对于零件乙的安装指导信息等操作指示信息,其中,标记信息包括第二用户对零件乙的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。在另一些实时例中,标记信息还包括移动终端采集的基于第三用户的操作,对目标对象的标记(如画线段、圆圈等)等,或者对平板电脑发送的标记信息的反馈信息等,如在标记信息中提问、画圈圈出文字等;移动终端在呈现视频信息的同时将该辅助标示信息叠加显示在目标对象对应的位置。For example, the first user holds augmented reality glasses, the second user holds a tablet computer, and the third user holds a mobile terminal. The augmented reality glasses, tablet computer, and mobile terminal establish a communication connection through a network device (cloud), and the augmented reality The glasses, the mobile terminal, and the tablet computer are performing the same remote assistance task (for example, installation instructions for part A and part B on the workbench), and the augmented reality glasses are responsible for shooting video information related to the workbench. The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform) to obtain video information related to part A on the workbench. At the same time, the video information corresponding to the video frame contains part B; then, the augmented reality glasses And send the video information to the network device. The network device receives the video information, obtains the initial positions of part A and part B through image recognition, and calculates the first transition matrix information and the third transition of each video frame in the video information of the part A and part B according to the target tracking algorithm. Matrix information, and then the network device sends the third transfer matrix information and video information to the mobile terminal. The mobile terminal receives the third transfer matrix information and video information sent by the network device. When presenting the video information, the position information of part B in each video frame is determined according to the third transfer matrix information, and the position information of part B is superimposed and displayed at the position. Marking information, such as operation instruction information such as installation guidance information for Part B, where the marking information includes operation instruction information such as installation guidance information for Part B by the second user, where the operation instruction information may be generated on a tablet computer, It may also be generated by the network device according to the operation about the second user uploaded by the tablet computer. In some other real-time examples, the tagging information also includes operations based on the third user collected by the mobile terminal, marking on the target object (such as drawing line segments, circles, etc.), or feedback information on the tagging information sent by the tablet computer, etc. For example, in the tag information, questions are asked, text is drawn in circles, etc .; while the mobile terminal presents the video information, the auxiliary tag information is superimposed and displayed at a position corresponding to the target object.
图18示出根据本申请又一个方面的一种基于增强现实进行远程辅助的第二用户设备,其中,该设备包括接收模块61和呈现模块62。接收模块61,用于接收对应网络设备发送的、关于第一目标对象的视频信息及所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;呈现模块62,用于呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述第一标记信息包括第二用户通过所述第二用户设备 对所述第一目标对象的操作指示信息;其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。FIG. 18 illustrates a second user equipment for remote assistance based on augmented reality according to another aspect of the present application, where the device includes a receiving module 61 and a presenting module 62. A receiving module 61, configured to receive video information about a first target object and first transfer matrix information corresponding to the first target object in each video frame of the video information sent by a corresponding network device; a presentation module 62, Configured to present the video information, and superimpose and display corresponding first marker information on the first target object in each video frame of the video information according to the first transition matrix information, wherein the first A tag information includes operation instruction information of a second user on the first target object through the second user equipment; wherein the video information is from a first user who belongs to the same remote assistance task as the second user equipment The camera device in the device takes pictures in real time, or reconstructs them based on the real-time video information about the first target object and other video information of the first target object taken by the camera device.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现实眼镜与平板电脑通过网络设备(云端)建立了通信连接。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲相关的视频信息,并将该视频信息发送至网络设备。网络设备接收该零件甲相关的视频信息,并根据目标跟踪算法确定该零件甲在视频信息中各视频帧中的转移矩阵信息,随后,网络设备将该转移矩阵信息返回至增强现实眼镜,并将转移矩阵信息以及视频信息发送至平板电脑,其中,增强现实眼镜与平板电脑通过网络设备建立通信执行同一远程辅助任务(如,对零件甲的安装指导)。平板电脑接收网络设备发送的转移矩阵信息以及视频信息,在呈现视频信息时,根据转移矩阵信息确定零件甲在各视频帧中的位置信息,并在该位置叠加显示关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息。For example, a first user holds augmented reality glasses and a second user holds a tablet computer, and the augmented reality glasses and the tablet computer establish a communication connection through a network device (cloud). The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform), obtain video information related to part A, and send the video information to the network device. The network device receives the video information related to the part A, and determines the transfer matrix information of the part A in each video frame according to the target tracking algorithm. Then, the network device returns the transfer matrix information to the augmented reality glasses, and The transfer matrix information and video information are sent to a tablet computer, where the augmented reality glasses and the tablet computer establish communication through a network device to perform the same remote assistance task (eg, installation instruction for part A). The tablet receives the transfer matrix information and video information sent by the network device. When presenting the video information, the position information of the part A in each video frame is determined according to the transfer matrix information, and the mark information about the part A is superimposed and displayed at the position, such as Operation instructions such as installation instructions for Part A.
在一些实施例中,该设备还包括第三标记叠加模块63(未示出)。第三标记叠加模块63,用于接收所述网络设备发送的、所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,并在呈现所述视频信息过程中,根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括所述第二用户通过所述第二用户设备对所述第三目标对象的操作指示信息。In some embodiments, the device further includes a third tag overlay module 63 (not shown). The third mark superimposing module 63 is configured to receive third transfer matrix information corresponding to the third target object in each video frame of the video information sent by the network device, and in the process of presenting the video information And superimposing and displaying the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein the third marker information includes the second The user uses the second user equipment to perform operation instruction information on the third target object.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,第三用户持有移动终端,增强现实眼镜、平板电脑与移动终端通过网络设备(云端)建立了通信连接,且增强现实眼镜、移动终端与平板电脑正在执行同一远程辅助任务(如,对工作台上零件甲和零件乙的安装指导),增强现实眼镜负责拍摄工作台相关的视频信息。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取工作台上零件甲相关的视频 信息,同时,该视频信息对应视频帧中包含零件乙;随后,增强现实眼镜并将该视频信息发送至网络设备。网络设备接收该视频信息,通过图像识别获取零件甲和零件乙的初始位置,并根据目标跟踪算法分别计算零件甲和零件乙在视频信息中各视频帧中的第一转移矩阵信息和第三转移矩阵信息,随后,网络设备将第一转移矩阵信息、第三转移矩阵信息以及视频信息发送至平板电脑。平板电脑接收网络设备发送的第一转移矩阵信息、第三转移矩阵信息以及视频信息,在呈现视频信息时,根据第一转移矩阵信息确定零件甲在各视频帧中的位置信息,并在该位置叠加显示关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息,根据第三转移矩阵信息确定零件乙在各视频帧中的位置信息,并在该位置叠加显示关于零件乙的标记信息,如对于零件乙的安装指导信息等操作指示信息其中,标记信息包括第二用户对各零件的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。其中,第二用户设备可以根据第二用户的选择操作确定当前第二用户设备的标记信息的对象。For example, the first user holds augmented reality glasses, the second user holds a tablet computer, and the third user holds a mobile terminal. The augmented reality glasses, tablet computer, and mobile terminal establish a communication connection through a network device (cloud), and the augmented reality The glasses, the mobile terminal, and the tablet computer are performing the same remote assistance task (for example, installation instructions for part A and part B on the workbench), and the augmented reality glasses are responsible for shooting video information related to the workbench. The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform) to obtain video information related to part A on the workbench. At the same time, the video information corresponding to the video frame contains part B; then, the augmented reality glasses And send the video information to the network device. The network device receives the video information, obtains the initial positions of part A and part B through image recognition, and calculates the first transition matrix information and the third transition of each video frame in the video information of the part A and part B according to the target tracking algorithm. Matrix information, and then, the network device sends the first transfer matrix information, the third transfer matrix information, and the video information to the tablet computer. The tablet computer receives the first transfer matrix information, the third transfer matrix information, and the video information sent by the network device. When presenting the video information, the position information of the part A in each video frame is determined according to the first transfer matrix information, and at the position Superimposedly display mark information about Part A, such as installation instruction information for Part A, and determine the position information of Part B in each video frame based on the third transfer matrix information, and superimpose and display the information about Part B at this position. Marking information, such as operation instruction information such as installation guidance information for part B, where the marking information includes operation instruction information such as installation guidance information for each part by the second user, where the operation instruction information may be generated on a tablet computer, or It may be generated by the network device according to the operation about the second user uploaded by the tablet computer. The second user equipment may determine an object of the current tag information of the second user equipment according to a selection operation of the second user.
图19示出根据本申请又一个方面的一种基于增强现实进行远程辅助的网络设备,其中,该设备包括视频接收模块71、目标跟踪模块72、标记添加模块73和视频发送模块74。视频接收模块71,用于接收第一用户设备发送的关于目标对象的视频信息,其中,所述视频信息包括通过所述第一用户设备中的摄像装置所拍摄的;目标跟踪模块72,用于通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;标记添加模块73,用于根据所述转移矩阵信息将对应的标记信息添加至所述视频信息中的各视频帧,其中,所述标记信息保持叠加于所述视频信息的各视频帧中的所述目标对象,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;视频发送模块74,用于将编辑后的所述视频信息发送至第一用户设备,以及与所述第一用户设备属于同一远程辅助任务的第二用户设备。FIG. 19 illustrates a network device for remote assistance based on augmented reality according to another aspect of the present application, where the device includes a video receiving module 71, a target tracking module 72, a tag adding module 73, and a video sending module 74. A video receiving module 71 is configured to receive video information about a target object sent by a first user equipment, where the video information includes a picture taken by a camera device in the first user equipment; a target tracking module 72 is configured to: Performing a target tracking operation on the target object in the video information to determine transition matrix information corresponding to the target object in each video frame of the video information; and a tag adding module 73 for The information adds corresponding tag information to each video frame in the video information, wherein the tag information remains superimposed on the target object in each video frame of the video information, and the tag information includes a corresponding second The operation instruction information of the second user on the target object sent by the user equipment; the video sending module 74 is configured to send the edited video information to the first user equipment and belong to the same as the first user equipment Second user equipment for remote assistance tasks.
例如,第一用户持有增强现实眼镜,第二用户持有平板电脑,增强现 实眼镜与平板电脑通过网络设备(云端)建立了通信连接。增强现实眼镜对第一目标对象(如操作台上的零件甲)进行实时拍摄,获取零件甲相关的视频信息,并将该视频信息发送至网络设备。网络设备接收该零件甲相关的视频信息,并根据目标跟踪算法确定该零件甲在视频信息中各视频帧中的转移矩阵信息,随后,网络设备根据该转移矩阵信息将零件甲对应的标记信息(如零件甲的指导操作等)添加在各视频帧对应的位置,并将编辑后的视频帧发送至增强现实眼镜和平板电脑,其中,增强现实眼镜与平板电脑通过网络设备建立通信执行同一远程辅助任务(如,对零件甲的安装指导)。增强现实眼镜接收并呈现视频信息,其中,在该视频信息中对应位置实时叠加显示了对应的标记信息,其中,标记信息包括第二用户对零件甲的安装指导信息等操作指示信息,其中,该操作指示信息可以是在平板电脑上生成,也可以是网络设备根据平板电脑上传的关于第二用户的操作生成的。同理,平板电脑接收并呈现视频信息,其中,在该视频信息对应位置叠加显示了关于零件甲的标记信息,如对于零件甲的安装指导信息等操作指示信息。For example, a first user holds augmented reality glasses and a second user holds a tablet computer. The augmented reality glasses and tablet computer establish a communication connection through a network device (cloud). The augmented reality glasses take a real-time shot of the first target object (such as part A on the operating platform), obtain video information related to part A, and send the video information to the network device. The network device receives the video information related to the part A, and determines the transfer matrix information of the part A in each video frame of the video information according to the target tracking algorithm. Then, the network device uses the transfer matrix information to mark information corresponding to the part A ( (Such as the guidance operation of Part A), add it to the corresponding position of each video frame, and send the edited video frame to the augmented reality glasses and tablet computer, where the augmented reality glasses and tablet computer establish communication through the network device to perform the same remote assistance Tasks (eg, installation instructions for part A). The augmented reality glasses receive and present video information, in which corresponding mark information is displayed in real-time superimposed on the corresponding position in the video information, wherein the mark information includes operation instruction information such as the second user's installation instruction information on the part A, where the The operation instruction information may be generated on a tablet computer, or may be generated by a network device according to an operation on the second user uploaded by the tablet computer. In the same way, the tablet computer receives and presents video information, and the mark information about the part A is superimposed and displayed at the corresponding position of the video information, such as operation instruction information such as the installation instruction information for the part A.
图20示出了一种基于增强现实进行远程辅助的系统,其中,该系统包括如上所述的包含实时拍摄模块、目标跟踪模块及叠加显示模块的第一用户设备以及如上所述的包含视频接收模块及视频呈现模块的第二用户设备。FIG. 20 shows a system for remote assistance based on augmented reality, wherein the system includes the first user equipment including the real-time shooting module, the target tracking module, and the superimposed display module as described above, and the video receiving device as described above. Module and the second user equipment of the video presentation module.
图21示出了一种基于增强现实进行远程辅助的系统,其中,该系统包括如上所述的包含实时拍摄模块、视频发送模块、转移矩阵接收模块以及叠加显示模块的第一用户设备,如上所述包含接收模块及呈现模块的第二用户设备,以及如上所述包含视频接收模块、目标跟踪模块、第一发送模块以及第二发送模块的网络设备。FIG. 21 shows a system for remote assistance based on augmented reality, where the system includes a first user equipment including a real-time shooting module, a video sending module, a transfer matrix receiving module, and an overlay display module as described above, as described above. A second user equipment including a receiving module and a presentation module, and a network device including a video receiving module, a target tracking module, a first sending module, and a second sending module as described above are described.
图22示出了一种基于增强现实进行远程辅助的系统,其中,该系统包括如上所述的包含实时拍摄模块、视频发送模块、转移矩阵接收模块以及叠加显示模块的第一用户设备,如上所述的包含接收模块及呈现模块的第二用户设备,如上所述的接收模块、呈现模块的第三用户设备,以及如上所述的包含视频接收模块、目标跟踪模块、第一发送模块以及第二发送 模块的网络设备。FIG. 22 shows a system for remote assistance based on augmented reality, wherein the system includes a first user equipment including a real-time shooting module, a video sending module, a transfer matrix receiving module, and an overlay display module as described above, as described above. The second user equipment including the receiving module and the presenting module, the receiving module and the third user equipment of the presenting module as described above, and the video receiving module, the target tracking module, the first sending module, and the second including the video receiving module as described above. Network equipment of the sending module.
本申请还提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机代码,当所述计算机代码被执行时,如前任一项所述的方法被执行。The application also provides a computer-readable storage medium, where the computer-readable storage medium stores computer code, and when the computer code is executed, the method according to any one of the preceding is executed.
本申请还提供了一种计算机程序产品,当所述计算机程序产品被计算机设备执行时,如前任一项所述的方法被执行。The present application also provides a computer program product. When the computer program product is executed by a computer device, the method according to any one of the foregoing is executed.
本申请还提供了一种计算机设备,所述计算机设备包括:This application also provides a computer device, where the computer device includes:
一个或多个处理器;One or more processors;
存储器,用于存储一个或多个计算机程序;Memory for storing one or more computer programs;
当所述一个或多个计算机程序被所述一个或多个处理器执行时,使得所述一个或多个处理器实现如前任一项所述的方法。When the one or more computer programs are executed by the one or more processors, the one or more processors are caused to implement the method according to any one of the preceding items.
图23示出了可被用于实施本申请中所述的各个实施例的示例性系统;FIG. 23 illustrates an exemplary system that can be used to implement various embodiments described in this application;
如图23所示在一些实施例中,系统300能够作为各所述实施例中的任意一个基于增强现实进行远程辅助的设备。在一些实施例中,系统300可包括具有指令的一个或多个计算机可读介质(例如,系统存储器或NVM/存储设备320)以及与该一个或多个计算机可读介质耦合并被配置为执行指令以实现模块从而执行本申请中所述的动作的一个或多个处理器(例如,(一个或多个)处理器305)。As shown in FIG. 23, in some embodiments, the system 300 can serve as a device for remote assistance based on augmented reality in any of the embodiments. In some embodiments, system 300 may include one or more computer-readable media (e.g., system memory or NVM / storage device 320) with instructions and coupled to the one or more computer-readable media and configured to execute Instructions to one or more processors (eg, processor (s) 305) that implement the modules to perform the actions described in this application.
对于一个实施例,系统控制模块310可包括任意适当的接口控制器,以向(一个或多个)处理器305中的至少一个和/或与系统控制模块310通信的任意适当的设备或组件提供任意适当的接口。For one embodiment, the system control module 310 may include any suitable interface controller to provide to at least one of the processor (s) 305 and / or any suitable device or component in communication with the system control module 310 Any appropriate interface.
系统控制模块310可包括存储器控制器模块330,以向系统存储器315提供接口。存储器控制器模块330可以是硬件模块、软件模块和/或固件模块。The system control module 310 may include a memory controller module 330 to provide an interface to the system memory 315. The memory controller module 330 may be a hardware module, a software module, and / or a firmware module.
系统存储器315可被用于例如为系统300加载和存储数据和/或指令。对于一个实施例,系统存储器315可包括任意适当的易失性存储器,例如,适当的DRAM。在一些实施例中,系统存储器315可包括双倍数据速率类型四同步动态随机存取存储器(DDR4SDRAM)。 System memory 315 may be used, for example, to load and store data and / or instructions for system 300. For one embodiment, the system memory 315 may include any suitable volatile memory, such as a suitable DRAM. In some embodiments, the system memory 315 may include a double data rate type quad synchronous dynamic random access memory (DDR4SDRAM).
对于一个实施例,系统控制模块310可包括一个或多个输入/输出(I/O) 控制器,以向NVM/存储设备320及(一个或多个)通信接口325提供接口。For one embodiment, the system control module 310 may include one or more input / output (I / O) controllers to provide an interface to the NVM / storage device 320 and the communication interface (s) 325.
例如,NVM/存储设备320可被用于存储数据和/或指令。NVM/存储设备320可包括任意适当的非易失性存储器(例如,闪存)和/或可包括任意适当的(一个或多个)非易失性存储设备(例如,一个或多个硬盘驱动器(HDD)、一个或多个光盘(CD)驱动器和/或一个或多个数字通用光盘(DVD)驱动器)。For example, the NVM / storage device 320 may be used to store data and / or instructions. The NVM / storage device 320 may include any suitable non-volatile memory (e.g., flash memory) and / or may include any suitable non-volatile storage device (e.g., one or more hard drives (e.g., one or more hard drives) HDD), one or more compact disc (CD) drives, and / or one or more digital versatile disc (DVD) drives).
NVM/存储设备320可包括在物理上作为系统300被安装在其上的设备的一部分的存储资源,或者其可被该设备访问而不必作为该设备的一部分。例如,NVM/存储设备320可通过网络经由(一个或多个)通信接口325进行访问。The NVM / storage device 320 may include storage resources that are physically part of the device on which the system 300 is installed, or it may be accessed by the device without having to be part of the device. For example, the NVM / storage device 320 may be accessed over a network via the communication interface (s) 325.
(一个或多个)通信接口325可为系统300提供接口以通过一个或多个网络和/或与任意其他适当的设备通信。系统300可根据一个或多个无线网络标准和/或协议中的任意标准和/或协议来与无线网络的一个或多个组件进行无线通信。The communication interface (s) 325 may provide an interface for the system 300 to communicate over one or more networks and / or with any other suitable device. System 300 may wirelessly communicate with one or more components of a wireless network in accordance with any one or more of one or more wireless network standards and / or protocols.
对于一个实施例,(一个或多个)处理器305中的至少一个可与系统控制模块310的一个或多个控制器(例如,存储器控制器模块330)的逻辑封装在一起。对于一个实施例,(一个或多个)处理器305中的至少一个可与系统控制模块310的一个或多个控制器的逻辑封装在一起以形成系统级封装(SiP)。对于一个实施例,(一个或多个)处理器305中的至少一个可与系统控制模块310的一个或多个控制器的逻辑集成在同一模具上。对于一个实施例,(一个或多个)处理器305中的至少一个可与系统控制模块310的一个或多个控制器的逻辑集成在同一模具上以形成片上系统(SoC)。For one embodiment, at least one of the processor (s) 305 may be packaged with the logic of one or more controllers (eg, the memory controller module 330) of the system control module 310. For one embodiment, at least one of the processor (s) 305 may be packaged with the logic of one or more controllers of the system control module 310 to form a system-in-package (SiP). For one embodiment, at least one of the processor (s) 305 may be integrated with the logic of one or more controllers of the system control module 310 on the same mold. For one embodiment, at least one of the processor (s) 305 may be integrated with the logic of one or more controllers of the system control module 310 on the same mold to form a system-on-chip (SoC).
在各个实施例中,系统300可以但不限于是:服务器、工作站、台式计算设备或移动计算设备(例如,膝上型计算设备、手持计算设备、平板电脑、上网本等)。在各个实施例中,系统300可具有更多或更少的组件和/或不同的架构。例如,在一些实施例中,系统300包括一个或多个摄像机、键盘、液晶显示器(LCD)屏幕(包括触屏显示器)、非易失性存储器端口、多个天线、图形芯片、专用集成电路(ASIC)和扬声器。In various embodiments, the system 300 may be, but is not limited to, a server, a workstation, a desktop computing device, or a mobile computing device (eg, a laptop computing device, a handheld computing device, a tablet computer, a netbook, etc.). In various embodiments, the system 300 may have more or fewer components and / or different architectures. For example, in some embodiments, the system 300 includes one or more cameras, keyboards, liquid crystal display (LCD) screens (including touch screen displays), non-volatile memory ports, multiple antennas, graphics chips, application specific integrated circuits ( ASIC) and speakers.
需要注意的是,本申请可在软件和/或软件与硬件的组合体中被实施, 例如,可采用专用集成电路(ASIC)、通用目的计算机或任何其他类似硬件设备来实现。在一个实施例中,本申请的软件程序可以通过处理器执行以实现上文所述步骤或功能。同样地,本申请的软件程序(包括相关的数据结构)可以被存储到计算机可读记录介质中,例如,RAM存储器,磁或光驱动器或软磁盘及类似设备。另外,本申请的一些步骤或功能可采用硬件来实现,例如,作为与处理器配合从而执行各个步骤或功能的电路。It should be noted that this application may be implemented in software and / or a combination of software and hardware, for example, it may be implemented using an application specific integrated circuit (ASIC), a general purpose computer, or any other similar hardware device. In one embodiment, the software program of the present application may be executed by a processor to implement the steps or functions described above. Likewise, the software program (including related data structures) of the present application can be stored in a computer-readable recording medium, such as a RAM memory, a magnetic or optical drive or a floppy disk and the like. In addition, some steps or functions of this application may be implemented by hardware, for example, as a circuit that cooperates with a processor to perform each step or function.
另外,本申请的一部分可被应用为计算机程序产品,例如计算机程序指令,当其被计算机执行时,通过该计算机的操作,可以调用或提供根据本申请的方法和/或技术方案。本领域技术人员应能理解,计算机程序指令在计算机可读介质中的存在形式包括但不限于源文件、可执行文件、安装包文件等,相应地,计算机程序指令被计算机执行的方式包括但不限于:该计算机直接执行该指令,或者该计算机编译该指令后再执行对应的编译后程序,或者该计算机读取并执行该指令,或者该计算机读取并安装该指令后再执行对应的安装后程序。在此,计算机可读介质可以是可供计算机访问的任意可用的计算机可读存储介质或通信介质。In addition, a part of the application may be applied as a computer program product, such as a computer program instruction, which, when executed by a computer, may call or provide the method and / or technical solution according to the application through the operation of the computer. Those skilled in the art should understand that the existence forms of computer program instructions in computer-readable media include, but are not limited to, source files, executable files, installation package files, and the like. Accordingly, the manner in which computer program instructions are executed by a computer includes, but is not limited to. Limited to: the computer directly executes the instruction, or the computer compiles the instruction and then executes the corresponding compiled program, or the computer reads and executes the instruction, or the computer reads and installs the instruction and then executes the corresponding installation program. Here, the computer-readable medium can be any available computer-readable storage medium or communication medium that can be accessed by a computer.
通信介质包括藉此包含例如计算机可读指令、数据结构、程序模块或其他数据的通信信号被从一个系统传送到另一系统的介质。通信介质可包括有导的传输介质(诸如电缆和线(例如,光纤、同轴等))和能传播能量波的无线(未有导的传输)介质,诸如声音、电磁、RF、微波和红外。计算机可读指令、数据结构、程序模块或其他数据可被体现为例如无线介质(诸如载波或诸如被体现为扩展频谱技术的一部分的类似机制)中的已调制数据信号。术语“已调制数据信号”指的是其一个或多个特征以在信号中编码信息的方式被更改或设定的信号。调制可以是模拟的、数字的或混合调制技术。Communication media include media whereby communication signals containing, for example, computer-readable instructions, data structures, program modules, or other data, are transmitted from one system to another. Communication media can include conductive transmission media (such as cables and wires (e.g., fiber optics, coaxial, etc.)) and wireless (non-conductive transmission) media that can propagate energy waves, such as sound, electromagnetic, RF, microwave, and infrared . Computer readable instructions, data structures, program modules or other data may be embodied, for example, as a modulated data signal in a wireless medium, such as a carrier wave or a similar mechanism such as embodied as part of a spread spectrum technology. The term "modulated data signal" refers to a signal whose one or more characteristics are altered or set in such a manner as to encode information in the signal. Modulation can be analog, digital, or hybrid modulation techniques.
作为示例而非限制,计算机可读存储介质可包括以用于存储诸如计算机可读指令、数据结构、程序模块或其它数据的信息的任何方法或技术实现的易失性和非易失性、可移动和不可移动的介质。例如,计算机可读存储介质包括,但不限于,易失性存储器,诸如随机存储器(RAM,DRAM,SRAM);以及非易失性存储器,诸如闪存、各种只读存储器(ROM,PROM, EPROM,EEPROM)、磁性和铁磁/铁电存储器(MRAM,FeRAM);以及磁性和光学存储设备(硬盘、磁带、CD、DVD);或其它现在已知的介质或今后开发的能够存储供计算机系统使用的计算机可读信息/数据。By way of example, and not limitation, computer-readable storage media may include volatile and non-volatile, non-volatile, non-volatile, non-volatile, non-volatile, and non-volatile Removable and non-removable media. For example, computer-readable storage media include, but are not limited to, volatile memory such as random access memory (RAM, DRAM, SRAM); and non-volatile memory such as flash memory, various read-only memories (ROM, PROM, EPROM) , EEPROM), magnetic and ferromagnetic / ferroelectric memory (MRAM, FeRAM); and magnetic and optical storage devices (hard disk, tape, CD, DVD); or other media now known or developed in the future that can be stored for computer systems Computer-readable information / data used.
对于本领域技术人员而言,显然本申请不限于上述示范性实施例的细节,而且在不背离本申请的精神或基本特征的情况下,能够以其他的具体形式实现本申请。因此,无论从哪一点来看,均应将实施例看作是示范性的,而且是非限制性的,本申请的范围由所附权利要求而不是上述说明限定,因此旨在将落在权利要求的等同要件的含义和范围内的所有变化涵括在本申请内。不应将权利要求中的任何附图标记视为限制所涉及的权利要求。此外,显然“包括”一词不排除其他单元或步骤,单数不排除复数。第一,第二等词语用来表示名称,而并不表示任何特定的顺序。It is obvious to a person skilled in the art that the present application is not limited to the details of the above exemplary embodiments, and that the present application can be implemented in other specific forms without departing from the spirit or basic features of the application. Therefore, the embodiments are to be regarded as exemplary and non-limiting in every respect. The scope of the present application is defined by the appended claims rather than the above description, and therefore is intended to fall within the claims. All changes within the meaning and scope of the equivalent requirements are included in this application. Any reference signs in the claims should not be construed as limiting the claims involved. In addition, it is obvious that the word "comprising" does not exclude other units or steps, and that the singular does not exclude the plural. Words such as first and second are used to indicate names, but not in any particular order.

Claims (42)

  1. 一种在第一用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality on a first user equipment side, wherein the method includes:
    通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;Shooting video information about a target object in real time through a camera device in the first user equipment;
    通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Determining a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
    根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。According to the transfer matrix information, corresponding mark information is superimposed and displayed on the target object, wherein the mark information includes corresponding instruction information of the second user on the target object sent by the second user equipment.
  2. 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1, further comprising:
    将所述视频信息发送至所述第二用户设备。Sending the video information to the second user equipment.
  3. 根据权利要求1所述的方法,其中,所述将所述视频信息发送至所述第二用户设备,包括:The method according to claim 1, wherein the sending the video information to the second user equipment comprises:
    将所述视频信息及所述转移矩阵信息发送至所述第二用户设备。Sending the video information and the transfer matrix information to the second user equipment.
  4. 根据权利要求2或3所述的方法,其中,所述方法还包括:The method according to claim 2 or 3, wherein the method further comprises:
    接收所述第二用户设备发送的、所述第二用户基于所述视频信息对所述目标对象的继续操作指示信息。And receiving, from the second user equipment, continuous operation instruction information of the second user on the target object based on the video information.
  5. 根据权利要求2至4中任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 2 to 4, further comprising:
    接收所述第二用户设备发送的、所述第二用户对所述摄像装置的摄像控制指令信息;Receiving imaging control instruction information sent by the second user equipment to the imaging device by the second user;
    根据所述摄像控制指令信息调整所述摄像装置的摄像参数信息;Adjusting imaging parameter information of the imaging device according to the imaging control instruction information;
    通过调整后的所述摄像装置实时拍摄关于所述目标对象的视频信息;Shooting video information about the target object in real time through the adjusted camera device;
    将通过所述调整后的摄像装置拍摄的所述视频信息发送至所述第二用户设备。Sending the video information shot by the adjusted camera device to the second user equipment.
  6. 根据权利要求1至5中任一项所述的方法,其中,所述标记信息还包括第一用户通过所述第一用户设备对所述目标对象标示的辅助标示信息。The method according to any one of claims 1 to 5, wherein the marking information further includes auxiliary marking information that the first user marks the target object through the first user equipment.
  7. 根据权利要求1至6中任一项所述的方法,其中,所述目标对象包括在讨论纸件文档;所述第二用户对所述目标对象的操作指示信息包括所述第 二用户对所述在讨论纸件文档的视频帧中的一个或多个标注位置信息。The method according to any one of claims 1 to 6, wherein the target object includes a paper document under discussion; the operation instruction information of the second user on the target object includes the second user's Describes one or more labeled position information in a video frame of a discussion paper document.
  8. 根据权利要求7所述的方法,其中,所述根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息,包括:The method according to claim 7, wherein, according to the transfer matrix information, the corresponding mark information is superimposed and displayed on the target object, and wherein the mark information includes a second The operation instruction information of the user on the target object includes:
    根据所述一个或多个标注位置信息生成渲染标记信息;Generating rendering mark information according to the one or more marked position information;
    根据所述转移矩阵信息,将所述渲染标记信息叠加显示于所述目标对象。Superimposing and displaying the rendering mark information on the target object according to the transfer matrix information.
  9. 根据权利要求1至8中任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1 to 8, wherein the method further comprises:
    通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的图像信息;Shooting image information about a target object in real time through a camera device in the first user equipment;
    将所述图像信息发送至对应的第二用户设备;Sending the image information to a corresponding second user equipment;
    接收关于所述目标对象的标记信息,其中,所述标记信息包括所述第二用户设备发送的、第二用户对所述图像信息中所述目标对象的操作指示信息;Receiving tag information about the target object, where the tag information includes operation instruction information of the second user on the target object in the image information sent by the second user equipment;
    将所述标记信息叠加显示于所述目标对象;Superimposing and displaying the mark information on the target object;
    其中,所述通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息,包括:Wherein, the real-time shooting of the video information about the target object through the camera device in the first user equipment includes:
    通过所述摄像装置实时拍摄关于所述目标对象的视频信息。Video information about the target object is captured in real time by the camera device.
  10. 一种在第二用户设备端用于基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality at a second user equipment end, wherein the method includes:
    接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;Receiving video information corresponding to a target object that is sent by a corresponding first user equipment in real time through a camera device in the first user equipment;
    呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。Presenting the video information, and maintaining corresponding target information superimposed on the target object displayed in each video frame of the video information, wherein the label information includes a second user using the second user device to Operation instruction information of the target object.
  11. 根据权利要求10所述的方法,其中,所述方法还包括:The method according to claim 10, wherein the method further comprises:
    对所述视频信息中的所述目标对象执行目标跟踪操作;Performing a target tracking operation on the target object in the video information;
    其中,所述呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息,包括:Wherein, the presenting the video information and keeping the corresponding tag information superimposed and displayed on the target object in each video frame of the video information, wherein the tag information includes a second user passing the second user The operation instruction information of the device on the target object includes:
    呈现所述视频信息,并根据所述目标跟踪操作的结果信息,将对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述 标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。Presenting the video information, and superimposing and displaying corresponding mark information on the target object in each video frame of the video information according to the result information of the target tracking operation, wherein the mark information includes a second user Information indicating operation of the target object by the second user equipment.
  12. 根据权利要求10所述的方法,其中,所述接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息,包括:The method according to claim 10, wherein the receiving corresponding video information about the target object in real time through a camera device in the first user equipment sent by the first user equipment comprises:
    接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时获取关于目标对象的视频信息,以及所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Receiving video information corresponding to a target object obtained in real time through a camera device in the first user equipment and corresponding transfer matrix information of the target object in each video frame of the video information sent by the corresponding first user equipment;
    其中,所述呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息,包括:Wherein, the presenting the video information and keeping the corresponding tag information superimposed and displayed on the target object in each video frame of the video information, wherein the tag information includes a second user passing the second user The operation instruction information of the device on the target object includes:
    呈现所述视频信息,并根据所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息,将对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。Presenting the video information, and superimposing and displaying corresponding marker information on the target object in each video frame of the video information according to the corresponding transfer matrix information of the target object in each video frame of the video information , Wherein the tag information includes operation instruction information of the second user on the target object through the second user equipment.
  13. 根据权利要求10至12中任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 10 to 12, wherein the method further comprises:
    获取所述第二用户基于所述视频信息对所述目标对象的继续操作指示信息;Acquiring instruction information for the second user to continue operating the target object based on the video information;
    将所述继续操作指示信息发送至所述第一用户设备。Sending the continue operation instruction information to the first user equipment.
  14. 根据权利要求10至13中任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 10 to 13, wherein the method further comprises:
    根据所述第二用户通过第二用户设备执行的摄像控制操作,生成所述第二用户对所述摄像装置的摄像控制指令信息,其中,所述摄像控制指令信息用于调整所述摄像装置的摄像参数信息;Generating imaging control instruction information of the second user for the imaging device according to an imaging control operation performed by the second user through the second user equipment, where the imaging control instruction information is used to adjust Camera parameter information;
    将所述摄像控制指令信息发送至所述第一用户设备;Sending the imaging control instruction information to the first user equipment;
    接收所述第一用户设备发送的、通过所述调整后的摄像装置拍摄的所述视频信息。Receiving the video information sent by the first user equipment and captured by the adjusted camera device.
  15. 根据权利要求10至14中任一项所述的方法,其中,所述方法还包 括:The method according to any one of claims 10 to 14, wherein the method further comprises:
    接收并呈现对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的图像信息;Receiving and presenting image information about a target object that is captured in real time by a camera device in the first user equipment and sent by the first user equipment;
    获取所述第二用户对所述图像信息中所述目标对象的操作指示信息;Acquiring operation instruction information of the second user on the target object in the image information;
    将所述操作指示信息发送至所述第一用户设备;Sending the operation instruction information to the first user equipment;
    将所述操作指示信息叠加显示于所述图像信息中所述目标对象;Superimposing and displaying the operation instruction information on the target object in the image information;
    其中,所述接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息,包括:Wherein, receiving the corresponding video information about the target object that is sent by the first user equipment in real time through the camera device in the first user equipment includes:
    接收所述第一用户设备发送的、通过所述摄像装置实时拍摄关于所述目标对象的视频信息。Receiving video information about the target object captured by the first user equipment in real time through the camera device.
  16. 一种在第一用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality on a first user equipment side, wherein the method includes:
    通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息;Shooting video information about a first target object in real time through a camera device in the first user equipment;
    将所述视频信息发送至对应的网络设备;Sending the video information to a corresponding network device;
    接收所述网络设备发送的、所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Receiving first transfer matrix information corresponding to the first target object in each video frame of the video information sent by the network device;
    根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息。Superimposing and displaying the corresponding first marker information on the first target object according to the first transfer matrix information, wherein the first marker information includes a second user equipment corresponding to Operation instruction information of a target object.
  17. 一种在网络设备端基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality on a network device side, wherein the method includes:
    接收第一用户设备发送的关于第一目标对象的视频信息,其中,所述视频信息是通过所述第一用户设备中的摄像装置实时拍摄的;Receiving video information about a first target object sent by a first user equipment, where the video information is captured in real time by a camera device in the first user equipment;
    通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Determining a first transition matrix information corresponding to the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information;
    将所述第一转移矩阵信息发送至所述第一用户设备;Sending the first transfer matrix information to the first user equipment;
    将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the video information and the first transfer matrix information to a second user equipment that belongs to the same remote assistance task as the first user equipment.
  18. 根据权利要求17所述的方法,其中,所述通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息,包括:The method according to claim 17, wherein the determining that the first target object corresponds to each video frame of the video information by performing a target tracking operation on the first target object in the video information The first transfer matrix information includes:
    根据所述视频信息及所述第一目标对象的其它视频信息重建所述第一目标对象的视频信息;Reconstructing video information of the first target object according to the video information and other video information of the first target object;
    通过对重建后的所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息。By performing a target tracking operation on the first target object in the reconstructed video information, first transition matrix information corresponding to the first target object in each video frame of the video information is determined.
  19. 根据权利要求17所述的方法,其中,所述方法还包括:The method according to claim 17, wherein the method further comprises:
    通过对所述视频信息中的第三目标对象执行目标跟踪操作,确定所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,其中,所述第三目标对象与所述第一目标对象属于同一远程辅助任务;Performing a target tracking operation on a third target object in the video information to determine third transfer matrix information corresponding to the third target object in each video frame of the video information, wherein the third target object Belong to the same remote assistance task as the first target object;
    将所述视频信息及所述第三转移矩阵信息发送至所述远程辅助任务中与所述第三目标对象相对应的第三用户设备;Sending the video information and the third transfer matrix information to a third user equipment corresponding to the third target object in the remote assistance task;
    其中,所述将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备,包括:Wherein, sending the video information and the first transfer matrix information to a second user equipment belonging to the same remote assistance task as the first user equipment includes:
    将所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the video information, the first transfer matrix information, and the third transfer matrix information to a second user equipment that belongs to the same remote assistance task as the first user equipment.
  20. 一种在第三用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:A method for performing remote assistance based on augmented reality on a third user equipment side, wherein the method includes:
    接收对应网络设备发送的、关于第三目标对象的视频信息及所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;Receiving video information about a third target object and third transfer matrix information corresponding to the third target object in each video frame of the video information sent by a corresponding network device;
    呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括第二用户通过第二用户设备对所述第三目标对象的操作指示信息;Presenting the video information, and superimposing and displaying the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein the third marker The information includes operation instruction information of the second user on the third target object through the second user equipment;
    其中,所述视频信息是通过第一用户设备中的摄像装置实时拍摄的,所述第一用户设备、所述第三用户设备与所述第二用户设备属于同一远程辅助任务,并分别接受所述第二用户设备的远程辅助。The video information is captured in real time by a camera device in the first user equipment, and the first user equipment, the third user equipment, and the second user equipment belong to the same remote assistance task, and accept all The remote assistance of the second user equipment is described.
  21. 一种在第二用户设备端基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality on a second user equipment side, wherein the method includes:
    接收对应网络设备发送的、关于第一目标对象的视频信息及所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;Receiving video information about a first target object and first transfer matrix information corresponding to the first target object in each video frame of the video information sent by a corresponding network device;
    呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述第一标记信息包括第二用户通过所述第二用户设备对所述第一目标对象的操作指示信息;Presenting the video information, and superimposing and displaying corresponding first marker information on the first target object in each video frame of the video information according to the first transition matrix information, wherein the first marker The information includes operation instruction information of the second user on the first target object through the second user equipment;
    其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。The video information is captured in real time by a camera device in the first user equipment that belongs to the same remote assistance task as the second user device, or is based on the first target object captured by the camera device. The real-time video information and other video information of the first target object are reconstructed.
  22. 根据权利要求21所述的方法,其中,所述方法还包括:The method according to claim 21, wherein the method further comprises:
    接收所述网络设备发送的、所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;Receiving third transfer matrix information corresponding to the third target object in each video frame of the video information sent by the network device;
    在呈现所述视频信息过程中,根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括所述第二用户通过所述第二用户设备对所述第三目标对象的操作指示信息。In the process of presenting the video information, according to the third transition matrix information, the corresponding third marker information is superimposed and displayed on the third target object in each video frame of the video information, wherein the first The three mark information includes operation instruction information of the second user on the third target object through the second user equipment.
  23. 一种在网络设备端基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality on a network device side, wherein the method includes:
    接收第一用户设备发送的关于目标对象的视频信息,其中,所述视频信息包括通过所述第一用户设备中的摄像装置所拍摄的;Receiving video information about a target object sent by a first user equipment, where the video information includes a picture taken by an imaging device in the first user equipment;
    通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;Determining a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
    根据所述转移矩阵信息将对应的标记信息添加至所述视频信息中的各视频帧,其中,所述标记信息保持叠加于所述视频信息的各视频帧中的所述目标对象,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;Adding corresponding tag information to each video frame in the video information according to the transfer matrix information, wherein the tag information remains the target object superimposed on each video frame of the video information, the tag The information includes corresponding operation instruction information of the second user on the target object sent by the second user equipment;
    将编辑后的所述视频信息发送至第一用户设备,以及与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the edited video information to a first user equipment and a second user equipment that belongs to the same remote assistance task as the first user equipment.
  24. 一种基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality, wherein the method includes:
    第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息,通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息,并根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;The first user equipment captures video information about a target object in real time through a camera device in the first user equipment, and determines a target object in the video by performing a target tracking operation on the target object in the video information. The corresponding transfer matrix information in each video frame of the information, and the corresponding marker information is superimposed and displayed on the target object according to the transfer matrix information, wherein the marker information includes a second User operation instruction information on the target object;
    所述第一用户设备将所述视频信息发送至所述第二用户设备;Sending, by the first user equipment, the video information to the second user equipment;
    所述第二用户设备接收并呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。The second user equipment receives and presents the video information, and maintains corresponding target information superimposed and displayed on the target object in each video frame of the video information, wherein the label information includes information obtained by the second user through The operation instruction information of the second user equipment on the target object is described.
  25. 一种基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality, wherein the method includes:
    第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息,并将所述视频信息发送至对应的网络设备;The first user equipment captures video information about the first target object in real time through a camera device in the first user equipment, and sends the video information to a corresponding network device;
    所述网络设备接收所述视频信息,通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息,将所述第一转移矩阵信息发送至所述第一用户设备,将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备;The network device receives the video information, and determines a first corresponding object of the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information. Transfer matrix information, sending the first transfer matrix information to the first user equipment, and sending the video information and the first transfer matrix information to a first remote user task that belongs to the same remote auxiliary task as the first user equipment Two user equipment;
    所述第一用户设备接收所述第一转移矩阵信息,根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息;The first user equipment receives the first transfer matrix information, and superimposes and displays corresponding first marker information on the first target object according to the first transfer matrix information, where the first marker information includes Corresponding to the operation instruction information of the second user on the first target object sent by the second user equipment;
    所述第二用户设备接收所述视频信息及所述第一转移矩阵信息,并呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄 像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。Receiving, by the second user equipment, the video information and the first transfer matrix information, presenting the video information, and superimposing and displaying the corresponding first tag information on the video according to the first transfer matrix information The first target object in each video frame of the information, wherein the video information is captured in real time by a camera device in the first user equipment belonging to the same remote assistance task as the second user equipment, or Real-time video information about the first target object and other video information of the first target object captured by the imaging device are reconstructed.
  26. 一种基于增强现实进行远程辅助的方法,其中,该方法包括:A method for remote assistance based on augmented reality, wherein the method includes:
    第一用户设备通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息,并将所述视频信息发送至对应的网络设备;The first user equipment captures video information about the first target object in real time through a camera device in the first user equipment, and sends the video information to a corresponding network device;
    所述网络设备接收所述视频信息,通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息,并将所述第一转移矩阵信息发送至所述第一用户设备;The network device receives the video information, and determines a first corresponding object of the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information. Transfer matrix information, and send the first transfer matrix information to the first user equipment;
    所述第一用户设备接收所述第一转移矩阵信息,根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息;The first user equipment receives the first transfer matrix information, and superimposes and displays corresponding first marker information on the first target object according to the first transfer matrix information, where the first marker information includes Corresponding to the operation instruction information of the second user on the first target object sent by the second user equipment;
    所述网络设备通过对所述视频信息中的第三目标对象执行目标跟踪操作,确定所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,其中,所述第三目标对象与所述第一目标对象属于同一远程辅助任务;The network device determines a third transition matrix information corresponding to the third target object in each video frame of the video information by performing a target tracking operation on a third target object in the video information. The third target object belongs to the same remote auxiliary task as the first target object;
    所述网络设备将所述视频信息及所述第三转移矩阵信息发送至所述远程辅助任务中与所述第三目标对象相对应的第三用户设备,将所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备;Sending, by the network device, the video information and the third transfer matrix information to a third user equipment corresponding to the third target object in the remote assistance task, and sending the video information and the first Sending the transfer matrix information and the third transfer matrix information to a second user equipment that belongs to the same remote auxiliary task as the first user equipment;
    所述第三用户设备接收所述视频信息及所述第三转移矩阵信息;Receiving, by the third user equipment, the video information and the third transfer matrix information;
    所述第三用户设备呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象;The third user equipment presents the video information, and superimposes and displays the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information;
    所述第二用户设备接收所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息,并在呈现所述视频信息过程中,根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象。Receiving, by the second user equipment, the video information, the first transition matrix information, and the third transition matrix information, and in presenting the video information, according to the first transition matrix information, the corresponding The first tag information is superimposed and displayed on the first target object in each video frame of the video information, and the corresponding third tag information is superimposed and displayed on each video of the video information according to the third transition matrix information. The third target object in the frame.
  27. 一种基于增强现实进行远程辅助的第一用户设备,其中,该设备包括:A first user device for remote assistance based on augmented reality, wherein the device includes:
    实时拍摄模块,用于通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;A real-time shooting module, configured to shoot video information about a target object in real time through a camera device in the first user equipment;
    目标跟踪模块,用于通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;A target tracking module, configured to determine a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
    叠加显示模块,用于根据所述转移矩阵信息,将对应的标记信息叠加显示于所述目标对象,其中,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息。An overlay display module, configured to superimpose and display corresponding mark information on the target object according to the transfer matrix information, where the mark information includes corresponding second user equipment to the target object sent by the second user equipment. Operation instructions.
  28. 根据权利要求27所述的设备,其中,所述设备还包括摄像控制模块,该摄像控制模块用于:The device according to claim 27, wherein the device further comprises a camera control module, the camera control module is configured to:
    接收所述第二用户设备发送的、所述第二用户对所述摄像装置的摄像控制指令信息;Receiving imaging control instruction information sent by the second user equipment to the imaging device by the second user;
    根据所述摄像控制指令信息调整所述摄像装置的摄像参数信息;Adjusting imaging parameter information of the imaging device according to the imaging control instruction information;
    通过调整后的所述摄像装置实时拍摄关于所述目标对象的视频信息;Shooting video information about the target object in real time through the adjusted camera device;
    将通过所述调整后的摄像装置拍摄的所述视频信息发送至所述第二用户设备。Sending the video information shot by the adjusted camera device to the second user equipment.
  29. 根据权利要求27或28所述的设备,其中,所述设备还包括标记获取模块,该标记获取模块用于:The device according to claim 27 or 28, wherein the device further comprises a mark acquisition module, the mark acquisition module being configured to:
    通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的图像信息;Shooting image information about a target object in real time through a camera device in the first user equipment;
    将所述图像信息发送至对应的第二用户设备;Sending the image information to a corresponding second user equipment;
    接收关于所述目标对象的标记信息,其中,所述标记信息包括所述第二用户设备发送的、第二用户对所述图像信息中所述目标对象的操作指示信息;Receiving tag information about the target object, where the tag information includes operation instruction information of the second user on the target object in the image information sent by the second user equipment;
    将所述标记信息叠加显示于所述目标对象;Superimposing and displaying the mark information on the target object;
    其中,所述实时拍摄模块用于:The real-time shooting module is used for:
    通过所述摄像装置实时拍摄关于所述目标对象的视频信息。Video information about the target object is captured in real time by the camera device.
  30. 一种基于增强现实进行远程辅助的第二用户设备,其中,该设备包括:A second user equipment for remote assistance based on augmented reality, wherein the equipment includes:
    视频接收模块,用于接收对应第一用户设备发送的、通过所述第一用户设备中的摄像装置实时拍摄关于目标对象的视频信息;A video receiving module, configured to receive video information about a target object that is sent by the first user equipment in real time through a camera device in the first user equipment;
    视频呈现模块,用于呈现所述视频信息,并保持对应的标记信息叠加显示于所述视频信息的各视频帧中的所述目标对象,其中,所述标记信息包括第二用户通过所述第二用户设备对所述目标对象的操作指示信息。A video presentation module is configured to present the video information and maintain corresponding target information superimposed on the target object displayed in each video frame of the video information, wherein the label information includes a second user passing through the first Operation instruction information of the user equipment on the target object.
  31. 一种基于增强现实进行远程辅助的第一用户设备,其中,该设备包括:A first user device for remote assistance based on augmented reality, wherein the device includes:
    实时拍摄模块,用于通过所述第一用户设备中的摄像装置实时拍摄关于第一目标对象的视频信息;A real-time shooting module, configured to shoot video information about a first target object in real time through a camera device in the first user equipment;
    视频发送模块,用于将所述视频信息发送至对应的网络设备;A video sending module, configured to send the video information to a corresponding network device;
    转移矩阵接收模块,用于接收所述网络设备发送的、所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;A transfer matrix receiving module, configured to receive first transfer matrix information sent by the network device and corresponding to the first target object in each video frame of the video information;
    叠加显示模块,用于根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述第一目标对象,其中,所述第一标记信息包括对应第二用户设备发送的、第二用户对所述第一目标对象的操作指示信息。An overlay display module, configured to overlay and display corresponding first marker information on the first target object according to the first transfer matrix information, where the first marker information includes a first Operation instruction information of the two users on the first target object.
  32. 一种基于增强现实进行远程辅助的网络设备,其中,该设备包括:A network device for remote assistance based on augmented reality, wherein the device includes:
    视频接收模块,用于接收第一用户设备发送的关于第一目标对象的视频信息,其中,所述视频信息是通过所述第一用户设备中的摄像装置实时拍摄的;A video receiving module, configured to receive video information about a first target object sent by a first user equipment, where the video information is captured in real time by a camera device in the first user equipment;
    目标跟踪模块,用于通过对所述视频信息中的所述第一目标对象执行目标跟踪操作,确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;A target tracking module, configured to determine first transfer matrix information corresponding to the first target object in each video frame of the video information by performing a target tracking operation on the first target object in the video information;
    第一发送模块,用于将所述第一转移矩阵信息发送至所述第一用户设备;A first sending module, configured to send the first transfer matrix information to the first user equipment;
    第二发送模块,用于将所述视频信息及所述第一转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。A second sending module is configured to send the video information and the first transfer matrix information to a second user equipment that belongs to the same remote assistance task as the first user equipment.
  33. 根据权利要求32所述的设备,其中,所述目标跟踪模块用于:The device according to claim 32, wherein the target tracking module is configured to:
    根据所述视频信息及所述第一目标对象的其它视频信息重建所述第一目标对象的视频信息;Reconstructing video information of the first target object according to the video information and other video information of the first target object;
    通过对重建后的所述视频信息中的所述第一目标对象执行目标跟踪操作, 确定所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息。By performing a target tracking operation on the first target object in the reconstructed video information, first transition matrix information corresponding to the first target object in each video frame of the video information is determined.
  34. 根据权利要求32所述的设备,其中,所述设备还包括第三发送模块,该第三发发送模块用于:The device according to claim 32, wherein the device further comprises a third sending module, the third sending module being configured to:
    通过对所述视频信息中的第三目标对象执行目标跟踪操作,确定所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息,其中,所述第三目标对象与所述第一目标对象属于同一远程辅助任务;Performing a target tracking operation on a third target object in the video information to determine third transfer matrix information corresponding to the third target object in each video frame of the video information, wherein the third target object Belong to the same remote assistance task as the first target object;
    将所述视频信息及所述第三转移矩阵信息发送至所述远程辅助任务中与所述第三目标对象相对应的第三用户设备;Sending the video information and the third transfer matrix information to a third user equipment corresponding to the third target object in the remote assistance task;
    其中,所述第二发送模块用于:The second sending module is configured to:
    将所述视频信息及所述第一转移矩阵信息、所述第三转移矩阵信息发送至与所述第一用户设备属于同一远程辅助任务的第二用户设备。Sending the video information, the first transfer matrix information, and the third transfer matrix information to a second user equipment that belongs to the same remote assistance task as the first user equipment.
  35. 一种基于增强现实进行远程辅助的第三用户设备,其中,该设备包括:A third user equipment for remote assistance based on augmented reality, wherein the equipment includes:
    接收模块,用于接收对应网络设备发送的、关于第三目标对象的视频信息及所述第三目标对象在所述视频信息的各视频帧中对应的第三转移矩阵信息;A receiving module, configured to receive video information about a third target object sent by a corresponding network device and third transfer matrix information corresponding to the third target object in each video frame of the video information;
    呈现模块,用于呈现所述视频信息,并根据所述第三转移矩阵信息,将对应的第三标记信息叠加显示于所述视频信息的各视频帧中的所述第三目标对象,其中,所述第三标记信息包括第二用户通过第二用户设备对所述第三目标对象的操作指示信息;A presentation module, configured to present the video information and superimpose and display the corresponding third marker information on the third target object in each video frame of the video information according to the third transition matrix information, wherein, The third tag information includes operation instruction information of the second user on the third target object through the second user equipment;
    其中,所述视频信息是通过第一用户设备中的摄像装置实时拍摄的,所述第一用户设备、所述第三用户设备与所述第二用户设备属于同一远程辅助任务,并分别接受所述第二用户设备的远程辅助。The video information is captured in real time by a camera device in the first user equipment, and the first user equipment, the third user equipment, and the second user equipment belong to the same remote assistance task, and accept all The remote assistance of the second user equipment is described.
  36. 一种基于增强现实进行远程辅助的第二用户设备,其中,该设备包括:A second user equipment for remote assistance based on augmented reality, wherein the equipment includes:
    接收模块,用于接收对应网络设备发送的、关于第一目标对象的视频信息及所述第一目标对象在所述视频信息的各视频帧中对应的第一转移矩阵信息;A receiving module, configured to receive video information about a first target object and first transfer matrix information corresponding to the first target object in each video frame of the video information sent by a corresponding network device;
    呈现模块,用于呈现所述视频信息,并根据所述第一转移矩阵信息,将对应的第一标记信息叠加显示于所述视频信息的各视频帧中的所述第一目标对象,其中,所述第一标记信息包括第二用户通过所述第二用户设备对所述第一目标对象的操作指示信息;A presentation module, configured to present the video information and superimpose and display the corresponding first marker information on the first target object in each video frame of the video information according to the first transfer matrix information, wherein, The first marking information includes operation instruction information of a second user on the first target object through the second user equipment;
    其中,所述视频信息是通过与所述第二用户设备属于同一远程辅助任务的第一用户设备中的摄像装置实时拍摄的,或者是基于所述摄像装置所拍摄的关于所述第一目标对象的实时视频信息及所述第一目标对象的其他视频信息重建的。The video information is captured in real time by a camera device in the first user equipment that belongs to the same remote assistance task as the second user device, or is based on the first target object captured by the camera device. The real-time video information and other video information of the first target object are reconstructed.
  37. 一种基于增强现实进行远程辅助的网络设备,其中,该设备包括:A network device for remote assistance based on augmented reality, wherein the device includes:
    视频接收模块,用于接收第一用户设备发送的关于目标对象的视频信息,其中,所述视频信息包括通过所述第一用户设备中的摄像装置所拍摄的;A video receiving module, configured to receive video information about a target object sent by a first user equipment, where the video information includes a picture taken by a camera device in the first user equipment;
    目标跟踪模块,用于通过对所述视频信息中的所述目标对象执行目标跟踪操作,确定所述目标对象在所述视频信息的各视频帧中对应的转移矩阵信息;A target tracking module, configured to determine a transition matrix information corresponding to the target object in each video frame of the video information by performing a target tracking operation on the target object in the video information;
    标记添加模块,用于根据所述转移矩阵信息将对应的标记信息添加至所述视频信息中的各视频帧,其中,所述标记信息保持叠加于所述视频信息的各视频帧中的所述目标对象,所述标记信息包括对应第二用户设备发送的、第二用户对所述目标对象的操作指示信息;A tag adding module is configured to add corresponding tag information to each video frame in the video information according to the transfer matrix information, wherein the tag information remains superimposed on the video frames in the video information. A target object, where the tag information includes operation instruction information corresponding to the target object sent by the second user equipment to the second user;
    视频发送模块,用于将编辑后的所述视频信息发送至第一用户设备,以及与所述第一用户设备属于同一远程辅助任务的第二用户设备。A video sending module is configured to send the edited video information to a first user equipment and a second user equipment that belongs to the same remote assistance task as the first user equipment.
  38. 一种基于增强现实进行远程辅助的系统,其中,该系统包括如权利要求27至29中任一项所述的第一用户设备和权利要求30所述的第二用户设备。A system for remote assistance based on augmented reality, wherein the system includes a first user equipment according to any one of claims 27 to 29 and a second user equipment according to claim 30.
  39. 一种基于增强现实进行远程辅助的系统,其中,该系统包括如权利要求31所述的第一用户设备、如权利要求36所述的第二用户设备以及权利要求32至34中任一项所述的网络设备。A system for remote assistance based on augmented reality, wherein the system includes a first user equipment according to claim 31, a second user equipment according to claim 36, and any one of claims 32 to 34. The network equipment described above.
  40. 一种基于增强现实进行远程辅助的系统,其中,该系统包括如权利要求31所述的第一用户设备、如权利要求36所述的第二用户设备、如权利要求35所述的第三用户设备以及如权利要求32至34所述的网络设备。A system for remote assistance based on augmented reality, wherein the system includes a first user equipment according to claim 31, a second user equipment according to claim 36, and a third user according to claim 35. A device and a network device according to claims 32 to 34.
  41. 一种基于增强现实进行远程辅助的第一用户设备,其中,该设备包括:A first user device for remote assistance based on augmented reality, wherein the device includes:
    处理器;以及Processor; and
    被安排成存储计算机可执行指令的存储器,所述可执行指令在被执行时使所述处理器执行如权利要求1至23中任一项所述方法的操作。A memory arranged to store computer-executable instructions which, when executed, cause the processor to perform the operations of the method according to any one of claims 1 to 23.
  42. 一种包括指令的计算机可读介质,所述指令在被执行时使得系统进行如权利要求1至23中任一项所述方法的操作。A computer-readable medium including instructions that, when executed, cause a system to perform the operations of the method of any one of claims 1 to 23.
PCT/CN2018/121729 2018-05-29 2018-12-18 Method and equipment for performing remote assistance on the basis of augmented reality WO2019227905A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810533512.2 2018-05-29
CN201810533512.2A CN108769517B (en) 2018-05-29 2018-05-29 Method and equipment for remote assistance based on augmented reality

Publications (1)

Publication Number Publication Date
WO2019227905A1 true WO2019227905A1 (en) 2019-12-05

Family

ID=64003881

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/121729 WO2019227905A1 (en) 2018-05-29 2018-12-18 Method and equipment for performing remote assistance on the basis of augmented reality

Country Status (2)

Country Link
CN (1) CN108769517B (en)
WO (1) WO2019227905A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112689151A (en) * 2020-12-07 2021-04-20 深圳盈天下视觉科技有限公司 Live broadcast method and device, computer equipment and storage medium
US11010975B1 (en) 2018-03-06 2021-05-18 Velan Studios, Inc. Remote camera augmented reality system
CN114187509A (en) * 2021-11-30 2022-03-15 北京百度网讯科技有限公司 Object positioning method and device, electronic equipment and storage medium

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108769517B (en) * 2018-05-29 2021-04-16 亮风台(上海)信息科技有限公司 Method and equipment for remote assistance based on augmented reality
CN109459029B (en) * 2018-11-22 2021-06-29 亮风台(上海)信息科技有限公司 Method and equipment for determining navigation route information of target object
CN109656259A (en) * 2018-11-22 2019-04-19 亮风台(上海)信息科技有限公司 It is a kind of for determining the method and apparatus of the image location information of target object
CN109669657B (en) * 2018-12-26 2023-06-02 亮风台(上海)信息科技有限公司 Method and equipment for conducting remote document collaboration
CN116866336A (en) * 2019-03-29 2023-10-10 亮风台(上海)信息科技有限公司 Method and equipment for performing remote assistance
CN110136268B (en) * 2019-04-26 2023-12-05 广东电网有限责任公司广州供电局 Cable accessory manufacturing guiding system and method
CN110266992A (en) * 2019-06-24 2019-09-20 苏芯物联技术(南京)有限公司 A kind of long-distance video interactive system and method based on augmented reality
CN110728756B (en) * 2019-09-30 2024-02-09 亮风台(上海)信息科技有限公司 Remote guidance method and device based on augmented reality
CN110751735B (en) * 2019-09-30 2024-02-09 亮风台(上海)信息科技有限公司 Remote guidance method and device based on augmented reality
CN110944139B (en) * 2019-11-29 2022-04-22 维沃移动通信有限公司 Display control method and electronic equipment
CN111050112A (en) * 2020-01-10 2020-04-21 北京首翼弘泰科技有限公司 Method for remote operation command or guidance by displaying mark on screen
CN113885700A (en) * 2021-09-03 2022-01-04 广东虚拟现实科技有限公司 Remote assistance method and device
CN115439635B (en) * 2022-06-30 2024-04-26 亮风台(上海)信息科技有限公司 Method and equipment for presenting marking information of target object

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107247510A (en) * 2017-04-27 2017-10-13 成都理想境界科技有限公司 A kind of social contact method based on augmented reality, terminal, server and system
CN107493228A (en) * 2017-08-29 2017-12-19 北京易讯理想科技有限公司 A kind of social interaction method and system based on augmented reality
CN107590453A (en) * 2017-09-04 2018-01-16 腾讯科技(深圳)有限公司 Processing method, device and the equipment of augmented reality scene, computer-readable storage medium
CN107765842A (en) * 2016-08-23 2018-03-06 深圳市掌网科技股份有限公司 A kind of augmented reality method and system
CN108769517A (en) * 2018-05-29 2018-11-06 亮风台(上海)信息科技有限公司 A kind of method and apparatus carrying out remote assistant based on augmented reality

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5776201B2 (en) * 2011-02-10 2015-09-09 ソニー株式会社 Information processing apparatus, information sharing method, program, and terminal apparatus
EP2818948B1 (en) * 2013-06-27 2016-11-16 ABB Schweiz AG Method and data presenting device for assisting a remote user to provide instructions
CN106339094B (en) * 2016-09-05 2019-02-26 山东万腾电子科技有限公司 Interactive remote expert cooperation examination and repair system and method based on augmented reality
CN107172390A (en) * 2017-05-12 2017-09-15 广州市和佳电子科技有限公司 It is a kind of based on intelligent glasses for terminal platform visualization system and implementation method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107765842A (en) * 2016-08-23 2018-03-06 深圳市掌网科技股份有限公司 A kind of augmented reality method and system
CN107247510A (en) * 2017-04-27 2017-10-13 成都理想境界科技有限公司 A kind of social contact method based on augmented reality, terminal, server and system
CN107493228A (en) * 2017-08-29 2017-12-19 北京易讯理想科技有限公司 A kind of social interaction method and system based on augmented reality
CN107590453A (en) * 2017-09-04 2018-01-16 腾讯科技(深圳)有限公司 Processing method, device and the equipment of augmented reality scene, computer-readable storage medium
CN108769517A (en) * 2018-05-29 2018-11-06 亮风台(上海)信息科技有限公司 A kind of method and apparatus carrying out remote assistant based on augmented reality

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11010975B1 (en) 2018-03-06 2021-05-18 Velan Studios, Inc. Remote camera augmented reality system
CN112689151A (en) * 2020-12-07 2021-04-20 深圳盈天下视觉科技有限公司 Live broadcast method and device, computer equipment and storage medium
CN112689151B (en) * 2020-12-07 2023-04-18 深圳盈天下视觉科技有限公司 Live broadcast method and device, computer equipment and storage medium
CN114187509A (en) * 2021-11-30 2022-03-15 北京百度网讯科技有限公司 Object positioning method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN108769517A (en) 2018-11-06
CN108769517B (en) 2021-04-16

Similar Documents

Publication Publication Date Title
WO2019227905A1 (en) Method and equipment for performing remote assistance on the basis of augmented reality
CN107491174B (en) Method, device and system for remote assistance and electronic equipment
JP6165846B2 (en) Selective enhancement of parts of the display based on eye tracking
CN108304075B (en) Method and device for performing man-machine interaction on augmented reality device
US20160358383A1 (en) Systems and methods for augmented reality-based remote collaboration
US8917908B2 (en) Distributed object tracking for augmented reality application
WO2022166872A1 (en) Special-effect display method and apparatus, and device and medium
US11288871B2 (en) Web-based remote assistance system with context and content-aware 3D hand gesture visualization
CN110751735B (en) Remote guidance method and device based on augmented reality
US11222409B2 (en) Image/video deblurring using convolutional neural networks with applications to SFM/SLAM with blurred images/videos
CN113741698A (en) Method and equipment for determining and presenting target mark information
CN110728756B (en) Remote guidance method and device based on augmented reality
CN109656363B (en) Method and equipment for setting enhanced interactive content
JP7422876B2 (en) Display method and device based on augmented reality, and storage medium
US20200125833A1 (en) Method and apparatus for positioning face feature points
CN110111241B (en) Method and apparatus for generating dynamic image
US10026509B2 (en) Low bandwidth media stream transmission
WO2020253716A1 (en) Image generation method and device
US20210158490A1 (en) Joint rolling shutter correction and image deblurring
CN113918070A (en) Synchronous display method and device, readable storage medium and electronic equipment
US20190058861A1 (en) Apparatus and associated methods
CN109636922B (en) Method and device for presenting augmented reality content
CN109816791B (en) Method and apparatus for generating information
CN110619615A (en) Method and apparatus for processing image
CN114143568A (en) Method and equipment for determining augmented reality live image

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18920779

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18920779

Country of ref document: EP

Kind code of ref document: A1