WO2014075237A1

WO2014075237A1 - Method for achieving augmented reality, and user equipment

Info

Publication number: WO2014075237A1
Application number: PCT/CN2012/084581
Authority: WO
Inventors: 刘峥
Original assignee: 华为技术有限公司
Priority date: 2012-11-14
Filing date: 2012-11-14
Publication date: 2014-05-22
Also published as: CN103959220A; CN103959220B

Abstract

The present invention relates to the field of information technology, and particularly to a method for achieving augmented reality, and a user equipment. The method for achieving augmented reality is provided. When a user undergoes an augmented reality experience, a UE stores virtual content information and a captured video stream via the context of the augmented reality; and after the augmented reality experience has ended, when the user needs to undergo the augmented reality experience once again, the UE acquires virtual reality information according to the stored virtual content information, and superposes the acquired virtual reality information onto each video frame in the video stream for display, so that the user can also still experience the same augmented reality experience once again anytime after experiencing the augmented reality experience.

Description

TECHNICAL FIELD The present invention relates to the field of information technology (Information Technology, IT: IT), and in particular to a method and user equipment for implementing augmented reality.

Background technique

Augmented Reality (AR) technology is an emerging human-computer interaction technology developed on the basis of virtual reality technology. It uses visual technology to apply virtual reality information to the real world. The virtual reality information acquired in the real world is superimposed on the real world image, and allows users to interact with the augmented reality application, which expands the user's perception of the real world. With the popularity of intelligent user equipment (User Equipment, UE), AR technology has developed rapidly in recent years.

In the existing AR application, the user equipment can capture the video stream through the camera, use the captured video stream as real world information, and obtain virtual reality information related to the real world information from the server side, and superimpose the acquired virtual reality information. On the captured video stream, and display the superimposed video stream.

Specifically, the UE may send a request for acquiring virtual reality information to the server side after the video stream is captured, where the request for acquiring the virtual reality information includes information about a key frame captured by the UE or a location of the UE, where the key frame is The gesture image of the tracked object is included; after the virtual reality information is obtained according to the key frame captured by the UE or the location of the UE, the virtual reality information is sent to the UE, and the UE superimposes the received virtual reality information. Displayed on each frame of the captured video stream. The virtual reality information received by the UE is related to the tracked object in the real world or to the location where the UE is located. The AR experience begins when the UE overlays the received virtual reality information onto the captured video stream.

Through analysis of the prior art, the inventors believe that the prior art has at least the following problems: The virtual reality information received by the UE is related to the real world. Specifically, the virtual reality information received by the UE is related to the tracked object in the real world or the location where the UE is located. After the end of the AR experience, if the user needs After experiencing the same AR experience again, the user needs to go back to the original real world. For example, the user is located at location A. When the user queries the restaurant near location A with the UE, the server side will return to near location A. The information of the restaurant, the UE superimposes the obtained restaurant information onto the captured video frame, and if the user later wants to experience the same AR experience, the user is required to return to the location A again and capture the same video frame.

Summary of the invention

To overcome the deficiencies of the prior art, an object of embodiments of the present invention is to provide a method and user equipment for implementing augmented reality, so that after the end of the AR experience, the user can also experience the same AR experience again at any time.

In a first aspect, an embodiment of the present invention provides a method for implementing augmented reality, including:

The user equipment stores an augmented reality context when the user experiences an augmented reality experience, the augmented reality context including virtual content information received by the user equipment from the server side and a video stream captured by the user equipment;

When the user needs to experience the augmented reality experience again, the user equipment acquires virtual reality information according to the stored virtual content information;

The user equipment sequentially acquires the stored video frames in the video stream according to the sequence in which the video frames are captured, and superimposes the acquired virtual reality information on the acquired video frames, and displays the overlay force P. Video frame.

In a first possible implementation manner of the first aspect, the user sets a video frame to capture a correspondence between a timestamp of the captured video frame and the tracked object information, and a posture of the tracked object. And removing an image from the captured video frame, updating the panoramic image according to the video frame after removing the posture image, and storing a correspondence between the time stamp and the background information;

The user equipment stores a standard image of the tracked object when capturing a video frame, and in the When the user equipment stops capturing video frames, storing the panorama;

The tracked object information includes location information of the gesture image in the captured video frame, and the background information includes location information of the captured video frame in the panoramic image.

In conjunction with the first possible implementation of the first aspect, in a second possible implementation manner of the first aspect, the tracked object information further includes a single image of the gesture image on the captured video frame And the background information further includes a deflection angle of the captured video frame relative to the panoramic image deflection.

In conjunction with the second possible implementation of the first aspect, in a third possible implementation manner of the first aspect, the user equipment acquires the stored standard image and the panoramic image;

The user equipment sequentially acquires the timestamp of the video frame to be displayed in the order in which the video frames are captured, and obtains the tracked object information and the background information corresponding to the acquired timestamp according to the obtained timestamp. And performing affine transformation on the obtained standard image according to the obtained homography matrix included in the tracked object information, obtaining a posture image of the tracked object, and acquiring location information according to the obtained background information. And the deflection angle, the acquired panoramic image is obtained according to the displayed resolution, and the background image is obtained, and the obtained posture image is superimposed on the cut background image according to the obtained position information of the tracked object information. The video frame currently to be displayed is generated.

With reference to the third possible implementation manner of the first aspect, in a fourth possible implementation manner of the first aspect, the virtual content information includes an identifier of the tracked object corresponding to the virtual reality information, The superimposing the acquired virtual reality information on the acquired video frame includes: when the virtual content information includes the identifier of the tracked object, the user equipment is according to the posture of the tracked object And superimposing the acquired virtual reality information on the currently displayed video frame in a position in the video frame to be displayed currently.

In a fifth possible implementation manner of the first aspect, the user equipment sequentially captures a video frame, updates a panoramic image according to the captured video frame, and stores a timestamp and background information of the captured video frame. Correspondence between them;

And when the user equipment stops capturing a video frame, the user equipment stores the panoramic image; wherein the background information includes location information of the captured video frame in the panoramic image. In conjunction with the fifth possible implementation of the first aspect, in a sixth possible implementation of the first aspect, the background information further includes a deflection angle of the captured video frame relative to the panoramic image deflection.

In conjunction with the sixth possible implementation of the first aspect, in a seventh possible implementation manner of the first aspect, the user equipment acquires the stored panoramic image;

The user equipment obtains timestamps of the current video frame to be displayed in sequence according to the sequence in which the video frames are captured, and obtains background information corresponding to the obtained timestamp according to the obtained timestamp, according to the obtained information. The position information and the deflection angle included in the background information are intercepted, and the acquired panoramic image is intercepted according to the displayed resolution to generate the current video frame to be displayed.

With reference to the seventh possible implementation of the first aspect, in an eighth possible implementation manner of the first aspect, the virtual content information includes location information corresponding to the virtual reality information, where the background information further includes And the information about the location of the user equipment, the superimposing the acquired virtual reality information on the acquired video frame, including:

And the user equipment superimposes the acquired virtual reality information on the currently displayed video frame according to the information about the location of the user equipment included in the background information and the location information included in the virtual content information.

In a second aspect, an embodiment of the present invention provides a user equipment, including:

a receiving unit, configured to receive virtual content information returned from the server side;

a video stream capturing unit, configured to capture a video stream;

a storage unit, configured to store an augmented reality context when the user experiences an augmented reality experience, where the augmented reality context includes the virtual content information received by the receiving unit and the video stream captured by the video stream capturing unit; a virtual reality information acquiring unit, configured to acquire virtual reality information according to the virtual content information stored by the storage unit when the user needs to experience the augmented reality experience again;

a video frame acquiring unit, configured to sequentially acquire video frames in the video stream stored by the storage unit according to a sequence in which the video frames are captured;

a superimposing unit, configured to superimpose the virtual reality information acquired by the virtual reality information acquiring unit on the video frame acquired by the video frame acquiring unit;

a display unit, configured to display a video frame superimposed by the superposition unit.

In a first possible implementation manner of the second aspect, the video stream capturing unit is specifically configured to obtain a video frame according to a second time;

The storage unit is specifically configured to store a correspondence between a timestamp of the video frame captured by the video stream capturing unit and the tracked object information, and remove the posture image of the tracked object from the captured video frame, according to the Removing the video frame after the gesture image to update the panorama, and storing a correspondence between the time stamp and the background information;

And storing a standard image of the tracked object when the video stream capturing unit captures a video frame, and storing the panoramic image when the video stream capturing unit stops capturing a video frame;

With reference to the first possible implementation of the second aspect, in a second possible implementation manner of the second aspect, the tracked object information further includes a single image of the gesture image on the captured video frame And the background information further includes a deflection angle of the captured video frame relative to the panoramic image deflection.

In conjunction with a second possible implementation of the second aspect, a third possible implementation in the second aspect, and the panoramic view;

For sequentially obtaining the video frame to be displayed according to the order in which the video frames are captured. And obtaining, according to the obtained timestamp, the tracked object information and the background information stored by the storage unit corresponding to the obtained timestamp, and the homography matrix included according to the obtained tracked object information Performing affine transformation on the obtained standard image to obtain a posture image of the tracked object, and intercepting the acquired panoramic image according to the displayed resolution according to the obtained position information and the deflection angle of the background information. Obtaining a background image, and superimposing the obtained posture image on the truncated background image according to the obtained position information included in the tracked object information, and generating the current video frame to be displayed.

With reference to the third possible implementation manner of the second aspect, in a fourth possible implementation manner of the second aspect, the virtual content information that is received by the receiving unit includes the Tracking the identifier of the object, the superimposing unit is specifically configured to: according to the position of the image of the tracked object in the current video frame to be displayed, when the virtual content information includes the identifier of the tracked object And superimposing the virtual reality information acquired by the virtual reality information acquiring unit on the current video frame to be displayed generated by the video frame acquiring unit.

In a fifth possible implementation manner of the second aspect, the video stream capturing unit is specifically configured to obtain a video frame according to a second time;

The storage unit is specifically configured to update a panoramic image according to a video frame captured by the video stream capturing unit, and store a correspondence between a timestamp of the captured video frame and background information;

And storing the panoramic image when the video stream capturing unit stops capturing video frames; wherein the background information includes location information of the captured video frame in the panoramic image. In conjunction with the fifth possible implementation of the second aspect, in a sixth possible implementation of the second aspect, the background information further includes a deflection angle of the captured video frame relative to the panoramic image deflection.

In conjunction with the sixth possible implementation of the second aspect, the seventh possible implementation in the second aspect

as well as And obtaining, according to the sequence of the video frames, the timestamps of the currently displayed video frames, and obtaining the background information corresponding to the acquired timestamps according to the acquired timestamps, according to the obtained background. The position information included in the information and the deflection angle are intercepted according to the displayed resolution to generate the currently displayed video frame.

With reference to the seventh possible implementation manner of the second aspect, in the eighth possible implementation manner of the second aspect, the virtual content information received by the receiving unit includes location information corresponding to the virtual reality information, The background information further includes information about a location of the user equipment, where the superimposing unit is specifically configured to: according to information about a location of the user equipment included in the background information, and location information included in the virtual content information, The virtual reality information acquired by the virtual reality information acquiring unit is superimposed on the current video frame to be displayed generated by the video frame acquiring unit.

A method and a user equipment for implementing an augmented reality experience are provided by an embodiment of the present invention. When a user experiences an augmented reality experience, the UE stores the virtual content information and the captured video stream through the augmented reality context, after the augmented reality experience ends. The UE acquires virtual reality information according to the stored virtual content information, and superimposes the acquired virtual reality information on each video frame in the video stream for display. , enabling the user to experience the same augmented reality experience again at any time after experiencing the augmented reality experience.

DRAWINGS

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings to be used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only the present invention. For some embodiments, other drawings may be obtained from those skilled in the art without any inventive effort.

FIG. 1 is a schematic structural diagram of a system for implementing augmented reality according to an embodiment of the present invention;

2 is a flowchart of a method for implementing augmented reality according to an embodiment of the present invention;

FIG. 3 is a flowchart of another method for implementing augmented reality according to an embodiment of the present invention; FIG. 4 is a flowchart of still another method for implementing augmented reality according to an embodiment of the present invention; FIG. 5 is a structural diagram of a user equipment according to an embodiment of the present invention;

FIG. 6 is a structural diagram of another user equipment according to an embodiment of the present invention. detailed description

BRIEF DESCRIPTION OF THE DRAWINGS The technical solutions in the embodiments of the present invention are clearly and completely described in the following description of the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative work are within the scope of the present invention.

As shown in FIG. 1 , it is a system architecture diagram for implementing augmented reality according to an embodiment of the present invention. When the user determines that the user needs to experience the augmented reality experience, the UE sends a request for acquiring the virtual content information to the server side, where the request for acquiring the virtual content information includes information identifying the tracked object or information about the location of the UE, The information identifying the tracked object includes the gesture image of the tracked object or the feature data of the gesture image of the tracked object, and the server side sends the virtual to the UE according to the request for acquiring the virtual content information. Content information, after receiving the virtual content information, the UE stores the virtual content information and the video stream captured by the UE. After the augmented reality experience ends, if the user determines that the augmented reality experience needs to be experienced again, the UE acquires virtual reality information according to the stored virtual content information, and sequentially according to the sequence in which the video frames are captured. Obtaining the stored video frame in the video stream, superimposing the acquired virtual reality information on the acquired video frame, and displaying the superimposed video frame.

The embodiment of the present invention does not limit the type of the UE. By way of example and not limitation, the UE may include a smart phone, a personal computer, a tablet, glasses with augmented reality function, or other terminal with augmented reality function.

It should be noted that, the embodiment of the present invention does not limit the composition of the server side. By way of example and not limitation, the server side is composed of at least one server, and the server in the server side may include a presentation layer server. , application layer server and database server.

Based on the system architecture diagram shown in FIG. 1 , an embodiment of the present invention provides a method for implementing augmented reality. As shown in FIG. 2, the method includes:

S201: The UE stores an augmented reality context when the user experiences an augmented reality experience, where the enhanced real context includes the virtual content information received by the UE from the server side and the video stream captured by the UE;

It should be noted that, the stored video stream is a series of consecutive video frames,

The UE uses the video stream as real world information when the user experiences the augmented reality experience, the virtual content information includes virtual reality information or storage location information of virtual reality information; and virtual reality information that the UE will acquire The augmented reality experience begins when superimposed onto the captured video frame for display;

Wherein, when the tracked object needs to be enhanced, that is, when the video stream captured by the UE includes the attitude image of the tracked object, the UE may image the tracked object and remove the pose image. The background image is stored separately; when the current location in the real environment needs to be enhanced, that is, when the video stream captured by the UE does not include the gesture image of the tracked object, the video frame captured by the UE may be directly directly For the background image in the video frame captured by the UE, the UE may merge the background images in the captured video frame to generate a panorama, and the UE may be configured according to the background image. The position in the panorama, restoring the background image;

Specifically, the UE may store the captured video stream in any of the following manners: In a first mode, the video stream captured by the UE includes a posture image of the tracked object: the UE sequentially captures a video frame, and stores the captured video. Corresponding relationship between the time stamp of the video frame and the tracked object information, removing the posture image of the tracked object from the captured video frame, updating the panorama according to the video frame after removing the posture image, and storing the image Corresponding relationship between the timestamp and the background information; the UE storing the standard image of the tracked object when capturing the video frame, and storing the panorama when the UE stops capturing the video frame;

The timestamp is used to indicate the time at which the video frame is captured, by way of example and not limitation. The timestamp may be a time when the video frame is captured with respect to the start of the augmented reality experience; the tracked object information includes location information of the pose image in the captured video frame, the background information including the Position information of the captured video frame in the panorama and;

The tracked object information may further include a homography matrix of the gesture image on the captured video frame, and the background information may further include the captured video frame being deflected relative to the panoramic image. Deflection angle

It should be noted that the tracked object refers to an object to be tracked in the real world, such as a toy car in the current real world; the attitude image of the tracked object refers to the captured video frame. An image of the tracked object, such as a toy car in the current real world, when capturing a video frame, an image of the toy car in the captured video frame is a pose image of the toy car; The standard image refers to an image captured when the tracked object is horizontally placed on a horizontal plane, when the field of view is perpendicular to the horizontal plane;

In the second mode, the video stream captured by the UE does not include the posture image of the tracked object: the UE sequentially captures the video frame, updates the panorama according to the captured video frame, and stores the timestamp and background of the captured video frame. Corresponding relationship between the information; when the UE stops capturing video frames, the UE stores the panorama;

S202: When the user needs to experience the augmented reality experience again, the UE acquires virtual reality information according to the stored virtual content information.

The UE may obtain virtual reality information in the following manner:

If the virtual content information includes the virtual reality information, the user equipment may directly obtain the virtual reality information; or

If the virtual content information includes the storage location information of the virtual reality information, the user equipment may acquire the virtual reality information according to the storage location information; for example, by way of example and not limitation, the virtual content information a URI (Uniform Resource Identifier) including the virtual reality information, where the UE may be based on a URI of the virtual reality information. Obtaining the virtual reality information;

S203: The UE sequentially acquires the stored video frames in the video stream according to the sequence in which the video frames are captured, and superimposes the acquired virtual reality information on the acquired video frames, and displays the overlay force. After the video frame;

The UE may determine the sequence in which the video frames are captured according to the timestamp of the video frame. When the user needs to experience the augmented reality experience that has been experienced before, the UE needs to acquire. The virtual reality information and the video stream when the augmented reality experience is experienced, and the acquired virtual reality information is superimposed on each frame in the acquired video stream for display; and the first method corresponds to step S201. The method of storing the captured video stream is as follows: the UE acquires the stored standard image and the panoramic image, and sequentially acquires the timestamp of the currently displayed video frame according to the sequence in which the video frames are captured, according to the acquired Obtaining, by the timestamp, the tracked object information and the background information corresponding to the acquired timestamp, and performing affine transformation on the obtained standard image according to the obtained homography matrix included in the tracked object information. Obtaining a posture image of the tracked object, and receiving the bit according to the obtained background information Information and a deflection angle are obtained by intercepting the acquired panoramic image according to the displayed resolution to obtain a background image, and superimposing the obtained posture image on the cut background image according to the obtained position information included in the tracked object information. , generating a video frame to be displayed currently;

Manner 2, corresponding to the method 2 of storing the captured video stream in step S201: the UE acquires the stored panorama, and sequentially acquires the timestamp of the current video frame to be displayed according to the sequence in which the video frames are captured. Obtaining the background information corresponding to the obtained timestamp according to the obtained timestamp, and extracting the acquired panoramic image according to the displayed resolution according to the obtained position information and the deflection angle of the background information, and generating The video frame currently to be displayed.

In this embodiment, when the user starts to experience the augmented reality experience, the UE may also The user operation information is used to describe the interaction between the user and the UE by using the augmented reality context, where the user operation information may include an operation type, an operation parameter, and a timestamp. The time stamp included in the user operation information is used to indicate the moment when the interaction occurs. As an example and not by way of limitation, the time stamp included in the user operation information may be a time when the interaction occurs relative to the start of the augmented reality experience. When the user experiences the augmented reality experience again, the UE may simulate the operation of the user according to the operation type and the operation parameter at a time corresponding to the time stamp included in the user operation information.

It should be noted that, after the UE stores the augmented reality context, the UE may further send the augmented reality context to other UEs, so that other users may also experience the augmented reality experience, thereby enabling the The user can share the augmented reality experience with other users.

A method for implementing augmented reality according to an embodiment of the present invention, when a user experiences an augmented reality experience, the UE stores the virtual content information and the captured video stream through the augmented reality context, after the augmented reality experience ends, when the user When the augmented reality experience needs to be experienced again, the UE acquires virtual reality information according to the stored virtual content information, and superimposes the acquired virtual reality information on each video frame in the video stream for display, so that the After experiencing the augmented reality experience, the user can again experience the same augmented reality experience at any time. Secondly, when the UE captures a video frame that includes the gesture image of the tracked object, the UE will be the tracked object. The gesture image is stored separately from the background image, and by storing location information of the gesture image of the tracked object in the captured video frame and a homography matrix, storing the gesture image of the tracked object, and storing the Position information of the captured video frame in the panorama, storing the background image, from And saving the storage resource of the UE; again, when the video frame captured by the UE does not include the gesture image of the tracked object, the UE uses the captured video frame as a background image, and uses a storage The location information of the captured video frame in the panorama stores the background image, thereby saving storage resources of the UE. As shown in FIG. 3, it is a flowchart of a method for implementing augmented reality according to an embodiment of the present invention. The method is applied to a scene of a captured video stream that includes a gesture image of the tracked object, and the method includes:

S301: When the user determines that the user needs to experience the augmented reality experience, the UE sends information identifying the tracked object to the server side, where the information of the tracked object includes the posture image of the tracked object or the posture image of the tracked object. Characteristic data;

Wherein, by way of example and not limitation, the feature data of the gesture image may be an outline of the gesture image, and the gesture image may be obtained by capturing a video frame;

S302: The UE receives virtual content information sent by the server, where the virtual content information includes virtual reality information or storage location information of virtual reality information.

The virtual content information is obtained by the server side according to the information of the identified object to be tracked. Specifically, the server side stores the feature data of the posture image of the tracked object and the identifier of the tracked object (Identifier) a correspondence between the identifier of the tracked object and the virtual content information, and the server side obtains the posture of the tracked object after obtaining the information of the identified object to be tracked And obtaining, according to the feature data, the identifier of the tracked object, and obtaining virtual content information corresponding to the identifier of the tracked object according to the identifier of the tracked object;

Optionally, the server side stores a correspondence between the feature data of the gesture image of the tracked object and the virtual content information, and the server side obtains the information that identifies the tracked object, and obtains the Tracking feature data of the gesture image of the object, and obtaining virtual content information corresponding to the feature data according to the feature data;

It should be noted that, when the information of the identified object includes the posture image of the tracked object, the server side may use a feature extraction algorithm to process the posture image of the tracked object. Obtaining feature data;

S303: The UE stores the virtual content information.

The UE may store the virtual content information in an augmented reality context;

S304: The UE captures a video frame. The UE may sequentially capture a video frame according to a frame rate of the captured video stream, where the video frame captured by the UE includes a posture image of the tracked object;

It should be noted that, when the UE superimposes the virtual reality information acquired according to the virtual content information onto the captured video frame for display, the augmented reality experience starts;

S305: The UE stores a correspondence between a timestamp of the captured video frame and the tracked object information.

The tracked object information includes location information of the tracked object's pose image in the captured video frame, and the location information of the tracked object's pose image in the captured video frame may be a coordinate of a center point of the gesture image of the tracked object in the captured video frame, the coordinate being determined when the UE tracks the tracked object;

The tracked object information may further include a homography matrix of the gesture image of the tracked object on the captured video frame, and the gesture image of the tracked object is on the captured video frame. The homography matrix may be determined when the UE tracks the tracked object, and the UE may perform affine transformation on the standard image of the tracked object according to the homography matrix to obtain the tracked object. The affine transformation of the standard image of the tracked object means that the standard image of the tracked object is multiplied by the homography matrix;

It should be noted that, after selecting a key point of the tracked object, the UE matches a key point on the captured video frame with a corresponding key point on the standard image to obtain a key point on the captured video frame. The location information and the location information on the standard image, according to the position information of the key point on the captured video frame and the position information on the standard image, the RANSAC (RANdom S Ample Consensus) algorithm can be used to obtain the single Qualitative matrix

The UE may store a correspondence between a timestamp of the captured video frame and the tracked object information in the augmented reality context;

S306: The UE removes the posture image of the tracked object from the captured video frame, updates a video frame with the posture image removed as a background image, and stores the timestamp and Correspondence between background information;

It should be noted that, after the UE removes the posture image of the tracked object from the captured video frame, a background image is obtained, and the panoramic image is updated according to the obtained background image; After the figure, the UE has not created a panorama, the UE may initialize the panorama with the obtained background image. At this time, “update the panorama according to the obtained background image” means “according to the obtained The background image initializes the panorama";

The background information includes location information of the captured video frame in the panoramic view and a deflection angle of the captured video frame relative to the panoramic view deflection;

The location information of the captured video frame in the panorama may be coordinates of a center point of the captured video frame in the panorama, and a center point of the captured video frame is in the panorama The coordinates in the figure may be determined when the UE updates the panorama;

The UE may store a correspondence between a timestamp of the captured video frame and the background information in the augmented reality context;

The UE may determine a deflection angle of the captured video frame relative to the panorama deflection when updating the panorama, and specifically, determining a horizontal line of the captured video frame relative to the panorama The angle of the horizontal rotation of the graph, for example, when the panorama is updated by using a video frame, the video frame is rotated counterclockwise by 30°, and the rotation angle of the video relative to the panorama rotation is 30° counterclockwise;

It should be noted that the operation of updating the panorama may include the following three steps:

1) image registration: determining a portion of the captured video frame that is repeated with the panorama;

Wherein, there is no overlapping portion in the background image, which may be used to expand the panoramic image; by using the repeated portion, location information of the captured video frame in the panoramic image and the captured a deflection angle of the video frame relative to the panorama deflection;

2) image warping: map the panorama to a spherical cluster or a columnar cluster, And splicing the background image on the panoramic image according to a portion of the captured video frame that is overlapped with the panoramic image;

3) image blending: smoothing, chrominance processing and de-ghosting processing of the stitched panorama to improve the rendering quality of the panorama;

S307: The UE determines whether the augmented reality experience is over, and if so, step S308 is performed, otherwise, step S304 is performed;

The UE may store a standard image of the tracked object when capturing a video frame. Specifically, the tracked object may be stored before, after, or simultaneously with any of the steps S304 to S306. a standard image; the UE may generate a pose image of the tracked object according to a homography matrix of the image of the tracked object on a video frame captured by the UE and a standard image of the tracked object;

Wherein, by way of example and not limitation, the server side stores a standard image of the tracked object, and the UE may obtain a standard image of the tracked object from the server side;

It should be noted that, when the augmented reality experience ends, the UE stops capturing video frames.

S308: The UE stores the panorama view.

It should be noted that, when the augmented reality experience ends, the panoramic image stored by the UE is processed according to the background image in the video frame captured by the UE, and the UE may be according to the panoramic image. Restoring a background image of the captured video frame;

S309: After the augmented reality experience ends, when the user needs to experience the enhanced real-life experience again, the UE acquires virtual reality information according to the stored virtual content information.

The UE may obtain the virtual reality information in the following manner:

If the virtual content information includes the virtual reality information, the user equipment directly obtains the virtual reality information; or

If the virtual content information includes storage location information of the virtual reality information, the user Obtaining, by the device, the virtual reality information according to the storage location information;

S310: The UE acquires the stored standard image and the panoramic image.

S311: The UE acquires a timestamp of a video frame to be displayed, and obtains a posture image of the tracked object in the currently displayed video frame according to the obtained time stamp.

Specifically, after acquiring the timestamp of the video frame to be displayed, the UE obtains the tracked object information and the background information corresponding to the acquired timestamp, and according to the obtained homography of the tracked object information. a matrix, performing affine transformation on the obtained standard image to obtain a posture image of the tracked object;

The UE may sequentially acquire timestamps of the video frames to be displayed in sequence according to the sequence in which the video frames are captured;

S312: The UE obtains a background image of the video frame to be currently displayed.

Specifically, the UE intercepts the acquired panoramic image according to the obtained resolution of the background information and the deflection angle, and obtains a background image in the currently displayed video frame.

For example, the UE may generate a horizontal rectangular frame according to the resolution to be displayed. If the angle of the current video frame to be displayed is 30° in the counterclockwise direction with respect to the panorama, the UE will rotate the horizontal rectangular frame counterclockwise. Rotate 30. And according to the position of the current video frame to be displayed in the panorama, the panoramic image is captured by using the rotated rectangular frame to generate a background image in the current video frame to be displayed;

As an example and not by way of limitation, the resolution of the display may be determined by the screen resolution of the UE. For example, if the screen resolution of the UE is 480×320, the UE may intercept the acquired location according to the resolution of 480×320. Panoramic view

S313: The UE generates the video frame to be displayed currently;

Specifically, the UE superimposes the obtained posture image of the tracked object to the cut according to the obtained position information of the posture image of the tracked object included in the tracked object information in the video frame. On the obtained background image, generate a video frame to be displayed currently;

S314: The UE superimposes the acquired virtual reality information on the generated video frame to be displayed, and displays the superimposed video frame.

The virtual content information may further include the identifier of the tracked object corresponding to the virtual reality information, and the UE may superimpose the acquired virtual reality information to the generated current desired display manner. On the video frame:

When the virtual content information includes the identifier of the tracked object, the UE superimposes the acquired virtual reality information according to the position of the gesture image of the tracked object in the current video frame to be displayed. Going to the video frame currently to be displayed;

S315: The UE determines whether the video frame in the stored video stream has been acquired. If yes, the augmented reality experience ends. Otherwise, step S311 is performed.

In the embodiment of the present invention, if the frame rate of the captured video stream is greater than the expected frame rate of the captured video stream, only a part of the video frames in the video stream may be stored. For example, the UE may sample the timestamp of the video frame. The UE stores a video frame corresponding to the timestamp obtained by sampling;

If the frame rate of the video playback is greater than the expected frame rate, the UE may perform an interpolation process, specifically, the timestamp of the video frame to be currently displayed by the UE, and the current video to be displayed. The tracked object information and the background information corresponding to the time stamp of the frame are subjected to interpolation processing.

In the embodiment of the present invention, when the user starts to experience the augmented reality experience, the UE may further store user operation information, where the user operation information is used to describe an interaction between the user and the UE. The user operation information includes an operation type, an operation parameter, and a timestamp, and the time information included in the user operation information is used to indicate a moment when the interaction occurs; when the user experiences the augmented reality experience again, the UE may Simulating the operation of the user according to the operation type and the operation parameter at a time corresponding to the time stamp included in the user operation information;

Wherein, by way of example and not limitation, the interaction between the user and the UE may include any of the following types of operations: Click: For a click operation, the UE needs to store the coordinates of the clicked location and the timestamp when the click operation occurs;

Press and hold: for the hold operation, the UE needs to store the coordinates of the pressed position, the time stamp when the hold operation occurs, and the time during which the hold operation is continued;

Drag: For a drag operation, the UE needs to store the coordinates of the point on the drag path at a certain frequency, and the time stamp dragged to the point.

It should be noted that, after the UE stores the augmented reality context, the UE may send the augmented reality context to other UEs, so that other users may also experience the augmented reality experience, thereby causing the user to The augmented reality experience can be shared with other users.

A method for implementing augmented reality according to an embodiment of the present invention, when a user experiences an augmented reality experience, the UE stores the virtual content information and the captured video stream through the augmented reality context, after the augmented reality experience ends, when the user When the augmented reality experience needs to be experienced again, the UE acquires virtual reality information according to the stored virtual content information, and superimposes the acquired virtual reality information on each video frame in the video stream for display, so that the After experiencing the augmented reality experience, the user can again experience the same augmented reality experience at any time. Secondly, when the UE captures a video frame that includes the gesture image of the tracked object, the UE will be the tracked object. The gesture image is stored separately from the background image, and by storing location information of the gesture image of the tracked object in the captured video frame and a homography matrix, storing the gesture image of the tracked object, and storing the Position information of the captured video frame in the panorama, storing the background image, from And saving the storage resource of the UE; the UE may further add the acquired virtual reality information to the location according to the location of the gesture image of the tracked object in the current video frame to be displayed. Currently on the video frame to be displayed, so that the user can have a better augmented reality experience. FIG. 4 is a flowchart of another method for implementing augmented reality according to an embodiment of the present invention. The method is applied to a scene of a captured video stream that does not include a pose image of a tracked object. In the method, The video frame in the video stream captured by the UE is used as a background image, and the method includes: S401: When the user determines that the augmented reality experience needs to be experienced, the UE sends the information about the location of the UE to the server side.

For example, the UE may obtain information about the location of the UE by using a positioning device, for example, the information of the location of the UE may be obtained by using a GPS (Global Position System) device;

S402: The UE receives virtual content information sent by the server, where the virtual content information includes virtual reality information or storage location information of virtual reality information.

The virtual content information is obtained by the server side according to the information of the location of the UE. Specifically, the server side stores a correspondence between the location information and the virtual content information, where the server side obtains After the information about the location of the UE, the virtual content information is obtained according to the information about the location of the UE;

S403: The UE stores the virtual content information.

S404: The UE captures a video frame.

The UE may sequentially capture video frames according to a frame rate of the captured video stream;

S405: The UE updates the panoramic image as the background image by using the captured video frame, and stores a correspondence between the timestamp of the captured video frame and the background information.

It should be noted that, in this embodiment, the video frame captured by the UE is directly regarded as a background image. For detailed description of this step, refer to step S306, and details are not described herein again.

S406: The UE determines whether the augmented reality experience is over, and if so, step S407 is performed, otherwise, step S404 is performed; It should be noted that, when the augmented reality experience ends, the UE stops capturing video frames;

S407: The UE stores the panorama view.

The UE may store the panorama in the augmented reality context. For detailed description of this step, refer to step S308, and details are not described herein.

S408: After the augmented reality experience ends, when the user needs to experience the enhanced real-life experience again, the UE acquires virtual reality information according to the stored virtual content information.

For detailed description of this step, refer to step S309, and details are not described herein again.

S409: The UE acquires the stored panorama view.

S410: The UE acquires a timestamp of a video frame to be displayed, and obtains the current video frame to be displayed according to the obtained time stamp.

Specifically, after acquiring the timestamp of the video frame to be displayed, the UE obtains background information corresponding to the acquired timestamp, and according to the obtained location information and the deflection angle of the background information, according to the resolution of the display. Rate capturing the acquired panorama to generate a video frame to be currently displayed;

It should be noted that, the UE may sequentially obtain the timestamp of the video frame to be displayed in sequence according to the sequence in which the video frames are captured;

S411: The UE superimposes the acquired virtual reality information on the generated video frame to be displayed, and displays the superimposed video frame.

The virtual content information may further include location information corresponding to the virtual reality information, where the background information further includes information about a location of the UE, and the UE may acquire the virtual reality in the following manner. The information is superimposed on the generated video frame to be displayed: the UE superimposes the acquired virtual reality information according to the information about the location of the UE and the location information included in the virtual content information included in the background information. To the generated video frame currently to be displayed; S412: The UE determines whether the video frame in the stored video stream has been acquired. If yes, the augmented reality experience ends. Otherwise, step S410 is performed.

If the frame rate of the video playback is greater than the expected frame rate, the UE may perform an interpolation process, specifically, the timestamp of the video frame to be currently displayed by the UE, and the current video to be displayed. The background information corresponding to the timestamp of the frame is interpolated.

In the embodiment of the present invention, when the user starts to experience the augmented reality experience, the UE may further store user operation information, where the user operation information is used to describe an interaction between the user and the UE. The user operation information includes an operation type, an operation parameter, and a timestamp, and the time information included in the user operation information is used to indicate a moment when the interaction occurs; when the user experiences the augmented reality experience again, the UE may At the time corresponding to the time stamp included in the user operation information, the operation of the user is simulated according to the operation type and the operation parameter. For a detailed description of the user operation information, reference may be made to the embodiment shown in FIG. 3, and details are not described herein again.

A method for implementing augmented reality according to an embodiment of the present invention, when a user experiences an augmented reality experience, the UE stores the virtual content information and the captured video stream through the augmented reality context, after the augmented reality experience ends, when the user When the augmented reality experience needs to be experienced again, the UE acquires virtual reality information according to the stored virtual content information, and superimposes the acquired virtual reality information on each video frame in the video stream for display, so that the After experiencing the augmented reality experience, the user can also experience the same augmented reality experience again at any time. Secondly, when the video frame captured by the UE does not include the gesture image of the tracked object, the UE will Captured video The frame is used as a background image, and the background image is stored by storing location information of the captured video frame in the panorama, thereby saving storage resources of the UE; again, the UE may be included according to the background information. The information about the location of the UE and the location information corresponding to the virtual reality information included in the virtual content information are superimposed on the currently displayed video frame, so that the user can have a better augmented reality experience. . As shown in FIG. 5, it is a structural diagram of a user equipment according to an embodiment of the present invention, where the user equipment includes:

The receiving unit 501 is configured to receive virtual content information returned from the server side;

a video stream capturing unit 502, configured to capture a video stream;

The storage unit 503 is configured to store an augmented reality context when the user experiences an augmented reality experience, where the augmented reality context includes the virtual content information received by the receiving unit 501 and the video stream captured by the video stream capturing unit 502 ;

The virtual reality information acquiring unit 504 is configured to acquire virtual reality information according to the virtual content information stored by the storage unit 503 when the user needs to experience the augmented reality experience again; the video frame acquiring unit 505 is configured to follow the video. And acquiring, in sequence, the video frames in the video stream stored by the storage unit 503;

The superimposing unit 506 is configured to superimpose the virtual reality information acquired by the virtual reality information acquiring unit 504 on the video frame acquired by the video frame acquiring unit 505;

The display unit 507 is configured to display the superimposed video frame of the superimposing unit 506.

It should be noted that the video frame acquiring unit 505 can sequentially acquire video frames in the video stream according to the frame rate of the video playing.

The user equipment provided by the embodiment of the present invention, when the user experiences the augmented reality experience, the storage unit receives the virtual content information received by the receiving unit and the video stream captured by the video stream capturing unit, and ends the augmented reality experience. After the user needs to experience the augmented reality experience again, the superimposing unit superimposes the virtual reality information acquired by the virtual reality information acquiring unit. On the video frame acquired by the video frame acquiring unit, the display unit displays the video frame superimposed by the superimposing unit, so that the user can experience the same augmented reality experience again at any time after experiencing the augmented reality experience.

In an implementation manner of the embodiment of the present invention, when the object to be tracked needs to be enhanced, the tracked object exists in the real world where the user is located, and the video stream captured by the video stream capturing unit includes the tracked object. The video stream capturing unit 502 may be specifically configured to sequentially capture video frames;

The storage unit 503 may be specifically configured to store a correspondence between a timestamp of the video frame captured by the video stream capturing unit 502 and the tracked object information, and use the posture image of the tracked object from the captured video frame. Removing, updating the panoramic image according to the video frame after removing the posture image, and storing a correspondence between the time stamp and the background information;

And storing a standard image of the tracked object when the video stream capturing unit 502 captures a video frame, and storing the panorama when the video stream capturing unit 502 stops capturing a video frame; wherein, the time a stamp indicating a time at which the video frame is captured, the tracked object information including location information of the gesture image in the captured video frame, the background information including the captured video frame in the panorama Location information;

Wherein, when the user needs to experience the augmented reality experience again, the video frame acquires a single image;

And the timestamp of the video frame to be displayed is obtained in sequence according to the sequence in which the video frames are captured, and the tracked by the storage unit 503 corresponding to the acquired timestamp is obtained according to the acquired timestamp. Object information and background information, according to the obtained tracked object information packet The inclusion of the homography matrix, performing affine transformation on the obtained standard image, obtaining a posture image of the tracked object, and intercepting according to the displayed resolution according to the obtained position information and the deflection angle of the background information. Obtaining the panoramic image to obtain a background image, and superimposing the obtained posture image on the truncated background image according to the obtained position information included in the tracked object information, to generate a video frame to be currently displayed;

The virtual content information received by the receiving unit 501 may include the identifier of the tracked object corresponding to the virtual reality information, and the superimposing unit 506 may be specifically configured to include the When the identifier of the tracked object is located, the virtual reality information acquired by the virtual reality information acquiring unit 504 is superimposed on the video according to the position of the image of the tracked object in the current video frame to be displayed. The frame to be displayed by the frame acquiring unit 505 is currently displayed on the video frame;

It should be noted that, the user equipment may further include a sending unit, where the sending unit may be configured to send to the server side before the receiving unit 501 receives the virtual content information returned from the server side. Sending information identifying the tracked object, the information identifying the tracked object includes feature image of the tracked object or a feature image of the tracked object, so that the receiving unit 501 receives the The virtual content information is obtained by the server side according to the information of the tracking and the object, and the virtual content information may further include the virtual reality information or the virtual reality information. The virtual reality information acquiring unit 504 may be specifically configured to directly acquire the virtual reality information when the virtual content information received by the receiving unit 501 includes the virtual reality information; The virtual content information received by the receiving unit 501 includes the storage of the virtual reality information. When the location information, according to the storage location information, access the Virtual Reality information. In another implementation manner of the embodiment of the present invention, when it is required to enhance the current location in the real environment, the tracked object does not exist in the real world where the user is located, and the video stream capture unit captures The video stream does not include a pose image of the tracked object, and the video stream capture unit 502 It can be specifically used to sequentially capture video frames;

The storage unit 503 may be specifically configured to update a panoramic image according to the video frame captured by the video stream capturing unit 502, and store a correspondence between a timestamp of the captured video frame and background information;

And storing the panorama when the video stream capture 502 unit stops capturing a video frame; wherein the timestamp is used to indicate a moment of capturing a video frame, and the background information includes the captured video frame in the Position information in the panorama;

The background information may also include a deflection angle of the captured video frame relative to the panoramic view deflection;

When the user needs to experience the augmented reality experience again, the video frame acquisition unit is configured to sequentially acquire the timestamp of the current video frame to be displayed according to the sequence in which the video frames are captured, according to the obtained a timestamp, obtaining background information corresponding to the acquired timestamp, and according to the obtained position information and the deflection angle of the background information, intercepting the acquired panoramic image according to the displayed resolution, and generating a current video to be displayed frame;

The virtual content information received by the receiving unit 501 may include location information corresponding to the virtual reality information, and the background information may further include information about a location of the user equipment, where the superimposing unit 506 may Specifically, the virtual reality information acquired by the virtual reality information acquiring unit 504 is superimposed on the video frame according to the information about the location of the user equipment included in the background information and the location information included in the virtual content information. The current video frame to be displayed generated by the unit 505;

The user equipment may further include a sending unit, where the sending unit may be configured to send the user equipment to the server side before the receiving unit 501 receives the virtual content information returned from the server side. The location information, so that the receiving unit 501 receives the virtual content information, where the virtual content information is determined by the server side according to the location of the user equipment The set information is obtained by searching, and the virtual content information may further include the virtual reality information or storage location information of the virtual reality information;

The virtual reality information acquiring unit 504 may be specifically configured to directly acquire the virtual reality information when the virtual content information received by the receiving unit 501 includes the virtual reality information; or received by the receiving unit 501. When the virtual content information includes the storage location information of the virtual reality information, the virtual reality information is acquired according to the storage location information. The augmented reality context stored by the storage unit 503 may further include user operation information, where the user operation information includes an operation type and an operation parameter, and the presence of the tracked object in the current real world. Timestamp

Shellfish ¹ J, the user equipment may further comprise:

The user operation simulation unit is configured to simulate the operation of the user according to the operation type and the operation parameter at a time corresponding to the time stamp included in the user operation information. FIG. 6 is a structural diagram of another user equipment according to an embodiment of the present invention. As shown in FIG. 6, the user equipment includes at least one processor 601, a communication bus 602, a memory 603, and at least one communication interface. 604.

The communication bus 602 is configured to implement a connection and communication between the components, and the communication interface 604 is configured to connect and communicate with an external device.

The memory 603 is configured to store program code that needs to be executed. The program code may include: a receiving unit 6031, a video stream capturing unit 6032, a storage unit 6033, a virtual reality information acquiring unit 6034, a video frame acquiring unit 6035, and an overlay. The unit 6036 and the display unit 6037 are configured to execute the unit stored in the memory 603. When the unit is executed by the processor 601, the following functions are implemented:

The receiving unit 6031 is configured to receive virtual content information returned from the server side;

The video stream capturing unit 6032 is configured to capture a video stream. The storage unit 6033 is configured to store an augmented reality context when the user experiences an augmented reality experience, where the augmented reality context includes the virtual content information received by the receiving unit 6031 and the captured by the video stream capturing unit 6032 Video stream

The virtual reality information acquiring unit 6034 is configured to acquire, according to the virtual content information stored by the storage unit 6033, the virtual reality information, the video frame acquiring unit 6035, when the user needs to experience the augmented reality experience again. Obtaining video frames in the video stream stored by the storage unit 6033 in sequence according to a sequence in which video frames are captured;

The superimposing unit 6036 is configured to superimpose the virtual reality information acquired by the virtual reality information acquiring unit 6034 on the video frame acquired by the video frame acquiring unit 6035;

The display unit 6037 is configured to display the superimposed video frame of the superimposing unit 6036.

It should be noted that the video frame acquiring unit 6035 may sequentially acquire video frames in the video stream according to the frame rate of the video playing.

The user equipment provided by the embodiment of the present invention, when the user experiences the augmented reality experience, the storage unit receives the virtual content information received by the receiving unit and the video stream captured by the video stream capturing unit, and ends the augmented reality experience. After the user needs to experience the augmented reality experience again, the superimposing unit superimposes the virtual reality information acquired by the virtual reality information acquiring unit on the video frame acquired by the video frame acquiring unit, and the display unit displays the superimposed video of the superimposing unit. The frame enables the user to experience the same augmented reality experience again at any time after experiencing the augmented reality experience.

In an implementation manner of the embodiment of the present invention, when the object to be tracked needs to be enhanced, the tracked object exists in the real world where the user is located, and the video stream captured by the video stream capturing unit includes the video stream. Tracking the pose image of the object, the video stream capturing unit 6032 may be specifically configured to sequentially obtain a video frame;

The storage unit 6033 may be specifically configured to store the view captured by the video stream capturing unit 6032. Corresponding relationship between the time stamp of the frequency frame and the tracked object information, removing the posture image of the tracked object from the captured video frame, updating the panorama according to the video frame after removing the posture image, and storing the image Corresponding relationship between the timestamp and the background information;

And storing a standard image of the tracked object when the video stream capturing unit 6032 captures a video frame, and storing the panoramic image when the video stream capturing unit 6032 stops capturing a video frame; wherein, the time a stamp indicating a time at which the video frame is captured, the tracked object information including location information of the gesture image in the captured video frame, the background information including the captured video frame in the panorama Location information;

The video frame acquires a single scene view when the user needs to experience the augmented reality experience again;

And the timestamp of the video frame to be displayed is obtained in sequence according to the sequence in which the video frames are captured, and the tracked by the storage unit 6033 corresponding to the acquired timestamp is obtained according to the acquired timestamp. The object information and the background information are affine-transformed to the acquired standard image according to the obtained homography matrix included in the tracked object information, to obtain a posture image of the tracked object, according to the obtained background. The position information included in the information and the deflection angle are obtained by intercepting the acquired panoramic image according to the displayed resolution, and obtaining the background image according to the obtained position information included in the tracked object information, and superimposing the obtained posture image on the interception to obtain On the background image, generate the current video frame to be displayed;

The virtual content information received by the receiving unit 6031 may include the identifier of the tracked object corresponding to the virtual reality information, and the superimposing unit 6036 may be specifically configured to include the virtual content information in the When the identifier of the tracked object is described, according to the posture of the tracked object The position of the state image in the current video frame to be displayed, the virtual reality information acquiring unit

The virtual reality information acquired by 6034 is superimposed on the video frame to be displayed generated by the video frame obtaining unit 6035.

It should be noted that the memory 603 may further include a sending unit. When the processor 601 executes the sending unit, the following functions may be implemented:

The sending unit may be configured to send, after the receiving unit 6031 receives the virtual content information returned from the server side, information indicating the tracked object to the server side, where the identifier is tracked The information of the object includes the attitude image of the tracked object or the feature data of the gesture image of the tracked object, so that the receiving unit 6031 receives the virtual content information, wherein the virtual content information is from the server side Obtaining, according to the information processing of the tracking and the object, the virtual content information may further include the virtual reality information or storage location information of the virtual reality information;

The virtual reality information acquiring unit 6034 may be specifically configured to directly acquire the virtual reality information when the virtual content information received by the receiving unit 6031 includes the virtual reality information; or received by the receiving unit 6031. When the virtual content information includes the storage location information of the virtual reality information, the virtual reality information is acquired according to the storage location information. In another implementation manner of the embodiment of the present invention, when the current location in the real environment needs to be enhanced, the tracked object does not exist in the real world where the user is located, and the video stream capturing unit 6032 may Specifically used to sequentially capture video frames;

The storage unit 6033 may be specifically configured to update a panoramic image according to the video frame captured by the video stream capturing unit 6032, and store a correspondence between a timestamp of the captured video frame and background information;

And storing the panorama when the video stream capture 6032 unit stops capturing a video frame; wherein the timestamp is used to indicate a moment of capturing a video frame, and the background information includes the captured video frame at the location Location information in the panorama The background information may also include a deflection angle of the captured video frame relative to the panoramic view deflection;

The virtual content information received by the receiving unit 6031 may include location information corresponding to the virtual reality information, and the background information further includes information about a location of the user equipment, and the superimposing unit 6036 may be specific. And superimposing the virtual reality information acquired by the virtual reality information acquiring unit 6034 on the video frame acquiring unit according to the information about the location of the user equipment included in the background information and the location information included in the virtual content information. 6035 is generated on the current video frame to be displayed.

The memory 603 may further include a sending unit. When the processor 601 executes the sending unit, the following functions may be implemented:

The sending unit may be configured to send information about a location of the user equipment to the server side before the receiving unit 6031 receives the virtual content information returned from the server side, so that the receiving unit 6031 receives The virtual content information, wherein the virtual content information is obtained by the server side according to the information of the location of the user equipment, and the virtual content information may further include the virtual reality information or the virtual reality information. The virtual reality information acquiring unit 6034 may be specifically configured to directly acquire the virtual reality information when the virtual content information received by the receiving unit 6031 includes the virtual reality information; When the virtual content information received by the receiving unit 6031 includes the storage location information of the virtual reality information, the virtual reality information is acquired according to the storage location information. The augmented reality context stored by the storage unit 6033 may further include user operation information, where the user operation information includes an operation type, an operation parameter, and Timestamp

The memory 603 may further include a user operation simulation unit, and when the processor 601 executes the user operation simulation ticket, the following functions may be implemented:

The user operation simulation unit is configured to simulate a user operation according to the operation type and the operation parameter at a time when the time stamp included in the user operation information corresponds.

The method for implementing the augmented reality and the user equipment provided by the embodiment of the present invention, when the user experiences the augmented reality experience, the UE stores the virtual content information and the captured video stream through the augmented reality context, after the augmented reality experience ends, When the user needs to experience the augmented reality experience again, the UE acquires virtual reality information according to the stored virtual content information, and superimposes the acquired virtual reality information on each video frame in the video stream for display. After the user experiences the augmented reality experience, the user can again experience the same augmented reality experience at any time. Secondly, when the UE captures a video frame that includes the gesture image of the tracked object, the UE will The posture image of the tracking object is stored separately from the background image, and the position information of the posture image of the tracked object in the captured video frame and the homography matrix are stored, and the posture image of the tracked object is stored and passed Storing location information of the captured video frame in the panorama, storing the The scene view, thereby saving the storage resource of the UE; in addition, the UE may superimpose the acquired virtual reality information according to the position of the gesture image of the tracked object in the current video frame to be displayed. Going to the current video frame to be displayed, so that the user can have a better augmented reality experience; again, when the video frame captured by the UE does not include the gesture image of the tracked object, the UE will The captured video frame is used as a background image, and the background image is stored by storing location information of the captured video frame in the panoramic image, thereby saving storage resources of the UE, and the UE may be based on background information. The information about the location of the UE included in the location and the location information corresponding to the virtual reality information included in the virtual content information, and superimposing the acquired virtual reality information The video frame to be displayed before, so that the user can have a better augmented reality experience. It will be apparent to those skilled in the art that the present invention can be implemented in hardware, software implementation, or a combination thereof, as will be apparent to those skilled in the art. When implemented in software, the functions described above may be stored in or transmitted as one or more instructions or code on a computer readable medium. Computer readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one location to another. A storage medium may be any available media that can be accessed by a computer. By way of example and not limitation, computer readable media may comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, disk storage media or other magnetic storage device, or can be used for carrying or storing in the form of an instruction or data structure. The desired program code and any other medium that can be accessed by the computer. Also. Any connection may suitably be a computer readable medium. For example, if the software is transmitted from a website, server, or other remote source using coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable , fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, wireless, and microwaves are included in the fixing of the associated media. As used in the present invention, a disk and a disc include a compact disc (CD), a laser disc, a compact disc, a digital versatile disc (DVD), a floppy disc, and a Blu-ray disc, wherein the disc is usually magnetically copied, and the disc is The laser is used to optically replicate the data. Combinations of the above should also be included within the scope of the computer readable media.

It is to be noted that the various embodiments in the present specification are described in a progressive manner, and the same similar parts between the various embodiments may be referred to each other, and each embodiment focuses on different embodiments from other embodiments. At the office. In particular, for the device embodiment, since it is basically similar to the method embodiment, it is described as a comparison, and the execution process of each unit specific function can be referred to the description of the method embodiment. The device embodiments described above are merely illustrative, wherein the units illustrated as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located in one place. , or it can be distributed to multiple network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment. Those of ordinary skill in the art can understand and implement without any creative effort.

In summary, the above description is only a preferred embodiment of the technical solution of the present invention, and is not intended to limit the scope of the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and scope of the present invention are intended to be included within the scope of the present invention.

Claims

Claim

A method for realizing augmented reality, comprising:

2. The method of claim 1, wherein the storing, by the user equipment, the captured video stream comprises:

The user equipment sequentially captures a video frame, stores a correspondence between a timestamp of the captured video frame and the tracked object information, and removes the posture image of the tracked object from the captured video frame, according to removing the gesture. The video frame after the image updates the panorama, and stores the correspondence between the time stamp and the background information;

The user equipment stores a standard image of the tracked object when capturing a video frame, and stores the panorama when the user equipment stops capturing a video frame;

3. The method according to claim 2, wherein the tracked object information further comprises a homography matrix of the gesture image on the captured video frame, and the background information further comprises the capturing The angle of deflection of the video frame relative to the panorama deflection.

The method of claim 3, wherein the user equipment sequentially acquires the stored video frames in the video stream according to a sequence in which the video frames are captured, including: The user equipment acquires the stored standard image and the panorama;

The method according to claim 4, wherein the virtual content information includes an identifier of the 3 tracked object corresponding to the virtual reality information, and the virtual reality information to be acquired Superimposed on the acquired video frame, including:

When the virtual content information includes the identifier of the tracked object, the user equipment acquires the virtual reality information according to the position of the gesture image of the tracked object in the current video frame to be displayed. Superimposed on the currently displayed video frame.

The method of claim 2, wherein before the user equipment stores the enhanced reality context, the method further includes:

The user equipment sends information identifying the tracked object to the server side, where the information identifying the tracked object includes a pose image of the tracked object or feature data of a pose image of the tracked object ;

The user equipment receives the virtual content information sent by the server side, where the virtual content information is obtained by the server side according to the information of the tracking and the object, and the virtual content information includes The virtual reality information or the storage location information of the virtual reality information; and the user equipment acquiring the virtual reality information according to the stored virtual content information, including: If the virtual content information includes the virtual reality information, the user equipment directly acquires the virtual reality information; or

And if the virtual content information includes the storage location information of the virtual reality information, the user equipment acquires the virtual reality information according to the storage location information.

The method of claim 1, wherein the storing, by the user equipment, the captured video stream comprises:

The user equipment sequentially captures a video frame, updates a panoramic image according to the captured video frame, and stores a correspondence between a timestamp of the captured video frame and background information;

And when the user equipment stops capturing a video frame, the user equipment stores the panoramic image; wherein the background information includes location information of the captured video frame in the panoramic image.

8. The method of claim 7, wherein the background information further comprises a deflection angle of the captured video frame relative to the panoramic view deflection.

The method of claim 8, wherein the user equipment sequentially acquires the stored video frames in the video stream according to a sequence in which the video frames are captured, including:

The user equipment acquires the stored panorama view;

The method of claim 9, wherein the virtual content information includes location information corresponding to the virtual reality information, and the background information further includes information about a location of the user equipment, And superimposing the acquired virtual reality information on the acquired video frame, including: information about a location where the user equipment is located according to the background information, and location information included in the virtual content information, The acquired virtual reality information is superimposed on the currently displayed video frame.

The method of claim 7, wherein before the user equipment stores the enhanced reality context, the method further includes:

The user equipment sends the information about the location of the user equipment to the server side; the user equipment receives the virtual content information sent by the server side, where the virtual content information is used by the server side The information about the location of the user equipment is obtained, and the virtual content information includes the virtual reality information or the storage location information of the virtual reality information. The user equipment obtains the virtual content information according to the stored virtual content information. The virtual reality information includes:

The method according to any one of claims 1 to 11, wherein the augmented reality context further includes user operation information, and the user operation information includes an operation type, an operation parameter, and a time stamp; The method also includes:

The user equipment simulates the operation of the user according to the operation type and the operation parameter at a time corresponding to the time stamp included in the user operation information.

13. A user equipment, comprising:

a video stream capturing unit, configured to capture a video stream;

a storage unit, configured to store an augmented reality context when the user experiences an augmented reality experience, where the enhanced reality context includes the virtual content information received by the receiving unit and the video stream captured by the video stream capture unit;

a virtual reality information acquiring unit, configured to acquire virtual reality information according to the virtual content information stored by the storage unit when the user needs to experience the augmented reality experience again; a video frame acquiring unit, configured to sequentially acquire video frames in the video stream stored by the storage unit according to a sequence in which video frames are captured;

The user equipment according to claim 13, wherein the video stream capturing unit is specifically configured to sequentially capture video frames;

The user equipment according to claim 14, wherein the tracked object information further includes a homography matrix of the gesture image on the captured video frame, and the background information further includes the The angle of deflection of the captured video frame relative to the panorama deflection.

The user equipment according to claim 15, wherein the video frame obtaining unit is specifically configured to acquire the standard image and the panoramic image stored by the storage unit;

The timestamps of the video frames to be displayed are obtained in sequence according to the sequence in which the video frames are captured, and the tracked objects stored in the storage unit corresponding to the acquired timestamps are obtained according to the acquired timestamps. And the background information is obtained by performing affine transformation on the obtained standard image according to the obtained homography matrix included in the tracked object information, to obtain a posture image of the tracked object, according to the obtained background information. The position information contained and the angle of deflection, as shown The resolution captures the obtained panoramic image to obtain a background image, and according to the obtained position information included in the tracked object information, superimposes the obtained posture image on the cut background image to generate the current desired image to be displayed. Video frame.

The user equipment according to claim 16, wherein the virtual content information received by the receiving unit includes an identifier of the tracked object corresponding to the virtual reality information, and the superimposing unit is specific And when the virtual content information includes the identifier of the tracked object, according to a position of the posture image of the tracked object in the currently displayed video frame, the virtual reality information acquiring unit acquires The virtual reality information is superimposed on the current video frame to be displayed generated by the video frame acquiring unit.

The user equipment according to claim 14, wherein the user equipment further comprises a sending unit, wherein the sending unit is configured to: before the receiving unit receives the virtual content information returned from the server side Sending information identifying the tracked object to the server side, where the information identifying the tracked object includes a pose image of the tracked object or feature data of a pose image of the tracked object, so as to The receiving unit receives the virtual content information, where the virtual content information is obtained by the server side according to the information of the tracking and the object, and the virtual content information includes the virtual reality information or the Storage location information of virtual reality information;

The virtual reality information acquiring unit is specifically configured to directly acquire the virtual reality information when the virtual content information received by the receiving unit includes the virtual reality information; or the When the virtual content information includes the storage location information of the virtual reality information, the virtual reality information is acquired according to the storage location information.

The storage unit is specifically configured to update a panorama according to a video frame captured by the video stream capturing unit, and store a correspondence between a timestamp of the captured video frame and background information; And storing the panoramic image when the video stream capturing unit stops capturing video frames; wherein the background information includes location information of the captured video frame in the panoramic image.

20. The user equipment of claim 19, wherein the background information further comprises a deflection angle of the captured video frame relative to the panoramic view deflection.

The user equipment according to claim 20, wherein the video frame obtaining unit is specifically configured to acquire the panoramic image stored by the storage unit;

And obtaining, according to the sequence of the video frames, the timestamps of the currently displayed video frames, and obtaining the background information corresponding to the acquired timestamps according to the acquired timestamps, according to the obtained background. The position information included in the information and the deflection angle are intercepted according to the displayed resolution to generate the currently displayed video frame.

The user equipment according to claim 21, wherein the virtual content information received by the receiving unit includes location information corresponding to the virtual reality information, and the background information further includes where the user equipment is located. The information of the location, the superimposing unit is specifically configured to: according to the information about the location of the user equipment included in the background information and the location information included in the virtual content information, the obtained by the virtual reality information acquiring unit The virtual reality information is superimposed on the currently displayed video frame generated by the video frame obtaining unit.

The user equipment according to claim 19, wherein the user equipment further comprises a sending unit, wherein the sending unit is configured to: before the receiving unit receives the virtual content information returned from the server side Sending, to the server side, the information about the location of the user equipment, so that the receiving unit receives the virtual content information, where the virtual content information is searched by the server side according to the information of the location of the user equipment. Obtaining that the virtual content information includes the virtual reality information or storage location information of the virtual reality information;

The virtual reality information acquiring unit is specifically configured to directly acquire the virtual reality information when the virtual content information received by the receiving unit includes the virtual reality information; or the When the virtual content information includes storage location information of the virtual reality information, Obtaining the virtual reality information according to the storage location information.

The user equipment according to any one of claims 13 to 23, wherein the enhanced reality context further includes user operation information, and the user operation information includes an operation type, an operation parameter, and a time war;

The user equipment further includes:

The user operation simulation unit is configured to simulate the operation of the user according to the operation type and the operation parameter at a time corresponding to the time stamp included in the user operation information.