CN114115532B

CN114115532B - AR labeling method and system based on display content

Info

Publication number: CN114115532B
Application number: CN202111332392.8A
Authority: CN
Inventors: 何培明
Original assignee: Coral Stone Shanghai Video Technology Co ltd
Current assignee: Coral Stone Shanghai Video Technology Co ltd
Priority date: 2021-11-11
Filing date: 2021-11-11
Publication date: 2023-09-29
Anticipated expiration: 2041-11-11
Also published as: CN114115532A

Abstract

The invention discloses an AR labeling method and system based on display content, and belongs to the technical field of intelligent wearable equipment. It comprises the following steps: s1, analyzing display contents to be marked, extracting the marked contents in the display contents, performing AI arrangement, automatically dividing the marked contents into a plurality of trigger sets according to the integrity of the marked contents, and performing background preloaded marking on the set contents by networking. According to the scheme of combining the gazing focus of the user with the active virtual cursor, the gazing focus is triggered slowly, so that unwanted labels are not triggered at will, and then the selected display content can be labeled quickly and accurately by combining secondary determination of the virtual cursor, so that accuracy is guaranteed, and meanwhile, the labeling is performed at the highest speed, and the labeling content is attached to the mind of the user and expected, so that the user experience is good.

Description

AR labeling method and system based on display content

Technical Field

The invention relates to an AR labeling method and system based on display content, and belongs to the technical field of intelligent wearable equipment.

Background

Augmented reality (Augmented Reality, abbreviated as AR) is a newer technology content that facilitates the integration between real world information and virtual world information content, which carries out simulated simulation processing on physical information that is otherwise difficult to experience in the spatial range of the real world based on scientific technology such as computers, and the like, and the superposition effectively applies the virtual information content in the real world, and in the process, can be perceived by human senses, thereby realizing a sensory experience that exceeds reality. After overlapping between the real environment and the virtual object, the real environment and the virtual object can exist in the same picture and space simultaneously.

In practical application of AR, superposition of labels on display contents in a display device is the most common way, and the application principle is that the display contents in the current area are subjected to data analysis, the parallel network searches for related contents, the searched related contents are further superimposed on corresponding positions in the display device to form labels, in this process, how to select the desired display contents for labeling is a difficulty, and the existing solution is to use eye tracking technology for interactive selection, however, some defects exist: because people's attention can not concentrate always, lead to the skew that the sight can be irregular, just so can trigger some computer instruction at any time, lead to unwanted mark content to jump out, distraction, on the other hand, current eye moves interactive mode operation inefficiency, adopt single eye blink or the action of both eyes blink to click and operate can easily cause tired of eyes, and blink also belongs to natural reaction, and when eyes receive the stimulus, blink frequency is more uncontrolled, thereby cause the maloperation easily, just so make the user can not carry out the mark of display content according to own will, influence user experience.

Disclosure of Invention

The invention aims at the technical problems, so that an AR labeling method and an AR labeling system based on display content are provided.

The technical problems to be solved by the invention are realized by adopting the following technical scheme:

an AR labeling method based on display content comprises the following steps:

s1, analyzing display contents to be marked, extracting the marked contents in the display contents, performing AI arrangement, automatically dividing the marked contents into a plurality of trigger sets according to the integrity of the marked contents, and performing background preloaded marking on the set contents by networking;

s2, capturing a user' S gazing focus by an eyeball tracking module of the device, wherein after the time that the gazing focus is concentrated on a trigger set exceeds a set time, the display content mapped by the trigger set is highlighted and used for indicating that the display content at the place has a callable content, and after the highlighting lasts for the set time, the highlighting effect is hidden;

s3, capturing finger gesture information by an image shooting module of the device, forming an invisible virtual cursor at the top end of a fingertip of the finger, enabling a mark on the corresponding trigger set to enter a pre-loading state when the virtual cursor stays on the trigger set, loading and displaying the mark content after the trigger set stays for a set time, and hiding the mark content after the virtual cursor leaves the trigger set for a set time;

s4, when the gazing focus and the virtual cursor captured by the eyeball tracking module are concentrated on one trigger set, marking content is displayed in a display content nearby area mapped by the trigger set immediately;

and S5, repeating the step S2 or S3 when one of the gazing focus and the virtual cursor leaves the trigger set, and restoring the display content to be the same when the gazing focus and the virtual cursor leave the trigger set.

As a preferred example, the noted content includes graphics, animations, video, text, and icons.

As a preferable example, in S2, the setting time of focusing the gaze on the trigger set is 3 to 5S, and the setting time of highlighting continuation is 1 to 3S.

As a preferred example, in the S2, after the highlight effect is removed, the trigger set receives the highlight wake of the gazing focus again after 10-15S.

As a preferred example, in S2, the highlighted content is quickly converted into the labeling content display by the remote operation device.

As a preferable example, in S3, the setting time for the virtual cursor to stay on the trigger set is 5-8S, and the setting time for the virtual cursor to leave the trigger set is 1-3S.

As a preferred example, in S3, the captured finger posture information does not include the posture information of the thumb.

An AR labeling system based on display content comprises a display module, a system processing module, a signal module, an eyeball tracking module and an image shooting module;

the display module comprises a transparent screen and is used for superposing AR labeling content on display content in the real world;

the system processing module comprises a logic and a relative computer memory of control instructions, and is used for receiving and processing signals from the sensor, providing display signals for the display module and making AR labeling content;

the signal module comprises at least two of a 5G network communication module, a wireless module, a Bluetooth module or an infrared module and is used for being connected with external signals;

the eyeball tracking module is used for tracking the gazing focuses of the eyes of the user in real time and converting the gazing focuses into coordinate signals to be transmitted to the system processing module;

the image shooting module is used for extracting display content and capturing finger gesture information, the extracted display content is converted into a processable signal and transmitted to the system processing module, and the captured finger gesture information is converted into a coordinate signal and transmitted to the system processing module.

As a preferred example, a remote operation device is also included, which includes a smart finger ring or a smart bracelet/watch, and is connected to the signal module by a wireless signal.

The beneficial effects of the invention are as follows: according to the scheme of combining the gazing focus of the user with the active virtual cursor, the gazing focus is triggered slowly, so that unwanted labels are not triggered at will, and then the selected display content can be labeled quickly and accurately by combining secondary determination of the virtual cursor, so that accuracy is guaranteed, and meanwhile, the labeling is performed at the highest speed, and the labeling content is attached to the mind of the user and expected, so that the user experience is good.

Drawings

FIG. 1 is a workflow diagram of the present invention;

fig. 2 is a schematic structural view of the present invention.

Detailed Description

The invention will be further described with reference to the following detailed drawings, in order to make the technical means, the creation characteristics, the achievement of the purpose and the effect of the invention easy to understand.

The invention is implemented on the basis of AR glasses or smart phones/tablets, which must be provided with associated image capturing means for capturing the display content and for capturing the eye movements of the user.

As shown in fig. 1, an AR labeling method based on display content includes the following steps:

s1, analyzing display contents to be marked, extracting the marked contents in the display contents, performing AI arrangement, automatically dividing according to the integrity of the marked contents, such as a word, a sentence, a single graph and other units with complete meanings, editing the units into a plurality of trigger sets, performing background preloading marking on the contents of the trigger sets by a parallel network, caching the marked contents in a computer memory, converting the whole display contents into two-dimensional plane data, converting the plurality of trigger sets into coordinate sets according to original corresponding positions on the display contents, and embedding the coordinate sets in the positions corresponding to the two-dimensional planes;

s2, capturing a gazing focus of a user by an eyeball tracking module of the equipment, converting a gazing focus position signal into a coordinate signal on a two-dimensional plane so as to judge whether the gazing focus coincides with a trigger set, and when the time of the gazing focus on the trigger set exceeds a set time, highlighting display contents mapped by the trigger set, wherein the display contents are used for indicating that the display contents can be marked, after the highlighting lasts for the set time, the highlighting effect is hidden, and when the time of the gazing focus on the trigger set is lower than the set time, the display contents do not react, so that misoperation caused by jumping of a sight is prevented;

s3, capturing finger gesture information by an image capturing module of the device, capturing and tracking the peripheral outline of the finger through an image recognition technology, determining a plurality of measuring points on the peripheral outline based on a template matching algorithm and based on an artificial neural network, determining coordinates on a two-dimensional plane by the plurality of measuring points at the moment, recognizing the measuring points at the top end of the finger fingertip as invisible virtual cursors, enabling labels on the corresponding trigger set to enter a pre-loading state when the virtual cursors stay on the trigger set, loading and displaying the label content after the trigger set stays for a set time, and hiding the label content after the virtual cursors leave the trigger set for a set time;

s4, when the gazing focus and the virtual cursor captured by the eyeball tracking module are concentrated on one trigger set, namely, under the condition of no consideration of depth, the two-dimensional coordinates of the gazing focus and the two-dimensional coordinates of the virtual cursor are both positioned in the coordinate set of the trigger set, and at the moment, marking contents are immediately displayed in the area near the display contents mapped by the trigger set;

The annotated content includes graphics, animations, video, text and icons.

And S2, setting time of focusing the gazing focus on the trigger set is 3-5S, setting time of highlighting display duration is 1-3S, and after the highlighting effect is hidden, the trigger set receives highlight wakeup of the gazing focus again after 10-15S.

By adopting the scheme, when the reading or watching speed of the user is comprehensively considered and the focusing focus is concentrated at about 4s, the influence of short sight line deviation is avoided, the corresponding labeling time is not delayed too much, and meanwhile, the setting of the dormancy time after the highlight effect is hidden is to prevent the user from repeatedly carrying out highlight to disperse the attention of the user when the user focuses on watching a section of display content.

In S2, the highlighted content is quickly converted into the marked content to be displayed through the remote operation equipment, and the specific principle is that the remote operation equipment performs active marking operation through triggering or gestures at the moment under the highlight state of the displayed content, so that the content marking can be conveniently and quickly completed.

In S3, the setting time of the virtual cursor staying on the trigger set is 5-8S, the setting time of the virtual cursor leaving the trigger set is 1-3S, the setting time of the triggering time of the virtual cursor is slightly longer than the triggering time of the fixation focus, the finger gesture is prevented from being accidentally shot by the image shooting module and immediately triggering the mark to influence the sight line, and meanwhile, the effect of marking the target display content can be achieved after a period of time.

In S3, the captured finger gesture information does not comprise the gesture information of the thumb, because in a normal use scene, a user does not need to use the thumb when reading and understanding with the assistance of fingers, and generally only uses the index finger and the middle finger, and the ring finger and the little finger are contracted, the method can shield the gesture information capture of the thumb, and prevent the gesture information of the thumb from mistakenly touching irrelevant display content labels.

As shown in fig. 2, an AR labeling system based on display content includes a display module, a system processing module, a signal module, an eye tracking module, and an image capturing module;

the display module comprises a transparent screen for superimposing AR labeling content on display content in the real world;

the system processing module comprises a logic and control instruction related computer memory, is used for receiving and processing signals from the sensor, providing display signals for the display module and making AR labeling content;

the signal module comprises at least two of a 5G network communication module, a wireless module, a Bluetooth module or an infrared module and is used for being connected with external signals, including networking inquiry marking data, data exchange with an information terminal, information instruction of a remote operation device receiving and the like;

the eyeball tracking module is used for tracking the gazing focuses of the eyes of a user in real time, converting the gazing focuses into coordinate signals and transmitting the coordinate signals to the system processing module, and the main equipment comprises infrared equipment and image acquisition equipment;

The remote operation device is connected with the signal module through a wireless signal, the remote operation device is to be used for a wearable device such as an intelligent finger ring or an intelligent hand ring/watch, remote instruction operation is carried out through pressing and clicking or gesture movement modes of the device, and the remote instruction operation is carried out through direct marking of highlighted contents.

The foregoing has shown and described the basic principles and main features of the present invention and the advantages of the present invention. It will be appreciated by persons skilled in the art that the present invention is not limited to the embodiments described above, but is capable of numerous variations and modifications without departing from the spirit and scope of the invention as hereinafter claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims

1. An AR labeling method based on display content is characterized by comprising the following steps: the method comprises the following steps:

2. The AR labeling method based on display content according to claim 1, wherein: the noted content includes graphics, animations, video, text, and icons.

3. The AR labeling method based on display content according to claim 1, wherein: in S2, the setting time of focusing on the trigger set is 3-5S, and the setting time of highlighting duration is 1-3S.

4. The AR labeling method based on display content according to claim 1, wherein: in the step S2, after the highlight effect is hidden, the trigger set receives highlight wake-up of the gazing focus again after 10-15S.

5. The AR labeling method based on display content according to claim 1, wherein: in the step S2, the highlighted content is quickly converted into marked content to be displayed through the remote operation device.

6. The AR labeling method based on display content according to claim 1, wherein: in the step S3, the setting time of the virtual cursor staying on the trigger set is 5-8S, and the setting time of the virtual cursor leaving the trigger set is 1-3S.

7. The AR labeling method based on display content according to claim 1, wherein: in the step S3, the captured finger gesture information does not include gesture information of the thumb.