CN112132076A

CN112132076A - Forgotten object addressing method based on computer vision

Info

Publication number: CN112132076A
Application number: CN202011044578.9A
Authority: CN
Inventors: 韩强; 李庆新; 王志保; 张钦海; 裴欣欣
Original assignee: Tianjin Tiandi Weiye Intelligent Security Technology Co ltd
Current assignee: Tianjin Tiandi Weiye Intelligent Security Technology Co ltd
Priority date: 2020-09-28
Filing date: 2020-09-28
Publication date: 2020-12-25

Abstract

The invention provides a forgetting object addressing method based on computer vision, which comprises the following steps: s1, collecting data, and collecting the image information of the articles easy to forget; s2, preprocessing the collected images, identifying the characteristics of the corresponding articles in the images, classifying the articles, and labeling each object in the image data according to the determined category; s3, creating a deep learning network model and carrying out model training; and S4, carrying out algorithm processing, carrying out scaling processing on the received video stream image to meet the requirements of the model, sending the processed image into the model for prediction, storing the position information of various articles predicted by the model, and recording the articles of other types closest to the article. Compared with the method for searching through the video, the method for addressing the forgotten object based on the computer vision avoids the long-time video turning and watching, does not need to manually search and position the position of the object in the video, and saves the time cost and the labor cost.

Description

Forgotten object addressing method based on computer vision

Technical Field

The invention belongs to the technical field of video monitoring, and particularly relates to a forgetting object addressing method based on computer vision.

Background

In daily life, people often have the phenomenon that articles can not be found urgently, and the daily mood and the travel efficiency of people are often influenced without the disorderly finding of purposes and ideas. The prior technical scheme needs to add foreign matters on the surface of an object to realize the purposes of positioning and searching, not only influences the attractiveness of the object, but also has limited practicability on small objects or objects which are not suitable for adding foreign matters, and the use cost is gradually increased along with the increase of the objects. The position of the article is located based on big data, the position where the article is finally found needs to be collected, the user needs to perform operation processing, and the convenience of the method needs to be improved.

Disclosure of Invention

In view of the above, in order to overcome the above-mentioned drawbacks, the present invention is directed to a method for addressing a forgotten object based on computer vision,

in order to achieve the purpose, the technical scheme of the invention is realized as follows:

a method for amnesia addressing based on computer vision, comprising:

s1, collecting data, and collecting the image information of the articles easy to forget;

s2, preprocessing the collected images, identifying the characteristics of the corresponding articles in the images, classifying the articles, and labeling each object in the image data according to the determined category;

s3, creating a deep learning network model and carrying out model training;

and S4, carrying out algorithm processing, carrying out scaling processing on the received video stream image to meet the requirements of the model, sending the processed image into the model for prediction, storing the position information of various articles predicted by the model, and recording the articles of other types closest to the article.

Further, in the step S2, the labeled information includes the category to which the item belongs and the position where the item appears.

Further, in step S3, the training process is as follows:

training prepared image data by using a yolov3 network subjected to pruning compression, testing and comparing through a plurality of trained models, and selecting a weight model with the optimal reliability through an MAP value.

Further, after step S2 is executed, in order to increase the number of samples and the robustness of the model, the image data needs to be enhanced, which is specifically as follows:

and the image data is subjected to turning, scaling, clipping and brightness adjustment processing, so that objects can be conveniently recognized in different environments.

Compared with the prior art, the forgetting object addressing method based on computer vision has the following advantages:

the method does not depend on a database or foreign matter sticking, carries out the step of collecting the characteristics of the article in advance by using a deep learning technology through non-inductive video collection and image processing, inputs the category or name of the article when the position of the article is forgotten when a client uses the method, gives the position where the article possibly appears by one key, avoids long-time watching of the video compared with searching through the video, does not need to manually search and position the position where the article appears in the video, saves the time cost and the labor cost, provides convenience for life, and promotes the progress of intelligent life.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate an embodiment of the invention and, together with the description, serve to explain the invention and not to limit the invention. In the drawings:

fig. 1 is a flowchart of a method for addressing a forgotten object based on computer vision according to an embodiment of the present invention.

Detailed Description

It should be noted that the embodiments and features of the embodiments may be combined with each other without conflict.

In the description of the present invention, it is to be understood that the terms "center", "longitudinal", "lateral", "up", "down", "front", "back", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", and the like, indicate orientations or positional relationships based on those shown in the drawings, and are used only for convenience in describing the present invention and for simplicity in description, and do not indicate or imply that the referenced devices or elements must have a particular orientation, be constructed and operated in a particular orientation, and thus, are not to be construed as limiting the present invention. Furthermore, the terms "first", "second", etc. are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first," "second," etc. may explicitly or implicitly include one or more of that feature. In the description of the present invention, "a plurality" means two or more unless otherwise specified.

In the description of the present invention, it should be noted that, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, e.g., as meaning either a fixed connection, a removable connection, or an integral connection; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meaning of the above terms in the present invention can be understood by those of ordinary skill in the art through specific situations.

The present invention will be described in detail below with reference to the embodiments with reference to the attached drawings.

In order to know the condition in the house in real time, the application of domestic intelligent camera is more and more popularized, and the video data through gathering domestic camera in real time carries out analysis processes, uses computer vision and video image processing count, provides the position that the article of forgetting appeared most often and appeared last time, provides a convenient and fast's solution for the searching of the article of forgetting. The method does not depend on a database or foreign matter sticking, the steps of collecting the characteristics of the object are carried out in advance by using a non-inductive video collection and image processing technology, the category or name of the object is input when the position of the object is forgotten when a client uses the method, the position where the object possibly appears is given by one key, and compared with the method of finding the position through a video, the method avoids long-time video turning over and watching, does not need to find and position the position where the object appears in the video manually, saves time cost and labor cost, provides convenience for life, and promotes the progress of intelligent life.

The data required by the method is video image data, and can be realized through a household intelligent camera and a corresponding mobile phone APP or a computer terminal at present when the household camera is more and more popular.

The specific implementation method is as follows (as shown in figure 1):

first, data is prepared. The method is characterized in that common household environment information and articles easy to forget in life, including sofas, television cabinets, mobile phones, remote controllers, keys, wallets and the like, are collected in advance, and a large amount of video image data are collected.

And secondly, preprocessing the image, classifying the articles, and labeling each object in the image data according to the determined category, wherein the labeling information mainly comprises the article category and the position where the article appears.

And thirdly, enhancing the data, namely, in order to increase the number of samples and the robustness of the model, the image data is subjected to processes such as overturning, scaling, clipping, brightness adjustment and the like, so that the object can be conveniently recognized in different environments.

And fourthly, preparing a deep learning network for model training. Training prepared image data by using a yolov3 network subjected to pruning compression, testing and comparing through a plurality of trained models, and selecting a weight model with the optimal reliability through an MAP value.

And fifthly, processing of the algorithm. And carrying out scaling processing on the received video stream image to meet the requirement of the model, sending the processed image into the model for prediction, storing the position information of various articles predicted by the model, and recording the articles (accompanying articles) of other types closest to the article.

The first five parts are finished in the previous processing without any operation of the user.

And the sixth is the use by the user. When a user forgets the position of a required article, a mobile phone APP or a computer terminal is opened, the function of searching the forgotten article is found, the name or the category of the forgotten article is input, and the most frequently appearing position of the article and the accompanying articles can be popped out by clicking for searching, and the position information of the most recently appearing position of the article is provided. The user can quickly and efficiently find the position of the article according to the prompt messages, and time and labor are saved.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. A method for addressing a amnesia based on computer vision, comprising:

s3, creating a deep learning network model and carrying out model training;

2. A computer vision based amnestic addressing method according to claim 1, characterized by: in step S2, the labeled information includes the category to which the item belongs and the position where the item appears.

3. The computer vision based amnestic addressing method according to claim 1, wherein in step S3, the training process is as follows:

4. The method for addressing a forgotten object based on computer vision according to claim 1, wherein after step S2 is executed, in order to increase the number of samples and the robustness of the model, the image data needs to be enhanced, specifically as follows: