CN112732956A

CN112732956A - Efficient query method based on perception multi-mode big data

Info

Publication number: CN112732956A
Application number: CN202011547371.3A
Authority: CN
Inventors: 李海涛
Original assignee: Jiangsu Zhishui Intelligent Technology Co ltd
Current assignee: Jiangsu Zhishui Intelligent Technology Co ltd
Priority date: 2020-12-24
Filing date: 2020-12-24
Publication date: 2021-04-30

Abstract

The invention discloses an efficient query method based on perception multi-modal big data, which comprises the following steps: collecting an image modal data set, filtering an image and generating point cloud data; step two: according to the image depth values under the point cloud data in the step one, obtaining characteristic values of all points under the point cloud data; step three: asynchronously collecting a text modal data set, and extracting the features of a text; step four: establishing a training table of image characteristics-text characteristics by using a CAA algorithm; step five: randomly classifying data in the training table into a training set database and a test set database, and performing transfer learning on the data in the training set database and the test set database; step six: and establishing a query model according to the transfer learning and uploading the query model to the cloud, inputting the subsequent query graph or character into the query model, judging the characteristics of the subsequent graph or character by the query model, and querying on the Internet according to the judged characteristics. The method may speculate on subsequent data based on the detected data.

Description

Efficient query method based on perception multi-mode big data

Technical Field

The invention relates to the field of data query, in particular to an efficient query method based on perception multi-mode big data.

Background

By "modality", English is modal, in colloquial terms is "sense", and multimodal is meant the fusion of multiple senses. The Turing OS robot operating system defines the interaction mode between the robot and the human as multi-mode interaction, namely man-machine interaction is carried out in various modes such as characters, voice, vision, actions and environment, and the interaction mode between the human and the human is fully simulated. The interactive mode accords with the morphological characteristics and the user expectation of robot products, and breaks through the traditional PC type keyboard input and the point-touch interactive mode of the smart phone.

In the field of water source management, automatic control is generally adopted, video monitoring, depth drop detection, water quality detection and the like are adopted, and an image set or a text set is generated no matter whether the video monitoring or the depth drop detection is adopted.

The existing processing mode for the image sets or the text sets is that the reasonability of the detection data of other same areas can not be judged according to the existing data through human observation or set maximum value alarm.

Disclosure of Invention

In order to overcome the defects in the prior art, the efficient query method based on the perception multi-modal big data provided by the invention can be used for speculating the subsequent data according to the detection data.

In order to achieve the above object, the efficient query method based on perception multi-modal big data of the present invention includes the following steps: collecting an image modal data set, filtering an image and generating point cloud data;

step two: according to the image depth values under the point cloud data in the step one, obtaining characteristic values of all points under the point cloud data; step three: asynchronously acquiring a text modal data set, and performing feature extraction on a text by using a CountVectorizer; step four: establishing a training table of image characteristics-text characteristics by using a CAA algorithm; step five: randomly classifying data in the training table to generate a training set database and a test set database, and performing transfer learning on the data in the training set database and the test set database; step six: and establishing a query model according to the transfer learning and uploading the query model to the cloud, inputting the subsequent query graph or character into the query model, judging the characteristics of the subsequent graph or character by the query model, and querying on the Internet according to the judged characteristics.

Further, in the step one, a variance method is adopted for filtering, and the formula is as follows:

wherein, ω is₀Is the number of background points in the image proportion, u₀Is the average gray of the number of background points of the image, u is the total average gray of the image; g is the variance of the image.

Further, in the second step, a threshold method is adopted to obtain the image depth value, and the formula is as follows:

p(x，y)＝p(x，y)dis≤p(x，y)≤dis*2h，

where p (x, y) is the image depth, dis is the image height, and h is the height of the filtered image.

And in the second step, collecting image depth to form point cloud data, and processing the point cloud data by using PCL to obtain a characteristic value of the image depth.

Further, in step three, the characteristic selection frequency formula of the countvectorzer is as follows:

the IDF is a feature selection probability, m is the number of point cloud data, and g is the number of feature values.

Further, the ratio of the training set database to the test set database is 1: 2.

Has the advantages that: and calling characteristic values of the image modal data set and the text modal data set, establishing a model for the characteristic values by using transfer learning, training, inputting a subsequent measured image set or text set into the transfer learning model, and judging whether the subsequent image set or text set conforms to the surface of the transfer learning model so as to judge whether the data of the image set and the text set is qualified.

Drawings

The present invention will be further described and illustrated with reference to the following drawings.

FIG. 1 is a flow chart of a preferred embodiment of the present invention.

Detailed Description

The technical solution of the present invention will be more clearly and completely explained by the description of the preferred embodiments of the present invention with reference to the accompanying drawings.

As shown in fig. 1, the efficient query method based on perceptual multi-modal big data according to the preferred embodiment of the present invention includes the following steps: an image modality data set is collected, an image is filtered, and point cloud data is generated.

In the water management field, the acquired modality data generally includes two modalities, one is an image modality and one is a text modality. The image modality includes videos, photos and the like taken by the monitoring system. The text modality includes measured depth values, water quality values, and the like.

Point cloud data (point cloud data) refers to a collection of vectors in a three-dimensional coordinate system. The scan data is recorded in the form of dots, each dot containing three-dimensional coordinates, some of which may contain color information (RGB) or Intensity information (Intensity).

Step two: and C, obtaining characteristic values of all points under the point cloud data according to the image depth values under the point cloud data in the step one.

For each data point in the point cloud, a least squares local plane p is fitted through its K local proximity points such that the sum of the distances of all the proximity points of the data point to this plane is minimal.

Step three: the text modality data set is asynchronously collected and the text is feature extracted using a CountVectori zer.

The CountVectorzer () function only considers the frequency of occurrence of each word; then, a feature matrix is formed, and each line represents a word frequency statistical result of the training text. The idea is that according to all training texts, the appearance sequence is not considered, and each appeared vocabulary in the training texts is only considered as a column of characteristics to form a vocabulary table.

Step four: and establishing a training table of image features-text features by using a CAA algorithm.

Computer aided numerical analysis (CAA for short) and computer aided design (CAD for short) are analysis and design methods using computer as main tool, which are an emerging discipline developed based on computing technology, applied mathematics and simulation theory, and become an important branch in the field of computer application. The rapid development of scientific technology makes mathematical models built in scientific theoretical research, new product development and engineering design increasingly complex.

Step five: and randomly classifying the data in the training table to generate a training set database and a test set database, and performing transfer learning on the data in the training set database and the test set database.

The example-based transfer learning research is to select examples useful for training in the target field from the source field, for example, effective weight distribution is performed on labeled data examples in the source field, and the distribution of the source field examples is close to that of the examples in the target field, so that a reliable learning model with high classification accuracy is established in the target field.

Because the data distribution of the source domain and the target domain in the migration learning is inconsistent, all marked data instances in the source domain are not necessarily useful for the target domain.

Step six: and establishing a query model according to the transfer learning and uploading the query model to the cloud, inputting the subsequent query graph or character into the query model, judging the characteristics of the subsequent graph or character by the query model, and querying on the Internet according to the judged characteristics.

After the migration learning result is uploaded, different data can be conveniently distinguished from multiple places for multiple times.

The above detailed description merely describes preferred embodiments of the present invention and does not limit the scope of the invention. Without departing from the spirit and scope of the present invention, it should be understood that various changes, substitutions and alterations can be made herein by those skilled in the art without departing from the spirit and scope of the invention as defined by the appended claims and their equivalents. The scope of the invention is defined by the claims.

Claims

1. An efficient query method based on perception multi-modal big data comprises the following steps,

the method comprises the following steps: collecting an image modal data set, filtering an image and generating point cloud data;

step two: according to the image depth values under the point cloud data in the step one, obtaining characteristic values of all points under the point cloud data;

step three: asynchronously acquiring a text modal data set, and performing feature extraction on a text by using a CountVectorizer;

step four: establishing a training table of image characteristics-text characteristics by using a CAA algorithm;

step five: randomly classifying data in the training table to generate a training set database and a test set database, and performing transfer learning on the data in the training set database and the test set database;

2. The efficient query method based on perceptual multi-modal big data as claimed in claim 1, wherein the filtering in the first step is performed by a variance method, and the formula is as follows:

3. The efficient query method based on perceptual multi-modal big data as claimed in claim 2, wherein the image depth value is obtained by a thresholding method in the second step, and the formula is as follows:

p(x，y)＝p(x，y)dis≤p(x，y)≤dis*2h，

4. The efficient query method based on the perceptual multi-modal big data as claimed in claim 3, wherein in the second step, image depth is collected to form point cloud data, and PCL is adopted to process the point cloud data to obtain a feature value of the image depth.

5. The efficient query method based on perceptual multi-modal big data as claimed in claim 3, wherein in step three, the feature selection frequency formula of the countvectorzer is as follows:

6. The efficient perceptual-multimodal big data based query method of claim 3, wherein a ratio of the training set database to the test set database is 1: 2.