CN111695000A

CN111695000A - Multi-source big data loading method and system

Info

Publication number: CN111695000A
Application number: CN202010551553.1A
Authority: CN
Inventors: 董瑞朝; 董新建; 曹晓青
Original assignee: Shandong Lanhai Navigation Big Data Development Co ltd
Current assignee: Shandong Lanhai Navigation Big Data Development Co ltd
Priority date: 2020-06-16
Filing date: 2020-06-16
Publication date: 2020-09-22
Anticipated expiration: 2040-06-16
Also published as: CN111695000B

Abstract

The invention provides a multi-source big data loading method and a multi-source big data loading system, wherein the method comprises a first recording unit, a classifier and an extractor, wherein the first recording unit is used for recording storage paths of all storage nodes in a storage server and attribute information of storage entities, the classifier is used for linking at least one relation between the storage paths in the first recording unit and the attribute information of the storage entities, the extractor extracts expressions based on the relation, establishes a first loading rule and establishes a second loading rule, and the cloud database determines the attribute information corresponding to loaded data through at least one of the first loading rule and the second loading rule so as to search corresponding entity objects in an original data set of the storage server. The invention also provides a system and a matching method. According to the method and the device, the content loaded by the user is compared with the first loading rule and the second loading rule which are obtained according to the storage rule, and the corresponding entity object can be accurately searched in the original data set of the storage server.

Description

Multi-source big data loading method and system

Technical Field

The invention relates to the technical field of computers, in particular to a big data loading method, and specifically relates to a multi-source big data loading method and system.

Background

With the fusion of big data, the data types are various, and the storage of the storage object firstly needs to ensure the correctness of data storage and be convenient for retrieval. The method comprises the steps of storing customer data, loading production plans, financial statements, company plans, production orders, searching data and the like, wherein the loading of big data is not separated, the loading of the data is needed to directly reflect searched results, most of the big data loading modes at present are only used for loading terms searched by users, but when the terms of the users are wrong and deviated, the results required by the customers cannot be reflected timely, and meanwhile, in the aspect of stored data processing, the method of storing in a tree mode only can show single tree results under the traditional search results and cannot comprehensively meet the search requirements of the users.

Disclosure of Invention

The invention aims to provide a multi-source big data loading method and a multi-source big data loading system to solve the problems in the background technology.

In order to achieve the purpose, the invention provides the following technical scheme:

the multi-source big data loading method is characterized by comprising the following steps

Setting at least one storage server, wherein the storage server comprises at least one first recording unit, the first recording unit is used for recording the storage paths of all storage nodes in the storage server and the attribute information of the storage entities,

providing a classifier for linking at least one relationship between the storage path within the first recording unit and the attribute information of the storage entity,

providing an extractor, the extractor extracting the expression based on the relationship,

selecting a screening rule to identify repeated items in the expression according to the expression, deleting the repeated items to establish a first loading rule,

analyzing the similar expression rules in the first loading rule, deleting the similar expression rules to establish a second loading rule,

the first loading rule and the second loading rule are saved in memory,

the memory is saved to a cloud database,

the user data end inputs and receives the operation command of the user,

the loading server obtains the loading command from the user data terminal,

the load command is transmitted to the cloud database,

the cloud database determines attribute information corresponding to the loaded data through at least one of the first loading rule and the second loading rule, so that a corresponding entity object is searched in an original data set of the storage server.

Preferably, the information searched by the user, which is received by the user data terminal, is split into one or more of connection words, specific terms and pictures, and is stored in the cloud database.

Preferably, the method for searching the entity object in the original data set of the storage server is as follows:

the cloud database compares the connection words, the specific entries and the pictures which are split into one or more than one with the established second loading rule, if the connection words, the specific entries and the pictures have the same or similar loading items, the loading items are executed to obtain corresponding entity objects in the original data set of the storage server,

if the same or similar loading items do not exist after the comparison with the second loading rule, the cloud database is divided into one or more of connection words, specific terms and pictures to be compared with the established first loading rule, if the same or similar loading items exist, the loading items are executed to obtain corresponding entity objects in the original data set of the storage server,

and if the same or similar loading items do not exist after the comparison with the first loading rule, loading the corresponding entity object in the original data set acquired in the storage server by using fuzzy query according to the recording habit and the loading related history recorded by the cloud database.

Preferably, the display mode of the entity object is as follows:

acquiring a corresponding entity object in an original data set of a storage server as a first display result by using a second loading rule;

acquiring a corresponding entity object in an original data set of a storage server as a second display result by using a first loading rule;

and acquiring a corresponding entity object in the original data set of the storage server by using the recording habit and the loading related history recorded by the cloud database as a third display result.

Preferably, the expression further includes an internal node address of the storage data under the corresponding storage node determined based on the relationship.

Preferably, the expressions are sequentially extracted in a parent-level encoding and child-level data set mode.

The invention also provides a multi-source big data loading system, which comprises

A first recording unit for recording attribute information of storage paths and storage entities of all storage nodes in the storage server,

a classifier for linking at least one relationship between the storage path within the first recording unit and the attribute information of the storage entity,

an extractor for extracting an expression based on the relationship,

a filter for selecting a filtering rule to identify a duplicate term in the expression according to the expression,

a first loading tag, deleting the repeated items to establish a first loading rule,

the second loading label analyzes the similar expression rules in the first loading rule, deletes the similar expression rules to establish a second loading rule,

a memory storing a first load tag and a second load tag,

a user data terminal for inputting and receiving the operation command of the user,

the loading server acquires a loading command from the user data terminal and transmits the loading command to the cloud database;

and the cloud database receives the loading command, starts a loading actuator to determine the attribute information corresponding to the loaded data through at least one of the first loading tag and the second loading tag so as to search the corresponding entity object in the original data set of the storage server.

Preferably, the classifier links the storage paths in a tree network.

Compared with the prior art, the invention has the beneficial effects that:

in the invention, the content loaded by the user is divided into one or more of connecting words, specific terms and pictures, and the first loading rule and the second loading rule which are obtained according to the storage rule are compared, so that the corresponding entity object can be accurately searched in the original data set of the storage server.

When the user fails to provide accurate loading elements, the user can load the results desired by the user through the recording habits and the loading related history of the user by utilizing fuzzy query.

Drawings

FIG. 1 is a flow chart of a method of the present invention;

fig. 2 is a general block diagram of the system of the present invention.

Detailed Description

Detailed description of the preferred embodimentsreferring to fig. 1-2. The invention also provides a multi-source big data loading system, which comprises

an extractor for extracting an expression based on the relationship,

a memory storing a first load tag and a second load tag,

The classifier links the storage paths in a tree network.

Example 1

The invention provides a multi-source big data loading method which comprises the following steps

the first loading rule and the second loading rule are saved in memory,

the memory is saved to a cloud database,

the user data end inputs and receives the operation command of the user,

the loading server obtains the loading command from the user data terminal,

the load command is transmitted to the cloud database,

The method for searching the entity object in the original data set of the storage server comprises the following steps: and the cloud database compares the connection words, the specific entries and the pictures which are split into one or more than one with the established second loading rule, and if the connection words, the specific entries and the pictures have the same or similar loading items, the loading items are executed to acquire corresponding entity objects in the original data set of the storage server. And acquiring the corresponding entity object in the original data set of the storage server as a first display result by using a second loading rule.

Example 2

the first loading rule and the second loading rule are saved in memory,

the memory is saved to a cloud database,

the user data end inputs and receives the operation command of the user,

the loading server obtains the loading command from the user data terminal,

the load command is transmitted to the cloud database,

The method for searching the entity object in the original data set of the storage server comprises the following steps:

if there is no identical or similar loading item after the comparison with the second loading rule in embodiment 1, the cloud database compares the connection word, the specific entry and the picture split into one or more of the connection word, the specific entry and the picture with the established first loading rule, if there is an identical or similar loading item, the loading item is executed to obtain an entity object corresponding to the original data set of the storage server, and the entity object corresponding to the original data set of the storage server is obtained by using the first loading rule as a second display result.

Example 3

the first loading rule and the second loading rule are saved in memory,

the memory is saved to a cloud database,

the user data end inputs and receives the operation command of the user,

the loading server obtains the loading command from the user data terminal,

the load command is transmitted to the cloud database,

if there is no identical or similar loading item after comparison with the first loading rule in embodiment 2, the recording habit and the loading related history recorded in the cloud database are used, and the entity object corresponding to the original data set acquired in the storage server is loaded by using the fuzzy query. And acquiring a corresponding entity object in the original data set of the storage server by using the recording habit and the loading related history recorded by the cloud database as a third display result.

As can be seen from embodiments 1 to 3, the content loaded by the user is split into one or more of a connection word, a specific entry and a picture, and the first loading rule and the second loading rule obtained according to the storage rule are compared, so that when the user fails to provide an accurate loading element, a corresponding entity object can be accurately searched in the original data set of the storage server, and the result desired by the user can be loaded by fuzzy query through the recording habits and the loading related history of the user.

The principles and embodiments of the present invention are explained herein using specific examples, which are presented only to assist in understanding the method and its core concepts of the present invention. The foregoing is only a preferred embodiment of the present invention, and it should be noted that there are objectively infinite specific structures due to the limited character expressions, and it will be apparent to those skilled in the art that a plurality of modifications, decorations or changes may be made without departing from the principle of the present invention, and the technical features described above may be combined in a suitable manner; such modifications, variations, combinations, or adaptations of the invention using its spirit and scope, as defined by the claims, may be directed to other uses and embodiments.

Claims

1. The multi-source big data loading method is characterized by comprising the following steps

the first loading rule and the second loading rule are saved in memory,

the memory is saved to a cloud database,

the user data end inputs and receives the operation command of the user,

the loading server obtains the loading command from the user data terminal,

the load command is transmitted to the cloud database,

2. The multi-source big data loading method according to claim 1, wherein the information searched by the user is split into one or more of connection words, specific terms and pictures by using the information searched by the user received by the user data terminal, and is stored in a cloud database.

3. The multi-source big data loading method according to claim 1, wherein the method for searching the entity object in the original data set of the storage server is as follows:

4. The multi-source big data loading method according to claim 1 or 3, wherein the entity object is displayed in a manner that:

5. The multi-source big data loading method according to claim 1, wherein the expression further includes an internal node address of the storage data under the corresponding storage node determined based on the relationship.

6. The multi-source big data loading method according to claim 1, wherein the expressions are sequentially extracted in a parent-level encoding and child-level data set manner.

7. A multi-source big data loading system is characterized by comprising

an extractor for extracting an expression based on the relationship,

a memory storing a first load tag and a second load tag,

8. The multi-source big data loading system according to claim 7, wherein the classifier links the storage paths in a tree network.