CN115130043A - Database-based data processing method, device, equipment and storage medium - Google Patents

Database-based data processing method, device, equipment and storage medium Download PDF

Info

Publication number
CN115130043A
CN115130043A CN202211050745.XA CN202211050745A CN115130043A CN 115130043 A CN115130043 A CN 115130043A CN 202211050745 A CN202211050745 A CN 202211050745A CN 115130043 A CN115130043 A CN 115130043A
Authority
CN
China
Prior art keywords
database
node
target
sub
databases
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202211050745.XA
Other languages
Chinese (zh)
Other versions
CN115130043B (en
Inventor
周全
詹东远
张泽阳
王子豪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Smartin Technology Shenzhen Co ltd
Original Assignee
Smartin Technology Shenzhen Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Smartin Technology Shenzhen Co ltd filed Critical Smartin Technology Shenzhen Co ltd
Priority to CN202211050745.XA priority Critical patent/CN115130043B/en
Publication of CN115130043A publication Critical patent/CN115130043A/en
Application granted granted Critical
Publication of CN115130043B publication Critical patent/CN115130043B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the field of data processing, and discloses a database-based data processing method, device, equipment and storage medium, which are used for improving the accuracy of data mining. The method comprises the following steps: reading and analyzing the initial interface to obtain a target node, and searching the target node to obtain a target child node; performing sub-node grouping on the target sub-nodes according to the number of the databases to obtain a plurality of first sub-node sets; carrying out node sequence arrangement on the plurality of first child node sets to obtain child node arrangement results, and respectively distributing the plurality of first child node sets to a plurality of databases according to the child node arrangement results to obtain second child node sets; performing database operation on the second child node set to obtain a target operation result, and performing assignment on the target operation result to obtain an assignment result; and performing data fusion on the assignment result to obtain target data, and transcoding and outputting the target data.

Description

Database-based data processing method, device, equipment and storage medium
Technical Field
The present invention relates to the field of data processing, and in particular, to a database-based data processing method, apparatus, device, and storage medium.
Background
Currently, no well-established definition or concept has been formed for big data. Big data is defined as big data (BigData), the main core of which is on big words, which means that in the process of data processing based on database, the big data is beyond the category of common data processing based on database, and the work of data processing based on database is difficult to be carried out by applying modern general software.
Therefore, how to perform data mining on the database becomes a hot research currently, but the accuracy of data mining is low in some existing data mining schemes.
Disclosure of Invention
The invention provides a database-based data processing method, a database-based data processing device, a database-based data processing equipment and a database-based data processing storage medium, which are used for improving the accuracy of data mining.
The first aspect of the present invention provides a data processing method based on a database, where the data processing method based on the database includes: receiving a target processing request to be processed, and analyzing the target processing request to obtain an initial interface corresponding to the target processing request; reading and analyzing the initial interface to obtain a target node, and performing node search on the target node to obtain a target sub-node corresponding to the target node; acquiring the number of databases corresponding to a plurality of preset databases, and performing sub-node grouping on the target sub-nodes according to the number of the databases to obtain a plurality of first sub-node sets; carrying out node sequence arrangement on the plurality of first sub-node sets to obtain sub-node arrangement results, and respectively allocating the plurality of first sub-node sets to the plurality of databases according to the sub-node arrangement results to obtain second sub-node sets corresponding to each database; performing database operation on the second sub-node set corresponding to each database to obtain a target operation result corresponding to each database, and assigning values to the target operation result corresponding to each database to obtain an assignment result corresponding to each database; and forwarding the assignment result corresponding to each database, performing data fusion on the assignment result corresponding to each database to obtain target data, and transcoding and outputting the target data.
Optionally, in a first implementation manner of the first aspect of the present invention, the reading and analyzing the initial interface to obtain a target node, and performing node search on the target node to obtain a target child node corresponding to the target node, includes: calling a preset data reading function to read the initial interface, and performing data analysis on the initial interface to obtain a target node; and calling a preset node search algorithm, and performing node search extraction on the target node to obtain a target child node corresponding to the target node.
Optionally, in a second implementation manner of the first aspect of the present invention, the obtaining a preset number of databases corresponding to a plurality of databases, and performing child node grouping on the target child node according to the number of databases to obtain a plurality of first child node sets includes: determining a plurality of databases based on preset database parameters, and calculating the number of the databases corresponding to the plurality of databases; determining the sub-node grouping number of the target sub-node according to the database number; and performing sub-node grouping on the target sub-nodes according to the sub-node grouping number to obtain a plurality of first sub-node sets.
Optionally, in a third implementation manner of the first aspect of the present invention, the performing node sequence arrangement on the plurality of first child node sets to obtain child node arrangement results, and respectively allocating the plurality of first child node sets to the plurality of databases according to the child node arrangement results to obtain a second child node set corresponding to each database includes: respectively acquiring variable data of each first sub-node set, and determining the target weight of each first sub-node set according to the variable data; carrying out node sequence arrangement on the plurality of first child node sets according to the target weight of each first child node set to obtain child node arrangement results; and respectively distributing the plurality of first sub-node sets to the plurality of databases according to a preset distribution strategy and the sub-node arrangement result to obtain a second sub-node set corresponding to each database.
Optionally, in a fourth implementation manner of the first aspect of the present invention, the performing database operation on the second child node set corresponding to each database to obtain a target operation result corresponding to each database, and assigning values to the target operation result corresponding to each database to obtain an assignment result corresponding to each database includes: performing database operation on the second child node set corresponding to each database to obtain a target operation result corresponding to each database; and calling a preset assignment function, and assigning the target operation result corresponding to each database to obtain the assignment result corresponding to each database.
Optionally, in a fifth implementation manner of the first aspect of the present invention, the forwarding the assignment result corresponding to each database, performing data fusion on the assignment result corresponding to each database to obtain target data, and transcoding and outputting the target data includes: forwarding the assignment result corresponding to each database, and performing data fusion on the assignment result corresponding to each database to obtain target data; judging whether the target data meets a preset output condition or not; if yes, transcoding and outputting the target data; and if not, repeatedly calculating the assignment result corresponding to each database.
Optionally, in a sixth implementation manner of the first aspect of the present invention, the database-based data processing method further includes: obtaining a plurality of training data and a training model, the plurality of training data comprising: a plurality of target clause sets; performing model training on the training model based on the plurality of training data to obtain a trained target model; and carrying out sub-node grouping on the target sub-nodes based on the target model to obtain a plurality of first sub-node sets.
A second aspect of the present invention provides a database-based data processing apparatus, comprising: the receiving module is used for receiving a target processing request to be processed and analyzing the target processing request to obtain an initial interface corresponding to the target processing request; the analysis module is used for reading and analyzing the initial interface to obtain a target node, and searching the target node to obtain a target sub-node corresponding to the target node; the grouping module is used for acquiring the number of databases corresponding to a plurality of preset databases, and performing sub-node grouping on the target sub-nodes according to the number of the databases to obtain a plurality of first sub-node sets; the distribution module is used for carrying out node sequence arrangement on the plurality of first sub-node sets to obtain sub-node arrangement results, and distributing the plurality of first sub-node sets to the plurality of databases respectively according to the sub-node arrangement results to obtain second sub-node sets corresponding to each database; the assignment module is used for carrying out database operation on the second sub-node set corresponding to each database to obtain a target operation result corresponding to each database, and assigning values to the target operation result corresponding to each database to obtain an assignment result corresponding to each database; and the fusion module is used for forwarding the assignment result corresponding to each database, performing data fusion on the assignment result corresponding to each database to obtain target data, and transcoding and outputting the target data.
Optionally, in a first implementation manner of the second aspect of the present invention, the parsing module is specifically configured to: calling a preset data reading function to read the initial interface, and performing data analysis on the initial interface to obtain a target node; and calling a preset node search algorithm, and performing node search extraction on the target node to obtain a target child node corresponding to the target node.
Optionally, in a second implementation manner of the second aspect of the present invention, the grouping module is specifically configured to: determining a plurality of databases based on preset database parameters, and calculating the number of the databases corresponding to the plurality of databases; determining the sub-node grouping number of the target sub-node according to the database number; and performing sub-node grouping on the target sub-nodes according to the sub-node grouping number to obtain a plurality of first sub-node sets.
Optionally, in a third implementation manner of the second aspect of the present invention, the allocation module is specifically configured to: respectively acquiring variable data of each first sub-node set, and determining the target weight of each first sub-node set according to the variable data; carrying out node sequence arrangement on the plurality of first child node sets according to the target weight of each first child node set to obtain a child node arrangement result; and respectively distributing the plurality of first sub-node sets to the plurality of databases according to a preset distribution strategy and the sub-node arrangement result to obtain a second sub-node set corresponding to each database.
Optionally, in a fourth implementation manner of the second aspect of the present invention, the assignment module is specifically configured to: performing database operation on the second child node set corresponding to each database to obtain a target operation result corresponding to each database; and calling a preset assignment function, and assigning the target operation result corresponding to each database to obtain the assignment result corresponding to each database.
Optionally, in a fifth implementation manner of the second aspect of the present invention, the fusion module is specifically configured to: forwarding the assignment result corresponding to each database, and performing data fusion on the assignment result corresponding to each database to obtain target data; judging whether the target data meets a preset output condition or not; if yes, transcoding and outputting the target data; and if not, repeatedly calculating the assignment result corresponding to each database.
Optionally, in a sixth implementation manner of the second aspect of the present invention, the database-based data processing apparatus further includes: a training module to obtain a plurality of training data and a training model, the plurality of training data comprising: a plurality of target clause sets; performing model training on the training model based on the plurality of training data to obtain a trained target model; and carrying out sub-node grouping on the target sub-nodes based on the target model to obtain a plurality of first sub-node sets.
A third aspect of the present invention provides a database-based data processing apparatus comprising: a memory and at least one processor, the memory having instructions stored therein; the at least one processor invokes the instructions in the memory to cause the database-based data processing apparatus to perform the database-based data processing method described above.
A fourth aspect of the present invention provides a computer-readable storage medium having stored therein instructions, which, when run on a computer, cause the computer to execute the above-described database-based data processing method.
In the technical scheme provided by the invention, a plurality of first child node sets are obtained by acquiring the number of databases corresponding to a plurality of preset databases, and performing child node grouping on target child nodes according to the number of the databases, node sequence arrangement is performed on the plurality of first child node sets to obtain child node arrangement results, and the plurality of first child node sets are respectively distributed to the plurality of databases according to the child node arrangement results to obtain a second child node set corresponding to each database; and performing data fusion on the assignment results corresponding to each database to obtain target data, thereby improving the accuracy of data mining.
Drawings
FIG. 1 is a schematic diagram of an embodiment of a database-based data processing method according to an embodiment of the present invention;
FIG. 2 is a diagram of another embodiment of a database-based data processing method according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of an embodiment of a database-based data processing apparatus according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of another embodiment of a database-based data processing apparatus according to an embodiment of the present invention;
fig. 5 is a schematic diagram of an embodiment of a database-based data processing apparatus according to the embodiment of the present invention.
Detailed Description
The embodiment of the invention provides a database-based data processing method, a database-based data processing device, a database-based data processing equipment and a database-based storage medium, which are used for improving the accuracy of data mining. The terms "first," "second," "third," "fourth," and the like in the description and in the claims, as well as in the drawings, if any, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that the embodiments described herein may be practiced otherwise than as specifically illustrated or described herein. Furthermore, the terms "comprises," "comprising," or "having," and any variations thereof, are intended to cover non-exclusive inclusions, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
For convenience of understanding, a detailed flow of an embodiment of the present invention is described below, and referring to fig. 1, an embodiment of a database-based data processing method in an embodiment of the present invention includes:
101. receiving a target processing request to be processed, and analyzing the target processing request to obtain an initial interface corresponding to the target processing request;
it is to be understood that the execution subject of the present invention may be a database-based data processing apparatus, and may also be a terminal or a server, which is not limited herein. The embodiment of the present invention is described by taking a server as an execution subject.
Specifically, the server receives a target processing request to be processed, analyzes the target processing request, and obtains a corresponding data identifier, and then the server obtains an initial interface corresponding to the target processing request from a preset database according to the data identifier.
102. Reading and analyzing the initial interface to obtain a target node, and searching the target node to obtain a target sub-node corresponding to the target node;
specifically, the server extracts a plurality of operator macros or mathematical expressions from a preset database, then verifies the equivalence of the mathematical expression of the target node, and if the mathematical expression is equivalent to the mathematical expression corresponding to the preset template data, the server obtains a target child node corresponding to the target node.
103. Acquiring the number of databases corresponding to a plurality of preset databases, and performing sub-node grouping on target sub-nodes according to the number of the databases to obtain a plurality of first sub-node sets;
specifically, the server obtains the number of databases corresponding to the multiple databases, further determines the number of sub-node groups according to the number of databases corresponding to the multiple databases, and finally performs sub-node grouping according to the number of sub-node groups to obtain multiple first sub-node sets.
104. Carrying out node sequence arrangement on the plurality of first sub-node sets to obtain sub-node arrangement results, respectively distributing the plurality of first sub-node sets to a plurality of databases according to the sub-node arrangement results to obtain a second sub-node set corresponding to each database;
specifically, the server obtains variable data of each first sub-node set respectively, determines a target weight of each first sub-node set according to the variable data, performs node sequence arrangement on the plurality of first sub-node sets according to the target weight of each first sub-node set to obtain a sub-node arrangement result, and finally, the server distributes the plurality of first sub-node sets to the plurality of databases according to the sub-node arrangement result to obtain a second clause corresponding to each database.
105. Performing database operation on the second sub-node set corresponding to each database to obtain a target operation result corresponding to each database, and assigning values to the target operation result corresponding to each database to obtain an assignment result corresponding to each database;
specifically, database operation is performed on the second child node set corresponding to each database, so that a target operation result corresponding to each database is obtained; calling a preset assignment function, assigning the target operation result corresponding to each database to obtain the assignment result corresponding to each database, wherein the assignment function maps the MLN structure with the target field to establish the association between the two fields, and then optimizing the structure obtained by mapping to adapt to the index of the target field, so as to assign the target operation result corresponding to each database.
106. And forwarding the assignment result corresponding to each database, performing data fusion on the assignment result corresponding to each database to obtain target data, and transcoding and outputting the target data.
Specifically, a preset random assignment restart algorithm is used for carrying out close node search on assignment results corresponding to each database to obtain a node assignment network table, then a server takes a target node as a center and carries out secondary search on the node assignment network table by combining similarity on behavior pattern vectors among different nodes to obtain the target node assignment network table, data fusion is carried out on the assignment results corresponding to each database to obtain target data, and transcoding and outputting are carried out on the target data.
In the embodiment of the invention, a plurality of first child node sets are obtained by obtaining the number of databases corresponding to a plurality of preset databases and performing child node grouping on target child nodes according to the number of the databases, a node sequence arrangement is performed on the plurality of first child node sets to obtain a child node arrangement result, and the plurality of first child node sets are respectively distributed to the plurality of databases according to the child node arrangement result to obtain a second child node set corresponding to each database; and performing data fusion on the assignment results corresponding to each database to obtain target data, thereby improving the accuracy of data mining.
Referring to fig. 2, another embodiment of the database-based data processing method according to the embodiment of the present invention includes:
201. receiving a target processing request to be processed, and analyzing the target processing request to obtain an initial interface corresponding to the target processing request;
specifically, in this embodiment, the specific implementation of step 201 is similar to that of step 101, and is not described herein again.
202. Reading and analyzing the initial interface to obtain a target node, and searching the target node to obtain a target sub-node corresponding to the target node;
specifically, a preset data reading function is called to read an initial interface, and data analysis is performed on the initial interface to obtain a target node; and calling a preset node search algorithm, and searching and extracting nodes of the target node to obtain a target sub-node corresponding to the target node.
The method comprises the steps that a server creates a CNF code, the CNF code is solved through a preset SAT solver, a preset data reading function is called to read an initial interface, data analysis is conducted on the initial interface, and a target node is obtained; invoking a preset node search algorithm, in an exemplary embodiment, since the data in the initial interface has been normalized to the same structural pattern, solving the CNF encoding of the conjunctive normal form is very simple for the SAT solver, it does not get stuck in an exponential complexity process as it is usually, and the solution can be completed in a few seconds, e.g., two equivalent logical data with the same structure, which the SAT solver can solve in polynomial time. For example, the SAT solver traverses two equivalent data and generates CNF clauses in topological order. Since the two equivalent data are exactly in the same implementation form, the SAT solver merges each pair of CNF clauses with the same inputs and subfunctions. The verification result can be obtained by traversing once in polynomial time, and finally the server obtains a target child node corresponding to the target node.
203. Acquiring the number of databases corresponding to a plurality of preset databases, and performing sub-node grouping on target sub-nodes according to the number of the databases to obtain a plurality of first sub-node sets;
specifically, a plurality of databases are determined based on preset database parameters, and the number of databases corresponding to the plurality of databases is calculated; determining the sub-node grouping number of the target sub-node according to the database number; and carrying out sub-node grouping on the target sub-nodes according to the sub-node grouping number to obtain a plurality of first sub-node sets.
The method comprises the steps that after a server screens a plurality of databases in the determining process, a corresponding CNF formula is constructed to serve as a permanent clause group, wherein the server dynamically selects a partition set, the equivalence of each pair of the databases is verified, if signals in the partition set have a dependency relationship, namely a certain signal is a fan-out signal of other signals, the signal is deleted from the partition set, in addition, signals with the largest number of fan-ins are selected to enter the partition set, the server further realizes an increment satisfiability algorithm by grouping clauses of the CNF formula, and after the signals of one pair are verified to be equivalent, the target sub-nodes are grouped according to the grouping number of the sub-nodes to obtain a plurality of first sub-node sets.
204. Respectively acquiring variable data of each first sub-node set, and determining the target weight of each first sub-node set according to the variable data;
205. carrying out node sequence arrangement on the plurality of first child node sets according to the target weight of each first child node set to obtain a child node arrangement result;
206. respectively distributing the plurality of first sub-node sets to the plurality of databases according to a preset distribution strategy and the arrangement result of the sub-nodes to obtain a second sub-node set corresponding to each database;
specifically, a weight value corresponding to each first sub-node set in the current process and the use times of each first sub-node set in the use record of the first sub-node set are obtained, and the weight optimization value of each first sub-node set is calculated by using the use times and the weight value; and according to the weight optimization value, the total number of the first child node sets and identification information of each first child node set, calculating an arrangement position value of each first child node set, arranging each first child node set according to the arrangement position value to obtain a first child node set sequence, and then respectively allocating the plurality of first child node sets to a plurality of databases by the server according to a preset distribution strategy and a child node arrangement result to obtain a second child node set corresponding to each database.
207. Performing database operation on the second sub-node set corresponding to each database to obtain a target operation result corresponding to each database, and assigning values to the target operation result corresponding to each database to obtain an assignment result corresponding to each database;
specifically, database operation is performed on the second child node set corresponding to each database to obtain a target operation result corresponding to each database; and calling a preset assignment function, and assigning the target operation result corresponding to each database to obtain the assignment result corresponding to each database.
Wherein, the encoding operation algorithm outputs code words according to the syntax element values. The encoding algorithm includes a look-up table, a suffix generator, and a combiner. The lookup table outputs a second string of cells based on the syntax element values. The suffix generator performs exponential golomb binary encoding to generate a string according to the syntax element value. When the syntax element value is less than or equal to a threshold value, the meta-string is output as the codeword. When the syntax element value is larger than a critical value, the combiner combines the second binary character string and the second binary character string into the code word, and then the server obtains a target operation result corresponding to each database; and calling a preset assignment function, and assigning the target operation result corresponding to each database to obtain the assignment result corresponding to each database.
208. And forwarding the assignment result corresponding to each database, performing data fusion on the assignment result corresponding to each database to obtain target data, and transcoding and outputting the target data.
Specifically, the assignment result corresponding to each database is forwarded, and data fusion is performed on the assignment result corresponding to each database to obtain target data; judging whether the target data meet a preset output condition or not; if yes, transcoding and outputting the target data; and if not, repeatedly calculating the assignment result corresponding to each database.
The method comprises the steps that assignment results corresponding to each database of a server are forwarded, data fusion is carried out on the assignment results corresponding to each database, it is required to be explained that a preset system database stores received production data, basic data are provided for calculation assignment and verification comparison, a fixed assignment calculation model or a variable assignment calculation model is provided through correlation analysis of assignment and assigned values, assignment results are obtained, and whether target data meet preset output conditions or not is judged for the assigned target data; if yes, transcoding and outputting the target data; and if not, repeatedly calculating the assignment result corresponding to each database.
Optionally, a plurality of training data and training models are obtained, where the plurality of training data includes: a plurality of target clause sets; performing model training on the training model based on a plurality of training data to obtain a trained target model; and carrying out sub-node grouping on the target sub-nodes based on the target model to obtain a plurality of first sub-node sets.
In the embodiment of the invention, a plurality of first child node sets are obtained by obtaining the number of databases corresponding to a plurality of preset databases and performing child node grouping on target child nodes according to the number of the databases, a node sequence arrangement is performed on the plurality of first child node sets to obtain a child node arrangement result, and the plurality of first child node sets are respectively distributed to the plurality of databases according to the child node arrangement result to obtain a second child node set corresponding to each database; and performing data fusion on the assignment results corresponding to each database to obtain target data, thereby improving the accuracy of data mining.
With reference to fig. 3, the data processing method based on the database in the embodiment of the present invention is described above, and the following describes the data processing apparatus based on the database in the embodiment of the present invention, where an embodiment of the data processing apparatus based on the database in the embodiment of the present invention includes:
a receiving module 301, configured to receive a target processing request to be processed, and analyze the target processing request to obtain an initial interface corresponding to the target processing request;
an analysis module 302, configured to read and analyze the initial interface to obtain a target node, and perform node search on the target node to obtain a target child node corresponding to the target node;
the grouping module 303 is configured to obtain the number of databases corresponding to a plurality of preset databases, and perform child node grouping on the target child node according to the number of databases to obtain a plurality of first child node sets;
the allocating module 304 is configured to perform node sequence arrangement on the plurality of first child node sets to obtain child node arrangement results, and allocate the plurality of first child node sets to the plurality of databases according to the child node arrangement results to obtain a second child node set corresponding to each database;
an assignment module 305, configured to perform database operation on the second child node set corresponding to each database to obtain a target operation result corresponding to each database, and assign a value to the target operation result corresponding to each database to obtain an assignment result corresponding to each database;
and the fusion module 306 is configured to forward the assignment result corresponding to each database, perform data fusion on the assignment result corresponding to each database to obtain target data, and transcode and output the target data.
In the embodiment of the invention, a plurality of first child node sets are obtained by acquiring the number of databases corresponding to a plurality of preset databases, performing child node grouping on target child nodes according to the number of the databases, performing node sequence arrangement on the plurality of first child node sets to obtain child node arrangement results, and distributing the plurality of first child node sets to the plurality of databases respectively according to the child node arrangement results to obtain a second child node set corresponding to each database; and performing data fusion on the assignment results corresponding to each database to obtain target data, thereby improving the accuracy of data mining.
Referring to fig. 4, another embodiment of the database-based data processing apparatus according to the embodiment of the present invention includes:
a receiving module 301, configured to receive a target processing request to be processed, and analyze the target processing request to obtain an initial interface corresponding to the target processing request;
an analysis module 302, configured to read and analyze the initial interface to obtain a target node, and perform node search on the target node to obtain a target child node corresponding to the target node;
the grouping module 303 is configured to obtain the number of databases corresponding to a plurality of preset databases, and perform child node grouping on the target child node according to the number of databases to obtain a plurality of first child node sets;
the allocating module 304 is configured to perform node sequence arrangement on the plurality of first child node sets to obtain child node arrangement results, and allocate the plurality of first child node sets to the plurality of databases according to the child node arrangement results to obtain a second child node set corresponding to each database;
an assignment module 305, configured to perform database operation on the second child node set corresponding to each database to obtain a target operation result corresponding to each database, and assign a value to the target operation result corresponding to each database to obtain an assignment result corresponding to each database;
and the fusion module 306 is configured to forward the assignment result corresponding to each database, perform data fusion on the assignment result corresponding to each database to obtain target data, and transcode and output the target data.
Optionally, the parsing module 302 is specifically configured to: calling a preset data reading function to read the initial interface, and performing data analysis on the initial interface to obtain a target node; and calling a preset node search algorithm, and performing node search extraction on the target node to obtain a target child node corresponding to the target node.
Optionally, the grouping module 303 is specifically configured to: determining a plurality of databases based on preset database parameters, and calculating the number of the databases corresponding to the plurality of databases; determining the sub-node grouping number of the target sub-node according to the database number; and performing sub-node grouping on the target sub-nodes according to the sub-node grouping number to obtain a plurality of first sub-node sets.
Optionally, the allocating module 304 is specifically configured to: respectively acquiring variable data of each first sub-node set, and determining the target weight of each first sub-node set according to the variable data; carrying out node sequence arrangement on the plurality of first child node sets according to the target weight of each first child node set to obtain child node arrangement results; and respectively distributing the plurality of first sub-node sets to the plurality of databases according to a preset distribution strategy and the sub-node arrangement result to obtain a second sub-node set corresponding to each database.
Optionally, the assignment module 305 is specifically configured to: performing database operation on the second child node set corresponding to each database to obtain a target operation result corresponding to each database; and calling a preset assignment function, and assigning the target operation result corresponding to each database to obtain the assignment result corresponding to each database.
Optionally, the fusion module 306 is specifically configured to: forwarding the assignment result corresponding to each database, and performing data fusion on the assignment result corresponding to each database to obtain target data; judging whether the target data meets a preset output condition or not; if yes, transcoding and outputting the target data; and if not, repeatedly calculating the assignment result corresponding to each database.
Optionally, the database-based data processing apparatus further includes:
a training module 307, configured to obtain a plurality of training data and a training model, where the plurality of training data includes: a plurality of target clause sets; performing model training on the training model based on the plurality of training data to obtain a trained target model; and carrying out sub-node grouping on the target sub-nodes based on the target model to obtain a plurality of first sub-node sets.
In the embodiment of the invention, a plurality of first child node sets are obtained by obtaining the number of databases corresponding to a plurality of preset databases and performing child node grouping on target child nodes according to the number of the databases, a node sequence arrangement is performed on the plurality of first child node sets to obtain a child node arrangement result, and the plurality of first child node sets are respectively distributed to the plurality of databases according to the child node arrangement result to obtain a second child node set corresponding to each database; and performing data fusion on the assignment results corresponding to each database to obtain target data, thereby improving the accuracy of data mining.
Fig. 3 and 4 above describe the database-based data processing apparatus in the embodiment of the present invention in detail from the perspective of the modular functional entity, and the database-based data processing apparatus in the embodiment of the present invention is described in detail from the perspective of hardware processing.
Fig. 5 is a schematic structural diagram of a database-based data processing apparatus 500 according to an embodiment of the present invention, which may include one or more processors (CPUs) 510 (e.g., one or more processors) and a memory 520, and one or more storage media 530 (e.g., one or more mass storage devices) for storing applications 533 or data 532, where the database-based data processing apparatus 500 may generate relatively large differences due to different configurations or performances. Memory 520 and storage media 530 may be, among other things, transient or persistent storage. The program stored on the storage medium 530 may include one or more modules (not shown), each of which may include a series of instruction operations for the database-based data processing apparatus 500. Still further, the processor 510 may be configured to communicate with the storage medium 530 to execute a series of instruction operations in the storage medium 530 on the database-based data processing apparatus 500.
The database-based data processing apparatus 500 may also include one or more power supplies 540, one or more wired or wireless network interfaces 550, one or more input-output interfaces 560, and/or one or more operating systems 531, such as Windows Server, Mac OS X, Unix, Linux, FreeBSD, and the like. Those skilled in the art will appreciate that the database-based data processing apparatus configuration shown in fig. 5 does not constitute a limitation of database-based data processing apparatus and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.
The present invention also provides a database-based data processing apparatus, which includes a memory and a processor, wherein the memory stores computer-readable instructions, and the computer-readable instructions, when executed by the processor, cause the processor to execute the steps of the database-based data processing method in the above embodiments.
The present invention also provides a computer readable storage medium, which may be a non-volatile computer readable storage medium, and which may also be a volatile computer readable storage medium, having stored therein instructions, which, when executed on a computer, cause the computer to perform the steps of the database-based data processing method.
Further, the computer-readable storage medium may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function, and the like; the storage data area may store data created according to the use of the blockchain node, and the like.
The block chain is a novel application mode of computer technologies such as distributed data storage, point-to-point transmission, a consensus mechanism, an encryption algorithm and the like. A block chain (Blockchain), which is essentially a decentralized database, is a series of data blocks associated by using a cryptographic method, and each data block contains information of a batch of network transactions, so as to verify the validity (anti-counterfeiting) of the information and generate a next block. The blockchain may include a blockchain underlying platform, a platform product service layer, an application service layer, and the like.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a read-only memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A database-based data processing method, characterized in that the database-based data processing method comprises:
receiving a target processing request to be processed, and analyzing the target processing request to obtain an initial interface corresponding to the target processing request;
reading and analyzing the initial interface to obtain a target node, and performing node search on the target node to obtain a target sub-node corresponding to the target node;
acquiring the number of databases corresponding to a plurality of preset databases, and performing sub-node grouping on the target sub-nodes according to the number of the databases to obtain a plurality of first sub-node sets;
carrying out node sequence arrangement on the plurality of first sub-node sets to obtain sub-node arrangement results, and respectively allocating the plurality of first sub-node sets to the plurality of databases according to the sub-node arrangement results to obtain second sub-node sets corresponding to each database;
performing database operation on the second sub-node set corresponding to each database to obtain a target operation result corresponding to each database, and assigning values to the target operation result corresponding to each database to obtain an assignment result corresponding to each database;
and forwarding the assignment result corresponding to each database, performing data fusion on the assignment result corresponding to each database to obtain target data, and transcoding and outputting the target data.
2. The database-based data processing method according to claim 1, wherein the reading and analyzing the initial interface to obtain a target node, and performing node search on the target node to obtain a target child node corresponding to the target node, comprises:
calling a preset data reading function to read the initial interface, and performing data analysis on the initial interface to obtain a target node;
and calling a preset node search algorithm, and performing node search extraction on the target node to obtain a target child node corresponding to the target node.
3. The database-based data processing method according to claim 1, wherein the obtaining of the preset number of databases corresponding to the plurality of databases and the performing of the sub-node grouping on the target sub-node according to the number of databases to obtain a plurality of first sub-node sets comprises:
determining a plurality of databases based on preset database parameters, and calculating the number of the databases corresponding to the plurality of databases;
determining the sub-node grouping number of the target sub-node according to the database number;
and carrying out sub-node grouping on the target sub-nodes according to the sub-node grouping number to obtain a plurality of first sub-node sets.
4. The database-based data processing method according to claim 1, wherein the performing node sequence arrangement on the plurality of first child node sets to obtain child node arrangement results, and respectively allocating the plurality of first child node sets to the plurality of databases according to the child node arrangement results to obtain a second child node set corresponding to each database includes:
respectively acquiring variable data of each first sub-node set, and determining the target weight of each first sub-node set according to the variable data;
carrying out node sequence arrangement on the plurality of first child node sets according to the target weight of each first child node set to obtain child node arrangement results;
and respectively distributing the plurality of first sub-node sets to the plurality of databases according to a preset distribution strategy and the sub-node arrangement result to obtain a second sub-node set corresponding to each database.
5. The database-based data processing method of claim 1, wherein the performing database operations on the second child node set corresponding to each database to obtain a target operation result corresponding to each database, and assigning values to the target operation result corresponding to each database to obtain an assignment result corresponding to each database comprises:
performing database operation on the second child node set corresponding to each database to obtain a target operation result corresponding to each database;
and calling a preset assignment function, and assigning the target operation result corresponding to each database to obtain the assignment result corresponding to each database.
6. The database-based data processing method of claim 1, wherein the forwarding of the assignment result corresponding to each database, the data fusion of the assignment results corresponding to each database to obtain target data, and the transcoding and outputting of the target data comprise:
forwarding the assignment result corresponding to each database, and performing data fusion on the assignment result corresponding to each database to obtain target data;
judging whether the target data meets a preset output condition or not;
if yes, transcoding and outputting the target data;
and if not, repeatedly calculating the assignment result corresponding to each database.
7. The database-based data processing method according to any of claims 1-6, wherein the database-based data processing method further comprises:
obtaining a plurality of training data and a training model, the plurality of training data comprising: a plurality of target clause sets;
performing model training on the training model based on the plurality of training data to obtain a trained target model;
and carrying out sub-node grouping on the target sub-nodes based on the target model to obtain a plurality of first sub-node sets.
8. A database-based data processing apparatus, characterized in that the database-based data processing apparatus comprises:
the receiving module is used for receiving a target processing request to be processed and analyzing the target processing request to obtain an initial interface corresponding to the target processing request;
the analysis module is used for reading and analyzing the initial interface to obtain a target node, and searching the target node to obtain a target sub-node corresponding to the target node;
the grouping module is used for acquiring the number of databases corresponding to a plurality of preset databases, and performing sub-node grouping on the target sub-nodes according to the number of the databases to obtain a plurality of first sub-node sets;
the distribution module is used for carrying out node sequence arrangement on the plurality of first sub-node sets to obtain sub-node arrangement results, and distributing the plurality of first sub-node sets to the plurality of databases respectively according to the sub-node arrangement results to obtain second sub-node sets corresponding to each database;
the assignment module is used for performing database operation on the second sub-node set corresponding to each database to obtain a target operation result corresponding to each database, and assigning values to the target operation result corresponding to each database to obtain an assignment result corresponding to each database;
and the fusion module is used for forwarding the assignment result corresponding to each database, performing data fusion on the assignment result corresponding to each database to obtain target data, and transcoding and outputting the target data.
9. A database-based data processing apparatus, characterized in that the database-based data processing apparatus comprises: a memory and at least one processor, the memory having instructions stored therein;
the at least one processor invokes the instructions in the memory to cause the database-based data processing apparatus to perform the database-based data processing method of any of claims 1-7.
10. A computer-readable storage medium having instructions stored thereon, wherein the instructions, when executed by a processor, implement the database-based data processing method of any one of claims 1-7.
CN202211050745.XA 2022-08-30 2022-08-30 Database-based data processing method, device, equipment and storage medium Active CN115130043B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211050745.XA CN115130043B (en) 2022-08-30 2022-08-30 Database-based data processing method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211050745.XA CN115130043B (en) 2022-08-30 2022-08-30 Database-based data processing method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN115130043A true CN115130043A (en) 2022-09-30
CN115130043B CN115130043B (en) 2022-11-25

Family

ID=83387894

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211050745.XA Active CN115130043B (en) 2022-08-30 2022-08-30 Database-based data processing method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115130043B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115329151A (en) * 2022-10-17 2022-11-11 北方健康医疗大数据科技有限公司 Graph database optimization method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170046391A1 (en) * 2015-08-14 2017-02-16 California Institute Of Technology Algebraic query language (aql) database management system
US20200026705A1 (en) * 2018-07-20 2020-01-23 Dan Benanav Automatic object inference in a database system
US20200242150A1 (en) * 2018-10-09 2020-07-30 Elnaserledinellah Mahmoud Elsayed Abdelwahab Method for creating an efficient, logically complete, ontological level in the extended relational database concept
CN113010547A (en) * 2021-05-06 2021-06-22 电子科技大学 Database query optimization method and system based on graph neural network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170046391A1 (en) * 2015-08-14 2017-02-16 California Institute Of Technology Algebraic query language (aql) database management system
US20200026705A1 (en) * 2018-07-20 2020-01-23 Dan Benanav Automatic object inference in a database system
US20200242150A1 (en) * 2018-10-09 2020-07-30 Elnaserledinellah Mahmoud Elsayed Abdelwahab Method for creating an efficient, logically complete, ontological level in the extended relational database concept
CN113010547A (en) * 2021-05-06 2021-06-22 电子科技大学 Database query optimization method and system based on graph neural network

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115329151A (en) * 2022-10-17 2022-11-11 北方健康医疗大数据科技有限公司 Graph database optimization method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN115130043B (en) 2022-11-25

Similar Documents

Publication Publication Date Title
CN110609759B (en) Fault root cause analysis method and device
US7809758B2 (en) Database and method of generating same
CN110309196A (en) Block chain data storage and query method, apparatus, equipment and storage medium
US8271523B2 (en) Coordination server, data allocating method, and computer program product
CN110866029B (en) sql statement construction method, device, server and readable storage medium
CN101751333A (en) Method, computer program and computer system for assisting in analyzing program
CN105550225A (en) Index construction method and query method and apparatus
CN113420537B (en) Method, device, equipment and storage medium for processing electronic form data
CN115130043B (en) Database-based data processing method, device, equipment and storage medium
CN111209341B (en) Data storage method, device, equipment and medium of block chain
CN111045670B (en) Method and device for identifying multiplexing relationship between binary code and source code
US20100005203A1 (en) Method of Merging and Incremantal Construction of Minimal Finite State Machines
CN114723014A (en) Tensor segmentation mode determination method and device, computer equipment and medium
CN110888876A (en) Method and device for generating database script, storage medium and computer equipment
CN112529543A (en) Method, device and equipment for verifying mutual exclusion relationship of workflow and storage medium
EP3438818A1 (en) Method and device for compiling computer language
CN115147020B (en) Decoration data processing method, device, equipment and storage medium
CN111914537A (en) Data verification method, device and equipment and readable storage medium
CN107122849B (en) Spark R-based product detection total completion time minimization method
CN114328525A (en) Data processing method and device
CN104636474A (en) Method and equipment for establishment of audio fingerprint database and method and equipment for retrieval of audio fingerprints
CN114995719A (en) List rendering method, device, equipment and storage medium
US10003492B2 (en) Systems and methods for managing data related to network elements from multiple sources
JP2016115287A (en) Test case generation program, test case generation method and test case generator
CN113408562A (en) Sample enhancement method, device and system for model training

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant