CN110716950B - Caliber system establishment method, caliber system establishment device, caliber system establishment equipment and computer storage medium - Google Patents

Caliber system establishment method, caliber system establishment device, caliber system establishment equipment and computer storage medium Download PDF

Info

Publication number
CN110716950B
CN110716950B CN201910895112.0A CN201910895112A CN110716950B CN 110716950 B CN110716950 B CN 110716950B CN 201910895112 A CN201910895112 A CN 201910895112A CN 110716950 B CN110716950 B CN 110716950B
Authority
CN
China
Prior art keywords
keyword
source data
dimension
index
data table
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910895112.0A
Other languages
Chinese (zh)
Other versions
CN110716950A (en
Inventor
黄沙沙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Shenzhou Digital Cloud Information Technology Co ltd
Shenzhou Kuntai Xiamen Information Technology Co ltd
Original Assignee
Shenzhou Kuntai Xiamen Information Technology Co ltd
Beijing Shenzhou Digital Cloud Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhou Kuntai Xiamen Information Technology Co ltd, Beijing Shenzhou Digital Cloud Information Technology Co ltd filed Critical Shenzhou Kuntai Xiamen Information Technology Co ltd
Priority to CN201910895112.0A priority Critical patent/CN110716950B/en
Publication of CN110716950A publication Critical patent/CN110716950A/en
Application granted granted Critical
Publication of CN110716950B publication Critical patent/CN110716950B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a caliber system establishment method, a caliber system establishment device, caliber system establishment equipment and a computer storage medium, wherein the caliber system establishment method specifically comprises the following steps: the embodiment of the invention establishes a keyword table; selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table; selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table; and automatically generating a query instruction corresponding to the dimension-index combination according to the dimension key words and the index key words, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination. By the method, dimension-indexes are not required to be set in a considered mode, so that the caliber of the query instruction corresponding to the dimension-index combination can be automatically generated, a caliber system is built, the workload of a user is reduced, and the processing efficiency of big data is improved.

Description

Caliber system establishment method, caliber system establishment device, caliber system establishment equipment and computer storage medium
Technical Field
The present invention relates to the field of big data, and in particular, to a method, an apparatus, a device, and a computer storage medium for establishing a caliber system.
Background
With the rapid development of the internet, the large data age is coming, the exploration phase and the market starting phase are walked, and the large data now enter a key turning period from concept to practical application. Big data tend to mature in various aspects of acceptance, technology, application and the like, and start to walk into the rapid development stage of industry. The huge application value of big data drives the huge development of big data industry.
In the present day when the volume of a data warehouse is larger and larger in the big data era, when data information stored in the data warehouse is queried or modified, a modeling method is generally utilized to query or modify data confidence stored in the data warehouse in the prior art, but at present, corresponding scripts are required to be rewritten for each query or modification operation to realize the query or modification operation, and the workload is huge due to the mode, so that the development of the big data industry is not facilitated.
Disclosure of Invention
The embodiment of the invention provides a method, a device, equipment and a computer storage medium for establishing a caliber system, which mainly aims at solving the problems that in the prior art, corresponding scripts are required to be rewritten for each query or modification operation, so that the workload is huge and the development of big data industry is not facilitated.
In a first aspect, an embodiment of the present invention provides a method for establishing a caliber system, where the method specifically includes:
Establishing a keyword table;
Selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table;
Selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table;
And automatically generating a query instruction corresponding to the dimension-index combination according to the dimension key words and the index key words, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination.
Preferably, the establishing a keyword table specifically includes:
Checking the data warehouse at intervals of preset time, and counting a source data table generated in the preset time, wherein the source data table comprises description fields;
And replacing the description information in the description field by using a keyword, and establishing a keyword table.
Preferably, the keyword based on the keyword table selects at least one keyword as a dimension keyword according to a preset rule, and specifically includes:
acquiring a first source data table in the source data tables, and determining at least one primary key in the first source data table;
Calculating the times of taking at least one primary key in each first source data table as an external key of a second source data table;
selecting a primary key, which is used as a secondary source data table, from at least one primary key in the first source data table, wherein the primary key, which is used as a primary key of a secondary source data table, has the times greater than or equal to a preset times threshold value, as a dimension key of the first source data table, and the first source data table and the second source data table are any one source data table in the source data tables;
Or alternatively
Calculating the proportion of at least one main key in each first source data table as an external key of a second source data table;
and selecting a primary key with the proportion of the external key being larger than or equal to a preset proportion threshold value from at least one primary key in the first source data table as a dimension key of the first source data table, wherein the first source data table and the second source data table are any one source data table in the source data tables.
Preferably, the keyword based on the keyword table selects at least one keyword as an index keyword according to a preset rule, and specifically includes:
Acquiring a second source data table in the source data tables, and determining at least one keyword except for an external key in the second source data table, wherein the second source data table is any source data table except for the first source data table in the source data tables;
calculating the use quantity of each keyword except the external key in the second source data table;
Selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is the keyword with the use quantity being greater than or equal to a preset use quantity threshold value;
Or alternatively
Calculating the utilization rate of each keyword except the external key in the second source data table;
Selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is a keyword with the utilization rate being greater than or equal to a preset utilization rate threshold value;
Or alternatively
Calculating the use frequency of each keyword except the external key in the second source data table;
and selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is the keyword with the use quantity larger than or equal to a preset use frequency threshold value.
Preferably, on the basis of any one of the foregoing embodiments, the automatically generating, according to the dimension keyword and the index keyword, a query instruction corresponding to a dimension-index combination, and building a caliber system by using the dimension-index combination and the query instruction corresponding to the dimension-index combination, specifically includes:
Carrying out dimension splitting according to different theme categories according to the dimension keywords, and storing the split dimension keywords into a plurality of dimension tables, wherein the plurality of dimension tables correspond to a plurality of theme categories;
Carrying out index splitting according to the index keywords and different dimensions, and storing the split index keywords into a plurality of index tables, wherein the index tables correspond to the dimensions;
and automatically generating a dimension-index combination table according to the external keys of the index tables and the dimension tables, wherein the dimension-index combination table comprises the dimension-index combination and query instructions corresponding to the dimension-index combination, and establishing a caliber system by utilizing the dimension-index combination and the query instructions corresponding to the dimension-index combination.
In a second aspect, an embodiment of the present invention further provides an apparatus for establishing a caliber system, where the apparatus includes:
The first establishing module is used for establishing a keyword table;
The dimension selection module is used for selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table;
The index selection module is used for selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table;
And the second establishing module is used for automatically generating a query instruction corresponding to the dimension-index combination according to the dimension keywords and the index keywords, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination.
In a third aspect, the present invention provides an aperture system establishment apparatus, the apparatus comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor implements part or all of the steps of any one of the aperture system establishment methods described in the above embodiments when executing the computer program.
In a fourth aspect, the present invention also provides a computer storage medium, where the computer storage medium may store a program, where the program executes to implement part or all of the steps of any one of the caliber system establishing methods described in the above embodiments.
The beneficial effects are that: the embodiment of the invention establishes a keyword table; selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table; selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table; and automatically generating a query instruction corresponding to the dimension-index combination according to the dimension key words and the index key words, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination. By the method, the dimension-index is not required to be set manually, so that the caliber of the query instruction corresponding to the dimension-index combination can be automatically generated, a caliber system is built, the workload of a user is reduced, and the processing efficiency of big data is improved.
These and other aspects of the invention will be more readily apparent from the following description of the embodiments.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a schematic flow chart of a method for establishing a caliber system according to an embodiment of the present invention;
FIG. 2 is a flow chart of another method for establishing a caliber system according to a second embodiment of the present invention;
FIG. 3 is a flow chart of another method for establishing a caliber system according to the third embodiment of the invention;
Fig. 4 is a schematic structural diagram of an apparatus for establishing a caliber system according to an embodiment of the present invention;
Fig. 5 is a schematic structural diagram of an apparatus for establishing a caliber system according to an embodiment of the present invention.
Detailed Description
In order that those skilled in the art will better understand the present invention, a technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, shall fall within the scope of the present invention.
The following will describe in detail.
The terms "first," "second," "third," and "fourth" and the like in the description and in the claims and drawings are used for distinguishing between different objects and not necessarily for describing a particular sequential or chronological order. Furthermore, the terms "comprise" and "have," as well as any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, system, article, or apparatus that comprises a list of steps or elements is not limited to only those listed steps or elements but may include other steps or elements not listed or inherent to such process, method, article, or apparatus. The term "and/or" is used to describe an association relationship of associated objects, meaning that there may be three relationships, e.g., a and/or B, which may represent: a exists alone, A and B exist together, and B exists alone. The character "/" generally indicates that the context-dependent object is an "or" relationship.
Reference herein to "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the invention. The appearances of such phrases in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Those of skill in the art will explicitly and implicitly appreciate that the embodiments described herein may be combined with other embodiments.
Embodiments of the present application are described below with reference to the accompanying drawings.
Example 1
According to one aspect of the present invention, there is provided a caliber system establishing method, which specifically includes the following steps, as shown in fig. 1:
S1, establishing a keyword table;
It should be noted that, the execution body of the embodiment of the present invention is a data management server, which is formed by one or more computers running in a local area network and data management system software, and the data management server provides management services for user data.
Specifically, the source data table in the data warehouse during the verification of the data daily can be counted, wherein the source data table comprises table names, description fields and description information corresponding to the description fields, keywords are extracted from the description information in each row of source data in the counted source data table, the keywords obtained through extraction are generated into a keyword table, and the keywords in the keyword table are in one-to-one correspondence with the description information in each row of source data.
S2, selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table;
and selecting at least one keyword from each generated keyword table according to a preset rule as a dimension keyword of the keyword table, and setting the dimension keyword as the dimension of the caliber system.
S3, selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table;
and selecting at least one keyword from each generated keyword table according to a preset rule as an index keyword of the keyword table, and setting the dimension keyword as an index of the caliber system. The index is the attribute value of the dimension, for example, the dimension is a city, the corresponding index is Shenzhen, and Shenzhen is the attribute value of the dimension of the city.
S4, automatically generating a query instruction corresponding to the dimension-index combination according to the dimension keywords and the index keywords, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination.
And automatically generating a dimension-index combination, such as city-Shanghai, by using the selected dimension keywords and index keywords, if the source data of the Shanghai is required to be queried from a data warehouse, automatically generating and obtaining a corresponding data query instruction according to the generated flow web page access daily table-city-Shanghai, wherein the flow web page access daily table is a table name required in the data query instruction, determining a field name as a city by using the flow web page access daily table and the dimension, generating a query instruction corresponding to the determined table name, the field name and the dimension-index combination, and further, generating query instructions corresponding to all the dimension-index combinations by using a traversal principle. And establishing a caliber system by using the dimension-index combination and the corresponding query instruction. The caliber system comprises a table name, a table field, a dimension keyword, an index keyword and query instructions corresponding to all dimension-index combinations. And corresponding query instructions are automatically generated according to the dimension-index combination, and the corresponding query instructions are executed when the query is needed, so that the needed source data can be queried from the data warehouse, the workload of a user is reduced, and the data processing efficiency is improved.
The beneficial effects are that: the embodiment of the invention establishes a keyword table; selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table; selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table; and automatically generating a query instruction corresponding to the dimension-index combination according to the dimension key words and the index key words, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination. By the method, the dimension-index is not required to be set manually, so that the caliber of the query instruction corresponding to the dimension-index combination can be automatically generated, a caliber system is built, the workload of a user is reduced, and the processing efficiency of big data is improved.
Example two
The embodiment of the invention also provides another caliber system establishing method, and specifically, as shown in fig. 2, the keyword table can be established through the following steps:
S1, establishing a keyword table;
The specific implementation manner of step S1 has been described in detail in the foregoing embodiments, which is not repeated herein, and further, step S1 may be implemented to establish a keyword table by the following steps, which are specifically as follows:
s11, checking the data warehouse at intervals of preset time, and counting a source data table generated in the preset time, wherein the source data table comprises description fields;
At preset intervals, for example, checking the data warehouse every day, verifying whether the original data in the data warehouse and the final data after daily updating are balanced, so that a user can extract and clean data which has value for enterprises from the source data, clean the source data in the data warehouse, in which the original data and the final data after daily updating are unbalanced, and finally store the source data in the data warehouse. The data in the data warehouse will typically be retained for a long period of time for the enterprise to perform query operations for use in enterprise decisions. Thus, checking the data warehouse every day is the data base for the enterprise to perform the query operation.
S12, replacing the description information in the description field by using a keyword, and establishing a keyword table.
And counting a source data table generated in the preset time, wherein the source data table comprises at least one description field, the description field comprises corresponding description information, the corresponding description information in the description field is replaced by a keyword, and a keyword table is established.
The keywords in the keyword table uniquely correspond to the description information corresponding to each line of source data. As shown in table 1 below:
TABLE 1
Table 1 is a key table 1 established after replacing description information in the flow Web page access history table with a key according to the definition of the source data table.
Also for example as shown in table 2 below:
TABLE 2
Table 2 is a key table 2 established after replacing the description information in the access daily table for the traffic Web page with a key according to the definition of the source data table.
S2, selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table;
preferably, on the basis of the above embodiment, at least one keyword is selected as a dimension keyword according to a preset rule based on the keywords in the keyword table in step S2, and specifically the following manner may be adopted:
Mode one
S21, acquiring a first source data table in the source data tables, and determining at least one primary key in the first source data table;
And randomly acquiring a first source data table from a source data table generated in the process of counting the data warehouse in the preset time, and determining at least one primary key in the first source data table, wherein the primary key is also called a primary key word (PRIMARY KEY), is a unique key in the table, and the value of the key is used for uniquely identifying a record in the table and can uniquely determine a certain line of data in the table or uniquely determine a certain entity.
S22, calculating the times of taking at least one main key in each first source data table as an external key;
And respectively calculating the times of each primary key as an external key by using at least one primary key determined in the first source data table. It should be noted that, the foreign key is also called a foreign key, and if the common key is a primary key in one relationship, the common key is called a foreign key of another relationship, and the foreign key represents a related relationship between two relationships. A table in which foreign keys of another relationship are primary keys is called a primary table, and a table having the foreign keys is called a secondary table of the primary table. That is, the first source data table is a master table, and the source data tables other than the first source data table are slave tables, and the number of times that each master key in the first source data is used as the external key of the source data table other than the first source data table is calculated.
S23, selecting a primary key, which is used as an external key, from at least one primary key in the first source data table, wherein the number of times of the primary key is larger than or equal to a preset number of times threshold value as a dimension key of the first source data table, and the first source data table is any one source data table in the source data tables;
The number of times that a certain primary key in the first source data table is used as an external key can be understood as the number of source data tables in a master-slave relationship between the first source data table and other source data tables in a data warehouse, and if the number of times that a certain primary key in the first source data table is used as an external key is greater than or equal to a preset number of times threshold, the primary key is used as a dimension key of the first source data table, and the dimension key is set as a dimension of the caliber system.
The key word with the number of times larger than or equal to the preset number of times threshold is selected as the dimension key word, so that the accuracy of dimension selection is improved, the dimension key word can be more suitable for the requirement of large data processing, and meanwhile, the dimension key word is selected by using the foreign key statistics, so that the processing efficiency of later-stage data is improved, and the consistency of the data is maintained.
Mode two
Step 1, acquiring a first source data table in the source data table, and determining at least one primary key in the first source data table;
And randomly acquiring a first source data table from a source data table generated in the process of counting the data warehouse in the preset time, and determining at least one primary key in the first source data table, wherein the primary key is also called a primary key word (PRIMARY KEY), is a unique key in the table, and the value of the key is used for uniquely identifying a record in the table and can uniquely determine a certain line of data in the table or uniquely determine a certain entity.
Step 2, calculating the proportion of at least one main key in each first source data table as an external key;
And respectively calculating the proportion of each primary key serving as an external key of at least one primary key determined in the first source data table. It should be noted that, the foreign key is also called a foreign key, and if the common key is a primary key in one relationship, the common key is called a foreign key of another relationship, and the foreign key represents a related relationship between two relationships. A table in which foreign keys of another relationship are primary keys is called a primary table, and a table having the foreign keys is called a secondary table of the primary table. That is, the first source data table is a master table, the source data tables other than the first source data table are assumed to be slave tables, and the number of times each master key in the first source data is used as the external key of the source data table other than the first source data table is calculated.
And 3, selecting a primary key with the proportion larger than or equal to a preset proportion threshold value as a dimension key of the first source data table from at least one primary key in the first source data table, wherein the first source data table is any one source data table in the source data tables.
The proportion of a certain primary key in the first source data table as an external key may be understood as a proportion of the number of source data tables in a master-slave relationship between the first source data table and other source data tables in a data warehouse to the number of source data tables except the first source data table, and if the proportion of the number of source data tables in a master-slave relationship between the first source data table and other source data tables in the data warehouse to the number of source data tables except the first source data table is greater than or equal to a preset proportion threshold, the primary key is used as a dimension key of the first source data table, and the dimension key is set as a dimension of the caliber system.
The key word with the outer key sub-proportion being larger than or equal to the preset proportion threshold is selected as the dimension key word, so that the accuracy of dimension selection is improved, the dimension key word can be more suitable for large data processing requirements, and meanwhile, the dimension key word is selected by using outer key statistics, so that the processing efficiency of later-stage data is improved, and the consistency of the data is maintained.
S3, selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table;
preferably, on the basis of the above embodiment, at least one keyword is selected as an index keyword according to a preset rule based on the keywords in the keyword table in step S3, which may specifically be performed by:
Mode one
S31, acquiring a second source data table in the source data tables, and determining at least one keyword except an external key in the second source data table;
S32, calculating the use quantity of each keyword except the external key in the second source data table;
S33, selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is the keyword with the use quantity being larger than or equal to a preset use quantity threshold value;
The second source data table is a data table of a subordinate relationship in a master-slave relationship of the first source data table, the foreign key is excluded from the second source data table, at least one keyword is determined, and the at least one keyword does not comprise a main keyword which is used as the foreign key and the number of times or the proportion of the foreign key is greater than or equal to a preset number of times threshold value or a preset proportion threshold value.
And respectively calculating the use quantity of at least one keyword except the external key in each second source data table, selecting the keywords with the use quantity larger than or equal to a preset use quantity threshold as index keywords, and setting the index keywords as indexes of the caliber system. By the method, unusual keywords or inconsistent keywords can be screened, so that data cleaning of a data warehouse is facilitated, data utilization efficiency is improved, and data consistency is maintained.
Mode two
Step 1, acquiring a second source data table in the source data table, and determining at least one keyword except an external key in the second source data table;
Step 2, calculating the utilization rate of each keyword except the external key in the second source data table;
Step 3, selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is a keyword with the utilization rate being greater than or equal to a preset utilization rate threshold value;
The second source data table is a data table of a subordinate relationship in a master-slave relationship of the first source data table, the foreign key is excluded from the second source data table, at least one keyword is determined, and the at least one keyword does not comprise a main keyword which is used as the foreign key and the number of times or the proportion of the foreign key is greater than or equal to a preset number of times threshold value or a preset proportion threshold value.
And respectively calculating the utilization rate of at least one keyword except the external key in each second source data table, selecting the keyword with the utilization rate larger than or equal to a preset utilization rate threshold as an index keyword, and setting the index keyword as the index of the caliber system. By the method, keywords with low utilization rate or inconsistent keywords can be screened, data cleaning of a data warehouse is facilitated, data utilization efficiency is improved, and data consistency is maintained.
Mode three
Step 1, acquiring a second source data table in the source data table, and determining at least one keyword except an external key in the second source data table;
Step 2, calculating the use frequency of each keyword except the external key in the second source data table;
And 3, selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is the keyword with the use number larger than or equal to a preset use frequency threshold value.
The second source data table is a data table of a subordinate relationship in a master-slave relationship of the first source data table, the foreign key is excluded from the second source data table, at least one keyword is determined, and the at least one keyword does not comprise a main keyword which is used as the foreign key and the number of times or the proportion of the foreign key is greater than or equal to a preset number of times threshold value or a preset proportion threshold value.
And calculating the use frequency of at least one keyword except the external key in each second source data table, selecting the keyword with the use frequency larger than or equal to a preset use frequency threshold as an index keyword, and setting the index keyword as an index of the caliber system. By the method, keywords with low utilization rate or inconsistent keywords can be screened, data cleaning of a data warehouse is facilitated, data utilization efficiency is improved, and data consistency is maintained.
S4, automatically generating a query instruction corresponding to the dimension-index combination according to the dimension keywords and the index keywords, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination.
The beneficial effects are that: according to the embodiment of the invention, the keyword is utilized to replace the description information corresponding to the description field in the source data table by periodically carrying out the source data table of the data warehouse verification statistics, so that the keyword table is established, whether the source data in the data warehouse are balanced with the updated final data or not is verified periodically, the data consistency is improved, the user can extract and clear the data valuable to the enterprise from the source data, and finally the data are stored in the data warehouse, and the data basis of the enterprise for query operation is provided; automatically generating aperture combinations of dimension-indexes according to the keyword table, and establishing an aperture system, so that the generated aperture combinations of dimension-indexes can be adapted to the data change of the data warehouse, and the data change of the data warehouse can be reflected better; based on the keywords of the keyword table, the accuracy of dimension selection is improved by selecting keywords with the number of external keys or the proportion being greater than or equal to a preset number of times threshold or a preset proportion threshold as dimension keywords, so that the dimension keywords can be more suitable for large data processing requirements, and meanwhile, dimension keywords are selected by using external key statistics, so that the processing efficiency of later-stage data is improved and the consistency of the data is maintained; based on the keywords of the keyword table, keywords with low use rate or inconsistent keywords can be screened by selecting the keywords with the use number, the use rate and the use frequency reaching the threshold value from the keywords except the keys as index keywords, so that the data cleaning of a data warehouse is facilitated, the data utilization efficiency is improved, and the data consistency is maintained; and automatically generating a query instruction corresponding to the dimension-index combination according to the dimension key words and the index key words, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination. By the method, the dimension-index is not required to be set manually, so that the caliber of the query instruction corresponding to the dimension-index combination can be automatically generated, a caliber system is built, the workload of a user is reduced, and the processing efficiency of big data is improved.
Example III
On the basis of the above embodiment, the present invention further provides another caliber system establishing method, which specifically includes the following steps, as shown in fig. 3:
S1, establishing a keyword table;
s2, selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table;
s3, selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table;
the specific implementation of steps S1-S3 is described in detail in the above embodiments, and will not be repeated here.
S4, automatically generating a query instruction corresponding to the dimension-index combination according to the dimension keywords and the index keywords, and establishing a caliber system by utilizing the dimension-index combination and the query instruction corresponding to the dimension-index combination.
Preferably, on the basis of the above embodiment, in step S4, a query instruction corresponding to a dimension-index combination is automatically generated according to the dimension keyword and the index keyword, and a caliber system is built by using the dimension-index combination and the query instruction corresponding to the dimension-index combination, which specifically includes the following steps:
s41, carrying out dimension splitting according to different theme categories according to the dimension keywords, and storing the split dimension keywords into a plurality of dimension tables, wherein the plurality of dimension tables correspond to a plurality of theme categories;
And carrying out dimension splitting on the dimension keywords according to different theme categories, wherein the theme categories can be preset, carrying out dimension splitting on all the dimension keywords, storing the split dimension keywords into a plurality of different dimension tables, and each dimension table corresponds to one theme category. For example, the dimension keyword login ID, the page ID and the browser cookie in table 1 and the theme categories of the dimension keyword login ID, the page ID, the browser cookie, the IP, the city and the channel identifier in table 2 are Web, the theme categories can be preset to be App, user behavior and the like, the dimension keyword is split according to the theme categories, and the management of massive data in a dimension and layering manner is realized, so that the collection, storage and rapid and accurate data query of massive data are realized, the data processing efficiency is improved, and timely and accurate data basis is provided for the decision of enterprises.
S42, carrying out index splitting according to the index keywords and different dimensions, and storing the split index keywords into a plurality of index tables, wherein the index tables correspond to the dimensions;
It should be noted that, in the same manner as in step S41, the index keywords are split in dimensions according to different topic categories, where the topic categories may be preset, all the index keywords are split in dimensions, the split index keywords are stored in a plurality of different index tables, and each index table corresponds to one topic category. For example, the first access date, the last access date, the time of insertion, the user insertion and the partition date of the index key words in table 1, and the topic categories of the index key words in table 2, such as the access times, the landing page times, the exit page times, the insertion time, the user insertion and the partition date, are Web, so that the index key words with the topic categories being Web can be seen to be the attribute values of the dimension key words in the step S41, the topic categories can be preset to be App, user behaviors and the like, the index key words can be split according to the topic categories, and the management of the massive data in a dimension and hierarchy manner can be realized, thereby realizing the collection, storage, rapid and accurate data query of the massive data, improving the data processing efficiency and providing timely and accurate data basis for the decision of enterprises.
S43, automatically generating a dimension-index combination table according to the plurality of index tables and the plurality of dimension tables, wherein the dimension-index combination table comprises the dimension-index combination and query instructions corresponding to the dimension-index combination, and establishing a caliber system by utilizing the dimension-index combination and the query instructions corresponding to the dimension-index combination.
Automatically generating a dimension-index combination table from the plurality of index tables and the plurality of dimension tables, for example as shown in table 3 below:
TABLE 3 Table 3
Table 3 is a dimension-index combination table, including subject categories, dimensions, and indexes, and is presented herein for illustrative purposes only and is not intended to limit the scope of the present invention.
According to the dimension-index combination in the table 3, a corresponding query instruction is automatically generated, for example, city-access times, namely, the query theme type is Web from a data warehouse, the website access times of the city on the day are designated, the query theme type is Web, the query instruction of the website access times of the city on the day is designated, in the query instruction, parameters about specific designated cities or access times can be variables, according to specific required parameters input by a user, corresponding source data meeting the user input parameters can be queried from the data warehouse through executing the query instruction after assignment, and the source data query efficiency is improved.
Each dimension-index combination in table 3 corresponds to a query instruction, and the dimension-index combination and the query instruction corresponding thereto are used to build a caliber system, for example, as shown in table 4:
TABLE 4 Table 4
If a user needs to inquire source data corresponding to a dimension-index combination, determining a corresponding inquiry instruction according to the dimension-index combination, inputting a specific inquiry parameter, automatically executing the assigned inquiry instruction, and finishing inquiry of the source data corresponding to the dimension-index combination from the data warehouse.
The beneficial effects are that: the embodiment of the invention establishes a keyword table; selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table; selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table; according to the dimension keywords, dimension splitting is carried out according to different theme categories, the split dimension keywords are stored into a plurality of dimension tables, the plurality of dimension tables correspond to a plurality of theme categories, the dimension keywords are split according to the theme categories, and the dimension hierarchical management of mass data is realized, so that the collection, storage and rapid and accurate data query of the mass data are realized, the data processing efficiency is improved, and a timely and accurate data basis is provided for the decision of enterprises; carrying out index splitting according to the index keywords and different dimensions, and storing the split index keywords into a plurality of index tables, wherein the index tables correspond to the dimensions; and automatically generating a dimension-index combination table according to the plurality of index tables and the plurality of dimension tables, wherein the dimension-index combination table comprises the dimension-index combination and query instructions corresponding to the dimension-index combination, and establishing a caliber system by utilizing the dimension-index combination and the query instructions corresponding to the dimension-index combination. By the method, the dimension-index is not required to be set manually, so that the caliber of the query instruction corresponding to the dimension-index combination can be automatically generated, and a caliber system is established.
Example IV
An embodiment of the present invention provides an apparatus 400 for establishing a caliber system, as shown in fig. 4, where the apparatus includes:
a first establishing module 410, configured to establish a keyword table;
The dimension selection module 420 is configured to select at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table;
The index selection module 430 is configured to select at least one keyword as an index keyword according to a preset rule based on the keywords in the keyword table;
a second establishing module 440, configured to automatically generate a query instruction corresponding to the dimension-index combination according to the dimension keyword and the index keyword, and establish a caliber system by using the dimension-index combination and the query instruction corresponding to the dimension-index combination
The embodiment of the present invention provides a device 400, which corresponds to the above embodiment, and the device 400 further includes a plurality of units, which are configured to implement corresponding functions according to corresponding steps of the above method. Since the steps of the method have been described in detail in the above embodiments, the details of the method in the apparatus 400 are not repeated.
Preferably, the first establishing module 410 specifically includes:
The statistics unit is used for checking the data warehouse at intervals of preset time and counting a source data table generated in the preset time, wherein the source data table comprises description fields;
And the first establishing unit is used for replacing the description information in the description field by using a keyword to establish a keyword table.
Preferably, the second establishing module 440 specifically includes:
The dimension splitting unit is used for carrying out dimension splitting according to the dimension keywords and different theme categories, and storing the split dimension keywords into a plurality of dimension tables, wherein the plurality of dimension tables correspond to the theme categories;
The index splitting unit is used for splitting indexes according to the index keywords according to different dimensions, and storing the split index keywords into a plurality of index tables, wherein the index tables correspond to the dimensions;
The second establishing unit is used for automatically generating a dimension-index combination table according to the external keys of the index tables and the dimension tables, wherein the dimension-index combination table comprises the dimension-index combination and query instructions corresponding to the dimension-index combination, and the dimension-index combination and the query instructions corresponding to the dimension-index combination are utilized to establish a caliber system.
The beneficial effects are that: in the embodiment of the invention, a keyword table is established through a first establishing module; the dimension selection module selects at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table; the index selection module selects at least one keyword as an index keyword according to a preset rule based on keywords of the keyword table, the dimension splitting unit of the second establishment module performs dimension splitting according to different theme categories according to the dimension keyword and stores the split dimension keyword into a plurality of dimension tables, the plurality of dimension tables correspond to the plurality of theme categories, and the index splitting unit of the second establishment module splits the dimension keyword according to the theme categories to realize the dimension hierarchical management of mass data, thereby realizing the collection, storage and rapid and accurate data query of mass data, improving the data processing efficiency and providing timely and accurate data basis for decision making of enterprises; carrying out index splitting according to the index keywords and different dimensions, and storing the split index keywords into a plurality of index tables, wherein the index tables correspond to the dimensions; and automatically generating a dimension-index combination table according to the plurality of index tables and the plurality of dimension tables, wherein the dimension-index combination table comprises the dimension-index combination and query instructions corresponding to the dimension-index combination, and establishing a caliber system by utilizing the dimension-index combination and the query instructions corresponding to the dimension-index combination. By the method, the query efficiency of the source data of the data warehouse can be effectively improved, the workload of a user is reduced, and the processing efficiency of big data is improved.
Example five
The embodiment of the present invention further provides a caliber system establishing device 500, where the device includes a memory 510, a processor 520, and a computer program 530 stored in the memory 510 and capable of running on the processor 520, and the steps of any caliber system establishing method described in the foregoing embodiment are implemented when the processor 520 executes the computer program 530.
The embodiment of the invention also provides a computer storage medium, wherein the computer storage medium can store a program, and the program can be executed to include part or all of the steps of any one of the caliber system establishing methods described in the method embodiment.
It should be noted that, for simplicity of description, the foregoing method embodiments are all described as a series of acts, but it should be understood by those skilled in the art that the present invention is not limited by the order of acts described, as some steps may be performed in other orders or concurrently in accordance with the present invention. Further, those skilled in the art will also appreciate that the embodiments described in the specification are all preferred embodiments, and that the acts and modules referred to are not necessarily required for the present invention.
In the foregoing embodiments, the descriptions of the embodiments are emphasized, and for parts of one embodiment that are not described in detail, reference may be made to related descriptions of other embodiments.
In the several embodiments provided by the present application, it should be understood that the disclosed apparatus may be implemented in other manners. For example, the apparatus embodiments described above are merely illustrative, such as the division of the units, merely a logical function division, and there may be additional manners of dividing the actual implementation, such as multiple units or components may be combined or integrated into another system, or some features may be omitted, or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, or may be in electrical or other forms.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.
The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable memory. Based on this understanding, the technical solution of the present invention may be embodied essentially or partly in the form of a software product, or all or part of the technical solution, which is stored in a memory, and includes several instructions for causing a computer device (which may be a personal computer, a server, a network device, or the like) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned memory includes: a usb disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a removable hard disk, a magnetic disk, or an optical disk, or other various media capable of storing program codes.
Those of ordinary skill in the art will appreciate that all or a portion of the steps in the various methods of the above embodiments may be implemented by a program that instructs associated hardware, and the program may be stored in a computer readable memory, which may include: flash disk, read-Only Memory (ROM), random access Memory (Random Access Memory, RAM), magnetic disk or optical disk.
The foregoing has outlined rather broadly the more detailed description of embodiments of the invention, wherein the principles and embodiments of the invention are explained in detail using specific examples, the above examples being provided solely to facilitate the understanding of the method and core concepts of the invention; meanwhile, as those skilled in the art will have variations in the specific embodiments and application scope in accordance with the ideas of the present invention, the present description should not be construed as limiting the present invention in view of the above.

Claims (6)

1. A method of establishing a caliber system, the method comprising:
Establishing a keyword table;
Selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table;
Selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table;
Carrying out dimension splitting according to different theme categories according to the dimension keywords, and storing the split dimension keywords into a plurality of dimension tables, wherein the plurality of dimension tables correspond to a plurality of theme categories;
Carrying out index splitting according to the index keywords and different dimensions, and storing the split index keywords into a plurality of index tables, wherein the index tables correspond to the dimensions;
automatically generating a dimension-index combination table according to the external keys of the index tables and the dimension tables, wherein the dimension-index combination table comprises the dimension-index combination and query instructions corresponding to the dimension-index combination, and establishing a caliber system by utilizing the dimension-index combination and the query instructions corresponding to the dimension-index combination;
the keyword based on the keyword table selects at least one keyword as a dimension keyword according to a preset rule, and specifically includes:
acquiring a first source data table in a source data table, and determining at least one primary key in the first source data table;
calculating the times of taking at least one main key in each first source data table as an external key;
selecting a primary key, which is used as an external key, from at least one primary key in the first source data table, wherein the number of times of the primary key is larger than or equal to a preset number of times threshold value as a dimension key of the first source data table, and the first source data table is any one source data table in the source data tables;
Or alternatively
Calculating the proportion of at least one main key in each first source data table as an external key;
Selecting a primary key with the proportion being larger than or equal to a preset proportion threshold value from at least one primary key in the first source data table as a dimension key of the first source data table, wherein the first source data table is any one source data table in the source data tables;
the keyword based on the keyword table selects at least one keyword as an index keyword according to a preset rule, and specifically includes:
Acquiring a second source data table in the source data tables, and determining at least one keyword except for an external key in the second source data table, wherein the second source data table is any source data table except for the first source data table in the source data tables;
calculating the use quantity of each keyword except the external key in the second source data table;
Selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is the keyword with the use quantity being greater than or equal to a preset use quantity threshold value;
Or alternatively
Calculating the utilization rate of each keyword except the external key in the second source data table;
Selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is a keyword with the utilization rate being greater than or equal to a preset utilization rate threshold value;
Or alternatively
Calculating the use frequency of each keyword except the external key in the second source data table;
and selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is the keyword with the use quantity larger than or equal to a preset use frequency threshold value.
2. The caliber system establishing method according to claim 1, wherein the establishing keyword table specifically comprises:
Checking the data warehouse at intervals of preset time, and counting a source data table generated in the preset time, wherein the source data table comprises description fields;
And replacing the description information in the description field by using a keyword, and establishing a keyword table.
3. An aperture system establishing device, characterized in that the device specifically comprises:
The first establishing module is used for establishing a keyword table;
The dimension selection module is used for selecting at least one keyword as a dimension keyword according to a preset rule based on the keywords of the keyword table;
The index selection module is used for selecting at least one keyword as an index keyword according to a preset rule based on the keywords of the keyword table; the keyword based on the keyword table selects at least one keyword as a dimension keyword according to a preset rule, and specifically includes: acquiring a first source data table in a source data table, and determining at least one primary key in the first source data table; calculating the times of taking at least one main key in each first source data table as an external key; selecting a primary key, which is used as an external key, from at least one primary key in the first source data table, wherein the number of times of the primary key is larger than or equal to a preset number of times threshold value as a dimension key of the first source data table, and the first source data table is any one source data table in the source data tables; or calculating the proportion of at least one main key in each first source data table as an external key; selecting a primary key with the proportion being larger than or equal to a preset proportion threshold value from at least one primary key in the first source data table as a dimension key of the first source data table, wherein the first source data table is any one source data table in the source data tables; the keyword based on the keyword table selects at least one keyword as an index keyword according to a preset rule, and specifically includes: acquiring a second source data table in the source data tables, and determining at least one keyword except for an external key in the second source data table, wherein the second source data table is any source data table except for the first source data table in the source data tables; calculating the use quantity of each keyword except the external key in the second source data table; selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is the keyword with the use quantity being greater than or equal to a preset use quantity threshold value; or calculating the utilization rate of each keyword except the external key in the second source data table; selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is a keyword with the utilization rate being greater than or equal to a preset utilization rate threshold value; or calculating the use frequency of each keyword except the external key in the second source data table; selecting at least one keyword from at least one keyword except the external key in the second source data table as an index keyword, wherein the index keyword is the keyword with the use quantity larger than or equal to a preset use frequency threshold value;
The second building module is used for carrying out dimension splitting according to the dimension keywords and different theme categories, and storing the split dimension keywords into a plurality of dimension tables, wherein the plurality of dimension tables correspond to the theme categories; carrying out index splitting according to the index keywords and different dimensions, and storing the split index keywords into a plurality of index tables, wherein the index tables correspond to the dimensions; and automatically generating a dimension-index combination table according to the external keys of the index tables and the dimension tables, wherein the dimension-index combination table comprises the dimension-index combination and query instructions corresponding to the dimension-index combination, and establishing a caliber system by utilizing the dimension-index combination and the query instructions corresponding to the dimension-index combination.
4. A caliber system establishing apparatus according to claim 3, wherein the first establishing module specifically comprises:
The statistics unit is used for checking the data warehouse at intervals of preset time and counting a source data table generated in the preset time, wherein the source data table comprises description fields;
And the first establishing unit is used for replacing the description information in the description field by using a keyword to establish a keyword table.
5. An aperture system apparatus comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the processor implements the steps of an aperture system establishing method according to claim 1 or 2 when the computer program is executed by the processor.
6. A computer-readable storage medium storing a computer program, characterized in that the computer program when executed by a processor implements the steps of a caliber system establishing method according to any one of claims 1 or 2.
CN201910895112.0A 2019-09-20 2019-09-20 Caliber system establishment method, caliber system establishment device, caliber system establishment equipment and computer storage medium Active CN110716950B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910895112.0A CN110716950B (en) 2019-09-20 2019-09-20 Caliber system establishment method, caliber system establishment device, caliber system establishment equipment and computer storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910895112.0A CN110716950B (en) 2019-09-20 2019-09-20 Caliber system establishment method, caliber system establishment device, caliber system establishment equipment and computer storage medium

Publications (2)

Publication Number Publication Date
CN110716950A CN110716950A (en) 2020-01-21
CN110716950B true CN110716950B (en) 2024-05-17

Family

ID=69210789

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910895112.0A Active CN110716950B (en) 2019-09-20 2019-09-20 Caliber system establishment method, caliber system establishment device, caliber system establishment equipment and computer storage medium

Country Status (1)

Country Link
CN (1) CN110716950B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110837508A (en) * 2019-11-08 2020-02-25 深圳市彬讯科技有限公司 Method, device and equipment for establishing aperture system and computer storage medium
CN112256804A (en) * 2020-10-27 2021-01-22 武汉市钱鲸科技有限公司 Retail service crowd statistical classification method
CN112418721A (en) * 2020-12-08 2021-02-26 中国建设银行股份有限公司 Index determination method and device
CN113076317B (en) * 2021-04-08 2024-06-18 深圳前海微众银行股份有限公司 Big data-based data processing method, device, equipment and readable storage medium
CN113268468A (en) * 2021-06-09 2021-08-17 上海通联金融服务有限公司 Method for designing data warehouse

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107729336A (en) * 2016-08-11 2018-02-23 阿里巴巴集团控股有限公司 Data processing method, equipment and system
CN109062946A (en) * 2018-06-21 2018-12-21 深圳市彬讯科技有限公司 It is a kind of to highlight method and device based on multiple web pages
CN109145110A (en) * 2018-06-29 2019-01-04 深圳市彬讯科技有限公司 Information classification processing, tag queries method and apparatus based on label

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130185304A1 (en) * 2012-01-17 2013-07-18 International Business Machines Rule-driven runtime customization of keyword search engines

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107729336A (en) * 2016-08-11 2018-02-23 阿里巴巴集团控股有限公司 Data processing method, equipment and system
CN109062946A (en) * 2018-06-21 2018-12-21 深圳市彬讯科技有限公司 It is a kind of to highlight method and device based on multiple web pages
CN109145110A (en) * 2018-06-29 2019-01-04 深圳市彬讯科技有限公司 Information classification processing, tag queries method and apparatus based on label

Also Published As

Publication number Publication date
CN110716950A (en) 2020-01-21

Similar Documents

Publication Publication Date Title
CN110716950B (en) Caliber system establishment method, caliber system establishment device, caliber system establishment equipment and computer storage medium
US10296658B2 (en) Use of context-dependent statistics to suggest next steps while exploring a dataset
Zhang et al. Reducing uncertainty of schema matching via crowdsourcing
Angles et al. Benchmarking database systems for social network applications
CN105243067B (en) A kind of method and device for realizing real-time incremental synchrodata
US11172022B2 (en) Migrating cloud resources
US10452625B2 (en) Data lineage analysis
CN108027818A (en) Inquiry based on figure
CN107820613A (en) Database index is recommended and created automatically
CN111008521B (en) Method, device and computer storage medium for generating wide table
CN110023925A (en) It generates, access and display follow metadata
CN110737432B (en) Script aided design method and device based on root list
EP3751427A1 (en) Rules-based dataset cleaning
JP2016100005A (en) Reconcile method, processor and storage medium
CN113420026B (en) Database table structure changing method, device, equipment and storage medium
CN102609465A (en) Information recommendation method based on potential communities
CN109213826A (en) Data processing method and equipment
Szárnyas et al. The LDBC social network benchmark: Business intelligence workload
CN114398669A (en) Joint credit scoring method and device based on privacy protection calculation and cross-organization
CN115203435A (en) Entity relation generation method and data query method based on knowledge graph
CN111340404A (en) Method and device for constructing index system and computer storage medium
US11928083B2 (en) Determining collaboration recommendations from file path information
CN111639068A (en) Multi-system-based public data pool generation method, device, equipment and readable storage medium
CN105354339B (en) Content personalization providing method based on context
CN112667869B (en) Data processing method, device, system and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20240422

Address after: Room 301, 3rd floor, 9 shangdijiu street, Haidian District, Beijing

Applicant after: Beijing Shenzhou Digital Cloud Information Technology Co.,Ltd.

Country or region after: China

Applicant after: Shenzhou Kuntai (Xiamen) Information Technology Co.,Ltd.

Address before: 330000 Huangjia Natural Village, Guangming Village, Changdong Town, High tech Development Zone, Nanchang City, Jiangxi Province 75

Applicant before: Huang Shasha

Country or region before: China

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant