CN110941601A - Method and device for determining standard caliber of index, electronic equipment and storage medium - Google Patents

Method and device for determining standard caliber of index, electronic equipment and storage medium Download PDF

Info

Publication number
CN110941601A
CN110941601A CN201911103196.6A CN201911103196A CN110941601A CN 110941601 A CN110941601 A CN 110941601A CN 201911103196 A CN201911103196 A CN 201911103196A CN 110941601 A CN110941601 A CN 110941601A
Authority
CN
China
Prior art keywords
index
caliber
determining
standard
analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911103196.6A
Other languages
Chinese (zh)
Other versions
CN110941601B (en
Inventor
杨冬冬
刘强
魏建钟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sankuai Online Technology Co Ltd
Original Assignee
Beijing Sankuai Online Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sankuai Online Technology Co Ltd filed Critical Beijing Sankuai Online Technology Co Ltd
Priority to CN201911103196.6A priority Critical patent/CN110941601B/en
Publication of CN110941601A publication Critical patent/CN110941601A/en
Application granted granted Critical
Publication of CN110941601B publication Critical patent/CN110941601B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • G06F16/212Schema design and management with details for data modelling support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the application discloses a method and a device for determining an index standard caliber, electronic equipment and a storage medium, wherein the method comprises the following steps: acquiring an index of a standard caliber to be determined and a plurality of corresponding table identifications; respectively determining data models corresponding to the table identifications according to metadata, wherein the data models comprise a plurality of analysis dimensions related to the index caliber and values of the analysis dimensions; and determining the standard caliber of the index according to the data model and a pre-configured index dictionary, wherein the index dictionary stores values of a plurality of analysis dimensions corresponding to the index with the standard caliber. The method and the device for determining the standard caliber of the index achieve the purpose that the standard caliber of the index is determined according to the correlation analysis dimension of the index, do not depend on manual investigation and confirmation, improve the determination efficiency of the standard caliber of the index, avoid human errors and improve the accuracy of the determined standard caliber.

Description

Method and device for determining standard caliber of index, electronic equipment and storage medium
Technical Field
The embodiment of the application relates to the technical field of data warehouses, in particular to a method and a device for determining standard caliber of an index, electronic equipment and a storage medium.
Background
A data warehouse is a strategic set that provides all types of data support for all levels of decision-making processes of an enterprise. Providing analytical decisions for businesses and responding to large-scale complex queries is one of the primary functions of data warehouses.
Data models and metrics are important assets of a data warehouse. But due to some characteristics of the data warehouse, the conditions that indexes are repeated and the calibers are inconsistent can occur. The process of data extraction (Extract), transformation (Transform), cleaning (Cleansing) and loading (Load) is an important ring for constructing a data warehouse, a user extracts required data from a data source, and finally loads the data into the data warehouse according to a predefined data warehouse model after data cleaning. The multi-layer and multi-table structure of the data warehouse requires redundancy of data indexes, and indexes with different dimensions are repeatedly built due to rapid iteration of services.
In the prior art, when the standard apertures with the same index are determined, the single index needs to be checked manually, so that the determination efficiency of the standard apertures of the index is low, and the accuracy is insufficient due to human factors.
Disclosure of Invention
The embodiment of the application provides a method and a device for determining an index standard aperture, electronic equipment and a storage medium, which are beneficial to improving the determination efficiency and accuracy of the index standard aperture.
In order to solve the above problem, in a first aspect, an embodiment of the present application provides a method for determining an index standard caliber, including:
acquiring an index of a standard caliber to be determined and a plurality of corresponding table identifications;
respectively determining data models corresponding to the table identifications according to metadata, wherein the data models comprise a plurality of analysis dimensions related to the index caliber and values of the analysis dimensions;
and determining the standard caliber of the index according to the data model and a pre-configured index dictionary, wherein the index dictionary stores values of a plurality of analysis dimensions corresponding to the index with the standard caliber.
Optionally, the determining the standard caliber of the index according to the data model and a pre-configured index dictionary includes:
respectively matching the data models corresponding to the plurality of table identifications with the values of the analysis dimensions corresponding to the indexes in the index dictionary;
and determining the index caliber corresponding to the successfully matched table identifier as the standard caliber of the index.
Optionally, after the respectively matching the data models corresponding to the plurality of table identifiers with the values of the analysis dimensions corresponding to the indexes in the index dictionary, the method further includes:
if the data models corresponding to the table identifications and the index dictionary fail to be matched, calculating index caliber scores corresponding to the table identifications according to the data models corresponding to the table identifications and pre-configured index weights, wherein the index weights comprise full scores corresponding to the analysis dimensions and weights corresponding to the range of the analysis dimensions;
and determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifications.
Optionally, the calculating the index caliber scores corresponding to the plurality of table identifiers according to the data models corresponding to the plurality of table identifiers and the preconfigured index weights respectively includes:
selecting one table identifier from the plurality of table identifiers as a current table identifier;
inquiring the index weight according to the values of the plurality of analysis dimensions corresponding to the current table identification, and determining the full score of each analysis dimension and the weight corresponding to the value of the analysis dimension;
weighting and summing the full scores of the multiple analysis dimensions and the weights corresponding to the values of the analysis dimensions to obtain an index caliber score corresponding to the current table identifier;
and circularly executing the operation of selecting the current table identifier and determining the index caliber score corresponding to the current table identifier until the index caliber scores corresponding to the plurality of table identifiers are obtained.
Optionally, the determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifiers includes:
and taking the index caliber corresponding to the table identifier with the highest index caliber score as the standard caliber of the index.
Optionally, after determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifiers, the method further includes:
and correspondingly storing the value of the analysis dimension corresponding to the determined standard caliber and the index into the index dictionary.
Optionally, the plurality of analysis dimensions includes coverage, blood margin, heat, hierarchy, and topic.
In a second aspect, an embodiment of the present application provides an apparatus for determining an index standard caliber, including:
the index data acquisition module is used for acquiring indexes of the standard caliber to be determined and a plurality of corresponding table identifications;
the data model determining module is used for respectively determining data models corresponding to the table identifications according to metadata, and the data models comprise a plurality of analysis dimensions related to the index caliber and values of the analysis dimensions;
and the standard caliber determining module is used for determining the standard caliber of the index according to the data model and a pre-configured index dictionary, and the index dictionary stores a plurality of analysis dimensional values corresponding to the index with the standard caliber.
Optionally, the standard caliber determining module includes:
a matching unit, configured to match the data models corresponding to the plurality of table identifiers with the values of the analysis dimensions corresponding to the indexes in the index dictionary, respectively;
and the first standard caliber determining unit is used for determining the index caliber corresponding to the successfully matched table identifier as the standard caliber of the index.
Optionally, the standard caliber determining module further includes:
the score calculating unit is used for calculating the index caliber scores corresponding to the plurality of table identifications according to the data models corresponding to the plurality of table identifications and pre-configured index weights respectively if the data models corresponding to the plurality of table identifications and the index dictionary fail to be matched, wherein the index weights comprise full scores corresponding to the plurality of analysis dimensions and weights corresponding to the range of the analysis dimensions;
and the second standard caliber determining unit is used for determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifications.
Optionally, the score calculating unit is specifically configured to:
selecting one table identifier from the plurality of table identifiers as a current table identifier;
inquiring the index weight according to the values of the plurality of analysis dimensions corresponding to the current table identification, and determining the full score of each analysis dimension and the weight corresponding to the value of the analysis dimension;
weighting and summing the full scores of the multiple analysis dimensions and the weights corresponding to the values of the analysis dimensions to obtain an index caliber score corresponding to the current table identifier;
and circularly executing the operation of selecting the current table identifier and determining the index caliber score corresponding to the current table identifier until the index caliber scores corresponding to the plurality of table identifiers are obtained.
Optionally, the second standard caliber determining unit is specifically configured to:
and taking the index caliber corresponding to the table identifier with the highest index caliber score as the standard caliber of the index.
Optionally, the apparatus further comprises:
and the index dictionary expansion module is used for correspondingly storing the values of the analysis dimensions corresponding to the determined standard apertures and the indexes into the index dictionary after the standard apertures of the indexes are determined according to the index aperture scores corresponding to the plurality of table identifications.
Optionally, the plurality of analysis dimensions includes coverage, blood margin, heat, hierarchy, and topic.
In a third aspect, an embodiment of the present application further discloses an electronic device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program to implement the method for determining a standard caliber according to the embodiment of the present application.
In a fourth aspect, the present application provides a computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, provides the steps of the method for determining a standard caliber disclosed in the embodiments of the present application.
The method, the device, the electronic equipment and the storage medium for determining the standard caliber of the index disclosed by the embodiment of the application acquire the index of the standard caliber to be determined and the corresponding table identifications, respectively determining data models corresponding to a plurality of table identifications according to the metadata, determining standard apertures of indexes according to the data models and a pre-configured index dictionary, since the data model includes a plurality of analysis dimensions and corresponding values associated with the index aperture, and the index dictionary stores values of a plurality of analysis dimensions corresponding to the index having the standard aperture, the standard caliber of the index can be determined through matching of the index and the standard caliber, the standard caliber of the index can be determined according to the correlation analysis dimension of the index, manual investigation and confirmation are not relied on, the determination efficiency of the standard caliber of the index is improved, human errors are avoided, and the accuracy of the determined standard caliber is improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed to be used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a flowchart of a method for determining a standard caliber of an index according to a first embodiment of the present application;
fig. 2 is a schematic structural diagram of an apparatus for determining a standard caliber of an index according to a third embodiment of the present application.
Detailed Description
The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are some, but not all, embodiments of the present application. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
Example one
As shown in fig. 1, a method for determining an index standard caliber disclosed in this embodiment includes: step 110 to step 130.
And step 110, acquiring an index of the standard caliber to be determined and a plurality of corresponding table identifications.
When the same index exists in a plurality of tables of the data warehouse, whether the index calibers of the index in the plurality of tables are the same or not needs to be determined, and the standard calibers of the index need to be determined when the index calibers are different.
The method comprises the steps that indexes of a standard caliber to be determined given by a user and a plurality of corresponding table identifications can be obtained through an interface; or obtaining an index of a standard caliber to be determined, which is given by a user, through an interface, searching a plurality of tables including the index in a data warehouse according to the index, and determining the table identifications corresponding to the tables respectively.
And 120, respectively determining data models corresponding to the plurality of table identifications according to the metadata, wherein the data models comprise a plurality of analysis dimensions related to the index caliber and values of the plurality of analysis dimensions.
Metadata is data about data in a data warehouse, and information such as a logical data structure, a file, an address, an index and the like is stored, and metadata is data describing the structure and the building method of the data in the data warehouse. One index corresponds to one database field. The index caliber is index description information, namely the access logic of the index.
The plurality of analysis dimensions includes coverage, blood margin, heat, hierarchy, and subject matter. Coverage refers to the coverage of the current index in the whole data link, and can be understood as the influence on the downstream. The blood margin refers to the up-and-down stream dependency. Heat refers to the frequency of queries that are not outside the ETL. A hierarchy refers to the hierarchy of a table in a data warehouse. The topics are an abstract concept for integrating, classifying and analyzing data in the enterprise information system at a high level, and each topic basically corresponds to a macroscopic analysis field. The value of the coverage degree and the value of the heat degree are specific numerical values, the value of the blood margin is the name of a root node corresponding to the current table identifier, the hierarchy is the hierarchy of the table corresponding to the current table identifier in the data warehouse, and the value of the theme is a specific theme corresponding to the current table identifier.
And aiming at each table identifier, respectively obtaining a data model corresponding to each table identifier through analyzing the metadata. Specifically, one of the plurality of table identifiers is used as a current table identifier, the metadata of the data warehouse is analyzed according to the plurality of analysis dimensions in the data model, and a value of each analysis dimension in the plurality of analysis dimensions corresponding to the current table identifier is determined, so that the data model corresponding to the current table identifier is obtained.
Step 130, determining a standard caliber of the index according to the data model and a pre-configured index dictionary, wherein the index dictionary stores values of a plurality of analysis dimensions corresponding to the index with the standard caliber.
And aiming at the data model corresponding to each table identifier, inquiring the index dictionary according to the values of a plurality of analysis dimensions in the data model, matching the values of the plurality of analysis dimensions with the values of the plurality of analysis dimensions corresponding to the index in the index dictionary, and determining the standard caliber of the index according to the matching result.
In an embodiment of the application, the determining a standard caliber of the index according to the data model and a pre-configured index dictionary includes: respectively matching the data models corresponding to the plurality of table identifications with the values of the analysis dimensions corresponding to the indexes in the index dictionary; and determining the index caliber corresponding to the successfully matched table identifier as the standard caliber of the index.
And respectively matching the data model corresponding to each table identifier with the index dictionary. Selecting one table identifier from a plurality of table identifiers as a current table identifier, querying the index dictionary to obtain a value of an analysis dimension corresponding to the index, matching a data model corresponding to the current table identifier with the value of the analysis dimension corresponding to the index in the index dictionary, if the value of each analysis dimension in the data model is the same as the value of the same analysis dimension in the index dictionary, determining that the matching is successful, and if the value of each analysis dimension in the data model is different from the value of the same analysis dimension in the index dictionary, determining that the matching is failed. And determining the index calibers corresponding to the successfully matched table identifications as the standard calibers of the indexes, and if the data models of the table identifications are successfully matched with the index dictionary, determining that the index calibers corresponding to the table identifications are consistent and are all standard calibers.
According to the method for determining the standard caliber of the index, after the index of the standard caliber to be determined and the corresponding table identifications are obtained, the data models corresponding to the table identifications are respectively determined according to the metadata, the standard caliber of the index is determined according to the data models and the pre-configured index dictionary, the data models comprise a plurality of analysis dimensions and corresponding values related to the index caliber, the index dictionary stores a plurality of analysis dimensions corresponding to the index with the standard caliber, the standard caliber of the index can be determined through matching of the data models and the index dictionary, the standard caliber of the index is determined according to the related analysis dimensions of the index, manual investigation and confirmation are not relied, the determination efficiency of the standard caliber of the index is improved, manual errors are avoided, and the accuracy of the determined standard caliber is improved.
In an embodiment of the present application, after the respectively matching the data models corresponding to the plurality of table identifiers with the values of the analysis dimensions corresponding to the indexes in the index dictionary, further optionally includes: if the data models corresponding to the table identifications and the index dictionary fail to be matched, calculating index caliber scores corresponding to the table identifications according to the data models corresponding to the table identifications and pre-configured index weights, wherein the index weights comprise full scores corresponding to the analysis dimensions and weights corresponding to the range of the analysis dimensions; and determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifications.
If the data model corresponding to the plurality of table identifiers and the index dictionary fail to be matched, determining that the index dictionary does not store the value of the analysis dimension corresponding to the standard caliber of the index, calculating the index caliber scores corresponding to the plurality of table identifiers according to the preset index weight, and determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifiers. The index weight includes a full score corresponding to each of the plurality of analysis dimensions and a weight corresponding to a range included in each of the analysis dimensions. When the index caliber score corresponding to one table identifier is calculated, aiming at each analysis dimension, determining the range of the analysis dimension in which the value of the analysis dimension is positioned, multiplying the full score of the analysis dimension and the weight of the range of the analysis dimension to obtain the score of the analysis dimension corresponding to the table identifier, and adding the scores of the analysis dimensions to obtain the index caliber score corresponding to the table identifier. The index caliber scores respectively corresponding to the plurality of table identifications are determined according to the preset index weight, and the standard caliber of the index is determined according to the index caliber scores respectively corresponding to the plurality of table identifications, so that the defect that the standard caliber related data of the index are not stored in an index dictionary is overcome, and the accuracy of determining the standard caliber is further improved.
In an embodiment of the present application, the calculating the index caliber scores corresponding to the plurality of table identifiers according to the data models corresponding to the plurality of table identifiers and the preconfigured index weights respectively includes: selecting one table identifier from the plurality of table identifiers as a current table identifier; inquiring the index weight according to the values of the plurality of analysis dimensions corresponding to the current table identification, and determining the full score of each analysis dimension and the weight corresponding to the value of the analysis dimension; carrying out weighted summation on the full score of the analysis dimension and the weight corresponding to the value of the analysis dimension to obtain an index caliber score corresponding to the current table identifier; and circularly executing the operation of selecting the current table identifier and determining the index caliber score corresponding to the current table identifier until the index caliber scores corresponding to the plurality of table identifiers are obtained. Processing each table identifier respectively, selecting one table identifier from the multiple table identifiers as a current table identifier, inquiring index weight according to the values of the multiple analysis dimensions corresponding to the current table identifier, determining the range included by each analysis dimension and determining the range in which the value of the analysis dimension is located, taking the weight corresponding to the range as the weight corresponding to the value of the analysis dimension, calculating the product of the full score of each analysis dimension and the weight corresponding to the value of the analysis dimension as the score of the analysis dimension, and adding the scores of the multiple analysis dimensions to be used as the index caliber score corresponding to the current table identifier. And the corresponding index caliber score is respectively calculated aiming at each table identifier, thereby avoiding the calculation confusion and improving the accuracy of the calculation result.
In an embodiment of the application, the determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifiers includes: and taking the index caliber corresponding to the table identifier with the highest index caliber score as the standard caliber of the index. The higher the index caliber score is, the more accurate the caliber of the index is, so the index caliber corresponding to the table mark with the highest index caliber score is used as the standard caliber of the index.
In an embodiment of the present application, after determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifiers, the method may further include: and correspondingly storing the value of the analysis dimension corresponding to the determined standard caliber and the index into the index dictionary. After the standard caliber of the index is determined according to the index caliber scores corresponding to the plurality of table identifications, the value of the analysis dimension corresponding to the standard caliber and the index are correspondingly stored in the index dictionary, so that the index dictionary is expanded, and the determination efficiency of the subsequent index standard caliber can be further improved.
Example two
The present embodiment specifically describes a process of determining an index standard aperture by using a specific example. In this embodiment, two tables have the same index, where table a and table b have the same index, and the field corresponding to the index is "selling price".
The index of the standard caliber to be determined is acquired as 'selling price', and the table identifications corresponding to the index are respectively a table a and a table b. Analyzing metadata of a data warehouse, determining values of a plurality of analysis dimensions in a data model, wherein the data models corresponding to the table a and the table B are shown in table 1, and analyzing the metadata, aiming at the table a, the value of coverage in the obtained data model is 5000, the value of blood margin is an order system, namely, a root node corresponding to the table a is the order system, the value of heat is 1000, the value of hierarchy is B3, and the value of theme is a transaction; for table B, the coverage value in the obtained data model is 90, the blood margin value is the settlement system, that is, the root node corresponding to table B is the settlement system, the heat value is 600, the hierarchy value is B3, and the subject value is the settlement.
TABLE 1 data model
Table identifier Coverage degree Blood margin Heat degree Hierarchy level Themes
a 5000 Order system 1000 B3 Trading
b 90 Settlement system 600 B3 Settlement of accounts
After the data models corresponding to the table a and the table b are determined, the data models corresponding to the table a and the table b are matched with the values of the analysis dimensions corresponding to the index of 'selling price' in the index dictionary, and the data models corresponding to the matched table a and the matched table b are failed to be matched with the index dictionary. In this case, it is necessary to calculate the index aperture score corresponding to table a based on the data model corresponding to table a and the index weight arranged in advance, and to calculate the index aperture score corresponding to table b based on the data model corresponding to table b and the index weight arranged in advance. Assume that the pre-configured index weights are shown in table 2.
TABLE 2 index weights
Figure BDA0002270460660000091
Figure BDA0002270460660000101
By inquiring the pre-configured index weight, the full score of the coverage is 15, the full score of the blood margin is 30, the full score of the heat is 15, the full score of the hierarchy is 20, and the full score of the theme is 20.
For table a, the coverage range of coverage 5000 is 501-; the blood margin is an order system, the corresponding weight of the order system is 50%, and the score of the blood margin is 30 x 50% ═ 15; the heat degree range of the heat degree 1000 is 501-10000, the corresponding weight of the range is 50%, and the score of the heat degree is 15 × 50% — 7.5; the hierarchy is B3, the weight corresponding to B3 is 60%, and the score of the hierarchy is 20 × 60% — 12; the subject is a transaction, the range is an order system, the corresponding weight of the order system is obtained as 50% by inquiring the index weight, the score of the subject is 20 × 50% ═ 10, the sum of the score of the coverage degree of 7.5, the score of the blood margin of 15, the score of the heat degree of 7.5, the score of the hierarchy of 12 and the score of the subject of 10 is calculated, and the index caliber score corresponding to the table a is obtained as 52.
For table b, coverage 90 is in the range of 1-500, which corresponds to a weight of 20%, with a score of 15 × 20% — 3; the blood margin is a settlement system, the corresponding weight of the settlement system is 30%, and the score of the blood margin is 30 x 30% ═ 9; the heat degree range of the heat degree 600 is 501-10000, the corresponding weight of the range is 50%, and the score of the heat degree is 15 × 50% — 7.5; the hierarchy is B3, the weight corresponding to B3 is 60%, and the score of the hierarchy is 20 × 60% — 12; the subject is settlement, the index weight is inquired, the range where settlement is located is determined to be a settlement system, the weight corresponding to the settlement system is obtained to be 30%, the score of the subject is 20 × 30% equals6, the sum of the score 3 of the coverage degree, the score 9 of the blood margin, the score 7.5 of the heat degree, the score 12 of the hierarchy and the score 6 of the subject is calculated, and the index caliber score corresponding to the table b is obtained to be 37.5.
Since the index caliber score corresponding to table a is greater than the index caliber score corresponding to table b, the index caliber corresponding to table a is taken as the standard caliber of the index "selling price".
The embodiment elaborates the process of determining the standard caliber of the index in detail by using a specific example, realizes the purpose of determining the standard caliber of the index according to the relevant analysis dimension of the index, does not depend on manual investigation and confirmation, improves the efficiency of determining the standard caliber of the index, avoids human errors, and improves the accuracy of determining the standard caliber.
EXAMPLE III
In the apparatus for determining an index standard caliber disclosed in this embodiment, as shown in fig. 2, the apparatus 200 for determining an index standard caliber includes:
an index data obtaining module 210, configured to obtain an index of a standard aperture to be determined and a plurality of corresponding table identifiers;
a data model determining module 220, configured to determine, according to the metadata, data models corresponding to the plurality of table identifiers, where each data model includes a plurality of analysis dimensions related to the index aperture and values of the plurality of analysis dimensions;
a standard caliber determining module 230, configured to determine a standard caliber of the index according to the data model and a pre-configured index dictionary, where the index dictionary stores values of a plurality of analysis dimensions corresponding to the index with the standard caliber.
Optionally, the standard caliber determining module includes:
a matching unit, configured to match the data models corresponding to the plurality of table identifiers with the values of the analysis dimensions corresponding to the indexes in the index dictionary, respectively;
and the first standard caliber determining unit is used for determining the index caliber corresponding to the successfully matched table identifier as the standard caliber of the index.
Optionally, the standard caliber determining module further includes:
the score calculating unit is used for calculating the index caliber scores corresponding to the plurality of table identifications according to the data models corresponding to the plurality of table identifications and pre-configured index weights respectively if the data models corresponding to the plurality of table identifications and the index dictionary fail to be matched, wherein the index weights comprise full scores corresponding to the plurality of analysis dimensions and weights corresponding to the range of the analysis dimensions;
and the second standard caliber determining unit is used for determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifications.
Optionally, the score calculating unit is specifically configured to:
selecting one table identifier from the plurality of table identifiers as a current table identifier;
inquiring the index weight according to the values of the plurality of analysis dimensions corresponding to the current table identification, and determining the full score of each analysis dimension and the weight corresponding to the value of the analysis dimension;
weighting and summing the full scores of the multiple analysis dimensions and the weights corresponding to the values of the analysis dimensions to obtain an index caliber score corresponding to the current table identifier;
and circularly executing the operation of selecting the current table identifier and determining the index caliber score corresponding to the current table identifier until the index caliber scores corresponding to the plurality of table identifiers are obtained.
Optionally, the second standard caliber determining unit is specifically configured to:
and taking the index caliber corresponding to the table identifier with the highest index caliber score as the standard caliber of the index.
Optionally, the apparatus further comprises:
and the index dictionary expansion module is used for correspondingly storing the values of the analysis dimensions corresponding to the determined standard apertures and the indexes into the index dictionary after the standard apertures of the indexes are determined according to the index aperture scores corresponding to the plurality of table identifications.
Optionally, the plurality of analysis dimensions includes coverage, blood margin, heat, hierarchy, and topic.
The apparatus for determining a standard aperture of an index provided in the embodiment of the present application is configured to implement each step of the method for determining a standard aperture of an index described in the embodiment of the present application, and specific implementation manners of each module of the apparatus refer to the corresponding step, which is not described herein again.
The device for determining the standard caliber of the index, disclosed by the embodiment of the application, acquires the index of the standard caliber to be determined and a plurality of corresponding table identifications through an index data acquisition module, respectively determines data models corresponding to the plurality of table identifications according to metadata through a data model determination module, determines the standard caliber of the index according to the data models and a pre-configured index dictionary through a standard caliber determination module, since the data model includes a plurality of analysis dimensions and corresponding values associated with the index aperture, and the index dictionary stores values of a plurality of analysis dimensions corresponding to the index having the standard aperture, the standard caliber of the index can be determined through matching of the index and the standard caliber, the standard caliber of the index can be determined according to the correlation analysis dimension of the index, manual investigation and confirmation are not relied on, the determination efficiency of the standard caliber of the index is improved, human errors are avoided, and the accuracy of the determined standard caliber is improved.
Correspondingly, the embodiment of the application also discloses an electronic device, which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein when the processor executes the computer program, the method for determining the standard caliber of the index is realized. The electronic device can be a PC, a mobile terminal, a personal digital assistant, a tablet computer and the like.
The embodiment of the application also discloses a computer readable storage medium, wherein a computer program is stored on the computer readable storage medium, and when the computer program is executed by a processor, the steps of the method for determining the standard aperture of the index according to the embodiment of the application are realized.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.
The method, the apparatus, the electronic device, and the storage medium for determining the standard caliber of the index provided by the embodiments of the present application are introduced in detail, and a specific example is applied to illustrate the principle and the implementation of the present application, and the description of the embodiments is only used to help understand the method and the core idea of the present application; meanwhile, for a person skilled in the art, according to the idea of the present application, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present application.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.

Claims (10)

1. A method of determining an index gauge, comprising:
acquiring an index of a standard caliber to be determined and a plurality of corresponding table identifications;
respectively determining data models corresponding to the table identifications according to metadata, wherein the data models comprise a plurality of analysis dimensions related to the index caliber and values of the analysis dimensions;
and determining the standard caliber of the index according to the data model and a pre-configured index dictionary, wherein the index dictionary stores values of a plurality of analysis dimensions corresponding to the index with the standard caliber.
2. The method of claim 1, wherein determining the standard caliber of the metric based on the data model and a preconfigured metric dictionary comprises:
respectively matching the data models corresponding to the plurality of table identifications with the values of the analysis dimensions corresponding to the indexes in the index dictionary;
and determining the index caliber corresponding to the successfully matched table identifier as the standard caliber of the index.
3. The method of claim 2, after the matching the data models corresponding to the plurality of table identifiers with the values of the analysis dimensions corresponding to the metrics in the metric dictionary, respectively, further comprising:
if the data models corresponding to the table identifications and the index dictionary fail to be matched, calculating index caliber scores corresponding to the table identifications according to the data models corresponding to the table identifications and pre-configured index weights, wherein the index weights comprise full scores corresponding to the analysis dimensions and weights corresponding to the range of the analysis dimensions;
and determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifications.
4. The method of claim 3, wherein calculating the index caliber scores corresponding to the plurality of table identifiers according to the data models corresponding to the plurality of table identifiers and the pre-configured index weights respectively comprises:
selecting one table identifier from the plurality of table identifiers as a current table identifier;
inquiring the index weight according to the values of the plurality of analysis dimensions corresponding to the current table identification, and determining the full score of each analysis dimension and the weight corresponding to the value of the analysis dimension;
weighting and summing the full scores of the multiple analysis dimensions and the weights corresponding to the values of the analysis dimensions to obtain an index caliber score corresponding to the current table identifier;
and circularly executing the operation of selecting the current table identifier and determining the index caliber score corresponding to the current table identifier until the index caliber scores corresponding to the plurality of table identifiers are obtained.
5. The method of claim 3, wherein determining the standard caliber of the index according to the index caliber scores corresponding to the plurality of table identifications comprises:
and taking the index caliber corresponding to the table identifier with the highest index caliber score as the standard caliber of the index.
6. The method of claim 3, further comprising, after said determining the standard caliber of the metric from the metric caliber scores corresponding to the plurality of table identifications,:
and correspondingly storing the value of the analysis dimension corresponding to the determined standard caliber and the index into the index dictionary.
7. The method of claim 1, wherein the plurality of analysis dimensions comprise coverage, blood margin, heat, rank, and topic.
8. An apparatus for determining a target gauge, comprising:
the index data acquisition module is used for acquiring indexes of the standard caliber to be determined and a plurality of corresponding table identifications;
the data model determining module is used for respectively determining data models corresponding to the table identifications according to metadata, and the data models comprise a plurality of analysis dimensions related to the index caliber and values of the analysis dimensions;
and the standard caliber determining module is used for determining the standard caliber of the index according to the data model and a pre-configured index dictionary, and the index dictionary stores a plurality of analysis dimensional values corresponding to the index with the standard caliber.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the method for determining a target gauge according to any one of claims 1 to 7 when executing the computer program.
10. A computer-readable storage medium, on which a computer program is stored, which program, when being executed by a processor, is adapted to carry out the steps of the method of determining a target gauge according to any one of claims 1 to 7.
CN201911103196.6A 2019-11-12 2019-11-12 Method and device for determining standard caliber of index, electronic equipment and storage medium Active CN110941601B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911103196.6A CN110941601B (en) 2019-11-12 2019-11-12 Method and device for determining standard caliber of index, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911103196.6A CN110941601B (en) 2019-11-12 2019-11-12 Method and device for determining standard caliber of index, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN110941601A true CN110941601A (en) 2020-03-31
CN110941601B CN110941601B (en) 2023-05-30

Family

ID=69907461

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911103196.6A Active CN110941601B (en) 2019-11-12 2019-11-12 Method and device for determining standard caliber of index, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN110941601B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113535938A (en) * 2021-07-22 2021-10-22 北京明略软件系统有限公司 Standard data construction method, system, device and medium based on content identification

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101567068A (en) * 2009-05-18 2009-10-28 北京方正春元科技发展有限公司 Data-processing system used for financial information management
CN102013049A (en) * 2010-10-20 2011-04-13 浪潮集团山东通用软件有限公司 Virtual organization-based KPI analysis method and statistical analysis system
CN105335401A (en) * 2014-07-22 2016-02-17 阿里巴巴集团控股有限公司 Data warehouse index management method, apparatus and system
CN106358213A (en) * 2016-09-22 2017-01-25 中国联合网络通信集团有限公司 Indoor distribution system evaluation method and indoor distribution system evaluation device
CN107688580A (en) * 2016-08-05 2018-02-13 北京京东尚科信息技术有限公司 The method, apparatus and system of commodity classification based on Distributed Data Warehouse
CN107844962A (en) * 2017-11-24 2018-03-27 广东电网有限责任公司电网规划研究中心 A kind of distribution Construction Cost Data based on standard data structure collects system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101567068A (en) * 2009-05-18 2009-10-28 北京方正春元科技发展有限公司 Data-processing system used for financial information management
CN102013049A (en) * 2010-10-20 2011-04-13 浪潮集团山东通用软件有限公司 Virtual organization-based KPI analysis method and statistical analysis system
CN105335401A (en) * 2014-07-22 2016-02-17 阿里巴巴集团控股有限公司 Data warehouse index management method, apparatus and system
CN107688580A (en) * 2016-08-05 2018-02-13 北京京东尚科信息技术有限公司 The method, apparatus and system of commodity classification based on Distributed Data Warehouse
CN106358213A (en) * 2016-09-22 2017-01-25 中国联合网络通信集团有限公司 Indoor distribution system evaluation method and indoor distribution system evaluation device
CN107844962A (en) * 2017-11-24 2018-03-27 广东电网有限责任公司电网规划研究中心 A kind of distribution Construction Cost Data based on standard data structure collects system

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113535938A (en) * 2021-07-22 2021-10-22 北京明略软件系统有限公司 Standard data construction method, system, device and medium based on content identification

Also Published As

Publication number Publication date
CN110941601B (en) 2023-05-30

Similar Documents

Publication Publication Date Title
CN107436875B (en) Text classification method and device
CN110221965B (en) Test case generation method, test case generation device, test case testing method, test case testing device, test equipment and test system
CN110709826B (en) Method and system for linking data records from heterogeneous databases
CN108717407B (en) Entity vector determination method and device, and information retrieval method and device
CN108269122B (en) Advertisement similarity processing method and device
CN111400507B (en) Entity matching method and device
CN111242318A (en) Business model training method and device based on heterogeneous feature library
CN110929764A (en) Picture auditing method and device, electronic equipment and storage medium
CN114116973A (en) Multi-document text duplicate checking method, electronic equipment and storage medium
CN112203324B (en) MR positioning method and device based on position fingerprint database
CN114358487A (en) Performance assessment method and device and computer readable storage medium
CN110941601A (en) Method and device for determining standard caliber of index, electronic equipment and storage medium
CN113609020A (en) Test case recommendation method and device
CN117376228A (en) Network security testing tool determining method and device
CN108733707B (en) Method and device for determining stability of search function
CN113934776B (en) Food material pushing method, device, medium and equipment
CN115422180A (en) Data verification method and system
CN114944219A (en) Mental scale recommendation method and device based on artificial intelligence and storage medium
JP2008282111A (en) Similar document retrieval method, program and device
CN113011503A (en) Data evidence obtaining method of electronic equipment, storage medium and terminal
CN114116877B (en) Data conversion method, device, computer equipment and medium
CN113792800B (en) Feature generation method and device, electronic equipment and storage medium
CN111310445B (en) Method and device for generating file information of online service entity
JP7345744B2 (en) data processing equipment
CN113836144A (en) Method and device for recommending database standard table based on field

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant