CN105630475B - A kind of data label organization system and method for organizing - Google Patents

A kind of data label organization system and method for organizing Download PDF

Info

Publication number
CN105630475B
CN105630475B CN201410624275.2A CN201410624275A CN105630475B CN 105630475 B CN105630475 B CN 105630475B CN 201410624275 A CN201410624275 A CN 201410624275A CN 105630475 B CN105630475 B CN 105630475B
Authority
CN
China
Prior art keywords
data label
label
data
information
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410624275.2A
Other languages
Chinese (zh)
Other versions
CN105630475A (en
Inventor
沈金
甘云锋
黄晓婧
李小健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Tmall Technology Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201410624275.2A priority Critical patent/CN105630475B/en
Publication of CN105630475A publication Critical patent/CN105630475A/en
Application granted granted Critical
Publication of CN105630475B publication Critical patent/CN105630475B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application provides a kind of data label organization system, comprising: data label application module, for business datum label needed for applying according to user instructions;Data label collector, for required business datum label to be compiled as the SQL statement based on stsndard SQL according to the metadata information of definition;And memory module is executed, for executing and storing the SQL statement based on stsndard SQL after compiling.The application also provides a kind of data label method for organizing, comprising: request for data label;Define metadata information;Required business datum label is compiled as the SQL statement based on stsndard SQL according to metadata information;And it executes and stores the SQL statement based on stsndard SQL.The application can obtain the data label specified in various data platforms by once defining data service filtering rule automatically, and so as to meet, user is easy, the requirements of various data is efficiently and accurately obtained from different data platforms.

Description

A kind of data label organization system and method for organizing
Technical field
This application involves a kind of data label administrative skill more particularly to a kind of data label organization system and organizers Method.
Background technique
Currently, there are many different data suppliers in open data platform trade market, provide up to ten million Data label.Specified data label is obtained to usually require by manually carrying out data processing, and data processing is roughly divided into Two major classes: Transaction Processing (OLTP) and on-line analytical processing (OLAP), OLTP are the main application of relevant database, property Can it is upper by the response time be measurement standard;OLAP is the main application of data warehouse, using handling capacity as main criterion. In both data application environment, it is necessary to expend a large amount of manpower and material resources, by artificial a large amount of operation business logic codes, It can support various data label demands complicated and changeable.
Preceding method from requirement description business to final result, is needing third party technology personnel to intervene, inevitably can be because of industry Business understands that difference causes final development result different from demand;Or presence although different demands there are general character, it is still desirable to weight Multiple exploitation, causes development efficiency not high, the defect of generally applicable property difference.
In view of this, it is necessary to provide a kind of suitable for different data platform, can carry out data label tissue is System and method, to meet, user is easy, the requirements of various data is efficiently and accurately obtained from different data platforms.
Summary of the invention
This application provides a kind of data label organization systems, comprising: data label application module, for being referred to according to user Enable business datum label needed for applying;And data label collector, for the metadata information according to definition by required industry Business data label is compiled as the SQL statement based on stsndard SQL.
Present invention also provides a kind of data label method for organizing, comprising: request for data label;Define metadata information; And required business datum label is compiled as the SQL statement based on stsndard SQL according to the metadata information.
It, can be by once defining data service filtering rule using the data label organization system and method for organizing of the application Then, the data label specified in various data platforms can be obtained automatically, so as to meet user it is easy, efficiently and accurately from Different data platform obtains the requirement of various data.
Detailed description of the invention
Reader is after the specific embodiment for having read the application referring to attached drawing, it will more clearly understands the application's Various aspects.Wherein,
Fig. 1 is the module diagram of the data label organization system of the application;
Fig. 2 is the submodule schematic diagram of the module 121 in the data label organization system of Fig. 1;
Fig. 3 is the data tag information parameter E-R schematic diagram in the data label organization system of the application;
Fig. 4 is the data tag information parameter SQL list schematic diagram of Fig. 3;
Fig. 5 is the preferred flow schematic diagram of the data label method for organizing of the application;
Fig. 6 is the preferred flow schematic diagram of the step 200 in the data label method for organizing of Fig. 5;
Fig. 7 is the preferred flow schematic diagram of the step 300 in the data label method for organizing of Fig. 5.
Specific embodiment
In order to keep techniques disclosed in this application content more detailed with it is complete, can refer to the following of attached drawing and the application Various specific embodiments, identical label represents the same or similar component in attached drawing.However, those skilled in the art It should be appreciated that embodiment provided hereinafter not is used to limit the range that the application is covered.In addition, attached drawing is used only for It is schematically illustrated, and is drawn not according to its full size.
In a typical configuration of this application, terminal, the equipment of service network and trusted party include one or more Processor (CPU), input/output interface, network interface and memory.Memory may include impermanent in computer-readable medium Property memory, the forms such as random access memory (RAM) and/or Nonvolatile memory, such as read-only memory (ROM) or flash memory (flash RAM).Memory is the example of computer-readable medium.Computer-readable medium includes permanent and impermanency, can Mobile and non-removable media can be accomplished by any method or technique information storage.Information can be computer-readable finger It enables, the subelement of data structure, program or other data.The example of the storage medium of computer includes, but are not limited in phase transformation Deposit (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other kinds of arbitrary access Memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other Memory techniques, read-only disc read only memory (CD-ROM) (CD-ROM), digital versatile disc (DVD) or other optical storages, magnetic holder formula Tape, the storage of tape magnetic hard disk or other magnetic storage devices or any other non-transmission medium, can be used for storing can be counted Calculate the information of equipment access.As defined in this article, computer-readable medium does not include non-temporary computer readable media (transitory media), such as data-signal and carrier wave of modulation.
With reference to the accompanying drawings, the specific embodiment of the application various aspects is described in further detail.
Referring to Fig. 1, showing the module diagram of the data label organization system of the application.The data label tissue System 1 is interactively communicated with user 2 by visualization interface, can be according to the instruction of user 2, according to user 2 to the need of data label Tissue is asked to compile, to provide the SQL statement based on stsndard SQL for inquiry for user 2.
In the preferred embodiment of the application, the data label organization system 1 includes data label application module 11, data label collector 12 and execute memory module 13.Wherein, data label application module 11 according to user for referring to Enable business datum label needed for applying, data label collector 12 is used for the metadata information according to definition for required business number Be compiled as the SQL statement based on stsndard SQL according to label, execute memory module 13 be used to execute and store after compiling based on standard The SQL statement of SQL.
Specifically, in another preferred embodiment of the application, data label collector 12 further include: data label is fixed Adopted module 120 and program module 121.Wherein, data label definition module 120 is used for pre- according to metadata information definition If data tag information, program module 121 is for compiling required business datum label according to the preset data label information For the SQL statement based on stsndard SQL.
It executes memory module 13 and still further comprises execution module 130 and memory module 131.Wherein, execution module 130 for executes compiling after the SQL statement based on stsndard SQL, memory module 131 be used for store compile after based on standard The SQL statement of SQL.
In the present embodiment, data label definition module 120 defines preset data label letter according to the metadata information Breath.Wherein the metadata information include building data label Entity-Relationship figure (Entity Relationship Diagram, E-R figure), and logical message and physical message according to E-R figure setting data label.For example, the E-R figure includes Preset data definition for tag information is as follows:
Data label data source, for determining the storage information of basic data label;
Data label amalgamation mode, for determining the amalgamation mode of the basic data label;
Data label factor logic, for determine the data label and basic data label relationship and the basic number According to the filtering rule of label;
The service logic of data label, for determining that the service logic of the data label and the data label factor joins System;Applied data label demand polymerize dimension for determine applied data label and data label.
Data label container, for determining the storage location of the data label;And
Data label quality, for determining that the quality of data of the data label meets the requirement of the metadata information.
Specifically, it please cooperate while be shown refering to fig. 1 with Fig. 3, Fig. 3 in another embodiment of the application and define described preset Data tag information schematic diagram concisely shows the connection relationship example of the label entries in preset E-R figure.Wherein, number It is that multi-to-multi (M:N) quotes relationship type according to tagging element logical AND data label data source;Data label factor logic and number It is that multi-to-multi (M:N) constrains relationship type according to tag fusion mode;Data label factor logic and data label service logic are more Relationship type is defined to more (M:N).Applied data label demand and data label service logic are (1:N) more than 1 pair definition Relationship type;Applied data label demand and label container are 1 pair 1 (1:1) storage relationship type, applied data mark Label demand and label quality are 1 pair 1 (1:1) monitoring relationship type.
Further, to solid data label information each in the E-R figure carry out logic setting, please refer to Fig. 3 with Fig. 4 schematically illustrates the logic set content.In the present embodiment, the logic setting includes according to the data label Factor logic is arranged data label factor logic table, data label service logic is arranged according to the data label service logic Basic label is arranged according to data label demand setting data label demand schedule, according to the data label data source in table Basic label fusion table is arranged according to the tag fusion mode, holds according to data label setting data label for container table Device table and according to the data label quality settings data label quality table.
Wherein, data label factor logic table and data label service logic table are multi-to-multi (M:N) connection.In this implementation In example, it is named as [tags_factor_tab] according to the data label factor logic table, for determining tagging element and basis The relationship of label and the storage position of basic label, the service filtering rule of basic label.It wherein at least may include data Tagging element identifies (factor_id), and basic label container table identifies (src_tab_id) and availability deciding (is_ Validate), to judge the validity of the relationship and storage position.In the other embodiments of the application, the number It can also further comprise other content, such as continuous sex determination (is_constant) etc. according to tagging element logical table.
Data label service logic table and data label demand schedule are multipair 1 (N:1) connection.In the present embodiment, described Data label service logic table is named as [tags_expr_tab], for determine data label and tagging element logical relation, Data label alias and whether polymerizable, including at least (control is in TAB or COL for the definition of data label expression formula and type On).In the present embodiment, the data label service logic table includes at least data label service logic mark (expr_id), Service logic type (expr_type), the service logic factor (expr_factor) and availability deciding (is_validate).? In the other embodiments of the application, data label service logic table can further include polymerization dimension and determine (is_ Aggregate) etc..
Data label demand schedule is 1 pair 1 with data label container table and contacts.In the present embodiment, the data label needs Table is asked to be named as [tags_demand_tab], for determining the expression list of data label and the expression formula column of label aggregation Table, while defining the data storage position of data label.In the present embodiment, the data label demand schedule includes at least data Tagged traffic logical identifier (expr_id), convergence service logical identifier (aggre_expr_id), data label container table mark (target_tab_id) and availability deciding (is_validate), for judging the expression formula and the storage position is It is no effective.In the other embodiments of the application, data label demand schedule can further include other data label numbers It is believed that breath.
Basic label container table and data label factor logic table are multi-to-multi (M:N) connection.In the present embodiment, described Basic label container table is named as [tags_src_tab], for determining the specific storage position of basic label.In the present embodiment In, the basic label container table includes at least mark (tab_id), table name (tab_name) and the availability deciding of the table (is_validate).In the other embodiments of the application, basic label container table can further include other bases Label data information.
Basic label merges table and data label factor logic table as multi-to-multi (M:N) connection.In the present embodiment, described Basic label container table is named as [tags_join_tab], and for determining the data fusion mode of basic label, i.e. determination passes through Naturally it is associated with, or the one of outer association, field association is associated and its Correlation Criteria.In the present embodiment, the basis Tag fusion table includes at least left-handed watch mark (left_tab_id), right table identifies (right_tab_id), fused type (join_ ) and availability deciding (is_validate) type.In the other embodiments of the application, basic label merges table can be with It further comprise other basic label fused data information.
Data label container table and data label demand schedule are 1 pair 1 (1:1) connection.In the present embodiment, the data mark Label container table is named as [tags_target_tab], for determining the specific storage position of data label.In the present embodiment, The data label container table include at least the mark (tab_id) of the table, table name (tab_name), table type (tab_type), And availability deciding (is_validate).In the other embodiments of the application, data label container table can also be further Including (partition_ during such as subregion judgement (is_partition), zone name (partition_name), subregion Other data label data informations such as period).
Data label quality table and data label demand schedule are 1 pair 1 (1:1) connection.In the present embodiment, the data mark Label container table is named as [tags_quality_tab], for determining the quality of data of data label.In the present embodiment, described Data label quality table includes at least the mark (demand_id) of data label demand schedule, data label quality table name (qa_ ) and tabular value (qa_value) name.In the other embodiments of the application, data label quality table be can further include The quality data information of other data labels.
It should be noted that in the present embodiment, aforementioned each table is disposed as corresponding preset data tag information ginseng Number is after logic is arranged, suitable for standardizing the ANSI SQL list of SQL operation.Further, to aforementioned each ANSI SQL table And its design parameter carries out physics setting.In the present embodiment, major key mark (id) is respectively provided in all lists;Data label Creation time records (gmt-created), to obtain the data label creation time parameter;And data label modification time It records (gmt-modified), to obtain the data label modification time parameter.In the present embodiment, the number in aforementioned each table It is disposed as character string (STRING) type according to tag parameter, major key mark is set as integer (INT) type, the data label Creation and modification time record are set as date (DATE) type.In the other embodiments of the application, the word of the parameter Symbol type is arranged can give accommodation according to actual demand and Platform Requirements, and cited aforesaid way is not answered in the present embodiment It is considered as any restrictions or constraint to the application.
The main idea of the application is illustrated for simplicity, the relationship and function of each module in the present embodiment is elaborated, please simultaneously The submodule schematic diagram of module 121 in data label organization system in Fig. 1 is shown with Fig. 2, Fig. 2 refering to fig. 1.In the present embodiment In, preset metadata and data label parameter support SQL type set, and are stored in corresponding tabular form It can run in the computer hardware, software and network of SQL.The present embodiment is only explained by taking the table class of ANSI SQL as an example.
In the present embodiment, program module 121 further comprises selecting module 1211, source module 1213, condition module 1215, aggregation module 1217, insertion module 1219 and package module 1220.Wherein, selecting module 1211 is described for obtaining The output field of the corresponding SQL list of data tag information parameter, to obtain the output field parameter of the data label.Selection Module 1211 can be instructed by [select] and be interacted with user.
Source module 1213 is used to obtain the metadata of data label in the ANSI SQL table (table), and according to institute The interrelational form parameter that data label data information determines the data label is stated, including dynamic obtains and needs associated table, And the correct association sequence etc. of the table is obtained according to metadata information dynamic above-mentioned.Source module 1213 can pass through [from] Instruction is interacted with user.For example, according to data label demand, need to be associated with A, B, C, D, E5 tables, and AB passes through interior company Association is connect, B is associated with D, E by left outside connection, and B is associated with A, C by left outside connection, schemes to be arranged according to aforementioned E-R, Yi Jixiang The algorithm answered, such as by the breadth-first search algorithm of digraph, traverse all nodes and finally obtain as shown in following table one Five kinds of interrelational forms:
Table one
It is final association sequence that all tabular sequences, which can wherein be covered, thus the 5th kind of table interrelational form can be selected, from And determine the interrelational form parameter of wherein data label.
Condition module 1215 is used to obtain the alternative condition of its data label in the ANSI SQL table, determines the number According to the filtering rule parameter of label.Condition module 1215 can be instructed by [where] and be interacted with user.
Aggregation module 1217 is used to obtain the aggregation information of its data label in the ANSL SQL table, determines the number According to the polymerization dimensional parameter of label.Aggregation module 1217 can be instructed by [group] and be interacted with user;.
Insertion module 1219 is used to determine the object table according to the insertion of its data label in the ANSL SQL table The detail parameters of data label in lattice.Insertion module 1219 can be instructed by [group] and be interacted with user.
Package module 1220 is for controlling the selecting module 1211, source module 1213, condition module 1215, aggregation module 1217 and insertion module 1219 according to the output field of aforementioned data label, data label interrelational form, filtering rule, polymerization Dimension and the detail parameters of target data label are compiled, to obtain the SQL statement based on stsndard SQL.Package module 1220 can be interacted by [package] instruction with user.
To which user 2 is interacted by general visualization interface with standard ANSI SQL, and data label organization system passes through Receive label requirements instruction, and feedback target data label is to user 2.
It, can be by once defining data service filtering rule, just as a result, by the data label organization system of the application The data label specified in various data platforms can be obtained, it is easy, efficiently and accurately flat from different data so as to meet user Platform obtains the requirement of various data.
It please refers to shown in Fig. 5, is the preferred flow schematic diagram of data label method for organizing in one embodiment of the application.It please be same When refering to fig. 1~Fig. 5, Fig. 5 is clear from below in conjunction with the data label organization system of FIG. 1 to FIG. 4.
In the present embodiment, the data label method for organizing the following steps are included:
Step 100, request for data label.In the present embodiment, the data label system of the application passes through the data mark Business datum label needed for label application module is applied according to user instructions.
Step 200, metadata information is defined.In the present embodiment, the metadata information system passes through the data label Definition module definition, defining the metadata information includes building data label Entity-Relationship figure (Entity Relationship Diagram, E-R figure), and scheme according to the E-R to define the SQL list of data label parameter logistics setting with And physics setting.
Please refer to Fig. 6, it show the preferred flow schematic diagram of step 200 in Fig. 5.In another preferred reality of the application It applies in example, the step 200 further comprises the step of defining preset data label information according to the metadata information, specifically Ground:
Step 2001, data label data source is defined, to determine the storage information of basic data label.In the present embodiment In, the data label data source is realized by the basic label container table being arranged through logic.In the present embodiment, described through patrolling Collecting the corresponding data tag parameter list being arranged is the parameter list suitable for ANSI SQL, wherein corresponding data label is joined Number is disposed as character string (STRING) type, and specific implementation please refers to the tool previously in conjunction with Fig. 1, Fig. 2, Fig. 3 and Fig. 4 Body illustrates that details are not described herein.
Step 2003, data label amalgamation mode is defined, with the amalgamation mode of the determination basic data label.In this reality It applies in example, the amalgamation mode of the data label is realized by the data label fusion table being arranged through logic.Its specific implementation side Formula also please refers to aforementioned, and details are not described herein.
Step 2005, data label factor logic is defined, with the relationship of the determination data label and basic data label And the filtering rule of the basic data label.In the present embodiment, the data label factor logic through logic by being arranged Data label factor logic table is realized.Its specific implementation also please refers to aforementioned, and details are not described herein.
Step 2007, the service logic of data label is defined, with the determination data label and the data label factor Service logic connection.In the present embodiment, the service logic of the data label is by the data label industry being arranged through logic Business logical table is realized.Its specific implementation also please refers to aforementioned, and details are not described herein.
Step 2009, applied data label demand is defined, with the applied data label of determination and the data mark Sign the expression parameter of polymerization.In the present embodiment, it defines applied data label demand and has further determined data label Data storage location.The polymerization dimension of the data label is realized by the data label demand schedule being arranged through logic.It has Body implementation also please refers to aforementioned, and details are not described herein.
Step 2011, data label container is defined, with the storage location of the determination data label.In the present embodiment, The data label container is realized by the data label container table being arranged through logic.Its specific implementation please refer to it is aforementioned, Details are not described herein.
Step 2013, data label quality is defined, the metadata is met with the quality of data of the determination data label The requirement of information.In the present embodiment, the data label quality is realized by the data label quality table being arranged through logic.Its Specific implementation also please refers to aforementioned, and details are not described herein.
It preferably, can also be to aforementioned defined data label parameter and its corresponding in another embodiment of the application SQL list carries out physics setting, and the physics set content is refering to aforementioned, and details are not described herein.The physics is arranged also into one Step the following steps are included:
Step 2015, the major key mark of the SQL table is defined.In the present embodiment, the present count of step 2001~2013 Realize that then in the present embodiment, the primary key is sensible by the corresponding SQL parameter list being arranged through logic according to label information parameter Should be set in the data tag information parameter SQL table, and the major key mark be set as integer (INT) type, so as to Family is interacted with the data label organization system, and is safeguarded to respective list.
Step 2017, data label creation time record is defined, to obtain the data label creation time parameter, so as to Corresponding SQL table is safeguarded.
Step 2019, data label creation time record is defined, to obtain the data label creation time parameter, so as to Corresponding SQL table is safeguarded.In the present embodiment, the data label creation and modification time record are set as the date (DATE) type.
Step 300, required business datum label is compiled as to the SQL language based on stsndard SQL according to the metadata information Sentence.
Please refer to Fig. 7, it show the preferred flow schematic diagram of the step 300 in another embodiment of the application.At this In another preferred embodiment of application, the step 300 further comprises according to the preset data label information by required industry The step of business data label is compiled as the SQL statement based on stsndard SQL, specifically:
Step 3001, the output field parameter of the data label is obtained.In the present embodiment, which passes through selection mould Block 1211 obtains the output field parameter of the data label.The specific working mode of the selecting module please refers to be explained above It states, details are not described herein.
Step 3003, the metadata of the data label is obtained, and according to preset data label information determination The interrelational form parameter of data label.In the present embodiment, source module 1213 is used to obtain the metadata of the data label, and The interrelational form parameter of the data label is determined according to the preset data label information.It is associated that needs are obtained including dynamic Table, and the correct association sequence etc. of the table is obtained according to metadata information above-mentioned dynamic, specific working mode is asked Refering to already explained, details are not described herein.
Step 3005, the alternative condition for obtaining the data label determines the filtering rule parameter of the data label.? In the present embodiment, condition module 1215 is used to obtain the alternative condition of data label, determines the filtering rule of the data label Parameter, specific working mode please refer to already explained, and details are not described herein.
Step 3007, the aggregation information for obtaining the data label determines the polymerization dimensional parameter of the data label.? In the present embodiment, aggregation module 1217 is used to obtain the aggregation information of its data label in the ANSL SQL table, determines institute State the polymerization dimensional parameter of data label, specific working mode please refers to already explained, and details are not described herein.
Step 3009, the detail parameters of the target data label are determined according to the insertion of the data label.In this reality It applies in example, aggregation module 1217 is used to obtain the aggregation information of its data label in the ANSL SQL table, determines the number According to the polymerization dimensional parameter of label, specific working mode please refers to already explained, and details are not described herein.
Step 3011, the detail parameters of the data label are determined according to the insertion of the data label.In the present embodiment, Package module 1220 for control the selecting module 1211, source module 1213, condition module 1215, aggregation module 1217 and Be inserted into module 1219 according to the output field of aforementioned data label, data label interrelational form, filtering rule, polymerization dimension and The detail parameters of target data label are compiled, to obtain the standardization SQL.Its specific working mode please refers to be explained above It states, details are not described herein.
In view of this, user 2 is interacted by general visualization interface with standard ANSI SQL, pass through the data label Method for organizing receives label requirements instruction, and feedback target data label is to user 2.
It further, further include step 400 in the another preferred embodiment of the application, to execute and store the volume The SQL statement based on stsndard SQL after translating.
It from the foregoing, can be by once defining data industry by the data label organization system and method for the application Be engaged in filtering rule, the data label specified in various data platforms can be obtained automatically, so as to meet user it is easy, efficiently, The requirement of various data is accurately obtained from different data platforms.
Above, the specific embodiment of the application is described with reference to the accompanying drawings.But those skilled in the art It is understood that can also make to the specific embodiment of the application each without departing from spirit and scope Kind change and replacement.These changes and replacement are all fallen in the claim of this application book limited range.

Claims (7)

1. a kind of data label organization system, for the data platform of data label interaction can be carried out, to be obtained according to user demand The fixed data label of fetching, which is characterized in that the data label organization system includes:
Data label application module, for business datum label needed for applying according to user instructions;And data label defines mould Block, for defining preset data label information according to metadata information;
Program module, for required business datum label to be compiled as based on stsndard SQL according to the preset data label information SQL statement;
Wherein, the preset data definition for tag information is as follows:
Data label data source, for determining the storage information of basic data label;
Data label amalgamation mode, for determining the amalgamation mode of the basic data label;
Data label factor logic, for determine the data label and basic data label relationship and the basic data mark The filtering rule of label;
The service logic of data label, for determining that the data label and the service logic of the data label factor contact;
Applied data label demand polymerize dimension for determine applied data label and data label;
Data label container, for determining the storage location of the data label;And
Data label quality, for determining that the quality of data of the data label meets the requirement of the metadata information.
2. data label organization system according to claim 1, which is characterized in that the data label organization system is also wrapped Execution memory module is included, for executing and storing the SQL statement based on stsndard SQL after compiling.
3. data label organization system according to claim 1, which is characterized in that described program module further include:
Selecting module, for obtaining the output field parameter of the data label;
Source module determines the number for obtaining the metadata of the data label, and according to the preset data label information According to the interrelational form parameter of label;
Condition module determines the filtering rule parameter of the data label for obtaining the alternative condition of the data label;
Aggregation module determines the polymerization dimensional parameter of the data label for obtaining the aggregation information of the data label;
It is inserted into module, the detail parameters of the data label are determined for the insertion according to the data label;And
Package module is compiled for controlling the selecting module, source module, condition module, aggregation module and insertion module It translates, to obtain the SQL statement based on stsndard SQL.
4. a kind of data label method for organizing, for the data platform of data label interaction can be carried out, to be obtained according to user demand The fixed data label of fetching, which is characterized in that the data label method for organizing includes:
Request for data label;
Define metadata information;And
Preset data label information is defined according to the metadata information;And
Required business datum label is compiled as the SQL statement based on stsndard SQL according to the preset data label information;
Wherein, the preset data definition for tag information is as follows:
Data label data source, for determining the storage information of basic data label;
Data label amalgamation mode, for determining the amalgamation mode of the basic data label;
Data label factor logic, for determine the data label and basic data label relationship and the basic data mark The filtering rule of label;
The service logic of data label, for determining that the data label and the service logic of the data label factor contact;
Applied data label demand polymerize dimension for determine applied data label and data label;
Data label container, for determining the storage location of the data label;And
Data label quality, for determining that the quality of data of the data label meets the requirement of the metadata information.
5. data label method for organizing according to claim 4, which is characterized in that further include:
It executes and stores the SQL statement based on stsndard SQL after compiling.
6. data label method for organizing according to claim 4, which is characterized in that above-mentioned according to the preset data label Required business datum label is compiled as the SQL statement based on stsndard SQL by information further include:
Obtain the output field parameter of the data label;
The metadata of the data label is obtained, and determines the association of the data label according to the preset data label information Mode parameter;
The alternative condition for obtaining the data label determines the filtering rule parameter of the data label;
The aggregation information for obtaining the data label determines the polymerization dimensional parameter of the data label;
The detail parameters of the data label are determined according to the insertion of the data label;And
According to the output field parameter of the data label, interrelational form parameter, filtering rule parameter, polymerization dimensional parameter and Detail parameters are compiled, to obtain the SQL statement based on stsndard SQL.
7. data label method for organizing according to claim 4, which is characterized in that above-mentioned fixed according to the metadata information Adopted preset data label information further comprises:
The information parameter SQL list of the data label is set, and defines its major key mark;
Data label creation time record is defined, to obtain the creation time parameter of the data label;And
Data label modification time record is defined, to obtain the modification time parameter of the data label.
CN201410624275.2A 2014-11-06 2014-11-06 A kind of data label organization system and method for organizing Active CN105630475B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410624275.2A CN105630475B (en) 2014-11-06 2014-11-06 A kind of data label organization system and method for organizing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410624275.2A CN105630475B (en) 2014-11-06 2014-11-06 A kind of data label organization system and method for organizing

Publications (2)

Publication Number Publication Date
CN105630475A CN105630475A (en) 2016-06-01
CN105630475B true CN105630475B (en) 2018-12-21

Family

ID=56045465

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410624275.2A Active CN105630475B (en) 2014-11-06 2014-11-06 A kind of data label organization system and method for organizing

Country Status (1)

Country Link
CN (1) CN105630475B (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108874971B (en) * 2018-06-07 2021-09-24 北京赛思信安技术股份有限公司 Tool and method applied to mass tagged entity data storage
CN108809770A (en) * 2018-07-26 2018-11-13 郑州云海信息技术有限公司 A kind of resource monitoring method and system
CN109063151B (en) * 2018-08-08 2022-07-12 中国建设银行股份有限公司 Commercial bank data fusion method and device
CN109189774A (en) * 2018-09-14 2019-01-11 南威软件股份有限公司 A kind of user tag method for transformation and system based on script rule
CN110765100B (en) * 2019-09-09 2022-08-02 天云软件技术有限公司 Label generation method and device, computer readable storage medium and server
CN111858280B (en) * 2020-07-16 2024-02-27 中国工商银行股份有限公司 SQL information processing method, device, equipment and system
CN112785368A (en) * 2020-12-24 2021-05-11 江苏苏宁云计算有限公司 Label production method, management method, device and system
CN116361341B (en) * 2023-03-20 2024-02-13 北京白驹易行科技有限公司 Crowd-sourced circle selection method, crowd-sourced circle selection device, computer equipment and medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040254948A1 (en) * 2003-06-12 2004-12-16 International Business Machines Corporation System and method for data ETL in a data warehouse environment
CN101076793A (en) * 2004-08-31 2007-11-21 国际商业机器公司 System structure for enterprise data integrated system
CN101324846A (en) * 2008-07-08 2008-12-17 国电南瑞科技股份有限公司 Method for creating data model according to ASN.1 information dynamic state
CN101788992A (en) * 2009-05-06 2010-07-28 厦门东南融通系统工程有限公司 Method and system for converting query sentence of database
CN102254008A (en) * 2011-07-18 2011-11-23 深圳证券信息有限公司 Method and system for setting dynamic data label
CN103177008A (en) * 2011-12-22 2013-06-26 北大方正集团有限公司 Method and system used for generating and executing structured query language (SQL) statement
CN103559243A (en) * 2013-10-28 2014-02-05 陶睿 Method and system for searching users in mobile devices on basis of labels

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040254948A1 (en) * 2003-06-12 2004-12-16 International Business Machines Corporation System and method for data ETL in a data warehouse environment
CN101076793A (en) * 2004-08-31 2007-11-21 国际商业机器公司 System structure for enterprise data integrated system
CN101324846A (en) * 2008-07-08 2008-12-17 国电南瑞科技股份有限公司 Method for creating data model according to ASN.1 information dynamic state
CN101788992A (en) * 2009-05-06 2010-07-28 厦门东南融通系统工程有限公司 Method and system for converting query sentence of database
CN102254008A (en) * 2011-07-18 2011-11-23 深圳证券信息有限公司 Method and system for setting dynamic data label
CN103177008A (en) * 2011-12-22 2013-06-26 北大方正集团有限公司 Method and system used for generating and executing structured query language (SQL) statement
CN103559243A (en) * 2013-10-28 2014-02-05 陶睿 Method and system for searching users in mobile devices on basis of labels

Also Published As

Publication number Publication date
CN105630475A (en) 2016-06-01

Similar Documents

Publication Publication Date Title
CN105630475B (en) A kind of data label organization system and method for organizing
US8108360B2 (en) Database object update order determination
US8364723B1 (en) Apparatus and method for realizing big data into a big object and non-transitory tangible machine-readable medium thereof
CN105760419A (en) Method And Systme For Join Processing
CN108932257A (en) The querying method and device of multi-dimensional data
CN108334515A (en) The method, apparatus and system of stack address in file are collapsed in a kind of processing
US11853279B2 (en) Data storage using vectors of vectors
CN109117433B (en) Index tree object creation and index method and related device thereof
CN106095964A (en) A kind of method that data are carried out visualization filing and search
CN108009296A (en) A kind of SQL query method, system and relevant apparatus based on Hbase
CN106547870A (en) Point table method and device of data base
CN110427364A (en) A kind of data processing method, device, electronic equipment and storage medium
CN109003012B (en) Goods location recommendation link information acquisition method, goods location recommendation method, device and system
CN117235285B (en) Method and device for fusing knowledge graph data
CN109460312A (en) Request the processing method and processing device of failure
US11645283B2 (en) Predictive query processing
RU2020113119A (en) Method for providing a software tool and a data processing system
CN115293243A (en) Method, device and equipment for realizing intelligent matching of data assets
CN107562533A (en) A kind of data loading processing method and device
JP2014157598A (en) Software asset management device, software asset management method, and software asset management program
US10430775B1 (en) Validation and lookup techniques for rule-based data categorization
US20120078943A1 (en) High quantitative pattern searching using spatial indexing
CN110245265A (en) A kind of object classification method, device, storage medium and computer equipment
CN110442604A (en) Data flow querying method, abstracting method, processing method and relevant apparatus
CN110019186A (en) The method and device of data storage

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20211104

Address after: Room 507, floor 5, building 3, No. 969, Wenyi West Road, Wuchang Street, Yuhang District, Hangzhou City, Zhejiang Province

Patentee after: ZHEJIANG TMALL TECHNOLOGY Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Patentee before: ALIBABA GROUP HOLDING Ltd.