The content of the invention
In order to solve the above-mentioned technical problem, the invention provides a kind of data processing method and device, for
The characteristics of business diagnosis demand, can find corresponding data point and unified parameters, so as to improve logarithm
According to the efficiency of Treatment Analysis.
The embodiment of the invention discloses following technical scheme:
A kind of data processing method, summarizes data point from data, and the data point, which is used to identify, to be obtained
The acquisition logic of characteristic value;Corresponding unified parameters, the unified parameters are determined according to the data point
For identifying the contextual information needed for obtaining the characteristic value under the acquisition logic, and pre-save
The data point and unified parameters, methods described include:
According to business diagnosis demand, it is determined that realizing the business variable needed for the business diagnosis demand;
Find the data point and the unified parameters for calculating the business variable;
The numerical value of the characteristic value is transferred according to the data point and the unified parameters;
The numerical value of the business variable is obtained according to the numerical computations of the characteristic value;
The business diagnosis demand according to the Numerical Implementation of the business variable.
Optionally, in the data point found for calculating the business variable and the unification
Before parameter, in addition to:
The target variable and transfer function that the business variable is obtained for calculating are determined, the index becomes
Amount includes the information of the data point and the unified parameters;
The data point and the unified parameters found for calculating the business variable, including:
According to the information searching of the data point and the unified parameters to the data point that pre-saves and
The unified parameters.
Optionally, the numerical computations according to the characteristic value obtain the numerical value of the business variable, bag
Include:
The numerical value of the business variable is obtained according to the numerical computations of the transfer function and the characteristic value.
Optionally, the data point passes through dynamic script language definition.
Optionally, the target variable passes through to being obtained after the data point and unified parameters instantiation.
A kind of data processing equipment, described device includes:
Unit is concluded, for summarizing data point from data, the data point, which is used to identify, obtains feature
The acquisition logic of value;Corresponding unified parameters are determined according to the data point, the unified parameters are used for
The contextual information needed for obtaining the characteristic value under the acquisition logic is identified, and is pre-saved described
Data point and unified parameters;
Determining unit, for according to business diagnosis demand, it is determined that needed for realizing the business diagnosis demand
Business variable;
Searching unit, for finding the data point and the unification for calculating the business variable
Parameter;
Unit is transferred, the numerical value for transferring the characteristic value according to the data point and the unified parameters;
Computing unit, the numerical value for obtaining the business variable according to the numerical computations of the characteristic value;
Analytic unit, for business diagnosis demand described in the Numerical Implementation according to the business variable.
Optionally, the determining unit is additionally operable to before the searching unit is triggered, and is determined based on
The target variable and transfer function for obtaining the business variable are calculated, the target variable includes the data point
With the information of the unified parameters;
The searching unit is additionally operable to the information searching according to the data point and the unified parameters in advance
The data point preserved and the unified parameters.
Optionally, the computing unit is additionally operable to the numerical value meter according to the transfer function and the characteristic value
Calculate the numerical value for obtaining the business variable.
Optionally, the data point passes through dynamic script language definition.
Optionally, the target variable passes through to being obtained after the data point and unified parameters instantiation.
Summarize data point from data in advance it can be seen from above-mentioned technical proposal, the data point is used
The acquisition logic of characteristic value is obtained in mark;Corresponding unified parameters, institute are determined according to the data point
State unified parameters be used for identify it is described obtain logic under obtain the characteristic value needed for contextual information,
And the data point and unified parameters are pre-saved, needed due to being substantially all during business diagnosis is carried out
Characteristic value is obtained by different acquisition logics, therefore the data point summarized according to acquisition logic can be fitted
For different business diagnosis demands, so when there is business diagnosis demand, business diagnosis can be directed to
The characteristics of demand, can find corresponding data point and unified parameters, so as to improve to data processing point
The efficiency of analysis.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with this hair
Accompanying drawing in bright embodiment, the technical scheme in the embodiment of the present invention is explicitly described, it is clear that
Described embodiment is a part of embodiment of the invention, rather than whole embodiments.Based on the present invention
In embodiment, the institute that those of ordinary skill in the art are obtained under the premise of creative work is not made
There is other embodiment, belong to the scope of protection of the invention.
Under big data background, for a business diagnosis demand, it is necessary to transfer substantial amounts of, targeted
Data carry out data analysis, in order to complete in accurate data analysis, traditional approach, for one
Individual business diagnosis demand is, it is necessary to which engineer is carried out according to the characteristics of this business diagnosis demand with real needs
Corresponding hard coded, afterwards the logic further according to hard coded corresponding data are transferred from big data and are divided
Analyse to be met the analysis result of this business diagnosis demand.But different business diagnosis characteristicss of demand
It is not quite similar, often may require that and targetedly encode again during Data Management Analysis so that logarithm
It is low according to the efficiency of Treatment Analysis.
Therefore, the embodiments of the invention provide a kind of data processing method and device, returning in advance from data
Receive out data point, the data point is used to identify the acquisition logic for obtaining characteristic value;According to the data point
Corresponding unified parameters are determined, the unified parameters are used to identify obtains described under the acquisition logic
Contextual information needed for characteristic value, and the data point and unified parameters are pre-saved, due to carrying out
Being substantially all during business diagnosis needs to obtain characteristic value by different acquisition logics, therefore according to acquisition
The data point that logic is summarized goes for different business diagnosis demands, the data point and unified ginseng
Number is equivalent to the elementary cell after data normalization., can pin so when there is business diagnosis demand
The characteristics of to business diagnosis demand, can find corresponding data point and unified parameters, so as to improve pair
The efficiency of Data Management Analysis.
How before processing business analysis demand, first illustrating to the pre- of data in introducing the embodiment of the present invention
First course of standardization process.
Data point can be summarized from data, the acquisition that the data point is used to identify acquisition characteristic value is patrolled
Volume;Corresponding unified parameters are determined according to the data point, the unified parameters are used to identify described
The contextual information needed for the characteristic value is obtained under acquisition logic.
Data described here can be understood as a data acquisition system to be analyzed as business diagnosis basis,
Or the part in the data acquisition system.The characteristic value is a kind of specific finger that can be used for business diagnosis
Mark, for example user account remaining sum, user (buyer, seller) transaction history data, user mail, can
Trust address, mobile phone ownership place etc..And the data point can be the acquisition logic for obtaining characteristic value,
A kind of action is can be understood as, for example, obtains user account remaining sum, obtains trusted address, obtains and collect
Ownership place etc..
The unified parameters can be to implement the necessary information required for acquisition characteristic value This move, if number
Strong point is obtains user account remaining sum, and the unified parameters can include user's name (English:user id)
Deng positioning user account, to obtain the necessary information of account balance.
It should be noted that substantial amounts of data point can be determined by conclusion, and determine substantial amounts of
Unified parameters.One data point can include multiple corresponding uniform datas, and a unified parameters can also
The multiple data points of correspondence, specific corresponding to relation can be determined by specifically obtaining logic.
After the data point and unified parameters are determined, it can be stored in database, to carry out
Called during data analysis.
Optionally, the data point passes through dynamic script language definition.Dynamic script language can be Groovy
(a kind of agile development language based on Java Virtual Machine).
Fig. 1 is a kind of method flow diagram of data processing method provided in an embodiment of the present invention, methods described
Including:
S101:According to business diagnosis demand, it is determined that realizing the business variable needed for the business diagnosis demand.
For example, the business diagnosis demand is mainly the various possible analyses carried out for big data
Demand, for example, analyze whether an account has risk, and whether one group of account of analysis is high value account etc..
And business variable is then specific, the core data for realizing the business diagnosis demand.If the business
Whether analysis demand has risk for one account of analysis, and the business variable can be account total value,
If the business diagnosis demand is analysis, whether one group of account is high value account, and the business variable can be with
For each account balance in this group of account and the business of moon consumption value.
S102:Find the data point and the unified parameters for calculating the business variable.
For example, according to the particular content for the business variable determined, arithmetic logic can be passed through
Find out the data point and the unified parameters for calculating the business variable.For calculating one
The data point of business variable can have multiple, and unified parameters can also have multiple.
If for example described business variable is account total value, calculating the data point of the business variable can be
Account value is obtained, unified parameters can be the user's name of the account.If the business variable is one group
Each account balance and the business of moon consumption value in account, the data point for calculating the business variable can be to obtain
Take in this group of account each account balance and obtain each account month consumption value, unified parameters in this group of account
Can be user's name of each account etc. in this group of account.
Optionally, a kind of determined from the business variable is provided in the embodiment of the present invention and calculates the industry
The data point of variable of being engaged in and the process of the unified parameters, before S102 is performed, in addition to:
The target variable and transfer function that the business variable is obtained for calculating are determined, the index becomes
Amount includes the information of the data point and the unified parameters.
Target variable described here can be by being obtained after the data point and unified parameters instantiation
's.With clear and definite business semantics implication, it is made up of data point+unified parameters.If for example described index
Variable is specially that " the user Email " of buyer, then the target variable can be by data point (User logs in
Email)+unified parameters (buyer UserId) constitute.One data point can correspond to multiple target variables,
If for example described target variable is specially that " the user Email " of seller, the target variable can be by counting
Strong point (Email of User logs in)+unified parameters (seller UserId) are constituted.
The information of the data point can be title, storage location or mark of the data point etc..It is described
The information of unified parameters can be title, storage location or mark of the unified parameters etc..The data
The information of point and the unified parameters plays a part of identifying the data point and the unified parameters respectively,
In order to find the corresponding data point and the unified parameters according to these information.
The transfer function can be the meter used required for during calculating obtains the business variable
Logic is calculated, can be basic data operation function, such as max (taking maximum), min (take minimum
Value), add (summation), div (asking business) etc..
Accordingly, S102 can be specifically included:Looked into according to the information of the data point and the unified parameters
Find the data point pre-saved and the unified parameters.
S103:The numerical value of the characteristic value is transferred according to the data point and the unified parameters.
For example, due to having predetermined that out and storing the data point and the unified parameters,
Can be directly according to the unified parameters (necessary information needed for obtaining characteristic value) and the data point
(logic for obtaining characteristic value) determines the specific or real time value of the characteristic value.If the data point
To obtain account value, then all kinds of tools that the numerical value of the characteristic value can at present be had by the account
The information such as body deposit number.
S104:The numerical value of the business variable is obtained according to the numerical computations of the characteristic value.
Optionally, if before S102 is performed, in addition to determining and obtaining the business variable for calculating
Target variable and transfer function, then S104 is specifically as follows:According to the transfer function and the spy
The numerical computations of value indicative obtain the numerical value of the business variable.
If for example, the data point is obtains account value, the numerical value of the characteristic value is the account
The information such as all kinds of specific deposit numbers having at present, the business variable is that account total value can be to be somebody's turn to do
All kinds of specific deposit number sums that account has at present.And this operation function of summing can be for before really
The transfer function made.
In another example, if the business variable is:Seller's account balance and seller's Yuebao amount of money ratio, tool
Body can pass through div (target variables:Seller's account balance, target variable:Seller's Yuebao amount of money) come
Definition.
S105:The business diagnosis demand according to the Numerical Implementation of the business variable.
For example, after the numerical value of the business variable is determined, can be according to the business diagnosis need
The concrete analysis demand asked is analyzed the numerical value of the business variable.How to carry out analysis can be with tool
Body application scenarios are related, can also be related to specific analysis requirement.Such as described business diagnosis demand is
Whether one account of analysis has risk, and the foundation of analysis is account total value, passes through above-mentioned workflow management
Go out account total value for 13000 yuans.If thinking account total price in concrete application scene
Value>10000 be risky, then account total value will be judged as with wind for 13000 this account
The account of danger.
As can be seen from the above-described embodiment, data point is summarized from data in advance, the data point is used for
Mark obtains the acquisition logic of characteristic value;Corresponding unified parameters are determined according to the data point, it is described
Unified parameters are used to identify the contextual information needed for obtaining the characteristic value under the acquisition logic, and
The data point and unified parameters are pre-saved, due to being substantially all needs during business diagnosis is carried out
Characteristic value is obtained by different acquisition logics, therefore the data point summarized according to acquisition logic can be applicable
In different business diagnosis demands, the data point and unified parameters are equivalent to the base after data normalization
This unit.So when there is business diagnosis demand, the characteristics of being directed to business diagnosis demand can look into
Corresponding data point and unified parameters are found, so as to improve the efficiency to Data Management Analysis, and are had
Good Universal and scalability.
Fig. 2 is a kind of structure drawing of device of data processing equipment provided in an embodiment of the present invention, described device
Including:
Unit 200 is concluded, for summarizing data point from data, the data point, which is used to identify, obtains special
The acquisition logic of value indicative;Corresponding unified parameters are determined according to the data point, the unified parameters are used
Contextual information needed for mark obtains the characteristic value under the acquisition logic, and pre-save institute
State data point and unified parameters.
Determining unit 201, for according to business diagnosis demand, it is determined that needed for realizing the business diagnosis demand
Business variable.
Searching unit 202, for finding the data point and the system for calculating the business variable
One parameter.
Unit 203 is transferred, the number for transferring the characteristic value according to the data point and the unified parameters
Value.
Computing unit 204, the number for obtaining the business variable according to the numerical computations of the characteristic value
Value.
Analytic unit 205, for business diagnosis demand described in the Numerical Implementation according to the business variable.
Optionally, the determining unit is additionally operable to before the searching unit is triggered, and is determined based on
The target variable and transfer function for obtaining the business variable are calculated, the target variable includes the data point
With the information of the unified parameters;
The searching unit is additionally operable to the information searching according to the data point and the unified parameters in advance
The data point preserved and the unified parameters.
Optionally, the computing unit is additionally operable to the numerical value meter according to the transfer function and the characteristic value
Calculate the numerical value for obtaining the business variable.
Optionally, the data point passes through dynamic script language definition.
Optionally, the target variable passes through to being obtained after the data point and unified parameters instantiation.
As can be seen from the above-described embodiment, data point is summarized from data in advance, the data point is used for
Mark obtains the acquisition logic of characteristic value;Corresponding unified parameters are determined according to the data point, it is described
Unified parameters are used to identify the contextual information needed for obtaining the characteristic value under the acquisition logic, and
The data point and unified parameters are pre-saved, due to being substantially all needs during business diagnosis is carried out
Characteristic value is obtained by different acquisition logics, therefore the data point summarized according to acquisition logic can be applicable
In different business diagnosis demands, the data point and unified parameters are equivalent to the base after data normalization
This unit.So when there is business diagnosis demand, the characteristics of being directed to business diagnosis demand can look into
Corresponding data point and unified parameters are found, so as to improve the efficiency to Data Management Analysis, and are had
Good Universal and scalability.
One of ordinary skill in the art will appreciate that:Realize all or part of step of above method embodiment
It can be completed by the related hardware of programmed instruction, foregoing routine can be stored in an embodied on computer readable
In storage medium, the program upon execution, performs the step of including above method embodiment;And it is foregoing
Storage medium can be at least one of following media:Read-only storage (English:read-only
Memory, abbreviation:ROM), RAM, magnetic disc or CD etc. are various can be with Jie of store program codes
Matter.
It should be noted that each embodiment in this specification is described by the way of progressive, each
Identical similar part is mutually referring to what each embodiment was stressed is and it between embodiment
The difference of his embodiment.For equipment and system embodiment, because it is substantially similar
In embodiment of the method, so describing fairly simple, referring to the part explanation of embodiment of the method in place of correlation
.Equipment and system embodiment described above is only schematical, wherein being used as separating component
The unit of explanation can be or may not be physically separate, and the part shown as unit can be with
It is or may not be physical location, you can with positioned at a place, or can also be distributed to multiple
On NE.Some or all of module therein can be selected to realize this reality according to the actual needs
Apply the purpose of a scheme.Those of ordinary skill in the art are without creative efforts, you can
To understand and implement.
The foregoing is only a preferred embodiment of the present invention, but protection scope of the present invention not
Be confined to this, any one skilled in the art the invention discloses technical scope in, can
The change or replacement readily occurred in, should all be included within the scope of the present invention.Therefore, it is of the invention
Protection domain should be defined by scope of the claims.