CN106528795A - Data mining method and apparatus - Google Patents

Data mining method and apparatus Download PDF

Info

Publication number
CN106528795A
CN106528795A CN201610991856.9A CN201610991856A CN106528795A CN 106528795 A CN106528795 A CN 106528795A CN 201610991856 A CN201610991856 A CN 201610991856A CN 106528795 A CN106528795 A CN 106528795A
Authority
CN
China
Prior art keywords
data
result
cluster
mining
request
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610991856.9A
Other languages
Chinese (zh)
Other versions
CN106528795B (en
Inventor
陈萌
杜锐
赵焕芳
杨声钢
苑洪林
吴洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Agricultural Bank of China
Original Assignee
Agricultural Bank of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Agricultural Bank of China filed Critical Agricultural Bank of China
Priority to CN201610991856.9A priority Critical patent/CN106528795B/en
Publication of CN106528795A publication Critical patent/CN106528795A/en
Application granted granted Critical
Publication of CN106528795B publication Critical patent/CN106528795B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/03Data mining

Abstract

The present invention discloses a data mining method and apparatus that are applicable to a data mining system. The data mining system comprises a first cluster and a second cluster. The first cluster comprises multiple first servers. The second cluster comprises multiple second servers. The first servers are configured with, based on an ILog rule engine, a first mining model. The second servers are configured with, based on a statistical analysis system (SAS), a second mining model. The method comprises the steps of receiving data mining requests comprising request types; classifying the data mining requests; transmitting the data mining requests with the request type of a rapid response type to the first cluster, and performing, by the first servers, data mining processing on data by using the first mining model according to the data mining requests, so as to obtain a first mining result; and transmitting the data mining requests with the data type of a non-rapid response type to the second cluster, and performing, by the second servers, data mining processing on data by using the second mining model according to the data mining requests, so as to obtain a second mining result.

Description

A kind of data digging method and device
Technical field
The application is related to data mining technology field, more particularly to a kind of data digging method and device.
Background technology
With the development of science and technology, the fast-developing and big data technology of business intelligence is maked rapid progress, big data Value is increasingly taken seriously, and particularly banking system can accumulate the business number of magnanimity during its routine work is handled According to carrying out data mining using these big datas, Result can be widely applied to client's marketing, products perfection, wind The numerous areas such as dangerous management and control, have great importance for core competitiveness is lifted.
Thus, need a kind of implementation that effectively can be excavated to data in real time badly.
The content of the invention
In view of this, the purpose of the application is to provide a kind of data digging method and device, to solve in prior art The technical problem that effectively data cannot be excavated in real time.
This application provides a kind of data digging method, it is adaptable to which data digging system, the data digging system include First cluster and the second cluster, first cluster include multiple first servers, and second cluster includes multiple Two servers, the first server are configured with the first mining model based on ILog regulation engines, and the second server is based on SAS is configured with the second mining model, and methods described includes:
At least one data mining request is received, in the data mining request, at least includes request type;
Data mining request is classified based on its request type;
Data mining request of the request type for quick response type is transferred to into first cluster, is collected by described first First server in group is carried out to the data in data source using first mining model based on data mining request Data mining is processed, and obtains the first Result;
By data type be not quick response type data mining request be transferred to second cluster, by described second Second server in cluster is entered to the data in data source using second mining model based on data mining request Row data mining is processed, and obtains the second Result.
Said method, it is preferred that after first Result and second Result is obtained, methods described Also include:
First Result and second Result are returned.
Said method, it is preferred that after first Result and second Result is obtained, methods described Also include:
First Result and second Result are stored.
Said method, it is preferred that after first Result and second Result is obtained, methods described Also include:
First Result and second Result are transferred to into second cluster, by second cluster In second server first Result and second Result are handed over using second mining model Fork checking.
Said method, it is preferred that also include:
First mining model is transferred to into second cluster, is utilized by the second server in second cluster Second mining model carries out model training and checking.
Present invention also provides a kind of data mining device, is connected with data digging system, the data digging system Including the first cluster and the second cluster, first cluster includes multiple first servers, and second cluster includes many Individual second server, the first server are configured with the first mining model, the second server based on ILog regulation engines Second mining model, described device are configured with based on SAS (STATISTICAL ANALYSIS SYSTEM, statistical analysis system) Including:
Request reception unit, for receiving at least one data mining request, at least includes in the data mining request Request type;
Requests classification unit, for being classified based on its request type to data mining request;
First transmission unit, is transferred to described first for the data mining request by request type for quick response type Cluster, by the first server in first cluster based on data mining request using the first mining model logarithm Data mining is carried out according to the data in source, the first Result is obtained;
Second transmission unit, the data mining request for by data type not being quick response type are transferred to described the Two clusters, by the second server in second cluster based on data mining request using second mining model pair Data in data source carry out data mining process, obtain the second Result.
Said apparatus, it is preferred that also include:
As a result returning unit, for after first Result and second Result is obtained, will be described First Result and second Result are returned.
Said apparatus, it is preferred that also include:
As a result memory element, for first Result and second Result are stored.
Said apparatus, it is preferred that also include:
3rd transmission unit, for after first Result and second Result is obtained, will be described First Result and second Result are transferred to second cluster, by the second server in second cluster Cross validation is carried out to first Result and second Result using second mining model.
Said apparatus, it is preferred that also include:
4th transmission unit, for first mining model is transferred to second cluster, by second cluster In second server carry out model training and checking using second mining model.
From such scheme, a kind of data digging method and device that the application is provided, by by ILog clusters and SAS Cluster configuration in same system, so as to receive data mining ask when, can according to data mining request request Type is determining the excavation mode of excavation mode or SAS using Ilog so that the application can be provided simultaneously with the energy of Ilog The enough data mining characteristic based on expert model and excavations to data model of SAS that quick response is carried out to data mining And checking etc. data mining duty characteristic, so as to collect two kinds of data mining spies of Ilog and SAS on the basis of originating in identical data Property, in the case where legacy data mining task disposal ability is not affected, significantly lifted to different response times, different excavations The response efficiency of complexity task.
Description of the drawings
For the technical scheme being illustrated more clearly that in the embodiment of the present application, below will be to making needed for embodiment description Accompanying drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present application, for For those of ordinary skill in the art, without having to pay creative labor, can be obtaining which according to these accompanying drawings His accompanying drawing.
Fig. 1 is a kind of flow chart of data digging method that the embodiment of the present application one is provided;
Application exemplary plots of the Fig. 2 for the embodiment of the present application;
Fig. 3 is a kind of flow chart of data digging method that the embodiment of the present application two is provided;
Fig. 4 is a kind of flow chart of data digging method that the embodiment of the present application three is provided;
Fig. 5 is a kind of flow chart of data digging method that the embodiment of the present application four is provided;
Fig. 6 is a kind of partial process view of data digging method that the embodiment of the present application five is provided;
Fig. 7 is a kind of structural representation of data mining device that the embodiment of the present application six is provided;
Fig. 8 is a kind of structural representation of data mining device that the embodiment of the present application seven is provided;
Fig. 9 is a kind of structural representation of data mining device that the embodiment of the present application eight is provided;
Figure 10 is a kind of structural representation of data mining device that the embodiment of the present application nine is provided;
Figure 11 is a kind of structural representation of data mining device that the embodiment of the present application ten is provided.
Specific embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present application, the technical scheme in the embodiment of the present application is carried out clear, complete Site preparation is described, it is clear that described embodiment is only some embodiments of the present application, rather than the embodiment of whole.It is based on Embodiment in the application, it is every other that those of ordinary skill in the art are obtained under the premise of creative work is not made Embodiment, belongs to the scope of the application protection.
With reference to Fig. 1, it is a kind of flowchart of data digging method that the embodiment of the present application one is provided, it is adaptable to Fig. 2 Shown data digging system, data digging system are connected with access terminal and data source, as shown in Figure 2.
Wherein, can include in data digging system:First cluster and the second cluster, can include in the first cluster many Individual first server, can include multiple second servers in the second cluster, first server is configured based on ILog regulation engines There is the first mining model, the first mining model as quickly can carry out excavating the expert model of response, thus, first to data It is capable of the data mining demand based on expert model of quick response and deployment user in server based on ILog regulation engines, the Two servers are configured with the second mining model based on SAS (STATISTICAL ANALYSIS SYSTEM, statistical analysis system), Second mining model is data model, second server based on SAS can the data mining task higher to complexity carry out Response.
In the present embodiment, can include having the following steps, realize data mining:
Step 101:Receive at least one data mining request.
Wherein, data mining request characterizes the demand excavated required for user by generating and sending in access terminal, The request type for characterizing user's request is at least included in each data mining request, type if desired for quick response or Request type that big data is counted or complexity is higher etc..
Step 102:Data mining request is classified based on its request type.
In the present embodiment, the classification that data mining is asked is referred to, the demand to parsing user in data mining request Carry out cutting, that is to say, that user generates data mining request by accessing terminal, energy in the data mining request for now generating Enough characterizing user needs to excavate the data in data source using which kind of mode.
Step 103:Data mining request of the request type for quick response type is transferred to into the first cluster, by the first collection First server in group carries out data mining to the data in data source using the first mining model based on data mining request Process, obtain the first Result.
Wherein, after data mining request is transferred to the first cluster, the first cluster can be according to each first server Present load, determine that one or more first servers carry out data mining, realize the load balance scheduling of data mining.
Step 104:By data type be not quick response type data mining request be transferred to the second cluster, by second Second server in cluster carries out data digging to the data in data source using the second mining model based on data mining request Pick is processed, and obtains the second Result.
Wherein, after data mining request is transferred to the second cluster, the second cluster can be according to each second server Present load, determine that one or more second servers carry out data mining, realize the load balance scheduling of data mining.
That is, in the present embodiment after cutting is carried out according to its request type to data mining request, by difference The data mining request of type adopts different processing modes, for example:The data mining for needing quick response is asked to be transferred to First cluster, carries out ageing higher data mining, it would be desirable to which the data mining that data volume is big or complexity demand is higher please Ask the data mining for being transferred to that the second cluster is more improved or depth is higher.
It should be noted that data source here can be various types of data sources, such as relevant database, Hadoop Data acquisition system of data base or data file etc..
From such scheme, a kind of data digging method that the embodiment of the present application one is provided, by by ILog clusters with SAS cluster configurations in same system, so as to receive data mining ask when, can according to data mining request please Seek type to determine the excavation mode of excavation mode or SAS using Ilog so that the application can be provided simultaneously with Ilog's The data mining characteristic based on expert model and the digging to data model of SAS of quick response can be carried out to data mining The characteristic of the data mining duty such as pick and checking, so as to collect two kinds of data minings of Ilog and SAS on the basis of originating in identical data Characteristic, in the case where legacy data mining task disposal ability is not affected, is significantly lifted to different response times, different diggings The response efficiency of pick complexity task.
It should be noted that in actual applications, the program code with the methodological function in the present embodiment may operate at In application server cluster, in application server cluster can contain multiple application servers, data mining can be asked into Data mining request is carried out processing forward in corresponding first cluster or the second cluster by row response.
And in order to realize load balancing, data mining produced by the access terminal of user request can be sent initially to In the load-balanced server that application server cluster is connected, balance dispatching commander is carried out by load-balanced server and is forwarded again To in the application server of suitable application server cluster, and then realize data mining.
In one implementation, with reference to Fig. 3, it is a kind of realization of data digging method that the embodiment of the present application two is provided Flow chart, after the step 103 and the step 104, methods described can also be comprised the following steps:
Step 105:First Result and second Result are returned.
Specifically, in the present embodiment, first Result and second Result can be returned to user Access terminal.
In one implementation, with reference to Fig. 4, it is a kind of realization of data digging method that the embodiment of the present application three is provided Flow chart, after the step 103 and the step 104, methods described can also be comprised the following steps:
Step 106:First Result and second Result are stored.
Specifically, in the present embodiment, can be by first Result and second Result storage to the In the database storage system of one cluster and the connection of the second cluster.
In one implementation, with reference to Fig. 5, it is that a kind of data digging method that the embodiment of the present application four is provided realizes flow process Figure, wherein, after the step 103 and the step 104, methods described can also be comprised the following steps:
Step 107:First Result and second Result are transferred to into the second cluster, by the second cluster In second server first Result and second Result are handed over using second mining model Fork checking.
That is, the second server in the second cluster is built with the second mining model based on SAS so that second service Device can carry out the excavation of data model and training checking, thus, in the present embodiment, can obtain the first Result and The result of the first Result such as expert model result and the second Result such as data model is entered after second Result Row cross validation.
In one implementation, with reference to Fig. 6, it is a kind of part flow process of data digging method that the embodiment of the present application five is provided Figure, methods described can also be comprised the following steps:
Step 108:First mining model is transferred in the second cluster, is utilized by the second server in the second cluster Two mining models carry out model training and checking.
That is, the second server in the second cluster is built with the second mining model based on SAS so that second service Device can carry out the excavation of data model and training checking, thus, in the present embodiment, can be by first service in the first cluster First mining model of device such as expert model carries out the training and checking of model in being put into the second cluster, afterwards, the second cluster can So that model training result is fed back to the first cluster, carry out perfect grade of model and process.
With reference to Fig. 7, be a kind of structural representation of data mining device that the embodiment of the present application six is provided, described device with Data digging system shown in Fig. 2 is connected, and data mining device is connected with terminal is accessed, data digging system and data source It is connected.
Wherein, can include in data digging system:First cluster and the second cluster, can include in the first cluster many Individual first server, can include multiple second servers in the second cluster, first server is configured based on ILog regulation engines There is the first mining model, the first mining model as quickly can carry out excavating the expert model of response, thus, first to data It is capable of the data mining demand based on expert model of quick response and deployment user in server based on ILog regulation engines, the Two servers are configured with the second mining model based on SAS (STATISTICAL ANALYSIS SYSTEM, statistical analysis system), Second mining model is data model, second server based on SAS can the data mining task higher to complexity carry out Response.
In the present embodiment, described device can include following structure, realize data mining:
Request reception unit 701, for receiving at least one data mining request, at least wraps in the data mining request Include request type.
Wherein, data mining request characterizes the demand excavated required for user by generating and sending in access terminal, The request type for characterizing user's request is at least included in each data mining request, type if desired for quick response or Request type that big data is counted or complexity is higher etc..
It should be noted that request reception unit 701 can adopt the interface that can carry out data transmission to realize, to connect Receive the data mining request for accessing that terminal sends.
Requests classification unit 702, for being classified based on its request type to data mining request.
In the present embodiment, the classification that data mining is asked is referred to, the demand to parsing user in data mining request Carry out cutting, that is to say, that user generates data mining request by accessing terminal, energy in the data mining request for now generating Enough characterizing user needs to excavate the data in data source using which kind of mode.
It should be noted that requests classification unit 702 can be realized using grader, based on request type by data mining Request is classified.
First transmission unit 703, it is described for the data mining request that request type is quick response type is transferred to First cluster, by the first server in first cluster based on data mining request using first mining model Data mining is carried out to the data in data source, the first Result is obtained.
Wherein, after data mining request is transferred to the first cluster, the first cluster can be according to each first server Present load, determine that one or more first servers carry out data mining, realize the load balance scheduling of data mining.
It should be noted that the first transmission unit 703 can adopt the interface that can carry out data transmission to realize, to incite somebody to action Data mining request is transferred to the first cluster.
Second transmission unit 704, the data mining request for by data type not being quick response type are transferred to institute The second cluster is stated, mould is excavated using described second based on data mining request by the second server in second cluster Type carries out data mining process to the data in data source, obtains the second Result.
Wherein, after data mining request is transferred to the second cluster, the second cluster can be according to each second server Present load, determine that one or more second servers carry out data mining, realize the load balance scheduling of data mining.
It should be noted that the second transmission unit 704 can adopt the interface that can carry out data transmission to realize, to incite somebody to action Data mining request is transferred to the second cluster.
That is, in the present embodiment after cutting is carried out according to its request type to data mining request, by difference The data mining request of type adopts different processing modes, for example:The data mining for needing quick response is asked to be transferred to First cluster, carries out ageing higher data mining, it would be desirable to which the data mining that data volume is big or complexity demand is higher please Ask the data mining for being transferred to that the second cluster is more improved or depth is higher.
It should be noted that data source here can be various types of data sources, such as relevant database, Hadoop Data acquisition system of data base or data file etc..
From such scheme, a kind of data mining device that the embodiment of the present application six is provided, by by ILog clusters with SAS cluster configurations in same system, so as to receive data mining ask when, can according to data mining request please Seek type to determine the excavation mode of excavation mode or SAS using Ilog so that the application can be provided simultaneously with Ilog's The data mining characteristic based on expert model and the digging to data model of SAS of quick response can be carried out to data mining The characteristic of the data mining duty such as pick and checking, so as to collect two kinds of data minings of Ilog and SAS on the basis of originating in identical data Characteristic, in the case where legacy data mining task disposal ability is not affected, is significantly lifted to different response times, different diggings The response efficiency of pick complexity task.
It should be noted that in actual applications, the program code with the methodological function in the present embodiment may operate at In application server cluster, in application server cluster can contain multiple application servers, data mining can be asked into Data mining request is carried out processing forward in corresponding first cluster or the second cluster by row response.
And in order to realize load balancing, data mining produced by the access terminal of user request can be sent initially to In the load-balanced server that application server cluster is connected, balance dispatching commander is carried out by load-balanced server and is forwarded again To in the application server of suitable application server cluster, and then realize data mining.
With reference to Fig. 8, it is a kind of structural representation of data mining device that the embodiment of the present application seven is provided, described device is also Following structure can be included:
As a result returning unit 705, are connected with the first cluster and the second cluster, for obtaining described in the first cluster After one Result and the second cluster obtain second Result, first Result and described second are excavated As a result returned.
Specifically, the result returning unit 705 can using with first transmission unit 703 and described second transmission 704 identical coffret of unit, the first Result and the second Result are returned to access terminal.
With reference to Fig. 9, it is a kind of structural representation of data mining device that the embodiment of the present application eight is provided, wherein, it is described Device can also include following structure:
As a result memory element 706, are connected with the first cluster and the second cluster, and the first cluster is deposited with the second cluster and data Storage system is connected, and the result memory element 706 is for first Result that obtains the first cluster and the second collection Second Result that group obtains is stored.
Wherein, the result memory element 706 can be to for example various types of database transmissions of data-storage system The data-interface of one Result and the second Result.
With reference to Figure 10, it is a kind of structural representation of data mining device that the embodiment of the present application nine is provided, wherein, it is described Device can also include following structure:
3rd transmission unit 707, is connected with the second cluster, for obtaining first Result and described second After Result, first Result and second Result are transferred to into second cluster, by described Second server in two clusters is using second mining model to first Result and second Result Carry out cross validation.
It should be noted that the 3rd transmission unit 707 can adopt the interface that can carry out data transmission to realize, to incite somebody to action First Result and the second Result are transferred to the second cluster, carry out intersection by the second server in the second cluster and test Card.For example, the first Result represents the modeling result of expert model, and the second Result represents the modeling knot of the data evil spirit heart Really, second server carries out cross one another checking using real data to the result of two models, discovery two is mutually authenticated Problem and defect that class model is present, in this, as the optimization foundation of two class models, the accuracy of lift scheme.
With reference to Figure 11, it is a kind of structural representation of data mining device that the embodiment of the present application ten is provided, wherein, it is described Device can also include following structure:
4th transmission unit 708, is connected between the first cluster and the second cluster, for first mining model is passed It is defeated to second cluster, model training is carried out using second mining model by the second server in second cluster And checking.
It should be noted that the 4th transmission unit 708 can adopt the interface that can carry out data transmission to realize, by first The first mining model such as expert model in cluster is transferred to the second cluster, carries out model by the second server in the second cluster Training and checking.For example, the first mining model (Ilog) can only quickly develop expert model, itself without model training and Authentication function, and the second mining model (SAS) is, with such function, to therefore, it can for the first mining model to be put into Two mining models carry out model training and checking.
It should be noted that each embodiment in this specification is described by the way of progressive, each embodiment weight Point explanation is all difference with other embodiment, between each embodiment identical similar part mutually referring to.
Finally, in addition it is also necessary to explanation, herein, such as first and second or the like relational terms be used merely to by One entity or operation are made a distinction with another entity or operation, and are not necessarily required or implied these entities or operation Between there is any this actual relation or order.And, term " including ", "comprising" or its any other variant are anticipated Covering including for nonexcludability, so that a series of process, method, article or equipment including key elements not only includes that A little key elements, but also including other key elements being not expressly set out, or also include for this process, method, article or The intrinsic key element of equipment.In the absence of more restrictions, the key element for being limited by sentence "including a ...", does not arrange Except also there is other identical element in including the process of the key element, method, article or equipment.
Above a kind of data digging method provided by the present invention and device are described in detail, to disclosed reality The described above of example is applied, professional and technical personnel in the field is realized or using the present invention.The various of these embodiments are repaiied Change and will be apparent for those skilled in the art, generic principles defined herein can without departing from In the case of the spirit or scope of the present invention, realize in other embodiments.Therefore, the present invention is not intended to be limited to this paper institutes These embodiments shown, and it is to fit to the most wide scope consistent with principles disclosed herein and features of novelty.

Claims (10)

1. a kind of data digging method, it is characterised in that suitable for data digging system, the data digging system includes first Cluster and the second cluster, first cluster include multiple first servers, and second cluster includes multiple second clothes Business device, the first server are configured with the first mining model based on ILog regulation engines, and the second server is based on SAS (STATISTICAL ANALYSIS SYSTEM, statistical analysis system) is configured with the second mining model, and methods described includes:
At least one data mining request is received, in the data mining request, at least includes request type;
Data mining request is classified based on its request type;
Data mining request of the request type for quick response type is transferred to into first cluster, by first cluster First server based on the data mining request data are carried out to the data in data source using first mining model Excavation is processed, and obtains the first Result;
By data type be not quick response type data mining request be transferred to second cluster, by second cluster In second server based on the data mining request line number is entered to the data in data source using second mining model Process according to excavation, obtain the second Result.
2. method according to claim 1, it is characterised in that excavate first Result and described second is being obtained As a result, after, methods described also includes:
First Result and second Result are returned.
3. method according to claim 1, it is characterised in that excavate first Result and described second is being obtained As a result, after, methods described also includes:
First Result and second Result are stored.
4. method according to claim 1, it is characterised in that excavate first Result and described second is being obtained As a result, after, methods described also includes:
First Result and second Result are transferred to into second cluster, by second cluster Second server carries out intersection to first Result and second Result using second mining model and tests Card.
5. method according to claim 1, it is characterised in that also include:
First mining model is transferred to into second cluster, by the second server in second cluster using described Second mining model carries out model training and checking.
6. a kind of data mining device, it is characterised in that be connected with data digging system, the data digging system includes One cluster and the second cluster, first cluster include multiple first servers, and second cluster includes multiple second Server, the first server are configured with the first mining model based on ILog regulation engines, and the second server is based on SAS (STATISTICAL ANALYSIS SYSTEM, statistical analysis system) is configured with the second mining model, and described device includes:
Request reception unit, for receiving at least one data mining request, at least includes request in the data mining request Type;
Requests classification unit, for being classified based on its request type to data mining request;
First transmission unit, is transferred to first collection for the data mining request by request type for quick response type Group, utilizes first mining model to data based on data mining request by the first server in first cluster Data in source carry out data mining, obtain the first Result;
Second transmission unit, the data mining request for by data type not being quick response type are transferred to second collection Group, utilizes second mining model to data based on data mining request by the second server in second cluster Data in source carry out data mining process, obtain the second Result.
7. device according to claim 6, it is characterised in that also include:
As a result returning unit, for after first Result and second Result is obtained, by described first Result and second Result are returned.
8. device according to claim 6, it is characterised in that also include:
As a result memory element, for first Result and second Result are stored.
9. device according to claim 6, it is characterised in that also include:
3rd transmission unit, for after first Result and second Result is obtained, by described first Result and second Result are transferred to second cluster, are utilized by the second server in second cluster Second mining model carries out cross validation to first Result and second Result.
10. device according to claim 6, it is characterised in that also include:
4th transmission unit, for first mining model is transferred to second cluster, by second cluster Second server carries out model training and checking using second mining model.
CN201610991856.9A 2016-11-10 2016-11-10 Data mining method and device Active CN106528795B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610991856.9A CN106528795B (en) 2016-11-10 2016-11-10 Data mining method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610991856.9A CN106528795B (en) 2016-11-10 2016-11-10 Data mining method and device

Publications (2)

Publication Number Publication Date
CN106528795A true CN106528795A (en) 2017-03-22
CN106528795B CN106528795B (en) 2023-10-13

Family

ID=58351094

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610991856.9A Active CN106528795B (en) 2016-11-10 2016-11-10 Data mining method and device

Country Status (1)

Country Link
CN (1) CN106528795B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025288A (en) * 2017-04-14 2017-08-08 四川九鼎瑞信软件开发有限公司 Distributed data digging method and system
CN112363831A (en) * 2020-11-10 2021-02-12 上海华锐软件有限公司 Wind control processing method and device, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050283459A1 (en) * 2004-06-22 2005-12-22 Microsoft Corporation Combining multidimensional expressions and data mining extensions to mine OLAP cubes
US20090094174A1 (en) * 2007-10-09 2009-04-09 International Business Machines Corporation Method, system and program product for on demand data mining server with dynamic mining models
CN103024027A (en) * 2012-12-07 2013-04-03 中国电信股份有限公司云计算分公司 Data mining achieving method and system based on cloud computing
CN206209712U (en) * 2016-11-10 2017-05-31 中国农业银行股份有限公司 A kind of data mining device, application server and server cluster

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050283459A1 (en) * 2004-06-22 2005-12-22 Microsoft Corporation Combining multidimensional expressions and data mining extensions to mine OLAP cubes
US20090094174A1 (en) * 2007-10-09 2009-04-09 International Business Machines Corporation Method, system and program product for on demand data mining server with dynamic mining models
CN103024027A (en) * 2012-12-07 2013-04-03 中国电信股份有限公司云计算分公司 Data mining achieving method and system based on cloud computing
CN206209712U (en) * 2016-11-10 2017-05-31 中国农业银行股份有限公司 A kind of data mining device, application server and server cluster

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王春霞;: "基于Web数据挖掘的资源服务系统" *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025288A (en) * 2017-04-14 2017-08-08 四川九鼎瑞信软件开发有限公司 Distributed data digging method and system
CN112363831A (en) * 2020-11-10 2021-02-12 上海华锐软件有限公司 Wind control processing method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN106528795B (en) 2023-10-13

Similar Documents

Publication Publication Date Title
US20160335315A1 (en) System and method for data capture and reporting
CN111787090B (en) Intelligent treatment platform based on block chain technology
CN107908566A (en) Automatic test management method, device, terminal device and storage medium
CN106709012A (en) Method and device for analyzing big data
CN102893275A (en) Automated social networking graph mining and visualization
KR20200057903A (en) Artificial intelligence model platform and operation method thereof
CN108038052A (en) Automatic test management method, device, terminal device and storage medium
CN107872454A (en) A kind of monitoring of ultra-large type internet platform protection based on security rank threat information and analysis system and method based on big data technology
CN103793399A (en) Method for integrating information resources of system of politics and law
JP2002032773A (en) Device and method for processing map data
CN102594607A (en) Telecommunication application service security test and evaluation universal platform system and method thereof
CN106682206A (en) Method and system for big data processing
CN106528795A (en) Data mining method and apparatus
CN206209712U (en) A kind of data mining device, application server and server cluster
CN104239212B (en) Query method and device of test case and establishment method and device of test case
CN113326400A (en) Model evaluation method and system based on depth counterfeit video detection
CN112269324A (en) Digital exhibition cloud platform data analysis control system
CN107093018A (en) Communication engineering project information method for visualizing and device based on health model
CN109949175B (en) User attribute inference method based on collaborative filtering and similarity measurement
CN111581302A (en) Decision-making assisting system based on data warehouse
CN113449318B (en) Data classification model training method and device and data classification method and device
CN104462480A (en) Typicality-based big comment data mining method
CN114548820A (en) Big data wind control method and server for distance education service
CN101345691A (en) Method for intelligently generating data analysis report and dispatching through wireless/wired technology
CN116226204A (en) Scene determination method, device, equipment and storage medium based on joint learning platform

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant