CN109710233A

CN109710233A - A kind of index operation method of business risk regulation engine

Info

Publication number: CN109710233A
Application number: CN201811604910.5A
Authority: CN
Inventors: 陈玮; 刘德彬; 孙世通; 严开
Original assignee: Chongqing Yu Yu Da Data Technology Co Ltd
Current assignee: Chongqing Yu Yu Da Data Technology Co Ltd
Priority date: 2018-12-26
Filing date: 2018-12-26
Publication date: 2019-05-03

Abstract

A kind of index operation method of business risk regulation engine, comprising the following steps: Spark cluster S1, is configured on director server；S2, the setting target script drive module on director server are arranged logic control parameter when script drive module is arranged and SparkContext, the logic control parameter are transferred to the Cluster manager of Spark cluster by SparkContext；Index processor active task is assigned to the Cluster manager in Spark cluster by S3, script drive module；S4, pass through MapReduce mechanism, whole index processor active task is disassembled；" busy extent " of S5, Cluster manager by the index processor active task after dismantling according to other servers is mounted on other relatively idle servers；Implementing result is transmitted to cache module and is stored and be back to director server by S6, every server after having executed index processor active task.The present invention can quickly be calculated for the regulation engine of semi-structured text as a result, alleviating the situation of regulation engine computing capability deficiency.

Description

A kind of index operation method of business risk regulation engine

Technical field

The present invention relates to computer science software information technical fields, more particularly to a kind of business risk regulation engine Index operation method.

Background technique

Regulation engine be widely used in recent years finance and it is counter cheat field, help monitors and finds target customers In exception, risk, business opportunity etc..Most regulation engine can substantially be divided into two bulks in whole design, and one is rule The building of system, secondly being the operation system construction of data flow.Currently, in the industry for the data used by regulation engine, It mainly contains user behavior and (such as logs in, registers, browsing, collection, consumption) data, enterprise's financial data etc.；This kind of data There are structuring, mensurable characteristic mostly.Such as user behavior data just be unable to do without number, frequency, price, time etc. generally It reads.And in the regulation engine based on semi-structured text, index is the quantization to describe the certain concrete application scenes of client Value, identical with number, the concept of frequency in the regulation engine of structuring, rule is a kind of logical comparison of index and threshold value.? In entire regulation engine operation, for index as the bottom, the highest infrastructure elements of reusability, the efficiency of operation is direct Affect the real-time of system.And when executing in batches rules results calculating, there are high concurrent, high connection number and whole calculation power Insufficient problem.

Summary of the invention

In view of the above shortcomings of the prior art, the present invention provides a kind of index operation sides of business risk regulation engine Method can be calculated quickly for the regulation engine of semi-structured text as a result, alleviating the insufficient feelings of regulation engine computing capability Condition.

In order to solve the above-mentioned technical problem, present invention employs the following technical solutions:

A kind of index operation method of business risk regulation engine, comprising the following steps:

S1, Spark cluster is configured on director server, the IP address of the Servers-all in addition to director server is arranged Into the Spark cluster of director server；

Logic control is arranged when script drive module is arranged in S2, the setting target script drive module on director server Parameter and SparkContext, the logic control parameter are managed by the Cluster that SparkContext is transferred to Spark cluster Manage device；

Index processor active task is assigned to the Cluster manager in Spark cluster by S3, script drive module；

S4, pass through MapReduce mechanism, the Cluster manager disassembles whole index processor active task；

" busy extent " of S5, Cluster manager by the index processor active task after dismantling according to other servers, carry Onto other relatively idle servers；

Implementing result is transmitted to cache module and stored by S6, every server after having executed index processor active task And it is back to director server.

As optimization, in step S5, judge that " busy extent " of other servers is sentenced according to Nginx load balancing Disconnected.

As optimization, the Nginx realizes that the strategy of load balancing is poll distribution method, and each index processor active task is on time Between sequence be assigned to other servers one by one, if a certain server is broken down, automatic rejection, remaining continuation poll.

As optimization, the Nginx realizes that the strategy of load balancing is Method for Weight Distribution, by monitoring other servers The occupancy of CPU carrys out the weight of configuration access server, specifies the probability of access server, the weight and access probability are at just Than.

As optimization, the cache module is cache, i.e. cache memory.

As optimization, the Cluster manager judges the nginx load balancing of " busy extent " of other servers Strategy is poll distribution method, and each index processor active task is assigned to other servers one by one in chronological order, if a certain service Device is broken down, automatic rejection, remaining continuation poll.

As optimization, the Cluster manager judges the nginx load balancing of " busy extent " of other servers Strategy is Method for Weight Distribution, by monitor other servers CPU occupancy come the weight of configuration access server, specify and visit Ask the probability of server, the weight is directly proportional with access probability.

The beneficial effects of the present invention are:

The present invention can quickly be calculated for the regulation engine of semi-structured text as a result, alleviating regulation engine meter The case where calculating scarce capacity.

Detailed description of the invention

Fig. 1 is a kind of method flow diagram of the index operation method of business risk regulation engine of the present invention.

Fig. 2 is the local system architecture diagram of the cluster operation of index.

Specific embodiment

The present invention is described in further detail with reference to the accompanying drawing.

As shown in Figure 1, a kind of index operation method of business risk regulation engine, comprising the following steps:

S1, Spark cluster is configured on director server, the IP address of the Servers-all in addition to director server is arranged Into the Spark cluster of director server.

Such as:

Spark1:192.168.156.101

Spark2:192.168.156.102

Spark3:192.168.156.103

Spark4:192.168.156.104

Wherein, 192.168.156.101,192.168.156.102,192.168.156.103,192.168.156.104 It is the IP address of server.

Logic control is arranged when script drive module is arranged in S2, the setting target script drive module on director server Parameter and SparkContext, logic control parameter are managed by the Cluster that SparkContext is transferred to Spark cluster Device.SparkContext is the api interface for connecting script drive module and Cluster manager.

Index processor active task is assigned to the Cluster manager in Spark cluster by S3, script drive module.

S4, pass through MapReduce mechanism, Cluster manager disassembles whole index processor active task.

MapReduce is a kind of distributed computing platform, is made of two stages: Map and Reduce.Map's applies The Mapping and Converting of the one-to-one element of data is needed in us, such as is intercepted, is filtered or any conversion is grasped Make, these one-to-one element conversions are referred to as being Map；Reduce is mainly exactly the polymerization of element, is exactly multiple elements to one The polymerization of a element, for example seek Sum etc., here it is Reduce.

" busy extent " of S5, Cluster manager by the index processor active task after dismantling according to other servers, carry Onto other relatively idle servers.

Specific steps are as follows:

1.Map reads global index processor active task, and global index task is parsed into < key using index as minimum unit, Vaule>, each<key, map function of vaule>calling, such as include 10 indexs in global index task, then it is whole Index can parse into<1, A1>,<2, A2>,<3, A1>,<4, A3>,<5, A2>,<6, A1>,<7, A2>,<8, A2>,<9, A3>, <10,A1>；

2. cover map (), receive 1 generate<key, vaule>, be converted to new<key, vaule>output:

<A1,1>,<A1,1>,<A1,1>,<A1,1>；<A2,1>,<A2,1>,<A2,1>,<A2,1>；<A3,1>,<A3,1 >；

3. pair 2 outputs<key, vaule>be grouped.

4. being grouped according to different key values to data, the value of identical key is put into a set.After grouping Are as follows:<A1, { 1,1,1,1 }>,<A2, { 1,1,1,1 }>,<A3, { 1,1 }>.

5.Cluster manager judge " busy extent " of other servers by multiple map tasks according to different groupings, It is handled by network copy to other servers.

6. other server final output<A1, { 4 }>,<A2, { 4 }>,<A3, { 2 }>.

Embodiment one, Cluster manager judge the strategy of the nginx load balancing of " busy extent " of other servers For poll distribution method, each index processor active task is assigned to other servers one by one in chronological order, if a certain server is delayed Fall, automatic rejection, remaining continuation poll.

Embodiment two, Cluster manager judge the strategy of the nginx load balancing of " busy extent " of other servers For Method for Weight Distribution, by monitor other servers CPU occupancy come the weight of configuration access server, specify access clothes The probability of business device, the weight are directly proportional with access probability.

Embodiment two is upgrade method on the basis of example 1, passes through the application added in upstream parameter Specified parameter is added after server ip can be realized, such as:

By configuring above, all index processor active tasks can all first pass through nginx Reverse Proxy, in director server When forwarding a request to other servers, the address that upstream is tomcatsever1 is read, reads distribution policy, configuration Tomcat1 weight is 3, so nginx will can largely request the tomcat1 being sent on 49 servers, that is, 8080 ends Mouthful；Fewer parts realizes conditional load balancing to tomcat2.

Finally, it should be noted that those skilled in the art various changes and modifications can be made to the invention without departing from The spirit and scope of the present invention.In this way, if these modifications and changes of the present invention belongs to the claims in the present invention and its waits system Within the scope of counting, then the present invention is also intended to encompass these modification and variations.

Claims

1. a kind of index operation method of business risk regulation engine, which comprises the following steps:

S1, Spark cluster is configured on director server, the IP address of the Servers-all in addition to director server is arranged to total In the Spark cluster of server；

Logic control parameter is arranged when script drive module is arranged by S2, the setting target script drive module on director server And SparkContext, the logic control parameter are managed by the Cluster that SparkContext is transferred to Spark cluster Device；

" busy extent " of S5, Cluster manager by the index processor active task after dismantling according to other servers, is mounted to phase To on other idle servers；

Implementing result is transmitted to cache module and is stored and returned by S6, every server after having executed index processor active task It is back to director server.

2. a kind of index operation method of business risk regulation engine according to claim 1, which is characterized in that step S5 In, judge that " busy extent " of other servers is judged according to Nginx load balancing.

3. a kind of index operation method of business risk regulation engine according to claim 2, which is characterized in that described Nginx realizes that the strategy of load balancing is poll distribution method, and each index processor active task is assigned to other one by one in chronological order Server, if a certain server is broken down, automatic rejection, remaining continuation poll.

4. a kind of index operation method of business risk regulation engine according to claim 2, which is characterized in that described Nginx realize load balancing strategy be Method for Weight Distribution, by monitor other servers CPU occupancy come configuration access The weight of server, specifies the probability of access server, and the weight is directly proportional with access probability.

5. a kind of index operation method of business risk regulation engine according to claim 1, which is characterized in that described slow Storing module is cache, i.e. cache memory.

6. a kind of index operation method of business risk regulation engine according to claim 1, which is characterized in that step S5 In, the Cluster manager judges the strategy of the nginx load balancing of " busy extent " of other servers for poll distribution Method, each index processor active task are assigned to other servers one by one in chronological order, if a certain server is broken down, pick automatically It removes, remaining continuation poll.

7. a kind of index operation method of business risk regulation engine according to claim 1, which is characterized in that step S5 In, the Cluster manager judges the strategy of the nginx load balancing of " busy extent " of other servers for weight distribution Method, by monitor other servers CPU occupancy come the weight of configuration access server, specify the several of access server Rate, the weight are directly proportional with access probability.