CN112734583A

CN112734583A - Multithreading parallel computing method for life insurance actuarial model

Info

Publication number: CN112734583A
Application number: CN202110054321.XA
Authority: CN
Inventors: 陈曦; 陈森
Original assignee: Shenzhen Light Shanghai Technology Co ltd
Current assignee: Shenzhen Light Shanghai Technology Co ltd
Priority date: 2021-01-15
Filing date: 2021-01-15
Publication date: 2021-04-30

Abstract

The invention discloses a multithreading parallel computing method of a life insurance actuarial model. The method comprises the following steps: setting a plurality of computing threads, and copying all computing modules of the life insurance actuarial model to each computing thread; after the life insurance actuatedlodel starts to operate, inputting a data block with the number of records not exceeding a set threshold value for each calculation thread from an evaluation database; new data blocks are entered for each computing thread that completes the task of entering data block computations until all records in the evaluation database have been processed. The invention can obviously reduce the time consumption difference of different calculation threads for completing all calculation tasks, thereby reducing the waiting time for result summarization and improving the running speed of the life insurance actuarial model.

Description

Multithreading parallel computing method for life insurance actuarial model

Technical Field

The invention belongs to the technical field of life insurance actuarial, and particularly relates to a multi-thread parallel computing method of a life insurance actuarial model.

Background

The life insurance actuarial evaluation work needs to be carried out by using a life insurance actuarial model. The life insurance actuarial model for the life insurance evaluation work is a mathematical model which quantitatively describes various cash flows, liabilities, repayment capacity costs, profits, and the reduction values of the cash flows and the profits of the insurance company including premium and claim expenses and the like in the future by using actuarial and statistical professional methods on the basis of products sold by the insurance company. The actuarial model is generally composed of a series of calculation modules, and the most important basic constituent unit of each calculation module is a time series, such as the number of deaths per unit time period in the future, the expected death claim amount, the daily management cost of an insurance company, and the like. A actuarial model is typically made up of tens to hundreds of computing modules, containing thousands to tens of thousands of time series.

When a specific life insurance assessment work is carried out, the assessment personnel butt the life insurance actuarial model to the assessment database of the company. The database contains information on all the valid policies of the company, typically hundreds of thousands to millions of records. For each record, the actuarial evaluation model will calculate the results of all involved calculation modules. For the calculation modules needing to be output, the actuarial evaluation model can aggregate the calculation results. Due to the fact that a plurality of calculation modules are involved, the calculation is complex, the data size is large, the time consumption of the life risk assessment process is long, and the longest time can reach dozens of hours or even hundreds of hours. Therefore, in the prior art, a plurality of computing threads are generally adopted to run in parallel to reduce the operation time. For example, the evaluation database is divided into a plurality of sub-databases, and they are respectively connected to different computing threads. Although the method can effectively improve the calculation speed, certain problems exist, for example, because the number of data records of each sub-database and the complexity of prediction for the records are different, the time consumption for each calculation thread to complete the calculation task is different, sometimes the time consumption difference may be large, and final result summarization needs to be performed after the calculation tasks of all threads are completed, so that the overall speed of life risk assessment is influenced by waiting for one or two calculation threads consuming a lot of time.

Disclosure of Invention

In order to solve the problems in the prior art, the invention provides a multi-thread parallel computing method of a life insurance actuarial model.

In order to achieve the purpose, the invention adopts the following technical scheme:

a life insurance actuarial model multithread parallel computing method comprises the following steps:

step 1, setting a plurality of computing threads, and copying all computing modules of a life insurance actuarial model to each computing thread;

step 2, after the life insurance actuarial model starts to operate, inputting a data block with the number of records not exceeding a set threshold value for each calculation thread from an evaluation database;

and 3, inputting a new data block for each calculation thread which finishes the calculation task of the input data block until all records in the evaluation database are processed.

Compared with the prior art, the invention has the following beneficial effects:

according to the invention, a plurality of calculation threads are arranged, a data block with the number of records not exceeding a set threshold value is input for each calculation thread from the evaluation database, a new data block is input for each calculation thread for completing the calculation task of the input data block until all records in the evaluation database are processed, so that the time consumption difference of completing all calculation tasks by different calculation threads is obviously reduced, the waiting time for result summarization is reduced, and the running speed of the life insurance actuarial model is improved.

Drawings

Fig. 1 is a flowchart of a multi-thread parallel computing method of a life insurance actuarial model according to an embodiment of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings.

The embodiment of the invention provides a multithreading parallel computing method of a life insurance actuarial model, a flow chart is shown in figure 1, and the method comprises the following steps:

s101, setting a plurality of computing threads, and copying all computing modules of the life insurance actuarial model to each computing thread;

s102, after the life insurance actuarial model starts to operate, inputting a data block with the number of records not exceeding a set threshold value for each calculation thread from an evaluation database;

and S103, inputting a new data block for each calculation thread completing the calculation task of the input data block until all records in the evaluation database are processed.

In this embodiment, step S101 is mainly used to set up a plurality of computing threads, and copy all computing modules of the life insurance actuarial model to each computing thread. All the calculation modules are copied to each calculation thread, mainly considering that for most calculation data points, all the calculation modules are needed in the calculation process. Of course, there are few cases, such as when a product has not been delivered by death, and the calculation process does not use the calculation module for the delivery by death. However, this embodiment copies all of the compute modules onto each thread, considering that it is difficult to determine whether a thread will be assigned to a data point for a particular product, and the copy process for the compute modules does not incur much resource overhead.

In this embodiment, step S102 is mainly used to input one data block for each computing thread. The data block contains a small number of data records, colloquially referred to as small data blocks, from the evaluation database, which are small in number (less than a set threshold). The assessment database contains all the information of the valid policy, typically hundreds of thousands to millions of records. The prior art divides the evaluation database into a plurality of sub-databases and connects them to different computing threads, respectively. The method has the problem that the time consumption for each calculation thread to complete the calculation task is different even if the same number of data records are allocated to each sub-database because the calculation complexity of the data records for different products is different. Sometimes, the difference in the length of the consumed time may be large, which may affect the overall speed of the life risk assessment by waiting for one or two long computing threads. For this reason, in the embodiment, only one small data block is input to each calculation thread each time, and each calculation thread inputs a new small data block after completing the calculation of inputting the small data block until completing the calculation of all the data records in the evaluation database. The maximum waiting time after the mechanism is adopted is less than the time Tm for completing the calculation of one small data block by one calculation thread with the slowest calculation speed, and the calculation thread just starts the calculation of a new data block corresponding to the worst condition that other calculation threads complete the calculation. Since the data size of the small data block is small, Tm is also small and latency will be greatly reduced. If only from the standpoint of minimizing latency, the smaller the data block (the fewer the number of data records contained), the better; however, when the data block is too small, the data block needs to be input frequently for each computing thread, thereby affecting the operation speed to a certain extent, and therefore the data block size should be compromised. In this embodiment, for the sake of processing simplicity, each data block may contain the same number of data records; of course, each data block may contain different numbers of data records according to actual conditions.

In the present embodiment, step S103 is mainly used to input a new data block for each computing thread that completes the task of inputting a data block computation. Since a data block contains a small number of records, each computing thread quickly performs the task of computing a data block, requiring new data blocks to be continuously entered for them until all records in the evaluation database have been processed.

As an alternative embodiment, the S102 further includes: starting a thread for storing a certain number of said data blocks obtained from the evaluation database into a buffer and inputting a data block from the buffer for each calculation thread.

The embodiment provides a technical scheme for improving the operation speed by setting a cache area for storing data blocks. Since the number of records contained in the evaluation database is huge and takes up a large storage space, the evaluation database is generally stored on a hard disk in the form of a database file. It is known that the speed of data read/write on a hard disk is significantly slower than that of data read/write on a memory. Therefore, in order to increase the operation speed, in this embodiment, a memory buffer is provided instead of directly reading data blocks from the evaluation database and inputting the data blocks to the computing thread, a separate thread is started, a certain number of data blocks read from the evaluation database are stored in the buffer, and then the data blocks are read from the buffer and input to the computing thread.

As an alternative embodiment, the method further comprises: and monitoring the consumed time of each computing thread for completing the computation of one data block in real time, and automatically adjusting the number of records contained in the data block input to each computing thread according to the consumed time to ensure that the consumed time of each computing thread is approximately equal.

The embodiment provides a technical scheme for automatically adjusting the size of a data block. As mentioned above, since the data records of different products have different calculation complexity, even if each data block contains the same number of data records, the time taken by each calculation thread to complete the calculation task of one data block is different. In order to further reduce the waiting time, the embodiment appropriately reduces the number of records contained in the input data block for the calculation thread with longer time consumption by monitoring the time consumption of completing the calculation of one data block by each calculation thread in real time; for the calculation thread with shorter time consumption, the number of records contained in the data block input by the calculation thread is properly increased, so that the time consumption of all the calculation threads tends to be equal.

The above description is only for the purpose of illustrating a few embodiments of the present invention, and should not be taken as limiting the scope of the present invention, in which all equivalent changes, modifications, or equivalent scaling-up or down, etc. made in accordance with the spirit of the present invention should be considered as falling within the scope of the present invention.

Claims

1. A life insurance actuarial model multithread parallel computing method is characterized by comprising the following steps:

2. The life insurance actuarial model multithread parallel computing method according to claim 1, wherein the step 2 further comprises: starting a thread for storing a certain number of said data blocks obtained from the evaluation database into a buffer and inputting a data block from the buffer for each calculation thread.

3. The life insurance actuarial model multithread parallel computing method according to claim 1, wherein the method further comprises: and monitoring the consumed time of each computing thread for completing a data block computing task in real time, and automatically adjusting the number of records contained in the data block input to each computing thread according to the consumed time so as to approximately equalize the consumed time of each computing thread.