CN112949866A

CN112949866A - Poisson regression model training method and device, electronic equipment and storage medium

Info

Publication number: CN112949866A
Application number: CN202110322421.6A
Authority: CN
Inventors: 吴玙; 范涛; 马国强; 谭明超; 魏文斌; 郑会钿; 陈天健; 杨强
Original assignee: WeBank Co Ltd
Current assignee: WeBank Co Ltd
Priority date: 2021-03-25
Filing date: 2021-03-25
Publication date: 2021-06-11

Abstract

The application provides a method and a device for training a Poisson regression model, which are applied to a federal learning system; the method comprises the following steps: the parameter aggregation equipment acquires model parameter pairs corresponding to at least two Poisson regression models, and the model parameter pairs and the Poisson regression models are in one-to-one correspondence; each model parameter pair includes: model parameters of a first submodel in a first participant device and model parameters of a corresponding second submodel in a second participant device; respectively carrying out model parameter polymerization on model parameters of a first sub-model and model parameters of a second sub-model in at least two model parameter pairs to obtain corresponding first polymerization model parameters and second polymerization model parameters; and sending the first aggregation model parameters to the first participant equipment, and sending the second aggregation model parameters to each second participant equipment so as to update the model parameters of the Poisson regression model. By the method and the device, the number of feature providers in the Poisson regression model in federal learning can be increased, and the model prediction capability is improved.

Description

Poisson regression model training method and device, electronic equipment and storage medium

Technical Field

The present application relates to the field of artificial intelligence technologies, and in particular, to a method and an apparatus for training a poisson regression model, an electronic device, and a computer-readable storage medium.

Background

The federal learning technology is a novel privacy protection technology, and can effectively combine data of all parties to carry out model training on the premise that the data cannot be out of the local.

In the related longitudinal poisson regression model training, limited by the characteristics of poisson modeling, currently, a single modeling can only support a single label provider and a single feature provider to participate in the model training. In practical applications, a scenario exists in which one tag provider needs to perform poisson regression modeling with multiple feature providers. For example, when an insurance enterprise predicts the future number of claims of a user, the insurance enterprise needs to combine a plurality of feature providers because of the newer insurance types or the smaller number of specific samples of a certain type. Then, the training by using the relevant longitudinal poisson regression model easily leads to overfitting of the result model, and the model prediction result is inaccurate, such as leading to too high or too low number of predicted claims.

Disclosure of Invention

The embodiment of the application provides a method and a device for training a poisson regression model, an electronic device, a computer readable storage medium and a computer program product, which can expand the number of feature providers in the poisson regression model in federal learning, expand the application range of the poisson regression model and improve the prediction capability of the model.

The technical scheme of the embodiment of the application is realized as follows:

the embodiment of the application provides a method for training a Poisson regression model, which is applied to a federated learning system, wherein the system comprises a first participant device with a model training label, at least two second participant devices and a parameter aggregation device, and the method comprises the following steps:

the parameter aggregation equipment acquires at least two model parameter pairs corresponding to the Poisson regression models, wherein the model parameter pairs and the Poisson regression models are in one-to-one correspondence;

wherein each of the model parameter pairs comprises: model parameters of a first submodel in the first participant device and model parameters of a corresponding second submodel in the second participant device;

performing model parameter aggregation on the model parameters of the first sub-model and the model parameters of the second sub-model in the at least two model parameter pairs respectively to obtain corresponding first aggregation model parameters and second aggregation model parameters;

and sending the first aggregation model parameters to the first participant equipment, and sending the second aggregation model parameters to each second participant equipment so as to update the model parameters of the Poisson regression model.

The embodiment of the application provides a training device of a Poisson regression model, which is applied to a federated learning system, wherein the system comprises a first participant device with a model training label, at least two second participant devices and a parameter aggregation device, and the device comprises:

the acquisition module is used for acquiring model parameter pairs corresponding to at least two Poisson regression models by parameter aggregation equipment, wherein the model parameter pairs and the Poisson regression models are in one-to-one correspondence;

the aggregation module is used for respectively carrying out model parameter aggregation on the model parameters of the first sub-model and the model parameters of the second sub-model in the at least two model parameter pairs to obtain corresponding first aggregation model parameters and second aggregation model parameters;

and the training module is used for sending the first aggregation model parameters to the first participant equipment and sending the second aggregation model parameters to the second participant equipment so as to update the model parameters of the Poisson regression model.

In the above scheme, the aggregation module is further configured to obtain a first weight of a model parameter of a first sub-model in each of the model parameter pairs and a second weight of a model parameter of a second sub-model in each of the model parameter pairs, respectively;

based on the obtained first weights, carrying out weighted summation on model parameters of at least two first submodels to obtain a first aggregation model parameter;

and carrying out weighted summation on the model parameters of at least two second submodels based on the obtained second weights to obtain second aggregation model parameters.

In the above scheme, the obtaining module is further configured to obtain a first number of training samples provided by the first participant device for performing model training with each of the second participant devices, a second number of training samples provided by each of the second participant devices, and a total number of training samples used for poisson regression model training;

determining a first ratio of the first number to the total number, the first ratio being the first weight;

determining a second ratio of the second number to the total number, the second ratio being used as a second weight of the model parameters of the respective second submodel.

In the above scheme, when the number of the first party devices is the same as the number of the second party devices, the first party devices and the second party devices are in a one-to-one correspondence relationship; the obtaining module is further configured to obtain a third number of training samples provided by the first participant device, a fourth number of training samples provided by the corresponding second participant device, and a total number of training samples used for poisson regression model training, respectively;

determining a third ratio of the third number to the total number, the third ratio being the first weight;

determining a fourth ratio of the fourth number to the total number, the fourth ratio being the second weight.

In the above scheme, the obtaining module is further configured to enable the creating module to create a key pair, where the key pair includes a public key and a private key;

distributing the public key to the first participant device and the at least two second participant devices;

the public key is used for the first participant device and each second participant device to encrypt the model parameter based on the public key.

In the above scheme, the obtaining module is further configured to receive the model parameters of the encrypted first sub-model sent by the first party device and the model parameters of the encrypted second sub-model sent by the at least two second party devices;

constructing at least two encrypted model parameter pairs corresponding to the Poisson regression model based on the encrypted model parameters of the first sub-model and the encrypted model parameters of the second sub-model;

and respectively decrypting the model parameter of the first sub-model and the model parameter of the second sub-model in the encrypted model parameter pair based on the private key to obtain the decrypted model parameter pair.

In the foregoing scheme, when the number of the first participant devices is at least two, the aggregation module is further configured to obtain a mapping relationship between the first participant device and the second participant device, where the mapping relationship is used to indicate the second participant device performing poisson regression model training with each of the first participant devices;

constructing a model parameter group corresponding to each first participant device based on the mapping relation and the obtained at least two model parameter pairs;

wherein the model parameter set comprises a first number of model parameter pairs; the first number is the number of the second participant devices performing poisson regression model training with the first parameter party device;

performing parameter aggregation on the model parameter pairs in each model parameter group respectively to obtain initial aggregation parameters corresponding to each first sub-model and initial aggregation parameters corresponding to each second sub-model;

and carrying out parameter polymerization on the initial polymerization parameters corresponding to the first submodels to obtain first polymerization model parameters, and carrying out parameter polymerization on the initial polymerization parameters corresponding to the second submodels to obtain second polymerization model parameters.

In the foregoing scheme, the obtaining module is further configured to receive a first notification message, where the first notification message is sent by a target second participant device in at least two second participant devices, and is used to indicate that a poisson regression model in which the target second participant device participates in training meets a convergence condition;

storing a pair of model parameters when the Poisson regression model reaches a convergence condition based on the first notification message;

correspondingly, obtaining at least two model parameter pairs corresponding to the poisson regression model includes:

obtaining a model parameter pair when the Poisson regression models corresponding to other second participant devices except the target second participant device do not reach a convergence condition;

correspondingly, the aggregation module is further configured to aggregate the model parameters of the first sub-model and the model parameters of the second sub-model in the stored model parameter pairs, the model parameter pairs of the poisson regression model that do not reach the convergence condition, and obtain corresponding first aggregation model parameters and second aggregation model parameters.

In the above scheme, the training module is further configured to obtain a model convergence condition of the at least two poisson regression models;

based on the model convergence condition, when determining that a Poisson regression model meeting the model convergence condition exists in the at least two Poisson regression models, sending a second notification message to second participant equipment corresponding to the Poisson regression model meeting the model convergence condition;

wherein the second notification message is configured to notify the second participant device to stop training of the corresponding poisson regression model.

An embodiment of the present application provides an electronic device, including:

a memory for storing executable instructions;

and the processor is used for realizing the training method of the Poisson regression model provided by the embodiment of the application when the executable instructions stored in the memory are executed.

An embodiment of the present application provides a computer-readable storage medium, which stores executable instructions for causing a processor to execute the method for training a poisson regression model provided in the embodiment of the present application.

The embodiment of the present application provides a computer program product, which includes a computer program, and when the computer program is executed by a processor, the computer program implements the method for training a poisson regression model provided in the embodiment of the present application.

The embodiment of the application has the following beneficial effects:

compared with the technology that the modeling of the related single Poisson regression can only support the modeling of one first participant device and one second participant device, the Federal learning system which comprises the first participant device with the model training label, at least two second participant devices and the parameter aggregation device is built to support the multi-party combined Poisson regression modeling. The parameter aggregation equipment obtains model parameter pairs corresponding to at least two Poisson regression models, each model parameter pair comprises a model parameter of a first sub-model in the first participant equipment and a model parameter of a corresponding second sub-model in the second participant equipment, namely the first participant equipment and each second parameter party equipment respectively carry out Poisson regression model training, each training obtains model parameter pairs with the same number as the second participant equipment, and the model parameter pairs are uploaded to the parameter aggregation equipment, so that the number of the second participant equipment trained by the Poisson regression models can be supplemented. The parameter aggregation equipment carries out model parameter aggregation on model parameters of a first sub-model and model parameters of a second sub-model in at least two model parameter pairs respectively to obtain corresponding first aggregation model parameters and second aggregation model parameters; and sending a first aggregation model parameter to the first participant device, and sending a second aggregation model parameter to each second participant device to update the model parameters of the Poisson regression model.

Drawings

Fig. 1 is an alternative structural diagram of a training system of a poisson regression model provided in an embodiment of the present application;

fig. 2 is an alternative structural schematic diagram of an electronic device provided in an embodiment of the present application;

FIG. 3 is a schematic flow chart of an alternative method for training a Poisson regression model according to an embodiment of the present disclosure;

4A-4C are schematic flow diagrams illustrating an alternative method for training a Poisson regression model provided by an embodiment of the present application;

FIG. 5 is a schematic flow chart of an alternative method for training a Poisson regression model according to an embodiment of the present disclosure;

FIG. 6 is a schematic flow chart of an alternative method for training a Poisson regression model according to an embodiment of the present disclosure;

FIG. 7 is a schematic flow chart of an alternative method for training a Poisson regression model according to an embodiment of the present disclosure;

fig. 8 is an alternative flowchart of a method for training a poisson regression model according to an embodiment of the present disclosure.

Detailed Description

In order to make the objectives, technical solutions and advantages of the present application clearer, the present application will be described in further detail with reference to the attached drawings, the described embodiments should not be considered as limiting the present application, and all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.

In the following description, reference is made to "some embodiments" which describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or different subsets of all possible embodiments, and may be combined with each other without conflict.

Where similar language of "first/second" appears in the specification, the following description is added, and where reference is made to the term "first \ second \ third" merely for distinguishing between similar items and not for indicating a particular ordering of items, it is to be understood that "first \ second \ third" may be interchanged both in particular order or sequence as appropriate, so that embodiments of the application described herein may be practiced in other than the order illustrated or described herein.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. The terminology used herein is for the purpose of describing embodiments of the present application only and is not intended to be limiting of the application.

Before further detailed description of the embodiments of the present application, terms and expressions referred to in the embodiments of the present application will be described, and the terms and expressions referred to in the embodiments of the present application will be used for the following explanation.

1) Federal Learning (fed Learning) is a machine Learning framework of distributed Learning, and efficient machine Learning is developed among computing devices of multiple parties on the premise of guaranteeing information security during data exchange, protecting privacy of terminal data and personal data and guaranteeing legal compliance.

2) Vertical federal Learning (also called sample-aligned federal Learning), which increases the feature dimension of training data, and under the condition that the data feature overlap of participants is small and the users overlap more, takes out the part of users and data with the same users and different user data features to jointly train the machine Learning model.

For example, there are two participants a and B belonging to the same region, where participant a is a bank and participant B is an e-commerce platform. Participants a and B have more users in the same area, but a and B have different services and different recorded user data characteristics. In particular, the user data characteristics of the a and B records may be complementary. In such a scenario, vertical federated learning may be used to help a and B build a joint machine learning predictive model, helping a and B provide better service to their customers.

3) Poisson regression is mainly applicable to analysis of rare outcome incidence, and the data characteristics are generally as follows: the denominator of the occurrence is large relative to the numerator, or the event occurs in a limited temporal, spatial sense. The number of event occurrences may be approximated as obeying a poisson distribution. Common examples are the number of accidental deaths by soldiers, the number of telephone dialing errors, the incidence of rare illnesses, etc.

The inventor finds that in the related longitudinal poisson regression model technology, due to the characteristics of poisson modeling, a single modeling can only support modeling of a single label provider and a single feature provider, so that overfitting of a result model is easily caused, and a model prediction result is inaccurate.

Based on this, embodiments of the present application provide a method, an apparatus, an electronic device, a computer-readable storage medium, and a computer program product for training a poisson regression model, which can extend an application range of the poisson regression model, support multi-party combined modeling, ensure security of multi-party data, and improve prediction capability of the model.

First, a description is given to a training system of a poisson regression model provided in this embodiment, referring to fig. 1, fig. 1 is an optional architecture schematic diagram of the training system 100 of the poisson regression model provided in this embodiment, a first participant device 400, a second participant device 410 (2 second participant devices are exemplarily shown, and are respectively denoted as 410-1 and a terminal 410-2 for distinction), the first participant device 400 and the second participant device 410 are connected to each other through a network 300 while being connected to a parameter aggregation device 200 through the network 300, the network 300 may be a wide area network or a local area network, or a combination of the two, and data transmission is achieved using a wireless link.

In some embodiments, the first participant device 400 and the second participant device 410 may be, but are not limited to, a laptop computer, a tablet computer, a desktop computer, a smart phone, a dedicated messaging device, a portable gaming device, a smart speaker, a smart watch, etc., and may also be client terminals of federal learning participants, such as participant devices storing user characteristic data at various banks or financial institutions, etc. The parameter aggregation device 200 may be an independent physical server, a server cluster or a distributed system formed by a plurality of physical servers, or a cloud server providing basic cloud computing services such as a cloud service, a cloud database, cloud computing, a cloud function, cloud storage, a Network service, cloud communication, a middleware service, a domain name service, a security service, a Content Delivery Network (CDN), a big data and artificial intelligence platform, and is configured to assist each participant device in performing federal learning to obtain a federal learning model. The network 300 may be a wide area network or a local area network, or a combination of both. The first participant device 400, the second participant device 410 and the parameter aggregation device 200 may be directly or indirectly connected through wired or wireless communication, and the embodiments of the present application are not limited thereto.

The first participant device 400 is configured to perform poisson regression model training with the second participant device 410-1 and the second parameter device 410-2, respectively, generate model parameters of a local first sub-model, send the model parameters of the first sub-model to the parameter aggregation device 200, receive the first aggregation model parameters sent by the parameter aggregation device 200, and update the model parameters of the poisson regression model according to the first aggregation model parameters.

The second participant device 410 is configured to perform poisson regression model training with the first participant device 400, generate a model parameter of a local second sub-model, send the model parameter of the second sub-model to the parameter aggregation device 200, receive the second aggregation model parameter sent by the parameter aggregation device 200, and update the model parameter of the poisson regression model according to the second aggregation model parameter.

The parameter aggregation device 200 is configured to obtain at least two model parameter pairs corresponding to poisson regression models, where the model parameter pairs and the poisson regression models are in a one-to-one correspondence relationship; the model parameter pairs comprise model parameters of a first sub-model in the first participant device 400 and model parameters of a corresponding second sub-model in the second participant device 410; respectively carrying out model parameter polymerization on model parameters of a first sub-model and model parameters of a second sub-model in at least two model parameter pairs to obtain corresponding first polymerization model parameters and second polymerization model parameters; the first aggregation model parameters are sent to the first participant device 400, and the second aggregation model parameters are sent to the second participant device 410-1 and the second participant device 410-2, so as to update the model parameters of the poisson regression model.

Referring to fig. 2 and fig. 2 are schematic structural diagrams of an optional electronic device 500 provided in the embodiment of the present application, in practical applications, the electronic device 500 may be implemented as the terminal 400 or the server 200 in fig. 1, and the electronic device implementing the method for training a poisson regression model in the embodiment of the present application is described by taking the electronic device as the server 200 shown in fig. 1 as an example. The electronic device 500 shown in fig. 2 includes: at least one processor 510, memory 550, at least one network interface 520, and a user interface 530. The various components in the electronic device 500 are coupled together by a bus system 540. It will be appreciated that the bus system 540 is used to enable communications among the components. The bus system 540 includes a power bus, a control bus, and a status signal bus in addition to a data bus. For clarity of illustration, however, the various buses are labeled as bus system 540 in fig. 2.

The Processor 510 may be an integrated circuit chip having Signal processing capabilities, such as a general purpose Processor, a Digital Signal Processor (DSP), or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, or the like, wherein the general purpose Processor may be a microprocessor or any conventional Processor, or the like.

The user interface 530 includes one or more output devices 531 enabling presentation of media content, including one or more speakers and/or one or more visual display screens. The user interface 530 also includes one or more input devices 532, including user interface components to facilitate user input, such as a keyboard, mouse, microphone, touch screen display, camera, other input buttons and controls.

The memory 550 may be removable, non-removable, or a combination thereof. Exemplary hardware devices include solid state memory, hard disk drives, optical disk drives, and the like. Memory 550 optionally includes one or more storage devices physically located remote from processor 510.

The memory 550 may comprise volatile memory or nonvolatile memory, and may also comprise both volatile and nonvolatile memory. The nonvolatile Memory may be a Read Only Memory (ROM), and the volatile Memory may be a Random Access Memory (RAM). The memory 550 described in embodiments herein is intended to comprise any suitable type of memory.

In some embodiments, memory 550 can store data to support various operations, examples of which include programs, modules, and data structures, or subsets or supersets thereof, as exemplified below.

An operating system 551 including system programs for processing various basic system services and performing hardware-related tasks, such as a framework layer, a core library layer, a driver layer, etc., for implementing various basic services and processing hardware-based tasks;

a network communication module 552 for communicating to other computing devices via one or more (wired or wireless) network interfaces 520, exemplary network interfaces 520 including: bluetooth, wireless compatibility authentication (WiFi), and Universal Serial Bus (USB), etc.;

a presentation module 553 for enabling presentation of information (e.g., a user interface for operating peripherals and displaying content and information) via one or more output devices 531 (e.g., a display screen, speakers, etc.) associated with the user interface 530;

an input processing module 554 to detect one or more user inputs or interactions from one of the one or more input devices 532 and to translate the detected inputs or interactions.

In some embodiments, the training apparatus of the poisson regression model provided in the embodiments of the present application may be implemented in software, and fig. 2 illustrates the training apparatus 555 of the poisson regression model stored in the memory 550, which may be software in the form of programs and plug-ins, and includes the following software modules: an acquisition module 5551, an aggregation module 5552, and a training module 5553, which are logical and thus may be arbitrarily combined or further split depending on the functions implemented. The functions of the respective modules will be explained below.

In other embodiments, the training Device of the poisson regression model provided in the embodiments of the present Application may be implemented in hardware, and as an example, the training Device of the poisson regression model provided in the embodiments of the present Application may be a processor in the form of a hardware decoding processor, which is programmed to execute the training method of the poisson regression model provided in the embodiments of the present Application, for example, the processor in the form of the hardware decoding processor may employ one or more Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), Complex Programmable Logic Devices (CPLDs), Field Programmable Gate Arrays (FPGAs), or other electronic components.

In a poisson regression model training system in the related art, a single modeling only supports a single label provider and a single feature provider, and the overfitting of a prediction result is easy to occur when model prediction is performed under the condition of few training samples. Based on this, in order to increase the number of training samples and adapt to an application scenario in which multiple feature providers participate jointly, the application provides a poisson regression model training system based on federated learning, wherein the training system comprises at least one first participant device (i.e. a label provider) with a model training label, at least two second participant devices (i.e. multiple feature providers) and a parameter aggregation device; compared with the framework of a model training system in the related art, the number of feature providers is increased, a plurality of feature providers and label providers jointly model, and the plurality of feature providers provide training samples to increase the number of the training samples; on the basis, the label provider and each feature provider respectively perform Poisson regression modeling to obtain model parameter pairs with the same number as the feature providers, each model parameter pair comprises a model parameter of a local submodel of the label provider and a model parameter of a corresponding local submodel of the feature provider, and the parameter aggregation equipment respectively performs parameter aggregation on all model parameters sent by the label provider based on the obtained model parameter pairs to obtain aggregation model parameters corresponding to the label provider, and performs parameter aggregation on the model parameters sent by all the feature providers to obtain aggregation model parameters corresponding to the feature providers; the method comprises the steps that aggregation model parameters used for updating label providers and local models of all the feature providers are obtained through parameter aggregation, and therefore training of the Poisson regression model is achieved under the scene of combined modeling of the feature providers and the label providers. Next, with reference to an exemplary application and implementation of the parameter aggregation device provided in the embodiment of the present application, a method for training a poisson regression model provided in the embodiment of the present application is described, where the method is applied to a federated learning system, and the federated learning system includes at least one first participant device having a model training tag, at least two second participant devices, and a parameter aggregation device.

Referring to fig. 3, fig. 3 is an alternative flowchart of a method for training a poisson regression model provided in the embodiment of the present application, and will be described with reference to the steps shown in fig. 3.

In step 101, the parameter aggregation device obtains at least two model parameter pairs corresponding to the poisson regression models, where the model parameter pairs are in a one-to-one correspondence with the poisson regression models, and each model parameter pair includes a model parameter of a first sub-model in the first participant device and a model parameter of a corresponding second sub-model in the second participant device.

Here, the first participant device and the at least two second participant devices perform poisson regression model training, respectively, and there are poisson regression models equal in number to the second participant devices.

Explaining at least two Poisson regression models, wherein each second participant device and the corresponding first participant device cooperatively train a Poisson regression model, the number of the Poisson regression models is the same as that of the second participant devices, and the reason why a plurality of Poisson regression models exist is that sample intersections of training samples provided by the first participant device and the second participant device are different, that is, the number of samples provided by the first participant device and the second participant device for Poisson regression model training is often different, but meanings of all Poisson regression models are the same, that is, meanings of prediction results obtained by trained models are the same.

In some embodiments, each poisson regression model, after each round of training, generates a corresponding pair of model parameters, where each pair of model parameters includes a model parameter of a first sub-model in a first participant device and a model parameter of a corresponding second sub-model in a second participant device. That is, for each round of model training, a pair of model parameters corresponding to the poisson regression model one to one is generated.

In practical implementation, when the number of the first participant devices is one, the sample intersection of the first participant devices and each of the second participant devices is different, each of the second participant devices trains a poisson regression model through a common training sample with the first participant device, and the number of the poisson regression models is consistent with the number of the second participant devices.

Referring to fig. 4A, fig. 4A is an alternative flowchart of a method for training a poisson regression model according to an embodiment of the present disclosure. The first participant device provides training sample 1 and training sample 2, training poisson regression model 1 with training sample 1 (providing feature data x1, x2, label data y) and second participant 1 (providing feature data x3, x4) to obtain model parameter pairs 1{ Θ 1, Θ 2, Θ 3, Θ 4}, where { Θ 1, Θ 2} is from a first sub-model corresponding to the first participant device, and { Θ 3, Θ 4} is from a second sub-model corresponding to the second participant device; training sample 2 (providing characteristic data x1, x2) and the second participant 2 (providing characteristic data x3, x4) train the poisson regression model 2, and the meaning of the obtained model parameter pair 2, poisson regression model 1 and poisson regression model 2 is the same, namely the meaning of the model predicted value y is the same (for example, for a certain newly added vehicle risk type, the number of reimbursements in the next year is predicted). Wherein x1, x2, x3 and x4 are all the same meaning and only have different specific data.

In practical implementation, when the first participant device corresponds to the second participant device, each first participant device and one second participant device train a poisson regression model together, that is, the number of poisson regression models is consistent with the number of second participants.

For example, referring to fig. 4B, fig. 4B is an alternative flowchart of a method for training a poisson regression model provided in the embodiment of the present application. A first participant device 1 (providing characteristic data x1, x2 and label data y) and a second participant device 1 (providing characteristic data x3 and x4) train a poisson regression model 1 to obtain model parameter pairs 1{ Θ 1, Θ 2, Θ 3 and Θ 4 }; the first participant device 2 and the second participant device 2 train the poisson regression model 2 to obtain a model parameter pair 2, and the poisson regression model 1 and the poisson regression model 2 have the same meaning, namely the predicted values of the models are y.

In practical implementation, the first participant device and the second participant device are both multiple, and the first participant device can perform model training with any second participant device.

For example, referring to fig. 4C, fig. 4C is an alternative flowchart of a method for training a poisson regression model provided in the embodiment of the present application. The number of the first participant devices is two, the number of the second participant devices is two, the first participant device 1 (providing the feature data x1, x2 and the label data y) provides a training sample 1 and a training sample 2, and the training sample 1 and the second participant 1 (providing the feature data x3 and x4) jointly train a poisson regression model 1 to obtain a model parameter pair 1; the training sample 2 and a training sample 3 provided by the second participant device 2 jointly train a Poisson regression model 2 to obtain a model parameter pair 2; the first participant device 2 (providing the feature data x1, x2, tag data y) trains the poisson regression model 3 together with the training samples 4 provided by the second participant 2 (providing the feature data x3, x4), resulting in model parameter pairs 3. The Poisson regression model 1, the Poisson regression model 2 and the Poisson regression model 3 have the same meaning, namely the predicted values of the models are y.

In some embodiments, before the parameter aggregation device obtains the model parameter pairs corresponding to at least two poisson regression models, each poisson regression model needs to be trained for a preset round, specifically, the parameter aggregation device creates a key pair, and the key pair includes a public key and a private key; distributing the public key to the first participant device and to at least two second participant devices; and the public key is used for encrypting the model parameters by the first participant equipment and each second participant equipment based on the public key.

Further, based on the received public key, the first participant device performing poisson regression model training, and the corresponding second participant device perform the following operations, respectively: respectively calculating local intermediate results based on the public key to obtain encrypted intermediate results; respectively calculating encryption model parameters of respective local models based on the encryption intermediate result to obtain the model parameters of the encrypted first sub-model and the encrypted second sub-model; and respectively sending the encrypted model parameters of the first sub-model and the encrypted model parameters of the second sub-model to the parameter aggregation equipment.

In some embodiments, the parameter aggregation device performs a decryption operation on the received model parameters of the first sub-model encrypted by using the public key and the encrypted model parameters of the second sub-model by using the corresponding private key, specifically, receives the encrypted model parameters of the first sub-model sent by the first participant device and the encrypted model parameters of the second sub-model sent by at least two second participant devices; constructing at least two encrypted model parameter pairs corresponding to the Poisson regression model based on the encrypted model parameters of the first sub-model and the encrypted model parameters of the second sub-model; and respectively decrypting the model parameter of the first sub-model and the model parameter of the second sub-model in the encrypted model parameter pair based on the private key to obtain the decrypted model parameter pair.

Illustratively, based on fig. 3, before step 101, a first participant device and a corresponding second participant device perform longitudinal poisson regression model training, referring to fig. 5, fig. 5 is an optional flowchart of a method for training a poisson regression model provided in this embodiment of the present application, and a training process of the longitudinal poisson regression model will be described with reference to each step.

In step 201, the coordinator device creates a key pair and sends the public key to the first participant device and the corresponding second participant device.

Here, the coordinator device is a device that passes a security authentication protocol, and may be the aforementioned parameter aggregation device, a newly added trusted third party server, or a first participant device.

In some embodiments, the computational formula for the model tag in the first participant device is set:

y＝exp(wx)

in the above formula, y is a model label, and can be understood as the number of times a certain event occurs in a certain indefinite time range, such as the number of claims of a certain insurance type in a year. The model label y is directly predicted, multiplication homomorphic encryption is used when encryption residual is subsequently calculated, and the following formula is generally adopted in the calculation formula of the label in the relevant Poisson regression model

log y＝ax+b

The formula actually takes y as log, then carries out linear model related training to obtain model parameters a and b when the model converges, then predicts the value of logy, the actually predicted value y has changed, and when subsequently calculating the encryption residual, the addition homomorphic encryption is used.

In step 202, the first and second participant devices encrypt and exchange intermediate results, where the intermediate results are used to calculate model parameters of the first submodel and model parameters of the second submodel.

Here, the first sub-model is provided by the first participant device, the second sub-model is provided by the second participant device, and the second participant device (i.e., the Host party) obtains the public key in step 201 and uses the public key to calculate the result w of the intermediate calculation of the local model^hjx^hjAnd exp (w)^hjx^hj) Encrypting to obtain encrypted intermediate calculation result [ [ exp (w) ]^hjx^hj)]]And sending the encrypted intermediate calculation result to the corresponding first participant equipment for model training, wherein the first participant equipment uses respective characteristic data

To calculate correspondences

And

then, obtaining an encryption residual value, wherein the encryption mode adopts multiplication homomorphic encryption, and a formula for calculating the encryption residual value is as follows:

in the above-mentioned formula,

is provided by the first participant device [ [ exp (w)^hjx^hj)]]The calculation of the residual error is performed by the second party device, and based on the model label prediction formula corresponding to the poisson regression model provided in the embodiment of the present application, multiplicative homomorphic encryption is used.

It should be noted that, when using the multiplicative homomorphic encryption, the data provided by the first party device is unencrypted, because in the multiplicative homomorphic encryption, the encrypted information and the encrypted information cannot be multiplied, but the encrypted information and the unencrypted information are multiplied, and the above formula for calculating the encryption residue is for the case of one Guest and one Host, and if a plurality of hosts occur, the formula becomes:

and [ [ exp (w) in the above formula^h2x^h2)]]And [ [ exp (w) ]^h1x^h1)]]Both are homomorphic encryption parameters, the two cannot carry out multiplication operation, the result obtained by multiplying the two cannot be restored (in cryptography, if p and q can also be decomposed, the decomposition form of the product of p and q is various, so that decryption cannot be carried out after encryption), and the model parameters received in model training must be restored.

In step 203, the first participant device calculates a first model parameter corresponding to the local first sub-model, and the second participant device calculates a second model parameter corresponding to the local second sub-model, and encrypts the respective model parameters by a public key and sends the encrypted model parameters to the coordinator device.

In step 204, the coordinator device decrypts the encrypted model parameters of the first sub-model and the encrypted model parameters of the second sub-model through the private key, sends the decrypted model parameters of the first sub-model to the first participant device, and sends the decrypted model parameters of the second sub-model to the second participant device, so that the first participant device and the second participant device update the model parameters of their respective local models respectively.

In some embodiments, when the poisson regression model in which the second participant device participates in the training reaches the convergence condition, the second participant device sends a notification message to the parameter aggregation device, and the parameter aggregation device stores a model parameter pair when the poisson regression model reaches the convergence condition, and specifically, receives a first notification message, which is sent by a target second participant device of at least two second participant devices and is used for indicating that the poisson regression model in which the target second participant device participates in the training reaches the convergence condition; storing a pair of model parameters when the Poisson regression model reaches a convergence condition based on the first notification message; correspondingly, obtaining at least two model parameter pairs corresponding to the poisson regression model includes: and obtaining a model parameter pair when the Poisson regression models corresponding to other second participant devices except the target second participant device do not reach the convergence condition.

In practical implementation, when the poisson regression model in which any one of the second participant devices participates in training reaches the convergence condition, the model parameter pair corresponding to the poisson regression model needs to be prestored for use in parameter aggregation.

Illustratively, referring to fig. 4A, when a poisson regression model 1 trained by a training sample 1 together with a second participant device reaches a convergence condition, the obtained model parameter pair 1 is pre-stored.

In step 102, model parameter aggregation is performed on model parameters of a first sub-model and model parameters of a second sub-model in at least two model parameter pairs, so as to obtain corresponding first aggregation model parameters and second aggregation model parameters

Here, the first aggregate model parameter is obtained by performing parameter aggregation on the model parameter of the first submodel, and the second aggregate model parameter is obtained by performing parameter aggregation on the model parameter of the second submodel.

Illustratively, referring to fig. 4A, training the poisson regression model 1 with the training sample 1 provided by the first participant and the second participant 1 yields a model parameter pair 1{ Θ 1, Θ 2, Θ 3, Θ 4}, training the poisson regression model 2 with the training sample 2 provided by the first participant and the second participant 2 yields a model parameter pair 2{ Θ 1, Θ 2, Θ 3, Θ 4}, assuming that { Θ 1, Θ 2} in each model parameter pair is from the first sub-model of the first participant device and { Θ 3, Θ 4} is from the second sub-model of the second participant device. And carrying out safe polymerization on the model parameter pair 1 and the model parameter pair 2, actually carrying out safe polymerization on { theta 1, theta 2} in the model parameter pair 1 and { theta 1, theta 2} in the model parameter pair 2 to obtain a first polymerization model parameter, and carrying out safe polymerization on { theta 3, theta 4} in the model parameter pair 1 and { theta 3, theta 4} in the model parameter pair 2 to obtain a second polymerization model parameter.

In some embodiments, the aggregated model parameter is calculated by obtaining a weight corresponding to the model parameter, specifically, a first weight of the model parameter of the first sub-model in each model parameter pair and a second weight of the model parameter of the second sub-model in each model parameter pair are respectively obtained; based on the obtained first weights, carrying out weighted summation on the model parameters of at least two first submodels to obtain first aggregation model parameters; and carrying out weighted summation on the model parameters of the at least two second submodels based on the obtained second weights to obtain second aggregation model parameters.

Illustratively, 1 first participant device (Guest) and J Host send the encrypted model parameters of the respective local models to the third party arbiters, the parameter aggregation device aggregates all the model parameters provided by the first participant device as first aggregated model parameters by a weighted average value, and the parameter aggregation device aggregates all the model parameters of the second participant device as second aggregated model parameters by a weighted average value.

In some embodiments, the weights corresponding to the model parameters may be determined according to the number of training samples provided by the first participant device and each second participant device and the total data amount of the training samples participating in the model training when the poisson regression model training is performed. Specifically, a first number of training samples provided by the first participant device for performing model training with each second participant device, a second number of training samples provided by each second participant device, and a total number of training samples used for poisson regression model training are obtained; determining a first ratio of the first number to the total number, the first ratio being the first weight; determining a second ratio of the second number to the total number, the second ratio being used as a second weight of the model parameters of the respective second submodel.

Illustratively, the first participant device and the second participant device respectively send each encrypted model parameter and the data amount used for model modeling to the parameter aggregation device. The first participant device sends the encrypted model parameter [ w [ [ w ]_g]]Data quantity n used with model modeling_gThe second participant device sends the encrypted model parameter [ w [ [ w ]_h]]Data quantity n used with model modeling_h. The parameter polymerization equipment carries out safe polymerization operation on each model parameter to obtain a polymerization model parameter, and when parameter polymerization is carried out, the usable parameter polymerization formula is as follows:

wherein the content of the first and second substances,

is the parameter of polymerization model, sigma, obtained after safe polymerization]]For the received first model parameter, second model parameter, n_ghModeling data volumes for the first participant device and the second participant device. For example, 1 first participant device Guest, 2 second participant device Host, the first stage is modeled twice, the first modeling, the intersection data volume of Guest and Host1 is 1000, the second modeling, the intersection data volume of Guest and Host2 is 2000, then the total modeled data volume is 1000+2000 to 3000, corresponding

The calculation method comprises the following steps:

in some embodiments, when the number of the first participant devices is the same as the number of the second participant devices, the first participant devices and the second participant devices have a one-to-one correspondence relationship, and the weights for obtaining the corresponding model parameters are described, specifically, a third number of training samples provided by the first participant devices, a fourth number of training samples provided by the second participant devices, and a total number of training samples used for poisson regression model training are respectively obtained; determining a third ratio of the third number to the total number, the third ratio being the first weight; determining a fourth ratio of the fourth number to the total number, the fourth ratio being the second weight.

In some embodiments, when the number of the first participant devices is at least two, performing poisson regression model training with each second participant device, where each first participant device marks the second participant device performing model training with itself, and constructing a mapping relationship, specifically, when the number of the first participant devices is at least two, obtaining a mapping relationship between the first participant device and the second participant device, where the mapping relationship is used to indicate the second participant device performing poisson regression model training with each first participant device; constructing a model parameter group corresponding to each first participant device based on the mapping relation and the obtained at least two model parameter pairs; the model parameter group comprises a first number of model parameter pairs; the first number is the number of second participant devices performing poisson regression model training with the first parameter party device; performing parameter aggregation on the model parameter pairs in each model parameter group respectively to obtain initial aggregation parameters corresponding to each first sub-model and initial aggregation parameters corresponding to each second sub-model; and carrying out parameter aggregation on the initial aggregation parameters corresponding to the first submodels to obtain first aggregation model parameters, and carrying out parameter aggregation on the initial aggregation parameters corresponding to the second submodels to obtain second aggregation model parameters.

In some embodiments, if there is a pair of prestored model parameters in step 101, performing secure aggregation by combining the pair of prestored model parameters and the pair of model parameters that do not reach the convergence condition, specifically, performing model parameter aggregation on the model parameters of the first sub-model and the model parameters of the second sub-model in the pair of at least two model parameters, respectively, to obtain corresponding first aggregation model parameters and second aggregation model parameters, includes: and respectively carrying out model parameter aggregation on the stored model parameter pair, the model parameter of the first sub-model and the model parameter of the second sub-model in the model parameter pair of the Poisson regression model which does not reach the convergence condition, and obtaining corresponding first aggregation model parameters and second aggregation model parameters.

For example, referring to fig. 4A, there are 2 model parameter pairs in the graph, when a poisson regression model 1 trained by a training sample 1 and a second participant device reaches a convergence condition, the obtained model parameter pair 1 is pre-stored, a parameter aggregation operation is performed on the model parameter pair 2 uploaded during the poisson regression model 2 training which does not reach the convergence condition, so as to obtain a first aggregation model parameter and a second aggregation model parameter, the first participant updates a local model parameter according to the first aggregation model parameter, and the second participant 1 and the second participant 2 update the local model parameter according to the second aggregation model parameter.

In step 103, the first aggregation model parameter is sent to the first participant device, and the second aggregation model parameter is sent to each second participant device, so as to update the model parameter of the poisson regression model.

Here, the first aggregation model parameter is distributed to each first participant device, the second aggregation model parameter is distributed to each second participant device, and each first participant device and each second participant device acquire the relevant model parameter of the local submodel, decrypt the model parameter, and update the relevant model parameter of the local model.

In some embodiments, after the last security parameter aggregation is performed, a second notification message for stopping model training is also sent to each participant, specifically, a model convergence condition of the at least two poisson regression models is obtained; based on the model convergence condition, when determining that a Poisson regression model meeting the model convergence condition exists in the at least two Poisson regression models, sending a second notification message to second participant equipment corresponding to the Poisson regression model meeting the model convergence condition; wherein the second notification message is configured to notify the second participant device to stop training of the corresponding poisson regression model.

According to the method and the device, a federated learning system comprising a first participant device with a model training label, at least two second participant devices and a parameter aggregation device is built to support multi-party combined Poisson regression modeling, so that each first model parameter and each second model parameter are obtained, each first model parameter and each second model parameter are subjected to safety parameter aggregation through a safety aggregation technology, a first aggregation model parameter and a corresponding second aggregation model parameter are obtained, and the first participant device and each second participant device update the model parameters of respective local models based on the obtained first aggregation model parameter and the obtained second aggregation model parameter. In the longitudinal poisson regression model training process, local training data are not mutually exposed by all the participants, so that the data safety of all the participants is guaranteed, the application range of the poisson regression model in federal learning is expanded by the mode of safely aggregating model parameters corresponding to the multiparty poisson regression model, the model prediction accuracy is improved, the interaction times among the participants are greatly reduced, and the communication requirement is reduced.

In the following, an exemplary application of the embodiment of the present application in an actual application scenario will be described, where the embodiment is applied in a scenario where multiple parties jointly perform poisson regression model training in a federated learning system.

In a federal learning scenario, a parameter aggregation device is set as an Arbiter (Arbiter), a first participant device is also called a tag provider or a data application (Guest), and a second participant device is also called a feature provider or a data provider (Host). For convenience of calculation, characteristics of sample data in each Host party are consistent but contents of the sample data are inconsistent, and the Arbiter party is used as a coordinating party to coordinate joint modeling of each participant party. And both Host and Guest need to train and predict the Poisson regression model on the premise of not revealing labels and specific characteristic contents of each other.

In practical applications, there may be 1 scenario where tag providers (Guest) need to model with multiple feature complements (Host). For example, when an insurance enterprise predicts the future reimbursement times of a user, the result model is over-fitted easily because the insurance type is newer or the number of certain samples is small, if the model is only modeled with a single Host, and the model training is stopped in advance, so that the effect is not ideal in practical application.

Therefore, if a plurality of data providers Host (such as retail enterprises and banks) providing features are jointly modeled, namely a plurality of hosts with the same complementary features participate in modeling, the features can be supplemented, and the actual effect of the model is improved. However, the method is limited by the characteristics of Poisson modeling, single Poisson regression modeling in the related technology can only support single Host modeling and single Guest modeling, and the application range of common Poisson regression modeling is expanded through a safety aggregation multi-party model, so that multi-party simultaneous combined modeling is achieved.

The embodiment of the application provides a multi-label provider combined Poisson regression training method for integrating a plurality of longitudinal federated Poisson regression models through safe aggregation.

1 Guest party and J Host parties are laid, wherein J is an integer larger than 1, and the Guest parties have data samples X with the same characteristics and the same meaning labels_gkThe Host parties respectively have the same characteristic data sample X_hjAs can be seen from the figure, the process of the multiparty joint modeling provided by the embodiment of the present application is roughly divided into two stages:

the first stage is a poisson regression model training stage (1. model training), and 1 Guest and J hosts respectively carry out longitudinal modeling. Calculating a local model result by a data application (Guest), calculating an encryption gradient according to an encryption intermediate result received from a data provider (Host), and sending the encryption gradient to an Arbiter (Arbiter); the data provider calculates a local model result based on an encrypted intermediate result exchanged with the data application party, calculates an encryption gradient and sends the encryption gradient to the coordinator; and the coordinating party decrypts the encryption gradient of each party and distributes the encryption gradient to each party, and each party updates local model parameters to carry out model training.

And the second stage is a parameter aggregation stage (2. safety aggregation), wherein 1 Guest and J Host carry out encryption operation on the model parameters of the respective local model and then send the encrypted model parameters to the Arbiter, the Arbiter aggregates all the model parameters through a weighted average value to obtain aggregation model parameters, and the aggregation model parameters are sent to Guest and each Host so that each participant can obtain the model parameters corresponding to the respective local model from the aggregation model parameters and update the model parameters of the respective local model to the weighted average parameter value.

It can be understood that the embodiment of the present application is an extension of the original scheme of the common single Host participating in the trained poisson regression model, and the extension aims to allow multiple feature providers to participate in the longitudinal poisson regression modeling.

For a specific process of the first stage poisson regression model training, referring to fig. 7, fig. 7 is an optional schematic diagram of a training method of the poisson regression model provided in an embodiment of the present application, in the first stage longitudinal poisson regression model training, Guest performs longitudinal poisson regression modeling with J Host parties, where J ═ 2, 3, 4, … …, J }.

Precondition: guest and each Host party complete common sample screening through encrypted ID intersection, and only common samples are interacted in the following operation steps.

The Guest party and each Host party respectively carry out Poisson regression model training and execute the following operations:

in step 301, the Arbiter generates a public-private key pair and transmits a public key in the public-private key pair to Guest and Guest participating in the round of training modelHost_j。

In step 302, Guest determines the amount of training data in each round and sends the training data to the corresponding Host party participating in modeling.

In step 303, the Host and Guest parties respectively initialize the model parameters of the local model, and calculate the intermediate calculation result of the local model.

In step 304, the current Host side encrypts the intermediate calculation result of the local model and sends the intermediate calculation result to the Guest side.

Here, Host obtains the public key in step 401, and uses the public key to calculate the result w of the intermediate calculation of the local model^hjx^hjAnd exp (w)^hjx^hj) And encrypting to obtain an encrypted intermediate calculation result, and sending the encrypted intermediate calculation result to the corresponding Guest for model training.

In step 305, the Guest party calculates the encryption residual value according to the intermediate calculation result of the Host party and sends the encryption residual value to the corresponding Host party.

Here, the encryption method adopts multiplication homomorphic encryption, and the formula for calculating the encryption residual value is as follows:

in the above formula, the Host side is based on the local sample feature data x^hjTo calculate correspondences

w^hjx^hjAnd exp (w)^hjx^hj) And encrypting the result of the encryption to obtain [ exp (w)^hjx^hj)]]. Guest parties use respective feature data

To calculate correspondences

And

and calculated in conjunction with the tag data y

Where y is the actual tag value, and is the number of times a certain event has occurred within a certain indefinite time frame, such as the number of claims made within a year for a certain type of insurance. Residual value [ [ d ]]]Comprises w^hjx^hjAnd

therefore, the residual value cannot be calculated by either party alone, and requires the Host party and the Guest party to be calculated cooperatively.

In step 306, Guest and Host respectively calculate an encrypted local gradient by using the own characteristic data and the encrypted residual error and send the encrypted local gradient to the Arbiter.

Here, after [ [ d ] is set, the gradient of the loss function with respect to the training parameter can be expressed by the following formula:

in the above formula, L represents a loss function in model training, and derives a training parameter w to obtain a corresponding derivative, i.e., gradient information.

In practical implementation, the Arbiter party is mainly responsible for decrypting the encrypted model parameters sent by the Host party and the Guest party, and coordinating the training process and the evaluation process. To prevent the Arbiter party from learning the relevant information from the Host party or the Guest party, the Host party or the Guest party may add the respective corresponding model parameters (e.g., gradient information) to the encrypted random mask. Every iteration can not expose any information to the Arbiter party, because all information obtained by the Arbiter party only has a gradient which is hidden (namely processed by random masking), and randomness and confidentiality can be ensured.

In step 307, the Arbiter party decrypts all gradients by using the private key in the public and private key pair and sends the decrypted gradients to the Guest party and the Host party.

In step 308, the Host party and the Guest party update the model parameters of the local models respectively, and determine whether the local models reach the convergence condition.

Here, the determination as to whether the convergence condition is reached may be to determine whether the model converges according to a gradient norm change.

The iteration step 303 is repeated 308 until all data is modeled.

The iteration steps 302-308 are repeated until the convergence condition is achieved or the maximum number of model training rounds is reached.

Second stage parameter aggregation: after model training in the first stage, a trusted third party server (which may be an arbiter in the first training stage) obtains first model parameters and second model parameters corresponding to Guest and local models of the Host party respectively when the Guest and the local models of the Host party reach a convergence condition, wherein the first model parameters and the second model parameters are encrypted first model parameters and encrypted second model parameters which are subjected to encryption processing (such as adopting a sha256 encryption algorithm), and security parameter aggregation is performed on the encrypted first model parameters provided by the Guest party and the encrypted second model parameters provided by the Host party to obtain aggregation model parameters. The times of parameter aggregation can be preset, and in actual implementation, if the poisson regression model needs to be applied to a rapid modeling or preliminary modeling scene, the parameter aggregation can be directly performed only once, so that the interaction times among all the participants can be reduced, and the network communication overhead is reduced.

For the process of parameter aggregation in the second stage, referring to fig. 8, fig. 8 is an alternative schematic diagram of the training method of the poisson regression model provided in the embodiment of the present application.

In step 401, the Guest party and the Host party respectively send each encrypted model parameter and the data amount used for model modeling to the Arbiter party.

Here, the Guest party transmits the encrypted model parameter [ [ w ]_g]]Data quantity n used with model modeling_gThe Host side sends the encrypted model parameter [ w ]_h]]Data quantity n used with model modeling_h。

In step 402, the Arbiter performs a secure polymerization operation on each model parameter to obtain a polymerization model parameter, where the polymerization model parameter includes a first polymerization model parameter and a second polymerization model parameter.

Here, arbiters may be Guest itself.

In carrying out the parametric polymerization, the parametric polymerization formula that can be used is as follows:

the above formula can be understood as the model parameters for each run are directly multiplied by the amount of data involved in modeling and then divided by the total modeled data amount, where,

is the parameter of polymerization model, sigma, obtained after safe polymerization]]For the received first model parameter, second model parameter, n_ghThe modeled data volumes for Guest and Host.

Illustratively, 1 Guest, 2 hosts are modeled twice in the first stage, the first modeling, the amount of intersection data with Host1 is 1000, the second modeling, the amount of intersection data with Host2 is 2000, then the total modeled data amount is 1000+ 2000-3000, and the corresponding data amount is 3000

The calculation method comprises the following steps:

in step 403, Arbiter sends the first polymerization model parameter to the Guest party, and sends the second polymerization model parameter to the corresponding Host party.

In step 404, the Guest party decrypts the first aggregation model parameters and updates the model parameters of the local model according to the decrypted first aggregation model parameters; and the Host side decrypts the second aggregation model parameters and updates the model parameters of the local model according to the decrypted first aggregation model parameters.

Here, after receiving the aggregation model parameters, the Guest party and the Host party screen model parameters matching with their respective local models, and update the model parameters of their respective local models.

The Guest party and the Host party repeat the iteration step 401-404 until the Poisson regression model reaches the convergence condition or reaches the preset number of training rounds.

The embodiment of the application achieves participation of multiple data providers in training by providing the poisson regression model parameters which are integrated with multiple participants through safe aggregation, expands the application range of the poisson regression model in federal learning, allows multiple feature providers to participate in modeling at the same time, enhances the application range of the poisson regression model, and improves the accuracy and the applicability of model prediction.

Continuing with the exemplary structure of the poisson regression model training device 555 provided in the embodiments of the present application as a software module, in some embodiments, as shown in fig. 2, the software module stored in the poisson regression model training device 555 in the memory 540 may include:

an obtaining module 5551, configured to, by a parameter aggregation device, obtain at least two model parameter pairs corresponding to the poisson regression models, where the model parameter pairs and the poisson regression models are in a one-to-one correspondence relationship;

the aggregation module 5552 is configured to perform model parameter aggregation on a model parameter of a first sub-model and a model parameter of a second sub-model in the at least two model parameter pairs, respectively, to obtain a corresponding first aggregation model parameter and a corresponding second aggregation model parameter;

a training module 5553, configured to send the first aggregation model parameter to the first participant device, and send the second aggregation model parameter to each second participant device, so as to update the model parameter of the poisson regression model.

In some embodiments, the aggregating module 5552 is further configured to obtain a first weight of the model parameter of the first sub-model in each of the model parameter pairs and a second weight of the model parameter of the second sub-model in each of the model parameter pairs, respectively;

In some embodiments, the obtaining module 5551 is further configured to obtain a first number of training samples provided by the first participant device for performing model training with each of the second participant devices, a second number of training samples provided by each of the second participant devices, and a total number of training samples used for poisson regression model training;

In some embodiments, when the number of the first participant devices is the same as the number of the second participant devices, the first participant devices are in a one-to-one correspondence with the second participant devices; the obtaining module 5551 is further configured to obtain a third number of training samples provided by the first participant device, a fourth number of training samples provided by the corresponding second participant device, and a total number of training samples used for poisson regression model training, respectively;

In some embodiments, the obtaining module 5551 is further configured to enable the creating module to create a key pair, where the key pair includes a public key and a private key;

In some embodiments, the obtaining module 5551 is further configured to receive the model parameters of the encrypted first sub-model sent by the first participant device and the model parameters of the encrypted second sub-model sent by the at least two second participant devices;

In some embodiments, when the number of the first participant devices is at least two, the aggregating module 5552 is further configured to obtain a mapping relationship between the first participant device and the second participant device, where the mapping relationship is used to indicate the second participant devices performing poisson regression model training with the first participant devices;

In some embodiments, the obtaining module 5551 is further configured to receive a first notification message, where the first notification message is sent by a target second participant device in at least two second participant devices, and is used to indicate that a poisson regression model in which the target second participant device participates in training reaches a convergence condition;

In some embodiments, the training module 5553 is further configured to obtain a model convergence condition of the at least two poisson regression models;

It should be noted that the description of the apparatus in the embodiment of the present application is similar to the description of the method embodiment, and has similar beneficial effects to the method embodiment, and therefore, the description is not repeated.

The embodiment of the present application provides a computer program product, which includes a computer program, and is characterized in that when being executed by a processor, the computer program implements the method for training a poisson regression model provided in the embodiment of the present application.

Embodiments of the present application provide a computer-readable storage medium storing executable instructions, which when executed by a processor, will cause the processor to perform a method provided by embodiments of the present application, for example, a method for training a poisson regression model as shown in fig. 3.

In some embodiments, the computer-readable storage medium may be memory such as FRAM, ROM, PROM, EPROM, EEPROM, flash, magnetic surface memory, optical disk, or CD-ROM; or may be various devices including one or any combination of the above memories.

In some embodiments, executable instructions may be written in any form of programming language (including compiled or interpreted languages), in the form of programs, software modules, scripts or code, and may be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment.

By way of example, executable instructions may correspond, but do not necessarily have to correspond, to files in a file system, and may be stored in a portion of a file that holds other programs or data, such as in one or more scripts in a hypertext Markup Language (HTML) document, in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code).

By way of example, executable instructions may be deployed to be executed on one computing device or on multiple computing devices at one site or distributed across multiple sites and interconnected by a communication network.

In conclusion, the number of feature providers in the poisson regression model in federal learning can be increased, the application range of the poisson regression model is expanded, and the model prediction capability is improved.

The above description is only an example of the present application, and is not intended to limit the scope of the present application. Any modification, equivalent replacement, and improvement made within the spirit and scope of the present application are included in the protection scope of the present application.

Claims

1. A method for training a Poisson regression model, which is applied to a federated learning system including at least one first participant device having a model training label, at least two second participant devices, and a parameter aggregation device, the method comprising:

2. The method of claim 1, wherein the performing model parameter aggregation on the model parameters of the first sub-model and the model parameters of the second sub-model in the at least two model parameter pairs to obtain corresponding first aggregation model parameters and second aggregation model parameters comprises:

respectively obtaining a first weight of a model parameter of a first submodel in each model parameter pair and a second weight of a model parameter of a second submodel in each model parameter pair;

3. The method of claim 2, wherein separately obtaining a first weight of a model parameter of a first sub-model in each of the pairs of model parameters and a second weight of a model parameter of a second sub-model in each of the pairs of model parameters comprises:

acquiring a first number of training samples provided by the first participant device for model training with the second participant devices, a second number of training samples provided by the second participant devices, and a total number of training samples for Poisson regression model training;

4. The method of claim 2, wherein separately obtaining a first weight of a model parameter of a first sub-model in each of the pairs of model parameters and a second weight of a model parameter of a second sub-model in each of the pairs of model parameters comprises:

when the number of the first participant devices is the same as the number of the second participant devices, the first participant devices and the second participant devices are in one-to-one correspondence;

respectively acquiring a third number of training samples provided by the first participant device, a fourth number of training samples provided by the corresponding second participant device, and a total number of training samples used for poisson regression model training;

5. The method according to claim 1, wherein before the parameter aggregation device obtains at least two corresponding pairs of model parameters of the poisson regression model, the method comprises:

creating a key pair, the key pair comprising a public key and a private key;

6. The method according to claim 5, wherein the obtaining, by the parameter aggregation device, at least two corresponding pairs of model parameters of the Poisson regression model comprises:

receiving the model parameters of the encrypted first sub-model sent by the first participant device and the model parameters of the encrypted second sub-model sent by the at least two second participant devices;

7. The method of claim 1, wherein the performing model parameter aggregation on the model parameters of the first sub-model and the model parameters of the second sub-model in the at least two model parameter pairs to obtain corresponding first aggregation model parameters and second aggregation model parameters comprises:

when the number of the first participant devices is at least two, obtaining a mapping relationship between the first participant device and the second participant device, where the mapping relationship is used to indicate the second participant devices performing poisson regression model training with the first participant devices;

and carrying out parameter aggregation on the initial aggregation parameters corresponding to the first submodels to obtain first aggregation model parameters, and carrying out parameter aggregation on the initial aggregation parameters corresponding to the second submodels to obtain second aggregation model parameters.

8. The method of claim 1, further comprising:

receiving a first notification message, wherein the first notification message is sent by a target second participant device in at least two second participant devices and is used for indicating that a Poisson regression model in which the target second participant device participates in training reaches a convergence condition;

correspondingly, the performing model parameter aggregation on the model parameter of the first sub-model and the model parameter of the second sub-model in the at least two model parameter pairs respectively to obtain a corresponding first aggregation model parameter and a corresponding second aggregation model parameter includes:

and respectively carrying out model parameter aggregation on the stored model parameter pair, the model parameter of the first sub-model and the model parameter of the second sub-model in the model parameter pair of the Poisson regression model which does not reach the convergence condition, and obtaining corresponding first aggregation model parameters and second aggregation model parameters.

9. The method of claim 1, wherein after sending the first aggregation model parameters to the first participant device and the second aggregation model parameters to each of the second participant devices, the method further comprises:

obtaining model convergence conditions of the at least two Poisson regression models;

10. A Poisson regression model training device applied to a federated learning system, wherein the system comprises a first participant device with a model training label, at least two second participant devices and a parameter aggregation device, the device comprises:

and the sending module is used for sending the first aggregation model parameters to the first participant equipment and sending the second aggregation model parameters to each second participant equipment so as to update the model parameters of the Poisson regression model.

11. An electronic device, comprising:

a memory for storing executable instructions;

a processor for implementing the method of training a poisson regression model as claimed in any one of claims 1 to 9 when executing executable instructions stored in the memory.

12. A computer-readable storage medium storing executable instructions for implementing the method of training a poisson regression model according to any one of claims 1 to 9 when executed by a processor.

13. A computer program product comprising a computer program, characterized in that the computer program, when being executed by a processor, implements the method of training a poisson regression model according to any one of claims 1 to 9.