CN115935189A

CN115935189A - Method, device, system and equipment for training trans-city federal migration model

Info

Publication number: CN115935189A
Application number: CN202211677031.1A
Authority: CN
Inventors: 陈高德; 张钧波; 苏义军; 郑宇�
Original assignee: Jingdong City Beijing Digital Technology Co Ltd
Current assignee: Jingdong City Beijing Digital Technology Co Ltd
Priority date: 2022-12-26
Filing date: 2022-12-26
Publication date: 2023-04-07

Abstract

The invention discloses a method, a device, a system and equipment for training a trans-city federal migration model, which relate to the technical field of intelligent cities, and the method comprises the following steps: determining prediction index data and prediction city labels based on an initial cross-city federal migration model and training index data in an obtained training index data set; adjusting model parameters of the initial cross-city federal migration model based on standard index data corresponding to the prediction index data and standard city labels corresponding to the prediction city labels to obtain a trained target cross-city federal migration model; and sending the model parameters of the target cross-city federal migration model to the target city client, so that the target city client builds a standard cross-city federal migration model based on the received model parameters, and determines a target city label based on the standard cross-city federal migration model. The embodiment of the invention solves the problem that the target city cannot finish the city portrait task due to the missing of index data.

Description

Training method, device, system and equipment for cross-city federated migration model

技术领域technical field

本发明实施例涉及智能城市技术领域，尤其涉及一种跨城市联邦迁移模型的训练方法、装置、系统及设备。The embodiments of the present invention relate to the technical field of smart cities, and in particular to a training method, device, system and equipment for a cross-city federation migration model.

背景技术Background technique

大数据与城市发展相结合是未来城市的重要发展方向，城市画像任务是指基于城市发展过程中产生的指标数据(如人口数据、交通数据等)，预测可用于描述城市运行状态的城市标签(如商业流行度指标、消费指数等)，以指导城市的未来发展方向。The combination of big data and urban development is an important development direction of cities in the future. The city portrait task refers to predicting city labels (such as population data, traffic data, etc.) Such as commercial popularity index, consumption index, etc.) to guide the future development direction of the city.

训练神经网络模型被作为预测城市标签的有效方法，但由于不同城市的发展水平和发展方向不同，有些城市可能会存在指标数据的数据量较少的情况，使其不具备训练神经网络模型的条件。为解决该问题，现有采用的跨城市迁移方法是在数据资源丰富的源城市训练得到跨城市联邦迁移模型，并将跨城市联邦迁移模型迁移到数据资源匮乏的目标城市中，以辅助目标城市完成城市画像任务。Training neural network models is an effective way to predict city labels. However, due to the different development levels and directions of different cities, some cities may have a small amount of indicator data, making them unsuitable for training neural network models. . To solve this problem, the existing cross-city migration method is to train the cross-city federated migration model in the source city with rich data resources, and migrate the cross-city federated migration model to the target city with scarce data resources to assist the target city. Complete the city portrait mission.

在实现本发明的过程中，发现现有技术中至少存在以下技术问题：In the process of realizing the present invention, it is found that there are at least the following technical problems in the prior art:

在实际城市场景中，目标城市不仅会存在缺失城市标签的情况，还可能会存在缺失某种类型的指标数据的情况。因此，迁移到目标城市中的跨城市联邦迁移模型无法在同时缺失城市标签和缺失指标数据的情况下完成城市画像任务。In actual urban scenarios, not only will there be missing city labels in the target city, but some types of indicator data may also be missing. Therefore, the cross-city federated migration model migrating to the target city cannot complete the city portrait task in the absence of both city labels and missing indicator data.

发明内容Contents of the invention

本发明实施例提供了一种跨城市联邦迁移模型的训练方法、装置、系统及设备，以解决目标城市由于缺失指标数据导致无法完成城市画像任务的问题，扩宽了跨城市迁移方法的应用场景。Embodiments of the present invention provide a training method, device, system, and equipment for a cross-city federated migration model to solve the problem that the target city cannot complete the city portrait task due to lack of index data, and broaden the application scenarios of the cross-city migration method .

根据本发明一个实施例提供了一种跨城市联邦迁移模型的训练方法，应用于源城市客户端，该方法包括：According to an embodiment of the present invention, a training method for a cross-city federation migration model is provided, which is applied to a source city client, and the method includes:

响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和所述训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签；In response to obtaining the training indicator data set, based on the initial cross-city federation transfer model and the training indicator data in the training indicator data set, determine the predictive indicator data and the predicted city label;

基于与所述预测指标数据对应的标准指标数据以及与所述预测城市标签对应的标准城市标签，对所述初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型；Based on the standard index data corresponding to the predictive index data and the standard city label corresponding to the predicted city label, adjust the model parameters of the initial cross-city federated migration model to obtain a trained target cross-city federated migration model ;

将所述目标跨城市联邦迁移模型的模型参数发送给目标城市客户端，以使所述目标城市客户端基于接收到的模型参数构建标准跨城市联邦迁移模型，并基于所述标准跨城市联邦迁移模型确定目标城市标签。Send the model parameters of the target cross-city federation migration model to the target city client, so that the target city client builds a standard cross-city federation migration model based on the received model parameters, and migrates across cities based on the standard The model determines the target city labels.

根据本发明另一个实施例提供了一种跨城市联邦迁移模型的训练方法，应用于目标城市客户端，该方法包括：According to another embodiment of the present invention, a training method of a cross-city federation migration model is provided, which is applied to a target city client, and the method includes:

接收源城市客户端发送的训练完成的目标跨城市联邦迁移模型的模型参数，并基于所述模型参数，构建标准跨城市联邦迁移模型；Receive the model parameters of the trained target cross-city federation migration model sent by the source city client, and build a standard cross-city federation migration model based on the model parameters;

将测试指标数据输入到所述标准跨城市联邦迁移模型中，得到输出的目标城市标签；Input the test indicator data into the standard cross-city federation migration model to obtain the output target city label;

其中，所述目标跨城市联邦迁移模型是所述源城市客户端基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，以及基于与所述预测指标数据对应的标准指标数据以及与所述预测城市标签对应的标准城市标签对初始跨城市联邦迁移模型执行训练操作得到的。Wherein, the target cross-city federated migration model is that the source city client determines the predictive index data and the predicted city label based on the initial cross-city federated migration model and the training index data in the training index data set, and based on the correlation with the predictive index The standard index data corresponding to the data and the standard city label corresponding to the predicted city label are obtained by performing a training operation on the initial cross-city federation transfer model.

根据本发明另一个实施例提供了一种跨城市联邦迁移模型的训练装置，应用于源城市客户端，该装置包括：According to another embodiment of the present invention, a training device for a cross-city federation migration model is provided, which is applied to a source city client, and the device includes:

预测城市标签确定模块，用于响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和所述训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签；A predictive city label determination module, configured to determine predictive index data and predictive city labels based on the initial cross-city federation transfer model and the training index data in the training index data set in response to obtaining the training index data set;

目标跨城市联邦迁移模型确定模块，用于基于与所述预测指标数据对应的标准指标数据以及与所述预测城市标签对应的标准城市标签，对所述初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型；A target inter-city federated migration model determination module, configured to adjust the model parameters of the initial inter-city federated migration model based on the standard indicator data corresponding to the predicted indicator data and the standard city label corresponding to the predicted city label , to obtain the trained target inter-city federation migration model;

模型参数发送模块，用于将所述目标跨城市联邦迁移模型的模型参数发送给目标城市客户端，以使所述目标城市客户端基于接收到的模型参数构建标准跨城市联邦迁移模型，并基于所述标准跨城市联邦迁移模型确定目标城市标签。The model parameter sending module is used to send the model parameters of the target cross-city federation migration model to the target city client, so that the target city client builds a standard cross-city federation migration model based on the received model parameters, and based on The standard cross-city federated migration model determines the destination city label.

根据本发明另一个实施例提供了一种跨城市联邦迁移模型的训练装置，应用于目标城市客户端，该装置包括：According to another embodiment of the present invention, a training device for a cross-city federation migration model is provided, which is applied to a target city client, and the device includes:

标准跨城市联邦迁移模型构建模块，用于接收源城市客户端发送的训练完成的目标跨城市联邦迁移模型的模型参数，并基于所述模型参数，构建标准跨城市联邦迁移模型；A standard cross-city federated migration model building module, used to receive the model parameters of the trained target cross-city federated migration model sent by the source city client, and build a standard cross-city federated migration model based on the model parameters;

目标城市标签确定模块，用于将测试指标数据输入到所述标准跨城市联邦迁移模型中，得到输出的目标城市标签；The target city label determination module is used to input the test indicator data into the standard cross-city federation migration model to obtain the output target city label;

根据本发明另一个实施例提供了一种跨城市联邦迁移模型的训练系统，该系统包括：源城市客户端和目标城市客户端；According to another embodiment of the present invention, a training system for cross-city federation migration model is provided, the system includes: a source city client and a target city client;

其中，所述源城市客户端，用于响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和所述训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，以及基于与所述预测指标数据对应的标准指标数据以及与所述预测城市标签对应的标准城市标签，对所述初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型，并将所述目标跨城市联邦迁移模型的模型参数发送给目标城市客户端；Wherein, the source city client is configured to determine predictive index data and predicted city labels based on the initial cross-city federation transfer model and the training index data in the training index data set in response to obtaining the training index data set, and based on The standard index data corresponding to the predictive index data and the standard city label corresponding to the predicted city label are adjusted to the model parameters of the initial cross-city federated migration model to obtain a trained target cross-city federated migration model, and send the model parameters of the target cross-city federation migration model to the target city client;

所述目标城市客户端，用于基于接收到的模型参数，构建标准跨城市联邦迁移模型，并基于所述标准跨城市联邦迁移模型，确定目标城市标签。The target city client is configured to construct a standard cross-city federated migration model based on the received model parameters, and determine a target city label based on the standard cross-city federated migration model.

根据本发明另一个实施例提供了一种终端设备，所述终端设备包括：According to another embodiment of the present invention, a terminal device is provided, and the terminal device includes:

至少一个处理器；以及at least one processor; and

与所述至少一个处理器通信连接的存储器；其中，a memory communicatively coupled to the at least one processor; wherein,

所述存储器存储有可被所述至少一个处理器执行的计算机程序，所述计算机程序被所述至少一个处理器执行，以使所述至少一个处理器能够执行本发明任一实施例所述的跨城市联邦迁移模型的训练方法。The memory stores a computer program that can be executed by the at least one processor, and the computer program is executed by the at least one processor, so that the at least one processor can execute the method described in any embodiment of the present invention. A training method for federated transfer models across cities.

根据本发明另一个实施例，提供了一种计算机可读存储介质，所述计算机可读存储介质存储有计算机指令，所述计算机指令用于使处理器执行时实现本发明任一实施例所述的跨城市联邦迁移模型的训练方法。According to another embodiment of the present invention, a computer-readable storage medium is provided, the computer-readable storage medium stores computer instructions, and the computer instructions are used to enable a processor to implement any of the embodiments of the present invention when executed. A training method for cross-city federated transfer models.

本发明实施例的技术方案，通过响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型，使得训练得到的目标跨城市联邦迁移模型同时具备预测指标数据和预测城市标签的能力，解决了目标城市由于缺失指标数据导致无法完成城市画像任务的问题，扩宽了跨城市迁移方法的应用场景。In the technical solution of the embodiment of the present invention, by responding to the acquisition of the training index data set, based on the initial cross-city federation transfer model and the training index data in the training index data set, the prediction index data and the prediction city label are determined, based on the corresponding prediction index data The standard index data and the standard city labels corresponding to the predicted city labels are used to adjust the model parameters of the initial cross-city federated migration model to obtain the trained target cross-city federated migration model, so that the trained target cross-city federated migration model can be simultaneously With the ability to predict index data and predict city labels, it solves the problem that the target city cannot complete the city portrait task due to lack of index data, and broadens the application scenarios of the cross-city migration method.

应当理解，本部分所描述的内容并非旨在标识本发明的实施例的关键或重要特征，也不用于限制本发明的范围。本发明的其它特征将通过以下的说明书而变得容易理解。It should be understood that the content described in this section is not intended to identify key or important features of the embodiments of the present invention, nor is it intended to limit the scope of the present invention. Other features of the present invention will be easily understood from the following description.

附图说明Description of drawings

为了更清楚地说明本发明实施例中的技术方案，下面将对实施例描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings that need to be used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present invention. For those skilled in the art, other drawings can also be obtained based on these drawings without creative effort.

图1为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练方法的流程图；Fig. 1 is the flowchart of the training method of a kind of cross-city federation migration model provided by one embodiment of the present invention;

图2为本发明一个实施例所提供的一种不同城市的不同指标数据之间的关系图；Fig. 2 is a relationship diagram between different index data of different cities provided by an embodiment of the present invention;

图3为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练方法的具体实例的流程图；Fig. 3 is the flowchart of the concrete example of the training method of a kind of cross-city federation migration model provided by one embodiment of the present invention;

图4为本发明一个实施例所提供的另一种跨城市联邦迁移模型的训练方法的流程图；FIG. 4 is a flow chart of another training method for a cross-city federation migration model provided by an embodiment of the present invention;

图5为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练方法的流程图；FIG. 5 is a flowchart of a training method for a cross-city federation migration model provided by an embodiment of the present invention;

图6为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练装置的结构示意图；FIG. 6 is a schematic structural diagram of a training device for a cross-city federation migration model provided by an embodiment of the present invention;

图7为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练装置的结构示意图；FIG. 7 is a schematic structural diagram of a training device for a cross-city federation migration model provided by an embodiment of the present invention;

图8为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练系统的结构示意图；FIG. 8 is a schematic structural diagram of a training system for a cross-city federation migration model provided by an embodiment of the present invention;

图9为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练系统的系统原理图；FIG. 9 is a system schematic diagram of a training system for a cross-city federation migration model provided by an embodiment of the present invention;

图10为本发明一个实施例所提供的一种终端设备的结构示意图。Fig. 10 is a schematic structural diagram of a terminal device provided by an embodiment of the present invention.

具体实施方式Detailed ways

为了使本技术领域的人员更好地理解本发明方案，下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分的实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都应当属于本发明保护的范围。In order to enable those skilled in the art to better understand the solutions of the present invention, the following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is an embodiment of a part of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts shall fall within the protection scope of the present invention.

需要说明的是，本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象，而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换，以便这里描述的本发明的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外，术语“包括”和“具有”以及他们的任何变形，意图在于覆盖不排他的包含，例如，包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元，而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。It should be noted that the terms "first" and "second" in the description and claims of the present invention and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the invention described herein can be practiced in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" and "having", as well as any variations thereof, are intended to cover a non-exclusive inclusion, for example, a process, method, system, product or device comprising a sequence of steps or elements is not necessarily limited to the expressly listed instead, may include other steps or elements not explicitly listed or inherent to the process, method, product or apparatus.

图1为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练方法的流程图，本实施例可适用于对缺失指标数据的目标城市执行城市画像任务的情况，该方法可以由跨城市联邦迁移模型的训练装置来执行，该跨城市联邦迁移模型的训练装置可以采用硬件和/或软件的形式实现，该跨城市联邦迁移模型的训练装置可配置于源城市客户端中。其中，源城市客户端可以表征与指标数据资源丰富的城市对应的客户端。如图1所示，该方法包括：Fig. 1 is a flow chart of a training method for a cross-city federated migration model provided by an embodiment of the present invention. This embodiment is applicable to the case of performing a city portrait task for a target city with missing index data. The training device of the city federation migration model can be implemented in the form of hardware and/or software, and the training device of the cross-city federation migration model can be configured in the source city client. Wherein, the source city client may represent a client corresponding to a city with abundant index data resources. As shown in Figure 1, the method includes:

S110、响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签。S110. In response to obtaining the training index data set, determine predictive index data and predicted city labels based on the initial cross-city federation transfer model and the training index data in the training index data set.

其中，示例性的，训练指标数据可用于表征源城市在发展过程中产生的数据，如训练指标数据可以是人口数据、交通数据、税务数据等等。Wherein, for example, the training index data can be used to represent the data generated in the development process of the source city, for example, the training index data can be population data, traffic data, tax data and so on.

其中，示例性的，预测城市标签可以为消费指数、交通便捷程度和空气质量指数等等。其中，消费指数可以为后续的商圈规划提供参考依据，交通便捷程度，可以为后续的交通调度提供参考依据，空气质量指数可以为后续的环境治理方案的制定和实施提供参考依据。Wherein, for example, the predicted city labels may be consumption index, traffic convenience degree, air quality index and so on. Among them, the consumption index can provide a reference basis for subsequent business district planning, the degree of traffic convenience can provide a reference basis for subsequent traffic scheduling, and the air quality index can provide a reference basis for the formulation and implementation of subsequent environmental governance plans.

在一个实施例中，具体的，训练指标数据包括兴趣点区域数据、道路网络数据和人口数据，其中，兴趣点区域数据包括与至少一个兴趣点类型分别对应的兴趣点数量、兴趣点总数量和兴趣点熵值中至少一种，道路网络数据包括与至少一种道路类型分别对应的道路数量，人口数据包括工作人口数量和/或居住人口数量，相应的，预测指标数据为消费人口数量，预测城市标签为商业流行度的类别。In one embodiment, specifically, the training indicator data includes point-of-interest area data, road network data, and population data, wherein the point-of-interest area data includes the number of points of interest, the total number of points of interest, and the total number of points of interest corresponding to at least one type of point of interest. At least one of the entropy values of the points of interest, the road network data includes the number of roads corresponding to at least one road type, the population data includes the number of working population and/or the number of living population, correspondingly, the predictive index data is the number of consumption population, and the predicted City labels are categories for commercial popularity.

其中，具体的，兴趣点区域数据可以用于描述城市中不同区域的功能或属性，特别是，兴趣点(Point of Interest，POI)的数量和类型可以反映一个区域的受欢迎程度。其中，示例性的，兴趣点类型包括但不限于餐饮服务、风景名胜、公共设施、购物、交通设施服务、金融保险服务、科教文化服务、商务住宅、生活服务、体育服务、医疗保健服务、政府机构及社会团体、住宿服务和绿地公园等等，此处对数据点类型不作限定。Among them, specifically, POI area data can be used to describe the functions or attributes of different areas in the city, especially, the number and type of POI (Point of Interest, POI) can reflect the popularity of an area. Among them, for example, the types of POIs include but are not limited to catering services, scenic spots, public facilities, shopping, transportation facilities services, financial insurance services, science, education and cultural services, business residences, life services, sports services, healthcare services, government Institutions and social groups, accommodation services and green parks, etc., there is no limitation on the types of data points here.

在一个实施例中，具体的，兴趣点区域数据包括各兴趣点类型分别对应的兴趣点数量d^pf、兴趣点总数量d^pn和兴趣点熵值d^pe，其中，兴趣点熵值d^pe满足公式：In one embodiment, specifically, the point-of-interest area data includes the number of points of interest d ^pf corresponding to each type of point of interest, the total number of points of interest d ^pn and the entropy value of points of interest d ^pe , wherein the point-of-interest entropy value d ^pe satisfies formula:

其中，

表示第i中兴趣点类型对应的兴趣点数量。其中，兴趣点熵值d^pe可反映一个区域的功能多样性。in,

Indicates the number of interest points corresponding to the type of interest point in i. Among them, the interest point entropy value d ^pe can reflect the functional diversity of a region.

其中，具体的，道路网络数据可以反映城市的交通便捷程度。其中，示例性的，道路类型包括但不限于高速公路、快速路、生活街道、林业道路和竞赛道路等等，至少一种道路类型分别对应的道路数量可以用d^rf表示。Among them, concretely, the road network data can reflect the traffic convenience of the city. Wherein, for example, the road types include but not limited to expressways, expressways, living streets, forestry roads, competition roads, etc., and the number of roads corresponding to at least one road type can be represented by d ^rf .

其中，具体的，人口数据可以反映城市中不同区域的不同人口类型的分布，示例性的，人群类型包括但不限于工作人口、居住人口和消费人口等等。在一个实施例中，具体的，人口数据包括工作人口数量d^wp和居住人口数量d^rp。Specifically, the population data may reflect the distribution of different population types in different regions of the city. Exemplarily, the population types include but are not limited to working population, resident population, and consumption population. In one embodiment, specifically, the population data includes the number of working population d ^wp and the number of resident population d ^rp .

在一个实施例中，具体的，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，包括：将训练指标数据集中的训练指标数据分别输入到初始跨城市联邦迁移模型中的初始数据生成模块和初始标签生成模块中；通过初始数据生成模块，基于输入的训练指标数据，输出预测指标数据；通过初始标签生成模块，基于输入的训练指标数据以及初始数据生成模块输出的预测指标数据，输出预测城市标签。In one embodiment, specifically, based on the initial cross-city federation migration model and the training indicator data in the training indicator dataset, determining the predictive indicator data and the predicted city label includes: inputting the training indicator data in the training indicator dataset into the initial In the initial data generation module and initial label generation module in the cross-city federated migration model; through the initial data generation module, based on the input training indicator data, output prediction indicator data; through the initial label generation module, based on the input training indicator data and the initial The predictive indicator data output by the data generation module outputs the predicted city label.

图2为本发明一个实施例所提供的一种不同城市的不同指标数据之间的关系图。具体的，图2针对4个不同城市(分别为城市A、城市B、城市C和城市D)，反映出了归一化后的POI熵值与归一化后的消费人口数量之间的关系。从图2可以得到：不同指标数据之间的关系知识在不同城市之间存在相似性。由于不同城市的发展水平或发展方向不同，导致不同城市产生的同一类型的指标数据之间通常是存在比较大的差异的。但不同城市的不同类型的指标数据之间的关系知识却存在相似性(如图2)，因此，基于源城市的训练指标数据训练初始跨城市联邦迁移模型中的初始数据生成模块，可以使得初始数据生成模块学习到不同指标数据之间的关系知识，进而可以保证初始数据生成模块的可训练性以及输出的预测指标数据的准确度。FIG. 2 is a relationship diagram between different index data of different cities provided by an embodiment of the present invention. Specifically, Figure 2 reflects the relationship between the normalized POI entropy value and the normalized consumer population for four different cities (city A, city B, city C, and city D respectively). . From Figure 2, it can be concluded that the relationship knowledge between different index data is similar among different cities. Due to the different development levels or development directions of different cities, there are usually relatively large differences between the same type of indicator data produced by different cities. However, there are similarities in relational knowledge between different types of index data in different cities (as shown in Figure 2). Therefore, training the initial data generation module in the initial cross-city federation transfer model based on the training index data of the source city can make the initial The data generation module learns the relationship knowledge between different index data, which can ensure the trainability of the initial data generation module and the accuracy of the output prediction index data.

在一个实施例中，具体的，初始数据生成模块包括特征提取器、数据回归器和域分类器，相应的，基于输入的训练指标数据，输出预测指标数据，包括：将训练指标数据输入到特征提取器中，得到输出的训练指标特征；通过数据回归器，基于特征提取器输出的训练指标特征，输出预测指标数据，以及通过域分类器，基于特征提取器输出的训练指标特征，输出预测城市分类结果。In one embodiment, specifically, the initial data generation module includes a feature extractor, a data regressor, and a domain classifier. Correspondingly, based on the input training indicator data, the output prediction indicator data includes: inputting the training indicator data to the feature In the extractor, the output training indicator features are obtained; through the data regressor, based on the training indicator features output by the feature extractor, output prediction indicator data, and through the domain classifier, based on the training indicator features output by the feature extractor, output the predicted city classification results.

其中，具体的，特征提取器可学习提取训练指标数据与预测指标数据之间的关系特征。示例性的，特征提取器可包含两个全连接层。此处对特征提取器的网络架构不作限定。Specifically, the feature extractor can learn to extract the relationship features between the training index data and the prediction index data. Exemplarily, the feature extractor may contain two fully connected layers. The network architecture of the feature extractor is not limited here.

在一个实施例中，具体的，预测指标数据为消费人口数量。具体的，数据回归器数据输出的预测指标数据为预测得到的消费人口数量。其中，示例性的，数据回归器可包含两个全连接层和一个回归层。此处对特征提取器的网络架构不作限定。In one embodiment, specifically, the predictive indicator data is the consumption population. Specifically, the predictive index data output by the data regressor is the predicted consumption population. Wherein, for example, the data regressor may include two fully connected layers and one regression layer. The network architecture of the feature extractor is not limited here.

其中，具体的，仅使用特征提取器和数据回归器可能会存在域漂移的问题，为了解决该问题，本实施例中的初始数据生成模块包括域分类器。其中，示例性的，域分类器可包含两个全连接层和一个分类层。此处对域分类器的网络架构不作限定。Specifically, only using the feature extractor and the data regressor may cause domain drift. To solve this problem, the initial data generation module in this embodiment includes a domain classifier. Wherein, for example, the domain classifier may include two fully connected layers and one classification layer. The network architecture of the domain classifier is not limited here.

其中，具体的，预测城市分类结果可用于表征训练特征数据来自哪个源城市客户端。Specifically, the predicted city classification result can be used to represent which source city client the training feature data comes from.

其中，具体的，准确的商业流行度的类别预测可为商圈规划方案的制定和实施提供有力的数据支撑。示例性的，商业流行度的类别对应的标签数量可以为5个，分别为非常高、高、中等、低和非常低。Among them, specific and accurate category predictions of commercial popularity can provide strong data support for the formulation and implementation of commercial district planning schemes. Exemplarily, the number of tags corresponding to the category of commercial popularity may be 5, which are respectively very high, high, medium, low and very low.

S120、基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型。S120. Based on the standard index data corresponding to the predictive index data and the standard city labels corresponding to the predicted city labels, adjust the model parameters of the initial inter-city federated transfer model to obtain a trained target inter-city federated transfer model.

在本实施例中，具体的，训练指标数据集中包含与至少两个源城市客户端分别对应的训练指标数据，相应的，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型，包括：针对初始跨城市联邦迁移模型的每次迭代过程，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定目标损失函数；基于目标损失函数，确定当前迭代的初始跨城市联邦迁移模型对应的当前模型参数，直到目标损失函数收敛时，将当前迭代的初始跨城市联邦迁移模型作为训练完成的目标跨城市联邦迁移模型。In this embodiment, specifically, the training index data set contains training index data corresponding to at least two source city clients respectively, and correspondingly, based on the standard index data corresponding to the predictive index data and the standard index data corresponding to the predicted city label City label, adjust the model parameters of the initial cross-city federated migration model to obtain the trained target cross-city federated migration model, including: for each iteration process of the initial cross-city federated migration model, based on the standards corresponding to the predictive index data The index data and the standard city labels corresponding to the predicted city labels determine the target loss function; based on the target loss function, determine the current model parameters corresponding to the initial cross-city federation migration model of the current iteration until the target loss function converges. The initial cross-city federated migration model is used as the target cross-city federated migration model after training.

其中，具体的，获取至少两个源城市客户端分别对应的训练指标数据，将至少两个训练指标数据分别输入到初始跨城市联邦迁移模型中，得到输出的预测指标数据和预测城市标签，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定目标损失函数，以及基于目标损失函数，确定当前迭代的初始跨城市联邦迁移模型对应的当前模型参数，直到目标损失函数收敛时，将当前迭代的初始跨城市联邦迁移模型作为训练完成的目标跨城市联邦迁移模型。其中，至少两个源城市客户端中包括当前源城市客户端。Among them, specifically, the training index data corresponding to at least two source city clients are obtained, and at least two training index data are respectively input into the initial cross-city federation transfer model to obtain the output predictive index data and predicted city labels, based on The standard indicator data corresponding to the predicted indicator data and the standard city label corresponding to the predicted city label determine the target loss function, and based on the target loss function, determine the current model parameters corresponding to the initial cross-city federated migration model of the current iteration until the target loss When the function converges, the initial cross-city federated migration model of the current iteration is used as the target cross-city federated migration model after training. Wherein, at least two source city clients include the current source city client.

需要说明的是，本发明实施例中所涉及的源城市的训练指标数据的获取、存储和应用等，均符合相关法律法规的规定。It should be noted that the acquisition, storage, and application of the training index data of the source cities involved in the embodiments of the present invention all comply with relevant laws and regulations.

S130、将目标跨城市联邦迁移模型的模型参数发送给目标城市客户端，以使目标城市客户端基于接收到的模型参数构建标准跨城市联邦迁移模型，并基于标准跨城市联邦迁移模型确定目标城市标签。S130. Send the model parameters of the target cross-city federation migration model to the target city client, so that the target city client builds a standard cross-city federation migration model based on the received model parameters, and determines the target city based on the standard cross-city federation migration model Label.

其中，具体的，目标城市客户端可基于接收到的模型参数以及初始跨城市联邦迁移模型的模型架构，构建标准跨城市联邦迁移模型，将与训练指标数据对应的测试指标数据输入到标准跨城市联邦迁移模型中，标准跨城市联邦迁移模型中的标准数据生成模块基于输入的测试指标数据输出缺失指标数据，标准跨城市联邦迁移模型中的标准标签模块基于输入的测试指标数据和标准数据生成模块输出的缺失指标数据，输出目标城市标签。其中，测试指标数据可以是目标城市中已有的与训练指标数据的指标类型相同的数据，如测试指标数据包括目标城市的兴趣点区域数据、道路网络数据和人口数据。Specifically, the target city client can build a standard cross-city federated migration model based on the received model parameters and the model architecture of the initial cross-city federated migration model, and input the test indicator data corresponding to the training indicator data into the standard cross-city federated migration model. In the federated migration model, the standard data generation module in the standard cross-city federated migration model outputs missing indicator data based on the input test index data, and the standard label module in the standard cross-city federated migration model generates modules based on the input test index data and standard data Output the missing indicator data, and output the target city label. Wherein, the test index data may be data of the same index type as the training index data existing in the target city, for example, the test index data includes POI data, road network data and population data of the target city.

图3为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练方法的具体实例的流程图。具体的，多个源城市(图3中的城市A和城市B)包含丰度的指标数据资源，如指标数据资源可用于表征城市中不同区域的指标数据。基于多个源城市的训练指标数据对初始跨城市联邦迁移模型进行训练，如训练指标数据可以是税务数据、监控数据和轨迹数据。具体的，初始跨城市联邦迁移模型中包含初始数据生成模块和初始标签生成模块，其中，初始数据生成模块可用于学习指标数据之间的关系知识以及基于学习到的关系知识生成预测指标数据，初始标签生成模块基于训练指标数据和预测指标数据，生成预测城市标签。示例性的，预测指标数据可以为人口数据，预测城市标签可以为商业流行度的类别，具体的，预测城市标签为非常高、高、中等、低或非常低。在得到训练完成的目标跨城市联邦迁移模型后，将目标跨城市联邦迁移模型迁移到目标城市(图3中的城市C)中，得到标准跨城市联邦迁移模型。城市C将自身的税务数据、监控数据和轨迹数据输入到标准跨城市联邦迁移模型中，标准跨城市联邦迁移模型中的标准数据生成模块基于输入的税务数据、监控数据和轨迹数据输出预测的人口数据，标准跨城市联邦迁移模型中的标准标签生成模块基于输入的税务数据、监控数据和轨迹数据和预测的人口数据，输出城市C的商业流行度的类别。FIG. 3 is a flow chart of a specific example of a training method for a cross-city federated migration model provided by an embodiment of the present invention. Specifically, multiple source cities (city A and city B in FIG. 3 ) contain abundant index data resources, for example, the index data resources can be used to represent the index data of different regions in the city. The initial cross-city federation migration model is trained based on the training indicator data of multiple source cities. For example, the training indicator data can be tax data, monitoring data, and trajectory data. Specifically, the initial cross-city federated migration model includes an initial data generation module and an initial label generation module, wherein the initial data generation module can be used to learn the relationship knowledge between indicator data and generate predictive indicator data based on the learned relationship knowledge, the initial The label generation module generates predicted city labels based on the training indicator data and the prediction indicator data. Exemplarily, the predictive index data may be population data, and the predicted city label may be a category of commercial popularity, specifically, the predicted city label is very high, high, medium, low or very low. After the trained target cross-city federated migration model is obtained, the target cross-city federated migration model is migrated to the target city (city C in Figure 3) to obtain a standard cross-city federated migration model. City C inputs its own tax data, monitoring data and trajectory data into the standard cross-city federal migration model, and the standard data generation module in the standard cross-city federal migration model outputs the predicted population based on the input tax data, monitoring data and trajectory data Data, the standard label generation module in the standard cross-city federated migration model outputs the category of commercial popularity of city C based on the input tax data, monitoring data and trajectory data and predicted population data.

其中，具体的，标准跨城市联邦迁移模型中的标准数据生成模块包括训练完成的特征提取器和数据回归器。Specifically, the standard data generation module in the standard cross-city federated migration model includes a trained feature extractor and data regressor.

本实施例的技术方案，通过响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型，使得训练得到的目标跨城市联邦迁移模型同时具备预测指标数据和预测城市标签的能力，解决了目标城市由于缺失指标数据导致无法完成城市画像任务的问题，扩宽了跨城市迁移方法的应用场景。In the technical solution of this embodiment, by responding to the acquisition of the training index data set, based on the initial cross-city federation transfer model and the training index data in the training index data set, the prediction index data and the prediction city label are determined, based on the corresponding prediction index data The standard index data and the standard city labels corresponding to the predicted city labels are used to adjust the model parameters of the initial cross-city federated migration model to obtain the trained target cross-city federated migration model, so that the trained target cross-city federated migration model has both The ability to predict index data and predict city labels solves the problem that the target city cannot complete the city portrait task due to lack of index data, and broadens the application scenarios of the cross-city migration method.

图4为本发明一个实施例所提供的另一种跨城市联邦迁移模型的训练方法的流程图，本实施例对上述实施例中的“基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型”技术特征进行进一步细化。如图4所示，该方法包括：Fig. 4 is a flow chart of another cross-city federation migration model training method provided by an embodiment of the present invention. The standard city label corresponding to the label, the model parameters of the initial cross-city federation migration model are adjusted, and the technical characteristics of the trained target cross-city federation migration model are further refined. As shown in Figure 4, the method includes:

S210、响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签。S210. In response to obtaining the training indicator data set, determine predictive indicator data and predicted city labels based on the initial cross-city federation transfer model and the training indicator data in the training indicator data set.

在本实施例中，训练指标数据集中包含当前源城市客户端对应的训练指标数据。In this embodiment, the training index data set includes the training index data corresponding to the current source city client.

S220、针对初始跨城市联邦迁移模型的每次迭代过程，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定目标损失函数。S220. For each iterative process of the initial inter-city federated migration model, determine a target loss function based on the standard index data corresponding to the predictive index data and the standard city label corresponding to the predicted city label.

在一个实施例中，具体的，目标损失函数包括与初始数据生成模块对应的第一目标损失函数以及与初始标签生成模块对应的第二目标损失函数，相应的，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定目标损失函数，包括：基于预测指标数据和标准指标数据，构建第一损失函数，以及基于预测城市分类结果和标准城市分类结果，构建第二损失函数；基于第一损失函数和第二损失函数，确定与初始数据生成模块对应的第一目标损失函数；基于预测城市标签和标准城市标签，构建与初始标签生成模块对应的第二目标损失函数。In one embodiment, specifically, the target loss function includes a first target loss function corresponding to the initial data generation module and a second target loss function corresponding to the initial label generation module, correspondingly, based on the standard corresponding to the predictive index data The index data and the standard city labels corresponding to the predicted city labels determine the target loss function, including: constructing the first loss function based on the predicted index data and the standard index data, and constructing the second loss function based on the predicted city classification results and the standard city classification results. Loss function; based on the first loss function and the second loss function, determine the first target loss function corresponding to the initial data generation module; based on the predicted city label and the standard city label, construct the second target loss function corresponding to the initial label generation module .

其中，示例性的，可采用最小平方误差算法，基于预测指标数据和标准指标数据，构建第一损失函数，具体的，第一损失函数L_DR满足公式：Among them, for example, the least square error algorithm can be used to construct the first loss function based on the predictive index data and the standard index data. Specifically, the first loss function L _DR satisfies the formula:

其中，a_i表示与第i个训练指标数据对应的标准指标数据，

与第i个训练指标数据对应的预测指标数据，N表示训练指标数据的训练样本量。Among them, a _i represents the standard index data corresponding to the i-th training index data,

The predictive index data corresponding to the i-th training index data, N represents the training sample size of the training index data.

其中，示例性的，可采用最小化交叉熵算法，基于预测城市分类结果和标准城市分类结果，构建第二损失函数，具体的，第二损失函数L_DC满足公式：Among them, for example, the cross-entropy minimization algorithm can be used to construct the second loss function based on the predicted city classification results and the standard city classification results. Specifically, the second loss function L _DC satisfies the formula:

其中，d_ij表示第j个源城市客户端对应的第i个训练指标数据所属的标准城市分类结果，

表示第j个源城市客户端对应的第i个训练指标数据所属的预测城市分类结果，|C|表示源城市客户端的数量。Among them, d _ij represents the standard city classification result to which the i-th training indicator data corresponding to the j-th source city client belongs,

Indicates the predicted city classification result to which the i-th training indicator data corresponding to the j-th source city client belongs, and |C| represents the number of source city clients.

在上述实施例的基础上，具体的，初始数据生成模块还包括梯度反转层，梯度反转层设置在特征提取器与域分类器之间，梯度反转层用于在初始数据生成模块的反向传播过程中，将输入到梯度反转层中的第二损失函数的函数梯度乘以预设负数，相应的，第一目标损失函数中第二损失函数的系数为预设负数。On the basis of the above embodiments, specifically, the initial data generation module further includes a gradient inversion layer, the gradient inversion layer is set between the feature extractor and the domain classifier, and the gradient inversion layer is used in the initial data generation module In the backpropagation process, the function gradient of the second loss function input into the gradient inversion layer is multiplied by a preset negative number, and correspondingly, the coefficient of the second loss function in the first target loss function is a preset negative number.

其中，具体的，第一目标损失函数L₁满足公式：Among them, specifically, the first objective loss function L ₁ satisfies the formula:

L₁＝L_DR-λL_DC L ₁ =L _DR -λL _DC

这样设置的好处在于，由于不同城市的训练指标数据对应的训练指标特征具有很大的差异，因此，特征提取器应该学习不同城市的训练指标数据之间的通用特征表示，而不是区别特征表示，为此，特征提取器的训练目的应该最大化第二损失函数L_DC。而特征反转层(GRL)的目的是在初始数据生成模块的正向传播过程中，对训练指标特征没有影响，但在反向传播的过程中，特征反转层会将第二损失函数的函数梯度乘以预设负数，本实施例通过采用监督对抗策略，可以实现特征提取器在训练过程中学习提取不同城市的训练指标数据之间的通用特征表示的目的。The advantage of this setting is that since the training index features corresponding to the training index data of different cities are very different, the feature extractor should learn the common feature representation between the training index data of different cities, rather than distinguishing feature representation. For this reason, the training objective of the feature extractor should maximize the second loss function L _DC . The purpose of the feature inversion layer (GRL) is to have no effect on the training index features during the forward propagation of the initial data generation module, but in the process of backpropagation, the feature inversion layer will take the second loss function The function gradient is multiplied by a preset negative number. In this embodiment, by adopting a supervised confrontation strategy, the feature extractor can learn to extract common feature representations between training index data of different cities during the training process.

其中，示例性的，可采用最小化交叉熵算法，基于预测城市标签和标准城市标签，确定第二目标损失函数，具体的，第二目标损失函数L₂满足公式：Wherein, for example, the minimum cross-entropy algorithm can be used to determine the second target loss function based on the predicted city label and the standard city label. Specifically, the second target loss function L ₂ satisfies the formula:

其中，y_ij表示第j种预测城市标签对应的第i个训练指标数据对应的标准城市标签，

表示第j种预测城市标签对应的第i个训练指标数据对应的预测城市标签，|K|表示预测城市标签对应的标签数量。Among them, y _ij represents the standard city label corresponding to the i-th training index data corresponding to the j-th predicted city label,

Indicates the predicted city label corresponding to the i-th training index data corresponding to the j-th predicted city label, and |K| indicates the number of labels corresponding to the predicted city label.

S230、基于目标损失函数，确定当前迭代的初始跨城市联邦迁移模型对应的当前模型参数，并将当前模型参数发送给中央服务器，以使中央服务器基于接收到的至少两个源城市客户端分别发送的当前模型参数，确定聚合模型参数，并将聚合模型参数分别发送给各源城市客户端。S230. Based on the target loss function, determine the current model parameters corresponding to the initial cross-city federated migration model of the current iteration, and send the current model parameters to the central server, so that the central server can send the current model parameters based on the received at least two source city clients. , determine the aggregated model parameters, and send the aggregated model parameters to the clients of each source city respectively.

在本实施例中，至少两个源城市客户端包含当前源城市客户端。In this embodiment, at least two source city clients include the current source city client.

其中，具体的，各源城市客户端分别基于本地的训练指标数据，对初始跨城市联邦迁移模型进行训练，并将每次迭代更新的当前模型参数发送给中央处理器。Specifically, each source city client trains the initial cross-city federation transfer model based on local training index data, and sends the current model parameters updated for each iteration to the central processing unit.

其中，示例性的，中央处理器可以采用求均值或加权平均聚合算法，基于至少两个当前模型参数，确定聚合模型参数。Wherein, for example, the central processing unit may determine the aggregated model parameters based on at least two current model parameters by using an averaging or weighted average aggregation algorithm.

S240、将接收到的中央服务器发送的聚合模型参数作为当前迭代的初始跨城市联邦迁移模型的模型参数，直到目标损失函数收敛时，将当前迭代的初始跨城市联邦迁移模型作为训练完成的目标跨城市联邦迁移模型。S240. Use the aggregated model parameters received from the central server as the model parameters of the initial cross-city federated migration model of the current iteration, until the target loss function converges, use the initial cross-city federated migration model of the current iteration as the target cross-city federation model after training. Urban Federation Migration Model.

S250、将目标跨城市联邦迁移模型的模型参数发送给目标城市客户端，以使目标城市客户端基于接收到的模型参数构建标准跨城市联邦迁移模型，并基于标准跨城市联邦迁移模型确定目标城市标签。S250. Send the model parameters of the target cross-city federation migration model to the target city client, so that the target city client builds a standard cross-city federation migration model based on the received model parameters, and determines the target city based on the standard cross-city federation migration model Label.

在一个实施例中，具体的，将目标跨城市联邦迁移模型的模型参数发送给中央服务器，以使中央服务器基于接收到的至少两个源城市客户端分别发送的目标跨城市联邦迁移模型的模型参数，确定目标模型参数，并将目标模型参数发送给目标城市客户端。In one embodiment, specifically, the model parameters of the target cross-city federation migration model are sent to the central server, so that the central server receives the model parameters of the target cross-city federation migration model respectively sent by at least two source city clients parameters, determine the target model parameters, and send the target model parameters to the target city client.

其中，示例性的，中央处理器可以采用求均值或加权平均聚合算法，基于至少两个源城市客户端分别发送的目标跨城市联邦迁移模型的模型参数，确定目标模型参数。Wherein, for example, the central processor may use an average or a weighted average aggregation algorithm to determine the target model parameters based on the model parameters of the target cross-city federation migration models respectively sent by at least two source city clients.

受到法律或现实的约束，各源城市客户端分别对应的训练指标数据可能会涉及到隐私保护的问题，不能对外公开。因此，可能会存在不能实现采用集中建模训练初始跨城市联邦迁移模型的情况。本实施例的技术方案，通过采用多个源城市客户端分别同时训练各自的初始跨城市联邦迁移模型，针对每个源城市客户端，在当前源城市客户端中的初始跨城市联邦迁移模型的每次迭代过程中，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定目标损失函数，基于目标损失函数，确定当前迭代的初始跨城市联邦迁移模型对应的当前模型参数，并将当前模型参数发送给中央服务器，以及接收中央服务器对各当前模型参数执行聚合操作后发送的聚合模型参数，将聚合模型参数作为当前迭代的初始跨城市联邦迁移模型的模型参数，直到目标损失函数收敛时，将当前迭代的初始跨城市联邦迁移模型作为训练完成的目标跨城市联邦迁移模型，解决了多个源城市客户端进行训练指标数据交互时存在的隐私保护的问题，满足了跨城市迁移过程中对数据进行隐私保护的需求。Constrained by law or reality, the training index data corresponding to the clients in each source city may involve privacy protection issues and cannot be disclosed to the public. Therefore, there may be situations where it is not possible to train the initial inter-city federated transfer model using centralized modeling. In the technical solution of this embodiment, by using multiple source city clients to train their respective initial cross-city federation migration models at the same time, for each source city client, the initial cross-city federation migration model in the current source city client In each iteration, the target loss function is determined based on the standard indicator data corresponding to the predicted indicator data and the standard city label corresponding to the predicted city label. Model parameters, and send the current model parameters to the central server, and receive the aggregated model parameters sent by the central server after performing an aggregation operation on each current model parameter, and use the aggregated model parameters as the model parameters of the initial cross-city federation migration model of the current iteration, Until the target loss function converges, the initial cross-city federated migration model of the current iteration is used as the target cross-city federated migration model after training, which solves the privacy protection problem that exists when multiple source city clients interact with training index data, and satisfies It meets the demand for privacy protection of data in the process of cross-city migration.

图5为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练方法的流程图，本实施例可适用于对缺失指标数据的目标城市执行城市画像任务的情况，该方法可以由跨城市联邦迁移模型的训练装置来执行，该跨城市联邦迁移模型的训练装置可以采用硬件和/或软件的形式实现，该跨城市联邦迁移模型的训练装置可配置于目标城市客户端中。其中，目标城市客户端可以表征与指标数据资源匮乏的城市对应的客户端。如图5所示，该方法包括：Fig. 5 is a flow chart of a training method for a cross-city federated migration model provided by an embodiment of the present invention. This embodiment is applicable to the case of performing a city portrait task on a target city with missing index data. The training device of the city federation migration model can be implemented by the training device of the cross-city federation migration model. The training device of the cross-city federation migration model can be implemented in the form of hardware and/or software. The training device of the cross-city federation migration model can be configured in the target city client. Wherein, the target city client may represent a client corresponding to a city with scarce index data resources. As shown in Figure 5, the method includes:

S310、接收源城市客户端发送的训练完成的目标跨城市联邦迁移模型的模型参数，并基于模型参数，构建标准跨城市联邦迁移模型。S310. Receive the model parameters of the trained target cross-city federation migration model sent by the source city client, and build a standard cross-city federation migration model based on the model parameters.

在本实施例中，目标跨城市联邦迁移模型是源城市客户端基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，以及基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签对初始跨城市联邦迁移模型执行训练操作得到的。In this embodiment, the target cross-city federated migration model is determined by the source city client based on the initial cross-city federated migration model and the training index data in the training index data set to determine the predictive index data and predicted city labels, and based on the corresponding prediction index data The standard indicator data and the standard city labels corresponding to the predicted city labels are obtained by performing the training operation on the initial cross-city federated transfer model.

在一个实施例中，具体的，初始跨城市联邦迁移模型包括初始数据生成模块和初始标签生成模块，其中，初始数据生成模块，用于基于输入的训练指标数据集中的训练指标数据，输出预测指标数据，初始标签生成模块，用于基于输入的训练指标数据集中的训练指标数据以及初始数据生成模块输出的预测指标数据，输出预测城市标签。In one embodiment, specifically, the initial cross-city federated migration model includes an initial data generation module and an initial label generation module, wherein the initial data generation module is used to output the prediction index based on the training index data in the input training index data set The data, the initial label generation module, is used to output the predicted city label based on the training indicator data in the input training indicator data set and the prediction indicator data output by the initial data generation module.

在一个实施例中，具体的，模型参数包括目标跨城市联邦迁移模型中的目标数据生成模块对应的数据模型参数和目标标签生成模块对应的标签模型参数，相应的，基于模型参数，构建标准跨城市联邦迁移模型，包括：基于数据模型参数，构建标准跨城市联邦迁移模型中的标准数据生成模块，并将参考指标数据输入到标准数据生成模块中，得到输出的缺失指标数据；基于标签模型参数，构建参考标签生成模块，并基于参考指标数据和缺失指标数据，对参考标签生成模块进行训练，得到标准跨城市联邦迁移模型中的标准标签生成模块。In one embodiment, specifically, the model parameters include the data model parameters corresponding to the target data generation module in the target cross-city federation migration model and the label model parameters corresponding to the target label generation module. Correspondingly, based on the model parameters, the standard cross The city federation migration model, including: based on the data model parameters, constructing the standard data generation module in the standard cross-city federation migration model, and inputting the reference index data into the standard data generation module to obtain the output missing index data; based on the label model parameters , build a reference label generation module, and train the reference label generation module based on the reference index data and missing index data, and obtain the standard label generation module in the standard cross-city federation transfer model.

在一个实施例中，具体的，假设训练指标数据包括源城市客户端的各兴趣点类型分别对应的兴趣点数量d^pf、兴趣点总数量d^pn、兴趣点熵值d^pe、各道路类型分别对应的道路数量d^rf、工作人口数量d^wp和居住人口数量d^rp，相应的，参考指标数据包括目标城市客户端的各兴趣点类型分别对应的兴趣点数量b^pf、兴趣点总数量b^pn、兴趣点熵值b^pe、各道路类型分别对应的道路数量b^rf、工作人口数量b^wp和居住人口数量b^rp，将参考指标数据X_t输入到标准跨城市联邦迁移模型中的标准数据生成模块中，得到输出的缺失指标数据

其中，缺失指标数据

满足如下关系：In one embodiment, specifically, it is assumed that the training index data includes the number of points of interest d ^pf corresponding to the types of points of interest of the source city client, the total number of points of interest d ^pn , the entropy value of points of interest d ^pe , and the number of points of interest corresponding to each type of road. The number of roads d ^rf , the number of working population d ^wp , and the number of resident populations d ^rp . Correspondingly, the reference index data includes the number of points of interest b ^pf , the total number of points of interest b ^pn , and the number of interest points corresponding to each type of point of interest in the target city client. The point entropy value b ^pe , the number of roads b ^rf corresponding to each road type, the number of working population b ^wp and the number of resident population b ^rp , input the reference index data X _t into the standard data generation module in the standard cross-city federal migration model , to get the missing indicator data for the output

Among them, the missing indicator data

Satisfy the following relationship:

其中，Θ_RK表示数据模型参数。Among them, Θ _RK represents the data model parameters.

具体的，将参考指标数据X_t和缺失指标数据

输入到构建的参考标签生成模块中，得到输出的目标城市的预测城市标签

基于目标城市的预测城市标签和目标城市的标准城市标签，对参考标签生成模块进行训练，得到标准跨城市联邦迁移模型中的标准标签生成模块。其中，目标城市的预测城市标签

满足如下关系：Specifically, refer to the index data X _t and the missing index data

Input to the constructed reference label generation module to get the predicted city label of the output target city

Based on the predicted city labels of the target city and the standard city labels of the target city, the reference label generation module is trained to obtain the standard label generation module in the standard cross-city federated transfer model. Among them, the predicted city label of the target city

Satisfy the following relationship:

其中，Θ_task表示标签模型参数。Among them, Θ _task represents the label model parameters.

其中，示例性的，可采用最小化交叉熵算法，构建参考标签生成模块对应的损失函数，具体的，该损失函数L_t满足公式：Wherein, for example, the minimum cross-entropy algorithm can be used to construct the loss function corresponding to the reference label generation module. Specifically, the loss function L _t satisfies the formula:

其中，

表示第j种预测城市标签对应的第i个参考指标数据对应的标准城市标签，

表示第j种预测城市标签对应的第i个参考指标数据对应的预测城市标签，|K|表示预测城市标签对应的标签数量，M表示参考指标数据的数据量。in,

Indicates the standard city label corresponding to the i-th reference index data corresponding to the j-th predicted city label,

Indicates the predicted city label corresponding to the i-th reference indicator data corresponding to the j-th predicted city label, |K| indicates the number of labels corresponding to the predicted city label, and M indicates the data volume of the reference indicator data.

这样设置的好处在于，在跨城市迁移场景中，由于目标城市缺失指标数据，且城市标签在不同城市之间仍然存在差异。因此，可直接将数据模型参数构建的标准数据生成模块冻结在目标城市中，用来生成目标城市缺失的指标数据。然后，基于目标城市中的少量标准标签数据对基于标签模型参数构建的参考标签生成模块进行微调训练，得到目标城市的标准标签生成模块。从而进一步提高了目标城市对应的标准跨城市联邦迁移模型输出的目标城市标签的准确度。The advantage of this setting is that in the cross-city migration scenario, due to the lack of indicator data in the target city, and the city labels still have differences between different cities. Therefore, the standard data generation module constructed by the data model parameters can be directly frozen in the target city to generate the missing index data of the target city. Then, based on a small amount of standard label data in the target city, the reference label generation module constructed based on the label model parameters is fine-tuned and trained to obtain the standard label generation module of the target city. This further improves the accuracy of the target city label output by the standard inter-city federated migration model corresponding to the target city.

S320、将测试指标数据输入到标准跨城市联邦迁移模型中，得到输出的目标城市标签。S320. Input the test index data into the standard inter-city federation migration model to obtain the output target city label.

将与训练指标数据对应的测试指标数据输入到标准跨城市联邦迁移模型中，标准跨城市联邦迁移模型中的标准数据生成模块基于输入的测试指标数据输出缺失指标数据，标准跨城市联邦迁移模型中的标准标签模块基于输入的测试指标数据和标准数据生成模块输出的缺失指标数据，输出目标城市标签。其中，测试指标数据可以是目标城市中已有的与训练指标数据的指标类型相同的数据，如测试指标数据包括目标城市的兴趣点区域数据、道路网络数据和人口数据。Input the test indicator data corresponding to the training indicator data into the standard cross-city federated migration model, the standard data generation module in the standard cross-city federated migration model outputs missing indicator data based on the input test indicator data, and the standard cross-city federated migration model The standard label module of , based on the input test indicator data and the missing indicator data output by the standard data generation module, outputs the target city label. Wherein, the test index data may be data of the same index type as the training index data existing in the target city, for example, the test index data includes POI data, road network data and population data of the target city.

本实施例的技术方案，通过接收源城市客户端发送的训练完成的目标跨城市联邦迁移模型的模型参数，并基于模型参数，构建标准跨城市联邦迁移模型；将测试指标数据输入到标准跨城市联邦迁移模型中，得到输出的目标城市标签，其中，目标跨城市联邦迁移模型是源城市客户端基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，以及基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签对初始跨城市联邦迁移模型执行训练操作得到的，解决了目标城市由于缺失指标数据导致无法完成城市画像任务的问题，扩宽了跨城市迁移方法的应用场景。In the technical solution of this embodiment, the model parameters of the target cross-city federation migration model sent by the source city client are received, and based on the model parameters, a standard cross-city federation migration model is constructed; the test index data is input into the standard cross-city federation migration model. In the federated migration model, the output target city labels are obtained, where the target cross-city federated migration model is the source city client determines the predictive indicator data and predicted city labels based on the initial cross-city federated migration model and the training indicator data in the training indicator data set , and based on the standard index data corresponding to the predictive index data and the standard city labels corresponding to the predicted city labels to perform training operations on the initial cross-city federation transfer model, which solves the problem that the target city cannot complete the city portrait task due to lack of index data problem, broadening the application scenarios of the cross-city migration method.

图6为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练装置的结构示意图，可配置与源城市客户端中。如图6所示，该装置包括：预测城市标签确定模块410、目标跨城市联邦迁移模型确定模块420和模型参数发送模块430。FIG. 6 is a schematic structural diagram of a training device for a cross-city federated migration model provided by an embodiment of the present invention, which can be configured in a source city client. As shown in FIG. 6 , the device includes: a predicted city label determination module 410 , a target cross-city federation migration model determination module 420 and a model parameter sending module 430 .

其中，预测城市标签确定模块410，用于响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签；Among them, the predicted city label determination module 410 is used to determine the predicted indicator data and the predicted city label based on the initial cross-city federation transfer model and the training indicator data in the training indicator dataset in response to the acquisition of the training indicator data set;

目标跨城市联邦迁移模型确定模块420，用于基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型；The target cross-city federation migration model determination module 420 is used to adjust the model parameters of the initial cross-city federation migration model based on the standard index data corresponding to the predictive index data and the standard city label corresponding to the predicted city label, and obtain the trained Target cross-city federated migration model;

模型参数发送模块430，用于将目标跨城市联邦迁移模型的模型参数发送给目标城市客户端，以使目标城市客户端基于接收到的模型参数构建标准跨城市联邦迁移模型，并基于标准跨城市联邦迁移模型确定目标城市标签。The model parameter sending module 430 is used to send the model parameters of the target cross-city federation migration model to the target city client, so that the target city client builds a standard cross-city federation migration model based on the received model parameters, and based on the standard cross-city federation A federated migration model determines target city labels.

在上述实施例的基础上，具体的，预测城市标签确定模块410，包括：On the basis of the above-mentioned embodiments, specifically, the predicted city label determination module 410 includes:

训练指标数据输入单元，用于将训练指标数据集中的训练指标数据分别输入到初始跨城市联邦迁移模型中的初始数据生成模块和初始标签生成模块中；The training indicator data input unit is used to input the training indicator data in the training indicator data set into the initial data generation module and the initial label generation module in the initial cross-city federation migration model;

预测指标数据输出单元，用于通过初始数据生成模块，基于输入的训练指标数据，输出预测指标数据；The predictive index data output unit is used to output the predictive index data based on the input training index data through the initial data generation module;

预测城市标签输出单元，用于通过初始标签生成模块，基于输入的训练指标数据以及初始数据生成模块输出的预测指标数据，输出预测城市标签。The predicted city label output unit is used to output the predicted city label through the initial label generation module based on the input training indicator data and the prediction indicator data output by the initial data generation module.

在上述实施例的基础上，具体的，初始数据生成模块包括特征提取器、数据回归器和域分类器，预测指标数据输出单元，具体用于：On the basis of the above embodiments, specifically, the initial data generation module includes a feature extractor, a data regressor, a domain classifier, and a predictive index data output unit, specifically for:

将训练指标数据输入到特征提取器中，得到输出的训练指标特征；Input the training index data into the feature extractor to obtain the output training index features;

通过数据回归器，基于特征提取器输出的训练指标特征，输出预测指标数据，以及通过域分类器，基于特征提取器输出的训练指标特征，输出预测城市分类结果。Through the data regressor, based on the training indicator features output by the feature extractor, the predictive indicator data is output, and through the domain classifier, based on the training indicator features output by the feature extractor, the predicted city classification results are output.

在上述实施例的基础上，具体的，训练指标数据集中包含当前源城市客户端对应的训练指标数据，目标跨城市联邦迁移模型确定模块420包括：On the basis of the above-mentioned embodiments, specifically, the training index data set contains the training index data corresponding to the current source city client, and the target cross-city federation migration model determination module 420 includes:

目标损失函数确定单元，用于针对初始跨城市联邦迁移模型的每次迭代过程，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定目标损失函数；The target loss function determination unit is used to determine the target loss function based on the standard indicator data corresponding to the predictive indicator data and the standard city label corresponding to the predicted city label for each iteration process of the initial cross-city federated migration model;

当前模型参数发送单元，用于基于目标损失函数，确定当前迭代的初始跨城市联邦迁移模型对应的当前模型参数，并将当前模型参数发送给中央服务器，以使中央服务器基于接收到的至少两个源城市客户端分别发送的当前模型参数，确定聚合模型参数，并将聚合模型参数分别发送给各源城市客户端；其中，至少两个源城市客户端中包含当前源城市客户端；The current model parameter sending unit is used to determine the current model parameters corresponding to the initial cross-city federation migration model of the current iteration based on the target loss function, and send the current model parameters to the central server, so that the central server is based on the received at least two The current model parameters sent by the source city clients respectively, determine the aggregation model parameters, and send the aggregation model parameters to each source city client respectively; wherein, at least two source city clients include the current source city client;

第一目标跨城市联邦迁移模型确定单元，用于将接收到的中央服务器发送的聚合模型参数作为当前迭代的初始跨城市联邦迁移模型的模型参数，直到目标损失函数收敛时，将当前迭代的初始跨城市联邦迁移模型作为训练完成的目标跨城市联邦迁移模型。The first target cross-city federated migration model determination unit is used to use the received aggregation model parameters sent by the central server as the model parameters of the initial cross-city federated migration model of the current iteration, until the target loss function converges, the initial The cross-city federated migration model is used as the target cross-city federated migration model after training.

在上述实施例的基础上，具体的，模型参数发送模块430，具体用于：On the basis of the above embodiments, specifically, the model parameter sending module 430 is specifically used for:

将目标跨城市联邦迁移模型的模型参数发送给中央服务器，以使中央服务器基于接收到的至少两个源城市客户端分别发送的目标跨城市联邦迁移模型的模型参数，确定目标模型参数，并将目标模型参数发送给目标城市客户端。Send the model parameters of the target cross-city federation migration model to the central server, so that the central server determines the target model parameters based on the received model parameters of the target cross-city federation migration model sent by at least two source city clients, and sends The target model parameters are sent to the target city client.

在上述实施例的基础上，具体的，训练指标数据集中包含与至少两个源城市客户端分别对应的训练指标数据，目标跨城市联邦迁移模型确定模块420包括：On the basis of the above-mentioned embodiments, specifically, the training index data set contains training index data corresponding to at least two source city clients respectively, and the target inter-city federation migration model determination module 420 includes:

第二目标跨城市联邦迁移模型确定单元，用于基于目标损失函数，确定当前迭代的初始跨城市联邦迁移模型对应的当前模型参数，直到目标损失函数收敛时，将当前迭代的初始跨城市联邦迁移模型作为训练完成的目标跨城市联邦迁移模型。The second target cross-city federated migration model determination unit is used to determine the current model parameters corresponding to the initial cross-city federated migration model of the current iteration based on the target loss function, until the target loss function converges, the initial cross-city federated migration of the current iteration The model is used as the target cross-city federation migration model after training.

在上述实施例的基础上，具体的，目标损失函数包括与初始数据生成模块对应的第一目标损失函数以及与初始标签生成模块对应的第二目标损失函数，目标损失函数确定单元，具体用于：On the basis of the above embodiments, specifically, the target loss function includes a first target loss function corresponding to the initial data generation module and a second target loss function corresponding to the initial label generation module, and the target loss function determination unit is specifically used for :

基于预测指标数据和标准指标数据，构建第一损失函数，以及基于预测城市分类结果和标准城市分类结果，构建第二损失函数；Construct a first loss function based on the predictive index data and standard index data, and construct a second loss function based on the predicted city classification results and standard city classification results;

基于第一损失函数和第二损失函数，确定与初始数据生成模块对应的第一目标损失函数；Based on the first loss function and the second loss function, determine a first target loss function corresponding to the initial data generation module;

基于预测城市标签和标准城市标签，构建与初始标签生成模块对应的第二目标损失函数。Based on the predicted city labels and standard city labels, a second objective loss function corresponding to the initial label generation module is constructed.

在上述实施例的基础上，具体的，训练指标数据包括兴趣点区域数据、道路网络数据和人口数据，其中，兴趣点区域数据包括与至少一个兴趣点类型分别对应的兴趣点数量、兴趣点总数量和兴趣点熵值中至少一种，道路网络数据包括与至少一种道路类型分别对应的道路数量，人口数据包括工作人口数量和/或居住人口数量，相应的，预测指标数据为消费人口数量，预测城市标签为商业流行度的类别。On the basis of the above-mentioned embodiments, specifically, the training index data includes point-of-interest area data, road network data, and population data, wherein the point-of-interest area data includes the number of points of interest, the total number of points of interest corresponding to at least one type of point of interest, respectively. At least one of quantity and interest point entropy value, the road network data includes the number of roads corresponding to at least one road type, the population data includes the number of working population and/or the number of living population, and correspondingly, the predictive index data is the number of consumption population , predicting the city label as a category of commercial popularity.

图7为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练装置的结构示意图，可配置于目标城市客户端中。如图7所示，该装置包括：标准跨城市联邦迁移模型构建模块510和目标城市标签确定模块520。FIG. 7 is a schematic structural diagram of a training device for a cross-city federation transfer model provided by an embodiment of the present invention, which can be configured in a target city client. As shown in FIG. 7 , the device includes: a standard cross-city federation migration model building module 510 and a target city label determination module 520 .

其中，标准跨城市联邦迁移模型构建模块510，用于接收源城市客户端发送的训练完成的目标跨城市联邦迁移模型的模型参数，并基于模型参数，构建标准跨城市联邦迁移模型；Wherein, the standard cross-city federated migration model building module 510 is used to receive the model parameters of the trained target cross-city federated migration model sent by the source city client, and build a standard cross-city federated migration model based on the model parameters;

目标城市标签确定模块520，用于将测试指标数据输入到标准跨城市联邦迁移模型中，得到输出的目标城市标签；The target city label determination module 520 is used to input the test indicator data into the standard cross-city federation migration model to obtain the output target city label;

其中，目标跨城市联邦迁移模型是源城市客户端基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，以及基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签对初始跨城市联邦迁移模型执行训练操作得到的。Among them, the target cross-city federated migration model is based on the initial cross-city federated migration model and the training index data in the training index data set by the source city client to determine the predictive index data and predicted city labels, and based on the standard index data corresponding to the predictive index data And the standard city labels corresponding to the predicted city labels are obtained by performing the training operation on the initial cross-city federated transfer model.

本实施例的技术方案，通过接收源城市客户端发送的训练完成的目标跨城市联邦迁移模型的模型参数，并基于模型参数，构建标准跨城市联邦迁移模型；将测试指标数据输入到标准跨城市联邦迁移模型中，得到输出的目标城市标签，其中，目标跨城市联邦迁移模型是源城市客户端基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，以及基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签对初始跨城市联邦迁移模型执行训练操作得到的，解决了目标城市由于缺失指标数据导致无法完成城市画像任务的问题，扩宽了跨城市迁移方法的应用场景。In the technical solution of this embodiment, the model parameters of the target cross-city federation migration model sent by the source city client are received, and based on the model parameters, a standard cross-city federation migration model is constructed; the test index data is input into the standard cross-city federation migration model. In the federated migration model, the output target city labels are obtained, where the target cross-city federated migration model is the source city client determines the predictive indicator data and predicted city labels based on the initial cross-city federated migration model and the training indicator data in the training indicator data set , and based on the standard index data corresponding to the predictive index data and the standard city labels corresponding to the predicted city labels to perform training operations on the initial cross-city federated migration model, which solves the problem that the target city cannot complete the city portrait task due to lack of index data problem, broadening the application scenarios of the cross-city migration method.

在上述实施例的基础上，具体的，模型参数包括目标跨城市联邦迁移模型中的目标数据生成模块对应的数据模型参数和目标标签生成模块对应的标签模型参数，标准跨城市联邦迁移模型构建模块510，具体用于：On the basis of the above embodiments, specifically, the model parameters include the data model parameters corresponding to the target data generation module in the target cross-city federation migration model and the label model parameters corresponding to the target label generation module, and the standard cross-city federation migration model construction module 510, specifically for:

基于数据模型参数，构建标准跨城市联邦迁移模型中的标准数据生成模块，并将参考指标数据输入到标准数据生成模块中，得到输出的缺失指标数据；Based on the data model parameters, construct the standard data generation module in the standard cross-city federation migration model, and input the reference index data into the standard data generation module to obtain the output missing index data;

基于标签模型参数，构建参考标签生成模块，并基于参考指标数据和缺失指标数据，对参考标签生成模块进行训练，得到标准跨城市联邦迁移模型中的标准标签生成模块。Based on the parameters of the label model, a reference label generation module is constructed, and based on the reference index data and missing index data, the reference label generation module is trained to obtain the standard label generation module in the standard cross-city federation transfer model.

本发明上述实施例所提供的跨城市联邦迁移模型的训练装置可执行本发明上述实施例所提供的跨城市联邦迁移模型的训练方法，具备执行方法相应的功能模块和有益效果。The training device for the cross-city federated migration model provided by the above-mentioned embodiments of the present invention can execute the training method for the cross-city federated migration model provided by the above-mentioned embodiments of the present invention, and has corresponding functional modules and beneficial effects for executing the method.

图8为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练系统的结构示意图，该跨城市联邦迁移模型的训练系统可为上述实施例中的跨城市联邦迁移模型的训练方法提供服务。Fig. 8 is a schematic structural diagram of a training system for a cross-city federated migration model provided by an embodiment of the present invention. Serve.

如图8所示，该跨城市联邦迁移模型的训练系统600包括：源城市客户端610和目标城市客户端620；其中，源城市客户端610，用于响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，以及基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型，并将目标跨城市联邦迁移模型的模型参数发送给目标城市客户端620；目标城市客户端620，用于基于接收到的模型参数，构建标准跨城市联邦迁移模型，并基于标准跨城市联邦迁移模型，确定目标城市标签。As shown in FIG. 8 , the training system 600 of the cross-city federated migration model includes: a source city client 610 and a target city client 620; wherein, the source city client 610 is used to respond to the acquisition of the training index data set, based on The initial cross-city federation migration model and the training index data in the training index data set determine the predictive index data and predicted city labels, and based on the standard index data corresponding to the predictive index data and the standard city labels corresponding to the predicted city labels, the initial inter-city The model parameters of the city federation migration model are adjusted to obtain the trained target cross-city federation migration model, and the model parameters of the target cross-city federation migration model are sent to the target city client 620; the target city client 620 is used to receive Based on the obtained model parameters, a standard cross-city federation migration model is constructed, and based on the standard cross-city federation migration model, the target city label is determined.

在一个实施例中，具体的，源城市客户端610，具体用于:将训练指标数据集中的训练指标数据分别输入到初始跨城市联邦迁移模型中的初始数据生成模块和初始标签生成模块中；通过初始数据生成模块，基于输入的训练指标数据，输出预测指标数据；通过初始标签生成模块，基于输入的训练指标数据以及初始数据生成模块输出的预测指标数据，输出预测城市标签。In one embodiment, specifically, the source city client 610 is specifically configured to: respectively input the training indicator data in the training indicator data set into the initial data generation module and the initial label generation module in the initial cross-city federation migration model; Through the initial data generation module, output prediction index data based on the input training index data; through the initial label generation module, based on the input training index data and the prediction index data output by the initial data generation module, output the predicted city label.

在一个实施例中，具体的，源城市客户端610，具体用于：将训练指标数据集中的训练指标数据分别输入到初始跨城市联邦迁移模型中的初始数据生成模块和初始标签生成模块中；通过初始数据生成模块，基于输入的训练指标数据，输出预测指标数据；通过初始标签生成模块，基于输入的训练指标数据以及初始数据生成模块输出的预测指标数据，输出预测城市标签。In one embodiment, specifically, the source city client 610 is specifically configured to: respectively input the training indicator data in the training indicator data set into the initial data generation module and the initial label generation module in the initial cross-city federation migration model; Through the initial data generation module, output prediction index data based on the input training index data; through the initial label generation module, based on the input training index data and the prediction index data output by the initial data generation module, output the predicted city label.

在一个实施例中，具体的，训练指标数据集中包含当前源城市客户端610对应的训练指标数据，相应的，源城市客户端610，具体用于：针对初始跨城市联邦迁移模型的每次迭代过程，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定目标损失函数；基于目标损失函数，确定当前迭代的初始跨城市联邦迁移模型对应的当前模型参数，并将当前模型参数发送给中央服务器，以使中央服务器基于接收到的至少两个源城市客户端610分别发送的当前模型参数，确定聚合模型参数，并将聚合模型参数分别发送给各源城市客户端610；其中，至少两个源城市客户端610中包含当前源城市客户端610；将接收到的中央服务器发送的聚合模型参数作为当前迭代的初始跨城市联邦迁移模型的模型参数，直到目标损失函数收敛时，将当前迭代的初始跨城市联邦迁移模型作为训练完成的目标跨城市联邦迁移模型。In one embodiment, specifically, the training index data set contains the training index data corresponding to the current source city client 610, and correspondingly, the source city client 610 is specifically used for: each iteration of the initial cross-city federation migration model The process is to determine the target loss function based on the standard indicator data corresponding to the predicted indicator data and the standard city label corresponding to the predicted city label; based on the target loss function, determine the current model parameters corresponding to the initial cross-city federation migration model of the current iteration, and Sending the current model parameters to the central server, so that the central server determines the aggregation model parameters based on the received current model parameters sent by at least two source city clients 610, and sends the aggregation model parameters to the source city clients respectively 610; where at least two source city clients 610 include the current source city client 610; the received aggregation model parameters sent by the central server are used as the model parameters of the initial cross-city federation migration model of the current iteration until the target loss function When converging, the initial cross-city federated migration model of the current iteration is used as the target cross-city federated migration model after training.

在另一个实施例中，具体的，源城市客户端610的数量为至少两个，相应的，跨城市联邦迁移模型的训练系统600还包括中央服务器，中央服务器用于基于接收到的至少两个源城市客户端610分别发送的当前模型参数，确定聚合模型参数，并将聚合模型参数分别发送给各源城市客户端610；源城市客户端610，具体用于：针对初始跨城市联邦迁移模型的每次迭代过程，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定目标损失函数，以及基于目标损失函数，确定当前迭代的初始跨城市联邦迁移模型对应的当前模型参数，并将当前模型参数发送给中央服务器；将接收到的聚合模型参数作为当前迭代的初始跨城市联邦迁移模型的模型参数，直到目标损失函数收敛时，将当前迭代的初始跨城市联邦迁移模型作为训练完成的目标跨城市联邦迁移模型。In another embodiment, specifically, the number of source city clients 610 is at least two. Correspondingly, the training system 600 of the cross-city federation transfer model further includes a central server, and the central server is used to receive at least two The current model parameters sent by the source city client 610 respectively, determine the aggregation model parameters, and send the aggregation model parameters to each source city client 610 respectively; the source city client 610 is specifically used for: for the initial cross-city federation migration model In each iteration process, based on the standard indicator data corresponding to the predicted indicator data and the standard city label corresponding to the predicted city label, the target loss function is determined, and based on the target loss function, the current iteration corresponding to the initial cross-city federation migration model is determined. model parameters, and send the current model parameters to the central server; use the received aggregated model parameters as the model parameters of the current iteration's initial cross-city federation migration model, until the target loss function converges, the current iteration's initial cross-city federation migration model The model is used as the target cross-city federation migration model after training.

在一个实施例中，具体的，模型参数包括目标跨城市联邦迁移模型中的目标数据生成模块对应的数据模型参数和目标标签生成模块对应的标签模型参数，目标城市客户端620，具体用于：基于数据模型参数，构建标准跨城市联邦迁移模型中的标准数据生成模块，并将参考指标数据输入到标准数据生成模块中，得到输出的缺失指标数据；基于标签模型参数，构建参考标签生成模块，并基于参考指标数据和缺失指标数据，对参考标签生成模块进行训练，得到标准跨城市联邦迁移模型中的标准标签生成模块。In one embodiment, specifically, the model parameters include the data model parameters corresponding to the target data generation module in the target cross-city federation migration model and the label model parameters corresponding to the target label generation module. The target city client 620 is specifically used for: Based on the data model parameters, construct the standard data generation module in the standard cross-city federation migration model, and input the reference index data into the standard data generation module to obtain the output missing index data; based on the label model parameters, construct the reference label generation module, And based on the reference index data and missing index data, the reference label generation module is trained to obtain the standard label generation module in the standard cross-city federation transfer model.

图9为本发明一个实施例所提供的一种跨城市联邦迁移模型的训练系统的系统原理图。具体的，中央处理器将初始跨城市联邦迁移模型的模型架构分别发送给源城市-1客户端、源城市-2客户端…源城市-J客户端，并对各源城市客户端610中的初始跨城市联邦迁移模型的模型参数进行初始化。源城市-1客户端、源城市-2客户端…源城市-J客户端分别基于本地的训练指标数据，对初始跨城市联邦迁移模型在本源城市客户端610中进行本地训练。针对每个源城市客户端610，在初始跨城市联邦迁移模型的每次迭代的过程中，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，确定初始跨城市联邦迁移模型的当前模型参数，并将当前模型参数发送给中央处理器，中央处理器对各源城市客户端610分别发送的当前模型参数执行聚合操作，得到全局模型，将全局模型中的聚合模型参数分别发送给源城市-1客户端、源城市-2客户端…源城市-J客户端，以使各源城市客户端610基于接收到的聚合模型参数继续进行训练。FIG. 9 is a system schematic diagram of a training system for a cross-city federation migration model provided by an embodiment of the present invention. Specifically, the central processor sends the model architecture of the initial cross-city federation migration model to the source city-1 client, source city-2 client... The model parameters of the initial cross-city federated migration model are initialized. The source city-1 client, the source city-2 client...the source city-J client performs local training on the initial cross-city federation transfer model in the source city client 610 based on the local training index data respectively. For each source city client 610, during each iteration of the initial cross-city federation migration model, an initial cross-city federation is determined based on the standard indicator data corresponding to the predicted indicator data and the standard city label corresponding to the predicted city label. The current model parameters of the migration model are sent to the central processor, and the central processor performs an aggregation operation on the current model parameters sent by the client 610 of each source city to obtain a global model, and the aggregated model parameters in the global model They are respectively sent to the source city-1 client, source city-2 client...source city-J client, so that each source city client 610 continues training based on the received aggregation model parameters.

其中，初始跨城市联邦迁移模型包括初始数据生成模块和初始标签生成模块，其中，初始数据生成模块中包括特征提取器、数据回归器、域分类器和特征反转层，基于数据回归器对应的第一损失函数L_DR和域分类器对应的第二损失函数L_DC，对初始数据生成模块的模型参数进行调整。其中，通过初始标签生成模块，基于训练训练指标数据和初始数据生成模块输出的预测指标数据，输出预测城市标签，基于预测城市标签和标准城市标签构建的第二目标损失函数L₂对初始标签生成模块的模型参数进行调整。Among them, the initial cross-city federation migration model includes an initial data generation module and an initial label generation module, wherein the initial data generation module includes a feature extractor, a data regressor, a domain classifier and a feature inversion layer, based on the corresponding The first loss function L _DR and the second loss function L _DC corresponding to the domain classifier adjust the model parameters of the initial data generation module. Among them, through the initial label generation module, based on the training index data and the prediction index data output by the initial data generation module, the predicted city label is output, and the second target loss function L ₂ constructed based on the predicted city label and the standard city label generates the initial label The model parameters of the module are adjusted.

其中，具体的，中央服务器基于接收到的至少两个源城市客户端610分别发送的目标跨城市联邦迁移模型的模型参数，确定目标模型参数，并将目标模型参数发送给目标城市客户端620。目标城市客户端620冻结基于数据模型参数构建的标准数据生成模块，并基于参考指标数据以其对应的标准城市标签对基于标签模型参数构建的参考标签生成模块进行微调训练，得到标准标签生成模块。Wherein, specifically, the central server determines the target model parameters based on the received model parameters of the target inter-city federation migration model sent by at least two source city clients 610 , and sends the target model parameters to the target city client 620 . The target city client 620 freezes the standard data generation module built based on the data model parameters, and fine-tunes and trains the reference tag generation module built based on the tag model parameters with its corresponding standard city tags based on the reference index data to obtain a standard tag generation module.

本实施例的技术方案，通过跨城市联邦迁移模型的训练系统中的源城市客户端，响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签，基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型，使得训练得到的目标跨城市联邦迁移模型同时具备预测指标数据和预测城市标签的能力，解决了目标城市由于缺失指标数据导致无法完成城市画像任务的问题，扩宽了跨城市迁移方法的应用场景。In the technical solution of this embodiment, through the source city client in the training system of the cross-city federated migration model, in response to the acquisition of the training index data set, based on the initial cross-city federated migration model and the training index data in the training index data set, determine Prediction index data and prediction city labels, based on the standard index data corresponding to the prediction index data and the standard city labels corresponding to the prediction city labels, adjust the model parameters of the initial cross-city federation migration model to obtain the trained target cross-city federation The migration model enables the trained target cross-city federation migration model to have the ability to predict index data and city labels at the same time, which solves the problem that the target city cannot complete the city portrait task due to lack of index data, and broadens the cross-city migration method. Application scenarios.

图10为本发明一个实施例所提供的一种终端设备的结构示意图。终端设备10旨在表示各种形式的数字计算机，诸如，膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。终端设备还可以表示各种形式的移动装置，诸如，个人数字处理、蜂窝电话、智能电话、可穿戴设备(如头盔、眼镜、手表等)和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例，并且不意在限制本文中描述的和/或者要求的本发明的实现。Fig. 10 is a schematic structural diagram of a terminal device provided by an embodiment of the present invention. Terminal device 10 is intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other suitable computers. Terminal devices may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smartphones, wearable devices (eg, helmets, glasses, watches, etc.), and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are by way of example only, and are not intended to limit implementations of the inventions described and/or claimed herein.

在本实施例中，终端设备10为源城市客户端或目标城市客户端。In this embodiment, the terminal device 10 is a source city client or a target city client.

如图10所示，终端设备10包括至少一个处理器11，以及与至少一个处理器11通信连接的存储器，如只读存储器(ROM)12、随机访问存储器(RAM)13等，其中，存储器存储有可被至少一个处理器执行的计算机程序，处理器11可以根据存储在只读存储器(ROM)12中的计算机程序或者从存储单元18加载到随机访问存储器(RAM)13中的计算机程序，来执行各种适当的动作和处理。在RAM 13中，还可存储终端设备10操作所需的各种程序和数据。处理器11、ROM 12以及RAM 13通过总线14彼此相连。输入/输出(I/O)接口15也连接至总线14。As shown in FIG. 10 , the terminal device 10 includes at least one processor 11, and a memory communicatively connected with the at least one processor 11, such as a read-only memory (ROM) 12, a random access memory (RAM) 13, etc., wherein the memory stores There is a computer program executable by at least one processor, and the processor 11 can operate according to a computer program stored in a read-only memory (ROM) 12 or loaded from a storage unit 18 into a random access memory (RAM) 13. Various appropriate actions and processes are performed. In the RAM 13, various programs and data necessary for the operation of the terminal device 10 can also be stored. The processor 11, ROM 12, and RAM 13 are connected to each other through a bus 14. An input/output (I/O) interface 15 is also connected to the bus 14 .

终端设备10中的多个部件连接至I/O接口15，包括：输入单元16，例如键盘、鼠标等；输出单元17，例如各种类型的显示器、扬声器等；存储单元18，例如磁盘、光盘等；以及通信单元19，例如网卡、调制解调器、无线通信收发机等。通信单元19允许终端设备10通过诸如因特网的计算机网络和/或各种电信网络与其他设备交换信息/数据。Multiple components in the terminal device 10 are connected to the I/O interface 15, including: an input unit 16, such as a keyboard, a mouse, etc.; an output unit 17, such as various types of displays, speakers, etc.; a storage unit 18, such as a magnetic disk, an optical disk etc.; and a communication unit 19, such as a network card, a modem, a wireless communication transceiver, and the like. The communication unit 19 allows the terminal device 10 to exchange information/data with other devices through a computer network such as the Internet and/or various telecommunication networks.

处理器11可以是各种具有处理和计算能力的通用和/或专用处理组件。处理器11的一些示例包括但不限于中央处理单元(CPU)、图形处理单元(GPU)、各种专用的人工智能(AI)计算芯片、各种运行机器学习模型算法的处理器、数字信号处理器(DSP)、以及任何适当的处理器、控制器、微控制器等。处理器11执行上文所描述的各个方法和处理，例如跨城市联邦迁移模型的训练方法。Processor 11 may be various general and/or special purpose processing components having processing and computing capabilities. Some examples of processor 11 include, but are not limited to, central processing units (CPUs), graphics processing units (GPUs), various dedicated artificial intelligence (AI) computing chips, various processors that run machine learning model algorithms, digital signal processing processor (DSP), and any suitable processor, controller, microcontroller, etc. The processor 11 executes the various methods and processes described above, for example, the training method of the cross-city federation migration model.

在一些实施例中，跨城市联邦迁移模型的训练方法可被实现为计算机程序，其被有形地包含于计算机可读存储介质，例如存储单元18。在一些实施例中，计算机程序的部分或者全部可以经由ROM 12和/或通信单元19而被载入和/或安装到终端设备10上。当计算机程序加载到RAM 13并由处理器11执行时，可以执行上文描述的跨城市联邦迁移模型的训练方法的一个或多个步骤。备选地，在其他实施例中，处理器11可以通过其他任何适当的方式(例如，借助于固件)而被配置为执行跨城市联邦迁移模型的训练方法。In some embodiments, the method for training a federated migration model across cities can be implemented as a computer program tangibly embodied in a computer-readable storage medium, such as the storage unit 18 . In some embodiments, part or all of the computer program can be loaded and/or installed on the terminal device 10 via the ROM 12 and/or the communication unit 19. When the computer program is loaded into the RAM 13 and executed by the processor 11, one or more steps of the training method of the cross-city federation migration model described above can be performed. Alternatively, in other embodiments, the processor 11 may be configured in any other appropriate manner (for example, by means of firmware) to execute a method for training a cross-city federation migration model.

本文中以上描述的系统和技术的各种实施方式可以在数字电子电路系统、集成电路系统、场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、芯片上系统的系统(SOC)、负载可编程逻辑设备(CPLD)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括：实施在一个或者多个计算机程序中，该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/或解释，该可编程处理器可以是专用或者通用可编程处理器，可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令，并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。Various implementations of the systems and techniques described above herein can be implemented in digital electronic circuit systems, integrated circuit systems, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chips Implemented in a system of systems (SOC), load programmable logic device (CPLD), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include being implemented in one or more computer programs executable and/or interpreted on a programmable system including at least one programmable processor, the programmable processor Can be special-purpose or general-purpose programmable processor, can receive data and instruction from storage system, at least one input device, and at least one output device, and transmit data and instruction to this storage system, this at least one input device, and this at least one output device an output device.

用于实施本发明的跨城市联邦迁移模型的训练方法的计算机程序可以采用一个或多个编程语言的任何组合来编写。这些计算机程序可以提供给通用计算机、专用计算机或其他可编程数据处理装置的处理器，使得计算机程序当由处理器执行时使流程图和/或框图中所规定的功能/操作被实施。计算机程序可以完全在机器上执行、部分地在机器上执行，作为独立软件包部分地在机器上执行且部分地在远程机器上执行或完全在远程机器或服务器上执行。The computer program for implementing the training method of the cross-city federated migration model of the present invention can be written in any combination of one or more programming languages. These computer programs can be provided to a processor of a general-purpose computer, a special-purpose computer, or other programmable data processing apparatus, so that the computer program causes the functions/operations specified in the flowcharts and/or block diagrams to be implemented when executed by the processor. A computer program may execute entirely on the machine, partly on the machine, as a stand-alone software package partly on the machine and partly on a remote machine or entirely on the remote machine or server.

本发明一个实施例还提供了一种计算机可读存储介质，计算机可读存储介质存储有计算机指令，计算机指令用于使处理器执行一种跨城市联邦迁移模型的训练方法，该方法包括：An embodiment of the present invention also provides a computer-readable storage medium, the computer-readable storage medium stores computer instructions, and the computer instructions are used to make a processor execute a training method for a cross-city federation migration model, the method comprising:

响应于获取到训练指标数据集，基于初始跨城市联邦迁移模型和训练指标数据集中的训练指标数据，确定预测指标数据和预测城市标签；In response to obtaining the training indicator data set, based on the initial cross-city federation transfer model and the training indicator data in the training indicator data set, determine the predictive indicator data and the predicted city label;

基于与预测指标数据对应的标准指标数据以及与预测城市标签对应的标准城市标签，对初始跨城市联邦迁移模型的模型参数进行调整，得到训练完成的目标跨城市联邦迁移模型；Based on the standard index data corresponding to the predicted index data and the standard city label corresponding to the predicted city label, the model parameters of the initial cross-city federation migration model are adjusted to obtain the trained target cross-city federation migration model;

将目标跨城市联邦迁移模型的模型参数发送给目标城市客户端，以使目标城市客户端基于接收到的模型参数构建标准跨城市联邦迁移模型，并基于标准跨城市联邦迁移模型确定目标城市标签。Send the model parameters of the target cross-city federation migration model to the target city client, so that the target city client builds a standard cross-city federation migration model based on the received model parameters, and determines the target city label based on the standard cross-city federation migration model.

或者，计算机指令用于使处理器执行另一种跨城市联邦迁移模型的训练方法，该方法包括：Alternatively, the computer instructions are used to cause the processor to execute another cross-city federation transfer model training method, the method comprising:

接收源城市客户端发送的训练完成的目标跨城市联邦迁移模型的模型参数，并基于模型参数，构建标准跨城市联邦迁移模型；Receive the model parameters of the trained target cross-city federation migration model sent by the source city client, and build a standard cross-city federation migration model based on the model parameters;

将测试指标数据输入到标准跨城市联邦迁移模型中，得到输出的目标城市标签；Input the test indicator data into the standard cross-city federation migration model to get the output target city label;

在本发明的上下文中，计算机可读存储介质可以是有形的介质，其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的计算机程序。计算机可读存储介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备，或者上述内容的任何合适组合。备选地，计算机可读存储介质可以是机器可读信号介质。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present invention, a computer readable storage medium may be a tangible medium that may contain or store a computer program for use by or in conjunction with an instruction execution system, apparatus or device. A computer readable storage medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing. Alternatively, a computer readable storage medium may be a machine readable signal medium. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.

为了提供与用户的交互，可以在终端设备上实施此处描述的系统和技术，该终端设备具有：用于向用户显示信息的显示装置(例如，CRT(阴极射线管)或者LCD(液晶显示器)监视器)；以及键盘和指向装置(例如，鼠标或者轨迹球)，用户可以通过该键盘和该指向装置来将输入提供给终端设备。其它种类的装置还可以用于提供与用户的交互；例如，提供给用户的反馈可以是任何形式的传感反馈(例如，视觉反馈、听觉反馈、或者触觉反馈)；并且可以用任何形式(包括声输入、语音输入或者、触觉输入)来接收来自用户的输入。To provide for interaction with the user, the systems and techniques described herein can be implemented on a terminal device having a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) display device for displaying information to the user. monitor); and a keyboard and a pointing device (for example, a mouse or a trackball), through which the user can provide input to the terminal device. Other kinds of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and can be in any form (including Acoustic input, speech input or, tactile input) to receive input from the user.

可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如，作为数据服务器)、或者包括中间件部件的计算系统(例如，应用服务器)、或者包括前端部件的计算系统(例如，具有图形用户界面或者网络浏览器的用户计算机，用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如，通信网络)来将系统的部件相互连接。通信网络的示例包括：局域网(LAN)、广域网(WAN)、区块链网络和互联网。The systems and techniques described herein can be implemented in a computing system that includes back-end components (e.g., as a data server), or a computing system that includes middleware components (e.g., an application server), or a computing system that includes front-end components (e.g., as a a user computer having a graphical user interface or web browser through which a user can interact with embodiments of the systems and techniques described herein), or including such backend components, middleware components, Or any combination of front-end components in a computing system. The components of the system can be interconnected by any form or medium of digital data communication, eg, a communication network. Examples of communication networks include: local area networks (LANs), wide area networks (WANs), blockchain networks, and the Internet.

计算系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序来产生客户端和服务器的关系。服务器可以是云服务器，又称为云计算服务器或云主机，是云计算服务体系中的一项主机产品，以解决了传统物理主机与VPS服务中，存在的管理难度大，业务扩展性弱的缺陷。A computing system can include clients and servers. Clients and servers are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also known as a cloud computing server or a cloud host. It is a host product in the cloud computing service system to solve the problems of difficult management and weak business expansion in traditional physical hosts and VPS services. defect.

应该理解，可以使用上面所示的各种形式的流程，重新排序、增加或删除步骤。例如，本发明中记载的各步骤可以并行地执行也可以顺序地执行也可以不同的次序执行，只要能够实现本发明的技术方案所期望的结果，本文在此不进行限制。It should be understood that steps may be reordered, added or deleted using the various forms of flow shown above. For example, each step described in the present invention may be executed in parallel, sequentially, or in a different order, as long as the desired result of the technical solution of the present invention can be achieved, there is no limitation herein.

上述具体实施方式，并不构成对本发明保护范围的限制。本领域技术人员应该明白的是，根据设计要求和其他因素，可以进行各种修改、组合、子组合和替代。任何在本发明的精神和原则之内所作的修改、等同替换和改进等，均应包含在本发明保护范围之内。The above specific implementation methods do not constitute a limitation to the protection scope of the present invention. It should be apparent to those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made depending on design requirements and other factors. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present invention shall be included within the protection scope of the present invention.

Claims

1. A training method of a cross-city federal migration model is applied to a source city client and comprises the following steps:

in response to the acquisition of a training index data set, determining prediction index data and prediction city labels based on an initial cross-city federal migration model and the training index data in the training index data set;

adjusting model parameters of the initial cross-city federal migration model based on standard index data corresponding to the prediction index data and standard city labels corresponding to the prediction city labels to obtain a trained target cross-city federal migration model;

and sending the model parameters of the target cross-city federal migration model to a target city client, so that the target city client builds a standard cross-city federal migration model based on the received model parameters, and determines a target city label based on the standard cross-city federal migration model.

2. The method of claim 1, wherein determining predictive index data and predictive city labels based on an initial cross-city federated migration model and training index data in the training index dataset comprises:

respectively inputting the training index data in the training index data set into an initial data generation module and an initial label generation module in an initial cross-city federal migration model;

outputting, by the initial data generation module, prediction index data based on the input training index data;

and outputting a predicted city label based on the input training index data and the predicted index data output by the initial data generation module through the initial label generation module.

3. The method of claim 2, wherein the initial data generation module comprises a feature extractor, a data regressor and a domain classifier, and wherein outputting prediction index data based on the input training index data comprises:

inputting the training index data into the feature extractor to obtain an output training index feature;

outputting, by the data regressor, predictive index data based on the training index features output by the feature extractor, and outputting, by the domain classifier, a predictive city classification result based on the training index features output by the feature extractor.

4. The method according to claim 3, wherein the training index dataset includes training index data corresponding to a current source city client, and accordingly, the adjusting the model parameters of the initial cross-city federation migration model based on standard index data corresponding to the prediction index data and standard city labels corresponding to the prediction city labels to obtain a trained target cross-city federation migration model includes:

determining a target loss function based on standard index data corresponding to the prediction index data and a standard city label corresponding to the prediction city label for each iteration process of the initial cross-city federal migration model;

determining current model parameters corresponding to an initial cross-city federal migration model of current iteration based on the target loss function, and sending the current model parameters to a central server, so that the central server determines aggregation model parameters based on the received current model parameters respectively sent by at least two source city clients, and sends the aggregation model parameters to each source city client; wherein the at least two source city clients comprise the current source city client;

and taking the received aggregation model parameters sent by the central server as model parameters of the initial cross-city federal migration model of the current iteration until the target loss function is converged, and taking the initial cross-city federal migration model of the current iteration as a trained target cross-city federal migration model.

5. The method of claim 4, wherein sending model parameters of the target cross-city federated migration model to a target city client comprises:

sending the model parameters of the target cross-city federal migration model to the central server, so that the central server determines target model parameters based on the received model parameters of the target cross-city federal migration model, which are sent by the at least two source city clients respectively, and sends the target model parameters to the target city clients.

6. The method according to claim 3, wherein the training index dataset includes training index data corresponding to at least two source city clients, and accordingly, the adjusting the model parameters of the initial cross-city federal migration model based on the standard index data corresponding to the prediction index data and the standard city labels corresponding to the prediction city labels to obtain a trained target cross-city federal migration model includes:

and determining current model parameters corresponding to the initial cross-city federal migration model of the current iteration based on the target loss function, and taking the initial cross-city federal migration model of the current iteration as a trained target cross-city federal migration model until the target loss function is converged.

7. The method according to claim 4 or 6, wherein the objective loss function comprises a first objective loss function corresponding to the initial data generation module and a second objective loss function corresponding to the initial label generation module, and wherein determining the objective loss function based on the standard index data corresponding to the prediction index data and the standard city label corresponding to the prediction city label comprises:

constructing a first loss function based on the prediction index data and the standard index data, and constructing a second loss function based on the prediction city classification result and the standard city classification result;

determining a first target loss function corresponding to the initial data generation module based on the first loss function and the second loss function;

and constructing a second target loss function corresponding to the initial label generation module based on the predicted city label and the standard city label.

8. The method according to claim 7, wherein the initial data generation module further comprises a gradient inversion layer disposed between the feature extractor and the domain classifier, the gradient inversion layer is configured to multiply a function gradient of the second loss function input into the gradient inversion layer by a preset negative number during back propagation of the initial data generation module, and accordingly, a coefficient of the second loss function in the first target loss function is the preset negative number.

9. The method of claim 1, wherein the training index data comprises point of interest area data, road network data and population data, wherein the point of interest area data comprises at least one of a number of points of interest, a total number of points of interest and a point of interest entropy value corresponding to at least one type of point of interest, wherein the road network data comprises a number of roads corresponding to at least one type of roads, wherein the population data comprises a number of working populations and/or a number of residential populations, and wherein the prediction index data is a number of consuming populations and wherein the prediction city label is a category of business popularity.

10. A training method of a cross-city federal migration model is applied to a target city client, and comprises the following steps:

receiving model parameters of a trained target cross-city federal migration model sent by a source city client, and constructing a standard cross-city federal migration model based on the model parameters;

inputting the test index data into the standard trans-city federal migration model to obtain an output target city label;

the target cross-city federal migration model is obtained by the source city client determining prediction index data and prediction city labels based on an initial cross-city federal migration model and training index data in a training index data set, and executing training operation on the initial cross-city federal migration model based on standard index data corresponding to the prediction index data and standard city labels corresponding to the prediction city labels.

11. The method according to claim 10, wherein the model parameters include data model parameters corresponding to a target data generation module and tag model parameters corresponding to a target tag generation module in the target cross-city federal migration model, and accordingly, the constructing a standard cross-city federal migration model based on the model parameters includes:

constructing a standard data generation module in a standard cross-city federal migration model based on the data model parameters, and inputting reference index data into the standard data generation module to obtain output missing index data;

and constructing a reference label generation module based on the label model parameters, and training the reference label generation module based on the reference index data and the missing index data to obtain a standard label generation module in a standard trans-city federal migration model.

12. A training device of a cross-city federal migration model is applied to a source city client, and comprises:

the prediction city label determining module is used for responding to the obtained training index data set, and determining prediction index data and prediction city labels based on an initial trans-city federal migration model and the training index data in the training index data set;

a target cross-city federal migration model determination module, configured to adjust model parameters of the initial cross-city federal migration model based on standard index data corresponding to the prediction index data and standard city labels corresponding to the prediction city labels, so as to obtain a trained target cross-city federal migration model;

and the model parameter sending module is used for sending the model parameters of the target cross-city federal migration model to the target city client, so that the target city client builds a standard cross-city federal migration model based on the received model parameters, and determines a target city label based on the standard cross-city federal migration model.

13. The utility model provides a training device of cross city federal migration model which is characterized in that, is applied to target city client, includes:

the standard trans-city federal migration model building module is used for receiving model parameters of a trained target trans-city federal migration model sent by a source city client, and building a standard trans-city federal migration model based on the model parameters;

the target city label determining module is used for inputting the test index data into the standard trans-city federal migration model to obtain an output target city label;

14. A training system for a cross-city federal migration model, comprising: a source city client and a target city client;

the source city client is used for responding to the obtained training index data set, determining prediction index data and prediction city labels based on an initial cross-city federal migration model and training index data in the training index data set, adjusting model parameters of the initial cross-city federal migration model based on standard index data corresponding to the prediction index data and standard city labels corresponding to the prediction city labels to obtain a trained target cross-city federal migration model, and sending the model parameters of the target cross-city federal migration model to a target city client;

the target city client is used for constructing a standard cross-city federal migration model based on the received model parameters and determining a target city label based on the standard cross-city federal migration model.

15. The system according to claim 14, wherein the number of the source city clients is at least two, and accordingly, the system further comprises a central server, and the central server is configured to determine an aggregation model parameter based on the received current model parameters respectively sent by the at least two source city clients, and send the aggregation model parameter to each of the source city clients;

the source city client is specifically configured to: aiming at each iteration process of the initial cross-city federal migration model, determining a target loss function based on standard index data corresponding to the prediction index data and a standard city label corresponding to the prediction city label, determining current model parameters corresponding to the initial cross-city federal migration model of the current iteration based on the target loss function, and sending the current model parameters to a central server;

and taking the received aggregation model parameters as model parameters of the initial cross-city federal migration model of the current iteration until the target loss function is converged, and taking the initial cross-city federal migration model of the current iteration as a trained target cross-city federal migration model.

16. A terminal device, characterized in that the terminal device comprises:

at least one processor; and

a memory communicatively coupled to the at least one processor; wherein,

the memory stores a computer program executable by the at least one processor to enable the at least one processor to perform the method of training a cross-city federal migration model as defined in any of claims 1-9 or the method of training a cross-city federal migration model as defined in any of claims 10-11.

17. A computer-readable storage medium storing computer instructions for causing a processor to implement the method for training across-city federal migration model of any of claims 1-9 or the method for training across-city federal migration model of any of claims 10-11 when executed.