WO2022043741A1

WO2022043741A1 - Network training method and apparatus, person re-identification method and apparatus, storage medium, and computer program

Info

Publication number: WO2022043741A1
Application number: PCT/IB2020/060047
Authority: WO
Inventors: 庄伟铭; 张学森; 张帅; 伊帅
Original assignee: 商汤国际私人有限公司
Priority date: 2020-08-25
Filing date: 2020-10-27
Publication date: 2022-03-03
Also published as: CN112001321B; TW202209151A; TWI769635B; CN112001321A

Abstract

The embodiments of the present disclosure relate to a network training method and apparatus, a person re-identification method and apparatus, a storage medium, and a computer program. A cloud server comprises a first person re-identification network. The method comprises: sending, to a plurality of edge servers, a first network parameter corresponding to a first person re-identification network; receiving second network parameters returned by the plurality of edge servers, wherein each second network parameter is obtained by each edge server by means of training a second person re-identification network included in each edge server according to a local image data set, an identity classification network and the first network parameter; and updating the first person re-identification network according to the second network parameters returned by the plurality of edge servers, so as to obtain an updated first person re-identification network.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS Priority, the entire content of this Chinese patent application is incorporated herein by reference. FIELD OF THE DISCLOSURE The present disclosure relates to the field of computer technology, and in particular, to a method and device for network training and pedestrian re-identification, a storage medium, and a computer program. 2. Description of the Related Art Pedestrian re-identification (Person Re-identification), also known as pedestrian re-identification, is a technology that uses computer vision technology to determine whether there is a specific pedestrian in an image or a video sequence. At present, pedestrian re-identification technology has been widely used in many fields and industries, such as intelligent video detection, intelligent security and so on. Since the person re-identification technology involves private data such as face, human body, and personal identity in the process of processing images or video frame sequences, there is an urgent need for a person re-identification method that can avoid leakage of private data. SUMMARY Embodiments of the present disclosure propose a network training and pedestrian re-identification method and device, a storage medium, and a technical solution for a computer program. According to an aspect of the embodiments of the present disclosure, a network training method is provided. The method is applied to a cloud server, where the cloud server includes a first person re-identification network, and the method includes: sending first network parameters corresponding to the first pedestrian re-identification network; receiving second network parameters returned by the multiple edge servers, wherein, for any of the edge servers, the edge servers include A second person re-identification network, an identity classification network and a local image dataset, the second person re-identification network and the first person re-identification network have the same network structure, and the second network parameter is the edge obtained after the end server trains the second pedestrian re-identification network according to the local image data set, the identity classification network and the first network parameters; Two network parameters, updating the first pedestrian re-identification network to obtain the updated first pedestrian re-identification network. In a possible implementation manner, the first pedestrian re-identification network is updated according to the second network parameters returned by the multiple edge servers to obtain the updated first row The person re-identification network includes: receiving weights corresponding to the second network parameters returned by the plurality of edge servers, wherein, for any of the edge servers, the weights corresponding to the second network parameters are the Determined by the edge server according to the second pedestrian re-identification network before training and the second pedestrian re-identification network after training; according to the weights corresponding to the second network parameters returned by the multiple edge servers , performing a weighted average on the second network parameters returned by the plurality of edge servers to obtain the updated first network parameters; according to the updated first network parameters, the first pedestrian The re-identification network is updated to obtain the updated first pedestrian re-identification network. In a possible implementation manner, the method further includes: sending a shared image data set to the multiple edge servers; receiving pseudo tags returned by the multiple edge servers, wherein for any of the edge servers an end server, where the pseudo label is generated by the side server according to the shared image dataset and the trained second pedestrian re-identification network; according to the shared image dataset and the multiple side servers For the returned pseudo-label, the updated first pedestrian re-identification network is trained to obtain the trained first pedestrian re-identification network. In a possible implementation manner, the returning according to the shared image data set and the multiple side servers training the updated first pedestrian re-identification network, and obtaining the trained first pedestrian re-identification network, comprising: determining, according to the pseudo-tags returned by the multiple edge servers, Average pseudo-label; train the updated first person re-identification network according to the shared image data set and the average pseudo-label, to obtain the trained first person re-identification network. According to an aspect of the embodiments of the present disclosure, a network training method is provided, the method is applied to an edge server, and the edge server includes a second person re-identification network, an identity classification network and a local image dataset, so The method includes: receiving a first network parameter corresponding to a first pedestrian re-identification network sent by a cloud server, wherein the first pedestrian-re-identification network and the second pedestrian-re-identification network have the same network structure; The local image data set, the identity classification network and the first network parameters are used to train the second pedestrian re-identification network, and the trained second pedestrian re-identification network is obtained, wherein the first pedestrian re-identification network is obtained. The two-person re-identification network corresponds to the second network parameter; and the second network parameter is sent to the cloud server. In a possible implementation manner, the second pedestrian re-identification network is trained according to the local image data set, the identity classification network and the first network parameters to obtain the trained The second person re-identification network includes: training the second person re-identification network and the identity classification network according to the local image data set and the first network parameters, to obtain the trained second person re-identification network. A person re-identification network and the trained identity classification network. In a possible implementation manner, the method further includes: storing the trained identity classification network in the edge server. In a possible implementation manner, the local image data set includes image data corresponding to multiple identities; and the dimension of the identity classification network is related to the number of the multiple identities. In a possible implementation manner, the method further includes: receiving a shared image data set sent by the cloud server; generating a pseudo-label according to the shared image data set and the trained second person re-identification network ; Send the pseudo tag to the cloud server. In a possible implementation manner, the method further includes: determining a first feature vector according to the second pedestrian re-identification network before training and the local image data set, and determining a first feature vector according to the second pedestrian re-identification network after training and the local image data set; The two-person re-identification network and the local image data set determine a second feature vector; determine the cosine distance between the first feature vector and the second feature vector; determine the second feature vector according to the cosine distance the weight corresponding to the network parameter; sending the weight corresponding to the second network parameter to the cloud server. In a possible implementation manner, the edge server is an image acquisition device; and the local image data set is acquired according to the image acquisition device. In a possible implementation manner, the edge server is connected to at least one image acquisition device, and the edge server and the at least one image acquisition device are located in the same geographical area; the local image data set is the The edge server is obtained from the at least one image acquisition device. According to an aspect of the embodiments of the present disclosure, a pedestrian re-identification method is provided, including: performing pedestrian re-identification processing on at least one frame of images to be identified obtained within a target geographic area through a target pedestrian re-identification network, and determining the pedestrian The re-identification result; wherein, the target pedestrian re-identification network is obtained by training the above-mentioned network training method. In a possible implementation manner, the target person re-identification network is an updated first person re-identification network or a trained first person re-identification network. In a possible implementation manner, in the case that an edge server is included within the target geographical area, and the edge server includes a trained second pedestrian re-identification network, the target pedestrian The re-identification network is the second person re-identification network after training. According to an aspect of the embodiments of the present disclosure, a network training device is provided, the network training device is applied to a cloud server, the cloud server includes a first person re-identification network, and the device includes: a sending part, which is is configured to send the first network parameters corresponding to the first pedestrian re-identification network to the plurality of edge servers; the receiving part is configured to receive the second network parameters returned by the multiple edge servers, wherein for any a described edge server, The edge server includes a second person re-identification network, an identity classification network and a local image dataset, the second person re-identification network and the first person re-identification network have the same network structure, and the first person re-identification network has the same network structure. The second network parameter is obtained by the side server after training the second person re-identification network according to the local image data set, the identity classification network and the first network parameter; the update part is configured as According to the second network parameters returned by the multiple edge servers, the first pedestrian re-identification network is updated to obtain the updated first pedestrian re-identification network. According to an aspect of the embodiments of the present disclosure, a network training apparatus is provided, the apparatus is applied to an edge server, and the edge server includes a second pedestrian re-identification network, an identity classification network and a local image dataset, so The apparatus includes: a receiving part configured to receive a first network parameter corresponding to a first pedestrian re-identification network sent by a cloud server, wherein the first pedestrian-re-identification network and the second pedestrian-re-identification network have the same network structure; the network training part is configured to train the second pedestrian re-identification network according to the local image data set, the identity classification network and the first network parameters, and obtain the trained the second pedestrian re-identification network, wherein the second pedestrian re-identification network corresponds to a second network parameter; the sending part is configured to send the second network parameter to the cloud server according to an embodiment of the present disclosure. In an aspect, a pedestrian re-identification device is provided, comprising: a pedestrian re-identification part configured to perform pedestrian re-identification processing on at least one frame of images to be identified obtained within a target geographical area through a target pedestrian re-identification network, and determine Pedestrian re-identification result; wherein, the target pedestrian re-identification network is obtained by training the above-mentioned network training method. According to an aspect of the embodiments of the present disclosure, there is provided an electronic device, comprising: a processor; a memory configured to store instructions executable by the processor; wherein the processor is configured to invoke the instructions stored in the memory, to perform the above method. According to an aspect of the embodiments of the present disclosure, there is provided a computer-readable storage medium having computer program instructions stored thereon, the computer program instructions implementing the foregoing method when executed by a processor. According to an aspect of the embodiments of the present disclosure, a computer program is provided, including computer-readable code, when the computer-readable code is executed in an electronic device, the processor in the electronic device implements the foregoing method when executed. In the embodiment of the present disclosure, in the cloud server including the first pedestrian re-identification network, the first network parameters corresponding to the first pedestrian re-identification network are sent to multiple edge servers, and the multiple edge servers are received. The returned second network parameter, wherein, for any edge server, the edge server includes a second person re-identification network, an identity classification network and a local image dataset that have the same network structure as the first person re-identification network, The second network parameter is obtained by the edge server after training the second pedestrian re-identification network according to the local image data set, the identity classification network and the first network parameters, and then according to the second network parameters returned by the multiple edge servers, to The first person re-identification network is updated, and the updated first person re-identification network is obtained. The cloud server combines multiple side servers to train the pedestrian re-identification network. During the training process, the image data set is still saved in the side server and does not need to be uploaded to the cloud server, so that the pedestrian re-identification network can be effectively trained while protecting the data. privacy. It should be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not limiting of the embodiments of the present disclosure. Other features and aspects of embodiments of the present disclosure will become apparent from the following detailed description of exemplary embodiments with reference to the accompanying drawings. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are incorporated into the specification and constitute a part of the specification, illustrate embodiments consistent with the present disclosure, and together with the description, serve to explain the technical solutions of the embodiments of the present disclosure. Fig. 1 shows a flow chart of a network training method according to an embodiment of the present disclosure; Fig. 2 shows a structural diagram of an exemplary network training provided according to an embodiment of the present disclosure; Fig. 3 shows an embodiment according to the present disclosure A flowchart of a network training method according to an embodiment of the present disclosure; FIG. 4 shows an exemplary schematic diagram of determining the weight of a second network parameter provided according to an embodiment of the present disclosure; FIG. 5 shows an exemplary schematic diagram provided according to an embodiment of the present disclosure. A cloud server-side server network structure Fig. 6 shows an exemplary network structure diagram of a cloud server-side server-terminal device provided according to an embodiment of the present disclosure; Fig. 7 shows an exemplary network training provided according to an embodiment of the present disclosure 8 shows a block diagram of a network training apparatus according to an embodiment of the present disclosure; FIG. 9 shows a block diagram of a network training apparatus according to an embodiment of the present disclosure; A block diagram of an electronic device; FIG. 11 shows a block diagram of an electronic device according to an embodiment of the present disclosure. DETAILED DESCRIPTION Various exemplary embodiments, features and aspects of the present disclosure will be described in detail below with reference to the accompanying drawings. The same reference numbers in the figures denote elements that have the same or similar functions. Although various aspects of the embodiments are shown in the drawings, the drawings are not necessarily drawn to scale unless otherwise indicated. The word "exemplary" is used exclusively herein to mean "serving as an example, embodiment, or illustration." Any embodiment described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other embodiments. The term "and/or" in this article is only an association relationship to describe associated objects, which means that there can be three kinds of relationships, for example, A and/or B, which can mean: A exists alone, A and B exist simultaneously, and A and B exist independently B these three cases. In addition, the term "at least one" herein refers to any one of multiple or any combination of at least two of multiple, for example, including at least one of A, B, and C, may mean including from A, B, and C. Any one or more elements selected from the set of B and C. In addition, in order to better illustrate the embodiments of the present disclosure, numerous specific details are given in the following detailed description. It should be understood by those skilled in the art that the embodiments of the present disclosure can also be implemented without certain specific details. In some instances, methods, means, components and circuits well known to those skilled in the art are not described in detail in order to highlight the gist of the embodiments of the present disclosure. FIG. 1 shows a flowchart of a network training method according to an embodiment of the present disclosure. The network training method can be executed by a cloud server, and the cloud server includes a first pedestrian re-identification network. In some possible implementations, the network training method may be implemented by a cloud server invoking computer-readable instructions stored in a memory. As shown in FIG. 1 , the method may include: In step S11, sending first network parameters corresponding to the first pedestrian re-identification network to a plurality of edge servers. In step S12, the second network parameters returned by multiple edge servers are received, wherein, for any edge server, the edge server includes a second pedestrian re-identification network, an identity classification network and a local image data set, the second pedestrian The re-identification network and the first person re-identification network have the same network structure, and the second network parameters are obtained after the edge server trains the second person re-identification network according to the local image data set, the identity classification network and the first network parameters. of. In step S13, the first pedestrian re-identification network is updated according to the second network parameters returned by the multiple edge servers to obtain an updated first pedestrian re-identification network. The cloud server combines multiple side servers to train the pedestrian re-identification network. During the training process, the image data set is still saved in the side server and does not need to be uploaded to the cloud server, so that the pedestrian re-identification network can be effectively trained while protecting the data. privacy. In addition, since there is no need to upload the image dataset to the cloud server, communication bandwidth can be effectively saved. When the cloud server combines multiple side servers to train the pedestrian re-identification network, the cloud server can combine multiple side servers to perform network training based on the federated learning algorithm. For example, a person re-identification network is jointly trained by multiple communities, and each community is set up with a side server. Through the federated learning algorithm, the image data set (the image data set collected by the image acquisition equipment set in the community or nearby) still remains It is stored in the community (local side server) and does not need to be uploaded to other communities (other side servers), thus protecting data privacy. In practical applications, due to the different data volumes of local image datasets in different edge servers, different edge servers have different data sizes. The data between the end servers is heterogeneous. When the traditional federated learning algorithm uses multiple side servers for network training, the weights of the second network parameters obtained by the network training in the side servers are set according to the amount of data in different side servers. However, since the amount of data cannot directly reflect the training effect of network training, the cloud server updates the first pedestrian re-identification network by using the weights of the second network parameters obtained in the side server based on this weight determination method. , which will lead to lower accuracy of the updated first person re-identification network. In a possible implementation manner, the first pedestrian re-identification network is updated according to the second network parameters returned by multiple edge servers, and the updated first pedestrian re-identification network is obtained, including: receiving multiple The weight corresponding to the second network parameter returned by the edge server, wherein, for any edge server, the weight corresponding to the second network parameter is the weight of the edge server according to the second pedestrian re-identification network before training and the second pedestrian weight after training. It is determined by identifying the network; according to the weights corresponding to the second network parameters returned by the multiple edge servers, weighted average is performed on the second network parameters returned by the multiple edge servers to obtain the updated first network parameters; according to the updated first network parameters; The first network parameters are updated, and the first pedestrian re-identification network is updated to obtain the updated first pedestrian re-identification network. Since the weight of the second network parameter sent by the side server is determined by the side server according to the second pedestrian re-identification network before training and the second pedestrian re-identification network after training, that is, the weight of the second network parameter It is determined according to the training effect of the edge server, so that the cloud server performs a weighted average of the second network parameters returned by the multiple edge servers according to the weights corresponding to the second network parameters returned by each edge server to obtain a higher-precision network parameter. The updated first network parameters, and then after the first pedestrian re-identification network is updated according to the updated first network parameters, the accuracy of the updated first pedestrian re-identification network is effectively improved. In practical applications, since the local image datasets in different edge servers are collected in different scenarios (lighting, angle), the data between different edge servers is heterogeneous, which leads to The performance of the trained second person re-identification network obtained by training the local image data set, the identity classification network and the first network parameters is better than the updated first person re-identification obtained by training the cloud server in conjunction with multiple side servers. The internet. Therefore, based on the knowledge evaporation algorithm, the second person re-identification network trained in each side server can be used as the teacher network, the updated first person re-identification network in the cloud server can be used as the student network, and the teacher network can be used to The student network is trained (using the updated second person re-identification network to train the updated first-person re-identification network) to improve the stability and convergence of the first-person re-identification network training process. In a possible implementation manner, the method further includes: sending a shared image data set to multiple edge servers; receiving pseudo tags returned by the multiple edge servers, wherein, for any edge server, the pseudo tag is the edge server The server is generated according to the shared image data set and the trained second person re-identification network; according to the shared image data set and the pseudo-labels returned by multiple side servers, the updated first person re-identification network is trained to obtain The first person re-identification network after training. The cloud server receives the pseudo-label returned by each side server. Since the pseudo-label is generated by the side-end server according to the shared image data set and the trained second person re-identification network, the pseudo-label can be used to represent the trained second person re-identification network. The network characteristics of the person re-identification network, therefore, according to the shared image data set and the pseudo-labels returned by multiple side servers, the updated first person re-identification network is trained, which is equivalent to synthesizing the network characteristics of each side server. The updated first person re-identification network is trained, so that the stability and convergence of the training process of the first person re-identification network can be effectively improved. The shared image dataset refers to an image dataset that both the cloud server and each side server can use for network training. FIG. 2 shows a structural diagram of an exemplary network training provided according to an embodiment of the present disclosure. As shown in FIG. 2 , the second person re-identification network trained in the multiple edge servers constitutes a teacher network 1, a teacher network 2, and a teacher network N, where N is the number of multiple edge servers, and N>1 . The updated first person re-identification network in the cloud server constitutes the student network. The teacher network 1 uses the shared image data set to generate pseudo-labels and sends the pseudo-labels to the cloud server; the teacher network 2 uses the shared image data set to generate pseudo-labels, and sends the pseudo-labels to the cloud server; Image datasets generate pseudo-labels, and send pseudo-labels to

The pedestrian re-identification network is trained to obtain a trained second pedestrian re-identification network corresponding to the second network parameters, and then the second network parameters are sent to the cloud server. In a possible implementation manner, the edge server is connected to at least one image acquisition device, and the edge server and the at least one image acquisition device are located in the same geographical area; the local image data set is obtained by the edge server from the at least one image acquisition device. obtained. In the case where at least one image acquisition device is provided within the same geographical area, an edge server may be provided within the geographical area, and in this case, the at least one image acquisition device does not need to have storage capability and computing power. The edge server is connected to each image acquisition device, and further acquires images from each image acquisition device to construct a local image data set. The edge server receives the first network parameters corresponding to the first pedestrian re-identification network sent by the cloud server, and trains the second pedestrian re-identification network according to the local image data set and the first network parameters, and obtains a parameter corresponding to the second network parameters. The trained second person re-identification network, and then sends the second network parameters to the cloud server. In a possible implementation manner, the second person re-identification network is trained according to the local image data set, the identity classification network and the first network parameters, and the trained second person re-identification network is obtained, including: according to the local image The data set and the first network parameters are used to train the second person re-identification network and the identity classification network, and the trained second person re-identification network and the trained identity classification network are obtained. In a possible implementation manner, the local image dataset includes image data corresponding to multiple identities; the dimension of the identity classification network is related to the number of multiple identities. Since the trained person re-identification network is a network for identifying images, therefore, in the process of training the person re-identification network, a local image dataset including image data corresponding to multiple identities needs to be used, and the identity Classification network, the dimension of the identity classification network is related to the number of multiple identities included in the local image dataset. For example, if the local image dataset includes image data corresponding to 100 identities, the dimension of the identity classification network is 100. That is, the identity classification network includes 100 different identity classes. The side-end server constructs the local second person re-identification network and the identity classification network as a combined network, and uses the first network parameters received from the cloud server and the local image data set to train the combined network, and then obtains the combined network after training network, wherein the trained combined network includes a trained second person re-identification network and a trained identity classification network, and the trained second person re-identification network corresponds to a second network parameter. Further, the edge server sends the second network parameter to the cloud server. Since the first person re-identification network and the second person re-identification network have the same network structure, the first person re-identification network can be updated by using the second network parameters. In a possible implementation manner, the method further includes: storing the trained identity classification network in the edge server. Since the first person re-identification network trained in the cloud server does not need to use the classifier network in the actual pedestrian re-identification process, in order to save the communication bandwidth and ensure that the cloud is used in the joint training process based on the federated learning algorithm The network structure between the server and the side server is consistent. The side server only sends the second network parameters corresponding to the trained second person re-identification network to the cloud server, and stores the trained identity classification network on the side server. local. In a possible implementation manner, the method further includes: receiving a shared image data set sent by the cloud server; generating a pseudo-label according to the shared image data set and the trained second person re-identification network; sending the pseudo-label to the cloud server . Still taking the above FIG. 2 as an example, as shown in FIG. 2, the side server receives the shared image data set sent by the cloud server, and uses the shared image data set and the locally trained second person re-identification network to generate pseudo labels, and then the edge The terminal server sends a pseudo-label to the cloud server, because the pseudo-label can be used to represent the network characteristics of the trained second pedestrian re-identification network, so that the cloud server can re-identify the updated first pedestrian in the cloud server according to the pseudo-label. The first person re-identification network after training is obtained after the recognition network performs network training, so that the network performance of the first person re-identification network after training is closer to that of the second person re-identification network after training in the side server. Thus, the stability and convergence of the first person re-identification network training process are effectively improved.

In the geographic area (for example, the same community, or the same company), the edge server A obtains images from the image acquisition device 1 and the image acquisition device 2 respectively to construct a local image dataset. The side server B is connected to the terminal device 3, the terminal device 4 and the terminal device 5, and the terminal device 3, the terminal device 4 and the terminal device 5 are image acquisition devices (the image acquisition device 3, the image acquisition device 4 and the image acquisition device 5, for example , the image acquisition device is a camera), the edge server B, the image acquisition device 3, the image acquisition device 4, and the image acquisition device 5 are set in the same geographical area (for example, the same community, or the same company), and the edge server B is Image capture device 3, image capture device 4, and image capture device 5 acquire images to construct a local image dataset. The cloud server combines two side servers (side server A and side server B) to train the pedestrian re-identification network. During the training process, the image data set is still stored locally on each side server without uploading to the cloud server. Data privacy is preserved while effectively training a person re-identification network. According to the above discussion, the present disclosure proposes two training architectures combining federated learning and person re-identification: a cloud-edge architecture and a device-edge-cloud architecture. Cloud-edge architecture: The cloud server communicates directly with the smart cameras, and the cloud coordinates multiple smart cameras to train at the same time. The smart camera caches pictures on the edge, and deletes and cleans them regularly to reduce the storage pressure on the edge server. And this architecture requires smart cameras to have certain computing power, storage and communication capabilities. Cloud-edge-end architecture: The edge gateway (that is, the above-mentioned edge-end server) is connected to multiple smart cameras, the cloud server is connected to multiple edge gateways, and the pedestrian re-identification training image is transmitted from the smart camera to the edge gateway, and cached in the edge gateway, the edge gateway Federated learning training with cloud servers. During this process, the data remains at the edge gateway, and data privacy can still be protected. Among them, typical application scenarios, such as multiple communities jointly training a pedestrian re-identification model, each community has an edge gateway connected to multiple smart cameras, through federated learning, the data is still retained in the community, not transmitted to other Community or cloud server to protect data privacy. In a possible implementation manner, when the cloud server cooperates with multiple side servers to train the pedestrian re-identification network, the multiple side servers may also be partly image acquisition devices (for example, smart cameras) that communicate directly with the cloud server. ), part of which is an edge server connected to at least one image acquisition device, which is not specifically limited in this embodiment of the present disclosure. FIG. 7 shows a structural diagram of an exemplary network training provided according to an embodiment of the present disclosure. As shown in FIG. 7 , the cloud server can communicate with multiple side servers (side server 1, side server 2, >... , side server N), and the first line included in the cloud server The person re-identification network and the second person re-identification network included in the edge server have the same network structure. Each edge server also includes a local image dataset and an identity classification network. The cloud server sends the first network parameters corresponding to the first pedestrian re-identification network to the multiple edge servers, and each edge server uses the local image data set and the identity classification network to re-identify the second pedestrian after receiving the first network parameters. The network is trained to obtain a trained second person re-identification network corresponding to the second network parameters, and a trained identity classification network. In order to ensure that the network structure jointly trained by the cloud server and each side server is consistent, each side server only sends the second network parameters corresponding to the trained second person re-identification network to the cloud server. The cloud server updates the first pedestrian re-identification network according to the received second network parameters returned by the multiple edge servers, and obtains the updated first pedestrian re-identification network. Then, the first network parameters corresponding to the updated first pedestrian re-identification network are sent to multiple side servers for cyclic training, until the updated recognition accuracy of the first pedestrian re-identification network in the cloud server reaches the threshold, or the cycle is repeated. When the number of training reaches the preset number, the training ends. The general federated learning algorithm (Federated Averaging, FedAvg) requires that the multi-party models (person re-identification deep learning model, ie the above-mentioned person re-identification network) to be synchronized must be exactly the same. The classifier layer of the deep learning model for person re-identification (that is, the above-mentioned identity classification network) depends on how many different pedestrians each party's data contains, so the classifier layers of the multi-party models participating in training may be different, resulting in participation in federated learning. Multi-party models may be different, so the federated learning algorithm FedAvg is not applicable in the above application scenarios. As can be seen from the above content, since the federated learning algorithm is improved in the present disclosure, that is, the models of multiple parties participating in the federated learning are allowed to be partially different, so the federated learning can be better applied to the training of person re-identification. Since the data volume of the local image datasets in different edge servers is different, the difference between different edge servers data is heterogeneous. When combining multiple edge servers to train the first person re-identification network, in order to reduce the impact of data heterogeneity on the accuracy of the updated first person re-identification network, each edge server can use a The weight determination method of the training effect is used to determine the weight of the second network parameter corresponding to the trained second pedestrian re-identification network, so that the cloud server combines the second network parameters returned by each side server to carry out the first pedestrian re-identification network. After the update, the updated first person re-identification network with higher accuracy is obtained. The specific steps of the weight determination method based on the training effect are as described in the relevant parts of the foregoing embodiments, and are not repeated here. Since the local image datasets in different side servers are collected under different scenes (lighting, angle), the data between different side servers is heterogeneous, which leads to each side server according to the local image dataset. The performance of the trained second person re-identification network obtained by training with the first network parameters is better than that of the updated first person re-identification network trained by the cloud server in conjunction with multiple side servers. In order to improve the stability and convergence of the training process of the first person re-identification network, the knowledge evaporation algorithm can be used, based on the updated second person re-identification network and shared image data set in each side server, and the cloud server The updated first person re-identification network is trained, thereby effectively improving the stability and convergence of the training process of the first person re-identification network. The specific training process based on the knowledge steaming library algorithm is described in the relevant part of the above-mentioned embodiment, and details are not repeated here. Based on the above content, it can be seen that the present disclosure proposes a method of using a knowledge library, taking the local model of multiple parties participating in federated learning as the teacher model, and the model of the cloud server as the student model, and using the knowledge library method to better integrate the knowledge of the teacher model It is passed to the student model, thereby improving the stability and convergence of the model training. In the process of training the pedestrian re-identification network based on the network structure shown in FIG. 7 , the weight determination method based on the training effect and the network training based on the knowledge evaporation algorithm can be used separately, or can be used in combination. Embodiments of the present disclosure There is no specific limitation on this. In an application scenario, for example, when multiple companies or institutions want to jointly train a person re-identification network to improve the robustness of the trained person re-identification network, in order to avoid aggregating data from multiple parties into the same For the problem of data privacy leakage generated on a server, the pedestrian re-identification network can be jointly trained based on the network structure shown in FIG. The cloud server communicates directly, and the data is still stored locally during the training process, and there is no need to upload it to the cloud server, so that the pedestrian re-identification network can be obtained through effective training in the cloud server, while protecting the data privacy of multiple companies or institutions. In an application scenario, for example, company A provides company B with a pedestrian re-identification network training service. If the image data of each image acquisition device (for example, a smart camera) of company B is uploaded to company A, it will generate Data privacy leaks. At this time, company A can jointly train the pedestrian re-identification network based on the network structure shown in Figure 7, company A can be used as a cloud server, and each image acquisition device in company B can be used as multiple side servers. The data is still stored locally in company B and does not need to be uploaded to company A, so that the person re-identification network can be obtained through effective training in company B while protecting the data privacy of company A. Embodiments of the present disclosure also provide a pedestrian re-identification method. The pedestrian re-identification method can be executed by a terminal device or other processing device, wherein the terminal device can be an image acquisition device (eg, a smart camera), user equipment (User Equipment, UE), mobile device, user terminal, terminal, cellular phone , cordless phones, Personal Digital Assistant (PDA), handheld devices, computing devices, in-vehicle devices, wearable devices, etc. Other processing devices may be servers or cloud servers, or the like. In some possible implementations, the pedestrian re-identification method may be implemented by the processor calling computer-readable instructions stored in the memory. The method may include: performing pedestrian re-identification processing on at least one frame of images to be identified obtained within the target geographical area through a target pedestrian re-identification network, and determining a pedestrian re-identification result; wherein, the target pedestrian re-identification network adopts the foregoing embodiment. The network training method is trained. The target pedestrian re-identification network may perform pedestrian re-identification processing on at least one frame of the to-be-identified image within the target geographic area, and determine whether there is a characteristic pedestrian in the at least one frame of the to-be-identified image. In a possible implementation, the target person re-identification network is the updated first person re-identification network or the training The first pedestrian re-identification network after training. Since the updated first person re-identification network or the trained first person re-identification network in the cloud server is universal, that is, it can be applied to any application scenario, therefore, the updated first person re-identification network in the cloud server can be used. The pedestrian re-identification network or the trained first pedestrian re-identification network realizes the pedestrian re-identification processing of at least one frame of the image to be recognized obtained within the target geographic area, so as to obtain the pedestrian re-identification result. In a possible implementation manner, in the case that an edge server is included within the target geographic area, and the edge server includes a trained second pedestrian re-identification network, the target pedestrian re-identification network is the trained second pedestrian re-identification network. Pedestrian Re-Identification Network. Combining the above embodiments of the network training method for the cloud server and the edge server, it can be seen that since the local image data sets in different edge servers are collected in different scenarios (lighting, angle), the data between different edge servers is It has heterogeneity. The trained second person re-identification network obtained by different side servers according to the local image data set is personalized and more suitable for local scenes, which leads to the trained second person re-identification in each side server. The performance of the network is better than the updated first person re-identification network trained by the cloud server combined with multiple side servers. Therefore, in the case where the side server is included in the target geographical area, and the second pedestrian re-identification network after training is included in the side server, the trained second pedestrian that is more suitable for the local scene of the target geographical area can be used. The re-identification network performs pedestrian re-identification processing on at least one frame of the image to be identified, so as to improve the accuracy of the processing result. In the present disclosure, after one deployment, further training iterations can be performed according to the data generated by the edge server, and a low-cost model can be continuously updated and updated. It can be understood that the above method embodiments mentioned in the embodiments of the present disclosure can be combined with each other to form a combined embodiment without violating the principle and logic. Those skilled in the art can understand that, in the above method in the specific implementation manner, the specific execution order of each step should be determined by its function and possible internal logic. In addition, the embodiments of the present disclosure also provide a network training/pedestrian re-identification apparatus, electronic equipment, computer-readable storage media, and programs, all of which can be used to implement any of the network training/pedestrian re-identification methods provided by the embodiments of the present disclosure, Corresponding technical solutions and descriptions and refer to the corresponding records in the method section, which will not be repeated. FIG. 8 shows a block diagram of a network training apparatus according to an embodiment of the present disclosure. The network training device is applied to a cloud server, and the cloud server includes a first pedestrian re-identification network. As shown in FIG. 8, the apparatus 80 includes: a sending part 81, configured to send a first network parameter corresponding to the first pedestrian re-identification network to a plurality of edge servers; a receiving part 82, configured to receive a plurality of edge servers The second network parameter returned by the server, wherein, for any edge server, the edge server includes a second pedestrian re-identification network, an identity classification network and a local image dataset, the second pedestrian re-identification network and the first pedestrian re-identification network The networks have the same network structure, and the second network parameters are obtained by the edge server after training the second pedestrian re-identification network according to the local image data set, the identity classification network and the first network parameters; the updating part 83 is configured to The second network parameters returned by the multiple edge servers update the first pedestrian re-identification network to obtain the updated first pedestrian re-identification network. In a possible implementation manner, the updating part 83 includes: a receiving sub-part, configured to receive weights corresponding to the second network parameters returned by multiple edge servers, wherein, for any edge server, the second network parameter The corresponding weight is determined by the edge server according to the second pedestrian re-identification network before training and the second pedestrian re-identification network after training; The weights corresponding to the two network parameters are weighted and averaged on the second network parameters returned by the multiple edge servers to obtain the updated first network parameters; the second update sub-section is configured to, according to the updated first network parameters, The first pedestrian re-identification network is updated to obtain the updated first pedestrian re-identification network. In a possible implementation manner, the sending part 81 is further configured to send the shared image data set to multiple edge servers; the receiving part 82 is further configured to receive pseudo tags returned by multiple edge servers, wherein, For any side server, the pseudo-label is generated by the side server according to the shared image data set and the trained second person re-identification network; the apparatus 80 further includes: a network training part configured to The pseudo-label returned by the edge server is used to train the updated first pedestrian re-identification network, and the trained first pedestrian re-identification network is obtained. In a possible implementation manner, the network training part is further configured to: determine an average pseudo-label according to pseudo-labels returned by multiple edge servers; The person re-identification network is trained, and the first person re-identification network after training is obtained. FIG. 9 shows a block diagram of a network training apparatus according to an embodiment of the present disclosure. The network training device is applied to a side server, and the side server includes a second person re-identification network, an identity classification network and a local image data set. As shown in FIG. 9, the apparatus 90 includes: a receiving part 91, configured to receive the first network parameter corresponding to the first pedestrian re-identification network sent by the cloud server, wherein the first pedestrian-re-identification network and the second pedestrian-re-identification network The recognition network has the same network structure; the network training part 92 is configured to train the second person re-identification network according to the local image data set, the identity classification network and the first network parameters to obtain the trained second person re-identification network network, wherein the second pedestrian re-identification network corresponds to the second network parameter; the sending part 93 is configured to send the second network parameter to the cloud server. In a possible implementation manner, the network training part 92 is further configured to: train the second person re-identification network and the identity classification network according to the local image data set and the first network parameters, to obtain a trained second person re-identification network and an identity classification network. Person re-identification network and trained identity classification network. In a possible manner, the apparatus 90 further includes: a storage part configured to store the trained identity classification network in the edge server. In a possible manner, the local image dataset includes image data corresponding to multiple identities; the dimension of the identity classification network is related to the number of multiple identities. In a possible manner, the receiving part 91 is further configured to receive the shared image data set sent by the cloud server; the apparatus 90 further includes: a pseudo-label generating part, configured to The two-person re-identification network generates a pseudo-label; the sending part 93 is further configured to send the pseudo-label to the cloud server. In a possible manner, the apparatus 90 further includes: a first determining part configured to determine a first feature vector according to the second person re-identification network before training and the local image data set, and The pedestrian re-identification network and the local image data set determine a second feature vector; a second determination part is configured to determine a cosine distance between the first feature vector and the second feature vector; a third determination part is configured to determine the cosine distance according to the cosine The distance is used to determine the weight corresponding to the second network parameter; the sending part 93 is further configured to send the weight corresponding to the second network parameter to the cloud server. In a possible manner, the side server is an image acquisition device; the local image data set is acquired according to the image acquisition device. In a possible manner, the edge server is connected to at least one image acquisition device, and the edge server and the at least one image acquisition device are located in the same geographical area; the local image data set is obtained by the edge server from the at least one image acquisition device owned. Embodiments of the present disclosure further provide a pedestrian re-identification device, including: a pedestrian re-identification part configured to perform pedestrian re-identification processing on at least one frame of an image to be identified obtained within a target geographic area through a target pedestrian re-identification network , and determine the pedestrian re-identification result; wherein, the target pedestrian re-identification network is trained by the above-mentioned network training method. In a possible implementation manner, the target person re-identification network is an updated first person re-identification network or a trained first person re-identification network. In a possible implementation manner, in the case that an edge server is included within the target geographic area, and the edge server includes a trained second pedestrian re-identification network, the target pedestrian re-identification network is the trained second pedestrian re-identification network. Pedestrian Re-Identification Network. In some embodiments, the functions or included parts of the apparatus for network training/person re-identification provided by the embodiments of the present disclosure may be configured to execute the methods described in the above method embodiments, and the specific implementation may refer to the above method embodiments The description of , for brevity, is not repeated here. In the embodiments of the present disclosure and other embodiments, a "part" may be a part of a circuit, a part of a processor, a part of a program or software, etc., of course, a unit, a module, or a non-modular form. Embodiments of the present disclosure further provide a computer-readable storage medium, on which computer program instructions are stored, and when the computer program instructions are executed by a processor, the foregoing method is implemented. The computer-readable storage medium may be a non-volatile computer-readable storage medium. An embodiment of the present disclosure further provides an electronic device, including: a processor; a memory configured to store instructions executable by the processor; wherein the processor is configured to invoke the instructions stored in the memory to execute the above method. Embodiments of the present disclosure also provide a computer program product, including computer-readable codes, and when the computer-readable codes are run on a device, a processor in the device executes the network training for implementing the network training provided in any of the above embodiments /Directive for pedestrian re-identification methods. Embodiments of the present disclosure further provide another computer program product configured to store computer-readable instructions, which, when executed, cause the computer to perform the operations of the network training/person re-identification method provided by any of the foregoing embodiments. The electronic device may be provided as a terminal, server or other form of device. FIG. 10 shows a block diagram of an electronic device according to an embodiment of the present disclosure. As shown in FIG. 10, the electronic device 800 may be an image capture device (eg, a smart camera), a mobile phone, a computer, a digital broadcasting terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant Wait for the terminal. 10 , the electronic device 800 may include one or more of the following components: a processing component 802, a memory 804, a power supply component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, a sensor component 814 , and the communication component 816. The processing component 802 generally controls the overall operation of the electronic device 800, such as operations associated with display, phone calls, data communications, camera operations, and recording operations. The processing component 802 may include one or more processors 820 to execute instructions to perform all or part of the steps of the methods described above. Additionally, processing component 802 may include one or more modules to facilitate interaction between processing component 802 and other components. For example, processing component 802 may include a multimedia module to facilitate interaction between multimedia component 808 and processing component 802. Memory 804 is configured to store various types of data to support operation at electronic device 800 . Examples of such data include instructions for any application or method operating on electronic device 800, contact data, phonebook data, messages, pictures, videos, and the like. Memory 804 may be implemented by any type of volatile or non-volatile storage device or combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable programmable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic Disk or Optical Disk. Power supply assembly 806 provides power to various components of electronic device 800 . Power supply components 806 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power to electronic device 800 . Multimedia component 808 includes a screen that provides an output interface between the electronic device 800 and the user. In a In some embodiments, the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from a user. The touch panel includes one or more touch sensors to sense touches, swipes, and gestures on the touch panel. The touch sensor may not only sense the boundaries of a touch or swipe action, but also detect the duration and pressure associated with the touch or swipe action. In some embodiments, the multimedia component 808 includes a front-facing camera and/or a rear-facing camera. When the electronic device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera may receive external multimedia data. Each of the front and rear cameras can be a fixed optical lens system or have focal length and optical zoom capability. Audio component 810 is configured to output and/or input audio signals. For example, the audio component 810 includes a microphone (MIC) that is configured to receive external audio signals when the electronic device 800 is in an operating mode, such as a calling mode, a recording mode, and a voice recognition mode. The received audio signal may be further stored in memory 804 or transmitted via communication component 816 . In some embodiments, audio component 810 also includes a speaker for outputting audio signals.

The I/O interface 812 provides an interface between the processing component 802 and a peripheral interface module, and the above-mentioned peripheral interface module may be a keyboard, a click wheel, a button, and the like. These buttons may include, but are not limited to: home button, volume buttons, start button, and lock button. Sensor assembly 814 includes one or more sensors for providing status assessments of various aspects of electronic device 800 . For example, the sensor component 814 can detect the open/closed state of the electronic device 800, the relative positioning of the components, for example, the components are the display and the keypad of the electronic device 800, and the sensor component 814 can also detect the electronic device 800 or the electronic device 800— Changes in the positions of components, presence or absence of user contact with the electronic device 800 , orientation or acceleration/deceleration of the electronic device 800 , and changes in the temperature of the electronic device 800 . Sensor assembly 814 may include a proximity sensor configured to detect the presence of nearby objects in the absence of any physical contact. Sensor assembly 814 may also include a light sensor, such as a complementary metal oxide semiconductor (CMOS) or charge coupled device (CCD) image sensor, for use in imaging applications. In some embodiments, the sensor component 814 may further include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor or a temperature sensor. Communication component 816 is configured to facilitate wired or wireless communication between electronic device 800 and other devices. The electronic device 800 may access a wireless network based on a communication standard, such as a wireless network (WiFi), a second generation mobile communication technology (2G) or a third generation mobile communication technology (3G), or a combination thereof. In one exemplary embodiment, the communication component 816 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 816 also includes a near field communication (NFC) module to facilitate short-range communication. For example, the NFC module may be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies. In an exemplary embodiment, electronic device 800 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A programmed gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation is used to perform the above method. In an exemplary embodiment, a non-volatile computer-readable storage medium is also provided, such as a memory 804 including computer program instructions that can be executed by the processor 820 of the electronic device 800 to complete the above method. FIG. 11 shows a block diagram of an electronic device according to an embodiment of the present disclosure. As shown in FIG. 11, the electronic device 1900 may be provided as a server. 11, the electronic device 1900 includes a processing component 1922, which further includes one or more processors, and memory resources represented by memory 1932 for storing instructions executable by the processing component 1922, such as application programs. An application program stored in memory 1932 may include one or more modules, each corresponding to a set of instructions. Additionally, the processing component 1922 is configured to execute instructions to perform the above-described methods. The electronic device 1900 may also include a power supply assembly 1926 configured to perform power management of the electronic device 1900, a wired or wireless network interface 1950 configured to connect the electronic device 1900 to a network, and an input output (I/O) interface 1958 . Electronic device 1900 may operate based on an operating system stored in memory 1932, such as Microsoft's server operating system (Windows Server™), Apple's graphical user interface-based operating system (Mac OS X™), A multi-user, multi-process computer operating system (Unix™), a free and open source Unix-like operating system (Linux™), an open source Unix-like operating system (FreeBSD™) or similar. In an exemplary embodiment, a non-volatile computer-readable storage medium is also provided, such as a memory 1932 comprising computer program instructions executable by the processing component 1922 of the electronic device 1900 to accomplish the above method. Embodiments of the present disclosure may be systems, methods and/or computer program products. The computer program product may include a computer-readable storage medium having computer-readable program instructions loaded thereon for causing a processor to implement various aspects of the embodiments of the present disclosure. A computer-readable storage medium may be a tangible device that can hold and store instructions for use by the instruction execution device. The computer-readable storage medium may be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the above. More specific examples (non-exhaustive list) of computer readable storage media include: portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM) or flash memory), static random access memory (SRAM), portable compact disk read only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanically coded device, such as a printer on which instructions are stored Hole cards or raised structures in grooves, and any suitable combination of the above. Computer-readable storage media, as used herein, are not to be construed as transient signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (eg, light pulses through fiber optic cables), or through electrical wires transmitted electrical signals. The computer readable program instructions described herein can be downloaded to various computing/processing devices from a computer readable storage medium, or to an external computer or external storage device over a network such as the Internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. The network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network, and forwards the computer-readable program instructions for storage in the computer-readable storage medium in each computing/processing device . Computer program instructions for performing the operations of embodiments of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state setting data, or programmed in one or more Source or object code written in any combination of languages, including object-oriented programming languages, such as Smalltalk, C++, etc., and conventional procedural programming languages, such as the "C" language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server implement. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider to connect). In some embodiments, by utilizing state information of computer readable program instructions to personalize an electronic circuit, such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), the electronic circuit may The computer readable program instructions are executed to implement various aspects of the embodiments of the present disclosure. Aspects of embodiments of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions. These computer readable program instructions can be provided to the processor of a general purpose computer, special purpose computer or other programmable data processing device, thereby producing a machine, so that when these instructions are executed by the processor of the computer or other programmable data processing device , resulting in means for implementing the functions/acts specified in one or more blocks of the flowchart and/or block diagrams. These computer-readable program instructions can also be stored in a computer-readable storage medium, and these instructions make a computer, a programmable data processing apparatus and/or other equipment work in a specific manner, so that the computer-readable medium storing the instructions includes An article of manufacture that includes implementing one or more of the blocks specified in the flowchart and/or block diagram Instructions for various aspects of function/action. Computer-readable program instructions can also be loaded into a computer, other programmable data processing apparatus, or other equipment, so that a series of operational steps are performed on the computer, other programmable data processing apparatus, or other equipment to produce a computer-implemented process. , thereby causing instructions executed on a computer, other programmable data processing apparatus, or other device to implement the functions/acts specified in one or more blocks of the flowcharts and/or block diagrams. The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagram may represent a module, program segment or part of an instruction, the module, program segment or part of the instruction including one or more functions for implementing the specified logic function. executable instructions. In some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by dedicated hardware-based systems that perform the specified functions or actions , or can be implemented using a combination of dedicated hardware and computer instructions. The computer program product can be specifically implemented by hardware, software or a combination thereof. In an optional embodiment, the computer program product is embodied as a computer storage medium, and in another optional embodiment, the computer program product is embodied as a software product, such as Software Development Kit (SDK), etc. Wait. Various embodiments of the present disclosure have been described above, and the above description is exemplary, not exhaustive, and not limited to the disclosed embodiments. Numerous modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein is chosen to best explain the principles of the various embodiments, practical application or improvement of technology in the market, or to enable others of ordinary skill in the art to understand the various embodiments disclosed herein. Industrial Applicability The embodiments of the present disclosure relate to a network training and pedestrian re-identification method and device, a storage medium, and a computer program. The cloud server includes a first pedestrian-re-identification network, and the method includes: sending a message to a plurality of edge servers. First network parameters corresponding to the first pedestrian re-identification network; receiving second network parameters returned by multiple edge servers, wherein, for any edge server, the edge server includes a second pedestrian re-identification network, an identity classification network and the local image data set, the second pedestrian re-identification network and the first pedestrian re-identification network have the same network structure, and the second network parameter is the edge server according to the local image data set, the identity classification network and the first network parameters. Obtained after the two-person re-identification network is trained; according to the second network parameters returned by the multiple edge servers, the first person-re-identification network is updated to obtain the updated first-person-re-identification network. Since the cloud server combines multiple side servers to train the pedestrian re-identification network, the image data set is still stored in the side server during the training process, and does not need to be uploaded to the cloud server, so that the pedestrian re-identification network can be effectively trained while protecting the data. privacy.

Claims

claims

1. A network training method, the method is applied to a cloud server, and the cloud server includes a first pedestrian re-identification network, and the method includes: sending the first pedestrian re-identification to a plurality of edge servers The first network parameter corresponding to the network; receiving the second network parameter returned by the plurality of edge servers, wherein, for any of the edge servers, the edge server includes a second pedestrian re-identification network, identity classification network and a local image data set, the second pedestrian re-identification network and the first pedestrian re-identification network have the same network structure, and the second network parameter is the edge server according to the local image data set , obtained after the identity classification network and the first network parameters are trained on the second pedestrian re-identification network; according to the second network parameters returned by the multiple edge servers, the first The pedestrian re-identification network is updated to obtain the updated first pedestrian re-identification network.

2. The method according to claim 1, wherein the first pedestrian re-identification network is updated according to the second network parameters returned by the plurality of edge servers to obtain the updated all The first pedestrian re-identification network includes: receiving weights corresponding to the second network parameters returned by the plurality of edge servers, wherein, for any of the edge servers, the second network parameter corresponds to The weight is determined by the edge server according to the second pedestrian re-identification network before training and the second pedestrian re-identification network after training; according to the second network returned by the multiple edge servers The weights corresponding to the parameters are weighted and averaged on the second network parameters returned by the plurality of edge servers to obtain the updated first network parameters; according to the updated first network parameters, the The first pedestrian re-identification network is updated to obtain the updated first pedestrian re-identification network.

3. The method according to claim 1 or 2, wherein the method further comprises: sending a shared image data set to the multiple edge servers; receiving pseudo tags returned by the multiple edge servers, wherein, For any of the side servers, the pseudo labels are generated by the side server according to the shared image data set and the trained second person re-identification network; The pseudo labels returned by the plurality of edge servers are used to train the updated first person re-identification network, and the trained first person re-identification network is obtained.

4. The method according to claim 3, wherein the updated first person re-identification network is trained according to the shared image data set and the pseudo labels returned by the plurality of edge servers , obtaining the trained first person re-identification network, including: determining an average pseudo-label according to the pseudo-labels returned by the plurality of edge servers; according to the shared image dataset and the average pseudo-label, determining The updated first person re-identification network is trained to obtain the trained first person re-identification network.

5. A network training method, wherein the method is applied to a side server, and the side server includes a second person re-identification network, an identity classification network and a local image dataset, the method comprising: receiving a cloud server Send the first network parameters corresponding to the first pedestrian re-identification network, wherein the first pedestrian re-identification network and the second pedestrian re-identification network have the same network structure; according to the local image data set, The identity classification network and the first network parameters are used to train the second pedestrian re-identification network to obtain the trained second pedestrian re-identification network, wherein the second pedestrian re-identification network corresponds to the first pedestrian re-identification network. two network parameters; sending the second network parameters to the cloud server.

6. The method according to claim 5, wherein the second pedestrian re-identification network is trained according to the local image data set, the identity classification network and the first network parameters to obtain the training after the second The pedestrian re-identification network includes: training the second pedestrian re-identification network and the identity classification network according to the local image data set and the first network parameters, to obtain the trained second pedestrian re-identification network. A recognition network and the trained identity classification network.

7. The method according to claim 6, wherein the method further comprises: storing the trained identity classification network in the edge server.

8. The method according to claim 6 or 7, wherein the local image data set includes image data corresponding to multiple identities; the dimension of the identity classification network is related to the number of the multiple identities.

9. The method according to any one of claims 5 to 8, wherein the method further comprises: receiving a shared image data set sent by the cloud server; The second pedestrian re-identification network generates a pseudo-label; and sends the pseudo-label to the cloud server.

10. The method according to any one of claims 5 to 9, wherein the method further comprises: determining a first feature vector according to the second person re-identification network before training and the local image data set , and determine a second feature vector according to the trained second person re-identification network and the local image data set; determine the cosine distance between the first feature vector and the second feature vector; the cosine distance, determining the weight corresponding to the second network parameter; and sending the weight corresponding to the second network parameter to the cloud server.

11. The method according to any one of claims 5 to 10, wherein the edge server is an image acquisition device; and the local image data set is acquired according to the image acquisition device.

12. The method according to any one of claims 5 to 10, wherein the edge server is connected to at least one image acquisition device, and the edge server and the at least one image acquisition device are located in the same geographical area ; the local image data set is obtained by the edge server from the at least one image acquisition device.

13. A pedestrian re-identification method, comprising: performing pedestrian re-identification processing on at least one frame of images to be identified obtained within a target geographic area through a target pedestrian re-identification network, and determining a pedestrian re-identification result; wherein the target The pedestrian re-identification network is trained by using the network training method described in any one of claims 1 to 12.

14. The method according to claim 13, wherein the target pedestrian re-identification network is an updated first pedestrian re-identification network or a trained first pedestrian re-identification network.

15. The method according to claim 13, wherein, in the case that an edge server is included within the target geographic area, and the edge server includes a trained second person re-identification network, the target The person re-identification network is the second person re-identification network after training.

16. A network training device, the device is applied to a cloud server, the cloud server includes a first pedestrian re-identification network, and the device includes: a sending part configured to send the the first network parameter corresponding to the first pedestrian re-identification network; the receiving part is configured to receive the second network parameter returned by the plurality of edge servers, wherein, for any of the edge servers, the edge server The server includes a second person re-identification network, an identity classification network and a local image data set, the second person re-identification network and the first person re-identification network have the same network structure, and the second network parameter is obtained by the edge server after training the second pedestrian re-identification network according to the local image data set, the identity classification network and the first network parameters; the update part is configured to The second network parameter returned by the edge server updates the first pedestrian re-identification network to obtain the updated first pedestrian re-identification network.

17. A network training device, the device is applied to an edge server, and the edge server includes a second line 19 A person re-identification network, an identity classification network and a local image data set, the apparatus includes: a receiving part, configured to receive a first network parameter corresponding to the first pedestrian re-identification network sent by a cloud server, wherein the first The pedestrian re-identification network and the second pedestrian re-identification network have the same network structure; the network training part is configured to, according to the local image data set, the identity classification network and the first network parameters, The second pedestrian re-identification network is trained, and the trained second pedestrian re-identification network is obtained, wherein the second pedestrian re-identification network corresponds to the second network parameter; the sending part is configured to send to the cloud The server sends the second network parameter.

18. A pedestrian re-identification device, comprising: a pedestrian re-identification part, configured to perform pedestrian re-identification processing on at least one frame of images to be recognized obtained within a target geographical area through a target pedestrian re-identification network, and determine the pedestrian re-identification. The identification result; wherein, the target pedestrian re-identification network is obtained by training the network training method according to any one of claims 1 to 12.

19. An electronic device, comprising: a processor; a memory configured to store instructions executable by the processor; wherein the processor is configured to invoke the instructions stored in the memory to execute any of claims 1 to 15 one of the methods described.

20. A computer-readable storage medium having computer program instructions stored thereon, the computer program instructions implementing the method of any one of claims 1 to 15 when executed by a processor.

21. A computer program, comprising computer-readable codes, when the computer-readable codes are executed in an electronic device, the processor in the electronic device implements any one of claims 1 to 15 when executed. method.