WO2022252843A1 - Procédé et appareil d'entraînement de modèle de traitement de données spatio-temporelles, dispositif, et support de stockage - Google Patents

Procédé et appareil d'entraînement de modèle de traitement de données spatio-temporelles, dispositif, et support de stockage Download PDF

Info

Publication number
WO2022252843A1
WO2022252843A1 PCT/CN2022/086925 CN2022086925W WO2022252843A1 WO 2022252843 A1 WO2022252843 A1 WO 2022252843A1 CN 2022086925 W CN2022086925 W CN 2022086925W WO 2022252843 A1 WO2022252843 A1 WO 2022252843A1
Authority
WO
WIPO (PCT)
Prior art keywords
spatio
data
region
temporal
temporal data
Prior art date
Application number
PCT/CN2022/086925
Other languages
English (en)
Chinese (zh)
Inventor
李婷
包锴楠
张钧波
郑宇�
Original Assignee
京东城市(北京)数字科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 京东城市(北京)数字科技有限公司 filed Critical 京东城市(北京)数字科技有限公司
Publication of WO2022252843A1 publication Critical patent/WO2022252843A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/041Abduction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Definitions

  • the present disclosure relates to the field of computer technology, in particular to the field of artificial intelligence technology such as deep learning and big data, and in particular to a training method, device, equipment and storage medium for a spatio-temporal data processing model.
  • spatio-temporal data is generated all the time, such as positioning data, mobile phone signaling data, wireless network data, etc.
  • accurate and high-quality spatio-temporal data are usually a small amount, and it is difficult and expensive to obtain.
  • Accurate spatiotemporal data restoration is a very basic function in smart city applications. It can provide support for fine-grained passenger flow inference in scenic spots, offline intelligent marketing, intelligent traffic optimization scheduling, crowd gathering warning, and intelligent location selection. In related technologies, directly using a small amount of high-quality data to train the model may easily lead to over-fitting of the model, which cannot accurately reflect the real data situation. Therefore, how to improve the accuracy and reliability of spatio-temporal data is an urgent problem to be solved.
  • the disclosure provides a training method, device, equipment and storage medium of a spatio-temporal data processing model.
  • a training method for a spatio-temporal data processing model including:
  • a first degree of difference between the first data representation and the second data representation is less than a first threshold, and a second degree of difference between the first data representation and the third data representation is greater than a second threshold case, end the training process of the spatio-temporal data processing model.
  • a training device for a spatio-temporal data processing model including:
  • An extraction module configured to extract target spatio-temporal data associated with any region from the training data set, wherein the training data set includes spatio-temporal data associated with each region and attribute information of each region;
  • the first acquisition module is configured to acquire, from the training data set, first reference space-time data associated with a first reference area that is the same as the attribute information of any area, and a first reference space-time data that is different from the attribute information of any area.
  • the second acquisition module is configured to respectively input the target spatio-temporal data, the first reference spatio-temporal data and the second reference spatio-temporal data into the spatio-temporal data processing model to be trained, so as to respectively acquire the first data representation, the second reference spatio-temporal data Two data representations and a third data representation;
  • a judging module configured to have a first degree of difference between the first data representation and the second data representation less than a first threshold, and a second degree of difference between the first data representation and the third data representation If it is greater than the second threshold, the training process of the spatio-temporal data processing model ends.
  • an electronic device including:
  • the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can execute the method described in the above one aspect embodiment.
  • a non-transitory computer-readable storage medium storing computer instructions, the computer instructions are used to make the computer execute the method described in the above-mentioned one embodiment.
  • a computer program product including a computer program, when the computer program is executed by a processor, the method described in the above one embodiment is implemented.
  • a computer program includes computer program code, and when the computer program code is run on a computer, the computer executes the method described in the above one embodiment.
  • the training method, device, equipment and storage medium of the spatio-temporal data processing model provided by the present disclosure at least have the following
  • the device first extracts the target spatio-temporal data associated with any region from the training data set, wherein the training data set includes the spatio-temporal data associated with each region and the attribute information of each region, and then obtains the target spatio-temporal data associated with any region from the training data set
  • the first reference spatio-temporal data associated with the first reference area with the same attribute information of the area, and the second reference spatio-temporal data associated with the second reference area with different attribute information of any area and then the target spatio-temporal data, the first reference spatio-temporal data
  • the data and the second reference spatio-temporal data are respectively input into the spatio-temporal data processing model to be trained, so as to obtain the first data representation, the second data representation and the third data representation respectively.
  • the spatio-temporal data processing ends The training process of the model. Therefore, by using comparative self-supervised learning of spatio-temporal data, a spatio-temporal data processing model can be generated, so that spatio-temporal data can be processed to obtain accurate and reliable spatio-temporal data representations, and to improve The accuracy of spatiotemporal prediction provides the conditions.
  • FIG. 1 is a schematic flowchart of a training method for a spatio-temporal data processing model provided according to an embodiment of the present disclosure
  • FIG. 2 is a schematic flowchart of a training method for a spatio-temporal data processing model provided according to another embodiment of the present disclosure
  • Fig. 3 is a schematic flowchart of a training method for a spatio-temporal data processing model provided according to another embodiment of the present disclosure
  • FIG. 4 is a framework diagram of model pre-training of a spatio-temporal data processing model training method provided according to an embodiment of the present disclosure
  • FIG. 5 is a framework diagram of population inference for multi-source data fusion provided by an embodiment of the present disclosure
  • FIG. 6 is a structural block diagram of a training device for a spatio-temporal data processing model provided by an embodiment of the present disclosure
  • Fig. 7 is a structural block diagram of an electronic device provided by an embodiment of the present disclosure.
  • the training method of the spatio-temporal data processing model proposed in this disclosure can be executed by the training device based on the spatio-temporal data processing model provided by this disclosure, and can also be executed by the electronic equipment provided by this disclosure, wherein the electronic equipment can include but not limited to desktop computers, tablet computers and other terminal equipment, which can also be a server.
  • the training device for a spatio-temporal data processing model provided by the present disclosure is used to execute the training method of a spatio-temporal data processing model provided by the present disclosure, which is not a limitation of the present disclosure, hereinafter referred to as for "device".
  • FIG. 1 is a schematic flowchart of a training method for a spatio-temporal data processing model according to an embodiment of the present disclosure.
  • the training method of the spatio-temporal data processing model may include the following steps:
  • Step S101 extracting target spatio-temporal data associated with any region from the training data set, wherein the training data set includes spatio-temporal data associated with each region and attribute information of each region.
  • spatio-temporal data has temporal features and spatial features.
  • Temporal features represent the features of spatio-temporal data in the time dimension
  • spatial features represent the features of spatio-temporal data in spatial features.
  • spatio-temporal data may be real-time positioning data, communication data, real-time population distribution data in an area, etc., which are not limited here.
  • the target spatio-temporal data is the spatio-temporal data associated with any area, where any area may be a scenic area, a school, a shopping mall, etc., which is not limited here.
  • the attribute information of each area may be the location, area, flow of people, population data, traffic conditions, type, etc. of each area, which is not limited here.
  • Step S102 acquiring first reference space-time data associated with a first reference area with the same attribute information of any area and second reference space-time data associated with a second reference area with different attribute information of any area from the training data set .
  • the device may obtain from the training dataset the first reference spatio-temporal data associated with the first reference region that has the same attribute information of any region, and acquire the attribute information of any region from the training dataset The second reference spatio-temporal data associated with the second reference area that are different in all of them.
  • the first reference spatio-temporal data associated with the first reference region that is the same as at least one attribute information of the attribute information of any region can also be obtained from the training data set, and the attribute information of any region can be obtained from the training data set
  • the second reference spatio-temporal data associated with the second reference area in which each attribute information is different which is not limited.
  • the apparatus may determine the first threshold according to a first matching degree between the attribute information of any region and the attribute information of the first reference region. It can be understood that since any region has the same attribute information as the first reference region, that is, any region and the first reference region should be the same or similar, so the first matching degree between any region and the first reference region is relatively high, The degree of difference is small, so the first threshold is small.
  • the apparatus may determine the second threshold according to a second matching degree between the attribute information of any region and the attribute information of the second reference region. It can be understood that, since the attribute information of any area is different from that of the second reference area, that is, the difference between any area and the first reference area should be different or relatively large, so the second matching degree comparison between any area and the second reference area Low, the degree of difference is large, so the second threshold is large.
  • Step S103 input the target spatio-temporal data, the first reference spatio-temporal data and the second reference spatio-temporal data respectively into the spatio-temporal data processing model to be trained to obtain the first data representation, the second data representation and the third data representation respectively.
  • the first data representation may represent features of the target spatio-temporal data, which may be vectors, matrices, etc., which is not limited in the present disclosure.
  • the second data representation may represent the first reference spatio-temporal data
  • the third data representation may represent the second reference spatio-temporal data.
  • Step S104 when the first degree of difference between the first data representation and the second data representation is smaller than the first threshold, and the second difference between the first data representation and the third data representation is greater than the second threshold, end the spatio-temporal The training process of the data processing model.
  • the device may calculate the degree of difference between the first data representation and the second data representation, and between the first data representation and the third data representation.
  • the apparatus may determine the first degree of difference by comparing the first data representation with the second data representation, and determine the second degree of difference by comparing the first data representation with the third data representation.
  • the first threshold is the threshold of the first degree of difference determined according to the first degree of matching
  • the second threshold is the threshold of the second degree of difference determined according to the second degree of matching
  • the first reference area is the same area as the attribute information of any area, the corresponding data representations of the two will be relatively similar.
  • the second reference area is an area different from the attribute information of any area.
  • the corresponding data representations of the two will be quite different.
  • the first difference degree is less than the first threshold value, it means that the difference between the first reference area and any area is relatively small; if the second difference degree is greater than the second threshold value, it means that the difference between any area and the second reference area The difference is relatively large.
  • the data representation obtained by using the current spatio-temporal data processing model is reasonable and reliable, that is, it can meet the needs of the device for processing any spatio-temporal region, so the training process of the spatio-temporal data processing model can be ended.
  • the device first extracts the target spatio-temporal data associated with any region from the training data set, wherein the training data set includes the spatio-temporal data associated with each region and the attribute information of each region, and then obtains the target spatio-temporal data associated with any region from the training data set
  • the first reference spatio-temporal data associated with the first reference area with the same attribute information of the area, and the second reference spatio-temporal data associated with the second reference area with different attribute information of any area and then the target spatio-temporal data, the first reference spatio-temporal data
  • the data and the second reference spatio-temporal data are respectively input into the spatio-temporal data processing model to be trained, so as to obtain the first data representation, the second data representation and the third data representation respectively.
  • the spatio-temporal data processing ends The training process of the model. Therefore, by using comparative self-supervised learning of spatio-temporal data, a spatio-temporal data processing model can be generated, so that spatio-temporal data can be processed to obtain accurate and reliable spatio-temporal data representations, and to improve The accuracy of spatiotemporal prediction provides the conditions.
  • Fig. 2 is a schematic flowchart of a training method for a spatio-temporal data processing model according to another embodiment of the present disclosure.
  • the training method of the spatio-temporal data processing model may include the following steps:
  • Step S201 extracting target spatio-temporal data associated with any region from the training data set, wherein the training data set includes spatio-temporal data associated with each region and attribute information of each region.
  • step S201 reference may be made to the detailed description of any embodiment of the present disclosure, and details are not repeated here.
  • Step S202 acquiring first reference spatio-temporal data associated with a first reference region that is identical to at least one attribute information of any region's attribute information from the training data set.
  • At least one attribute information of the attribute information of the first reference area is the same as that of any area.
  • the first reference spatio-temporal data associated with the area in some embodiments, the device can also acquire the first reference spatio-temporal data associated with the first reference area that has the same area, traffic status, and population gender distribution in any area, where Not limited.
  • Step S203 acquiring second reference spatio-temporal data associated with a second reference region whose attribute information of any region is different from each attribute information from the training data set.
  • the apparatus may use an area that is different from each attribute information in the attribute information of any area as the second reference area.
  • the first reference spatio-temporal data associated with the first reference region that has the same attribute information of any region can also be obtained from the training data set; Second reference spatio-temporal data associated with a second reference area with at least one attribute information different among each attribute information of an area.
  • Step S204 input the target spatio-temporal data, the first reference spatio-temporal data and the second reference spatio-temporal data respectively into the spatio-temporal data processing model to be trained to obtain the first data representation, the second data representation and the third data representation respectively.
  • step S204 can refer to the above-mentioned embodiment, which will not be repeated here.
  • Step S205 the first degree of difference between the first data representation and the second data representation is greater than or equal to the first threshold, or the second difference between the first data representation and the third data representation is less than or equal to the second threshold
  • the training process of the spatio-temporal data processing model ends.
  • the first degree of difference between the first data representation and the second data representation is greater than or equal to the first threshold, it means that the spatio-temporal data representation of any region obtained after processing by the spatio-temporal data processing model and the first reference region The difference between them is relatively large, and since the attribute information of any region is the same as that of the first reference region, that is, the corresponding spatiotemporal data representations of the two should be the same or similar, which means that the spatiotemporal data processing model at this time is inaccurate and cannot If the condition for ending the training of the neural network training model is met, the device will continue to return to the operation of extracting target spatio-temporal data from the training data set.
  • the second degree of difference between the first data representation and the third data representation is less than or equal to the second threshold, it means that the spatio-temporal data representation of any region obtained after processing by the spatio-temporal data processing model and the second reference region The difference between them is small, and because the attribute information of any region is different from the second reference region, that is, the corresponding spatiotemporal data representations of the two should be different or have a large difference, which can explain that the spatiotemporal data processing model at this time is inaccurate , the condition for ending the training of the neural network training model cannot be met, then the device will continue to return to the operation of extracting the target spatiotemporal data from the training data set.
  • the training process of the spatio-temporal data processing model may end.
  • the device first extracts the target spatio-temporal data associated with any region from the training data set, wherein the training data set includes the spatio-temporal data associated with each region and the attribute information of each region, and then obtains the target spatio-temporal data associated with any region from the training data set
  • the first reference spatio-temporal data associated with at least one of the attribute information of the region is the same as the first reference region, and the first reference region associated with the second reference region that is different in the attribute information of any region is obtained from the training data set.
  • Reference spatio-temporal data and then input the target spatio-temporal data, the first reference spatio-temporal data and the second reference spatio-temporal data into the spatio-temporal data processing model to be trained to obtain the first data representation, the second data representation and the third data respectively representation, finally the first degree of difference between the first data representation and the second data representation is greater than or equal to the first threshold, or the second difference between the first data representation and the third data representation is less than or equal to the second threshold
  • the training process of the spatio-temporal data processing model ends.
  • a large amount of low-quality data can be used to assist in training the spatio-temporal data processing model, so that the time data processing model generated by training can be used to process low-quality spatio-temporal data to obtain more accurate and reliable spatiotemporal data representation.
  • Fig. 3 is a schematic flowchart of a training method for a spatio-temporal data processing model according to yet another embodiment of the present disclosure.
  • the training method of the spatio-temporal data processing model may include the following steps:
  • Step S301 extracting target spatio-temporal data associated with any region from the training data set, wherein the training data set includes spatio-temporal data associated with each region and attribute information of each region.
  • the spatio-temporal data associated with each area may include a communication data sequence, attribute map, population attribute, and positioning data sequence corresponding to each area, which is not limited.
  • the attribute map can be constructed by the device based on the known attribute information of the region.
  • the region can be used as a node
  • the distance of the road network can be used as the side length
  • the attribute information of the region such as house price, traffic flow
  • the population attribute may be obtained by the device based on the analysis of the acquired positioning data set and communication data, for example, it may be population age distribution, population gender distribution, and the like. This is not limited.
  • the training data set may also include labels corresponding to each area, such as the traffic conditions of each area, whether an aggregation event occurs, etc., which is not limited.
  • Step S302 acquiring first reference spatio-temporal data associated with a first reference area with the same attribute information of any area and second reference spatio-temporal data associated with a second reference area with different attribute information of any area from the training data set .
  • step S302 may refer to any of the foregoing embodiments, and details are not described here.
  • Step S303 use the knowledge fusion network to be trained to fuse the communication data sequence, attribute map, population attribute and positioning data sequence corresponding to any region, the first reference region and the second reference region respectively, so as to obtain the fused target Spatio-temporal data, fused first reference spatio-temporal data and second reference spatio-temporal data.
  • the knowledge fusion network to be trained can be used to use the knowledge to be trained to use the data such as the communication data sequence, attribute map, population attribute, and positioning data sequence corresponding to any area, the first reference area, and the second reference area.
  • the fusion network performs fusion, so that the fused target spatio-temporal data, the first reference spatio-temporal data, and the second reference spatio-temporal data have spatio-temporal data and attribute information of corresponding regions.
  • the Swapped Contrastive Loss mechanism can be used to predict the spatiotemporal data representation of another region from the spatiotemporal data representation of one region, so that smaller batch data can be used To train the network and reduce the memory requirements during model training.
  • the frame diagram of model pre-training shown in FIG. 4 can be adopted in the present disclosure, and the model can be trained by using the exchange contrast loss mechanism.
  • the training framework of the spatio-temporal data processing model can be divided into two types of networks, one is the prototype network (prototype) network, which is a full-connection network (full-connection) network, which mainly undertakes the task of representation calculation , and share network parameters when computing target data and augmented data.
  • the other is the backbone network, which mainly completes the tasks of multi-source data fusion and feature extraction.
  • backbone network corresponds to the spatio-temporal knowledge fusion network in Figure 4
  • prototype network corresponds to the encoding network in Figure 4.
  • swapped contrastive loss can be used to optimize the parameters of the network, and its expression can be:
  • o t and o s can be the output of the backbone network
  • q t and q s can be the output of the prototype network (prototype)
  • L is the cross-entropy loss function
  • L(o t ,q s ) can be calculated by the following formula:
  • is the temperature parameter
  • k is the number of protonet networks.
  • each P a contains the area, traffic conditions and location of the region
  • each P c contains the regional population portrait, such as age and gender, and the reporting point data of dynamic mobile phone positioning
  • the flow of people in the reaction area is not limited.
  • representation learning can be performed using the Message Passing Neural Network (MPNN), where the edge network can be expressed as:
  • w ij is the input of the edge network
  • ⁇ e is the parameter to be learned
  • the message passing network can be defined as:
  • ⁇ g is the parameter of the readout (reading) network.
  • network parameters of the knowledge fusion network to be trained can be optimized, thereby obtaining more accurate target spatio-temporal data, first reference spatio-temporal data, and second reference spatio-temporal data.
  • Step S304 respectively input the fused target spatio-temporal data, fused first reference spatio-temporal data and second reference spatio-temporal data into the spatio-temporal data processing model to be trained, so as to respectively obtain the first data representation, the second data representation Representation and third data representation.
  • Step S305 when the first degree of difference between the first data representation and the second data representation is smaller than the first threshold, and the second difference between the first data representation and the third data representation is greater than the second threshold, end the spatio-temporal The training process of the data processing model.
  • Step S306 input the spatio-temporal data associated with each region into the spatio-temporal data processing model generated by training, so as to obtain the data representation corresponding to each region.
  • the device can input the spatio-temporal data associated with each region into the spatio-temporal data processing model generated by training, so as to obtain the data representation corresponding to each region.
  • the data representation may represent the spatio-temporal data associated with each region in the form of a static graph structure or in the form of a vector, which is not limited.
  • Figure 5 shows a population inference frame diagram of multi-source data fusion.
  • spatio-temporal data such as communication data sequences, attribute maps, population attributes, and positioning data sequences corresponding to each region.
  • various types of data can be e-commerce data, travel data, rental data, road network data, and point of interest (POI) data.
  • POI point of interest
  • the communication data sequence, attribute map, and population corresponding to any area, the first reference area, and the second reference area are respectively
  • the attributes and positioning data sequences are fused, and then the above-mentioned spatio-temporal data processing model is used to take each fused data as input to obtain each data representation.
  • the device can input each data representation into the space-time predictor, so as to obtain the predicted flow of people in each area, that is, the inferred value.
  • Step S307 input the data representation corresponding to each region into the prediction model to be trained, so as to obtain the prediction label corresponding to each region.
  • the prediction label can be a prediction label for the attribute information of each area, such as the prediction of the flow of people, the prediction of the traffic condition, the prediction of the gender distribution, etc., or it can also be the prediction of the crowd gathering situation, marketing decision, traffic Scheduling decisions and other information are not limited.
  • the corresponding prediction labels obtained by the device through the prediction model may also be different. flow, which is not limited.
  • step S308 the prediction model and the spatio-temporal data processing model generated by training are corrected respectively according to the difference between the predicted label and the labeled label.
  • the device can compare the predicted label with the labeled label to determine the difference between the predicted label and the labeled label. For example, gradient descent, stochastic gradient descent, etc. can be used to determine the corrected gradient.
  • the prediction model and the spatio-temporal data processing model generated by training are respectively corrected, and the method of determining the correction gradient is not limited in the present disclosure.
  • the regression network can also be used to modify the parameters of the spatio-temporal data processing model through the cross loss of the knowledge fusion network.
  • the previously trained backbone network and another regression network to be trained can be used to fine-tune and infer specific tasks, and the specific formula is as follows:
  • o t is the output of the backbone network
  • ⁇ o is the parameter of the regression network f o
  • is the sigmoid activation function.
  • ⁇ b ⁇ a , ⁇ c , ⁇ e , ⁇ g , ⁇ n , ⁇ s ⁇
  • L is the loss function
  • is the learning rate
  • is the decay coefficient
  • the device first extracts the target spatio-temporal data associated with any region from the training data set, wherein the training data set includes the spatio-temporal data associated with each region and the attribute information of each region, and then obtains the target spatio-temporal data associated with any region from the training data set
  • the first reference space-time data associated with the first reference area with the same attribute information of the area, and the second reference space-time data associated with the second reference area with different attribute information of any area and then using the knowledge fusion network to be trained, the The communication data sequence, attribute map, population attribute and positioning data sequence respectively corresponding to any region, the first reference region and the second reference region are fused to obtain the fused target spatio-temporal data, the fused first reference spatio-temporal data and Second reference spatio-temporal data, when the first degree of difference between the first data representation and the second data representation is less than the first threshold, and the second difference between the first data representation and the third data representation is greater than the second threshold , end the training process of the spatio
  • the accuracy of the prediction model can be improved, and the fine-tuning of the model can be realized while optimizing the model, realizing the purpose of training with a small amount of data and achieving the effect of approximating the effect of full training data.
  • FIG. 6 is a structural block diagram of a training device for a spatio-temporal data processing model provided by an embodiment of the present disclosure.
  • the training device for the spatio-temporal data processing model includes: an extraction module 610 , a first acquisition module 620 , a second acquisition module 630 and a judgment module 640 .
  • An extraction module configured to extract target spatio-temporal data associated with any region from the training data set, wherein the training data set includes spatio-temporal data associated with each region and attribute information of each region;
  • the first acquisition module is configured to acquire, from the training data set, first reference space-time data associated with a first reference area that is the same as the attribute information of any area, and a first reference space-time data that is different from the attribute information of any area.
  • the second acquisition module is configured to respectively input the target spatio-temporal data, the first reference spatio-temporal data and the second reference spatio-temporal data into the spatio-temporal data processing model to be trained, so as to respectively acquire the first data representation, the second reference spatio-temporal data Two data representations and a third data representation;
  • a judging module configured to have a first degree of difference between the first data representation and the second data representation less than a first threshold, and a second degree of difference between the first data representation and the third data representation If it is greater than the second threshold, the training process of the spatio-temporal data processing model ends.
  • the attribute information of the region includes the area, population and type of the region, and the first acquisition module is specifically used for:
  • first reference spatio-temporal data associated with a first reference region that is identical to at least one attribute information in the attribute information of any region from the training data set;
  • the second reference spatio-temporal data associated with the second reference region whose attribute information is different in the attribute information of any region is obtained from the training data set.
  • the attribute information of the region includes the area, population and type of the region, and the first acquisition module is specifically used for:
  • Second reference spatio-temporal data associated with a second reference region that is different from at least one attribute information of each attribute information of any region is acquired from the training data set.
  • the device further includes:
  • a first determining module configured to determine the first threshold according to a first degree of matching between the attribute information of any region and the attribute information of the first reference region;
  • the second determining module is configured to determine the second threshold according to a second degree of matching between the attribute information of any region and the attribute information of the second reference region.
  • the judging module is also used for:
  • the first degree of difference between the first data representation and the second data representation is greater than or equal to a first threshold, or the second degree of difference between the first data representation and the third data representation is less than or If it is equal to the second threshold, return to the operation of extracting the target spatio-temporal data from the training data set until the gap between the first data representation newly generated by the spatio-temporal data processing model to be trained and the newly generated second data representation
  • the training process of the spatio-temporal data processing model ends.
  • the training data set also includes annotated labels corresponding to each of the regions, and the device further includes:
  • the third acquisition module is used to input the spatio-temporal data associated with each of the regions into the spatio-temporal data processing model generated by training, so as to obtain the data representation corresponding to each of the regions;
  • the fourth acquisition module is used to input the data representation corresponding to each of the regions into the prediction model to be trained, so as to obtain the prediction labels corresponding to each of the regions;
  • the correction module is configured to correct the prediction model and the spatio-temporal data processing model generated by the training according to the difference between the prediction label and the label label.
  • the spatio-temporal data associated with each of the regions includes communication data sequences, attribute maps, population attributes and positioning data sequences corresponding to each region, and the second acquisition module is also used to:
  • the device first extracts the target spatio-temporal data associated with any region from the training data set, wherein the training data set includes the spatio-temporal data associated with each region and the attribute information of each region, and then obtains the target spatio-temporal data associated with any region from the training data set
  • the first reference spatio-temporal data associated with the first reference area with the same attribute information of the area, and the second reference spatio-temporal data associated with the second reference area with different attribute information of any area and then the target spatio-temporal data, the first reference spatio-temporal data
  • the data and the second reference spatio-temporal data are respectively input into the spatio-temporal data processing model to be trained, so as to obtain the first data representation, the second data representation and the third data representation respectively.
  • the spatio-temporal data processing ends The training process of the model. Therefore, by using comparative self-supervised learning of spatio-temporal data, a spatio-temporal data processing model can be generated, so that spatio-temporal data can be processed to obtain accurate and reliable spatio-temporal data representations, and to improve The accuracy of spatiotemporal prediction provides the conditions.
  • the present disclosure also provides an electronic device, a readable storage medium, a computer program product, and a computer program.
  • an electronic device includes:
  • the memory stores instructions that can be executed by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can execute the spatiotemporal data processing model described in the above-mentioned embodiments. training method.
  • the readable storage medium is a non-transitory computer-readable storage medium.
  • the non-transitory computer-readable storage medium stores computer instructions, and the computer instructions are used to make the computer execute the spatio-temporal data processing model training method described in the above-mentioned embodiments.
  • the computer program product includes a computer program, and when the computer program is executed by a processor, the method for training the spatio-temporal data processing model described in the above embodiments is implemented.
  • FIG. 7 shows a schematic block diagram of an example electronic device 700 that may be used to implement embodiments of the present disclosure.
  • Electronic device is intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other suitable computers.
  • Electronic devices may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smart phones, wearable devices, and other similar computing devices.
  • the components shown herein, their connections and relationships, and their functions, are by way of example only, and are not intended to limit implementations of the disclosure described and/or claimed herein.
  • the device 700 includes a computing unit 701 that can execute according to a computer program stored in a read-only memory (ROM) 702 or loaded from a storage unit 708 into a random-access memory (RAM) 703. Various appropriate actions and treatments. In the RAM 703, various programs and data necessary for the operation of the device 700 can also be stored.
  • the computing unit 701, ROM 702, and RAM 703 are connected to each other through a bus 704.
  • An input/output (I/O) interface 705 is also connected to the bus 704 .
  • the I/O interface 705 includes: an input unit 706, such as a keyboard, a mouse, etc.; an output unit 707, such as various types of displays, speakers, etc.; a storage unit 708, such as a magnetic disk, an optical disk, etc. ; and a communication unit 709, such as a network card, a modem, a wireless communication transceiver, and the like.
  • the communication unit 709 allows the device 700 to exchange information/data with other devices over a computer network such as the Internet and/or various telecommunication networks.
  • the computing unit 701 may be various general-purpose and/or special-purpose processing components having processing and computing capabilities. Some examples of computing units 701 include, but are not limited to, central processing units (CPUs), graphics processing units (GPUs), various dedicated artificial intelligence (AI) computing chips, various computing units that run machine learning model algorithms, digital signal processing processor (DSP), and any suitable processor, controller, microcontroller, etc.
  • the calculation unit 701 executes various methods and processes described above, such as a training method of a spatio-temporal data processing model.
  • the method for training a spatio-temporal data processing model may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as storage unit 708 .
  • part or all of the computer program may be loaded and/or installed on the device 700 via the ROM 702 and/or the communication unit 709.
  • the computer program When the computer program is loaded into the RAM 703 and executed by the computing unit 701, one or more steps of the training method of the spatio-temporal data processing model described above can be performed.
  • the computing unit 701 may be configured in any other appropriate way (for example, by means of firmware) to execute the training method of the spatio-temporal data processing model.
  • Various implementations of the systems and techniques described above herein can be implemented in digital electronic circuit systems, integrated circuit systems, field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chips Implemented in a system of systems (SOC), load programmable logic device (CPLD), computer hardware, firmware, software, and/or combinations thereof.
  • FPGAs field programmable gate arrays
  • ASICs application specific integrated circuits
  • ASSPs application specific standard products
  • SOC system of systems
  • CPLD load programmable logic device
  • computer hardware firmware, software, and/or combinations thereof.
  • programmable processor can be special-purpose or general-purpose programmable processor, can receive data and instruction from storage system, at least one input device, and at least one output device, and transmit data and instruction to this storage system, this at least one input device, and this at least one output device an output device.
  • Program codes for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general-purpose computer, a special purpose computer, or other programmable data processing devices, so that the program codes, when executed by the processor or controller, make the functions/functions specified in the flow diagrams and/or block diagrams Action is implemented.
  • the program code may execute entirely on the machine, partly on the machine, as a stand-alone software package partly on the machine and partly on a remote machine or entirely on the remote machine or server.
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, apparatus, or device.
  • a machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • a machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing.
  • machine-readable storage media would include one or more wire-based electrical connections, portable computer discs, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read only memory
  • EPROM or flash memory erasable programmable read only memory
  • CD-ROM compact disk read only memory
  • magnetic storage or any suitable combination of the foregoing.
  • the systems and techniques described herein can be implemented on a computer having a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user. ); and a keyboard and pointing device (eg, a mouse or a trackball) through which a user can provide input to the computer.
  • a display device e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor
  • a keyboard and pointing device eg, a mouse or a trackball
  • Other kinds of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and can be in any form (including Acoustic input, speech input or, tactile input) to receive input from the user.
  • the systems and techniques described herein can be implemented in a computing system that includes back-end components (e.g., as a data server), or a computing system that includes middleware components (e.g., an application server), or a computing system that includes front-end components (e.g., as a a user computer having a graphical user interface or web browser through which a user can interact with embodiments of the systems and techniques described herein), or including such backend components, middleware components, Or any combination of front-end components in a computing system.
  • the components of the system can be interconnected by any form or medium of digital data communication, eg, a communication network. Examples of communication networks include: local area networks (LANs), wide area networks (WANs), the Internet, and blockchain networks.
  • a computer system may include clients and servers.
  • Clients and servers are generally remote from each other and typically interact through a communication network.
  • the relationship of client and server arises by computer programs running on the respective computers and having a client-server relationship to each other.
  • the server can be a cloud server, also known as cloud computing server or cloud host, which is a host product in the cloud computing service system to solve the problem of traditional physical host and VPS service ("Virtual Private Server", or "VPS”) Among them, there are defects such as difficult management and weak business scalability.
  • the server can also be a server of a distributed system, or a server combined with a blockchain.
  • the device first extracts the target spatio-temporal data associated with any region from the training data set, wherein the training data set includes the spatio-temporal data associated with each region and the attribute information of each region, and then obtains the target spatio-temporal data associated with any region from the training data set
  • the first reference spatio-temporal data associated with the first reference area with the same attribute information of the area, and the second reference spatio-temporal data associated with the second reference area with different attribute information of any area and then the target spatio-temporal data, the first reference spatio-temporal data
  • the data and the second reference spatio-temporal data are respectively input into the spatio-temporal data processing model to be trained, so as to obtain the first data representation, the second data representation and the third data representation respectively.
  • the spatio-temporal data processing ends The training process of the model. Therefore, by using comparative self-supervised learning of spatio-temporal data, a spatio-temporal data processing model can be generated, so that spatio-temporal data can be processed to obtain accurate and reliable spatio-temporal data representations, and to improve The accuracy of spatiotemporal prediction provides the conditions.
  • steps may be reordered, added or deleted using the various forms of flow shown above.
  • each step described in the present disclosure may be executed in parallel, sequentially, or in a different order, as long as the desired result of the technical solution disclosed in the present disclosure can be achieved, no limitation is imposed herein.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Tourism & Hospitality (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Human Resources & Organizations (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Economics (AREA)
  • Evolutionary Biology (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Sont divulgués un procédé et un appareil d'entraînement d'un modèle de traitement de données spatio-temporelles, un dispositif, et un support de stockage. Le procédé consiste à : extraire des données spatio-temporelles cibles d'une zone quelconque à partir d'un ensemble de données d'entraînement, l'ensemble de données d'entraînement comprenant des données spatio-temporelles et des attributs de chaque zone ; acquérir des premières données spatio-temporelles de référence d'une première zone de référence, qui a le même attribut que n'importe quelle zone, et des secondes données spatio-temporelles de référence d'une seconde zone de référence, qui a un attribut différent de celui de n'importe quelle zone ; entrer les données spatio-temporelles cibles, les premières données spatio-temporelles de référence et les secondes données spatio-temporelles de référence dans un modèle à entraîner, de manière à acquérir une première représentation de données, une deuxième représentation de données et une troisième représentation de données ; et si la différence entre la première représentation de données et la deuxième représentation de données est inférieure à une première valeur seuil et la différence entre la première représentation de données et la troisième représentation de données est supérieure à une seconde valeur seuil, mettre fin à l'entraînement du modèle.
PCT/CN2022/086925 2021-06-02 2022-04-14 Procédé et appareil d'entraînement de modèle de traitement de données spatio-temporelles, dispositif, et support de stockage WO2022252843A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110612995.7 2021-06-02
CN202110612995.7A CN113723607A (zh) 2021-06-02 2021-06-02 时空数据处理模型的训练方法、装置、设备及存储介质

Publications (1)

Publication Number Publication Date
WO2022252843A1 true WO2022252843A1 (fr) 2022-12-08

Family

ID=78672815

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/086925 WO2022252843A1 (fr) 2021-06-02 2022-04-14 Procédé et appareil d'entraînement de modèle de traitement de données spatio-temporelles, dispositif, et support de stockage

Country Status (2)

Country Link
CN (1) CN113723607A (fr)
WO (1) WO2022252843A1 (fr)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113723607A (zh) * 2021-06-02 2021-11-30 京东城市(北京)数字科技有限公司 时空数据处理模型的训练方法、装置、设备及存储介质
CN116934557B (zh) * 2023-09-15 2023-12-01 中关村科学城城市大脑股份有限公司 行为预测信息生成方法、装置、电子设备和可读介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200184252A1 (en) * 2018-12-10 2020-06-11 International Business Machines Corporation Deep Learning Network for Salient Region Identification in Images
CN112016682A (zh) * 2020-08-04 2020-12-01 杰创智能科技股份有限公司 视频表征学习、预训练方法及装置、电子设备、存储介质
US20210103815A1 (en) * 2019-10-07 2021-04-08 Deepmind Technologies Limited Domain adaptation for robotic control using self-supervised learning
CN112800053A (zh) * 2021-01-05 2021-05-14 深圳索信达数据技术有限公司 数据模型的生成方法、调用方法、装置、设备及存储介质
CN113723607A (zh) * 2021-06-02 2021-11-30 京东城市(北京)数字科技有限公司 时空数据处理模型的训练方法、装置、设备及存储介质

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111782709B (zh) * 2020-06-30 2024-07-02 北京百度网讯科技有限公司 异常数据确定方法和装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200184252A1 (en) * 2018-12-10 2020-06-11 International Business Machines Corporation Deep Learning Network for Salient Region Identification in Images
US20210103815A1 (en) * 2019-10-07 2021-04-08 Deepmind Technologies Limited Domain adaptation for robotic control using self-supervised learning
CN112016682A (zh) * 2020-08-04 2020-12-01 杰创智能科技股份有限公司 视频表征学习、预训练方法及装置、电子设备、存储介质
CN112800053A (zh) * 2021-01-05 2021-05-14 深圳索信达数据技术有限公司 数据模型的生成方法、调用方法、装置、设备及存储介质
CN113723607A (zh) * 2021-06-02 2021-11-30 京东城市(北京)数字科技有限公司 时空数据处理模型的训练方法、装置、设备及存储介质

Also Published As

Publication number Publication date
CN113723607A (zh) 2021-11-30

Similar Documents

Publication Publication Date Title
JP7331171B2 (ja) 画像認識モデルをトレーニングするための方法および装置、画像を認識するための方法および装置、電子機器、記憶媒体、並びにコンピュータプログラム
WO2023015941A1 (fr) Procédé et appareil d'apprentissage de modèle de détection de texte, procédé de détection de texte et dispositif
WO2022134071A1 (fr) Procédé et appareil d'extraction de texte, support de stockage lisible par ordinateur et dispositif électronique
WO2023020045A1 (fr) Procédé d'apprentissage pour modèle de reconnaissance de texte, et procédé et appareil de reconnaissance de texte
WO2022252843A1 (fr) Procédé et appareil d'entraînement de modèle de traitement de données spatio-temporelles, dispositif, et support de stockage
WO2022227769A1 (fr) Procédé et appareil d'entraînement pour modèle de détection de ligne de voie, dispositif électronique et support de stockage
CN113033622B (zh) 跨模态检索模型的训练方法、装置、设备和存储介质
WO2022134480A1 (fr) Procédé d'établissement d'un modèle de prédiction de risque, procédé de prédiction de risque régional et appareil correspondant
CN113222916A (zh) 采用目标检测模型检测图像的方法、装置、设备和介质
WO2023016007A1 (fr) Procédé et appareil de formation de modèle de reconnaissance faciale, et produit de programme informatique
CN115688920B (zh) 知识抽取方法、模型的训练方法、装置、设备和介质
CN114820871B (zh) 字体生成方法、模型的训练方法、装置、设备和介质
CN114065864B (zh) 联邦学习方法、联邦学习装置、电子设备以及存储介质
EP4123595A2 (fr) Procédé et appareil de redressement d'image de texte, procédé et appareil d'apprentissage, dispositif électronique et support
CN114494784A (zh) 深度学习模型的训练方法、图像处理方法和对象识别方法
EP4120181A2 (fr) Procédé et appareil de fusion d'image, et procédé d'apprentissage de modèle de fusion d'image
US20230215203A1 (en) Character recognition model training method and apparatus, character recognition method and apparatus, device and storage medium
CN113963186A (zh) 目标检测模型的训练方法、目标检测方法及相关装置
WO2023173633A1 (fr) Procédé et appareil de prédiction d'état de route, procédé et appareil d'apprentissage de modèle correspondants, et dispositif et support
CN113627536A (zh) 模型训练、视频分类方法,装置,设备以及存储介质
CN115796310A (zh) 信息推荐及模型训练方法、装置、设备和存储介质
CN110633716A (zh) 一种目标对象的检测方法和装置
CN113657596B (zh) 训练模型和图像识别的方法和装置
CN113435523B (zh) 预测内容点击率的方法、装置、电子设备以及存储介质
CN114882333B (zh) 数据处理模型的训练方法、装置、电子设备及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22814890

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 260324)