WO2020151725A1 - Method and device for location prediction - Google Patents

Method and device for location prediction Download PDF

Info

Publication number
WO2020151725A1
WO2020151725A1 PCT/CN2020/073652 CN2020073652W WO2020151725A1 WO 2020151725 A1 WO2020151725 A1 WO 2020151725A1 CN 2020073652 W CN2020073652 W CN 2020073652W WO 2020151725 A1 WO2020151725 A1 WO 2020151725A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
predicted
sample
feature value
historical behavior
Prior art date
Application number
PCT/CN2020/073652
Other languages
French (fr)
Chinese (zh)
Inventor
谢君
卓呈祥
Original Assignee
北京嘀嘀无限科技发展有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京嘀嘀无限科技发展有限公司 filed Critical 北京嘀嘀无限科技发展有限公司
Publication of WO2020151725A1 publication Critical patent/WO2020151725A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0202Market predictions or forecasting for commercial activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0607Regulated
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0633Lists, e.g. purchase orders, compilation or processing
    • G06Q30/0635Processing of requisition or of purchase orders
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0639Item locations

Definitions

  • This application relates to the field of machine learning technology, and in particular to a position prediction method and device.
  • the user's location location information can be obtained in real time, but the method of obtaining the location location in real time has poor timeliness, which is not conducive to early resource allocation and the configuration of related service strategies.
  • the purpose of the embodiments of the present application is to provide a location prediction method and device, which can determine the location of the user to be predicted within a preset time period in the future with higher accuracy.
  • a position prediction method may include: acquiring features associated with the historical behavior of the user to be predicted, the features including at least location information related to the historical behavior of the user to be predicted; at least based on the features associated with the historical behavior of the user to be predicted, through pre-trained
  • the location prediction model obtains the location prediction result of the user to be predicted in the future preset time period.
  • the feature associated with the historical behavior of the user to be predicted may include the first feature value of the user to be predicted under at least one historical behavior feature, and the user to be predicted within a preset time period in the future The second feature value under the target time feature and the third feature value under the target location feature.
  • the location prediction result of the user to be predicted in the future preset time period may include the probability that the user to be predicted will go to the area to be predicted in the future preset time period;
  • the first feature reflects the The distribution information of the at least one historical behavior in different regions, the distribution information of the at least one historical behavior in time, the association information between the at least one historical behavior and the future preset time period, the at least one historical behavior and One or a combination of one or more of the associated information of the region to be predicted;
  • the second feature reflects the attribute information of the future preset time period;
  • the third feature reflects the attribute information of the region to be predicted.
  • the acquiring the first feature value of the user to be predicted under at least one historical behavior feature includes: extracting the user to be predicted from the at least one historical behavior information of the user to be predicted Regional historical behavior information of the region, and extracting the regional time historical behavior information of the user to be predicted in the region to be predicted in each historical time period among the multiple historical time periods; according to the at least one user to be predicted
  • the first feature value under the first historical behavior feature of the user to be predicted is determined based on the historical behavior information, the regional historical behavior information, and the regional time historical behavior information; This kind of historical behavior information determines the first feature value under the second historical behavior feature of the user to be predicted.
  • the first historical behavior feature may include one or more of the following: the number of occurrences of historical behaviors corresponding to multiple POI classifications of points of interest, and the number of occurrences of historical behaviors per day within a preset time period. , The number of occurrences of historical behaviors on working days, the number of occurrences of historical behaviors on non-working days, and the number of occurrences corresponding to different historical behaviors.
  • the second historical behavior feature may include one or more of the following: whether the area where the last historical behavior occurred is the area to be predicted, and the time between the last historical behavior and the time to be predicted The time interval, the POI classification of the destination of the last historical behavior, the POI classification of the departure place of the last historical behavior, and the number of areas reached by the user to be predicted.
  • the historical behavior information may include one or more of historical bubbling behavior information, historical billing behavior information, and historical billing behavior information.
  • the method may further include: obtaining the user attribute characteristics of the user to be predicted; and obtaining the user attribute characteristics of the user to be predicted through a pre-trained location prediction model based at least on the characteristics associated with the historical behavior of the user to be predicted
  • the position prediction result of the user in the future preset time period includes: obtaining the to-be-predicted through a pre-trained position prediction model based on the characteristics associated with the historical behavior of the user to be predicted and the user attribute characteristics of the user to be predicted The location prediction result of the user in the future preset time period.
  • the user attribute characteristics may include one or more of the following: the number of orders, whether it is a business person, whether it is a tourist, the area where the user's home is located, and the area where the user company is located.
  • the following method may be used to train the position prediction model: obtaining the first sample feature value of a plurality of sample users under at least one historical behavior characteristic, and the historical preset time of each sample user In the segment, the second sample feature value under the target time feature and the third sample feature value under the target location feature; the sample users include positive sample users and negative sample users; based on the first sample of each sample user This feature value, the second sample feature value, and the third sample feature value are used to train the position prediction model.
  • the second sample feature value and the third sample feature value under the target location feature include: for each sample region to be predicted in the plurality of sample regions to be predicted, obtaining a sample user corresponding to the sample region to be predicted in at least one
  • the training of the location prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user includes: based on each sample area to be predicted
  • the first sample feature value, the second sample feature value, and the third sample feature value of the corresponding sample user are trained to train the position prediction model.
  • the training of the location prediction model based on the first sample feature value of each sample user and the second sample feature value includes: based on the first sample feature value of each sample user This feature value, the second sample feature value, and the third sample feature value construct multiple sub-decision trees; determine the multiple sub-decision trees as the position prediction model.
  • constructing multiple sub-decision trees based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user includes: Among the historical behavior characteristics, the target time characteristics, and the target area characteristics, multiple target characteristics are randomly determined; based on the characteristic values of the sample users under the target characteristics, a sub-decision tree of the current iteration cycle is constructed ; The sub-decision tree based on the current iteration cycle and the sub-decision tree of the historical iteration cycle constitute the current decision tree set, and determine the loss of the current decision tree set; if the loss is greater than the preset loss threshold, complete the current iteration Cycle, and return to the step of randomly determining multiple target characteristics among the historical behavior characteristics, the target time characteristics, the target area characteristics, and the user attribute characteristics; if the loss is not greater than the preset loss threshold In this case, the current decision tree set is determined as the position prediction model.
  • the determining the loss of the current decision tree set includes: comparing the first test sample feature value of a plurality of test sample users under the historical behavior characteristics, and the time to be predicted for each test sample user The feature and the feature value of the second test sample under the feature of the region to be predicted are input into the current decision tree set to obtain the location prediction result corresponding to each test sample user; based on the corresponding test sample user The location prediction result, and the corresponding actual location, determine the loss of the current decision tree set.
  • the method further includes: obtaining a fourth sample feature value of each of the sample users under multiple user attribute features;
  • the training of the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user includes: the The first sample feature value, the second sample feature value, the third sample feature value, and the fourth sample feature value are used to train the position prediction model.
  • the first feature value, the second feature value, and the third feature value are input into a pre-trained location prediction model to obtain the user to be predicted at the preset time in the future
  • the location prediction result of the segment includes: inputting the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model to obtain the sub-prediction results of each sub-decision tree; The sub-prediction results of each sub-decision tree are weighted and summed to determine the location prediction result of the user to be predicted in the future time period.
  • the feature associated with the historical behavior of the user to be predicted reflects one or more historical ride-hailing information of the user to be predicted.
  • the historical ride-hailing information includes one or more of the following information: departure place, destination, departure place attributes, destination attributes, and time information.
  • it further includes acquiring the user attribute characteristics of the user to be predicted and the area to be predicted; the location prediction result of the user to be predicted in the future preset time period includes the user to be predicted at a preset time in the future The probability of going to the area to be predicted within a segment; said at least based on the features associated with the historical behavior of the user to be predicted, obtaining the location prediction result of the user to be predicted in the future preset time period through a pre-trained location prediction model , Including: based on the features associated with the historical behavior of the user to be predicted, the user attribute feature of the user to be predicted, the future preset time period, and the area to be predicted, obtaining the to-be-predicted location through a pre-trained location prediction model Predict the location prediction result of the user in the future preset time period.
  • the location prediction is based on the features associated with the historical behavior of the user to be predicted, the user attribute features of the user to be predicted, the future preset time period, and the area to be predicted.
  • the model obtains the location prediction result of the user to be predicted in the future preset time period, including, through the location prediction model: processing the user attribute characteristics of the user to be predicted, the future preset time period, and the The region to be predicted obtains the first vector representation; the user attribute characteristics are respectively fused with each historical taxi information to obtain the vector representation corresponding to each historical taxi information; the first vector representation and each time are processed based on the attention mechanism
  • the vector representation corresponding to the historical taxi information obtains the second vector representation; based on the second vector representation, the probability of the user to be predicted going to the area to be predicted in the future preset time period is determined.
  • a position prediction system includes: at least one storage medium including a set of instructions for position prediction; and at least one processor in communication with the storage medium, wherein when the set of instructions is executed, the at least one processor uses ⁇ : Obtaining features associated with the historical behavior of the user to be predicted; the features include at least location information related to the historical behavior of the user to be predicted; at least based on the features associated with the historical behavior of the user to be predicted, through a pre-trained location prediction model Obtain the location prediction result of the user to be predicted in the future preset time period.
  • a position prediction device includes: an acquisition module, configured to acquire features associated with the historical behavior of the user to be predicted; the features include at least location information related to the historical behavior of the user to be predicted; For the associated features, the position prediction result of the user to be predicted in the future preset time period is obtained through a pre-trained position prediction model.
  • a computer-readable storage medium characterized in that a computer program is stored on the computer-readable storage medium, and the computer program is executed when the processor is running: obtaining and predicting user history Behavior-related features; the features include at least location information related to the historical behavior of the user to be predicted; at least based on the features associated with the historical behavior of the user to be predicted, the location of the user to be predicted is acquired through a pre-trained location prediction model Describe the location prediction result for a preset time period in the future.
  • a position prediction method includes: obtaining a first feature value of a user to be predicted under at least one historical behavior feature, and a second feature value of the user to be predicted under a target time feature and a target location within a preset time period in the future.
  • the third feature value under the feature; the first feature value, the second feature value, and the third feature value are input into a pre-trained location prediction model to obtain the future prediction of the user to be predicted Set the position prediction result of the time period.
  • a location prediction device includes: an acquisition module for acquiring the first characteristic value of the user to be predicted under at least one historical behavior characteristic, and the second characteristic of the user to be predicted under the target time characteristic in a preset time period in the future Value and the third feature value under the target location feature; a prediction module for inputting the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model to obtain The location prediction result of the user to be predicted in the future preset time period.
  • Fig. 1 is a schematic diagram of an exemplary online platform of an exemplary on-demand service system according to some embodiments of the present application
  • Figure 2A is a schematic diagram of exemplary hardware and/or software components of a computing device according to some embodiments of the present application
  • 2B is a schematic diagram of exemplary hardware and/or software components of a mobile device according to some embodiments of the present application.
  • FIG. 3 shows a flowchart of an exemplary process for predicting a position according to some embodiments of the present application
  • FIG. 4 shows a flowchart of a process of acquiring a first characteristic value under a first historical behavior characteristic and a first characteristic value under a second historical behavior characteristic according to some embodiments of the present application;
  • Fig. 5 shows a flowchart of a process of training a position prediction model according to some embodiments of the present application
  • Fig. 6 shows a flowchart of a process for training a position prediction model according to some embodiments of the present application
  • FIG. 7A shows a flowchart of an exemplary process for predicting a position according to some embodiments of the present application
  • Fig. 7B shows a flowchart of a prediction process of a position prediction model according to some embodiments of the present application.
  • Fig. 8 shows an exemplary block diagram of a position prediction apparatus according to some embodiments of the present application.
  • the means of transportation of the transportation system can include taxis, private cars, downwinds, buses, trains, bullet trains, high-speed railways, subways, ships, airplanes, spacecraft, hot air balloons, or unmanned vehicles, etc., or any of them combination.
  • This application may also include any service system for providing services to users based on the Internet, for example, a system for sending and/or receiving express delivery, and a service system for transactions between buyers and sellers.
  • the application of the system or method of the present application may include web pages, browser plug-ins, client terminals, customized systems, internal analysis systems, or artificial intelligence robots, etc., or any combination thereof.
  • This application relates to a location prediction method.
  • the method can input the features associated with the historical behavior of the user to be predicted into a pre-trained location prediction model, and predict whether the user to be predicted will appear at the target location in a preset time period in the future, so that the user location can be predicted in advance Information is conducive to the deployment of resources in advance and the configuration of related service strategies.
  • the features associated with the historical behavior of the user to be predicted include at least location information related to the historical behavior to be predicted, and the location information may be a certain city, administrative division, POI, latitude and longitude, etc.
  • the feature associated with the historical behavior of the user to be predicted may include the first feature value under at least one historical behavior feature of the user to be predicted, and the feature of the user to be predicted at the target time within a preset time period in the future.
  • the feature associated with the historical behavior of the user to be predicted may also be one or more historical ride-hailing information of the user to be predicted. The historical ride-hailing information may be obtained based on the user's historical ride-hailing orders.
  • the prediction of the user's location can also be based on the statistics of the user to be predicted, but the prediction accuracy of this method is low, and the location prediction method provided in this application can treat the prediction with higher accuracy.
  • the user’s location is predicted.
  • Fig. 1 is a schematic diagram of an exemplary online platform of an exemplary on-demand service system 100 according to some embodiments of the present application.
  • the on-demand service system 100 may be an online transportation service platform implemented in a network environment, which has a positioning system for providing transportation services.
  • the on-demand service system 100 can provide at least two services. Exemplary on-demand services may include small passenger car sharing services, medium passenger car sharing services, taxi services, driver services, express services, bus services, driver hire services, and shuttle services.
  • the on-demand service system 100 may include a server 110, a network 120, a service requester terminal 130, a service provider terminal 140, a vehicle 150, a storage 160, and a positioning device 170.
  • the server 110 may be a computer server.
  • the server 110 may communicate with the service requester terminal 130 and/or the service provider terminal 140 to provide various functions of online on-demand services.
  • the server 110 may be a single server or a group of servers.
  • the server group may be a centralized server group connected to the network 120 via an access point, or a distributed server group connected to the network 120 via one or more access points.
  • the server 110 may be locally connected to the network 120 or remotely connected to the network 120.
  • the server 110 may access information and/or data stored in the service requester terminal 130, the service provider terminal 140, and/or the storage 160 via the network 120.
  • the storage 160 may be used as a back-end storage of the server 110.
  • the server 110 may be implemented on a cloud platform.
  • the cloud platform may include private clouds, public clouds, hybrid clouds, community clouds, distributed clouds, intermediate clouds, multiple clouds, etc., or any combination thereof.
  • the server 110 may include a processing device 112.
  • the processing device 112 may process information and/or data related to performing one or more functions described in this application.
  • the processing device 112 may obtain historical behavior information of the user.
  • the processing device 112 may determine the first characteristic value, the second characteristic value, and the third characteristic value under the historical behavior characteristic according to the historical behavior information.
  • the processing device 112 may train a position prediction model.
  • the processing device 112 may input the first characteristic value, the second characteristic value, and the third characteristic value under the historical behavior characteristic into the position prediction model, and the prediction model may output the position of the user to be predicted in a preset time period in the future.
  • the processing device 112 may include one or more processing units (for example, a single-core processing engine or a multi-core processing engine).
  • the processing device 112 may include a central processing unit (CPU), an application specific integrated circuit (ASIC), an application specific instruction set processor (ASIP), a graphics processing unit (GPU), a physical processing unit (PPU), and a digital signal processor. (DSP), field programmable gate array (FPGA), programmable logic device (PLD), controller, microcontroller unit, reduced instruction set computer (RISC), microprocessor, etc., or any combination thereof.
  • CPU central processing unit
  • ASIC application specific integrated circuit
  • ASIP application specific instruction set processor
  • GPU graphics processing unit
  • PPU physical processing unit
  • DSP digital signal processor
  • FPGA field programmable gate array
  • PLD programmable logic device
  • controller microcontroller unit
  • RISC reduced instruction set computer
  • the network 120 may facilitate the exchange of information and/or data.
  • one or more components of the on-demand service system 100 can be sent to the on-demand service system 100 via the network 120.
  • Other components in the service system 100 send information and/or data.
  • the server 110 may access and/or obtain at least two pieces of historical behavior information from the storage 160 via the network 120.
  • the network 120 may be any type of wired or wireless network or combination thereof.
  • the network 120 may include a cable network, a wired network, an optical fiber network, a telecommunications network, an intranet, the Internet, a local area network (LAN), a wide area network (WAN), a wireless local area network (WLAN), a metropolitan area network (MAN), and a public telephone. Switched Network (PSTN), Bluetooth network, ZigBee network, Near Field Communication (NFC) network, etc., or any combination thereof.
  • the network 120 may include one or more network access points.
  • the network 120 may include wired or wireless network access points, such as base stations and/or Internet exchange points 120-1, 120-2, and so on.
  • One or more components of the on-demand service system 100 may be connected to the network 120 through a network access point to exchange data and/or information.
  • the passenger or user may be the holder of the service requester terminal 130.
  • the holder of the service requester terminal 130 may be someone other than the passenger.
  • the holder A of the service requester terminal 130 may use the service requester terminal 130 to send a service request for the passenger B, and/or receive service confirmation and/or information or instructions from the server 110.
  • the driver may be a user of the service provider terminal 140.
  • the user of the service provider terminal 140 may be someone other than the driver.
  • the user C of the service provider terminal 140 may use the service provider terminal 140 to receive service requests for the driver D, and/or information or instructions from the server 110.
  • the driver may be designated to use at least one of the service provider terminals 140 and/or 150 types of vehicles for a period of time, for example, one day, one week, one month, or one year. In some other embodiments, the driver may be designated to randomly use one of the service provider terminals 140 and/or one of the vehicles 150. For example, when a driver is available to provide on-demand services, he/she can be assigned to use the driver terminal that receives the earliest request and the vehicle that recommends the on-demand service. In some embodiments, "passenger” and “terminal device” can be used interchangeably, and “driver” and “driver device” can be used interchangeably. In some embodiments, the driver device may be associated with one or more drivers (e.g., night shift drivers, day shift drivers, or randomly moving drivers).
  • drivers e.g., night shift drivers, day shift drivers, or randomly moving drivers.
  • the service requester terminal 130 may include a mobile device 130-1, a tablet computer 130-2, a laptop computer 130-3, a built-in device in a vehicle 130-4, etc., or any combination thereof.
  • the mobile device 130-1 may include a smart home device, a wearable device, a smart mobile device, a virtual reality device, an augmented reality device, etc., or any combination thereof.
  • smart home devices may include smart lighting devices, control devices of smart electrical devices, smart monitoring devices, smart TVs, smart cameras, walkie-talkies, etc., or any combination thereof.
  • the wearable device may include a smart bracelet, smart footwear, smart glasses, smart helmets, smart watches, smart clothing, smart backpacks, smart accessories, etc., or any combination thereof.
  • smart mobile devices may include smart phones, personal digital assistants (PDAs), gaming devices, navigation devices, point of sale (POS) devices, etc., or any combination thereof.
  • the virtual reality device and/or the augmented reality device may include a virtual reality helmet, virtual reality glasses, virtual reality goggles, augmented reality helmets, augmented reality glasses, augmented reality goggles, etc., or any combination thereof.
  • the virtual reality device and/or the augmented reality device may include Google Glass TM , Oculus Rift TM , Hololens TM , Gear VR TM and the like.
  • the built-in devices in the vehicle 130-4 may include a built-in computer, an on-board built-in TV, a built-in tablet computer, and the like.
  • the service requester terminal 130 may include a signal transmitter and a signal receiver, and the signal receiver is configured to communicate with the positioning device 170 to locate the position of the passenger and/or the service requester terminal 130.
  • the service provider terminal 140 may include at least two service provider terminals 140-1, 140-2, ..., 140-n. In some embodiments, the service provider terminal 140 may be similar to or the same as the service requester terminal 130. In some embodiments, the service provider terminal 140 can be customized to implement online transportation services. In some embodiments, the service provider terminal 140 and the service requester terminal 130 may be configured with a signal transmitter and a signal receiver to receive location information of the service provider terminal 140 and the service requester terminal 130 from the positioning device 170. In some embodiments, the service requester terminal 130 and/or the service provider terminal 140 may communicate with other positioning devices to determine the location of the passenger, the service requester terminal 130, the driver, and/or the service provider terminal 140.
  • the service requester terminal 130 and/or the service provider terminal 140 may periodically send the positioning information to the server 110.
  • the service provider terminal 140 may also send the availability status to the server 110 periodically.
  • the availability status may indicate whether the vehicle 150 associated with the service provider terminal 140 is available to transport passengers.
  • the service requester terminal 130 may send positioning information to the server 110 every 30 minutes.
  • the service provider terminal 140 may send the availability status to the server every 30 minutes and/or when the on-demand service is completed.
  • the service requester terminal 130 may send location information to the server 110 every time a user logs in to a mobile application associated with an online on-demand service.
  • the service provider terminal 140 may correspond to one or more vehicles 150.
  • the vehicle 150 can carry passengers and travel to the destination.
  • the vehicle 150 may include at least two vehicles 150-1, 150-2,..., 150-n.
  • One of at least two vehicles may correspond to one order type.
  • Order types can include taxi orders, luxury car orders, express orders, bus orders, shuttle orders, etc.
  • the service may be any online service, such as ordering food, shopping, etc., or a combination thereof.
  • the memory 160 may store data and/or instructions.
  • the memory 160 may store data obtained from the service requester terminal 130 and/or the service provider terminal 140.
  • the memory 160 may store log information associated with the service requester terminal 130.
  • the memory may store bubbling behaviors, billing behaviors, bill-finishing behaviors, etc. that occur through the service requester terminal 130 by the user.
  • the memory 160 may store data and/or instructions that can be executed by the server 110 to provide the on-demand service described in this application.
  • the memory 160 may include mass memory, removable memory, volatile read-write memory, read-only memory (ROM), etc., or any combination thereof. Exemplary mass storage devices may include magnetic disks, optical disks, solid state drives, and the like.
  • Exemplary removable storage may include flash drives, floppy disks, optical disks, memory cards, compact disks, magnetic tapes, and the like.
  • An exemplary volatile read-write memory may include random access memory (RAM).
  • Exemplary RAM may include dynamic RAM (DRAM), double data rate synchronous dynamic RAM (DDR SDRAM), static RAM (SRAM), thyristor RAM (T-RAM), zero capacitor RAM (Z-RAM), and the like.
  • Exemplary ROMs may include mask ROM (MROM), programmable ROM (PROM), erasable programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), compact disk ROM (CD-ROM), and digital Multi-function disk ROM, etc.
  • the storage 160 may be implemented on a cloud platform.
  • the cloud platform may include private clouds, public clouds, hybrid clouds, community clouds, distributed clouds, intermediate clouds, multiple clouds, etc., or any combination thereof.
  • the positioning device 170 may determine information associated with the object, for example, one or more of the service requester terminal 130, the service provider terminal 140, the vehicle 150, and the like. For example, the positioning device 170 may determine the current time and location of the passenger or driver through the service requester terminal 130. In some embodiments, the positioning device 170 may be Global Positioning System (GPS), Global Navigation Satellite System (GLONASS), Compass Navigation System (COMPASS), Beidou Navigation Satellite System, Galileo Positioning System, Quasi-Zenith Satellite System (QZSS) Wait. This information may include the position, height, speed or acceleration of the object, and/or the current time. The position can be in the form of coordinates, such as latitude coordinates and longitude coordinates.
  • the positioning device 170 may include one or more satellites, for example, a satellite 170-1, a satellite 170-2, and a satellite 170-3.
  • the satellites 170-1 to 170-3 can determine the above-mentioned information independently or collectively.
  • the positioning system 170 may transmit the above-mentioned information to the service requester terminal 130, the service provider terminal 140, or the vehicle 150 via the network 120.
  • one or more components in the on-demand service system 100 can access data or instructions stored in the memory 160 via the network 120.
  • the storage 160 may be directly connected to the server 110 as a back-end storage.
  • one or more components in the on-demand service system 100 may have permission to access the memory 160.
  • one or more components in the on-demand service system 100 can read and/or modify information related to passengers, drivers, and/or vehicles.
  • the server 110 may read and/or modify the user characteristics of one or more passengers after completing the on-demand service order.
  • the information exchange between one or more components of the on-demand service system 100 can be initiated by launching a mobile application of the on-demand service on the terminal device to request the service.
  • the object of the service request can be any product.
  • the product may include food, medicine, commodities, chemical products, electrical appliances, clothes, automobiles, household goods, luxury goods, etc., or any combination thereof.
  • products may include service products, financial products, knowledge products, Internet products, etc., or any combination thereof.
  • Internet products may include personal host products, network products, mobile Internet products, commercial host products, embedded products, etc., or any combination thereof.
  • Mobile Internet products can be used in mobile terminals, programs, systems, etc. or any combination of software.
  • the mobile terminal may include a tablet computer, a laptop computer, a mobile phone, a personal digital assistant (PDA), a smart watch, a point of sale (POS) device, a vehicle-mounted computer, a vehicle-mounted TV, a wearable device, etc., or any combination thereof.
  • the product can be any software and/or application used in a computer or mobile phone.
  • the software and/or application program may involve social networking, shopping, transportation, entertainment, learning, investment, etc., or any combination thereof.
  • transportation-related software and/or applications may include travel software and/or applications, vehicle scheduling software and/or applications, mapping software and/or applications, and the like.
  • vehicles can include horses, carriages, rickshaws (for example, unicycles, bicycles, tricycles, etc.), automobiles (for example, taxis, buses, private cars, etc.), trains, subways, boats, and aircraft (For example, airplane, helicopter, space shuttle, rocket, hot air balloon, etc.), etc., or any combination thereof.
  • rickshaws for example, unicycles, bicycles, tricycles, etc.
  • automobiles for example, taxis, buses, private cars, etc.
  • trains subways, boats, and aircraft (For example, airplane, helicopter, space shuttle, rocket, hot air balloon, etc.), etc., or any combination thereof.
  • the element when an element of the on-demand service system 100 operates, the element may operate through an electrical signal and/or an electromagnetic signal.
  • the service requester terminal 130 may operate a logic circuit in its processor to process such a task.
  • the processor of the service requester terminal 130 may generate an electrical signal encoding the order.
  • the processor of the service requester terminal 130 may transmit the electric signal to the output port. If the service requester terminal 130 communicates with the server 110 via a wired network, the output port may be physically connected to a cable, and the cable also transmits an electric signal to the input port of the server 110.
  • the output port of the terminal 130 may be one or more antennas, which convert electrical signals into electromagnetic signals.
  • the service provider terminal 140 may process tasks through the operation of logic circuits in its processor, and receive instructions and/or service commands from the server 110 via electrical or electromagnetic signals.
  • the processor processes instructions, issues instructions, and/or executes actions, the instructions and/or actions are performed through electrical signals.
  • the processor when the processor retrieves data (for example, the user's historical order information, the user's bubbling behavior information, etc.) from the storage medium (for example, the memory 160), it may send an electrical signal to the reading device of the storage medium, which Can read structured data in storage media.
  • the structured data can be sent to the processor in the form of electrical signals via the bus of the electronic device.
  • the electrical signal may refer to one electrical signal, a series of electrical signals, and/or at least two discrete electrical signals.
  • FIG. 2A is a schematic diagram of exemplary hardware and/or software components of a computing device 200A according to some embodiments of the present application.
  • the computing device 200A may be a general-purpose computer or a special-purpose computer. Both can be used to realize the functions of this application.
  • the computing device 200A can be used to implement any component of the service as described herein.
  • the processing device 112 of the server may be implemented on the computing device 200A through its hardware, software program, firmware, or a combination thereof.
  • the service-related computer functions described herein can be implemented in a distributed manner on at least two similar platforms to distribute the processing load.
  • the computing device 200A may include a communication port 250a connected to a network (e.g., network 120) connected to it to facilitate data communication.
  • the computing device 200A may also include one or more processors 220a for executing program instructions.
  • An exemplary computer platform may include an internal communication bus 210a, various forms of program memory and data storage for various data files processed and/or transmitted by the computer, such as a disk 270a, a read only memory (ROM) 230a, or a random access memory ( RAM) 240a.
  • the exemplary computer platform may also include program instructions stored in ROM 230a, RAM 240a, and/or other types of non-transitory storage media to be executed by CPU 220a.
  • the methods and/or processes of the present application can be implemented as program instructions.
  • the computing device 200A also includes an input/output (I/O) component 260a, which supports input/output between the computer, the user, and other components herein.
  • the computing device 200A can also receive programs and data through
  • the computing device 200A in this application may also include multiple CPUs and/or processors, so the operations and/or method steps performed by one CPU and/or processor described in this application may also be Multiple CPUs and/or processors execute jointly or individually.
  • the CPU and/or the processor of the computing device 200A may perform step A and step B.
  • step A and step B can also be performed jointly or separately by two different CPUs and/or processors in the computing device 200A (for example, the first processor performs step A and the second processor performs Step B, or the first and second processors jointly perform steps A and B).
  • FIG. 2B is a schematic diagram of exemplary hardware and/or software components of a mobile device 200B according to some embodiments of the present application.
  • the mobile device 200B may include a communication module 210b, a display 220b, a graphics processing unit (GPU) 230b, a central processing unit (CPU) 240b, an input/output (I/O) 250b, a memory 260b, and a memory 290b.
  • any other suitable components including but not limited to a system bus or a controller (not shown), may also be included in the mobile device 200B.
  • the mobile operating system 270b eg, iOS TM , Android TM , Windows Phone TM, etc.
  • one or more application programs 280b may be loaded from the memory 290b into the memory 260b so as to be executed by the CPU 240b.
  • the application program 280b may include a browser or any other suitable mobile application program for sending, receiving, and presenting information related to on-demand services (for example, sending car-hailing orders) from the processing device 112 and/or the memory 160.
  • the interaction between the user and the information flow can be implemented through the I/O 250b, and provided to the processing device 112 and/or other components of the on-demand service system 100 through the network 120.
  • Fig. 3 shows a flowchart of an exemplary process for predicting a position according to some embodiments of the present application.
  • the process 300 may include steps S301 to S302.
  • the process 300 may be performed by the processing device 112.
  • the process 300 may be implemented with a set of instructions (for example, an application program) stored in the memory 160.
  • the server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and thus can instruct to execute the process 300.
  • S301 Obtain the first feature value of the user to be predicted under at least one historical behavior feature, and the second feature value of the user to be predicted under the target time feature and the second feature value under the target location feature within a preset time period in the future.
  • the third characteristic value Obtain the first feature value of the user to be predicted under at least one historical behavior feature, and the second feature value of the user to be predicted under the target time feature and the second feature value under the target location feature within a preset time period in the future.
  • the users to be predicted may be all users of an online transportation service platform (for example, a car-hailing platform).
  • an online transportation service platform for example, a car-hailing platform.
  • the location of each user is determined, and the amount of calculation is very large. Therefore, in order to be more targeted, It can also be a user selected from all users of the online transportation service platform based on certain conditions. For example, the location of each user during bubbling can be calculated periodically, and if the bubbling location of any user during the period belongs to at least two cities or a preset area, the user is determined as the user to be predicted. For another example, it is possible to periodically calculate the place of departure from which each user places an order.
  • the user is determined to be a waiting area.
  • Forecast users For another example, it may be determined that a user whose historical order quantity is greater than the threshold order quantity in a threshold time period is a user to be predicted.
  • the threshold time period can be the past day, week, month, year, etc.
  • the methods of obtaining the third characteristic value are described separately:
  • FIG. 4 shows a flowchart of a process of obtaining a first characteristic value under a first historical behavior characteristic and a first characteristic value under a second historical behavior characteristic according to some embodiments of the present application.
  • the process 400 may include steps S401 to S403.
  • the process 400 may be performed by the processing device 112.
  • the process 400 may be implemented with a set of instructions (for example, an application program) stored in the memory 160.
  • the server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and therefore can instruct to execute the process 400.
  • S401 Extracting regional historical behavior information of the user to be predicted in different regions (including but not limited to the region to be predicted) from at least one type of historical behavior information of the user to be predicted, and extracting multiple historical behaviors of the user to be predicted Regional time historical behavior information in different regions within each historical time period in the time period.
  • the area may be a province, city, county, town, district, etc., in a municipal plan.
  • the area may be a grid divided according to latitude and longitude.
  • the area may be an area in which longitude and latitude are between 90 degrees to 91 degrees east longitude and 30 degrees to 31 degrees north latitude, respectively.
  • the area may also be an arbitrarily divided area of interest (AOI).
  • AOI area of interest
  • the area may be a school, a shopping mall, a scenic spot, a hospital, etc.
  • the area may be a user-defined area. For example, an area with a certain point as the center and a certain distance as a radius.
  • the historical behavior of the user to be predicted may include any operation behavior of the user in the online transportation service platform.
  • the historical behavior may include login behavior, bubbling behavior, billing behavior, order completion behavior, evaluation behavior, consulting behavior, subscription behavior, etc., or any combination thereof.
  • the historical behavior may be a historical behavior in a certain period of time in the past (for example, the previous day, the previous week, the previous month, the last year, etc.). Each type of historical behavior can correspond to a type of historical behavior information.
  • bubbling means that the service requester enters the home page of the service software of the online transportation service platform, and sends the travel starting point and travel destination of the service requesting end to the online transportation service platform.
  • the online transportation service platform receives the travel starting point and travel end point sent by the service requester, and determines to monitor bubbling behavior on the service requesting end.
  • the user can log in to the service software through the user terminal (for example, the service requester terminal 130), the user terminal can upload the location information of the user terminal through the positioning system (for example, the positioning device 170) as the starting point of travel, and the user can manually input or drag
  • the travel destination is input by dragging the map.
  • the on-demand service system 100 may store the user's behavior as a bubbling behavior and store it in the memory 160.
  • the online transportation service platform After the online transportation service platform obtains the travel starting point and travel end point sent by the service requesting terminal, it determines the estimated order price for the service requesting terminal for this trip, and sends the estimated order price to the service requesting terminal. After receiving the estimated order price, the service requester generates order information based on the passenger's trigger, and sends the order information to the online transportation service platform. This behavior is the billing behavior of the service requester.
  • the online transportation service platform After the online transportation service platform receives the order information sent by the service requester, it generates an order and pushes the order to multiple qualified service providers, and matches the service provider for the service requester based on the feedback of the service provider.
  • the service provider matched with the service requester After the service provider matched with the service requester completes the service, it will feedback the service completion information to the online transportation service platform.
  • the online transportation service platform initiates fee payment to the service requester. After the service requester completes the payment, it is deemed that the service requester has completed the order.
  • the billing behavior is not the same as the billing behavior. Only when the order is completed and the payment is completed, the user behavior is considered to be the order completion behavior. If in the middle of the service process, the service provider, service requester, or online ride-hailing party cancels the service, the order completion behavior corresponding to the order will no longer occur.
  • Each historical behavior can include corresponding historical behavior information.
  • the historical behavior information corresponding to the bubbling behavior includes historical bubbling behavior information;
  • the historical behavior information corresponding to the billing behavior includes historical billing behavior information;
  • the historical behavior information corresponding to the billing behavior includes historical billing behavior information.
  • Each type of historical behavior information may include the time when the historical behavior occurred and the place where the historical behavior occurred. For example, when a user has a historical behavior through the user terminal, the user terminal may send the historical behavior occurrence time and the historical behavior occurrence location when the historical behavior occurred to the memory 160 for storage. As described here, the time when the historical behavior occurred and the place where the historical behavior occurred respectively refer to the time and place when the historical behavior was completed. For example, a user can log in to the service software.
  • the online transportation service platform can record the user's behavior as a bubbling behavior, and input the time and place of the trip start and/or end of the trip as a historical behavior Occurrence time and location of the historical behavior; in response to the user triggering the generation of order information, the online transportation service platform can update the user’s behavior as an ordering behavior, and determine the time and place that triggered the generation of the order information as the historical behavior time and history Where the act occurred.
  • the at least one type of historical behavior information of the user to be predicted includes historical behavior information of at least one of the foregoing historical behaviors.
  • the following methods may be used to obtain at least one type of historical behavior information, regional historical behavior information, and regional time historical behavior information of the user to be predicted.
  • the i-th historical behavior information is recorded as x i .
  • the historical behavior information X of the user to be predicted may be obtained from the memory 160 or other storage devices (not shown).
  • each historical behavior information X determines the regional historical behavior information X city of the user to be predicted in different regions.
  • the j-th regional historical behavior information is recorded as .
  • some or all of the different regions may be regions to be predicted.
  • the regional historical behavior information of the user to be predicted in different regions the historical behavior occurrence time of X city and the preset multiple historical time periods, it is determined that the user to be predicted is in a different historical time period in each of the multiple historical time periods.
  • the regional time history behavior information of the region X city,t The number of groups of the regional time historical behavior information X city, t may be equal to the number of segments N of the multiple historical time periods ⁇ the number of regions M.
  • the n-th regional historical time behavior information of the m-th group is recorded as
  • multiple historical time periods can be specifically set according to actual needs.
  • multiple historical time periods can be divided according to the distance from the current time, for example, 0-2 days from the current, 2-5 days from the current, and The current 5-10 days, the current 10-20 days, the current 20-50 days, the current 50-100 days, and the current 100 days or more.
  • the corresponding regional historical time behavior information can be 7 groups.
  • S402 Determine the first feature under the first historical behavior feature of the user to be predicted according to the at least one type of historical behavior information of the user to be predicted, the regional historical behavior information, and the regional time historical behavior information value.
  • S403 Determine a first characteristic value under a second historical behavior characteristic of the user to be predicted according to the at least one historical behavior information of the user to be predicted.
  • the first historical behavior feature includes but is not limited to one or more of the following a1 to a5:
  • a1 The number of occurrences of historical behaviors corresponding to multiple points of interest (POI) categories.
  • POI classification is a classification determined in advance for different POIs. For example: airports, entertainment venues, shopping malls, schools, etc.
  • the number of occurrences of historical behaviors per day in a preset time period For example, the number of occurrences of historical behavior from Monday to Sunday.
  • the number of occurrences of historical behavior on working days refers to the number of bubbling behaviors and billing behaviors on working days.
  • the corresponding occurrence times of different historical behaviors include the corresponding occurrence times of bubbling behavior and the occurrence times corresponding to billing behavior.
  • the second historical behavior feature includes but is not limited to one or more of the following b1 to b5:
  • b1 Whether the area where the last historical behavior occurred is the area; where the last historical behavior refers to the current time (the current time can refer to any preset time, such as model training time, model prediction time or other specified Time) the most recent historical act.
  • the time interval between the time when the last historical behavior occurred and the time to be predicted for example, if the time when the last historical behavior occurred is 2 days from the current time, and the time to be predicted is 1 day in the future, then the time interval For 3 days. For another example, if the occurrence time of the last historical behavior is 5 hours from the current time, and the time to be predicted is 5 to 8 hours in the future, the time interval is 10 hours.
  • the number of areas reached by the users to be predicted refers to the area determined by the on-demand service system 100 to locate the user terminal through the positioning device 170 after the user logs in to the service software through the user terminal (for example, the service requester terminal 130). For example, if the on-demand service system 100 uses the positioning device 170 to locate the user who has logged in to the service software in Chengdu, it can determine that the user has arrived in Chengdu.
  • the area and the time to be predicted may be the default settings of the on-demand service system 100, or may be specifically set according to actual application conditions.
  • the area may be an area where the user to be predicted has reached. In some embodiments, the area may also be an area that the user to be predicted has not reached. For example, the area may be a destination to be predicted that the user has searched for.
  • the time to be predicted is a preset time period in the future.
  • the location prediction method provided by the embodiment of the present application can predict the probability that the user to be predicted will appear in the area in a preset time period in the future.
  • the first feature value of the user to be predicted under at least one historical behavior feature may include three parts:
  • the first part According to at least one type of historical behavior information X of the user to be predicted, a feature value under the first historical behavior feature corresponding to the at least one type of historical behavior information X of the user to be predicted is determined.
  • the predetermined time period in the future is the time to be predicted.
  • the second feature value under the target time feature may reflect information related to the time to be predicted.
  • the target time feature may include whether the future preset time period is the day of the week, and whether the future preset time period is a holiday or a working day. In some embodiments, the target time feature may also include the attribute of the future preset time period. For example, whether it is in peak hours or peak off hours.
  • the target time feature can be determined, so that the second feature value of the user to be predicted under the target time feature in a preset time period in the future can be determined.
  • the second feature value may be a digitization of the target time feature.
  • the second feature value may be a vector obtained by encoding the target time feature.
  • the target location feature may include the attributes of the area. For example, whether the area is a tourist city, whether it is a national capital city, the POI classification type near the area, and the distance between the area and the user's home and/or company to be predicted.
  • the corresponding third characteristic value can be determined according to the attributes of the area.
  • the third feature value may be the digitization of the target location feature.
  • the third feature value may be a vector obtained by encoding the target location feature.
  • the process 300 further includes:
  • S302 Input the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model, and obtain the location prediction of the user to be predicted in the future preset time period result.
  • the location prediction result of the user to be predicted in the predetermined time period in the future may be the probability that the user to be predicted will appear at the location to be predicted in the predetermined time period in the future.
  • the probability that the user to be predicted will appear at multiple locations to be predicted within a preset time period in the future can also be predicted.
  • the third feature value corresponding to each area of the user to be predicted is also obtained, and the input of multiple groups of location prediction models is formed , Respectively input into the location prediction model to obtain the probability that the user to be predicted will appear in each area at a preset time period in the future.
  • the on-demand service system 100 may determine the area with the greatest probability as the location where the user to be predicted will appear in the future preset time period.
  • the first feature value a of the user to be predicted the second feature value b under the target time feature in the future preset time period, and obtain the user to be predicted in the future preset time period, and the area A, B and C correspond to the third characteristic values c1, c2, and c3, respectively.
  • three sets of input data are formed: a, b, and c1; a, b, and c2; a, b, and c3.
  • the three sets of input data are sequentially input into the pre-trained location prediction model to obtain the probabilities of the users to be predicted in places A, B, and C in the next twenty-four hours.
  • the user to be predicted after determining the probability that the user to be predicted appears in multiple regions, it can be sorted according to the probability of the user to be predicted appearing in the multiple regions, and the region with the highest probability can be determined as the user to be predicted in the future. Set the most likely location within the time period, so that resource scheduling and related service strategy configuration can be performed in advance based on the prediction result.
  • Fig. 5 shows a flowchart of a process of training a position prediction model according to some embodiments of the present application.
  • the process 500 may include steps S501 to S502.
  • the process 500 may be performed by the processing device 112.
  • the process 500 may be implemented with a set of instructions (for example, an application program) stored in the memory 160.
  • the server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and therefore can instruct to execute the process 500.
  • the process 500 may be performed by other systems or devices other than the on-demand service system 100. For example, implemented by equipment or systems provided by a manufacturing supplier.
  • S501 Acquire a first sample feature value of a plurality of sample users under at least one historical behavior characteristic, and a second sample feature value of each of the sample users under a target time characteristic within a historical preset time period and The third sample feature value under the target location feature; the sample users include positive sample users and negative sample users.
  • S502 Train the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user.
  • the position prediction model may include xgboost (eXtreme Gradient Boosting), Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM), and Decision Tree (DT). ) Model, gradient boosting decision tree (Gradient Boosting Decision Tree, GBDT) model, k-nearest neighbor algorithm (K-Nearest Neighbor, kNN) model, convolutional neural network (Convolutional Neural Networks, CNN), artificial neural network (Artificial Neural) Networks, ANN) models, etc., or any combination thereof.
  • xgboost eXtreme Gradient Boosting
  • RNN Recurrent Neural Network
  • LSTM Long Short-Term Memory
  • DT Decision Tree
  • the historical preset time periods of different sample users may be the same or different. For example, sample user A determines "January 15, 2018” as the historical preset time period, and sample user B determines "October 10, 2018” as the historical preset time period. However, the duration of the historical preset time period of different sample users is the same, and the duration of the aforementioned future preset time period is the same.
  • the sample user appears in the area during the historical preset time period, the sample user is a positive sample user. If the sample user does not appear in the area in the historical preset time period, the sample user is a negative sample user.
  • the probability of their appearance in the area during the historical preset time period can be set to 1; the negative sample user does not appear in the historical preset time period In the predicted area, the probability of appearing in the area in the historical preset time period can be set to zero.
  • the process of training the model After the first sample feature value, second sample feature value, and third sample feature value of each sample user are input into the location prediction model, the output result of the location prediction model should be as close as possible to the The probability of the area.
  • the location prediction model can only predict the probability that the user to be predicted will appear in a certain area within a preset time period in the future, or it may be for the user to be predicted to appear in multiple areas to be preset within a preset time period in the future. Therefore, for these two different situations, the situation of the determined sample users is also different.
  • the location prediction model can only predict the probability of the user to be predicted in a certain area within a preset time period in the future, the following methods can be used to determine the sample users:
  • Filters users related to the area may include users who have visited the area and/or users who have searched the area.
  • the following method can be used to determine the first sample feature value of the sample user:
  • For each sample user extract regional historical behavior information of the sample user in the region from at least one type of historical behavior information of the sample user, and extract each historical time period of the sample user in a plurality of historical time periods Inside, the regional time history behavior information of the area to be tested;
  • a first sample characteristic value under the second historical behavior characteristic of the sample user is determined.
  • the first sample feature value under the first historical behavior feature of the sample user and the first sample feature value under the second historical behavior feature of the sample user are the same as the method for determining the first feature value of the user to be predicted Similar, I won't repeat it here.
  • the first sample feature value of the sample user is constructed based on historical behavior information before the historical preset time period.
  • the location prediction model can predict the probability of the user to be predicted in multiple areas within a preset time period in the future, the following methods can be used to determine the sample users:
  • the following methods can be used to obtain the first sample feature value, the second sample feature value, and the third sample feature value of the sample user:
  • any one of the following methods (1) and (2) may be used to obtain the first sample feature value of each sample user:
  • a first sample characteristic value under the second historical behavior characteristic of the sample user is determined.
  • the regional historical behavior information of the user in at least one region and extracting the regional time historical behavior information corresponding to each region in at least one region in each historical time period of the multiple historical time periods, it is determined that the user is in At least one area corresponds to the first sample feature value of each area under the first historical behavior feature;
  • a first sample feature value under the second historical behavior feature corresponding to each area of the user in at least one area is determined.
  • the regional historical behavior information corresponding to the region M1 can be extracted, and the regional time historical behavior information corresponding to the region M1 can be extracted.
  • a first sample feature value corresponding to the user C and the area M1 is constructed.
  • a first sample feature value corresponding to the user C and the area M2 is constructed.
  • the first sample feature value under the first historical behavior feature of the sample user and the first sample feature value under the second historical behavior feature of the sample user are the same as the method for determining the first feature value of the user to be predicted Similar, I won't repeat it here.
  • sample users in different regions can be the same or different. For example, if user A has appeared in area A and area B, then the user A can be a sample user corresponding to area A or a sample user corresponding to area B.
  • the historical behavior information of A in area A and B is different, when A is a sample user in area A and B, the first sample feature value corresponding to area A is generated, and The feature value of the first sample corresponding to the B area is different. The corresponding characteristic value of the third sample is also different.
  • the position prediction model can be trained in the following way:
  • the user for example only, the user’s historical behavior related data (for example, the first feature value, the second feature value, and the third feature value) can be used as input, and the corresponding analysis result can be used as the output and the user appears in the historical preset time period.
  • the correct standard Ground Truth
  • the model parameters can be adjusted inversely according to the direct difference between the predicted output of the model (for example, the predicted position) and the correct standard.
  • the training process will stop .
  • a certain preset condition for example, the training sample reaches a predetermined number, the prediction accuracy of the model is greater than a predetermined accuracy threshold, or the value of the loss function (Loss Function) is less than a preset value, the training process will stop .
  • the loss function Loss Function
  • the embodiment of the present application also provides a specific method for training the position prediction model based on the first sample feature value and the second sample feature value of each sample user, including: The first sample feature value, the second sample feature value, and the third sample feature value are constructed to construct multiple sub-decision trees; based on the multiple sub-decision trees, it is determined as the position prediction model.
  • Fig. 6 shows a flowchart of a process of training a position prediction model according to some embodiments of the present application.
  • the process 600 may include steps S601 to S605.
  • the process 600 may be performed by the processing device 112.
  • the process 600 may be implemented with a set of instructions (for example, an application program) stored in the memory 160.
  • the server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and therefore can instruct to execute the process 600.
  • the process 600 may be performed by other systems or devices other than the on-demand service system 100. For example, implemented by equipment or systems provided by a manufacturing supplier.
  • S601 randomly determine multiple target characteristics from the historical behavior characteristics, the target time characteristics, and the target area characteristics;
  • S602 Construct a sub-decision tree of the current iteration cycle based on the feature value of the sample user under each of the target features;
  • S603 Based on the sub-decision tree of the current iteration period and the sub-decision tree of the historical iteration period, constitute the current decision tree set, and determine the loss of the current decision tree set;
  • S604 Detect whether the loss of the current decision tree set is greater than the preset loss threshold; if so, jump to S601; the current iteration cycle is completed. If not, skip to S605.
  • S605 Determine the current decision tree set as the position prediction model.
  • multiple target characteristics are randomly determined, based on the multiple randomly determined target characteristics and the characteristic value of each sample user under the corresponding target characteristic, and the relationship with the sample user
  • the corresponding probability of appearing at the position to be predicted is trained on the position prediction model to obtain the decision tree set.
  • the target feature is adjusted according to the loss of the aforementioned decision tree set, and the selection probability prediction model is trained according to the newly re-determined target feature to obtain the current decision tree set.
  • the user verifies the current decision tree set based on the test sample, and determines the loss of the current decision tree set.
  • the location prediction model predicts the location of the user to be predicted, it inputs the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model to obtain each Sub-prediction results of the decision tree;
  • the embodiment of the application obtains the first characteristic value of the user to be predicted under at least one historical behavior characteristic, and the second characteristic value of the user to be predicted under the target time characteristic and the target location characteristic within a preset time period in the future. And input the first feature value, the second feature value, and the third feature value into the pre-trained location prediction model to obtain the location prediction of the user to be predicted in the future preset time period.
  • the pre-trained location prediction model to obtain the location prediction of the user to be predicted in the future preset time period.
  • FIG. 7A shows a flowchart of an exemplary process for predicting a position according to some embodiments of the present application.
  • the process 700A may include steps S701 to S703.
  • the process 700A may be performed by the processing device 112.
  • the process 700A may be implemented with a set of instructions (eg, application programs) stored in the memory 160.
  • the server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and therefore can instruct to execute the process 700A.
  • S701 Obtain the first feature value of the user to be predicted under at least one historical behavior feature, and the second feature value of the user to be predicted under the target time feature and the second feature value under the target location feature within a preset time period in the future The third characteristic value.
  • S701 is similar to the implementation of S301 described above, and will not be repeated here.
  • S702 Obtain a fourth characteristic value of the user to be predicted under multiple user attribute characteristics.
  • the user attribute characteristics include one or more of the following:
  • the number of historical orders of the user whether it is a business person, whether it is a tourist, the area where the user’s home is located, and the area where the user’s company is located.
  • S703 Input the first feature value, the second feature value, the third feature value, and the fourth feature value into a pre-trained location prediction model, and obtain the user to be predicted in the future The location prediction result within a preset time period.
  • S703 is similar to the implementation of S302 described above. I won't repeat them here.
  • the fourth sample feature value of each of the sample users under multiple user attribute features can also be obtained; when the location prediction model is trained, Training the position prediction model based on the first sample feature value, the second sample feature value, the third sample feature value, and the fourth sample feature value of each sample user.
  • the specific training method is similar to the model training method in S302, and will not be repeated here.
  • the historical ride-hailing information of the user to be predicted, the preset time period in the future, and the area to be predicted may be processed separately with user attribute characteristics and then input into the trained location prediction model to determine the Predict the probability that the user will be in the area within a preset time period in the future.
  • the historical taxi information may include the origin, destination, attributes of both, and time information.
  • the attributes can include the city, area, or POI information of the origin or destination.
  • the time information may include the time of issuing the order, the time of departure or the time of arrival, etc.
  • several historical ride-hailing information before the time to be predicted may be input into the position prediction model, such as historical ride-hailing information 100 times, 200 times, 500 times before the time to be predicted, and so on.
  • the position prediction model can be obtained by training the initial neural network model.
  • the initial neural network model may include xgboost model, recurrent neural network, long short-term memory network, k-nearest neighbor (kNN) model, convolutional neural network (Convolutional Neural Networks, CNN) , Artificial Neural Networks (ANN) models, etc., or any combination thereof.
  • Fig. 7B shows a flowchart of a prediction process of a position prediction model according to some embodiments of the present application. As shown in FIG. 7B, the historical ride-hailing information of the user to be predicted, user attribute characteristics, future preset time period, and the area to be predicted can be obtained, and then input into the location prediction model.
  • the vector xt and vector q are processed based on the attention mechanism to obtain the vector e.
  • the position prediction model may forward the vector e, and obtain y after two-layer neural network and activation function processing. The y may reflect the arrival of the user to be predicted at the location to be measured within a preset time period in the future. Probability.
  • training data may be obtained to train the initial neural network model to obtain a position prediction model.
  • the training data may include historical ride-hailing information, target time period, target area, user attribute characteristics, and label data of multiple sample users.
  • the label data reflects whether the sample user has reached the target area during the corresponding target time period. For example, if the sample user has reached the target area during the corresponding target time period, the label can be 1, otherwise, it is 0. For different sample users, the corresponding target time and target area may be different.
  • the prediction result of the model may be a probability value, which reflects the predicted probability of the sample user going to the target area within the target time period.
  • the loss function can be constructed based on the difference between the label data of the training data and the model prediction result, and the parameters of the model can be adjusted based on the loss function. When the value of the loss function is less than a certain preset value, the training process will stop. Otherwise, update the parameters of the initial neural network model and recalculate until the value of the loss function is less than the preset value.
  • the position prediction end-to-end model can be updated periodically or in real time according to the update of the user's behavior on the online transportation service platform. For example, when some users in the platform complete a new order, the on-demand service system 100 can update the user's historical taxi information based on the new order completion behavior, and then generate new training samples based on the updated historical taxi information, and then update the location Forecast end-to-end models.
  • Fig. 8 shows an exemplary block diagram of a position prediction apparatus according to some embodiments of the present application.
  • the position prediction device may include an acquisition module 81, a prediction module 82 and a training module 83.
  • the acquiring module 81 is configured to acquire the first characteristic value of the user to be predicted under at least one historical behavior characteristic, and the second characteristic value of the user to be predicted under the target time characteristic and the second characteristic value of the user to be predicted in the future preset time period.
  • the prediction module 82 is configured to input the first feature value, the second feature value, and the third feature value into a pre-trained position prediction model to obtain the preset time of the user to be predicted in the future The position prediction result of the segment.
  • the acquiring module 81 is configured to acquire the first characteristic value of the user to be predicted under at least one historical behavior characteristic in the following manner:
  • the first feature value under the second historical behavior feature of the user to be predicted is determined.
  • the first historical behavior feature includes one or more of the following:
  • the number of occurrences of historical behaviors corresponding to multiple POI classifications of points of interest The number of occurrences of historical behaviors per day within a preset time period, the number of occurrences of historical behaviors on working days, the number of occurrences of historical behaviors on non-working days, and different histories The number of occurrences corresponding to each behavior.
  • the second historical behavior feature includes one or more of the following:
  • the region where the last historical behavior occurred is the region, the time interval between the time when the last historical behavior occurred and the time to be predicted, the POI classification of the destination of the last historical behavior, and the departure place of the last historical behavior
  • the historical behavior information includes one or more of historical bubbling behavior information, historical billing behavior information, and historical billing behavior information.
  • the acquiring module 81 is further configured to acquire the fourth characteristic value of the user to be predicted under multiple user attribute characteristics
  • the prediction module 82 is configured to obtain the position prediction result of the user to be predicted in the future preset time period in the following manner:
  • the inputting the first feature value, the second feature value, the third feature value, and the fourth feature value into a pre-trained position prediction model, and obtaining the user to be predicted in the future The location prediction result within a preset time period.
  • the user attribute characteristics include one or more of the following: the number of orders, whether it is a business person, whether it is a tourist, the area where the user's home is located, and the area where the user's company is located.
  • the training module 83 is used to train the position prediction model using the following device:
  • the sample users include positive sample users and negative sample users;
  • the training module 83 is configured to obtain the first sample feature value of a plurality of sample users under at least one historical behavior characteristic in the following manner, and the historical preset time period of each sample user , The second sample feature value under the target time feature and the third sample feature value under the target location feature:
  • the training module 83 is configured to train the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user in the following manner:
  • the training module 83 is configured to train the position prediction model based on the first sample feature value and the second sample feature value of each sample user in the following manner:
  • a plurality of the sub-decision trees are determined as the position prediction model.
  • the training module 83 is configured to construct the first sample feature value, the second sample feature value, and the third sample feature value of each sample user in the following manner Multiple sub-decision trees:
  • the loss is greater than the preset loss threshold, complete the current iteration cycle and return to the historical behavior characteristics, the target time characteristics, the target area characteristics, and the user attribute characteristics. Steps of target characteristics;
  • the current decision tree set is determined as the position prediction model.
  • the training module 83 is used to determine the loss of the current decision tree set in the following manner:
  • the first test sample feature value of multiple test sample users under the historical behavior characteristics, and the second test sample feature value of each test sample user under the to-be-predicted time feature and the regional feature are input to all In the current decision tree set, obtain a location prediction result corresponding to each of the test sample users;
  • the loss of the current decision tree set is determined.
  • the training module 83 is further configured to obtain the fourth sample feature value of each sample user under multiple user attribute features
  • the training module 83 is configured to train the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user in the following manner:
  • the prediction module 82 is configured to input the first feature value, the second feature value, and the third feature value into a pre-trained position prediction model in the following manner to obtain The location prediction result of the user to be predicted in the future preset time period:
  • the aforementioned modules may be connected or communicate with each other via a wired connection or a wireless connection.
  • Wired connections may include metal cables, optical cables, hybrid cables, etc., or any combination thereof.
  • the wireless connection may include a connection in the form of LAN, WAN, Bluetooth, ZigBee, or NFC, or any combination thereof. Two or more modules can be combined into a single module, and any one module can be divided into two or more units.
  • an embodiment of the present application also provides an electronic device, including: a bus 210a, a processor 220a, a communication port 250a, an input/output 260a, and a storage medium (for example, a magnetic disk 270a, a read-only memory (ROM) 230a , Or random access memory (RAM) 240a), the storage medium stores machine-readable instructions executable by the processor 220a, and when the electronic device is running, the processor 220a communicates with the storage medium through a bus 210a.
  • the processor 220a executes the machine-readable instructions to execute the steps of the position prediction method provided in the embodiments of the present application when executed.
  • the embodiment of the present application also provides a computer-readable storage medium on which a computer program is stored, and the computer program executes the steps of the position prediction method provided in the embodiment of the present application when the computer program is run by a processor.
  • the modules described as separate components may or may not be physically separated, and the components displayed as modules may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a nonvolatile computer readable storage medium executable by a processor.
  • the technical solution of this application essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, ROM, RAM, magnetic disk or optical disk and other media that can store program codes.

Abstract

A method and device for location prediction. The method comprises: acquiring a feature associated with historical behavior of a user whose location is to be predicted, the feature at least comprising location information relating to the historical behavior of the user; and using a pre-trained location prediction model to acquire, at least on the basis of the feature associated with the historical behavior of the user, a result of a predicted location of the user within a preset future time period. The method enables determination of a location of a user within a preset future time period, such that location information of the user can be predicted in advance, thereby facilitating scheduling of resources in advance and configuration of related service schemes.

Description

一种位置预测方法以及装置Method and device for position prediction
交叉引用cross reference
本申请要求于2019年01月21日提交的申请号为201910055395.8的中国专利申请的优先权,其内容以引用的方式被包含于此。This application claims the priority of the Chinese patent application with application number 201910055395.8 filed on January 21, 2019, the content of which is included here by reference.
技术领域Technical field
本申请涉及机器学习技术领域,具体而言,涉及一种位置预测方法以及装置。This application relates to the field of machine learning technology, and in particular to a position prediction method and device.
背景技术Background technique
通过获取用户位置信息,可以为用户匹配与用户位置相关的个性化服务、激励策略、以及运力调配策略等。例如,通过预测次日用户所在的城市,可以找到异地旅游或者出差的用户,进行差异化运营,提升用户体验。By obtaining user location information, it is possible to match users with personalized services, incentive strategies, and capacity allocation strategies related to the user's location. For example, by predicting the city where the user is located on the next day, users who travel or travel in different places can be found to perform differentiated operations and improve user experience.
当前在确定用户位置的时候,可以实时获取用户定位位置信息,但是实时获取定位位置的方式及时性较差,不利于提前进行资源调配,及相关服务策略的配置。Currently, when determining the user's location, the user's location location information can be obtained in real time, but the method of obtaining the location location in real time has poor timeliness, which is not conducive to early resource allocation and the configuration of related service strategies.
发明内容Summary of the invention
本申请实施例的目的在于提供一种位置预测方法以及装置,能够以更高的准确率来确定待预测用户在未来预设时间段内的位置。The purpose of the embodiments of the present application is to provide a location prediction method and device, which can determine the location of the user to be predicted within a preset time period in the future with higher accuracy.
根据本申请的第一方面,提供了一种位置预测方法。该方法可以包括:获取与待预测用户历史行为相关联的特征,所述特征至少包括与待预测用户历史行为相关的位置信息;至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。According to the first aspect of the present application, a position prediction method is provided. The method may include: acquiring features associated with the historical behavior of the user to be predicted, the features including at least location information related to the historical behavior of the user to be predicted; at least based on the features associated with the historical behavior of the user to be predicted, through pre-trained The location prediction model obtains the location prediction result of the user to be predicted in the future preset time period.
在一些实施例中,与待预测用户历史行为相关联的特征可以包括待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值。In some embodiments, the feature associated with the historical behavior of the user to be predicted may include the first feature value of the user to be predicted under at least one historical behavior feature, and the user to be predicted within a preset time period in the future The second feature value under the target time feature and the third feature value under the target location feature.
在一些实施例中,所述待预测用户在所述未来预设时间段的位置预测结果可以包括待预测用户在未来预设时间段内去到待预测区域的概率;所述第一特征反映所述至少一种历史行为在不同区域的分布信息、所述至少一种历史行为在时间上的分布信息、至少一种历史行为与所述未来预设时间段的关联信息、至少一种历史行为与所述待预测区域的关联信息中的一种或多种的组合;所述第二特征反映所述未来预设时间段的属性信息;所述第三特征反映所述待预测区域的属性信息。In some embodiments, the location prediction result of the user to be predicted in the future preset time period may include the probability that the user to be predicted will go to the area to be predicted in the future preset time period; the first feature reflects the The distribution information of the at least one historical behavior in different regions, the distribution information of the at least one historical behavior in time, the association information between the at least one historical behavior and the future preset time period, the at least one historical behavior and One or a combination of one or more of the associated information of the region to be predicted; the second feature reflects the attribute information of the future preset time period; the third feature reflects the attribute information of the region to be predicted.
在一些实施例中,所述获取待预测用户在至少一种历史行为特征下的第一特征值, 包括:从待预测用户的至少一种历史行为信息中,提取所述待预测用户在待预测区域的区域历史行为信息,以及提取所述待预测用户在多个历史时间段中每个历史时间段内,在待预测区域的区域时间历史行为信息;根据所述待预测用户的所述至少一种历史行为信息、所述区域历史行为信息以及所述区域时间历史行为信息,确定所述待预测用户的第一历史行为特征下的第一特征值;根据所述待预测用户的所述至少一种历史行为信息,确定所述待预测用户的第二历史行为特征下的第一特征值。In some embodiments, the acquiring the first feature value of the user to be predicted under at least one historical behavior feature includes: extracting the user to be predicted from the at least one historical behavior information of the user to be predicted Regional historical behavior information of the region, and extracting the regional time historical behavior information of the user to be predicted in the region to be predicted in each historical time period among the multiple historical time periods; according to the at least one user to be predicted The first feature value under the first historical behavior feature of the user to be predicted is determined based on the historical behavior information, the regional historical behavior information, and the regional time historical behavior information; This kind of historical behavior information determines the first feature value under the second historical behavior feature of the user to be predicted.
在一些实施例中,所述第一历史行为特征可以包括如下一种或者多种:与多个兴趣点POI分类分别对应的历史行为的发生次数、历史行为在预设时间段内每天的发生次数、历史行为在工作日的发生次数、历史行为在非工作日的发生次数、不同历史行为分别对应的发生次数。In some embodiments, the first historical behavior feature may include one or more of the following: the number of occurrences of historical behaviors corresponding to multiple POI classifications of points of interest, and the number of occurrences of historical behaviors per day within a preset time period. , The number of occurrences of historical behaviors on working days, the number of occurrences of historical behaviors on non-working days, and the number of occurrences corresponding to different historical behaviors.
在一些实施例中,所述第二历史行为特征可以包括以下一种或者多种:最后一次历史行为发生的区域是否为待预测区域、最后一次历史行为发生的时间与所述待预测时间之间的时间间隔、最后一次历史行为的目的地的POI分类、最后一次历史行为的出发地的POI分类、所述待预测用户到达过的区域数量。In some embodiments, the second historical behavior feature may include one or more of the following: whether the area where the last historical behavior occurred is the area to be predicted, and the time between the last historical behavior and the time to be predicted The time interval, the POI classification of the destination of the last historical behavior, the POI classification of the departure place of the last historical behavior, and the number of areas reached by the user to be predicted.
在一些实施例中,所述历史行为信息可以包括:历史冒泡行为信息、历史发单行为信息以及历史完单行为信息中一种或者多种。In some embodiments, the historical behavior information may include one or more of historical bubbling behavior information, historical billing behavior information, and historical billing behavior information.
在一些实施例中,该方法还可以包括:获取所述待预测用户的用户属性特征;所述至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果,包括:基于与待预测用户历史行为相关联的特征以及所述待预测用户的用户属性特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。In some embodiments, the method may further include: obtaining the user attribute characteristics of the user to be predicted; and obtaining the user attribute characteristics of the user to be predicted through a pre-trained location prediction model based at least on the characteristics associated with the historical behavior of the user to be predicted The position prediction result of the user in the future preset time period includes: obtaining the to-be-predicted through a pre-trained position prediction model based on the characteristics associated with the historical behavior of the user to be predicted and the user attribute characteristics of the user to be predicted The location prediction result of the user in the future preset time period.
在一些实施例中,所述用户属性特征可以包括如下一种或者多种:订单数量、是否为商务人士、是否为旅游人士、用户家所在的区域、用户公司所在区域。In some embodiments, the user attribute characteristics may include one or more of the following: the number of orders, whether it is a business person, whether it is a tourist, the area where the user's home is located, and the area where the user company is located.
在一些实施例中,可以采用下述方法训练所述位置预测模型:获取多个样本用户在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值;所述样本用户包括正样本用户以及负样本用户;基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型。In some embodiments, the following method may be used to train the position prediction model: obtaining the first sample feature value of a plurality of sample users under at least one historical behavior characteristic, and the historical preset time of each sample user In the segment, the second sample feature value under the target time feature and the third sample feature value under the target location feature; the sample users include positive sample users and negative sample users; based on the first sample of each sample user This feature value, the second sample feature value, and the third sample feature value are used to train the position prediction model.
在一些实施例中,所述获取多个样本用户在至少一种历史行为特征下的第一样本 特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值,包括:针对多个待预测样本区域中的每个待预测样本区域,获取与该待预测样本区域对应的样本用户在在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值;所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型,包括:基于各个所述待预测样本区域对应的样本用户的第一样本特征值、第二样本特征值以及第三样本特征值,训练所述位置预测模型。In some embodiments, the acquiring the first sample feature value of a plurality of sample users under at least one historical behavior characteristic, and the first sample feature value of each of the sample users under the target time feature in the historical preset time period The second sample feature value and the third sample feature value under the target location feature include: for each sample region to be predicted in the plurality of sample regions to be predicted, obtaining a sample user corresponding to the sample region to be predicted in at least one The first sample feature value under a historical behavior feature, and the second sample feature value under the target time feature and the third sample feature value under the target location feature of each of the sample users in the historical preset time period The training of the location prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user includes: based on each sample area to be predicted The first sample feature value, the second sample feature value, and the third sample feature value of the corresponding sample user are trained to train the position prediction model.
在一些实施例中,所述基于各个样本用户的所述第一样本特征值,以及所述第二样本特征值,训练所述位置预测模型,包括:基于各个样本用户的所述第一样本特征值、所述第二样本特征值以及所述第三样本特征值,构建多棵子决策树;将多棵所述子决策树确定为所述位置预测模型。In some embodiments, the training of the location prediction model based on the first sample feature value of each sample user and the second sample feature value includes: based on the first sample feature value of each sample user This feature value, the second sample feature value, and the third sample feature value construct multiple sub-decision trees; determine the multiple sub-decision trees as the position prediction model.
在一些实施例中,基于所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值以及所述第三样本特征值,构建多棵子决策树,包括:从所述历史行为特征、所述目标时间特征、所述目标区域特征中,随机确定多个目标特征;基于所述样本用户在所述各个所述目标特征下的特征值,构建当前迭代周期的子决策树;基于当前迭代周期的子决策树,以及历史迭代周期的子决策树,构成当前决策树集,并确定当前决策树集的损失;在所述损失大于预设损失阈值的情况下,完成当前迭代周期,并返回至所述历史行为特征、所述目标时间特征、所述目标区域特征以及所述用户属性特征中,随机确定多个目标特征的步骤;在所述损失不大于预设损失阈值的情况下,将所述当前决策树集确定为所述位置预测模型。In some embodiments, constructing multiple sub-decision trees based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user includes: Among the historical behavior characteristics, the target time characteristics, and the target area characteristics, multiple target characteristics are randomly determined; based on the characteristic values of the sample users under the target characteristics, a sub-decision tree of the current iteration cycle is constructed ; The sub-decision tree based on the current iteration cycle and the sub-decision tree of the historical iteration cycle constitute the current decision tree set, and determine the loss of the current decision tree set; if the loss is greater than the preset loss threshold, complete the current iteration Cycle, and return to the step of randomly determining multiple target characteristics among the historical behavior characteristics, the target time characteristics, the target area characteristics, and the user attribute characteristics; if the loss is not greater than the preset loss threshold In this case, the current decision tree set is determined as the position prediction model.
在一些实施例中,所述确定当前决策树集的损失,包括:将多个测试样本用户在所述历史行为特征下的第一测试样本特征值,以及各个所述测试样本用户在待预测时间特征以及所述待预测区域特征下的第二测试样本特征值,输入至所述当前决策树集中,获取与每个所述测试样本用户对应的位置预测结果;基于各个所述测试样本用户对应的位置预测结果,以及对应的实际位置,确定当前决策树集的损失。In some embodiments, the determining the loss of the current decision tree set includes: comparing the first test sample feature value of a plurality of test sample users under the historical behavior characteristics, and the time to be predicted for each test sample user The feature and the feature value of the second test sample under the feature of the region to be predicted are input into the current decision tree set to obtain the location prediction result corresponding to each test sample user; based on the corresponding test sample user The location prediction result, and the corresponding actual location, determine the loss of the current decision tree set.
在一些实施例中,该方法还包括:获取各个所述样本用户在多个用户属性特征下的第四样本特征值;In some embodiments, the method further includes: obtaining a fourth sample feature value of each of the sample users under multiple user attribute features;
所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三 样本特征值,训练所述位置预测模型,包括:所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值、所述第三样本特征值以及所述第四样本特征值,训练所述位置预测模型。The training of the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user includes: the The first sample feature value, the second sample feature value, the third sample feature value, and the fourth sample feature value are used to train the position prediction model.
在一些实施例中,将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段的位置预测结果,包括:将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取各棵子决策树的子预测结果;将各棵子决策树的子预测结果进行加权求和,确定所述待预测用户在所述未来时间段的位置预测结果。In some embodiments, the first feature value, the second feature value, and the third feature value are input into a pre-trained location prediction model to obtain the user to be predicted at the preset time in the future The location prediction result of the segment includes: inputting the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model to obtain the sub-prediction results of each sub-decision tree; The sub-prediction results of each sub-decision tree are weighted and summed to determine the location prediction result of the user to be predicted in the future time period.
在一些实施例中,与待预测用户历史行为相关联的特征反映待预测用户的一次或以上的历史打车信息。In some embodiments, the feature associated with the historical behavior of the user to be predicted reflects one or more historical ride-hailing information of the user to be predicted.
在一些实施例中,所述历史打车信息包括以下信息中的一种或多种:出发地、目的地、出发地属性、目的地属性以及时间信息。In some embodiments, the historical ride-hailing information includes one or more of the following information: departure place, destination, departure place attributes, destination attributes, and time information.
在一些实施例中,还包括获取所述待预测用户的用户属性特征,以及待预测区域;所述待预测用户在所述未来预设时间段的位置预测结果包括待预测用户在未来预设时间段内去到待预测区域的概率;所述至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果,包括:基于与待预测用户历史行为相关联的特征、所述待预测用户的用户属性特征、所述未来预设时间段以及所述待预测区域,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。In some embodiments, it further includes acquiring the user attribute characteristics of the user to be predicted and the area to be predicted; the location prediction result of the user to be predicted in the future preset time period includes the user to be predicted at a preset time in the future The probability of going to the area to be predicted within a segment; said at least based on the features associated with the historical behavior of the user to be predicted, obtaining the location prediction result of the user to be predicted in the future preset time period through a pre-trained location prediction model , Including: based on the features associated with the historical behavior of the user to be predicted, the user attribute feature of the user to be predicted, the future preset time period, and the area to be predicted, obtaining the to-be-predicted location through a pre-trained location prediction model Predict the location prediction result of the user in the future preset time period.
在一些实施例中,所述基于与待预测用户历史行为相关联的特征、所述待预测用户的用户属性特征、所述未来预设时间段以及所述待预测区域,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果,包括,通过所述位置预测模型:处理所述待预测用户的用户属性特征、所述未来预设时间段以及所述待预测区域获得第一向量表示;将所述用户属性特征分别与各次历史打车信息融合,以获得与各次历史打车信息对应的向量表示;基于attention机制处理所述第一向量表示与各次历史打车信息对应的向量表示,得到第二向量表示;基于第二向量表示确定所述待预测用户在未来预设时间段内去到待预测区域的概率。In some embodiments, the location prediction is based on the features associated with the historical behavior of the user to be predicted, the user attribute features of the user to be predicted, the future preset time period, and the area to be predicted. The model obtains the location prediction result of the user to be predicted in the future preset time period, including, through the location prediction model: processing the user attribute characteristics of the user to be predicted, the future preset time period, and the The region to be predicted obtains the first vector representation; the user attribute characteristics are respectively fused with each historical taxi information to obtain the vector representation corresponding to each historical taxi information; the first vector representation and each time are processed based on the attention mechanism The vector representation corresponding to the historical taxi information obtains the second vector representation; based on the second vector representation, the probability of the user to be predicted going to the area to be predicted in the future preset time period is determined.
根据本申请的第二方面,提供了一种位置预测系统。该位置预测系统包括:至少一个存储介质,包括用于位置预测的一组指令;以及至少一个与所述存储介质通信的处理 器,其中当执行所述一组指令时,所述至少一个处理器用于:获取与待预测用户历史行为相关联的特征;所述特征至少包括与待预测用户历史行为相关的位置信息;至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。According to the second aspect of the present application, a position prediction system is provided. The position prediction system includes: at least one storage medium including a set of instructions for position prediction; and at least one processor in communication with the storage medium, wherein when the set of instructions is executed, the at least one processor uses于: Obtaining features associated with the historical behavior of the user to be predicted; the features include at least location information related to the historical behavior of the user to be predicted; at least based on the features associated with the historical behavior of the user to be predicted, through a pre-trained location prediction model Obtain the location prediction result of the user to be predicted in the future preset time period.
根据本申请的第三方面,提供了一种位置预测装置。该装置包括:获取模块,用于获取与待预测用户历史行为相关联的特征;所述特征至少包括与待预测用户历史行为相关的位置信息;预测模块,用于至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。According to the third aspect of the present application, a position prediction device is provided. The device includes: an acquisition module, configured to acquire features associated with the historical behavior of the user to be predicted; the features include at least location information related to the historical behavior of the user to be predicted; For the associated features, the position prediction result of the user to be predicted in the future preset time period is obtained through a pre-trained position prediction model.
根据本申请的第四方面,提供了一种计算机可读存储介质,其特征在于,该计算机可读存储介质上存储有计算机程序,该计算机程序被处理器运行时执行:获取与待预测用户历史行为相关联的特征;所述特征至少包括与待预测用户历史行为相关的位置信息;至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。According to a fourth aspect of the present application, a computer-readable storage medium is provided, characterized in that a computer program is stored on the computer-readable storage medium, and the computer program is executed when the processor is running: obtaining and predicting user history Behavior-related features; the features include at least location information related to the historical behavior of the user to be predicted; at least based on the features associated with the historical behavior of the user to be predicted, the location of the user to be predicted is acquired through a pre-trained location prediction model Describe the location prediction result for a preset time period in the future.
根据本申请的第五方面,提供了一种位置预测方法。该方法包括:获取待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值;将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段的位置预测结果。According to the fifth aspect of the present application, a position prediction method is provided. The method includes: obtaining a first feature value of a user to be predicted under at least one historical behavior feature, and a second feature value of the user to be predicted under a target time feature and a target location within a preset time period in the future. The third feature value under the feature; the first feature value, the second feature value, and the third feature value are input into a pre-trained location prediction model to obtain the future prediction of the user to be predicted Set the position prediction result of the time period.
根据本申请的第六方面,提供了一种位置预测装置。该装置包括:获取模块,用于获取待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值;预测模块,用于将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段的位置预测结果。According to the sixth aspect of the present application, a location prediction device is provided. The device includes: an acquisition module for acquiring the first characteristic value of the user to be predicted under at least one historical behavior characteristic, and the second characteristic of the user to be predicted under the target time characteristic in a preset time period in the future Value and the third feature value under the target location feature; a prediction module for inputting the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model to obtain The location prediction result of the user to be predicted in the future preset time period.
附图说明Description of the drawings
为了更清楚地说明本申请实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本申请的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following will briefly introduce the drawings needed in the embodiments. It should be understood that the following drawings only show some embodiments of the present application, and therefore do not It should be regarded as a limitation of the scope. For those of ordinary skill in the art, other related drawings can be obtained based on these drawings without creative work.
图1是根据本申请一些实施例的示例性按需服务系统的示例性在线平台的示意 图;Fig. 1 is a schematic diagram of an exemplary online platform of an exemplary on-demand service system according to some embodiments of the present application;
图2A是根据本申请一些实施例的计算设备的示例性硬件和/或软件组件的示意图;Figure 2A is a schematic diagram of exemplary hardware and/or software components of a computing device according to some embodiments of the present application;
图2B是根据本申请一些实施例的移动设备的示例性硬件和/或软件组件的示意图;2B is a schematic diagram of exemplary hardware and/or software components of a mobile device according to some embodiments of the present application;
图3示出了根据本申请一些实施例的用于预测位置的示例性过程的流程图;FIG. 3 shows a flowchart of an exemplary process for predicting a position according to some embodiments of the present application;
图4示出了根据本申请一些实施例的获取第一历史行为特征下的第一特征值和第二历史行为特征下的第一特征值的过程的流程图;FIG. 4 shows a flowchart of a process of acquiring a first characteristic value under a first historical behavior characteristic and a first characteristic value under a second historical behavior characteristic according to some embodiments of the present application;
图5示出了根据本申请一些实施例的训练位置预测模型的过程的流程图;Fig. 5 shows a flowchart of a process of training a position prediction model according to some embodiments of the present application;
图6示出了根据本申请一些实施例的用于训练位置预测模型的过程的流程图;Fig. 6 shows a flowchart of a process for training a position prediction model according to some embodiments of the present application;
图7A示出了根据本申请一些实施例的用于预测位置的示例性过程的流程图;FIG. 7A shows a flowchart of an exemplary process for predicting a position according to some embodiments of the present application;
图7B示出了根据本申请一些实施例的位置预测模型的预测过程的流程图。Fig. 7B shows a flowchart of a prediction process of a position prediction model according to some embodiments of the present application.
图8示出了根据本申请一些实施例的位置预测装置的示例性框图。Fig. 8 shows an exemplary block diagram of a position prediction apparatus according to some embodiments of the present application.
具体实施方式detailed description
为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,应当理解,本申请中附图仅起到说明和描述的目的,并不用于限定本申请的保护范围。另外,应当理解,示意性的附图并未按实物比例绘制。本申请中使用的流程图示出了根据本申请的一些实施例实现的操作。应该理解,流程图的操作可以不按顺序实现,没有逻辑的上下文关系的步骤可以反转顺序或者同时实施。此外,本领域技术人员在本申请内容的指引下,可以向流程图添加一个或多个其他操作,也可以从流程图中移除一个或多个操作。In order to make the purpose, technical solutions and advantages of the embodiments of this application clearer, the technical solutions in the embodiments of this application will be described clearly and completely in conjunction with the drawings in the embodiments of this application. It should be understood that this application is attached The drawings are only for the purpose of illustration and description, and are not used to limit the protection scope of this application. In addition, it should be understood that the schematic drawings are not drawn to scale. The flowchart used in this application shows operations implemented according to some embodiments of this application. It should be understood that the operations of the flowchart may be implemented out of order, and steps without logical context may be reversed in order or implemented at the same time. In addition, under the guidance of the content of this application, those skilled in the art can add one or more other operations to the flowchart, or remove one or more operations from the flowchart.
另外,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。通常在此处附图中描述和示出的本申请实施例的组件可以以各种不同的配置来布置和设计。因此,以下对在附图中提供的本申请的实施例的详细描述并非旨在限制要求保护的本申请的范围,而是仅仅表示本申请的选定实施例。基于本申请的实施例,本领域技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例,都属于本申请保护的范围。In addition, the described embodiments are only a part of the embodiments of the present application, rather than all the embodiments. The components of the embodiments of the present application generally described and shown in the drawings herein may be arranged and designed in various different configurations. Therefore, the following detailed description of the embodiments of the application provided in the drawings is not intended to limit the scope of the claimed application, but merely represents selected embodiments of the application. Based on the embodiments of the present application, all other embodiments obtained by those skilled in the art without creative work shall fall within the protection scope of the present application.
为了使得本领域技术人员能够使用本申请内容,结合示例性应用场景“网约车出行场景”做相关介绍。对于本领域技术人员来说,在不脱离本申请的精神和范围的情况下,可以将这里定义的一般原理应用于其他实施例和应用场景。虽然本申请主要围绕预测网约 车乘客位置进行描述,但是应该理解,这仅是一个示例性实施例。本申请可以应用于任何其他交通运输类型。例如,本申请可以应用于不同的运输系统环境,包括陆地,海洋,或航空等,或其任意组合。运输系统的交通工具可以包括出租车、私家车、顺风车、公共汽车、火车、子弹头列车、高速铁路、地铁、船只、飞机、宇宙飞船、热气球、或无人驾驶车辆等,或其任意组合。本申请还可以包括用于基于互联网为用户提供服务的任何服务系统,例如,用于发送和/或接收快递的系统、用于买卖双方交易的服务系统。本申请的系统或方法的应用可以包括网页、浏览器的插件、客户端终端、定制系统、内部分析系统、或人工智能机器人等,或其任意组合。In order to enable those skilled in the art to use the content of this application, a related introduction is made in conjunction with an exemplary application scenario "online car-hailing travel scenario". For those skilled in the art, without departing from the spirit and scope of the present application, the general principles defined herein can be applied to other embodiments and application scenarios. Although this application mainly focuses on predicting the location of ride-hailing passengers, it should be understood that this is only an exemplary embodiment. This application can be applied to any other types of transportation. For example, this application can be applied to different transportation system environments, including land, sea, or aviation, etc., or any combination thereof. The means of transportation of the transportation system can include taxis, private cars, downwinds, buses, trains, bullet trains, high-speed railways, subways, ships, airplanes, spacecraft, hot air balloons, or unmanned vehicles, etc., or any of them combination. This application may also include any service system for providing services to users based on the Internet, for example, a system for sending and/or receiving express delivery, and a service system for transactions between buyers and sellers. The application of the system or method of the present application may include web pages, browser plug-ins, client terminals, customized systems, internal analysis systems, or artificial intelligence robots, etc., or any combination thereof.
需要说明的是,本申请实施例中将会用到术语“包括”,用于指出其后所声明的特征的存在,但并不排除增加其它的特征。It should be noted that the term "including" will be used in the embodiments of this application to indicate the existence of the features declared later, but it does not exclude the addition of other features.
本申请涉及一种位置预测方法。该方法可以将与待预测用户历史行为相关联的特征,输入至预先训练的位置预测模型中,对待预测用户在未来预设时间段是否出现在目标位置的结果进行预测,从而可以提前预测用户位置信息,有利于提前进行资源调配,及进行相关服务策略的配置。在一些实例中,与待预测用户历史行为相关联的特征至少包括待预测历史行为相关的位置信息,所述位置信息可以是某一个城市、行政划区、POI、经纬度等。在一些实施例中,与待预测用户历史行为相关联的特征可以包括待预测用户至少一种历史行为特征下的第一特征值,以及待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值。在一些实施例中,与待预测用户历史行为相关联的特征还可以是待预测用户的一次或多次历史打车信息。所述历史打车信息可以基于用户的历史打车订单获取。This application relates to a location prediction method. The method can input the features associated with the historical behavior of the user to be predicted into a pre-trained location prediction model, and predict whether the user to be predicted will appear at the target location in a preset time period in the future, so that the user location can be predicted in advance Information is conducive to the deployment of resources in advance and the configuration of related service strategies. In some instances, the features associated with the historical behavior of the user to be predicted include at least location information related to the historical behavior to be predicted, and the location information may be a certain city, administrative division, POI, latitude and longitude, etc. In some embodiments, the feature associated with the historical behavior of the user to be predicted may include the first feature value under at least one historical behavior feature of the user to be predicted, and the feature of the user to be predicted at the target time within a preset time period in the future. The second feature value under the target location feature and the third feature value under the target location feature. In some embodiments, the feature associated with the historical behavior of the user to be predicted may also be one or more historical ride-hailing information of the user to be predicted. The historical ride-hailing information may be obtained based on the user's historical ride-hailing orders.
值得注意的是,用户位置的预测也可以是基于待预测用户的统计量来进行,但是这种方式的预测准确率较低,本申请提供的位置预测方法可以以更高的准确率来对待预测用户的位置进行预测。It is worth noting that the prediction of the user's location can also be based on the statistics of the user to be predicted, but the prediction accuracy of this method is low, and the location prediction method provided in this application can treat the prediction with higher accuracy. The user’s location is predicted.
图1是根据本申请一些实施例的示例性按需服务系统100的示例性在线平台的示意图。按需服务系统100可以是在网络环境中实现的在线运输服务平台,其具有用于提供运输服务的定位系统。按需服务系统100可以提供至少两种服务。示例性按需服务可以包括小型客车拼车服务、中型客车拼车服务、出租车服务、司机服务、快车服务、公共汽车服务、司机租用服务和班车服务。按需服务系统100可以包括服务器110、网络120、服务请求者终端130、服务提供者终端140、车辆150、存储器160和定位设备170。Fig. 1 is a schematic diagram of an exemplary online platform of an exemplary on-demand service system 100 according to some embodiments of the present application. The on-demand service system 100 may be an online transportation service platform implemented in a network environment, which has a positioning system for providing transportation services. The on-demand service system 100 can provide at least two services. Exemplary on-demand services may include small passenger car sharing services, medium passenger car sharing services, taxi services, driver services, express services, bus services, driver hire services, and shuttle services. The on-demand service system 100 may include a server 110, a network 120, a service requester terminal 130, a service provider terminal 140, a vehicle 150, a storage 160, and a positioning device 170.
服务器110可以是计算机服务器。服务器110可以与服务请求者终端130和/或服务提供者终端140通信,以提供在线按需服务的各种功能。在一些实施例中,服务器110可以是单个服务器或服务器组。服务器组可以是经由接入点连接到网络120的集中式服务器组,或者经由一个或一个以上的接入点分别连接到网络120的分布式服务器组。在一些实施例中,服务器110可以本地连接到网络120或者与网络120远程连接。例如,服务器110可以经由网络120访问存储在服务请求者终端130、服务提供者终端140和/或存储器160中的信息和/或数据。又如例如,存储器160可以用作服务器110的后端存储器。在一些实施例中,服务器110可以在云平台上实现。仅作为示例,云平台可以包括私有云、公共云、混合云、社区云、分布式云、中间云、多重云等,或其任意组合。The server 110 may be a computer server. The server 110 may communicate with the service requester terminal 130 and/or the service provider terminal 140 to provide various functions of online on-demand services. In some embodiments, the server 110 may be a single server or a group of servers. The server group may be a centralized server group connected to the network 120 via an access point, or a distributed server group connected to the network 120 via one or more access points. In some embodiments, the server 110 may be locally connected to the network 120 or remotely connected to the network 120. For example, the server 110 may access information and/or data stored in the service requester terminal 130, the service provider terminal 140, and/or the storage 160 via the network 120. As another example, the storage 160 may be used as a back-end storage of the server 110. In some embodiments, the server 110 may be implemented on a cloud platform. For example only, the cloud platform may include private clouds, public clouds, hybrid clouds, community clouds, distributed clouds, intermediate clouds, multiple clouds, etc., or any combination thereof.
在一些实施例中,服务器110可以包括处理设备112。处理设备112可以处理与执行本申请中描述的一个或一个以上的功能有关的信息和/或数据。例如,处理设备112可以获取用户的历史行为信息。处理设备112可以根据历史行为信息确定历史行为特征下的第一特征值、第二特征值以及第三特征值。又例如,处理设备112可以训练位置预测模型。处理设备112可以将历史行为特征下的第一特征值、第二特征值以及第三特征值,输入位置预测模型,预测模型可以输出待预测用户在未来预设时间段的位置。在一些实施例中,处理设备112可以包括一个或一个以上的处理单元(例如,单核处理引擎或多核处理引擎)。仅作为示例,处理设备112可以包括中央处理单元(CPU)、专用集成电路(ASIC)、专用指令集处理器(ASIP)、图形处理单元(GPU)、物理处理单元(PPU)、数字信号处理器(DSP)、现场可编程门阵列(FPGA)、可编程逻辑器件(PLD)、控制器、微控制器单元、精简指令集计算机(RISC)、微处理器等,或其任意组合。In some embodiments, the server 110 may include a processing device 112. The processing device 112 may process information and/or data related to performing one or more functions described in this application. For example, the processing device 112 may obtain historical behavior information of the user. The processing device 112 may determine the first characteristic value, the second characteristic value, and the third characteristic value under the historical behavior characteristic according to the historical behavior information. For another example, the processing device 112 may train a position prediction model. The processing device 112 may input the first characteristic value, the second characteristic value, and the third characteristic value under the historical behavior characteristic into the position prediction model, and the prediction model may output the position of the user to be predicted in a preset time period in the future. In some embodiments, the processing device 112 may include one or more processing units (for example, a single-core processing engine or a multi-core processing engine). For example only, the processing device 112 may include a central processing unit (CPU), an application specific integrated circuit (ASIC), an application specific instruction set processor (ASIP), a graphics processing unit (GPU), a physical processing unit (PPU), and a digital signal processor. (DSP), field programmable gate array (FPGA), programmable logic device (PLD), controller, microcontroller unit, reduced instruction set computer (RISC), microprocessor, etc., or any combination thereof.
网络120可以促进信息和/或数据的交换。在一些实施例中,按需服务系统100中的一个或一个以上的组件(例如,服务器110、服务请求者终端130、服务提供者终端140、车辆150、存储器160)可以通过网络120向按需服务系统100中的其他组件发送信息和/或数据。例如,服务器110可以经由网络120从存储器160访问和/或获得至少两个历史行为信息。在一些实施例中,网络120可以是任何类型的有线或无线网络或其组合。仅作为示例,网络120可以包括电缆网络、有线网络、光纤网络、电信网络、内联网、因特网、局域网(LAN)、广域网(WAN)、无线局域网(WLAN)、城域网(MAN)、公共电话交换网(PSTN)、蓝牙网络、ZigBee网络、近场通信(NFC)网络等,或其任意组合。在一些实施例中,网络120可以包括一个或一个以上的网络接入点。例如,网络120 可以包括有线或无线网络接入点,例如基站和/或互联网交换点120-1、120-2等。按需服务系统100的一个或一个以上的组件可以通过网络接入点连接到网络120以交换数据和/或信息。The network 120 may facilitate the exchange of information and/or data. In some embodiments, one or more components of the on-demand service system 100 (for example, the server 110, the service requester terminal 130, the service provider terminal 140, the vehicle 150, and the storage 160) can be sent to the on-demand service system 100 via the network 120. Other components in the service system 100 send information and/or data. For example, the server 110 may access and/or obtain at least two pieces of historical behavior information from the storage 160 via the network 120. In some embodiments, the network 120 may be any type of wired or wireless network or combination thereof. For example only, the network 120 may include a cable network, a wired network, an optical fiber network, a telecommunications network, an intranet, the Internet, a local area network (LAN), a wide area network (WAN), a wireless local area network (WLAN), a metropolitan area network (MAN), and a public telephone. Switched Network (PSTN), Bluetooth network, ZigBee network, Near Field Communication (NFC) network, etc., or any combination thereof. In some embodiments, the network 120 may include one or more network access points. For example, the network 120 may include wired or wireless network access points, such as base stations and/or Internet exchange points 120-1, 120-2, and so on. One or more components of the on-demand service system 100 may be connected to the network 120 through a network access point to exchange data and/or information.
在一些实施例中,乘客或用户可以是服务请求者终端130的持有者。在一些实施例中,服务请求者终端130的持有者可以是除乘客之外的其他人。例如,服务请求者终端130的持有者A可以使用服务请求者终端130为乘客B发送服务请求,和/或从服务器110接收服务确认和/或信息或指令。在一些实施例中,司机可以是服务提供者终端140的用户。在一些实施例中,服务提供者终端140的用户可以是除司机之外的其他人。例如,服务提供者终端140的用户C可以使用服务提供者终端140为司机D接收服务请求,和/或来自服务器110的信息或指令。在一些实施例中,可以指定司机至少使用服务提供者终端140中的一个和/或车辆150种的一个一段时间,例如,一天、一周、一个月或一年等。在一些其他实施例中,可以指定司机随机地使用服务提供者终端140中的一个和/或车辆150中的一个。例如,当司机可用于提供按需服务时,可以指派他/她使用接收最早请求的司机终端和推荐执行按需服务的车辆。在一些实施例中,“乘客”和“终端设备”可以互换使用,“司机”和“司机设备”可以互换使用。在一些实施例中,司机设备可以与一个或一个以上的司机(例如,夜班司机、白班司机或随机移动的司机)相关联。In some embodiments, the passenger or user may be the holder of the service requester terminal 130. In some embodiments, the holder of the service requester terminal 130 may be someone other than the passenger. For example, the holder A of the service requester terminal 130 may use the service requester terminal 130 to send a service request for the passenger B, and/or receive service confirmation and/or information or instructions from the server 110. In some embodiments, the driver may be a user of the service provider terminal 140. In some embodiments, the user of the service provider terminal 140 may be someone other than the driver. For example, the user C of the service provider terminal 140 may use the service provider terminal 140 to receive service requests for the driver D, and/or information or instructions from the server 110. In some embodiments, the driver may be designated to use at least one of the service provider terminals 140 and/or 150 types of vehicles for a period of time, for example, one day, one week, one month, or one year. In some other embodiments, the driver may be designated to randomly use one of the service provider terminals 140 and/or one of the vehicles 150. For example, when a driver is available to provide on-demand services, he/she can be assigned to use the driver terminal that receives the earliest request and the vehicle that recommends the on-demand service. In some embodiments, "passenger" and "terminal device" can be used interchangeably, and "driver" and "driver device" can be used interchangeably. In some embodiments, the driver device may be associated with one or more drivers (e.g., night shift drivers, day shift drivers, or randomly moving drivers).
在一些实施例中,服务请求者终端130可以包括移动设备130-1、平板电脑130-2、膝上型计算机130-3、车辆130-4中的内置设备等,或其任意组合。在一些实施例中,移动设备130-1可以包括智能家居设备、可穿戴设备、智能移动设备、虚拟现实设备、增强现实设备等,或其任意组合。在一些实施例中,智能家居设备可以包括智能照明设备、智能电气设备的控制设备、智能监控设备、智能电视、智能摄像机、对讲机等,或其任意组合。在一些实施例中,可穿戴设备可包括智能手环、智能鞋袜、智能眼镜、智能头盔、智能手表、智能服装、智能背包、智能配件等,或其任意组合。在一些实施例中,智能移动设备可以包括智能手机、个人数字助理(PDA)、游戏设备、导航设备、销售点(POS)设备等,或其任意组合。在一些实施例中,虚拟现实设备和/或增强现实设备可以包括虚拟现实头盔、虚拟现实眼镜、虚拟现实眼罩、增强现实头盔、增强现实眼镜、增强现实眼罩等,或其任意组合。例如,虚拟现实设备和/或增强现实设备可以包括Google Glass TM、Oculus Rift TM、Hololens TM、Gear VR TM等。在一些实施例中,车辆130-4中的内置设备可以包括内置计算机、板载内置电视、内置平板电脑等。在一些实施例中,服务请求者终 端130可以包括信号发送器和信号接收器,信号接收器配置成与定位设备170通信以定位乘客和/或服务请求者终端130的位置。 In some embodiments, the service requester terminal 130 may include a mobile device 130-1, a tablet computer 130-2, a laptop computer 130-3, a built-in device in a vehicle 130-4, etc., or any combination thereof. In some embodiments, the mobile device 130-1 may include a smart home device, a wearable device, a smart mobile device, a virtual reality device, an augmented reality device, etc., or any combination thereof. In some embodiments, smart home devices may include smart lighting devices, control devices of smart electrical devices, smart monitoring devices, smart TVs, smart cameras, walkie-talkies, etc., or any combination thereof. In some embodiments, the wearable device may include a smart bracelet, smart footwear, smart glasses, smart helmets, smart watches, smart clothing, smart backpacks, smart accessories, etc., or any combination thereof. In some embodiments, smart mobile devices may include smart phones, personal digital assistants (PDAs), gaming devices, navigation devices, point of sale (POS) devices, etc., or any combination thereof. In some embodiments, the virtual reality device and/or the augmented reality device may include a virtual reality helmet, virtual reality glasses, virtual reality goggles, augmented reality helmets, augmented reality glasses, augmented reality goggles, etc., or any combination thereof. For example, the virtual reality device and/or the augmented reality device may include Google Glass TM , Oculus Rift TM , Hololens TM , Gear VR TM and the like. In some embodiments, the built-in devices in the vehicle 130-4 may include a built-in computer, an on-board built-in TV, a built-in tablet computer, and the like. In some embodiments, the service requester terminal 130 may include a signal transmitter and a signal receiver, and the signal receiver is configured to communicate with the positioning device 170 to locate the position of the passenger and/or the service requester terminal 130.
服务提供者终端140可包括至少两个服务提供者终端140-1、140-2、……、140-n。在一些实施例中,服务提供者终端140可以与服务请求者终端130类似或相同。在一些实施例中,可以定制服务提供者终端140以实现在线运输服务。在一些实施例中,服务提供者终端140和服务请求者终端130可配置有信号发射器和信号接收器,以从定位设备170接收服务提供者终端140和服务请求者终端130的位置信息。在一些实施例中,服务请求者终端130和/或服务提供者终端140可以与其他定位设备通信以确定乘客、服务请求者终端130、司机和/或服务提供者终端140的位置。在一些实施例中,服务请求者终端130和/或服务提供者终端140可以周期性地将定位信息发送到服务器110。在一些实施例中,服务提供者终端140还可以周期性地向服务器110发送可用性状态。可用性状态可以指示与服务提供者终端140相关联的车辆150是否可用于运送乘客。例如,服务请求者终端130可以每30分钟将定位信息发送到服务器110。又例如,服务提供者终端140可以每30分钟和/或在完成按需服务时向服务器发送可用性状态。又例如,服务请求者终端130可以在每次用户登录到与在线按需服务相关联的移动应用程序时将定位信息发送到服务器110。The service provider terminal 140 may include at least two service provider terminals 140-1, 140-2, ..., 140-n. In some embodiments, the service provider terminal 140 may be similar to or the same as the service requester terminal 130. In some embodiments, the service provider terminal 140 can be customized to implement online transportation services. In some embodiments, the service provider terminal 140 and the service requester terminal 130 may be configured with a signal transmitter and a signal receiver to receive location information of the service provider terminal 140 and the service requester terminal 130 from the positioning device 170. In some embodiments, the service requester terminal 130 and/or the service provider terminal 140 may communicate with other positioning devices to determine the location of the passenger, the service requester terminal 130, the driver, and/or the service provider terminal 140. In some embodiments, the service requester terminal 130 and/or the service provider terminal 140 may periodically send the positioning information to the server 110. In some embodiments, the service provider terminal 140 may also send the availability status to the server 110 periodically. The availability status may indicate whether the vehicle 150 associated with the service provider terminal 140 is available to transport passengers. For example, the service requester terminal 130 may send positioning information to the server 110 every 30 minutes. For another example, the service provider terminal 140 may send the availability status to the server every 30 minutes and/or when the on-demand service is completed. For another example, the service requester terminal 130 may send location information to the server 110 every time a user logs in to a mobile application associated with an online on-demand service.
在一些实施例中,服务提供者终端140可以对应于一个或一个以上的车辆150。车辆150可以携带乘客并前往目的地。车辆150可包括至少两个车辆150-1、150-2、……、150-n。至少两个车辆中的一个可以对应于一种订单类型。订单类型可以包括出租车订单、豪华车订单、快车订单、公交车订单、班车订单等。在一些实施例中,服务可以是任何在线服务,例如订餐、购物等,或其组合。In some embodiments, the service provider terminal 140 may correspond to one or more vehicles 150. The vehicle 150 can carry passengers and travel to the destination. The vehicle 150 may include at least two vehicles 150-1, 150-2,..., 150-n. One of at least two vehicles may correspond to one order type. Order types can include taxi orders, luxury car orders, express orders, bus orders, shuttle orders, etc. In some embodiments, the service may be any online service, such as ordering food, shopping, etc., or a combination thereof.
存储器160可以存储数据和/或指令。在一些实施例中,存储器160可以存储从服务请求者终端130和/或服务提供者终端140获得的数据。例如,存储器160可以存储与服务请求者终端130相关联的日志信息。例如,存储器可以存储用户通过服务请求者终端130发生的冒泡行为、发单行为、完单行为等。在一些实施例中,存储器160可以存储服务器110可以执行的数据和/或指令,以提供本申请中描述的按需服务。在一些实施例中,存储器160可以包括大容量存储器、可移动存储器、易失性读写存储器、只读存储器(ROM)等,或其任意组合。示例性大容量存储器可以包括磁盘、光盘、固态驱动器等。示例性可移动存储器可以包括闪存驱动器、软盘、光盘、存储卡、压缩盘、磁带等。示例 性易失性读写存储器可以包括随机存取存储器(RAM)。示例性的RAM可以包括动态RAM(DRAM)、双倍数据速率同步动态RAM(DDR SDRAM)、静态RAM(SRAM)、晶闸管RAM(T-RAM)和零电容器RAM(Z-RAM)等。示例性的ROM可以包括掩模ROM(MROM)、可编程ROM(PROM)、可擦除可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)、光盘ROM(CD-ROM)和数字多功能盘ROM等。在一些实施例中,存储器160可以在云平台上实现。仅作为示例,云平台可以包括私有云、公共云、混合云、社区云、分布式云、中间云、多重云等,或其任意组合。The memory 160 may store data and/or instructions. In some embodiments, the memory 160 may store data obtained from the service requester terminal 130 and/or the service provider terminal 140. For example, the memory 160 may store log information associated with the service requester terminal 130. For example, the memory may store bubbling behaviors, billing behaviors, bill-finishing behaviors, etc. that occur through the service requester terminal 130 by the user. In some embodiments, the memory 160 may store data and/or instructions that can be executed by the server 110 to provide the on-demand service described in this application. In some embodiments, the memory 160 may include mass memory, removable memory, volatile read-write memory, read-only memory (ROM), etc., or any combination thereof. Exemplary mass storage devices may include magnetic disks, optical disks, solid state drives, and the like. Exemplary removable storage may include flash drives, floppy disks, optical disks, memory cards, compact disks, magnetic tapes, and the like. An exemplary volatile read-write memory may include random access memory (RAM). Exemplary RAM may include dynamic RAM (DRAM), double data rate synchronous dynamic RAM (DDR SDRAM), static RAM (SRAM), thyristor RAM (T-RAM), zero capacitor RAM (Z-RAM), and the like. Exemplary ROMs may include mask ROM (MROM), programmable ROM (PROM), erasable programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), compact disk ROM (CD-ROM), and digital Multi-function disk ROM, etc. In some embodiments, the storage 160 may be implemented on a cloud platform. For example only, the cloud platform may include private clouds, public clouds, hybrid clouds, community clouds, distributed clouds, intermediate clouds, multiple clouds, etc., or any combination thereof.
定位设备170可以确定与对象相关联的信息,例如,服务请求者终端130、服务提供者终端140、车辆150等中的一个或一个以上。例如,定位设备170可以通过服务请求者终端130确定乘客或司机的当前时间和位置。在一些实施例中,定位设备170可以是全球定位系统(GPS)、全球导航卫星系统(GLONASS)、罗盘导航系统(COMPASS)、北斗导航卫星系统、伽利略定位系统、准天顶卫星系统(QZSS)等。该信息可包括物体的位置、高度、速度或加速度,和/或当前时间。该位置可以是坐标的形式,例如纬度坐标和经度坐标等。定位设备170可以包括一个或一个以上的卫星,例如,卫星170-1、卫星170-2、以及卫星170-3。卫星170-1至170-3可以独立地或共同地确定上述信息。定位系统170可以经由网络120将上述信息发送到服务请求者终端130、服务提供者终端140或车辆150。The positioning device 170 may determine information associated with the object, for example, one or more of the service requester terminal 130, the service provider terminal 140, the vehicle 150, and the like. For example, the positioning device 170 may determine the current time and location of the passenger or driver through the service requester terminal 130. In some embodiments, the positioning device 170 may be Global Positioning System (GPS), Global Navigation Satellite System (GLONASS), Compass Navigation System (COMPASS), Beidou Navigation Satellite System, Galileo Positioning System, Quasi-Zenith Satellite System (QZSS) Wait. This information may include the position, height, speed or acceleration of the object, and/or the current time. The position can be in the form of coordinates, such as latitude coordinates and longitude coordinates. The positioning device 170 may include one or more satellites, for example, a satellite 170-1, a satellite 170-2, and a satellite 170-3. The satellites 170-1 to 170-3 can determine the above-mentioned information independently or collectively. The positioning system 170 may transmit the above-mentioned information to the service requester terminal 130, the service provider terminal 140, or the vehicle 150 via the network 120.
在一些实施例中,按需服务系统100中的一个或一个以上的组件可以经由网络120访问存储在存储器160中的数据或指令。在一些实施例中,存储器160可以作为后端存储器直接连接到服务器110。In some embodiments, one or more components in the on-demand service system 100 can access data or instructions stored in the memory 160 via the network 120. In some embodiments, the storage 160 may be directly connected to the server 110 as a back-end storage.
在一些实施例中,按需服务系统100中的一个或一个以上的组件(例如,服务器110、服务请求者终端130、服务提供者终端140等)可以具有访问存储器160的许可。在一些实施例中,当满足一个或一个以上的条件时,按需服务系统100中的一个或一个以上的组件可以读取和/或修改与乘客、司机和/或车辆有关的信息。例如,服务器110可以在完成按需服务订单之后读取和/或修改一个或一个以上的乘客的用户特征。In some embodiments, one or more components in the on-demand service system 100 (for example, the server 110, the service requester terminal 130, the service provider terminal 140, etc.) may have permission to access the memory 160. In some embodiments, when one or more conditions are met, one or more components in the on-demand service system 100 can read and/or modify information related to passengers, drivers, and/or vehicles. For example, the server 110 may read and/or modify the user characteristics of one or more passengers after completing the on-demand service order.
在一些实施例中,可以通过在终端设备上启动按需服务的移动应用程序以请求服务来发起在按需服务系统100的一个或一个以上的组件之间的信息交换。服务请求的对象可以是任何产品。在一些实施方案中,产品可包括食品、药品、商品、化学产品、电器、衣服、汽车、家居、奢侈品等,或其任意组合。在一些其他实施例中,产品可以包括服务 产品、金融产品、知识产品、互联网产品等,或其任意组合。互联网产品可以包括个人主机产品、网络产品、移动互联网产品、商业主机产品、嵌入式产品等,或其任意组合。移动互联网产品可以用在移动终端、程序、系统等或其任意组合的软件中。移动终端可以包括平板电脑、膝上型计算机、移动电话、个人数字助理(PDA)、智能手表、销售点(POS)设备、车载计算机、车载电视、可穿戴设备等,或其任意组合。例如,产品可以是计算机或移动电话中使用的任何软件和/或应用程序。该软件和/或应用程序可涉及社交、购物、运输、娱乐、学习、投资等,或其任意组合。在一些实施例中,与运输有关的软件和/或应用程序可以包括旅行软件和/或应用程序、车辆调度软件和/或应用程序、映射软件和/或应用程序等。在车辆调度软件和/或应用中,车辆可以包括马、马车、人力车(例如独轮车、自行车、三轮车等)、汽车(例如,出租车、公共汽车、私家车等)、火车、地铁、船只、航空器(例如,飞机、直升机、航天飞机、火箭、热气球等)等,或其任意组合。In some embodiments, the information exchange between one or more components of the on-demand service system 100 can be initiated by launching a mobile application of the on-demand service on the terminal device to request the service. The object of the service request can be any product. In some embodiments, the product may include food, medicine, commodities, chemical products, electrical appliances, clothes, automobiles, household goods, luxury goods, etc., or any combination thereof. In some other embodiments, products may include service products, financial products, knowledge products, Internet products, etc., or any combination thereof. Internet products may include personal host products, network products, mobile Internet products, commercial host products, embedded products, etc., or any combination thereof. Mobile Internet products can be used in mobile terminals, programs, systems, etc. or any combination of software. The mobile terminal may include a tablet computer, a laptop computer, a mobile phone, a personal digital assistant (PDA), a smart watch, a point of sale (POS) device, a vehicle-mounted computer, a vehicle-mounted TV, a wearable device, etc., or any combination thereof. For example, the product can be any software and/or application used in a computer or mobile phone. The software and/or application program may involve social networking, shopping, transportation, entertainment, learning, investment, etc., or any combination thereof. In some embodiments, transportation-related software and/or applications may include travel software and/or applications, vehicle scheduling software and/or applications, mapping software and/or applications, and the like. In vehicle scheduling software and/or applications, vehicles can include horses, carriages, rickshaws (for example, unicycles, bicycles, tricycles, etc.), automobiles (for example, taxis, buses, private cars, etc.), trains, subways, boats, and aircraft (For example, airplane, helicopter, space shuttle, rocket, hot air balloon, etc.), etc., or any combination thereof.
本领域普通技术人员将理解,当按需服务系统100的元件运行时,该元件可以通过电信号和/或电磁信号运行。例如,当服务请求者终端130处理诸如发送订单的任务时,服务请求者终端130可以在其处理器中操作逻辑电路以处理这样的任务。当服务请求者终端130向服务器110发出订单时,服务请求者终端130的处理器可以生成编码订单的电信号。然后,服务请求者终端130的处理器可以将电信号发送到输出端口。如果服务请求者终端130经由有线网络与服务器110通信,则输出端口可以物理地连接到电缆,电缆还将电信号发送到服务器110的输入端口。如果服务请求者终端130经由无线网络与服务器110通信,则终端130的输出端口可以是一个或一个以上的天线,其将电信号转换为电磁信号。类似地,服务提供者终端140可以通过其处理器中的逻辑电路的操作来处理任务,并且经由电信号或电磁信号从服务器110接收指令和/或服务命令。在诸如服务请求者终端130、服务提供者终端140和/或服务器110的电子设备内,当其处理器处理指令、发出指令,和/或执行动作时,指令和/或动作通过电信号进行。例如,当处理器从存储介质(例如,存储器160)检索数据(例如,用户的历史订单信息、用户的冒泡行为信息等)时,它可以将电信号发送到存储介质的读取设备,其可以读取存储介质中的结构化数据。结构化数据可以经由电子设备的总线以电信号的形式发送到处理器。这里,电信号可以指一个电信号、一系列电信号和/或至少两个离散电信号。A person of ordinary skill in the art will understand that when an element of the on-demand service system 100 operates, the element may operate through an electrical signal and/or an electromagnetic signal. For example, when the service requester terminal 130 processes a task such as sending an order, the service requester terminal 130 may operate a logic circuit in its processor to process such a task. When the service requester terminal 130 sends an order to the server 110, the processor of the service requester terminal 130 may generate an electrical signal encoding the order. Then, the processor of the service requester terminal 130 may transmit the electric signal to the output port. If the service requester terminal 130 communicates with the server 110 via a wired network, the output port may be physically connected to a cable, and the cable also transmits an electric signal to the input port of the server 110. If the service requester terminal 130 communicates with the server 110 via a wireless network, the output port of the terminal 130 may be one or more antennas, which convert electrical signals into electromagnetic signals. Similarly, the service provider terminal 140 may process tasks through the operation of logic circuits in its processor, and receive instructions and/or service commands from the server 110 via electrical or electromagnetic signals. In electronic devices such as the service requester terminal 130, the service provider terminal 140, and/or the server 110, when the processor processes instructions, issues instructions, and/or executes actions, the instructions and/or actions are performed through electrical signals. For example, when the processor retrieves data (for example, the user's historical order information, the user's bubbling behavior information, etc.) from the storage medium (for example, the memory 160), it may send an electrical signal to the reading device of the storage medium, which Can read structured data in storage media. The structured data can be sent to the processor in the form of electrical signals via the bus of the electronic device. Here, the electrical signal may refer to one electrical signal, a series of electrical signals, and/or at least two discrete electrical signals.
图2A是根据本申请一些实施例的计算设备200A的示例性硬件和/或软件组件的示意图。FIG. 2A is a schematic diagram of exemplary hardware and/or software components of a computing device 200A according to some embodiments of the present application.
计算设备200A可以是通用计算机或专用计算机。两者都可以用于实现本申请的功能。计算设备200A可以用于实现如本文所述的服务的任何组件。例如,服务器的处理设备112可以通过其硬件、软件程序、固件或其组合在计算设备200A上实现。尽管为了方便仅示出了一个这样的计算机,但是这里描述的与服务相关的计算机功能可以在至少两个类似平台上以分布式方式实现以分配处理负载。The computing device 200A may be a general-purpose computer or a special-purpose computer. Both can be used to realize the functions of this application. The computing device 200A can be used to implement any component of the service as described herein. For example, the processing device 112 of the server may be implemented on the computing device 200A through its hardware, software program, firmware, or a combination thereof. Although only one such computer is shown for convenience, the service-related computer functions described herein can be implemented in a distributed manner on at least two similar platforms to distribute the processing load.
例如,计算设备200A可以包括连接到与其连接的网络(例如,网络120)的通信端口250a,以促进数据通信。计算设备200A还可以包括一个或一个以上的处理器220a,用于执行程序指令。示例性计算机平台可以包括内部通信总线210a、由计算机处理和/或传输的各种数据文件的不同形式的程序存储器和数据存储器,例如,磁盘270a、只读存储器(ROM)230a,或随机存储器(RAM)240a。示例性计算机平台还可以包括存储在ROM 230a、RAM 240a和/或将由CPU 220a执行的其他类型的非暂时性存储介质中的程序指令。本申请的方法和/或过程可以实现为程序指令。计算设备200A还包括输入/输出(I/O)组件260a,其支持计算机、用户和在此的其他组件之间的输入/输出。计算设备200A也可以通过网络通信接收程序和数据。For example, the computing device 200A may include a communication port 250a connected to a network (e.g., network 120) connected to it to facilitate data communication. The computing device 200A may also include one or more processors 220a for executing program instructions. An exemplary computer platform may include an internal communication bus 210a, various forms of program memory and data storage for various data files processed and/or transmitted by the computer, such as a disk 270a, a read only memory (ROM) 230a, or a random access memory ( RAM) 240a. The exemplary computer platform may also include program instructions stored in ROM 230a, RAM 240a, and/or other types of non-transitory storage media to be executed by CPU 220a. The methods and/or processes of the present application can be implemented as program instructions. The computing device 200A also includes an input/output (I/O) component 260a, which supports input/output between the computer, the user, and other components herein. The computing device 200A can also receive programs and data through network communication.
仅仅为了说明,在计算设备200A中仅描述了一个CPU和/或处理器。然而,应该注意的是,本申请中的计算设备200A还可以包括多个CPU和/或处理器,因此由本申请中描述的一个CPU和/或处理器执行的操作和/或方法步骤也可以由多个CPU和/或处理器联合或单独执行。例如,计算设备200A的CPU和/或处理器可以执行步骤A和步骤B。如在另一个示例中,步骤A和步骤B也可以由计算设备200A中的两个不同的CPU和/或处理器联合或分开执行(例如,第一处理器执行步骤A,第二处理器执行步骤B,或者第一个和第二处理器共同执行步骤A和B)。For illustration only, only one CPU and/or processor is described in the computing device 200A. However, it should be noted that the computing device 200A in this application may also include multiple CPUs and/or processors, so the operations and/or method steps performed by one CPU and/or processor described in this application may also be Multiple CPUs and/or processors execute jointly or individually. For example, the CPU and/or the processor of the computing device 200A may perform step A and step B. As in another example, step A and step B can also be performed jointly or separately by two different CPUs and/or processors in the computing device 200A (for example, the first processor performs step A and the second processor performs Step B, or the first and second processors jointly perform steps A and B).
图2B是根据本申请一些实施例的移动设备200B的示例性硬件和/或软件组件的示意图。FIG. 2B is a schematic diagram of exemplary hardware and/or software components of a mobile device 200B according to some embodiments of the present application.
如图2B所示,移动设备200B可以包括通信模块210b、显示器220b、图形处理单元(GPU)230b、中央处理单元(CPU)240b、输入/输出(I/O)250b、内存260b和存储器290b。在一些实施例中,任何其他合适的组件,包括但不限于系统总线或控制器(未示出),也可以包括在移动设备200B中。在一些实施例中,移动操作系统270b(例如,iOS TM、Android TM、Windows Phone TM等)和一个或一个以上的应用程序280b可以从存储器290b加载到内存260b中,以便由CPU 240b执行。应用程序280b可以包括 浏览器或任何其他合适的移动应用程序,用于从处理设备112和/或存储器160发送、接收和呈现与按需服务(例如,发送约车订单)有关的信息。用户与信息流的交互可以通过I/O 250b实现,并通过网络120提供给处理设备112和/或按需服务系统100的其他组件。 As shown in FIG. 2B, the mobile device 200B may include a communication module 210b, a display 220b, a graphics processing unit (GPU) 230b, a central processing unit (CPU) 240b, an input/output (I/O) 250b, a memory 260b, and a memory 290b. In some embodiments, any other suitable components, including but not limited to a system bus or a controller (not shown), may also be included in the mobile device 200B. In some embodiments, the mobile operating system 270b (eg, iOS , Android , Windows Phone ™, etc.) and one or more application programs 280b may be loaded from the memory 290b into the memory 260b so as to be executed by the CPU 240b. The application program 280b may include a browser or any other suitable mobile application program for sending, receiving, and presenting information related to on-demand services (for example, sending car-hailing orders) from the processing device 112 and/or the memory 160. The interaction between the user and the information flow can be implemented through the I/O 250b, and provided to the processing device 112 and/or other components of the on-demand service system 100 through the network 120.
图3示出了根据本申请一些实施例的用于预测位置的示例性过程的流程图。过程300可以包括步骤S301~S302。在一些实施例中,过程300可以由处理设备112执行。例如,过程300可以以存储在存储器160中的一组指令(例如,应用程序)实现。服务器110、CPU 220a和/或CPU 240b可以执行该组指令,因此可以指示执行该过程300。Fig. 3 shows a flowchart of an exemplary process for predicting a position according to some embodiments of the present application. The process 300 may include steps S301 to S302. In some embodiments, the process 300 may be performed by the processing device 112. For example, the process 300 may be implemented with a set of instructions (for example, an application program) stored in the memory 160. The server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and thus can instruct to execute the process 300.
S301:获取待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值。S301: Obtain the first feature value of the user to be predicted under at least one historical behavior feature, and the second feature value of the user to be predicted under the target time feature and the second feature value under the target location feature within a preset time period in the future. The third characteristic value.
在一些实施例中,待预测用户可以是在线运输服务平台(例如,网约车平台)的全部用户。在一些实施例中,由于网约车的用户数量非常庞大,一般是以千万甚至亿计的,要针对每个用户都确定其位置,计算量是非常庞大的,因此为了更有针对性,也可以是基于一定的条件从在线运输服务平台的全部用户中筛选出来的用户。例如,可以周期性计算各个用户在冒泡时的所在位置,如果任一用户在周期内的冒泡位置归属于至少两个城市或者预设的区域,则将该用户确定为待预测用户。又例如,可以周期性计算各个用户在发出订单的出发地,如果任一用户在周期内的出发地归属于至少两个城市或者归属于至少两个预设的区域,则将该用户确定为待预测用户。再例如,可以确定阈值时间段内其历史订单数量大于阈值订单数量的用户为待预测用户。阈值时间段可以为过去一天、一周、一个月、一年等。In some embodiments, the users to be predicted may be all users of an online transportation service platform (for example, a car-hailing platform). In some embodiments, since the number of online car-hailing users is very large, generally in the tens of millions or even billions, the location of each user is determined, and the amount of calculation is very large. Therefore, in order to be more targeted, It can also be a user selected from all users of the online transportation service platform based on certain conditions. For example, the location of each user during bubbling can be calculated periodically, and if the bubbling location of any user during the period belongs to at least two cities or a preset area, the user is determined as the user to be predicted. For another example, it is possible to periodically calculate the place of departure from which each user places an order. If the place of departure of any user during the period belongs to at least two cities or belongs to at least two preset areas, the user is determined to be a waiting area. Forecast users. For another example, it may be determined that a user whose historical order quantity is greater than the threshold order quantity in a threshold time period is a user to be predicted. The threshold time period can be the past day, week, month, year, etc.
下面通过Ⅰ、Ⅱ以及Ⅲ,对预测用户在至少一种历史行为特征下的第一特征值、在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值的获取方式分别加以描述:The following uses I, II, and III to predict the first feature value of the user under at least one historical behavior feature, the second feature value under the target time feature, and the second feature value under the target location feature within a preset time period in the future. The methods of obtaining the third characteristic value are described separately:
Ⅰ:获取待预测用户在至少一种历史行为特征下的第一特征值:Ⅰ: Obtain the first characteristic value of the user to be predicted under at least one historical behavior characteristic:
参见图4所示,图4示出了根据本申请一些实施例的获取第一历史行为特征下的第一特征值和第二历史行为特征下的第一特征值的过程的流程图。过程400可以包括步骤S401~S403。在一些实施例中,过程400可以由处理设备112执行。例如,过程400 可以以存储在存储器160中的一组指令(例如,应用程序)实现。服务器110、CPU 220a和/或CPU 240b可以执行该组指令,因此可以指示执行该过程400。Referring to FIG. 4, FIG. 4 shows a flowchart of a process of obtaining a first characteristic value under a first historical behavior characteristic and a first characteristic value under a second historical behavior characteristic according to some embodiments of the present application. The process 400 may include steps S401 to S403. In some embodiments, the process 400 may be performed by the processing device 112. For example, the process 400 may be implemented with a set of instructions (for example, an application program) stored in the memory 160. The server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and therefore can instruct to execute the process 400.
S401:从待预测用户的至少一种历史行为信息中,提取所述待预测用户在不同区域(包括但不限于待预测区域)的区域历史行为信息,以及提取所述待预测用户在多个历史时间段中每个历史时间段内,在不同区域的区域时间历史行为信息。S401: Extracting regional historical behavior information of the user to be predicted in different regions (including but not limited to the region to be predicted) from at least one type of historical behavior information of the user to be predicted, and extracting multiple historical behaviors of the user to be predicted Regional time historical behavior information in different regions within each historical time period in the time period.
在一些实施例中,所述区域可以是市政规划的省、市、县、镇、区等。在一些实施例中,所述区域可以是根据经纬度划分的网格。例如,所述区域可以是经度和纬度分别在东经90度至91度、北纬30度至31度之间的区域。在一些实施例中,所述区域还可以是任意划分的感兴趣区域(area of interest,AOI)。例如,所述区域可以是学校、商场、景区、医院等。又例如,所述区域可以是用户自定义的区域。例如,以某点为中心、一定距离为半径确定的区域。In some embodiments, the area may be a province, city, county, town, district, etc., in a municipal plan. In some embodiments, the area may be a grid divided according to latitude and longitude. For example, the area may be an area in which longitude and latitude are between 90 degrees to 91 degrees east longitude and 30 degrees to 31 degrees north latitude, respectively. In some embodiments, the area may also be an arbitrarily divided area of interest (AOI). For example, the area may be a school, a shopping mall, a scenic spot, a hospital, etc. For another example, the area may be a user-defined area. For example, an area with a certain point as the center and a certain distance as a radius.
待预测用户的历史行为可以包括用户在在线运输服务平台中的任意操作行为。例如,历史行为可以包括登录行为、冒泡行为、发单行为、完单行为、评价行为、咨询行为、订阅行为等,或其任意组合。历史行为可以是过去某一段时间内(例如,前一天、前一周、前一个月、去年等)的历史行为。每一种历史行为可以对应一种历史行为信息。The historical behavior of the user to be predicted may include any operation behavior of the user in the online transportation service platform. For example, the historical behavior may include login behavior, bubbling behavior, billing behavior, order completion behavior, evaluation behavior, consulting behavior, subscription behavior, etc., or any combination thereof. The historical behavior may be a historical behavior in a certain period of time in the past (for example, the previous day, the previous week, the previous month, the last year, etc.). Each type of historical behavior can correspond to a type of historical behavior information.
如在这里所述的,冒泡行为,冒泡是指是服务请求端进入在线运输服务平台的服务软件的首页面,并将服务请求端的出行起点和出行终点发送给在线运输服务平台。在线运输服务平台接收到服务请求端发送的出行起点和出行终点,确定监听到服务请求端发生冒泡行为。例如,用户可以通过用户终端(例如,服务请求者终端130)登录服务软件,用户终端可以通过定位系统(例如,定位设备170)上传用户终端的位置信息作为出行起点,用户可以通过手动输入或拖拽地图的方式输入出行终点,响应于用户输入出行起点和出行终点,按需服务系统100可以将用户的行为存储为冒泡行为并存储至存储器160中。As mentioned here, bubbling behavior, bubbling means that the service requester enters the home page of the service software of the online transportation service platform, and sends the travel starting point and travel destination of the service requesting end to the online transportation service platform. The online transportation service platform receives the travel starting point and travel end point sent by the service requester, and determines to monitor bubbling behavior on the service requesting end. For example, the user can log in to the service software through the user terminal (for example, the service requester terminal 130), the user terminal can upload the location information of the user terminal through the positioning system (for example, the positioning device 170) as the starting point of travel, and the user can manually input or drag The travel destination is input by dragging the map. In response to the user inputting the travel starting point and the travel end point, the on-demand service system 100 may store the user's behavior as a bubbling behavior and store it in the memory 160.
在线运输服务平台在获取服务请求端发送的出行起点和出行终点后,为该服务请求端本次出行确定订单预估价格,并将订单预估价格发送给服务请求端。服务请求端在接收到预估订单价格后,基于乘客的触发,生成订单信息,并将订单信息发送给在线运输服务平台。此行为即为服务请求端的发单行为。After the online transportation service platform obtains the travel starting point and travel end point sent by the service requesting terminal, it determines the estimated order price for the service requesting terminal for this trip, and sends the estimated order price to the service requesting terminal. After receiving the estimated order price, the service requester generates order information based on the passenger's trigger, and sends the order information to the online transportation service platform. This behavior is the billing behavior of the service requester.
在线运输服务平台在接收到服务请求端发送的订单信息后,生成订单,并将订单推送至多个符合要求的服务提供端,并基于服务提供端的反馈,为该服务请求端匹配服务提供端。After the online transportation service platform receives the order information sent by the service requester, it generates an order and pushes the order to multiple qualified service providers, and matches the service provider for the service requester based on the feedback of the service provider.
当为服务请求端匹配的服务提供端完成服务后,会向在线运输服务平台反馈服务完成信息。在线运输服务平台向服务请求端发起费用支付。服务请求端完成支付后,则认为该服务请求端发生了完单行为。After the service provider matched with the service requester completes the service, it will feedback the service completion information to the online transportation service platform. The online transportation service platform initiates fee payment to the service requester. After the service requester completes the payment, it is deemed that the service requester has completed the order.
需要注意的是,发单行为并不等同于完单行为。只有当订单被完成并完成支付后,才认为用户行为为完单行为。如果服务过程中途,由服务提供端、服务请求端或者网约车任意一方取消服务,则与该订单对应的完单行为则不再发生。It should be noted that the billing behavior is not the same as the billing behavior. Only when the order is completed and the payment is completed, the user behavior is considered to be the order completion behavior. If in the middle of the service process, the service provider, service requester, or online ride-hailing party cancels the service, the order completion behavior corresponding to the order will no longer occur.
每一种历史行为可以包括对应的历史行为信息。例如,冒泡行为对应的历史行为信息包括历史冒泡行为信息;发单行为对应的历史行为信息包括历史发单行为信息;完单行为对应的历史行为信息包括历史完单行为信息。每种历史行为信息可以包括历史行为发生时间和历史行为发生地点。例如,当用户通过用户终端发生历史行为时,用户终端可以将历史行为发生时的历史行为发生时间和历史行为发生地点发送至存储器160进行存储。如在这里所述的,历史行为发生时间和历史行为发生地点分别指完成历史行为时的发生时间和发生地点。例如,用户可以登录服务软件,当用户将出行起点和出行终点都输入后,在线运输服务平台可以记录该用户的行为为冒泡行为,输入出行起点和/或出行终点的时间和地点为历史行为发生时间和历史行为发生地点;响应于用户触发生成订单信息时,在线运输服务平台可以更新该用户的行为为发单行为,并将触发订单信息生成的时间和地点确定为历史行为发生时间和历史行为发生地点。Each historical behavior can include corresponding historical behavior information. For example, the historical behavior information corresponding to the bubbling behavior includes historical bubbling behavior information; the historical behavior information corresponding to the billing behavior includes historical billing behavior information; the historical behavior information corresponding to the billing behavior includes historical billing behavior information. Each type of historical behavior information may include the time when the historical behavior occurred and the place where the historical behavior occurred. For example, when a user has a historical behavior through the user terminal, the user terminal may send the historical behavior occurrence time and the historical behavior occurrence location when the historical behavior occurred to the memory 160 for storage. As described here, the time when the historical behavior occurred and the place where the historical behavior occurred respectively refer to the time and place when the historical behavior was completed. For example, a user can log in to the service software. After the user enters both the starting point and the end of the trip, the online transportation service platform can record the user's behavior as a bubbling behavior, and input the time and place of the trip start and/or end of the trip as a historical behavior Occurrence time and location of the historical behavior; in response to the user triggering the generation of order information, the online transportation service platform can update the user’s behavior as an ordering behavior, and determine the time and place that triggered the generation of the order information as the historical behavior time and history Where the act occurred.
待预测用户的至少一种历史行为信息,包括上述历史行为中至少一种历史行为的历史行为信息。The at least one type of historical behavior information of the user to be predicted includes historical behavior information of at least one of the foregoing historical behaviors.
示例性的,可以采用下述方式获取待预测用户的至少一种历史行为信息、区域历史行为信息,以及区域时间历史行为信息。Exemplarily, the following methods may be used to obtain at least one type of historical behavior information, regional historical behavior information, and regional time historical behavior information of the user to be predicted.
获取待预测用户的历史行为信息X。其中,第i条历史行为信息记为x i。例如,可以从存储器160或其他存储设备(未示出)获取待预测用户的历史行为信息X。 Obtain historical behavior information X of the user to be predicted. Among them, the i-th historical behavior information is recorded as x i . For example, the historical behavior information X of the user to be predicted may be obtained from the memory 160 or other storage devices (not shown).
根据各历史行为信息X中的历史行为发生地点,从历史行为信息X中,确定待预测用户在不同区域的区域历史行为信息X city。其中,第j条区域历史行为信息记为
Figure PCTCN2020073652-appb-000001
。例如,可以根据历史行为发生地点,确定在不同区域中发生的历史行为,从而确定待预测用户各个区域的区域历史行为信息X city。在一些实施例中,所述不同区域中的部分或全部 可以是待预测区域。
According to the historical behavior occurrence location in each historical behavior information X, from the historical behavior information X, determine the regional historical behavior information X city of the user to be predicted in different regions. Among them, the j-th regional historical behavior information is recorded as
Figure PCTCN2020073652-appb-000001
. For example, it is possible to determine historical behaviors occurring in different regions according to the locations of historical behaviors, thereby determining the regional historical behavior information X city of each region of the user to be predicted. In some embodiments, some or all of the different regions may be regions to be predicted.
根据待预测用户在不同区域的区域历史行为信息X city的历史行为发生时间,以及预设的多个历史时间段,确定待预测用户在多个历史时间段中每个历史时间段内,在不同区域的区域时间历史行为信息X city,t。区域时间历史行为信息X city,t的组数可以等于所述多个历史时间段的段数N×区域个数M。第m组的第n条区域历史时间行为信息记为
Figure PCTCN2020073652-appb-000002
According to the regional historical behavior information of the user to be predicted in different regions, the historical behavior occurrence time of X city and the preset multiple historical time periods, it is determined that the user to be predicted is in a different historical time period in each of the multiple historical time periods. The regional time history behavior information of the region X city,t . The number of groups of the regional time historical behavior information X city, t may be equal to the number of segments N of the multiple historical time periods × the number of regions M. The n-th regional historical time behavior information of the m-th group is recorded as
Figure PCTCN2020073652-appb-000002
需要注意的是,历史行为信息X和区域历史行为信息X city均有一组。 It should be noted that there is a set of historical behavior information X and regional historical behavior information X city .
示例性的,多个历史时间段可以根据实际的需要具体设定,例如,可以按照距离当前的时间划分多个历史时间段,例如,距当前0-2天、距当前2-5天、距当前5-10天、距当前10-20天、距当前20-50天、距当前50-100天、距当前100天以上。则对应的区域历史时间行为信息可以为7组。Exemplarily, multiple historical time periods can be specifically set according to actual needs. For example, multiple historical time periods can be divided according to the distance from the current time, for example, 0-2 days from the current, 2-5 days from the current, and The current 5-10 days, the current 10-20 days, the current 20-50 days, the current 50-100 days, and the current 100 days or more. Then the corresponding regional historical time behavior information can be 7 groups.
S402:根据所述待预测用户的所述至少一种历史行为信息、所述区域历史行为信息以及所述区域时间历史行为信息,确定所述待预测用户的第一历史行为特征下的第一特征值。S402: Determine the first feature under the first historical behavior feature of the user to be predicted according to the at least one type of historical behavior information of the user to be predicted, the regional historical behavior information, and the regional time historical behavior information value.
S403:根据所述待预测用户的所述至少一种历史行为信息,确定所述待预测用户的第二历史行为特征下的第一特征值。S403: Determine a first characteristic value under a second historical behavior characteristic of the user to be predicted according to the at least one historical behavior information of the user to be predicted.
在一些实施例中,第一历史行为特征包括但不限于如下a1~a5一种或者多种:In some embodiments, the first historical behavior feature includes but is not limited to one or more of the following a1 to a5:
a1:与多个兴趣点(Point Of Interest,POI)分类分别对应的历史行为的发生次数。此处,POI分类为预先为不同POI确定的分类。例如:机场、娱乐场所、商场、学校等等。a1: The number of occurrences of historical behaviors corresponding to multiple points of interest (POI) categories. Here, the POI classification is a classification determined in advance for different POIs. For example: airports, entertainment venues, shopping malls, schools, etc.
a2、历史行为在预设时间段内每天的发生次数。例如,历史行为在周一到周日,每天的发生次数。a2. The number of occurrences of historical behaviors per day in a preset time period. For example, the number of occurrences of historical behavior from Monday to Sunday.
a3、历史行为在工作日的发生次数。例如,当历史行为包括冒泡行为以及发单行为时,则历史行为在工作日的发生次数,是指冒泡行为以及发单行为在工作日的发生次数。a3. The number of occurrences of historical behavior on working days. For example, when the historical behavior includes bubbling behavior and billing behavior, the number of occurrences of historical behavior on working days refers to the number of bubbling behaviors and billing behaviors on working days.
a4、历史行为在非工作日(例如,周末和/或法定假日)的发生次数。a4. The number of occurrences of historical behavior on non-working days (for example, weekends and/or statutory holidays).
a5、不同历史行为分别对应的发生次数。例如,当历史行为包括冒泡行为以及发单行为时,则不同历史行为分别对应的发生次数,包括冒泡行为对应的发生次数,以及发单行为对应的发生次数。a5. The corresponding occurrence times of different historical behaviors. For example, when the historical behavior includes bubbling behavior and billing behavior, the corresponding occurrence times of different historical behaviors include the corresponding occurrence times of bubbling behavior and the occurrence times corresponding to billing behavior.
在一些实施例中,第二历史行为特征包括但不限于以下b1~b5中一种或者多种:In some embodiments, the second historical behavior feature includes but is not limited to one or more of the following b1 to b5:
b1:最后一次历史行为发生的区域是否为所述区域;其中,最后一次历史行为是指距当前时间(当前时间可以是指任意预设的时间,如模型训练时间、模型预测时间或其他指定的时间)最近的一次历史行为。b1: Whether the area where the last historical behavior occurred is the area; where the last historical behavior refers to the current time (the current time can refer to any preset time, such as model training time, model prediction time or other specified Time) the most recent historical act.
b2:最后一次历史行为发生的时间与所述待预测时间之间的时间间隔;例如,最后一次历史行为的发生时间为距当前时间2天,待预测时间为未来1天,则所述时间间隔为3天。又例如,最后一次历史行为的发生时间为距当前时间5小时,待预测时间为未来5至8小时,则所述时间间隔为10小时。b2: the time interval between the time when the last historical behavior occurred and the time to be predicted; for example, if the time when the last historical behavior occurred is 2 days from the current time, and the time to be predicted is 1 day in the future, then the time interval For 3 days. For another example, if the occurrence time of the last historical behavior is 5 hours from the current time, and the time to be predicted is 5 to 8 hours in the future, the time interval is 10 hours.
b3:最后一次历史行为的目的地或终点的POI分类;b3: POI classification of the destination or end point of the last historical act;
b4:最后一次历史行为的出发地的POI分类b4: POI classification of the departure place of the last historical act
b5:所述待预测用户到达过的区域数量。待预测用户到达过的区域是指用户通过用户终端(例如,服务请求者终端130)登录服务软件后,按需服务系统100通过定位设备170对用户终端定位确定的区域。例如,按需服务系统100通过定位设备170定位用户在成都登录过服务软件,则可以确定用户到达过成都。b5: The number of areas reached by the users to be predicted. The area to be predicted that the user has reached refers to the area determined by the on-demand service system 100 to locate the user terminal through the positioning device 170 after the user logs in to the service software through the user terminal (for example, the service requester terminal 130). For example, if the on-demand service system 100 uses the positioning device 170 to locate the user who has logged in to the service software in Chengdu, it can determine that the user has arrived in Chengdu.
在一些实施例中,所述区域和待预测时间可以是按需服务系统100的默认设置,或根据实际应用情况具体设定。In some embodiments, the area and the time to be predicted may be the default settings of the on-demand service system 100, or may be specifically set according to actual application conditions.
在一些实施例中,所述区域可以是待预测用户到达过的区域。在一些实施例中,所述区域还可以是待预测用户未到达过的区域。例如,所述区域可以是待预测用户曾经搜索查询过的目的地。待预测时间为未来预设时间段。本申请实施例提供的位置预测方法,可以对待预测用户在未来预设时间段出现在所述区域的概率进行预测。In some embodiments, the area may be an area where the user to be predicted has reached. In some embodiments, the area may also be an area that the user to be predicted has not reached. For example, the area may be a destination to be predicted that the user has searched for. The time to be predicted is a preset time period in the future. The location prediction method provided by the embodiment of the present application can predict the probability that the user to be predicted will appear in the area in a preset time period in the future.
待预测用户在至少一种历史行为特征下的第一特征值可以包括三部分:The first feature value of the user to be predicted under at least one historical behavior feature may include three parts:
第一部分:根据待预测用户的至少一种历史行为信息X,确定与待预测用户的至少一种历史行为信息X对应的在第一历史行为特征下的特征值。The first part: According to at least one type of historical behavior information X of the user to be predicted, a feature value under the first historical behavior feature corresponding to the at least one type of historical behavior information X of the user to be predicted is determined.
第二部分,根据待预测用户在多个历史时间段中每个历史时间段内在不同区域的区域历史行为信息X city,确定与的区域历史行为信息X city对应的在第一历史行为特征下的特征值。 In the second part, according to the regional historical behavior information X city of the user to be predicted in different regions in each historical period of multiple historical time periods, determine the corresponding regional historical behavior information X city under the first historical behavior feature Eigenvalues.
第三部分,根据待预测用户在多个历史时间段中每个历史时间段内,在不同区域的区域时间历史行为信息X city,t,确定每组区域历史时间行为信息X city,t在第一历史行为 特征下的特征值。 In the third part, according to the regional time historical behavior information X city,t of the user to be predicted in each historical time period in multiple historical time periods, determine that each group of regional historical time behavior information X city,t is in the first A characteristic value under a historical behavior characteristic.
Ⅱ:获取待预测用户在未来预设时间段内,在目标时间特征下的第二特征值:Ⅱ: Obtain the second feature value of the user to be predicted in the future preset time period under the target time feature:
在一些实施例中,未来预设时间段,即为待预测时间。在目标时间特征下的第二特征值可以反映与待预测时间相关的信息。In some embodiments, the predetermined time period in the future is the time to be predicted. The second feature value under the target time feature may reflect information related to the time to be predicted.
在一些实施例中,目标时间特征可以包括该未来预设时间段是周几、该未来预设时间段是节假日还是工作日。在一些实施例中,目标时间特征还可以包括该未来预设时间段的属性。例如,是否处于上班高峰期或下班高峰期。In some embodiments, the target time feature may include whether the future preset time period is the day of the week, and whether the future preset time period is a holiday or a working day. In some embodiments, the target time feature may also include the attribute of the future preset time period. For example, whether it is in peak hours or peak off hours.
根据待预测用户的待预测时间对应的时间信息,可以确定目标时间特征,从而可以确定待预测用户在未来预设时间段内,在目标时间特征下的第二特征值。在一些实施例中,第二特征值可以是目标时间特征的数字化,仅作为示例,第二特征值可以是对目标时间特征的编码后得到的向量。According to the time information corresponding to the time to be predicted of the user to be predicted, the target time feature can be determined, so that the second feature value of the user to be predicted under the target time feature in a preset time period in the future can be determined. In some embodiments, the second feature value may be a digitization of the target time feature. For example only, the second feature value may be a vector obtained by encoding the target time feature.
Ⅲ:获取待预测用户在未来预设时间段内,在目标位置特征下的第三特征值;Ⅲ: Obtain the third feature value of the user to be predicted in the future preset time period under the target location feature;
此处,目标位置特征可以包括所述区域的属性。例如,所述区域是否为旅游城市、是否为省会城市、所述区域附近的POI分类种类、所述区域距离待预测用户家和/或公司的距离等。Here, the target location feature may include the attributes of the area. For example, whether the area is a tourist city, whether it is a provincial capital city, the POI classification type near the area, and the distance between the area and the user's home and/or company to be predicted.
在确定了所述区域后,就能够根据该所述区域的属性,从而确定对应的第三特征值。在一些实施例中,第三特征值可以是目标位置特征的数字化,仅作为示例,第三特征值可以是对目标位置特征的编码后得到的向量。After the area is determined, the corresponding third characteristic value can be determined according to the attributes of the area. In some embodiments, the third feature value may be the digitization of the target location feature. For example only, the third feature value may be a vector obtained by encoding the target location feature.
承接上述S301,在获取了待预测用户的第一特征值、第二特征值以及第三特征值后,过程300还包括:Following the above S301, after acquiring the first feature value, the second feature value, and the third feature value of the user to be predicted, the process 300 further includes:
S302:将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段的位置预测结果。S302: Input the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model, and obtain the location prediction of the user to be predicted in the future preset time period result.
在一些实施例中,待预测用户在所述未来预设时间段的位置预测结果,可以是待预测用户在该未来预设时间段,在待预测位置出现的概率。另外,在本申请另一实施例中,还可以对待预测用户在未来预设时间段内,在多个待预测位置出现的概率进行预测。在预测的时候,除了获取待预测用户的第一特征值和第二特征值外,还要获取待预测用户针对每一个区域,对应的第三特征值,并构成多个组位置预测模型的输入,分别输入至位置预测模型中,获取待预测用户在未来预设时间段在各个区域出现的概率。按需服务系统100 可以将概率最大的区域确定为待预测用户在该未来预设时间段出现的位置。In some embodiments, the location prediction result of the user to be predicted in the predetermined time period in the future may be the probability that the user to be predicted will appear at the location to be predicted in the predetermined time period in the future. In addition, in another embodiment of the present application, the probability that the user to be predicted will appear at multiple locations to be predicted within a preset time period in the future can also be predicted. When predicting, in addition to obtaining the first feature value and the second feature value of the user to be predicted, the third feature value corresponding to each area of the user to be predicted is also obtained, and the input of multiple groups of location prediction models is formed , Respectively input into the location prediction model to obtain the probability that the user to be predicted will appear in each area at a preset time period in the future. The on-demand service system 100 may determine the area with the greatest probability as the location where the user to be predicted will appear in the future preset time period.
例如,要确定待预测用户在未来的二十四小时内,在A、B、C三个区域中分别出现的概率。For example, it is necessary to determine the probability that the user to be predicted will appear in the three regions A, B, and C in the next twenty-four hours.
获取待预测用户的第一特征值a,在未来预设时间段内,在目标时间特征下的第二特征值b,以及获取待预测用户在未来预设时间段内,与所述区域A、B和C分别对应的第三特征值c1、c2和c3。Obtain the first feature value a of the user to be predicted, the second feature value b under the target time feature in the future preset time period, and obtain the user to be predicted in the future preset time period, and the area A, B and C correspond to the third characteristic values c1, c2, and c3, respectively.
从而,构成三组输入数据,分别为:a、b以及c1;a、b以及c2;a、b以及c3。Thus, three sets of input data are formed: a, b, and c1; a, b, and c2; a, b, and c3.
将三组输入数据依次输入至预先训练的位置预测模型中,获取待预测用户在未来二十四小时内,分别在A、B和C地出现的概率。The three sets of input data are sequentially input into the pre-trained location prediction model to obtain the probabilities of the users to be predicted in places A, B, and C in the next twenty-four hours.
在一些实施例中,在确定了待预测用户在多个区域出现的概率后,能够根据待预测用户在多个区域出现的概率大小进行排序,可以确定概率最大的区域为待预测用户在未来预设时间段内,最可能出现的位置,从而能够基于预测结果,提前进行资源调度以及相关服务策略的配置。In some embodiments, after determining the probability that the user to be predicted appears in multiple regions, it can be sorted according to the probability of the user to be predicted appearing in the multiple regions, and the region with the highest probability can be determined as the user to be predicted in the future. Set the most likely location within the time period, so that resource scheduling and related service strategy configuration can be performed in advance based on the prediction result.
图5示出了根据本申请一些实施例的训练位置预测模型的过程的流程图。过程500可以包括步骤S501~S502。在一些实施例中,过程500可以由处理设备112执行。例如,过程500可以以存储在存储器160中的一组指令(例如,应用程序)实现。服务器110、CPU 220a和/或CPU 240b可以执行该组指令,因此可以指示执行该过程500。在一些实施例中,过程500可以由除了按需服务系统100的其他系统或设备执行。例如,由制造供应商提供的设备或系统执行。Fig. 5 shows a flowchart of a process of training a position prediction model according to some embodiments of the present application. The process 500 may include steps S501 to S502. In some embodiments, the process 500 may be performed by the processing device 112. For example, the process 500 may be implemented with a set of instructions (for example, an application program) stored in the memory 160. The server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and therefore can instruct to execute the process 500. In some embodiments, the process 500 may be performed by other systems or devices other than the on-demand service system 100. For example, implemented by equipment or systems provided by a manufacturing supplier.
S501:获取多个样本用户在在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值;所述样本用户包括正样本用户以及负样本用户。S501: Acquire a first sample feature value of a plurality of sample users under at least one historical behavior characteristic, and a second sample feature value of each of the sample users under a target time characteristic within a historical preset time period and The third sample feature value under the target location feature; the sample users include positive sample users and negative sample users.
S502:基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型。S502: Train the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user.
在一些实施例中,位置预测模型可以包括xgboost(eXtreme Gradient Boosting)、循环神经网络(Recurrent Neural Network,RNN)、长短期记忆网络(Long Short-Term Memory,LSTM)、决策树(Decision Tree,DT)模型、梯度提升决策树(Gradient Boosting Decision Tree,GBDT)模型、k-最近邻算法(K-Nearest Neighbor,kNN)模型、卷积神经网络(Convolutional Neural Networks,CNN)、人工神经网络(Artificial Neural Networks, ANN)模型等,或其任意组合。In some embodiments, the position prediction model may include xgboost (eXtreme Gradient Boosting), Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM), and Decision Tree (DT). ) Model, gradient boosting decision tree (Gradient Boosting Decision Tree, GBDT) model, k-nearest neighbor algorithm (K-Nearest Neighbor, kNN) model, convolutional neural network (Convolutional Neural Networks, CNN), artificial neural network (Artificial Neural) Networks, ANN) models, etc., or any combination thereof.
在具体实施中,不同样本用户的历史预设时间段可以相同,也可以不同。例如,样本用户A将“2018年11月15日”确定为历史预设时间段,样本用户B将“2018年10月10日”确定为历史预设时间段。但不同的样本用户的历史预设时间段的时长相同,且与上述未来预设时间段的时长相同。In specific implementation, the historical preset time periods of different sample users may be the same or different. For example, sample user A determines "November 15, 2018" as the historical preset time period, and sample user B determines "October 10, 2018" as the historical preset time period. However, the duration of the historical preset time period of different sample users is the same, and the duration of the aforementioned future preset time period is the same.
若样本用户在历史预设时间段出现在所述区域,则该样本用户为正样本用户。若该样本用户在历史预设时间段并未出现在所述区域,则该样本用户为负样本用户。If the sample user appears in the area during the historical preset time period, the sample user is a positive sample user. If the sample user does not appear in the area in the historical preset time period, the sample user is a negative sample user.
其中,由于正样本用户在历史预设时间段出现在所述区域,因此其在历史预设时间段出现在所述区域的概率可以设为1;负样本用户在历史预设时间段并未出现在带预测区域,因此其在历史预设时间段出现在所述区域的概率可以设为0。Among them, since the positive sample user appears in the area during the historical preset time period, the probability of their appearance in the area during the historical preset time period can be set to 1; the negative sample user does not appear in the historical preset time period In the predicted area, the probability of appearing in the area in the historical preset time period can be set to zero.
对模型进行训练的过程,将各个样本用户的第一样本特征值、第二样本特征值、第三样本特征值输入位置预测模型后,位置预测模型输出的结果,要尽可能接近其出现在所述区域的概率。The process of training the model. After the first sample feature value, second sample feature value, and third sample feature value of each sample user are input into the location prediction model, the output result of the location prediction model should be as close as possible to the The probability of the area.
位置预测模型可以是仅仅能够对待预测用户在未来预设时间段内在一个确定的所述区域出现的概率进行预测,也可以是针对待预测用户在未来预设时间段内在多个待预设区域出现的概率进行预测,因此,针对这两种不同的情况,确定的样本用户的情况也不相同。The location prediction model can only predict the probability that the user to be predicted will appear in a certain area within a preset time period in the future, or it may be for the user to be predicted to appear in multiple areas to be preset within a preset time period in the future. Therefore, for these two different situations, the situation of the determined sample users is also different.
一:在位置预测模型仅仅能够对待预测用户在未来预设时间段内在一个确定的区域出现的概率进行预测的时候,可以采用下述方式确定样本用户:One: When the location prediction model can only predict the probability of the user to be predicted in a certain area within a preset time period in the future, the following methods can be used to determine the sample users:
从用户数据库中,筛选与该区域相关的用户,作为样本用户。与该区域相关的用户可以包括到达过该区域的用户和/或搜索过该区域的用户。From the user database, filter users related to the area as sample users. Users related to the area may include users who have visited the area and/or users who have searched the area.
针对该种情况,可以使用下述方式确定样本用户的第一样本特征值:In this case, the following method can be used to determine the first sample feature value of the sample user:
针对每个样本用户,从该样本用户的至少一种历史行为信息中,提取样本用户在所述区域的区域历史行为信息,以及提取所述样本用户在多个历史时间段中每个历史时间段内,在待测区域的区域时间历史行为信息;For each sample user, extract regional historical behavior information of the sample user in the region from at least one type of historical behavior information of the sample user, and extract each historical time period of the sample user in a plurality of historical time periods Inside, the regional time history behavior information of the area to be tested;
根据所述样本用户的所述至少一种历史行为信息、区域历史行为信息以及区域时间历史行为信息,确定所述样本用户的第一历史行为特征下的第一样本特征值;Determine the first sample feature value under the first historical behavior feature of the sample user according to the at least one type of historical behavior information, regional historical behavior information, and regional time historical behavior information of the sample user;
根据所述样本用户的所述至少一种历史行为信息,确定所述样本用户的第二历史行为特征下的第一样本特征值。According to the at least one type of historical behavior information of the sample user, a first sample characteristic value under the second historical behavior characteristic of the sample user is determined.
此处,样本用户的第一历史行为特征下的第一样本特征值和样本用户在第二历史行为特征下的第一样本特征值,与上述待预测用户的第一特征值的确定方式类似,在此不再赘述。Here, the first sample feature value under the first historical behavior feature of the sample user and the first sample feature value under the second historical behavior feature of the sample user are the same as the method for determining the first feature value of the user to be predicted Similar, I won't repeat it here.
需要注意的是,样本用户的第一样本特征值,是基于历史预设时间段之前的历史行为信息构建的。It should be noted that the first sample feature value of the sample user is constructed based on historical behavior information before the historical preset time period.
二:在位置预测模型能够针对待预测用户在未来预设时间段内在多个区域出现的概率进行预测的时候,可以采用下述方式确定样本用户:Two: When the location prediction model can predict the probability of the user to be predicted in multiple areas within a preset time period in the future, the following methods can be used to determine the sample users:
从用户数据库中,筛选与各个区域相关的用户,作为各个区域对应的样本用户。From the user database, filter users related to each area as sample users corresponding to each area.
可以采用下述方式获取样本用户的第一样本特征值、第二样本特征值和第三样本特征值:The following methods can be used to obtain the first sample feature value, the second sample feature value, and the third sample feature value of the sample user:
针对多个待预测样本区域中的每个待预测样本区域,获取与该待预测样本区域对应的样本用户在在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值;For each sample region to be predicted in the plurality of sample regions to be predicted, obtain the first sample feature value of the sample user corresponding to the sample region to be predicted under at least one historical behavior characteristic, and each of the sample users In the historical preset time period, the second sample feature value under the target time feature and the third sample feature value under the target location feature;
具体地,可以采用下述方式(1)和(2)中任意一种获取各个样本用户的第一样本特征值:Specifically, any one of the following methods (1) and (2) may be used to obtain the first sample feature value of each sample user:
针对每个区域,并针对该区域的各个样本用户,从该样本用户的至少一种历史行为信息中,提取样本用户在所述区域的区域历史行为信息,以及提取所述样本用户在多个历史时间段中每个历史时间段内,在待测区域的区域时间历史行为信息;For each region and for each sample user in the region, extract the regional historical behavior information of the sample user in the region from at least one type of historical behavior information of the sample user, and extract the sample user’s historical behavior information in the region. In each historical time period in the time period, the regional time historical behavior information of the area to be tested;
根据所述样本用户的所述至少一种历史行为信息、区域历史行为信息以及区域时间历史行为信息,确定所述样本用户的第一历史行为特征下的第一样本特征值;Determine the first sample feature value under the first historical behavior feature of the sample user according to the at least one type of historical behavior information, regional historical behavior information, and regional time historical behavior information of the sample user;
根据所述样本用户的所述至少一种历史行为信息,确定所述样本用户的第二历史行为特征下的第一样本特征值。According to the at least one type of historical behavior information of the sample user, a first sample characteristic value under the second historical behavior characteristic of the sample user is determined.
此种方式,是基于多个区域筛选与每个区域对应的样本用户。In this way, the sample users corresponding to each region are screened based on multiple regions.
针对多个用户,从该用户的至少一种历史行为信息中,提取该用户在至少一个区域的区域历史行为信息,以及提取该用户在多个历史时间段中每个历史时间段内,在至少一个区域中与每个区域对应的区域时间历史行为信息。For multiple users, extract regional historical behavior information of the user in at least one area from at least one type of historical behavior information of the user, and extract the user’s historical time period in each of the multiple historical time periods. Regional time history behavior information corresponding to each region in a region.
根据用户在至少一个区域的区域历史行为信息,以及提取该用户在多个历史时间段中每个历史时间段内,在至少一个区域中与每个区域对应的区域时间历史行为信息,确 定用户在至少一个区域中与每个区域对应第一历史行为特征下的第一样本特征值;According to the regional historical behavior information of the user in at least one region, and extracting the regional time historical behavior information corresponding to each region in at least one region in each historical time period of the multiple historical time periods, it is determined that the user is in At least one area corresponds to the first sample feature value of each area under the first historical behavior feature;
根据所述样本用户的所述至少一种历史行为信息,确定用户在至少一个区域中与每个区域对应第二历史行为特征下的第一样本特征值。According to the at least one type of historical behavior information of the sample user, a first sample feature value under the second historical behavior feature corresponding to each area of the user in at least one area is determined.
例如,用户C在区域M1和M2均出现过。For example, user C has appeared in both areas M1 and M2.
根据用户C的历史行为信息,能够提取与区域M1对应的区域历史行为信息,并提取与所述区域M1对应的区域时间历史行为信息。According to the historical behavior information of the user C, the regional historical behavior information corresponding to the region M1 can be extracted, and the regional time historical behavior information corresponding to the region M1 can be extracted.
然后根据与所述区域M1对应的区域历史行为信息,和与所述区域M1对应的区域时间历史行为信息,构建用户C与所述区域M1对应的第一样本特征值。Then, according to the area historical behavior information corresponding to the area M1 and the area time historical behavior information corresponding to the area M1, a first sample feature value corresponding to the user C and the area M1 is constructed.
并且,根据用户C的历史行为信息,还能够提取与所述区域M2对应的区域历史行为信息,并提取与带预测区域M2对应的区域时间历史行为信息。然后根据与所述区域M2对应的区域历史行为信息,和与所述区域M2对应的区域时间历史行为信息,构建用户C与所述区域M2对应的第一样本特征值。Furthermore, based on the historical behavior information of the user C, it is also possible to extract the regional historical behavior information corresponding to the region M2, and extract the regional temporal historical behavior information corresponding to the predicted region M2. Then, according to the area historical behavior information corresponding to the area M2 and the area time historical behavior information corresponding to the area M2, a first sample feature value corresponding to the user C and the area M2 is constructed.
此处,样本用户的第一历史行为特征下的第一样本特征值和样本用户在第二历史行为特征下的第一样本特征值,与上述待预测用户的第一特征值的确定方式类似,在此不再赘述。Here, the first sample feature value under the first historical behavior feature of the sample user and the first sample feature value under the second historical behavior feature of the sample user are the same as the method for determining the first feature value of the user to be predicted Similar, I won't repeat it here.
需要注意的是,不同区域的样本用户可以相同,也可以不同。例如,用户甲某既在A区域出现过,又在B区域出现过,则该用户甲某既可以是与A区域对应的样本用户,又可以是与B区域对应的样本用户。It should be noted that the sample users in different regions can be the same or different. For example, if user A has appeared in area A and area B, then the user A can be a sample user corresponding to area A or a sample user corresponding to area B.
但是由于甲某在A区域和B区域的区域历史行为信息有所区别,因此在甲某作为A区域和B区域的样本用户时,所生成的与A区域对应的第一样本特征值,和与B区域对应的第一样本特征值不相同。对应的第三样本特征值也不同。However, because the historical behavior information of A in area A and B is different, when A is a sample user in area A and B, the first sample feature value corresponding to area A is generated, and The feature value of the first sample corresponding to the B area is different. The corresponding characteristic value of the third sample is also different.
同时,在该种情况下,可以采用下述方式训练位置预测模型:At the same time, in this case, the position prediction model can be trained in the following way:
基于各个所述待预测样本区域对应的样本用户的第一样本特征值、第二样本特征值以及第三样本特征值,训练所述位置预测模型。仅作为示例,可以以用户的历史行为相关数据(例如,第一特征值、第二特征值以及第三特征值)作为输入,以对应的分析结果作为输出和用户在历史预设时间段内出现的区域作为正确标准(Ground Truth)对模型进行训练。同时可以根据模型预测输出(例如,预测的位置)与正确标准直接的差异反向调整模型参数。当满足某一预设条件时,例如,训练样本达到预定数量、模型的预测正确率大于某一预定正确率阈值,或损失函数(Loss Function)的值小于某一预设值,训练过程 将停止。Training the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of the sample user corresponding to each sample area to be predicted. For example only, the user’s historical behavior related data (for example, the first feature value, the second feature value, and the third feature value) can be used as input, and the corresponding analysis result can be used as the output and the user appears in the historical preset time period. As the correct standard (Ground Truth) to train the model. At the same time, the model parameters can be adjusted inversely according to the direct difference between the predicted output of the model (for example, the predicted position) and the correct standard. When a certain preset condition is met, for example, the training sample reaches a predetermined number, the prediction accuracy of the model is greater than a predetermined accuracy threshold, or the value of the loss function (Loss Function) is less than a preset value, the training process will stop .
本申请实施例还提供一种基于各个样本用户的所述第一样本特征值,以及所述第二样本特征值,训练所述位置预测模型的具体方法,包括:基于各个样本用户的所述第一样本特征值、所述第二样本特征值以及所述第三样本特征值,构建多棵子决策树;基于多棵所述子决策树,确定为所述位置预测模型。The embodiment of the present application also provides a specific method for training the position prediction model based on the first sample feature value and the second sample feature value of each sample user, including: The first sample feature value, the second sample feature value, and the third sample feature value are constructed to construct multiple sub-decision trees; based on the multiple sub-decision trees, it is determined as the position prediction model.
图6示出了根据本申请一些实施例的训练位置预测模型的过程的流程图。过程600可以包括步骤S601~S605。在一些实施例中,过程600可以由处理设备112执行。例如,过程600可以以存储在存储器160中的一组指令(例如,应用程序)实现。服务器110、CPU 220a和/或CPU 240b可以执行该组指令,因此可以指示执行该过程600。在一些实施例中,过程600可以由除了按需服务系统100的其他系统或设备执行。例如,由制造供应商提供的设备或系统执行。Fig. 6 shows a flowchart of a process of training a position prediction model according to some embodiments of the present application. The process 600 may include steps S601 to S605. In some embodiments, the process 600 may be performed by the processing device 112. For example, the process 600 may be implemented with a set of instructions (for example, an application program) stored in the memory 160. The server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and therefore can instruct to execute the process 600. In some embodiments, the process 600 may be performed by other systems or devices other than the on-demand service system 100. For example, implemented by equipment or systems provided by a manufacturing supplier.
S601:从所述历史行为特征、所述目标时间特征、所述目标区域特征中,随机确定多个目标特征;S601: randomly determine multiple target characteristics from the historical behavior characteristics, the target time characteristics, and the target area characteristics;
S602:基于所述样本用户在所述各个所述目标特征下的特征值,构建当前迭代周期的子决策树;S602: Construct a sub-decision tree of the current iteration cycle based on the feature value of the sample user under each of the target features;
S603:基于当前迭代周期的子决策树,以及历史迭代周期的子决策树,构成当前决策树集,并确定当前决策树集的损失;S603: Based on the sub-decision tree of the current iteration period and the sub-decision tree of the historical iteration period, constitute the current decision tree set, and determine the loss of the current decision tree set;
S604:检测当前决策树集的损失是否大于预设损失阈值;如果是,跳转至S601;当前迭代周期完成。如果否,则跳转至S605。S604: Detect whether the loss of the current decision tree set is greater than the preset loss threshold; if so, jump to S601; the current iteration cycle is completed. If not, skip to S605.
S605:将所述当前决策树集确定为所述位置预测模型。S605: Determine the current decision tree set as the position prediction model.
具体地,从历史行为特征、目标时间特征、目标区域特征中,随机确定多个目标特征,根据随机确定的多个目标特征和各个样本用户在对应目标特征下的特征值,以及与该样本用户对应的出现在待预测位置的概率,对位置预测模型进行训练,得到决策树集。Specifically, from historical behavior characteristics, target time characteristics, and target area characteristics, multiple target characteristics are randomly determined, based on the multiple randomly determined target characteristics and the characteristic value of each sample user under the corresponding target characteristic, and the relationship with the sample user The corresponding probability of appearing at the position to be predicted is trained on the position prediction model to obtain the decision tree set.
之后,将多个测试样本用户在所述历史行为特征下的第一测试样本特征值,以及各个所述测试样本用户在待预测时间特征以及所述区域特征下的第二测试样本特征值,输入至所述当前决策树集中,获取与每个所述测试样本用户对应的位置预测结果;基于各个所述测试样本用户对应的位置预测结果,以及对应的实际位置,确定当前决策树集的损失。Afterwards, input the first test sample feature value of multiple test sample users under the historical behavior characteristics, and the second test sample feature value of each test sample user under the to-be-predicted time feature and the regional feature In the current decision tree set, obtain the location prediction result corresponding to each test sample user; determine the loss of the current decision tree set based on the location prediction result corresponding to each test sample user and the corresponding actual location.
根据上述决策树集的损失调整目标特征,并根据新的重新确定的目标特征对选择概率预测模型进行训练,得到当前决策树集。The target feature is adjusted according to the loss of the aforementioned decision tree set, and the selection probability prediction model is trained according to the newly re-determined target feature to obtain the current decision tree set.
之后,基于所述测试样本用户对所述当前决策树集进行验证,确定所述当前决策树集的损失。Afterwards, the user verifies the current decision tree set based on the test sample, and determines the loss of the current decision tree set.
按照上述步骤,不断迭代优化选择概率预测模型,最终通过迭代得到一个包含N个子决策树的决策树集,N为正整数。According to the above steps, iteratively optimize the selection of the probability prediction model, and finally obtain a decision tree set containing N sub-decision trees through iteration, where N is a positive integer.
该位置预测模型在对待预测用户的位置进行预测的时候,是将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取各棵子决策树的子预测结果;When the location prediction model predicts the location of the user to be predicted, it inputs the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model to obtain each Sub-prediction results of the decision tree;
将各棵子决策树的子预测结果进行加权求和,确定所述待预测用户在所述未来时间段的位置预测结果。Perform weighted summation of the sub-prediction results of each sub-decision tree to determine the location prediction result of the user to be predicted in the future time period.
本申请实施例通过获取待预测用户在至少一种历史行为特征下的第一特征值,以及待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值;并将第一特征值、第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取待预测用户在所述未来预设时间段的位置预测结果,能够以更高的准确率来确定待预测用户在未来预设时间段内的位置。The embodiment of the application obtains the first characteristic value of the user to be predicted under at least one historical behavior characteristic, and the second characteristic value of the user to be predicted under the target time characteristic and the target location characteristic within a preset time period in the future. And input the first feature value, the second feature value, and the third feature value into the pre-trained location prediction model to obtain the location prediction of the user to be predicted in the future preset time period As a result, it is possible to determine the position of the user to be predicted in the future preset time period with higher accuracy.
图7A示出了根据本申请一些实施例的用于预测位置的示例性过程的流程图。过程700A可以包括步骤S701~S703。在一些实施例中,过程700A可以由处理设备112执行。例如,过程700A可以以存储在存储器160中的一组指令(例如,应用程序)实现。服务器110、CPU 220a和/或CPU 240b可以执行该组指令,因此可以指示执行该过程700A。FIG. 7A shows a flowchart of an exemplary process for predicting a position according to some embodiments of the present application. The process 700A may include steps S701 to S703. In some embodiments, the process 700A may be performed by the processing device 112. For example, the process 700A may be implemented with a set of instructions (eg, application programs) stored in the memory 160. The server 110, the CPU 220a, and/or the CPU 240b can execute the set of instructions, and therefore can instruct to execute the process 700A.
S701:获取待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值。S701: Obtain the first feature value of the user to be predicted under at least one historical behavior feature, and the second feature value of the user to be predicted under the target time feature and the second feature value under the target location feature within a preset time period in the future The third characteristic value.
其中,S701的实现方式与上述S301的实现方式类似,在此不再赘述。The implementation of S701 is similar to the implementation of S301 described above, and will not be repeated here.
S702:获取所述待预测用户在多个用户属性特征下的第四特征值。S702: Obtain a fourth characteristic value of the user to be predicted under multiple user attribute characteristics.
此处,用户属性特征包括如下一种或者多种:Here, the user attribute characteristics include one or more of the following:
用户的历史订单数量、是否为商务人士、是否为旅游人士、用户家所在的区域、用户公司所在区域。The number of historical orders of the user, whether it is a business person, whether it is a tourist, the area where the user’s home is located, and the area where the user’s company is located.
S703:将所述第一特征值、所述第二特征值、所述第三特征值以及所述第四特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段内的位 置预测结果。S703: Input the first feature value, the second feature value, the third feature value, and the fourth feature value into a pre-trained location prediction model, and obtain the user to be predicted in the future The location prediction result within a preset time period.
其中,S703的实现方式与上述S302的实现方式类似。在此不再赘述。The implementation of S703 is similar to the implementation of S302 described above. I won't repeat them here.
需要注意的是,在一些实施例中,在训练位置预测模型时,也可以获取各个所述样本用户在多个用户属性特征下的第四样本特征值;在对位置预测模型进行训练时,所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值、所述第三样本特征值以及所述第四样本特征值,训练所述位置预测模型。It should be noted that in some embodiments, when training the location prediction model, the fourth sample feature value of each of the sample users under multiple user attribute features can also be obtained; when the location prediction model is trained, Training the position prediction model based on the first sample feature value, the second sample feature value, the third sample feature value, and the fourth sample feature value of each sample user.
具体的训练方法与上述S302中的模型训练方法类似,在此不再赘述。The specific training method is similar to the model training method in S302, and will not be repeated here.
在一些实施例中,还可以将待预测用户的历史打车信息、未来预设时间段、以及待预测区域分别与用户属性特征进行处理后一起输入到训练好的位置预测模型,来确定所述待预测用户在未来预设时间段内在所述区域的概率。其中,历史打车信息可以包括出发地、目的地、两者的属性以及时间信息。其中属性可以包括出发地或目的地的所在的城市、区域或POI信息等。所述时间信息可以包括发单时间、出发时间或到达时间等。在一些实施例中,可以将待预测时间以前的若干次历史打车信息输入到位置预测模型中,如待预测时间以前的100次,200次,500次的历史打车信息等。In some embodiments, the historical ride-hailing information of the user to be predicted, the preset time period in the future, and the area to be predicted may be processed separately with user attribute characteristics and then input into the trained location prediction model to determine the Predict the probability that the user will be in the area within a preset time period in the future. Among them, the historical taxi information may include the origin, destination, attributes of both, and time information. The attributes can include the city, area, or POI information of the origin or destination. The time information may include the time of issuing the order, the time of departure or the time of arrival, etc. In some embodiments, several historical ride-hailing information before the time to be predicted may be input into the position prediction model, such as historical ride-hailing information 100 times, 200 times, 500 times before the time to be predicted, and so on.
位置预测模型可以通过训练初始神经网络模型来获得。在一些实施例中,初始神经网络模型可以包括xgboost模型、循环神经网络、长短期记忆网络、k-最近邻算法(K-Nearest Neighbor,kNN)模型、卷积神经网络(Convolutional Neural Networks,CNN)、人工神经网络(Artificial Neural Networks,ANN)模型等,或其任意组合。图7B示出了根据本申请一些实施例的位置预测模型的预测过程的流程图。如图7B所示,可以获取待预测用户的历史打车信息、用户属性特征、未来预设时间段、以及待预测区域,然后输入到位置预测模型中。The position prediction model can be obtained by training the initial neural network model. In some embodiments, the initial neural network model may include xgboost model, recurrent neural network, long short-term memory network, k-nearest neighbor (kNN) model, convolutional neural network (Convolutional Neural Networks, CNN) , Artificial Neural Networks (ANN) models, etc., or any combination thereof. Fig. 7B shows a flowchart of a prediction process of a position prediction model according to some embodiments of the present application. As shown in FIG. 7B, the historical ride-hailing information of the user to be predicted, user attribute characteristics, future preset time period, and the area to be predicted can be obtained, and then input into the location prediction model.
位置预测模型可以将用户属性特征分别与各次历史打车信息进行融合,得到与各次历史打车信息对应的向量表示ht,t=1、2、3…;同时将待测区域、未来预设时间段与用户属性特征处理生成向量q。在一些实施例中,可以利用LSTM网络对用户属性特征分别与各次历史打车信息融合后的信息xt,t=1、2、3…进行处理,得到所述与各次历史打车信息对应的向量表示ht,t=1、2、3…。The location prediction model can fuse user attribute characteristics with each historical taxi information respectively to obtain the vector representation ht corresponding to each historical taxi information, t=1, 2, 3...; meanwhile, the area to be tested and the future preset time Segment and user attribute feature processing generates vector q. In some embodiments, the LSTM network can be used to process the information xt, t=1, 2, 3... after the user attribute characteristics are respectively fused with each historical ride information to obtain the vector corresponding to each historical ride information Represents ht, t=1, 2, 3...
其次,基于attention机制处理向量xt和向量q,获得向量e。在一些实施例中,可以将向量q分别与向量xt,t=1、2、3…,做內积,将各个內积结果拼接成向量e。在一些实施例中,位置预测模型可以将向量e前向传递,经过两层神经网络以及激活函数处理 后得到y,所述y可以反映待预测用户在未来预设时间段内到达待测位置的概率。Second, the vector xt and vector q are processed based on the attention mechanism to obtain the vector e. In some embodiments, the vector q may be combined with the vector xt, t=1, 2, 3... to form an inner product, and each inner product result can be spliced into a vector e. In some embodiments, the position prediction model may forward the vector e, and obtain y after two-layer neural network and activation function processing. The y may reflect the arrival of the user to be predicted at the location to be measured within a preset time period in the future. Probability.
在一些实施例中,还可以对待预测用户所有到达过的区域进行聚类,基于聚类结果中位于各类中心或近中心的多个区域产生候选的判定区域;然后,依次将候选的判定区域作为输入到位置预测模型中的所述待预测区域,连同其他特征进行预测;最后,得到该未来预设时间段待预测用户在各候选判定区域的概率,选取概率最大的区域作为指定未来时间段待预测用户可能出现的位置。In some embodiments, it is also possible to cluster all the regions visited by the user to be predicted, and generate candidate determination regions based on the clustering results at various centers or multiple regions near the center; then, the candidate determination regions are sequentially As the area to be predicted as input to the location prediction model, it is predicted together with other features; finally, the probability of the user to be predicted in each candidate determination area in the future preset time period is obtained, and the area with the highest probability is selected as the designated future time period To be predicted where the user may appear.
在一些实施例中,可以获取训练数据以对初始神经网络模型进行训练以获得位置预测模型。其中训练数据可以包括多个样本用户的历史打车信息、目标时间段、以及目标区域、用户属性特征,及标签数据。其中标签数据反映样本用户是否在其对应的目标时间段到达了目标区域,仅作为示例,若样本用户是否在其对应的目标时间段到达了目标区域,标签可以为1,反之,为0。对于不同的样本用户,其对应的目标时间与目标区域可以不同。In some embodiments, training data may be obtained to train the initial neural network model to obtain a position prediction model. The training data may include historical ride-hailing information, target time period, target area, user attribute characteristics, and label data of multiple sample users. The label data reflects whether the sample user has reached the target area during the corresponding target time period. For example, if the sample user has reached the target area during the corresponding target time period, the label can be 1, otherwise, it is 0. For different sample users, the corresponding target time and target area may be different.
将训练数据中样本用户的历史打车信息、目标时间段、以及目标区域、用户属性特征输入到初始神经网络模型中,得到模型的预测结果。在一些实施例中,模型的预测结果可以是一个概率值,其反映样本用户在目标时间段内去到目标区域的预测概率。可以基于训练数据的标签数据与模型预测结果的差异构造损失函数,并基于损失函数调节模型的参数,当损失函数的值小于某一预设值,训练过程将停止。否则更新初始神经网络模型的参数,重新计算,直到损失函数的值小于该预设值为止。Input the historical taxi information, target time period, target area, and user attribute characteristics of the sample users in the training data into the initial neural network model to obtain the prediction result of the model. In some embodiments, the prediction result of the model may be a probability value, which reflects the predicted probability of the sample user going to the target area within the target time period. The loss function can be constructed based on the difference between the label data of the training data and the model prediction result, and the parameters of the model can be adjusted based on the loss function. When the value of the loss function is less than a certain preset value, the training process will stop. Otherwise, update the parameters of the initial neural network model and recalculate until the value of the loss function is less than the preset value.
在一些实施例中,根据用户在在线运输服务平台的上的行为的更新,可以周期性地或实时更新位置预测端到端模型。例如,当平台内某些用户完成新的订单后,按需服务系统100可以基于新的完单行为更新用户的历史打车信息,然后基于更新后的历史打车信息生成新的训练样本,进而更新位置预测端到端模型。In some embodiments, the position prediction end-to-end model can be updated periodically or in real time according to the update of the user's behavior on the online transportation service platform. For example, when some users in the platform complete a new order, the on-demand service system 100 can update the user's historical taxi information based on the new order completion behavior, and then generate new training samples based on the updated historical taxi information, and then update the location Forecast end-to-end models.
图8示出了根据本申请一些实施例的位置预测装置的示例性框图。如图8所示,所示位置预测装置可以包括获取模块81、预测模块82和训练模块83。Fig. 8 shows an exemplary block diagram of a position prediction apparatus according to some embodiments of the present application. As shown in FIG. 8, the position prediction device may include an acquisition module 81, a prediction module 82 and a training module 83.
获取模块81,用于获取待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值;The acquiring module 81 is configured to acquire the first characteristic value of the user to be predicted under at least one historical behavior characteristic, and the second characteristic value of the user to be predicted under the target time characteristic and the second characteristic value of the user to be predicted in the future preset time period. The third feature value under the target location feature;
预测模块82,用于将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段的位置预测结果。The prediction module 82 is configured to input the first feature value, the second feature value, and the third feature value into a pre-trained position prediction model to obtain the preset time of the user to be predicted in the future The position prediction result of the segment.
在一些实施例中,所述获取模块81,用于采用下述方式获取待预测用户在至少一种历史行为特征下的第一特征值:In some embodiments, the acquiring module 81 is configured to acquire the first characteristic value of the user to be predicted under at least one historical behavior characteristic in the following manner:
从待预测用户的至少一种历史行为信息中,提取所述待预测用户在所述区域的区域历史行为信息,以及提取所述待预测用户在多个历史时间段中每个历史时间段内,在所述区域的区域时间历史行为信息;Extracting regional historical behavior information of the user to be predicted in the region from at least one type of historical behavior information of the user to be predicted, and extracting the user to be predicted in each historical time period among multiple historical time periods, Regional time historical behavior information in the area;
根据所述待预测用户的所述至少一种历史行为信息、所述区域历史行为信息以及所述区域时间历史行为信息,确定所述待预测用户的第一历史行为特征下的第一特征值;Determine the first feature value under the first historical behavior feature of the user to be predicted according to the at least one type of historical behavior information of the user to be predicted, the regional historical behavior information, and the regional time historical behavior information;
根据所述待预测用户的所述至少一种历史行为信息,确定所述待预测用户的第二历史行为特征下的第一特征值。According to the at least one type of historical behavior information of the user to be predicted, the first feature value under the second historical behavior feature of the user to be predicted is determined.
在一些实施例中,所述第一历史行为特征包括如下一种或者多种:In some embodiments, the first historical behavior feature includes one or more of the following:
与多个兴趣点POI分类分别对应的历史行为的发生次数、历史行为在预设时间段内每天的发生次数、历史行为在工作日的发生次数、历史行为在非工作日的发生次数、不同历史行为分别对应的发生次数。The number of occurrences of historical behaviors corresponding to multiple POI classifications of points of interest, the number of occurrences of historical behaviors per day within a preset time period, the number of occurrences of historical behaviors on working days, the number of occurrences of historical behaviors on non-working days, and different histories The number of occurrences corresponding to each behavior.
在一些实施例中,所述第二历史行为特征包括以下一种或者多种:In some embodiments, the second historical behavior feature includes one or more of the following:
最后一次历史行为发生的区域是否为所述区域、最后一次历史行为发生的时间与所述待预测时间之间的时间间隔、最后一次历史行为的目的地的POI分类、最后一次历史行为的出发地的POI分类、所述待预测用户到达过的区域数量。Whether the region where the last historical behavior occurred is the region, the time interval between the time when the last historical behavior occurred and the time to be predicted, the POI classification of the destination of the last historical behavior, and the departure place of the last historical behavior The POI classification of the to-be-predicted users and the number of areas visited by the user.
在一些实施例中,所述历史行为信息包括:历史冒泡行为信息、历史发单行为信息以及历史完单行为信息中一种或者多种。In some embodiments, the historical behavior information includes one or more of historical bubbling behavior information, historical billing behavior information, and historical billing behavior information.
在一些实施例中,所述获取模块81,还用于获取所述待预测用户在多个用户属性特征下的第四特征值;In some embodiments, the acquiring module 81 is further configured to acquire the fourth characteristic value of the user to be predicted under multiple user attribute characteristics;
所述预测模块82,用于采用下述方式获取所述待预测用户在所述未来预设时间段的位置预测结果:The prediction module 82 is configured to obtain the position prediction result of the user to be predicted in the future preset time period in the following manner:
所述将所述第一特征值、所述第二特征值、所述第三特征值以及所述第四特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段内的位置预测结果。The inputting the first feature value, the second feature value, the third feature value, and the fourth feature value into a pre-trained position prediction model, and obtaining the user to be predicted in the future The location prediction result within a preset time period.
在一些实施例中,所述用户属性特征包括如下一种或者多种:订单数量、是否为商务人士、是否为旅游人士、用户家所在的区域、用户公司所在区域。In some embodiments, the user attribute characteristics include one or more of the following: the number of orders, whether it is a business person, whether it is a tourist, the area where the user's home is located, and the area where the user's company is located.
训练模块83,用于采用下述装置训练所述位置预测模型:The training module 83 is used to train the position prediction model using the following device:
获取多个样本用户在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值;所述样本用户包括正样本用户以及负样本用户;Acquire the first sample feature value of a plurality of sample users under at least one historical behavior feature, and the second sample feature value of each sample user under the target time feature and the target location within the historical preset time period The third sample feature value under the feature; the sample users include positive sample users and negative sample users;
基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型。Training the location prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user.
在一些实施例中,所述训练模块83,用于下述方式获取多个样本用户在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值:In some embodiments, the training module 83 is configured to obtain the first sample feature value of a plurality of sample users under at least one historical behavior characteristic in the following manner, and the historical preset time period of each sample user , The second sample feature value under the target time feature and the third sample feature value under the target location feature:
针对多个待预测样本区域中的每个待预测样本区域,获取与该待预测样本区域对应的样本用户在在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值;For each sample region to be predicted in the plurality of sample regions to be predicted, obtain the first sample feature value of the sample user corresponding to the sample region to be predicted under at least one historical behavior characteristic, and each of the sample users In the historical preset time period, the second sample feature value under the target time feature and the third sample feature value under the target location feature;
所述训练模块83,用于下述方式基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型:The training module 83 is configured to train the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user in the following manner:
基于各个所述待预测样本区域对应的样本用户的第一样本特征值、第二样本特征值以及第三样本特征值,训练所述位置预测模型。Training the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of the sample user corresponding to each sample area to be predicted.
在一些实施例中,所述训练模块83,用于下述方式基于各个样本用户的所述第一样本特征值,以及所述第二样本特征值,训练所述位置预测模型:In some embodiments, the training module 83 is configured to train the position prediction model based on the first sample feature value and the second sample feature value of each sample user in the following manner:
基于各个样本用户的所述第一样本特征值、所述第二样本特征值以及所述第三样本特征值,构建多棵子决策树;Constructing multiple sub-decision trees based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user;
将多棵所述子决策树确定为所述位置预测模型。A plurality of the sub-decision trees are determined as the position prediction model.
在一些实施例中,所述训练模块83,用于下述方式所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值以及所述第三样本特征值,构建多棵子决策树:In some embodiments, the training module 83 is configured to construct the first sample feature value, the second sample feature value, and the third sample feature value of each sample user in the following manner Multiple sub-decision trees:
从所述历史行为特征、所述目标时间特征、所述目标区域特征中,随机确定多个目标特征;Randomly determining multiple target characteristics from the historical behavior characteristics, the target time characteristics, and the target area characteristics;
基于所述样本用户在所述各个所述目标特征下的特征值,构建当前迭代周期的子决策树;Constructing a sub-decision tree of the current iteration cycle based on the feature value of the sample user under each of the target features;
基于当前迭代周期的子决策树,以及历史迭代周期的子决策树,构成当前决策树集,并确定当前决策树集的损失;Based on the sub-decision tree of the current iteration period and the sub-decision tree of the historical iteration period, constitute the current decision tree set, and determine the loss of the current decision tree set;
在所述损失大于预设损失阈值的情况下,完成当前迭代周期,并返回至所述历史行为特征、所述目标时间特征、所述目标区域特征以及所述用户属性特征中,随机确定多个目标特征的步骤;In the case that the loss is greater than the preset loss threshold, complete the current iteration cycle and return to the historical behavior characteristics, the target time characteristics, the target area characteristics, and the user attribute characteristics. Steps of target characteristics;
在所述损失不大于预设损失阈值的情况下,将所述当前决策树集确定为所述位置预测模型。In a case where the loss is not greater than a preset loss threshold, the current decision tree set is determined as the position prediction model.
在一些实施例中,所述训练模块83,用于下述方式确定当前决策树集的损失:In some embodiments, the training module 83 is used to determine the loss of the current decision tree set in the following manner:
将多个测试样本用户在所述历史行为特征下的第一测试样本特征值,以及各个所述测试样本用户在待预测时间特征以及所述区域特征下的第二测试样本特征值,输入至所述当前决策树集中,获取与每个所述测试样本用户对应的位置预测结果;The first test sample feature value of multiple test sample users under the historical behavior characteristics, and the second test sample feature value of each test sample user under the to-be-predicted time feature and the regional feature are input to all In the current decision tree set, obtain a location prediction result corresponding to each of the test sample users;
基于各个所述测试样本用户对应的位置预测结果,以及对应的实际位置,确定当前决策树集的损失。Based on the location prediction results corresponding to each of the test sample users and the corresponding actual locations, the loss of the current decision tree set is determined.
在一些实施例中,所述训练模块83,还用于获取各个所述样本用户在多个用户属性特征下的第四样本特征值;In some embodiments, the training module 83 is further configured to obtain the fourth sample feature value of each sample user under multiple user attribute features;
所述训练模块83,用于下述方式基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型:The training module 83 is configured to train the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user in the following manner:
所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值、所述第三样本特征值以及所述第四样本特征值,训练所述位置预测模型。Training the position prediction model based on the first sample feature value, the second sample feature value, the third sample feature value, and the fourth sample feature value of each sample user.
在一些实施例中,所述预测模块82,用于采用下述方式将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段的位置预测结果:In some embodiments, the prediction module 82 is configured to input the first feature value, the second feature value, and the third feature value into a pre-trained position prediction model in the following manner to obtain The location prediction result of the user to be predicted in the future preset time period:
将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取各棵子决策树的子预测结果;Inputting the first feature value, the second feature value, and the third feature value into a pre-trained position prediction model to obtain sub-prediction results of each sub-decision tree;
将各棵子决策树的子预测结果进行加权求和,确定所述待预测用户在所述未来时间段的位置预测结果。Perform weighted summation of the sub-prediction results of each sub-decision tree to determine the location prediction result of the user to be predicted in the future time period.
上述模块可以经由有线连接或无线连接彼此连接或通信。有线连接可以包括金属线缆、光缆、混合线缆等,或其任意组合。无线连接可以包括通过LAN、WAN、蓝牙、ZigBee、或NFC等形式的连接,或其任意组合。两个或更多个模块可以组合为单个模块,并且任何一个模块可以分成两个或更多个单元。The aforementioned modules may be connected or communicate with each other via a wired connection or a wireless connection. Wired connections may include metal cables, optical cables, hybrid cables, etc., or any combination thereof. The wireless connection may include a connection in the form of LAN, WAN, Bluetooth, ZigBee, or NFC, or any combination thereof. Two or more modules can be combined into a single module, and any one module can be divided into two or more units.
如图2A所示,本申请实施例还提供一种电子设备,包括:总线210a、处理器 220a、通讯端口250a、输入/输出260a以及存储介质(例如,磁盘270a、只读存储器(ROM)230a,或随机存储器(RAM)240a),所述存储介质存储有所述处理器220a可执行的机器可读指令,当电子设备运行时,所述处理器220a与所述存储介质之间通过总线通信210a,所述处理器220a执行所述机器可读指令,以执行时执行如本申请实施例提供的位置预测方法的步骤。As shown in FIG. 2A, an embodiment of the present application also provides an electronic device, including: a bus 210a, a processor 220a, a communication port 250a, an input/output 260a, and a storage medium (for example, a magnetic disk 270a, a read-only memory (ROM) 230a , Or random access memory (RAM) 240a), the storage medium stores machine-readable instructions executable by the processor 220a, and when the electronic device is running, the processor 220a communicates with the storage medium through a bus 210a. The processor 220a executes the machine-readable instructions to execute the steps of the position prediction method provided in the embodiments of the present application when executed.
本申请实施例还提供一种计算机可读存储介质,该计算机可读存储介质上存储有计算机程序,该计算机程序被处理器运行时执行如本申请实施例提供的位置预测方法的步骤。The embodiment of the present application also provides a computer-readable storage medium on which a computer program is stored, and the computer program executes the steps of the position prediction method provided in the embodiment of the present application when the computer program is run by a processor.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统和装置的具体工作过程,可以参考方法实施例中的对应过程,本申请中不再赘述。在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。以上所描述的装置实施例仅仅是示意性的,例如,所述模块的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,又例如,多个模块或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些通信接口,装置或模块的间接耦合或通信连接,可以是电性,机械或其它的形式。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, the specific working process of the system and device described above can refer to the corresponding process in the method embodiment, which will not be repeated in this application. In the several embodiments provided in this application, it should be understood that the disclosed system, device, and method may be implemented in other ways. The device embodiments described above are merely illustrative. For example, the division of the modules is only a logical function division, and there may be other divisions in actual implementation. For example, multiple modules or components may be combined or It can be integrated into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some communication interfaces, devices or modules, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的模块可以是或者也可以不是物理上分开的,作为模块显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The modules described as separate components may or may not be physically separated, and the components displayed as modules may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个处理器可执行的非易失的计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。If the function is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a nonvolatile computer readable storage medium executable by a processor. Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present application. The aforementioned storage media include: U disk, mobile hard disk, ROM, RAM, magnetic disk or optical disk and other media that can store program codes.
以上仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above are only specific implementations of this application, but the protection scope of this application is not limited to this. Any person skilled in the art can easily think of changes or substitutions within the technical scope disclosed in this application, and they should all be covered Within the scope of protection of this application. Therefore, the protection scope of this application should be subject to the protection scope of the claims.

Claims (25)

  1. 一种位置预测方法,其特征在于,该方法包括:A position prediction method, characterized in that the method includes:
    获取与待预测用户历史行为相关联的特征,所述特征至少包括与待预测用户历史行为相关的位置信息;Acquiring a feature associated with the historical behavior of the user to be predicted, the feature including at least location information related to the historical behavior of the user to be predicted;
    至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。At least based on the features associated with the historical behavior of the user to be predicted, the location prediction result of the user to be predicted in the future preset time period is obtained through a pre-trained location prediction model.
  2. 根据权利要求1所述的方法,其特征在于,与待预测用户历史行为相关联的特征包括待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值。The method according to claim 1, wherein the features associated with the historical behavior of the user to be predicted include a first feature value of the user to be predicted under at least one historical behavior feature, and the future prediction of the user to be predicted Set the second feature value under the target time feature and the third feature value under the target location feature within the time period.
  3. 根据权利要求2所述的方法,其特征在于,所述待预测用户在所述未来预设时间段的位置预测结果包括待预测用户在未来预设时间段内去到待预测区域的概率;The method according to claim 2, wherein the location prediction result of the user to be predicted in the future preset time period includes the probability that the user to be predicted will go to the area to be predicted in the future preset time period;
    所述第一特征反映所述至少一种历史行为在不同区域的分布信息、所述至少一种历史行为在时间上的分布信息、至少一种历史行为与所述未来预设时间段的关联信息、至少一种历史行为与所述待预测区域的关联信息中的一种或多种的组合;The first feature reflects the distribution information of the at least one historical behavior in different regions, the distribution information of the at least one historical behavior in time, and the association information between the at least one historical behavior and the future preset time period , At least one historical behavior and one or more combinations of the associated information of the area to be predicted;
    所述第二特征反映所述未来预设时间段的属性信息;The second feature reflects the attribute information of the future preset time period;
    所述第三特征反映所述待预测区域的属性信息。The third feature reflects the attribute information of the region to be predicted.
  4. 根据权利要求2所述的方法,其特征在于,所述获取待预测用户在至少一种历史行为特征下的第一特征值,包括:The method according to claim 2, wherein said obtaining the first characteristic value of the user to be predicted under at least one historical behavior characteristic comprises:
    从待预测用户的至少一种历史行为信息中,提取所述待预测用户在待预测区域的区域历史行为信息,以及提取所述待预测用户在多个历史时间段中每个历史时间段内,在待预测区域的区域时间历史行为信息;From at least one type of historical behavior information of the user to be predicted, extracting the regional historical behavior information of the user to be predicted in the area to be predicted, and extracting the user to be predicted in each of the multiple historical time periods, Regional time historical behavior information in the area to be predicted;
    根据所述待预测用户的所述至少一种历史行为信息、所述区域历史行为信息以及所述区域时间历史行为信息,确定所述待预测用户的第一历史行为特征下的第一特征值;Determine the first characteristic value under the first historical behavior characteristic of the user to be predicted according to the at least one type of historical behavior information of the user to be predicted, the regional historical behavior information, and the regional time historical behavior information;
    根据所述待预测用户的所述至少一种历史行为信息,确定所述待预测用户的第二历史行为特征下的第一特征值。According to the at least one type of historical behavior information of the user to be predicted, the first feature value under the second historical behavior feature of the user to be predicted is determined.
  5. 根据权利要求4所述的方法,其特征在于,所述第一历史行为特征包括如下一种或者多种:The method according to claim 4, wherein the first historical behavior characteristic includes one or more of the following:
    与多个兴趣点POI分类分别对应的历史行为的发生次数、历史行为在预设时间段内每天的发生次数、历史行为在工作日的发生次数、历史行为在非工作日的发生次数、不同历史行为分别对应的发生次数。The number of occurrences of historical behaviors corresponding to multiple POI classifications of points of interest, the number of occurrences of historical behaviors per day within a preset time period, the number of occurrences of historical behaviors on working days, the number of occurrences of historical behaviors on non-working days, and different histories The number of occurrences corresponding to each behavior.
  6. 根据权利要求4所述的方法,其特征在于,所述第二历史行为特征包括以下一种或者多种:The method according to claim 4, wherein the second historical behavior characteristic comprises one or more of the following:
    最后一次历史行为发生的区域是否为待预测区域、最后一次历史行为发生的时间与所述待预测时间之间的时间间隔、最后一次历史行为的目的地的POI分类、最后一次历史行为的出发地的POI分类、所述待预测用户到达过的区域数量。Whether the area where the last historical behavior occurred is the area to be predicted, the time interval between the time when the last historical behavior occurred and the time to be predicted, the POI classification of the destination of the last historical behavior, and the departure place of the last historical behavior The POI classification of the to-be-predicted users and the number of areas visited by the user.
  7. 根据权利要求4所述的方法,其特征在于,所述历史行为信息包括:历史冒泡行为信息、历史发单行为信息以及历史完单行为信息中一种或者多种。The method according to claim 4, wherein the historical behavior information includes one or more of: historical bubbling behavior information, historical billing behavior information, and historical billing behavior information.
  8. 根据权利要求1所述的方法,其特征在于,该方法还包括:The method of claim 1, wherein the method further comprises:
    获取所述待预测用户的用户属性特征;Acquiring the user attribute characteristics of the user to be predicted;
    所述至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果,包括:The obtaining the location prediction result of the user to be predicted in the future preset time period through a pre-trained location prediction model based at least on the features associated with the historical behavior of the user to be predicted includes:
    基于与待预测用户历史行为相关联的特征以及所述待预测用户的用户属性特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。Based on the features associated with the historical behavior of the user to be predicted and the user attribute characteristics of the user to be predicted, the location prediction result of the user to be predicted in the future preset time period is obtained through a pre-trained location prediction model.
  9. 根据权利要求8所述的方法,其特征在于,所述用户属性特征包括如下一种或者多种:The method according to claim 8, wherein the user attribute characteristics include one or more of the following:
    订单数量、是否为商务人士、是否为旅游人士、用户家所在的区域、用户公司所在区域。The number of orders, whether it is a business person, whether it is a tourist, the area where the user’s home is located, and the area where the user’s company is located.
  10. 根据权利要求2所述的方法,其特征在于,采用下述方法训练所述位置预测模型:The method according to claim 2, wherein the following method is used to train the position prediction model:
    获取多个样本用户在至少一种历史行为特征下的第一样本特征值,以及各个所述样本 用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值;所述样本用户包括正样本用户以及负样本用户;Acquire the first sample feature value of a plurality of sample users under at least one historical behavior feature, and the second sample feature value of each sample user under the target time feature and the target location within the historical preset time period The third sample feature value under the feature; the sample users include positive sample users and negative sample users;
    基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型。Training the location prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user.
  11. 根据权利要求10所述的方法,其特征在于,所述获取多个样本用户在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值,包括:The method according to claim 10, wherein said acquiring a first sample feature value of a plurality of sample users under at least one historical behavior characteristic, and each of the sample users is within a historical preset time period, The second sample feature value under the target time feature and the third sample feature value under the target location feature include:
    针对多个待预测样本区域中的每个待预测样本区域,获取与该待预测样本区域对应的样本用户在在至少一种历史行为特征下的第一样本特征值,以及各个所述样本用户在历史预设时间段内,在目标时间特征下的第二样本特征值和在目标位置特征下的第三样本特征值;For each sample region to be predicted in the plurality of sample regions to be predicted, obtain the first sample feature value of the sample user corresponding to the sample region to be predicted under at least one historical behavior characteristic, and each of the sample users In the historical preset time period, the second sample feature value under the target time feature and the third sample feature value under the target location feature;
    所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型,包括:The training the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user includes:
    基于各个所述待预测样本区域对应的样本用户的第一样本特征值、第二样本特征值以及第三样本特征值,训练所述位置预测模型。Training the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of the sample user corresponding to each sample area to be predicted.
  12. 根据权利要求10所述的方法,其特征在于,所述基于各个样本用户的所述第一样本特征值,以及所述第二样本特征值,训练所述位置预测模型,包括:The method according to claim 10, wherein the training of the location prediction model based on the first sample feature value and the second sample feature value of each sample user comprises:
    基于各个样本用户的所述第一样本特征值、所述第二样本特征值以及所述第三样本特征值,构建多棵子决策树;Constructing multiple sub-decision trees based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user;
    将多棵所述子决策树确定为所述位置预测模型。A plurality of the sub-decision trees are determined as the position prediction model.
  13. 根据权利要求12所述的方法,其特征在于,基于所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值以及所述第三样本特征值,构建多棵子决策树,包括:The method according to claim 12, wherein a plurality of sub-decisions are constructed based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user Tree, including:
    从所述历史行为特征、所述目标时间特征、所述目标区域特征中,随机确定多个目标特征;Randomly determining multiple target characteristics from the historical behavior characteristics, the target time characteristics, and the target area characteristics;
    基于所述样本用户在所述各个所述目标特征下的特征值,构建当前迭代周期的子决策 树;Constructing a sub-decision tree of the current iteration cycle based on the feature value of the sample user under each of the target features;
    基于当前迭代周期的子决策树,以及历史迭代周期的子决策树,构成当前决策树集,并确定当前决策树集的损失;Based on the sub-decision tree of the current iteration period and the sub-decision tree of the historical iteration period, constitute the current decision tree set, and determine the loss of the current decision tree set;
    在所述损失大于预设损失阈值的情况下,完成当前迭代周期,并返回至所述历史行为特征、所述目标时间特征、所述目标区域特征以及所述用户属性特征中,随机确定多个目标特征的步骤;In the case that the loss is greater than the preset loss threshold, complete the current iteration cycle and return to the historical behavior characteristics, the target time characteristics, the target area characteristics, and the user attribute characteristics. Steps of target characteristics;
    在所述损失不大于预设损失阈值的情况下,将所述当前决策树集确定为所述位置预测模型。In a case where the loss is not greater than a preset loss threshold, the current decision tree set is determined as the position prediction model.
  14. 根据权利要求13所述的方法,其特征在于,所述确定当前决策树集的损失,包括:The method according to claim 13, wherein the determining the loss of the current decision tree set comprises:
    将多个测试样本用户在所述历史行为特征下的第一测试样本特征值,以及各个所述测试样本用户在待预测时间特征以及所述待预测区域特征下的第二测试样本特征值,输入至所述当前决策树集中,获取与每个所述测试样本用户对应的位置预测结果;Input the first test sample feature value of multiple test sample users under the historical behavior characteristics, and the second test sample feature value of each test sample user under the to-be-predicted time feature and the to-be-predicted area feature Go to the current decision tree set and obtain a location prediction result corresponding to each of the test sample users;
    基于各个所述测试样本用户对应的位置预测结果,以及对应的实际位置,确定当前决策树集的损失。Based on the location prediction results corresponding to each of the test sample users and the corresponding actual locations, the loss of the current decision tree set is determined.
  15. 根据权利10所述的方法,其特征在于,该方法还包括:获取各个所述样本用户在多个用户属性特征下的第四样本特征值;The method according to claim 10, wherein the method further comprises: obtaining a fourth sample feature value of each of the sample users under multiple user attribute characteristics;
    所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值和所述第三样本特征值,训练所述位置预测模型,包括:The training the position prediction model based on the first sample feature value, the second sample feature value, and the third sample feature value of each sample user includes:
    所述基于各个样本用户的所述第一样本特征值、所述第二样本特征值、所述第三样本特征值以及所述第四样本特征值,训练所述位置预测模型。Training the position prediction model based on the first sample feature value, the second sample feature value, the third sample feature value, and the fourth sample feature value of each sample user.
  16. 根据权利要求12所述的方法,其特征在于,将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段的位置预测结果,包括:The method according to claim 12, wherein the first feature value, the second feature value, and the third feature value are input into a pre-trained location prediction model to obtain the user to be predicted The location prediction results in the future preset time period include:
    将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取各棵子决策树的子预测结果;Inputting the first feature value, the second feature value, and the third feature value into a pre-trained position prediction model to obtain sub-prediction results of each sub-decision tree;
    将各棵子决策树的子预测结果进行加权求和,确定所述待预测用户在所述未来时间段的位置预测结果。Perform weighted summation of the sub-prediction results of each sub-decision tree to determine the location prediction result of the user to be predicted in the future time period.
  17. 根据权利要求1所述的方法,其特征在于,与待预测用户历史行为相关联的特征反映待预测用户的一次或以上的历史打车信息。The method according to claim 1, wherein the feature associated with the historical behavior of the user to be predicted reflects one or more historical ride-hailing information of the user to be predicted.
  18. 根据权利要求17所述的方法,其特征在于,所述历史打车信息包括以下信息中的一种或多种:The method according to claim 17, wherein the historical ride-hailing information includes one or more of the following information:
    出发地、目的地、出发地属性、目的地属性以及时间信息。Origin, destination, origin attribute, destination attribute, and time information.
  19. 根据权利要求17所述的方法,其特征在于,还包括获取所述待预测用户的用户属性特征,以及待预测区域;The method according to claim 17, further comprising obtaining the user attribute characteristics of the user to be predicted and the area to be predicted;
    所述待预测用户在所述未来预设时间段的位置预测结果包括待预测用户在未来预设时间段内去到待预测区域的概率;The location prediction result of the user to be predicted in the future preset time period includes the probability that the user to be predicted will go to the area to be predicted in the future preset time period;
    所述至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果,包括:The obtaining the location prediction result of the user to be predicted in the future preset time period through a pre-trained location prediction model based at least on the features associated with the historical behavior of the user to be predicted includes:
    基于与待预测用户历史行为相关联的特征、所述待预测用户的用户属性特征、所述未来预设时间段以及所述待预测区域,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。Based on the characteristics associated with the historical behavior of the user to be predicted, the user attribute characteristics of the user to be predicted, the future preset time period, and the area to be predicted, the location of the user to be predicted is acquired through a pre-trained location prediction model. The position prediction result of the preset time period in the future.
  20. 根据权利要求19所述的方法,其特征在于,所述基于与待预测用户历史行为相关联的特征、所述待预测用户的用户属性特征、所述未来预设时间段以及所述待预测区域,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果,包括,通过所述位置预测模型:The method according to claim 19, characterized in that, based on the characteristics associated with the historical behavior of the user to be predicted, the user attribute characteristics of the user to be predicted, the future preset time period, and the area to be predicted Obtaining the location prediction result of the user to be predicted in the future preset time period through a pre-trained location prediction model includes, through the location prediction model:
    处理所述待预测用户的用户属性特征、所述未来预设时间段以及所述待预测区域获得第一向量表示;Processing the user attribute characteristics of the user to be predicted, the future preset time period, and the area to be predicted to obtain a first vector representation;
    将所述用户属性特征分别与各次历史打车信息融合,以获得与各次历史打车信息对应的向量表示;Fusing the user attribute characteristics with each historical taxi information respectively to obtain a vector representation corresponding to each historical taxi information;
    基于attention机制处理所述第一向量表示与各次历史打车信息对应的向量表示,得 到第二向量表示;Processing the first vector representation based on the attention mechanism to the vector representation corresponding to each historical ride-hailing information to obtain the second vector representation;
    基于第二向量表示确定所述待预测用户在未来预设时间段内去到待预测区域的概率。Based on the second vector, it indicates the probability that the user to be predicted will go to the area to be predicted within a preset time period in the future.
  21. 一种位置预测系统,其特征在于,包括:A position prediction system, characterized in that it comprises:
    至少一个存储介质,包括用于位置预测的一组指令;以及At least one storage medium, including a set of instructions for position prediction; and
    至少一个与所述存储介质通信的处理器,其中当执行所述一组指令时,所述至少一个处理器用于:At least one processor in communication with the storage medium, wherein when executing the set of instructions, the at least one processor is configured to:
    获取与待预测用户历史行为相关联的特征;所述特征至少包括与待预测用户历史行为相关的位置信息;Acquiring a feature associated with the historical behavior of the user to be predicted; the feature includes at least location information related to the historical behavior of the user to be predicted;
    至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。At least based on the features associated with the historical behavior of the user to be predicted, the location prediction result of the user to be predicted in the future preset time period is obtained through a pre-trained location prediction model.
  22. 一种位置预测装置,其特征在于,该装置包括:A position prediction device, characterized in that the device includes:
    获取模块,用于获取与待预测用户历史行为相关联的特征;所述特征至少包括与待预测用户历史行为相关的位置信息;An acquiring module, configured to acquire features associated with the historical behavior of the user to be predicted; the features include at least location information related to the historical behavior of the user to be predicted;
    预测模块,用于至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。The prediction module is configured to obtain the location prediction result of the user to be predicted in the future preset time period through a pre-trained location prediction model based at least on the features associated with the historical behavior of the user to be predicted.
  23. 一种计算机可读存储介质,其特征在于,该计算机可读存储介质上存储有计算机程序,该计算机程序被处理器运行时执行:A computer-readable storage medium, characterized in that a computer program is stored on the computer-readable storage medium, and the computer program is executed when a processor is running:
    获取与待预测用户历史行为相关联的特征;所述特征至少包括与待预测用户历史行为相关的位置信息;Acquiring a feature associated with the historical behavior of the user to be predicted; the feature includes at least location information related to the historical behavior of the user to be predicted;
    至少基于与待预测用户历史行为相关联的特征,通过预先训练的位置预测模型获取所述待预测用户在所述未来预设时间段的位置预测结果。At least based on the features associated with the historical behavior of the user to be predicted, the location prediction result of the user to be predicted in the future preset time period is obtained through a pre-trained location prediction model.
  24. 一种位置预测方法,其特征在于,该方法包括:A position prediction method, characterized in that the method includes:
    获取待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值;Obtain the first feature value of the user to be predicted under at least one historical behavior feature, and the second feature value of the user to be predicted under the target time feature and the first feature value under the target location feature within a preset time period in the future. Three eigenvalues;
    将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模 型中,获取所述待预测用户在所述未来预设时间段的位置预测结果。The first feature value, the second feature value, and the third feature value are input into a pre-trained location prediction model to obtain a location prediction result of the user to be predicted in the future preset time period.
  25. 一种位置预测装置,其特征在于,该装置包括:A position prediction device, characterized in that the device includes:
    获取模块,用于获取待预测用户在至少一种历史行为特征下的第一特征值,以及所述待预测用户在未来预设时间段内,在目标时间特征下的第二特征值和在目标位置特征下的第三特征值;The acquiring module is used to acquire the first characteristic value of the user to be predicted under at least one historical behavior characteristic, and the second characteristic value of the user to be predicted under the target time characteristic and the target time characteristic within a preset time period in the future. The third feature value under the location feature;
    预测模块,用于将所述第一特征值、所述第二特征值和所述第三特征值输入至预先训练的位置预测模型中,获取所述待预测用户在所述未来预设时间段的位置预测结果。A prediction module, configured to input the first feature value, the second feature value, and the third feature value into a pre-trained location prediction model to obtain the user to be predicted in the future preset time period The location prediction result.
PCT/CN2020/073652 2019-01-21 2020-01-21 Method and device for location prediction WO2020151725A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910055395.8A CN111461380A (en) 2019-01-21 2019-01-21 Position prediction method and device
CN201910055395.8 2019-01-21

Publications (1)

Publication Number Publication Date
WO2020151725A1 true WO2020151725A1 (en) 2020-07-30

Family

ID=71682148

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/073652 WO2020151725A1 (en) 2019-01-21 2020-01-21 Method and device for location prediction

Country Status (2)

Country Link
CN (1) CN111461380A (en)
WO (1) WO2020151725A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140143187A1 (en) * 2012-11-19 2014-05-22 Ha Yoon Song Method for probabilistically predicting location of object
CN104680250A (en) * 2015-02-11 2015-06-03 北京邮电大学 Position predicting system
CN106202236A (en) * 2016-06-28 2016-12-07 联想(北京)有限公司 A kind of customer location Forecasting Methodology and device
CN106776930A (en) * 2016-12-01 2017-05-31 合肥工业大学 A kind of location recommendation method for incorporating time and geographical location information

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108072378B (en) * 2016-11-15 2020-10-23 中国移动通信有限公司研究院 Method and device for predicting destination
CN108288096B (en) * 2017-01-10 2020-08-21 北京嘀嘀无限科技发展有限公司 Method and device for estimating travel time and training model
CN107992530A (en) * 2017-11-14 2018-05-04 北京三快在线科技有限公司 Information recommendation method and electronic equipment
CN108286980B (en) * 2017-12-29 2021-04-02 广州通易科技有限公司 Method for predicting destination and recommending driving route
CN108966148A (en) * 2018-08-10 2018-12-07 深圳北斗应用技术研究院有限公司 Prediction technique, server and the storage medium of trip information

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140143187A1 (en) * 2012-11-19 2014-05-22 Ha Yoon Song Method for probabilistically predicting location of object
CN104680250A (en) * 2015-02-11 2015-06-03 北京邮电大学 Position predicting system
CN106202236A (en) * 2016-06-28 2016-12-07 联想(北京)有限公司 A kind of customer location Forecasting Methodology and device
CN106776930A (en) * 2016-12-01 2017-05-31 合肥工业大学 A kind of location recommendation method for incorporating time and geographical location information

Also Published As

Publication number Publication date
CN111461380A (en) 2020-07-28

Similar Documents

Publication Publication Date Title
TWI638320B (en) Systems, methods and non-transitory computer-readable storage mediums for recommending an estimated time of arrival
TWI670677B (en) Systems and methods for recommending an estimated time of arrival
TWI673659B (en) Electronic systems and methods for determining an estimated time of arrival and relevant non-transitory computer-readable medium
JP7047096B2 (en) Systems and methods for determining estimated arrival times for online-to-offline services
TWI689744B (en) Systems and methods for determining estimated time of arrival and relevant non-transitory machine-readable storage medium
JP2019527871A (en) System and method for determining estimated arrival time
US20210140774A1 (en) Systems and methods for recommending pick-up locations
JP6503474B2 (en) System and method for determining a path of a mobile device
CN112868036B (en) System and method for location recommendation
AU2018102202A4 (en) Systems and methods for cheat examination
TWI675184B (en) Systems, methods and non-transitory computer readable medium for route planning
CN109791731B (en) Method and system for estimating arrival time
WO2018209551A1 (en) Systems and methods for determining an estimated time of arrival
CN111415024A (en) Arrival time estimation method and estimation device
CN110751531A (en) Track identification method and device and electronic equipment
CN110800030B (en) Method and system for car pooling service
CN111954891A (en) Cross-service shared automobile resource reuse method
CN111260423A (en) Order distribution method and device, electronic equipment and computer readable storage medium
WO2020151725A1 (en) Method and device for location prediction
CN111144968B (en) System and method for distributing service requests
CN113924460B (en) System and method for determining recommendation information for service request
CN114365205A (en) System and method for determining estimated time of arrival in online-to-offline service
CN110832811A (en) System and method for transmitting spatial data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20745052

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20745052

Country of ref document: EP

Kind code of ref document: A1