CN110069482A - A kind of data cleaning method predicted for sharing automobile demand - Google Patents

A kind of data cleaning method predicted for sharing automobile demand Download PDF

Info

Publication number
CN110069482A
CN110069482A CN201910340014.0A CN201910340014A CN110069482A CN 110069482 A CN110069482 A CN 110069482A CN 201910340014 A CN201910340014 A CN 201910340014A CN 110069482 A CN110069482 A CN 110069482A
Authority
CN
China
Prior art keywords
data
interaction
site
car
pick
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910340014.0A
Other languages
Chinese (zh)
Inventor
袁春雷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Shouqi Zhixing Technology Co Ltd
Original Assignee
Beijing Shouqi Zhixing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Shouqi Zhixing Technology Co Ltd filed Critical Beijing Shouqi Zhixing Technology Co Ltd
Priority to CN201910340014.0A priority Critical patent/CN110069482A/en
Publication of CN110069482A publication Critical patent/CN110069482A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0202Market predictions or forecasting for commercial activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0645Rental transactions; Leasing transactions

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Theoretical Computer Science (AREA)
  • Development Economics (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Game Theory and Decision Science (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • Coin-Freed Apparatuses For Hiring Articles (AREA)

Abstract

The present invention relates to a kind of for sharing the data cleaning method of automobile demand prediction, extracts the data opened with all client applications of once interaction including (1);(2) all sites in interaction 5 kilometer range of spot are obtained and whether there is or not the data wait hire a car for corresponding site when this time interaction occurs;(3) whether there is or not the data for carrying out lower single operation after acquisition this time interaction;(4) the process id of this opening client application is belonged to and is interacted spot apart from nearest site;(5) it obtains and carries out the longest pick-up walking distance that the success of this time user of interaction places an order;(6) all site information in user's longest pick-up walking distance are obtained;(7) determine whether wait hire a car;(8) this data result interacted is belonged to and interacts spot apart from nearest site, cleaned data and finish.The validity of present invention guarantee data;Dirty data will not be adulterated, improves forecasting accuracy, while reducing the model calculation time.

Description

A kind of data cleaning method predicted for sharing automobile demand
Technical field
The invention belongs to share automotive field, and in particular to a kind of for sharing the data cleansing side of automobile demand prediction Method.
Background technique
" Car sharing " comes across the last century 40's earliest, is invented by Swiss.They have organized " self-driving in the whole nation Vehicle cooperative society ", this mountainous region country as Switzerland is very useful, after a people is finished vehicle, just gives car key next People is easier than establishing network in level land country.
The states such as later Japan, Britain fall over each other to imitate, but all not formed scale.Japan is primarily due to automaker and does not prop up Hold this plan, Japanese likes possessing oneself a private car.And Britain is although obtain governmental support, automobile leasing expense It is cheap, to hinder the development of " Car sharing ".
Currently, in the shared automobile for having had already appeared multiple companies in the street of China, but at present for sharing automobile Requirement forecasting be still within research and development and exploratory stage, the requirement forecasting of shared automobile is conducive to maximumlly bring row of hiring a car together For.But the data of requirement forecasting are simply cleaned mostly or are applied without cleaning, and then lead to current share The requirement forecasting of automobile has the following disadvantages:
1. not can guarantee the validity of data;
2. dirty data can be adulterated, forecasting accuracy is influenced.
Summary of the invention
Goal of the invention: the present invention has made improvements in view of the above-mentioned problems of the prior art, i.e., the invention discloses one kind For sharing the data cleaning method of automobile demand prediction, one is made accurately for original data required for requirement forecasting Data cleansing.
Technical solution: a kind of for sharing the data cleaning method of automobile demand prediction, comprising the following steps:
(1), it from the data extracted in the data obtained and opened with all client applications of once interaction are buried, completes After enter step (2);
(2), step (1) extract data in, obtain interaction 5 kilometer range of spot in all sites and this time Corresponding site enters step (3) whether there is or not the data wait hire a car after the completion when interaction occurs;
(3), by open client application process id (sessionid) come obtain this time interaction after whether there is or not place an order The data of operation enter step (4) after the completion;
(4), the process id of this opening client application is belonged to and is interacted spot apart from nearest site, it is complete (5) are entered step after;
(5), it obtains and carries out the longest pick-up walking distance that the success of this time user of interaction places an order, enter step after the completion (6), in which:
Walking distance when longest pick-up walking distance refers to placing an order successfully between user position and pick-up site;
(6), all site information in user's longest pick-up walking distance are obtained, enter step (7) after the completion;
(7) if, step (6) have not less than one in all sites for obtaining wait hire a car, be considered as and need to be hired a car, It is on the contrary then be considered as without wait hire a car, (8) are entered step after the completion;
(8), this data result interacted is belonged to and interacts spot apart from nearest site, it is complete to clean data Finish, in which:
This time data result of interaction include this interaction either with or without place an order, either with or without wait hire a car.
Further, the data that client application is opened include neighbouring site and information of vehicles, facility information, Yong Huxin Whether breath order placement information and clicks each icon information.
Further, step (5) if in the user be new registration user, longest pick-up walking distance is set as 100 meters.
The utility model has the advantages that a kind of data cleaning method for sharing automobile demand prediction disclosed by the invention has with following Beneficial effect:
1, guarantee the validity of data;
2, dirty data will not be adulterated, forecasting accuracy is improved;
3, the model calculation time is reduced after cleaning data.
Detailed description of the invention
Fig. 1 is disclosed by the invention a kind of for sharing the flow chart of the data cleaning method of automobile demand prediction.
Specific embodiment:
Detailed description of specific embodiments of the present invention below.
As shown in Figure 1, a kind of for sharing the data cleaning method of automobile demand prediction, comprising the following steps:
(1), it from the data extracted in the data obtained and opened with all client applications of once interaction are buried, completes After enter step (2);
(2), step (1) extract data in, obtain interaction 5 kilometer range of spot in all sites and this time Corresponding site enters step (3) whether there is or not the data wait hire a car after the completion when interaction occurs;
(3), by open client application process id (sessionid) come obtain this time interaction after whether there is or not place an order The data of operation enter step (4) after the completion;
(4), the process id of this opening client application is belonged to and is interacted spot apart from nearest site, it is complete (5) are entered step after;
(5), it obtains and carries out the longest pick-up walking distance that the success of this time user of interaction places an order, enter step after the completion (6), in which:
Walking distance when longest pick-up walking distance refers to placing an order successfully between user position and pick-up site;
(6), all site information in user's longest pick-up walking distance are obtained, enter step (7) after the completion;
(7), determine whether wait hire a car, (8) are entered step after the completion
If had in all sites that step (6) obtains not less than one wait hire a car, it is considered as and needs to be hired a car, it is on the contrary Then it is considered as without wait hire a car;
(8), this data result interacted is belonged to and interacts spot apart from nearest site, it is complete to clean data Finish, in which:
This time data result of interaction include this interaction either with or without place an order, either with or without wait hire a car.
Further, the data that client application is opened include neighbouring site and information of vehicles, facility information, Yong Huxin Whether breath order placement information and clicks each icon information.
Further, step (5) if in the user be new registration user, longest pick-up walking distance is set as 100 meters.
Embodiments of the present invention are elaborated above.But present invention is not limited to the embodiments described above, Technical field those of ordinary skill within the scope of knowledge, can also do without departing from the purpose of the present invention Various change out.

Claims (3)

1. a kind of for sharing the data cleaning method of automobile demand prediction, which comprises the following steps:
(1), it from the data extracted in the data obtained and opened with all client applications of once interaction are buried, completes laggard Enter step (2);
(2), in the data that step (1) is extracted, all sites in interaction 5 kilometer range of spot and this time interaction are obtained Corresponding site enters step (3) whether there is or not the data wait hire a car after the completion when generation;
(3), it is obtained after this time interaction whether there is or not the data for carrying out lower single operation, is completed by opening the process id of client application After enter step (4);
(4), the process id of this opening client application is belonged to and is interacted spot apart from nearest site, after the completion Enter step (5);
(5), it obtains and carries out the longest pick-up walking distance that the success of this time user of interaction places an order, enter step (6) after the completion, Wherein:
Walking distance when longest pick-up walking distance refers to placing an order successfully between user position and pick-up site;
(6), all site information in user's longest pick-up walking distance are obtained, enter step (7) after the completion;
(7) if, step (6) have not less than one in all sites for obtaining wait hire a car, be considered as and need to be hired a car, it is on the contrary Then it is considered as without wait hire a car, enters step (8) after the completion;
(8), this data result interacted is belonged to and interacts spot apart from nearest site, cleaned data and finish, In:
This time data result of interaction include this interaction either with or without place an order, either with or without wait hire a car.
2. as described in claim 1 a kind of for sharing the data cleaning method of automobile demand prediction, which is characterized in that client The data that end application is opened include neighbouring site and information of vehicles, facility information, user information, whether order placement information and click Each icon information.
3. as described in claim 1 a kind of for sharing the data cleaning method of automobile demand prediction, which is characterized in that step (5) if in the user be new registration user, longest pick-up walking distance is set as 100 meters.
CN201910340014.0A 2019-04-25 2019-04-25 A kind of data cleaning method predicted for sharing automobile demand Pending CN110069482A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910340014.0A CN110069482A (en) 2019-04-25 2019-04-25 A kind of data cleaning method predicted for sharing automobile demand

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910340014.0A CN110069482A (en) 2019-04-25 2019-04-25 A kind of data cleaning method predicted for sharing automobile demand

Publications (1)

Publication Number Publication Date
CN110069482A true CN110069482A (en) 2019-07-30

Family

ID=67368870

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910340014.0A Pending CN110069482A (en) 2019-04-25 2019-04-25 A kind of data cleaning method predicted for sharing automobile demand

Country Status (1)

Country Link
CN (1) CN110069482A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160078367A1 (en) * 2014-10-15 2016-03-17 Brighterion, Inc. Data clean-up method for improving predictive model training
CN107196788A (en) * 2017-05-02 2017-09-22 阿里巴巴集团控股有限公司 A kind of processing method for burying point data, device, server and client
CN107871366A (en) * 2017-11-10 2018-04-03 广州市睿仕信息科技有限公司 One kind takes return the car easily automobile timesharing leasing system and its rent method
CN109582664A (en) * 2018-11-23 2019-04-05 温州职业技术学院 A kind of shared bicycle trip characteristics analysis method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160078367A1 (en) * 2014-10-15 2016-03-17 Brighterion, Inc. Data clean-up method for improving predictive model training
CN107196788A (en) * 2017-05-02 2017-09-22 阿里巴巴集团控股有限公司 A kind of processing method for burying point data, device, server and client
CN107871366A (en) * 2017-11-10 2018-04-03 广州市睿仕信息科技有限公司 One kind takes return the car easily automobile timesharing leasing system and its rent method
CN109582664A (en) * 2018-11-23 2019-04-05 温州职业技术学院 A kind of shared bicycle trip characteristics analysis method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
北京首汽智行科技有限公司: "GOFUN出行", 《GOFUN出行 *

Similar Documents

Publication Publication Date Title
US11333796B2 (en) Spatial autocorrelation machine learning-based downscaling method and system of satellite precipitation data
CN108550261B (en) Urban traffic OD (origin-destination) calculation method based on RFID (radio frequency identification) electronic license plate
CN104159189B (en) Resident trip information acquisition method based on smart mobile phone
CN102595323B (en) Method for obtaining resident travel characteristic parameter based on mobile phone positioning data
Lepič Limits to territorial nationalization in election support for an independence-aimed regional nationalism in Catalonia
CN109916413A (en) Road matching method, system, device and storage medium based on grid dividing
CN100476354C (en) Symmetric system sampling technique for estimating area change by different scale remote sensing data
CN105046350A (en) AFC data-based public transport passenger flow OD real-time estimation method
CN109934403A (en) Charge load Analysis prediction technique in electric car resident region based on mathematical model
CN106204937A (en) The car rental cost of a kind of ladder valuation calculates system and method
CN104967733A (en) Traffic data acquiring and analyzing system based on smart phone applications
CN104320789A (en) Internet of vehicles RSU optimal allocation method based on game theory
CN108776868A (en) A kind of rural area village hollowing appraisal procedure and device based on electricity consumption big data
CN111414719A (en) Method and device for extracting peripheral features of subway station and estimating traffic demand
CN110069482A (en) A kind of data cleaning method predicted for sharing automobile demand
CN109064750A (en) Urban road network traffic estimation method and system
Borgnat et al. Studying Lyon's Vélo'v: a statistical cyclic model
CN109582664A (en) A kind of shared bicycle trip characteristics analysis method
Bochner et al. Advances in urban trip generation estimation
CN106781467B (en) A kind of bus passenger based on collaborative filtering is swiped the card site information extracting method
Rieser-Schüssler Capitalising modern data sources for observing and modelling transport behaviour
CN115860520B (en) Method for evaluating dynamic and static space-time reachability of high-speed rail hub and urban public transportation
CN102117510A (en) Unknown transportation mean passenger source predictive method in big activities
CN108596381B (en) Urban parking demand prediction method based on OD data
CN116012029A (en) Big data information acquisition method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190730

RJ01 Rejection of invention patent application after publication