CN108764375B - Highway goods stock transprovincially matching process and device - Google Patents
Highway goods stock transprovincially matching process and device Download PDFInfo
- Publication number
- CN108764375B CN108764375B CN201810664921.6A CN201810664921A CN108764375B CN 108764375 B CN108764375 B CN 108764375B CN 201810664921 A CN201810664921 A CN 201810664921A CN 108764375 B CN108764375 B CN 108764375B
- Authority
- CN
- China
- Prior art keywords
- data
- outlet
- entrance
- transprovincially
- matching
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q20/00—Payment architectures, schemes or protocols
- G06Q20/02—Payment architectures, schemes or protocols involving a neutral party, e.g. certification authority, notary or trusted third party [TTP]
- G06Q20/023—Payment architectures, schemes or protocols involving a neutral party, e.g. certification authority, notary or trusted third party [TTP] the neutral party being a clearing house
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Accounting & Taxation (AREA)
- Computer Security & Cryptography (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Traffic Control Systems (AREA)
Abstract
The present invention relates to communications and transportation statistical technique fields, more particularly to a kind of highway goods stock transprovincially matching process and device, method includes: to obtain the entry data and outlet data of the corresponding goods stock of charge station transprovincially prestored, target algorithm model is obtained based on entry data and outlet data, to in charge station transprovincially entry data to be processed and outlet data to be processed use target algorithm model to be matched to obtain matching result, and similarity calculation is carried out to obtain similarity result to the license plate number in the license plate number and outlet data to be processed in the corresponding entry data to be processed of matching result, and the matching result is optimized based on the similarity result.By the above method with effective guarantee highway goods stock transprovincially matched accuracy, and then effectively solve expressway tol lcollection data segmentation problem and can not be directly by license plate number progress goods stock transprovincially matching problem as caused by Car license recognition is not complete or identification mistake transprovincially.
Description
Technical field
The present invention relates to communications and transportation to count field, transprovincially matches in particular to a kind of highway goods stock
Method and device.
Background technique
Important main line channel of the highway as highway transportation, for support national economic development, push social progress,
Safeguard national security etc. plays an important role.Especially the driving path of vehicle on a highway is reflecting regional warp
The important reference indicator for state of development of helping.
ExpresswayNetwork Toll Collection System is to save cascade network, provincial unified allocation settlement, expressway tol lcollection data at present
Essential record vehicle receives and dispatches situation in the IC card for respectively passing in and out freeway toll station inside the province, when vehicle passes through adjacent province, needs
The highway passage IC card in different provinces is received and dispatched again at the provincial boundaries station of two provinces, carry out pike balance respectively, it is therefore, right
In vehicle transprovincially, traveling record is divided in different provinces.
Inventor it has been investigated that, due to the precision of car license recognition equipment is not high and expressway tol lcollection management require etc.
Reason causes the license plate recognition rate in expressway tol lcollection data not high, directly can not carry out vehicle string transprovincially by license plate number
It connects.Therefore, transprovincially matching of the goods stock on national highway is solved, effectively to obtain goods stock on a highway
Complete driving path is a technical problem to be solved urgently.
Summary of the invention
In view of this, the purpose of the present invention is to provide a kind of highway goods stock transprovincially matching process and device,
Above-mentioned technical problem is effectively relieved.
To achieve the above object, the embodiment of the present invention adopts the following technical scheme that
A kind of highway goods stock transprovincially matching process, which comprises
The entry data and outlet data of the corresponding goods stock of charge station transprovincially prestored are obtained, the outlet number is based on
The matching degree of the license plate number in license plate number and entry data in generates the sample number including matched data and non-matched data
According to:
By in the sample data entry data and the outlet data perform mathematical calculations to obtain overall target, and
The overall target is counted to obtain correlation metric, and the target signature collection is allocated to obtain training spy
Collection and test feature collection;
The correlation metric is handled to obtain candidate characteristic set, and important journey is carried out to the candidate characteristic set
Degree evaluation is to obtain target signature collection;
To the training characteristics collection use a variety of default machine learning algorithms be trained study with it is corresponding obtain it is a variety of across
Matching algorithm model is saved, and chooses a kind of the first algorithm model of conduct from a variety of models of matching algorithm transprovincially;
It uses the test feature collection to assess to obtain assessment result first algorithm model, and is commented according to this
Estimate result to be adjusted to obtain target algorithm model first algorithm model;
To in charge station transprovincially entry data to be processed and outlet data to be processed using the target algorithm model into
Row matching is to obtain matching result;
The license plate in license plate number and outlet data to be processed in entry data to be processed corresponding to the matching result
Number similarity calculation is carried out to obtain similarity result, and the matching result is optimized based on the similarity result.
Optionally, described to obtain the charge station transprovincially prestored in above-mentioned highway goods stock transprovincially matching process
The entry data and outlet data of corresponding goods stock, based on the vehicle in the license plate number and entry data in the outlet data
The matching degree of the trade mark generates the step of matched data and non-matched data and includes:
Charge station transprovincially is obtained according to freeway net topological structure, charge station location and provincial administrative area boundary to believe
Breath, and obtain the goods stock outlet data corresponding with the information of charge station transprovincially and entry data prestored;
Have the corresponding outlet data of complete license plate number as target outlet data in outlet data, using preset algorithm
From searched in the entry data time with the complete license plate number in the target outlet data in consistent and outlet data and
The difference of time in entry data is located at the entry data of a setting time range as target entries data, and by the target
Entry data and with the target outlet data of the target entries Data Matching as matched data, using other data as mismatching
Data.
Optionally, in above-mentioned highway goods stock transprovincially matching process, will there is complete license plate in outlet data
Number corresponding outlet data searches to go out with the target from the entry data as target outlet data using preset algorithm
The difference of time of the complete license plate number unanimously and in time and entry data in outlet data in mouth data is located at a setting
The entry data of time range as target entries data, and by the target entries data and with the target entries Data Matching
Target outlet data include: as matched data, using other data as the step of non-matched data
License plate number L in outlet data is calculated using JaroWinklerDistance algorithmOutletWith the vehicle in entry data
Trade mark LEntranceSimilarity Slicense:
Wherein, m LOutletAnd LEntranceMatched number of characters, t are the number of transposition;
According to the minimum speed of highway must not lower than V kilometers of standard per hour, according to the distance D of charge station transprovincially,
Calculate the running time T from outlet charge station to entrance charge station:
T=(D/V) × 60
Entry data is screened based on the running time T, is subtracted when screening obtains the entry time in entry data
Go the Outlet time of outlet data in time interval [- T, T] range and license plate number similarity SlicenseMore than a setting value
When, determine the corresponding vehicle of corresponding outlet data and the corresponding vehicle of entry data is same vehicle, and by the entry data
And it is included in the matched outlet data of the entry data respectively as the target entries data and target outlet data
With data, other data are included in non-matched data.
Optionally, in above-mentioned highway goods stock transprovincially matching process, the entry data includes entrance charge
It stands coding, entrance license plate number, entry time, entrance vehicle, the total number of axle of entrance vehicle, entrance vehicle goods gross weight and entrance vehicle
Freight weight limit, the outlet data include outlet charge station coding, outlet license plate number, Outlet time, outlet vehicle, outlet vehicle line shaft
Number, outlet vehicle goods gross weight and outlet vehicle weight limitation, by the entry data and outlet data progress in the sample data
Mathematical operation is to obtain overall target, and the step of being counted the overall target to obtain correlation metric includes:
Utilize the entry time T in entry dataEntrance, entrance vehicle CEntrance, the total number of axle A of entrance vehicleEntrance, entrance vehicle goods it is total
Weight WEntranceAnd entrance vehicle weight limitation LWEntranceSubtract the Outlet time T in corresponding outlet dataOutlet, outlet vehicle COutlet, outlet vehicle
Total number of axle AOutlet, outlet vehicle goods gross weight WOutletAnd outlet vehicle weight limitation LWEntranceObtain overall target:
Dtime=TEntrance-TOutlet
Dcar=CEntrance-COutlet
Daxis=AEntrance-AOutlet
Dweight=WEntrance-WOutlet
Dlimitweight=LWEntrance-LWOutlet
Wherein, DtimeFor entrance time difference, DcarFor entrance vehicle is poor, DaxisFor the total number of axle of entrance vehicle it is poor,
DweightFor the total method of double differences of entrance vehicle goods, DlimitweightIt is poor for entrance vehicle weight limitation;
Statistic of classification goes out the entrance time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis、
The total method of double differences D of entrance vehicle goodsweight, entrance vehicle weight limitation difference DlimitweightFeature distribution, and choose judge outlet data and
The whether matched correlation metric of entry data.
Optionally, in above-mentioned highway goods stock transprovincially matching process, the correlation metric is handled
To obtain candidate characteristic set, and significance level evaluation is carried out to obtain target signature Ji Buzhoubao to the candidate characteristic set
It includes:
With nondimensionalization, qualitative features quantification, quantitative characteristic binaryzation and discrete features coding method to discrepancy
Mouth time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis, the total method of double differences D of entrance vehicle goodsweight, enter and leave
Mouth vehicle weight limitation difference DlimitweightIt is handled, forms highway goods stock matching candidate feature set transprovincially;
The significance level of the candidate characteristic set is evaluated to obtain target using correlation coefficient process or variance back-and-forth method
Feature set.
Optionally, quantitative with nondimensionalization, qualitative features in above-mentioned highway goods stock transprovincially matching process
Change, quantitative characteristic binaryzation, One-Hot coding and/or discrete features coding method are to entrance time difference Dtime, entrance vehicle
Type difference Dcar, the total number of axle difference D of entrance vehicleaxis, the total method of double differences D of entrance vehicle goodsweightIt is poor with entrance vehicle weight limitation
DlimitweightHandled, formed highway goods stock transprovincially matching candidate feature set the step of include:
To the entrance time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxisIt is respectively adopted
The One-Hot coding is handled, and to the total method of double differences D of the entrance vehicle goodsweightIt is poor with entrance vehicle weight limitation
DlimitweightSection is carried out respectively to zoom in the section of [- 1,1] to obtain highway goods stock matching candidate spy transprovincially
Collection.
Optionally, in above-mentioned highway goods stock transprovincially matching process, to the training characteristics collection using a variety of
Default machine learning algorithm is trained study and obtains a variety of matching algorithm models transprovincially with corresponding, and from it is described it is a variety of transprovincially
Include: with choosing a kind of in algorithm model as the step of the first algorithm model
Value, support vector machines, naive Bayesian, decision tree, random forest and ladder are closed on using logistic regression, K respectively
Degree hoisting machine learning algorithm is trained study to the training characteristics collection and obtains corresponding model, and calculates the standard of each model
True rate score;
Using a corresponding model of accuracy rate highest scoring in each model as the first algorithm model.
Optionally, in above-mentioned highway goods stock transprovincially matching process, using the test feature collection to described
First algorithm model assess and handled first algorithm model to obtain target algorithm mould according to assessment result
The step of type includes:
First algorithm model is tested using test feature collection, and draws learning curve, ROC curve, is calculated
AUC value;
According to learning curve, ROC curve and AUC value, the fitting state of first algorithm model is judged;
It is adjusted according to parameter and characteristic variable of the fitting state to the first algorithm model to obtain target algorithm
Model.
Optionally, in above-mentioned highway goods stock transprovincially matching process, according in the entry data to be processed
License plate number and outlet data to be processed in license plate number carry out similarity calculation to obtain similarity result, and be based on the phase
Include: like the step of result optimizes the matching result is spent
It is the vehicle in the license plate number and outlet data to be processed in the corresponding entry data to be processed of matching to matching result
The trade mark using JaroWinklerDistance algorithm carries out that similarity is calculated;
When the similarity is greater than a preset value, then the matching result is maintained, otherwise, the matching result is carried out
Modification.
The present invention also provides a kind of highway goods stock transprovincially coalignments, comprising:
Module is obtained, for obtaining the entry data and outlet data of the corresponding goods stock of charge station transprovincially prestored,
It includes matched data and not that matching degree based on the license plate number in the license plate number and entry data in the outlet data, which generates,
Sample data with data;
Computing module, for by the sample data entry data and the outlet data perform mathematical calculations with
It is counted to overall target, and to the overall target to obtain correlation metric;
Processing module, for being handled the correlation metric to obtain candidate characteristic set, and to described candidate special
Collection carries out significance level evaluation to obtain target signature collection, and is allocated to obtain training characteristics to the target signature collection
Collection and test feature collection;
Training module, for using a variety of default machine learning algorithms to be trained study with right the training characteristics collection
A variety of matching algorithm models transprovincially should be obtained, and choose a kind of the first algorithm of conduct from a variety of models of matching algorithm transprovincially
Model;
Evaluation module, for using the test feature collection to assess first algorithm model to obtain assessment knot
Fruit, and first algorithm model is adjusted to obtain target algorithm model according to the assessment result;
Matching module, for in charge station transprovincially entry data to be processed and outlet data to be processed use the mesh
Mark algorithm model is matched to obtain matching result;
Optimization module, for the license plate number in the license plate number and outlet data to be processed in the entry data to be processed
Carry out similarity calculation with obtain the license plate number in similarity result entry data to be processed corresponding to the matching result and
License plate number in outlet data to be processed carries out similarity calculation to obtain similarity result, and is based on the similarity result pair
The matching result optimizes.
A kind of highway goods stock provided by the invention transprovincially matching process and device are prestored transprovincially by obtaining
The entry data and outlet data of the corresponding goods stock of charge station, and target algorithm is obtained based on entry data and outlet data
Model, in charge station transprovincially entry data to be processed and outlet data to be processed use target algorithm model to be matched with
Matching result is obtained, and similar with the license plate number progress in outlet data to be processed according to the license plate number in entry data to be processed
Degree is calculated to obtain similarity result, and is optimized based on the similarity result to the matching result.It is set by above-mentioned
It sets with effective guarantee highway goods stock transprovincially matched accuracy, can effectively solve expressway tol lcollection data and transprovincially divide
It cuts problem and goods stock directly can not be carried out transprovincially by license plate number as caused by Car license recognition is not complete or identification mistake
Matching problem, to obtain goods stock complete driving path on a highway.
To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate
Appended attached drawing, is described in detail below.
Detailed description of the invention
Fig. 1 is the structural block diagram of terminal device provided in an embodiment of the present invention.
Fig. 2 is the flow diagram of highway goods stock provided in an embodiment of the present invention transprovincially matching process.
Fig. 3 is the flow diagram of step S110 in Fig. 2.
Fig. 4 is goods stock provided in an embodiment of the present invention matched ROC curve transprovincially.
Fig. 5 is the connection block diagram of highway goods stock provided in an embodiment of the present invention transprovincially coalignment.
Icon: 10- terminal device;12- memory;14- processor;100- highway goods stock transprovincially matches dress
It sets;110- obtains module;120- computing module;130- processing module;140- training module;150- evaluation module;160- matching
Module: 170- optimization module.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention
In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment only
It is a part of the embodiments of the present invention, instead of all the embodiments.The present invention being usually described and illustrated herein in the accompanying drawings
The component of embodiment can be arranged and be designed with a variety of different configurations.
As shown in Figure 1, the embodiment of the invention provides a kind of terminal device 10, including memory 12, processor 14 and height
Fast highway freight vehicle transprovincially coalignment 100.Wherein, the terminal device 10 can be but not limited to server, intelligent hand
Machine, PC (personal computer, PC), tablet computer etc. have the electronic equipment of data-handling capacity, herein not
Make specific limit.
In this embodiment, it is directly or indirectly electrically connected between the memory 12 and processor 14, to realize
The transmission or interaction of data.For example, these elements can realize electricity by one or more communication bus or signal wire between each other
Property connection.Transprovincially coalignment 100 can be with software or firmware including at least one for the highway goods stock
(firmware) form is stored in the software function module in the memory 12.The processor 14 is for executing described deposit
The executable module stored in reservoir 12, such as the highway goods stock transprovincially software included by coalignment 100
Functional module and computer program etc., to realize highway goods stock matching process transprovincially.
Wherein, the memory 12 may be, but not limited to, random access memory (Random Access Memory,
RAM), read-only memory (Read Only Memory, ROM), programmable read only memory (Programmable Read-Only
Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM),
Electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc..
Wherein, memory 12 is for storing program, and the processor 14 executes described program after receiving and executing instruction.
The processor 14 may be a kind of IC chip, the processing capacity with signal.Above-mentioned processor 14
It can be general processor, including central processing unit (Central Processing Unit, CPU), network processing unit
(Network Processor, NP) etc.;It can also be digital signal processor (DSP), specific integrated circuit (ASIC), scene
Programmable gate array (FPGA) either other programmable logic device, discrete gate or transistor logic, discrete hardware group
Part.It may be implemented or execute disclosed each method, step and the logic diagram in the embodiment of the present invention.General processor can be with
It is that microprocessor or the processor are also possible to any conventional processor etc..
It is appreciated that structure shown in FIG. 1 is only to illustrate, the terminal device 10 may also include more than shown in Fig. 1
Perhaps less component or with the configuration different from shown in Fig. 1.Each component shown in Fig. 1 can use hardware, software
Or combinations thereof realize.
In conjunction with Fig. 2, the embodiment of the present invention also provides a kind of highway shipping that can be applied to above-mentioned terminal device 10
Vehicle transprovincially matching process, the method includes the steps seven steps of S110- step S170.
Step S110: the entry data and outlet data of the corresponding goods stock of charge station transprovincially prestored are obtained, is based on
The matching degree of the license plate number in license plate number and entry data in the outlet data includes matched data and non-matched data
Sample data.
Wherein, the mode of the entry data and outlet data that obtain the corresponding goods stock of charge station transprovincially prestored can be with
It is the corresponding charge station's information in national freeway toll station position and Provincial administrative division boundary inputted according to user
As charge station's information transprovincially, and transprovincially corresponding outlet data and the entry data in charge station's information is obtained, is also possible to
Charge station's information transprovincially is obtained according in the high speeds highway network topological structure such as Amap and Tencent's map, and obtains this and transprovincially receives
Take the entry data and outlet data of the corresponding goods stock of station information.
It is optionally, in the present embodiment, described to obtain the corresponding goods stock of charge station transprovincially prestored incorporated by reference to Fig. 2
Entry data and outlet data, matching degree based on the license plate number in the license plate number and entry data in the outlet data is raw
At including the steps that the sample data of matched data and non-matched data includes:
Step S112: it is obtained transprovincially according to freeway net topological structure, charge station location and provincial administrative area boundary
Charge station's information, and obtain the goods stock outlet data corresponding with the information of charge station transprovincially and entry data prestored.
Step S114: have the corresponding outlet data of complete license plate number as target outlet data in outlet data, adopt
With preset algorithm from the entry data search with the complete license plate number in the target outlet data consistent and outlet data
In time and entry data in the difference of time be located at the entry data of a setting time range as target entries data,
And using the target entries data and with the target outlet data of the target entries Data Matching as matched data, by other data
As non-matched data.
It should be noted that in the present embodiment, the charge station transprovincially can be out inbound fission, it is also possible to enter and leave
Stand one, when it is described go out inbound fission when, the distance between the outlet station of the charge station transprovincially and access station should one compared with
In short distance range, such as in 20,30 or 40 kilometers.
The preset algorithm can be JaroWinkler Distance algorithm, be also possible to Levenshtein
Distance algorithm, is chosen according to actual needs, is not specifically limited herein.
Optionally, in the present embodiment, it is searched from the entry data using preset algorithm and the target outlet number
The difference of time of the complete license plate number in unanimously and in time and entry data in outlet data is located at a setting time
The entry data of range is as target entries data, and by the target entries data and target with the target entries Data Matching
Outlet data includes: as matched data, using other data as the step of non-matched data
License plate number L in outlet data is calculated using JaroWinklerDistance algorithmOutletWith the vehicle in entry data
Trade mark LEntranceSimilarity Slicense:
Slicense=Sj+(lp(1-Sj))
Wherein, m LOutletAnd LEntranceMatched number of characters, t are the number of transposition;
According to the minimum speed of highway must not lower than V kilometers of standard per hour, according to the distance D of charge station transprovincially,
Calculate the running time T from outlet charge station to entrance charge station:
T=(D/V) × 60
Entry data is screened based on the running time T, is subtracted when screening obtains the entry time in entry data
Go the Outlet time of outlet data in time interval [- T, T] range and license plate number similarity SlicenseMore than a setting value
When, determine the corresponding vehicle of corresponding outlet data and the corresponding vehicle of entry data is same vehicle, and by the entry data
And it is included in the matched outlet data of the entry data respectively as the target entries data and target outlet data
With data, other data are included in non-matched data.
Wherein, the number t of transposition be unmatched number of characters, the setting value can be but not limited to 0.8,0.9 or
0.95, and work as the similarity and be greater than the setting value, then determine the license plate in the license plate number and entry data in outlet data
It number is consistent.
It should be noted that national freeway toll station and the spatial data on Provincial administrative division boundary will use
Unified space coordinates, avoid as space coordinates are inconsistent and caused by positional shift, calculating charge station transprovincially
When distance D, national freeway toll station and Provincial administrative division boundary can be superimposed upon in same map window, be adopted
The mode for manually extracting charge station transprovincially calls the tool of GIS platform to calculate the distance between charge station transprovincially.
Step S120: by the sample data entry data and the outlet data perform mathematical calculations it is comprehensive to obtain
Index is closed, and the overall target is counted to obtain correlation metric.
Wherein, the outlet data can include but is not limited to: outlet charge station coding, outlet license plate number, Outlet time,
Export vehicle, the total number of axle of outlet vehicle, outlet vehicle goods gross weight and outlet vehicle weight limitation, the entry data may include but not
Be limited to entrance charge station coding, entrance license plate number, entry time, entrance vehicle, the total number of axle of entrance vehicle, entrance vehicle goods gross weight with
And entrance vehicle weight limitation.
By in the sample data entry data and the outlet data perform mathematical calculations to obtain overall target, and
The step of being counted the overall target to obtain correlation metric include:
Utilize the entry time T in entry dataEntrance, entrance vehicle CEntrance, the total number of axle A of entrance vehicleEntrance, entrance vehicle goods it is total
Weight WEntranceAnd entrance vehicle weight limitation LWEntranceSubtract the Outlet time T in corresponding outlet dataOutlet, outlet vehicle COutlet, outlet vehicle
Total number of axle AOutlet, outlet vehicle goods gross weight WOutletAnd outlet vehicle weight limitation LWEntranceObtain overall target:
Dtime=TEntrance-TOutlet
Dcar=CEntrance-COutlet
Daxis=AEntrance-AOutlet
Dweight=WEntrance-WOutlet
Dlimitweight=LWEntrance-LWOutlet
Wherein, DtimeFor entrance time difference, DcarFor entrance vehicle is poor, DaxisFor the total number of axle of entrance vehicle it is poor,
DweightFor the total method of double differences of entrance vehicle goods, DlimitweightIt is poor for entrance vehicle weight limitation;
Statistic of classification goes out the entrance time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis、
The total method of double differences D of entrance vehicle goodsweight, entrance vehicle weight limitation difference DlimitweightFeature distribution, and choose judge outlet data and
The whether matched correlation metric of entry data.
Step S130: the correlation metric is handled to obtain candidate characteristic set, and to the candidate characteristic set
Significance level evaluation is carried out to obtain target signature collection, and to the target signature collection be allocated to obtain training characteristics collection and
Test feature collection.
Wherein, the mode for obtaining training characteristics collection and test feature collection to the target signature and being allocated is can be with
It is to the target signature and to be allocated to obtain training characteristics collection and test feature collection according to a setting ratio, for example, can be with
It is to the target signature and to be allocated to obtain training characteristics collection and test feature collection according to 8: 2 ratio or 7: 3 ratio.
The correlation metric is handled to obtain candidate characteristic set, and important journey is carried out to the candidate characteristic set
Degree evaluation to include: the step of obtaining target signature collection
With nondimensionalization, qualitative features quantification, quantitative characteristic binaryzation and discrete features coding method to discrepancy
Mouth time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis, the total method of double differences D of entrance vehicle goodsweight, enter and leave
Mouth vehicle weight limitation difference DlimitweightIt is handled, forms highway goods stock matching candidate feature set transprovincially;
The significance level of the candidate characteristic set is evaluated to obtain target using correlation coefficient process or variance back-and-forth method
Feature set.
Specifically, in the present embodiment, to the entrance time difference Dtime, entrance vehicle difference Dcar, entrance vehicle
Total number of axle difference DaxisThe One-Hot coding is respectively adopted to be handled, and to the total method of double differences D of the entrance vehicle goodsweightWith
Entrance vehicle weight limitation difference DlimitweightSection is carried out respectively to zoom in the section of [- 1,1] to obtain highway freight
Matching candidate feature set transprovincially.
Step S140: a variety of default machine learning algorithms are used to be trained study to deserved the training characteristics collection
To a variety of matching algorithm models transprovincially, and a kind of the first algorithm mould of conduct is chosen from a variety of models of matching algorithm transprovincially
Type.
Wherein, a variety of default machine learning algorithms can include but is not limited to logistic regression, K closes on value, support vector machines,
Naive Bayesian, decision tree, random forest and gradient are promoted.
To the training characteristics collection use a variety of default machine learning algorithms be trained study with it is corresponding obtain it is a variety of across
Matching algorithm model is saved, and chooses a kind of the step of being used as the first algorithm model packet from a variety of models of matching algorithm transprovincially
It includes:
Value, support vector machines, naive Bayesian, decision tree, random forest and ladder are closed on using logistic regression, K respectively
Degree hoisting machine learning algorithm is trained study to feature set, and calculates the accuracy rate score of each model.
Using a corresponding model of accuracy rate highest scoring in each model as the first algorithm model.
Step S150: using the test feature collection to assess to obtain assessment result first algorithm model,
And first algorithm model is adjusted to obtain target algorithm model according to the assessment result.
Specifically, the test feature collection is used to assess to obtain assessment result first algorithm model, and
First algorithm model is calibrated to include: the step of obtaining target algorithm model according to the assessment result
First algorithm model is tested using the test feature collection, and draws learning curve, ROC curve,
Calculate AUC value.
According to learning curve, ROC curve and AUC value, the fitting state of first algorithm model is judged.
It is adjusted according to parameter and characteristic variable of the fitting state to the first algorithm model to obtain target algorithm
Model.
Step S160: in charge station transprovincially entry data to be processed and outlet data to be processed using the target calculate
Method model is matched to obtain matching result.
Specifically, by the way that the entry data to be processed and outlet data to be processed are input to the target algorithm mould
Type, so that the target algorithm model matches the outlet data and entry data.
Step S170: license plate number and outlet data to be processed in entry data to be processed corresponding to the matching result
In license plate number carry out similarity calculation to obtain similarity result, and based on the similarity result to the matching result into
Row optimization.
Wherein, in the license plate number and outlet data to be processed in entry data to be processed corresponding to the matching result
License plate number, which carries out similarity calculation, can be in a manner of obtaining similarity result using JaroWinkler Distance algorithm
Similarity calculation is carried out, is also possible to carry out similarity calculation using Levenshtein Distance algorithm, according to practical need
It asks and is chosen, is not specifically limited herein.
Optionally, in the present embodiment, according to the license plate number and outlet data to be processed in the entry data to be processed
In license plate number carry out similarity calculation to obtain similarity result, and based on the similarity result to the matching result into
Row optimization the step of include:
It is the vehicle in the license plate number and outlet data to be processed in the corresponding entry data to be processed of matching to matching result
The trade mark using JaroWinklerDistance algorithm carries out that similarity is calculated.
When the similarity is greater than a preset value, then the matching result is maintained, otherwise, the matching result is carried out
Modification.
Wherein, which can be 0.7,0.75,0.8,0.85 or 0.9, be not specifically limited herein, according to practical need
It asks and is configured.
Highway can be effectively solved by above-mentioned setting effectively to realize the matching transprovincially of highway goods stock
Charge data transprovincially segmentation problem and as Car license recognition is not complete or identification mistake caused by can not directly by license plate number into
Row goods stock transprovincially matching problem, so that complete driving path of the goods stock on national highway is restored, for high speed
The analysis decisions such as highway transportation statistics analysis, economic operation analysis provide base support.
It in the present embodiment, include using Guizhou Xin Zhai charge station as outlet station transprovincially and Guangxi with the charge station transprovincially
Qian Guiliuzhai charge station for access station transprovincially as being illustrated.Goods stock in June, 2017 Guizhou prestored is obtained respectively
Xin Zhai charge station outlet data and charge station, the Guangxi stockaded village Qian Guiliu entry data, and reject license plate number be it is empty, data target is empty
Record, specific entry data include entrance charge station number, entrance charge station name, entrance license plate number, entry time, enter
Mouth vehicle, the total number of axle of entrance vehicle, entrance vehicle goods gross weight and entrance vehicle weight limitation, outlet data include that outlet charge station compiles
Number, outlet charge station name, outlet license plate number, Outlet time, outlet vehicle, the total number of axle of outlet vehicle, outlet vehicle goods gross weight with
And outlet vehicle weight limitation.
Guizhou Xin Zhai charge station in June, 2017 goods stock outbound data is refering to table 1:
Table 1
Charge station, the Guangxi stockaded village Qian Guiliu in June, 2017 goods stock inbound data is refering to table 2:
Table 2
Wherein, license plate number is generally made of Chinese character+letter+number, wherein first Chinese character is the abbreviation in province, second letter
Generally represent city, behind be made of letter or number, license plate number length be 7.According to the coding rule of license plate number, from outlet number
The complete license plate number record for meeting license plate number coding rule is filtered out in.
Calculate the transit time between Guizhou Xin Zhai charge station and charge station, the Guangxi stockaded village Qian Guiliu.It is charged according to Guizhou Xin Zhai
The road network structure stood with the latitude and longitude coordinates of charge station, the stockaded village Guangxi Qian Guiliu and freeway net, can calculate Guizhou Xin Zhai
Charge station is 8.4 kilometers at a distance from charge station, the Guangxi stockaded village Qian Guiliu.Do not considering freeway net topological structure abnormal conditions
Under, 60 kilometers per hour of standard, Guizhou Xin Zhai charge station and Guangxi Guizhou Province osmanthus six must not be lower than according to the minimum speed of highway
The transit time of charge station, stockaded village up to 8.4 minutes.
On the basis of the Outlet time in outlet data, entry time difference is filtered out from entry data in [- 9,9] minute
Interior entry record collection.
The similarity of outlet license plate number and entrance license plate number is calculated using JaroWinklerDistance algorithm.Calculate knot
Fruit is refering to table 3:
Table 3
If license plate number similarity 0.9 or more, determines outlet vehicle and entrance vehicle is same vehicle, be included in coupling number
According to conversely, determining that outlet vehicle and entrance vehicle for non-same vehicle, are included in non-matched data.Finally by matched data and not
The highway goods stock transprovincially matched sample data that matched data is constituted, please refers to table 4:
Table 4
By performing mathematical calculations to sample data middle outlet data and entry data, by entry and exit discretization index
Be converted to can compare, analyzable overall target.By the regularity of distribution of the analysis integrated index of the statistical graphs such as histogram, grasp
To judge, the whether matched correlation metric of entry data.It specifically calculates, the indicator difference of entry data, such as enters and leaves
Mouth time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis, the total method of double differences D of entrance vehicle goodsweight, enter and leave
Mouth vehicle weight limitation difference Dlimitweight.Calculated result is refering to table 5:
Table 5
According to whether matching index, counts entry time difference Dtime, entrance vehicle difference Dcar, entrance vehicle line shaft
Number difference Daxis, the total method of double differences D of entrance vehicle goodsweight, entrance vehicle weight limitation difference DlimitweightData distribution.
Entrance time difference DtimeData distribution refering to table 6:
Table 6
Entrance vehicle difference DcarData distribution refering to table 7:
Table 7
Entrance vehicle is poor | Whether match | Sample size |
-4 | 0 | 84389 |
-3 | 0 | 193407 |
-2 | 0 | 101191 |
-1 | 0 | 129205 |
0 | 0 | 1121537 |
1 | 0 | 143759 |
2 | 0 | 101843 |
3 | 0 | 140178 |
4 | 0 | 145883 |
-4 | 1 | 6 |
-3 | 1 | 1 |
-2 | 1 | 5 |
-1 | 1 | 109 |
0 | 1 | 42986 |
1 | 1 | 2061 |
2 | 1 | 30 |
3 | 1 | 11 |
The total number of axle difference D of entrance vehicleaxisData distribution refering to table 8:
Table 8
Entrance vehicle borrows total method of double differences DweightData distribution refering to table 9:
Table 9
Entrance vehicle weight limitation difference DlimitweightData distribution refering to table 10:
Table 10
According to table 6- table 10: entrance time difference Dtime, entrance vehicle difference Dcar, the total number of axle of entrance vehicle it is poor
Daxis, the total method of double differences D of entrance vehicle goodsweight, entrance vehicle weight limitation difference DlimitweightThere are significant distribution rule for sample data
Rule, such as the entrance time difference D of sample datatimeIt concentrates within the scope of [- 5,7] minute, entrance vehicle difference DcarIt concentrates on
In [- 1,1] range, the total number of axle difference D of entrance vehicleaxisIt concentrates in [- 1,1] range, the total method of double differences D of entrance vehicle goodsweight、
Entrance vehicle weight limitation difference DlimitweightAlso it all concentrates in a certain range.It is thereby possible to select entrance time difference Dtime, go out
Entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis, the total method of double differences D of entrance vehicle goodsweight, entrance vehicle weight limitation it is poor
DlimitweightAs judging highway goods stock matched correlation metric transprovincially.
For entrance time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxisThree indexs are adopted
It is encoded with One-Hot, converts the binary feature comprising multiple mode bits for Discrete Eigenvalue;For the total method of double differences of entrance vehicle goods
Dweight, entrance vehicle weight limitation difference DlimitweightTwo indices carry out section scaling using MinMaxScalar, uniformly zoom to
In the section of [- 1,1], to form highway goods stock matching candidate feature set transprovincially, specifically include
WeightDiffefence, LimitWeightDifference, TD_-14, TD_-13, TD_12, TD_-11,
TD_10, TD_9, TD_8, TD_7, TD_6, TD_5, TD_4, TD_3, TD_2, TD_1, TD_0, TD_1, TD_2, TD_3, TD_4,
TD_5, TD_6, TD_7, TD_8, TD_9, TD_10, TD_11, TD_12, TD_13, TD_14, AD_-4, AD_-3, AD_-2, AD_-
1, AD_0, AD_1, AD_2, AD_3, AD_4, MD_4, MD_-3, MD_-2, MD_-1, MD_0, MD_1, MD_2, MD_3, MD_4 },
Wherein, the feature of TD_ beginning is the entrance time difference using the index generated after One-Hot coding, and the feature of MD_ beginning is
For entrance vehicle difference using the index generated after One-Hot coding, the feature of AD_ beginning is that the total number of axle difference of entrance vehicle uses
The index generated after One-Hot coding.
It is evaluated using significance level of the Random Forest method to candidate characteristic set.Evaluation result please refers to table
11:
Table 11
According to the significance level ranking of feature, before ranking 10 feature is chosen as target signature collection, and according to 0.8 He
0.2 ratio is split target signature collection, and 80% data are for training using as training characteristics collection, 20% data are used
In test to be modeled as test feature collection, and using training characteristics collection.
I.e. to include TD_0, WeightDifference, LimitWeightDifference, MD_0, AD_0, MD_1,
The target signature collection of AD_-2, AD-1, AD_1, MD-3 close on value using logistic regression, K, support vector machines, naive Bayesian, determine
The machine learning algorithms such as plan tree, random forest, gradient promotion are trained, and calculate predictablity rate.Calculated result is refering to table
12:
Table 12
Serial number | Algorithm title | Predictablity rate |
1 | RandomForest | 0.985375 |
2 | LogisticRegression | 0.975025 |
3 | KNN | 0.075936 |
4 | GradientBoosting | 0.986650 |
5 | AdaBoosting | 0.986975 |
6 | DecisionTree | 0.985550 |
7 | GaussianNativeByes | 0.900350 |
8 | SVC | 0.974725 |
As can be seen from Table 12, the predictablity rate of AdaBoosting algorithm is higher compared to other algorithms, therefore selects
First algorithm model of the AdaBoosting algorithm as highway goods stock transprovincially matching problem.
The first algorithm model is tested using test feature collection, test result is refering to table 13:
Table 13
Specifically, calculating the methods of AUC value by drawing learning curve, ROC curve and commenting the first algorithm model
Estimate, master of ceremonies's algorithm model model is optimized accordingly according to assessment result and improves to obtain target algorithm model.
It is 0.9798609318093167 with the AUC value that AdaBoosting algorithm is calculated, ROC curve please refers to
Fig. 4 can be seen that AdaBoosting algorithm from ROC curve and AUC value and be highly suitable for solving highway goods stock
Matching problem transprovincially.
The target algorithm model in charge station, trained Guizhou Xin Zhai charge station to the Guangxi stockaded village Qian Guiliu in June, 2017 is answered
For differentiating whether the highway goods stock in July, 2017, August transprovincially matches, the generalization ability of testing model.Statistics knot
Fruit is shown in Table 14:
Table 14
Can be seen that model integrally from 14 statistical result of table has good generalization ability, but the prediction to matched data
Accuracy rate is relatively low.
Therefore, solve the problems, such as that model is indifferent to exact matching data generaliza-tion using license plate number similarity algorithm, into
One step improves highway goods stock matching precision transprovincially.Calculated result is refering to table 15:
Table 15
If entry and exit license plate number similarity is greater than 0.8, model is maintained to determine as a result, otherwise, then determining to tie by model
Fruit modification is modified.Therefore, indifferent to exact matching data generaliza-tion using license plate number similarity algorithm solution model
Problem further increases highway goods stock matching precision transprovincially.Specifically: to model be judged as it is matched as a result, by
According to license plate number similarity calculating method, the similarity of outlet license plate number and entrance license plate is calculated, if entry and exit license plate number is similar
Degree is greater than a setting value, then maintains model to determine as a result, otherwise, then model judgement result is modified.
Incorporated by reference to Fig. 5, on the basis of the above, the present invention also provides a kind of highway goods stock transprovincially coalignments
100, including obtain module 110, computing module 120, processing module 130, training module 140, evaluation module 150, matching module
160 and optimization module 170.
The entry data and outlet for obtaining module 110 and being used to obtain the corresponding goods stock of charge station transprovincially prestored
Data, matching degree based on the license plate number in the license plate number and entry data in the outlet data generate include matched data and
The sample data of non-matched data.In the present embodiment, the acquisition module 110 can be used for executing step S110 shown in Fig. 2,
The description to step S110 above is referred to about the specific descriptions for obtaining module 110.
The computing module 120 is used for the entry data and outlet data progress mathematics fortune in the sample data
It calculates to obtain overall target, and the overall target is counted to obtain correlation metric.In the present embodiment, the meter
Calculating module 120 can be used for executing step S120 shown in Fig. 2, before the specific descriptions about the computing module 120 are referred to
Description of the text to step S120.
The processing module 130 is used to handle the correlation metric to obtain candidate characteristic set, and to described
Candidate characteristic set carries out significance level evaluation to obtain target signature collection, and is allocated and is instructed to the target signature collection
Practice feature set and test feature collection.In the present embodiment, the processing module 130 can be used for executing step S130 shown in Fig. 2
It is referred to above in the specific descriptions of the processing module 130 to step S130.
The training module 140 is used to be trained using a variety of default machine learning algorithms to the training characteristics collection
It practises and a variety of matching algorithm models transprovincially is obtained with correspondence, and choose from a variety of models of matching algorithm transprovincially and a kind of to be used as the
One algorithm model.In the present embodiment, the training module 140 can be used for executing step S140 shown in Fig. 2, about the instruction
The specific descriptions for practicing module 140 are referred to the description to step S140 above.
The evaluation module 150 is for using the test feature collection to assess to obtain first algorithm model
Assessment result, and first algorithm model is adjusted to obtain target algorithm model according to the assessment result.In this reality
It applies in example, the evaluation module 150 can be used for executing step S150 shown in Fig. 2, about specifically retouching for the evaluation module 150
It states and is referred to the description to step S150 above.
The matching module 160 is used for the entry data to be processed and outlet data to be processed use in charge station transprovincially
The target algorithm model is matched to obtain matching result.In the present embodiment, the matching module 160 can be used for executing
Step S160 shown in Fig. 2, the specific descriptions about the matching module 160 are referred to the description to step S160 above.
The optimization module 170 is used for in the license plate number and outlet data to be processed in the entry data to be processed
License plate number carries out similarity calculation to obtain the vehicle in similarity result entry data to be processed corresponding to the matching result
License plate number in the trade mark and outlet data to be processed carries out similarity calculation to obtain similarity result, and is based on the similarity
As a result the matching result is optimized.In the present embodiment, the optimization module 170 can be used for executing step shown in Fig. 2
Rapid S170, the specific descriptions about the optimization module 170 are referred to the description to step S170 above.
To sum up, a kind of highway goods stock provided by the invention transprovincially matching process and device is prestored by obtaining
The corresponding goods stock of charge station transprovincially entry data and outlet data, target is obtained based on entry data and outlet data
Algorithm model, in charge station transprovincially entry data to be processed and outlet data to be processed using target algorithm model carry out
It is equipped with to obtain matching result, and is carried out according to the license plate number in the license plate number and outlet data to be processed in entry data to be processed
Similarity calculation optimizes the matching result based on the similarity result with obtaining similarity result.By upper
Setting is stated with effective guarantee highway goods stock transprovincially matched accuracy, can effectively solve expressway tol lcollection data across
It saves segmentation problem and goods stock directly can not be carried out by license plate number as caused by Car license recognition is not complete or identification mistake
Matching problem transprovincially.
It, can be with if the function is realized and when sold or used as an independent product in the form of software function module
It is stored in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially in other words
The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, the meter
Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a
People's computer, electronic equipment or network equipment etc.) execute all or part of step of each embodiment the method for the present invention
Suddenly.
Claims (10)
1. a kind of highway goods stock transprovincially matching process, which is characterized in that the described method includes:
The entry data and outlet data for obtaining the corresponding goods stock of charge station transprovincially prestored, based in the outlet data
License plate number and entry data in license plate number matching degree generate include matched data and non-matched data sample data;
By in the sample data entry data and the outlet data perform mathematical calculations to obtain overall target, and to institute
It states overall target to be counted to obtain correlation metric, wherein the mathematical operation is subtraction;
The correlation metric is handled to obtain candidate characteristic set, and significance level is carried out to the candidate characteristic set and is commented
Valence is to obtain target signature collection, and is allocated to obtain training characteristics collection and test feature collection to the target signature collection;
To the training characteristics collection use a variety of default machine learning algorithms be trained study with it is corresponding obtain it is a variety of transprovincially
With algorithm model, and a kind of the first algorithm model of conduct is chosen from a variety of models of matching algorithm transprovincially;
The test feature collection is used to assess first algorithm model to obtain assessment result, and according to the assessment knot
Fruit is adjusted to obtain target algorithm model first algorithm model;
To in charge station transprovincially entry data to be processed and outlet data to be processed using the target algorithm model carry out
It is equipped with to obtain matching result;
The license plate number in license plate number and outlet data to be processed in entry data to be processed corresponding to the matching result into
Row similarity calculation optimizes the matching result based on the similarity result with obtaining similarity result.
2. highway goods stock according to claim 1 transprovincially matching process, which is characterized in that the acquisition prestores
The corresponding goods stock of charge station transprovincially entry data and outlet data, based in the outlet data license plate number with enter
The matching degree of license plate number in mouthful data generates the step of matched data and non-matched data and includes:
Charge station's information transprovincially is obtained according to freeway net topological structure, charge station location and provincial administrative area boundary, and
Obtain the goods stock outlet data corresponding with the information of charge station transprovincially and entry data prestored;
There to be the corresponding outlet data of complete license plate number as target outlet data in outlet data, using preset algorithm from institute
State time and the entrance searched in and outlet data consistent with the complete license plate number in the target outlet data in entry data
The difference of time in data is located at the entry data of a setting time range as target entries data, and by the target entries
Data and with the target outlet data of the target entries Data Matching as matched data, using other data as mismatching number
According to.
3. highway goods stock according to claim 2 transprovincially matching process, which is characterized in that will be in outlet data
With the corresponding outlet data of complete license plate number as target outlet data, searched from the entry data using preset algorithm
The difference of the time in time and entry data with the complete license plate number in the target outlet data in consistent and outlet data
The entry data that value is located at a setting time range enters as target entries data, and by the target entries data and with the target
The target outlet data of mouthful Data Matching include: using other data as the step of non-matched data as matched data
License plate number L in outlet data is calculated using JaroWinklerDistance algorithmOutletWith the license plate number in entry data
LEntranceSimilarity Slicense:
Slicense=Sj+(lp(1-Sj))
Wherein, m LOutletAnd LEntranceMatched number of characters, t are the number of transposition;
V kilometers per hour of standard must not be lower than according to the minimum speed of highway, according to the distance D of charge station transprovincially, calculated
From outlet charge station to the running time T of entrance charge station:
T=(D/V) × 60
Entry data is screened based on the running time T, is subtracted out when screening obtains the entry time in entry data
The Outlet time of mouthful data is in time interval [- T, T] range and license plate number similarity SlicenseWhen more than a setting value, sentence
The corresponding vehicle of fixed corresponding outlet data and the corresponding vehicle of entry data are same vehicles, and by the entry data and with
The matched outlet data of the entry data is included in matched data respectively as the target entries data and target outlet data,
Other data are included in non-matched data.
4. highway goods stock according to claim 1 transprovincially matching process, which is characterized in that the entry data
Including entrance charge station coding, entrance license plate number, entry time, entrance vehicle, the total number of axle of entrance vehicle, entrance vehicle goods gross weight with
And entrance vehicle weight limitation, the outlet data include outlet charge station coding, outlet license plate number, Outlet time, outlet vehicle, go out
The total number of axle of mouthful vehicle, outlet vehicle goods gross weight and outlet vehicle weight limitation, by the sample data entry data and it is described go out
Mouth data perform mathematical calculations to obtain overall target, and are counted to the overall target to obtain the step of correlation metric
Suddenly include:
According to the entry time T in the sample data in entry dataEntrance, entrance vehicle CEntrance, the total number of axle A of entrance vehicleEntrance、
Entrance vehicle goods gross weight WEntranceAnd entrance vehicle weight limitation LWEntranceSubtract the Outlet time T in corresponding outlet dataOutlet, outlet vehicle
COutlet, outlet the total number of axle A of vehicleOutlet, outlet vehicle goods gross weight WOutletAnd outlet vehicle weight limitation LWEntranceObtain overall target:
Dtime=TEntrance-TOutlet
Dcar=CEntrance-COutlet
Daxis=AEntrance-AOutlet
Dweight=WEntrance-WOutlet
Dlimitweight=LWEntrance-LWOutlet
Wherein, DtimeFor entrance time difference, DcarFor entrance vehicle is poor, DaxisFor the total number of axle of entrance vehicle is poor, DweightFor
The total method of double differences of entrance vehicle goods, DlimitweightIt is poor for entrance vehicle weight limitation;
Statistic of classification goes out the entrance time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis, enter and leave
The total method of double differences D of mouth vehicle goodsweight, entrance vehicle weight limitation difference DlimitweightFeature distribution, and choose and judge outlet data and entrance
The whether matched correlation metric of data.
5. highway goods stock according to claim 4 transprovincially matching process, which is characterized in that the correlation
Index is handled to obtain candidate characteristic set, and carries out significance level evaluation to the candidate characteristic set to obtain target signature
The step of collection includes:
When with nondimensionalization, qualitative features quantification, quantitative characteristic binaryzation and discrete features coding method to entrance
Between difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis, the total method of double differences D of entrance vehicle goodsweight, entrance vehicle
Freight weight limit difference DlimitweightIt is handled, forms highway goods stock matching candidate feature set transprovincially;
The significance level of the candidate characteristic set is evaluated to obtain target signature using correlation coefficient process or variance back-and-forth method
Collection.
6. highway goods stock according to claim 5 transprovincially matching process, which is characterized in that use dimensionless
Change, qualitative features quantification, quantitative characteristic binaryzation, One-Hot coding and/or discrete features coding method are to the entrance time
Poor Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxis, the total method of double differences D of entrance vehicle goodsweightWith entrance vehicle
Freight weight limit difference DlimitweightHandled, formed highway goods stock transprovincially matching candidate feature set the step of include:
To the entrance time difference Dtime, entrance vehicle difference Dcar, the total number of axle difference D of entrance vehicleaxisIt is respectively adopted described
One-Hot coding is handled, and to the total method of double differences D of the entrance vehicle goodsweightWith entrance vehicle weight limitation difference Dlimitweight
Section is carried out respectively to zoom in the section of [- 1,1] to obtain highway goods stock matching candidate feature set transprovincially.
7. highway goods stock according to claim 1 transprovincially matching process, which is characterized in that special to the training
Collection use a variety of default machine learning algorithms to be trained study and obtains a variety of matching algorithm models transprovincially with correspondence, and from institute
Stating the step of a kind of the first algorithm model of conduct is chosen in a variety of models of matching algorithm transprovincially includes:
Value, support vector machines, naive Bayesian, decision tree, random forest and gradient is closed on using logistic regression, K respectively to mention
Liter machine learning algorithm is trained study to the training characteristics collection and obtains corresponding model, and calculates the accuracy rate of each model
Score;
Using a corresponding model of accuracy rate highest scoring in each model as the first algorithm model.
8. highway goods stock according to claim 1 transprovincially matching process, which is characterized in that use the test
Feature set carries out assessment to first algorithm model and is handled first algorithm model to obtain according to assessment result
Include: to the step of target algorithm model
First algorithm model is tested using the test feature collection, and draws learning curve, ROC curve, is calculated
AUC value;
According to learning curve, ROC curve and AUC value, the fitting state of first algorithm model is judged;
It is adjusted according to parameter and characteristic variable of the fitting state to the first algorithm model to obtain target algorithm model.
9. highway goods stock according to claim 1 transprovincially matching process, which is characterized in that according to described wait locate
Manage entry data in license plate number and outlet data to be processed in license plate number carry out similarity calculation to obtain similarity result,
And the step of being optimized based on the similarity result to the matching result, includes:
It is the license plate number in the license plate number and outlet data to be processed in the corresponding entry data to be processed of matching to matching result
It carries out that similarity is calculated using JaroWinklerDistance algorithm;
When the similarity is greater than a preset value, then the matching result is maintained, otherwise, the matching result is repaired
Change.
10. a kind of highway goods stock transprovincially coalignment characterized by comprising
Module is obtained to be based on for obtaining the entry data and outlet data of the corresponding goods stock of charge station transprovincially prestored
It includes matched data and mismatch number that the matching degree of the license plate number in license plate number and entry data in the outlet data, which generates,
According to sample data;
Computing module, for by the sample data entry data and the outlet data perform mathematical calculations it is comprehensive to obtain
Index is closed, and the overall target is counted to obtain correlation metric, wherein the mathematical operation is subtraction;
Processing module, for being handled the correlation metric to obtain candidate characteristic set, and to the candidate characteristic set
Significance level evaluation is carried out to obtain target signature collection, and to the target signature collection be allocated to obtain training characteristics collection and
Test feature collection;
Training module, for using a variety of default machine learning algorithms to be trained study to deserved the training characteristics collection
To a variety of matching algorithm models transprovincially, and a kind of the first algorithm mould of conduct is chosen from a variety of models of matching algorithm transprovincially
Type;
Evaluation module, for using the test feature collection to assess to obtain assessment result first algorithm model,
And first algorithm model is adjusted to obtain target algorithm model according to the assessment result;
Matching module, for in charge station transprovincially entry data to be processed and outlet data to be processed using the target calculate
Method model is matched to obtain matching result;
Optimization module, for being carried out to the license plate number in the license plate number and outlet data to be processed in the entry data to be processed
Similarity calculation is to obtain license plate number in similarity result entry data to be processed corresponding to the matching result and wait locate
License plate number in reason outlet data carries out similarity calculation to obtain similarity result, and based on the similarity result to described
Matching result optimizes.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810664921.6A CN108764375B (en) | 2018-06-25 | 2018-06-25 | Highway goods stock transprovincially matching process and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810664921.6A CN108764375B (en) | 2018-06-25 | 2018-06-25 | Highway goods stock transprovincially matching process and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108764375A CN108764375A (en) | 2018-11-06 |
CN108764375B true CN108764375B (en) | 2019-05-03 |
Family
ID=63977506
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810664921.6A Active CN108764375B (en) | 2018-06-25 | 2018-06-25 | Highway goods stock transprovincially matching process and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108764375B (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109871513B (en) * | 2019-01-28 | 2023-03-31 | 重庆首讯科技股份有限公司 | Method and system for calculating similarity of vehicle passing behaviors on highway |
CN110335473B (en) * | 2019-08-14 | 2021-05-11 | 中国联合网络通信集团有限公司 | Identity recognition method and system for vehicle with license plate covered |
CN112419721B (en) * | 2020-11-18 | 2021-06-08 | 交通运输部科学研究院 | Method and device for calculating key indexes of transportation of vehicles on highway and electronic equipment |
CN115408705A (en) * | 2021-02-20 | 2022-11-29 | 支付宝(杭州)信息技术有限公司 | Vehicle passing data processing method, device, equipment and system |
CN112991134B (en) * | 2021-05-11 | 2021-08-27 | 交通运输部科学研究院 | Driving path reduction measuring and calculating method and device and electronic equipment |
CN114446045A (en) * | 2021-09-14 | 2022-05-06 | 武汉长江通信智联技术有限公司 | Method for studying and judging illegal transportation behaviors of vehicles on highway in epidemic situation period |
CN116821721B (en) * | 2023-07-03 | 2024-04-02 | 上海金润联汇数字科技有限公司 | Method, device, equipment and medium for identifying cross-city network about car |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105844721A (en) * | 2016-01-27 | 2016-08-10 | 吴加强 | Non-stop charging method for driving in and out of expressway |
CN107146292A (en) * | 2017-04-28 | 2017-09-08 | 成都通甲优博科技有限责任公司 | A kind of freeway toll station vehicle management server, system and method |
CN107730640A (en) * | 2016-08-10 | 2018-02-23 | 西安艾润物联网技术服务有限责任公司 | Bill settlement method and device |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103593879B (en) * | 2013-10-08 | 2016-07-13 | 南京爱沓信息技术有限公司 | A kind of vehicle location integrated intelligent charging administration system and method |
EP3214406A1 (en) * | 2016-03-04 | 2017-09-06 | Volvo Car Corporation | Method and system for utilizing a trip history |
CN107564294A (en) * | 2017-08-25 | 2018-01-09 | 深圳前海华夏智信数据科技有限公司 | Unlicensed car recognition methods and device based on virtual car plate |
-
2018
- 2018-06-25 CN CN201810664921.6A patent/CN108764375B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105844721A (en) * | 2016-01-27 | 2016-08-10 | 吴加强 | Non-stop charging method for driving in and out of expressway |
CN107730640A (en) * | 2016-08-10 | 2018-02-23 | 西安艾润物联网技术服务有限责任公司 | Bill settlement method and device |
CN107146292A (en) * | 2017-04-28 | 2017-09-08 | 成都通甲优博科技有限责任公司 | A kind of freeway toll station vehicle management server, system and method |
Also Published As
Publication number | Publication date |
---|---|
CN108764375A (en) | 2018-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108764375B (en) | Highway goods stock transprovincially matching process and device | |
CN108777004B (en) | Expressway coach transportation vehicle transprovincially matching process and device | |
CN112700072B (en) | Traffic condition prediction method, electronic device, and storage medium | |
CN107657267B (en) | Product potential user mining method and device | |
WO2017143919A1 (en) | Method and apparatus for establishing data identification model | |
Yang | On feature selection for traffic congestion prediction | |
CN105069470A (en) | Classification model training method and device | |
CN112258093A (en) | Risk level data processing method and device, storage medium and electronic equipment | |
CN106446208A (en) | Method for identifying traveling mode of smart phone by considering integrating degree of road network | |
CN103150900A (en) | Traffic jam event automatic detecting method based on videos | |
CN111028016A (en) | Sales data prediction method and device and related equipment | |
CN116504076A (en) | Expressway traffic flow prediction method based on ETC portal data | |
CN108062674A (en) | Order fraud recognition methods, system, storage medium and electronic equipment based on GPS | |
CN112507624B (en) | Inter-city road trip mode identification model construction and identification method and device | |
CN111090833A (en) | Data processing method, system and related equipment | |
CN107784411A (en) | The detection method and device of key variables in model | |
CN113379334B (en) | Road section bicycle riding quality identification method based on noisy track data | |
CN113191805B (en) | Vehicle owner replacement evaluation method, system, electronic equipment and storage medium | |
CN105930430A (en) | Non-cumulative attribute based real-time fraud detection method and apparatus | |
CN110298743A (en) | Vehicle collateral security status early warning method, apparatus, equipment and storage medium | |
CN113360845A (en) | Vehicle source transaction probability prediction method and device, electronic device and storage medium | |
CN112685610A (en) | False registration account identification method and related device | |
CN117689304B (en) | Method and device for determining transportation route of large highway | |
CN106781482A (en) | Novel intelligent traffic data processing system | |
CN112784789B (en) | Method, device, electronic equipment and medium for identifying traffic flow of road |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |