CN110059771B - Interactive vehicle data classification method under ordering support - Google Patents

Interactive vehicle data classification method under ordering support Download PDF

Info

Publication number
CN110059771B
CN110059771B CN201910386811.2A CN201910386811A CN110059771B CN 110059771 B CN110059771 B CN 110059771B CN 201910386811 A CN201910386811 A CN 201910386811A CN 110059771 B CN110059771 B CN 110059771B
Authority
CN
China
Prior art keywords
vehicle
vehicle data
data
iteration
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910386811.2A
Other languages
Chinese (zh)
Other versions
CN110059771A (en
Inventor
罗月童
吴帅
汪涛
闵海
周波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hefei University of Technology
Original Assignee
Hefei University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hefei University of Technology filed Critical Hefei University of Technology
Priority to CN201910386811.2A priority Critical patent/CN110059771B/en
Publication of CN110059771A publication Critical patent/CN110059771A/en
Application granted granted Critical
Publication of CN110059771B publication Critical patent/CN110059771B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/08Detecting or categorising vehicles

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses an interactive vehicle data classification method under the support of sequencing, which comprises the following steps: 1. obtaining a vehicle data training model; 2. acquiring vehicle recommendation data; 3. optimizing a vehicle classification model and a vehicle sequencing model; 4. evaluating the quality of the vehicle classification model; 5. and observing whether the vehicle classification result is satisfactory or not through related parameters in the interface, if so, accepting the result, and otherwise, returning to the step 1 to continue iteration. The invention can solve the problem that the vehicle classification has no high-quality training sample and the boundary points between the vehicle classification classes are difficult to distinguish, thereby optimizing a vehicle classification model and realizing the accurate classification of the vehicle data set to be classified.

Description

Interactive vehicle data classification method under ordering support
Technical Field
The invention relates to the field of interactive classification, in particular to a vehicle classification interactive task with a sequential relationship among vehicle categories.
Background
In the big data era, classification is one of the most fundamental data analysis techniques and data tasks. Although many automatic vehicle data classification methods have been proposed, none of them is suitable for all application scenarios, and the automatic vehicle classification method in the form of "black box system" also affects its interpretability and credibility, especially for high-dimensional complex data such as images, videos, etc. One of the reasons that affect interpretability and reliability is: users tend to interpret information and measure acquaintance according to high-level features, and the automatic classification algorithm relies on the bottom-level features for classification, so that a semantic gap is formed. Multiple scholars think that users are allowed to participate in the classification process, the domain knowledge of the users can be integrated into the classification algorithm, and the interpretability and the reliability of the classification are improved.
Vehicle classification algorithms based on machine learning, particularly vehicle classification algorithms based on deep learning, have been excellent in many fields in recent years and have become mainstream vehicle classification algorithms. The vehicle classification algorithm based on machine learning learns and constructs vehicle classification rules from training samples, and because the vehicle training samples are generally made by users according to own domain knowledge, the vehicle training samples can be considered to contain the domain knowledge for classification, so that the domain knowledge is indirectly merged into the vehicle classification algorithm through the training algorithm, and the semantic gap problem can be partially overcome. However, the vehicle classification algorithm based on machine learning needs enough high-quality vehicle training samples, a large number of high-quality vehicle training samples are not easily available, and the existing many exploratory vehicle data analyses have no vehicle training samples, so that the user cannot be helped to obtain the high-quality vehicle training samples, and the vehicle classification effect is poor. In addition, it is difficult to distinguish the boundary points between the vehicle classification categories and classify them.
Disclosure of Invention
The invention provides an interactive vehicle data classification method under the support of sequencing, aiming at solving the problems of the prior art, and aiming at solving the problems that the vehicle classification has no high-quality training sample and the boundary points among the vehicle classification classes are difficult to distinguish, so that a vehicle classification model is optimized and the accurate classification of a vehicle data set to be classified is realized.
In order to achieve the purpose, the invention adopts the following technical scheme:
the invention relates to an interactive vehicle data classification method under the support of sequencing, which is characterized by comprising the following steps:
step 1, obtaining a vehicle data training model:
step 1.1, acquiring n pieces of vehicle data from a vehicle data set S to be classified, and setting the vehicle safety category of the ith piece of vehicle data as piAnd p isi∈{L1,L2,…,Lh,…,LHIn which, { L }1,L2,…,Lh,…,LHDenotes a set of vehicle safety categories, LhIndicates the h-th vehicle safety category, and let the h-th vehicle safety category LhIs higher than the h +1 th vehicle safety class Lh+1(ii) a Thus, n pieces of vehicle data with vehicle safety classification are obtained and are marked as { g1,g2,…,gi,…,gn},giIndicating the ith vehicle data with vehicle safety categories, and H indicating the total number of the vehicle safety categories; i is less than or equal to n and H is less than or equal to n;
step 1.2, inputting the n vehicle data with the vehicle safety categories into a support vector machine for training to obtain an SVM vehicle classification model McInputting a vehicle data set S to be classified into the SVM vehicle classification model McIn (2), the classification result of the obtained vehicle data set S is denoted as C ═ { C ═ C1,c2,…,cj…,cNIn which c isjRepresenting the jth vehicle data g in the vehicle data set SjThe classification result of (2); n is the total number of the vehicle data in the vehicle data set S; so as to obtain a vehicle data set S', 1 is equal to or more than j is equal to or less than N with a classification result;
step 1.3, obtaining M pairs of vehicle data from the vehicle data set S' with the classification result, comparing the priority of the vehicle safety category of any M-th pair of vehicle data, and marking the obtained comparison result as QmObtaining a priority set of M pairs of vehicle data consisting of M comparison results; m is more than or equal to 1 and less than or equal to M;
step 1.4, inputting the priority set of the M pairs of vehicle data into a Ranking-SVM for training to obtain a Ranking-SVM vehicle Ranking model MrInputting the vehicle data set S' with the classification result into the Ranking-SVM vehicle Ranking model MrIn (D), the ranking result of the vehicle data set S' is recorded as D ═ D1,d2,…,dj,…,dN},djRepresenting the jth vehicle data g in the vehicle data set SjThe sorting result of (1);
sorting the sorting result D in a descending order to obtain a sorting result D ' ═ D ' after the descending order '1,d′2,…,d′j,…,d′NWherein, d'jRepresents the jth vehicle data g ' in the sorting result D ' after descending 'jSorting results after descending;
step 2, obtaining vehicle recommendation data:
step 2.1, judging two adjacent sequencing results D ' in the sequencing results D ' after descending 'jAnd d'j+1If the priority order of the corresponding vehicle data in the classification result C is larger than that of the corresponding vehicle data in the classification result C, the classification result of the corresponding two vehicle data is normal, otherwise, the classification result of the corresponding two vehicle data is in conflict; thus, all the vehicle data pairs with conflicting classification results form a vehicle recommendation data set;
step 2.2, selecting the ranking corresponding to the vehicle data with the top rank in each pair of vehicle data in the vehicle recommendation data set as the ranking of the corresponding vehicle data pair, thereby obtaining a vehicle recommendation data ranking set, and recording as T ═ T { (T)1,t2,…,tx,…,tX};txRepresenting the ranking of the X-th vehicle data pair, wherein X is more than or equal to 1 and less than or equal to X; x represents the total number of vehicle recommendation data pairs;
step 2.3, based on the assumption of the aggregation effect of the boundary points, obtaining the ranking priority P of the x-th vehicle data pair by using the formula (3)riority(tx):
Figure GDA0002662923640000031
In the formula (3), tyRepresenting the ranking of the y-th vehicle data pair, wherein y is more than or equal to 1 and less than or equal to X, and y is not equal to X;
step 3, optimizing a vehicle classification model and a vehicle sequencing model:
step 3.1, defining the iteration times as b; and initializing b as 1;
step 3.2, obtaining Z except the n pieces of vehicle data in the vehicle recommendation data setbIndividual vehicle data, and setting the vehicle safety class of the z-th vehicle data as pz,pz∈{L1,L2,…,Lh,…,LH},1≤z≤Zb(ii) a And merging the data with the n vehicle data with the vehicle safety categories into a data category set of a b-th iteration
Figure GDA0002662923640000032
Wherein
Figure GDA0002662923640000033
Set of data classes S representing the b-th iterationbKth vehicle data with vehicle safety category; k is more than or equal to 1 and less than or equal to n + Zb
Step 3.3, collecting the data category S of the b-th iterationbInputting the data into a support vector machine for training to obtain a b-th iteration SVM vehicle classification model
Figure GDA0002662923640000034
Inputting the vehicle data set S to be classified into the SVM vehicle classification model of the b-th iteration
Figure GDA0002662923640000035
In (1), the classification result of the vehicle data set S of the b-th iteration is recorded as
Figure GDA0002662923640000036
Figure GDA0002662923640000037
J-th vehicle data g in the vehicle data set S representing the b-th iterationjThe classification result of (2); thereby obtaining a vehicle data set S 'with classification results of the b-th iteration'b
Step 3.4, obtaining the Y of the b-th iteration from the vehicle recommendation data setbComparing the vehicle data and the priorities of the vehicle safety types of any y-th pair of vehicle data to obtain a comparison result QyThereby obtaining YbThe priority set of the b-th iteration formed by the comparison results; y is more than or equal to 1 and less than or equal to Yb(ii) a And combined with the priority set of the M pairs of vehicle data into an overall priority set for the b-th iterationCombining;
step 3.5, inputting the total priority set of the b-th iteration into a Ranking-SVM for training to obtain a Ranking-SVM vehicle Ranking model of the b-th iteration
Figure GDA0002662923640000038
Inputting a vehicle data set S to be classified into the b-th iteration Ranking-SVM vehicle Ranking model
Figure GDA0002662923640000039
In (1), the ordering result of the vehicle data set S of the b-th iteration is recorded as
Figure GDA00026629236400000310
Figure GDA00026629236400000311
J-th vehicle data g in the vehicle data set S representing the b-th iterationjThe sorting result of (1);
step 3.6, sequencing result D of the b-th iterationbSorting in descending order to obtain sorting result after sorting in descending order
Figure GDA0002662923640000041
Wherein the content of the first and second substances,
Figure GDA0002662923640000042
representing the sorting result D 'of the b-th iteration after descending'bJ-th vehicle data g'jThe sorting result of (1);
step 4, evaluating the quality of the vehicle classification model:
step 4.1, judging the descending sorting result D 'of the b-th iteration'bTwo adjacent sorting results
Figure GDA0002662923640000043
And
Figure GDA0002662923640000044
the number of vehicles with classification results of the corresponding vehicle data in the b-th iterationData set S'bIf the former is larger than the latter, the classification result of the corresponding two vehicle data of the b-th iteration is normal, otherwise, the classification result of the corresponding two vehicle data of the b-th iteration is in conflict; obtaining a vehicle recommendation data set of the b-th iteration formed by all the vehicle data pairs with conflicting classification results of the b-th iteration;
step 4.2, selecting the rank corresponding to the vehicle data with the top rank in each pair of vehicle data in the vehicle recommendation data set of the b-th iteration as the rank of the corresponding vehicle data pair of the b-th iteration, thereby obtaining the vehicle recommendation data rank set of the b-th iteration, and recording the rank set as the vehicle recommendation data rank set of the b-th iteration
Figure GDA0002662923640000045
Figure GDA0002662923640000046
Represents the rank of the X 'th vehicle data pair of the b-th iteration, X' is more than or equal to 1 and less than or equal to Xb;XbRepresenting a total number of vehicle recommendation data pairs for the b-th iteration;
step 4.3, obtaining the consistency P of the b-th iteration by using the formula (2)bTo evaluate the SVM vehicle classification model of the b-th iteration
Figure GDA0002662923640000047
Figure GDA0002662923640000048
In the formula (2), Pb∈[0,1];
Step 5, if PbIf < Δ, then b +1 is assigned to b, n + ZbAssigning n to M + YbAnd (3) after assigning the value to the M, returning to the step 3.2 for sequential execution, otherwise, finishing the optimization of the SVM vehicle classification model and realizing the optimal classification of the vehicle data set S to be classified.
Compared with the prior art, the invention has the beneficial effects that:
1. in some vehicle classification applications, there is a sequential relationship between vehicle classes, and the relative sequential relationship between vehicle data is easily perceived. Aiming at the vehicle classification application scene, the method improves the interactive vehicle classification method by means of the cognition of the user on the relative sequence relation among the vehicle data, thereby providing the interactive vehicle classification method supported by sequencing; by using the method, the user can mark the categories of the vehicle data as few as possible, and the sequence information among the data provided by the sequencing model is utilized, so that invalid vehicle data marks are reduced, and the marking efficiency of the vehicle data is greatly improved.
2. The invention further provides a recommendation method based on vehicle data pairs for recommending the vehicle candidate marking data. By adopting the method, the vehicle data with problems can be guaranteed to be recommended certainly, and the scale controllability of the vehicle recommended data is guaranteed.
3. The invention further optimizes the sequence of the vehicle recommendation data; at the initial stage of the recommendation method, the number of vehicle recommendation data pairs is relatively large, and each vehicle recommendation data is judged to generate great burden on a user; therefore, the method based on the aggregation degree of the candidate points is adopted to evaluate each vehicle data pair, the priority of the candidate points with lower aggregation degree is improved, and therefore the user is assisted to make corresponding decisions.
4. The invention also provides a set of new scheme for evaluating the quality of the vehicle classification result. In order to reduce the burden of the user, the user is assisted to decide whether to continuously optimize the model; according to the method, the evaluation strategy of the model consistency is adopted, so that a user has a corresponding basis for judging the quality of the current vehicle classification result, the user can more conveniently understand the result of the model, the interpretability and the reliability of the model are greatly improved, and the trained model is more easily accepted by the user.
Drawings
FIG. 1 is a flow chart of the inventive method;
FIG. 2 is a schematic diagram of a vehicle classification result quality metric view layout strategy according to the present invention.
Detailed Description
In this embodiment, an interactive vehicle data classification method under sequencing support is suitable for a vehicle classification problem with a priority sequence relationship, emphasizes and supports a sequence relationship between vehicle data input by a user and uses the sequence relationship to optimize an interactive classification process, based on this process, not only can category information of the vehicle data be set, but also a relative sequence relationship of the vehicle data can be set, and a vehicle data classification model and a vehicle data sequencing model are compared, so that an interactive classification effect is improved by adjusting the data sequence relationship, and display and evaluation of the vehicle classification model are obtained. Specifically, as shown in fig. 1, the method comprises the following steps:
step 1, obtaining a vehicle data training model:
step 1.1, acquiring n pieces of vehicle data from a vehicle data set S to be classified, and setting the vehicle safety category of the ith piece of vehicle data as piAnd p isi∈{L1,L2,…,Lh,…,LHIn which, { L }1,L2,…,Lh,…,LHDenotes a set of vehicle safety categories, LhIndicates the h-th vehicle safety category, and let the h-th vehicle safety category LhIs higher than the h +1 th vehicle safety class Lh+1(ii) a Thus, n pieces of vehicle data with vehicle safety classification are obtained and are marked as { g1,g2,…,gi,…,gn},giIndicating the ith vehicle data with vehicle safety categories, and H indicating the total number of the vehicle safety categories; i is less than or equal to n and H is less than or equal to n;
step 1.2, for the classification model, because the support vector machine SVM is a small sample learning method and the training and predicting speeds are fast, it is often adopted by the interactive classification method: on one hand, because interactive classification is difficult to require a user to label a large number of samples for training; another aspect is that interactive systems have high speed requirements. The invention also adopts a classification model based on the SVM, inputs n vehicle data with vehicle safety categories into a support vector machine for training to obtain an SVM vehicle classification model McTo be classifiedThe vehicle data set S is input into an SVM vehicle classification model McIn (2), the classification result of the obtained vehicle data set S is denoted as C ═ { C ═ C1,c2,…,cj…,cNIn which c isjRepresenting the jth vehicle data g in the vehicle data set SjThe classification result of (2); n is the total number of the vehicle data in the vehicle data set S; so as to obtain a vehicle data set S', 1 is equal to or more than j is equal to or less than N with a classification result;
step 1.3, obtaining M pairs of vehicle data from the vehicle data set S' with the classification result, comparing the priority of the vehicle safety class of any M-th pair of vehicle data, and marking the obtained comparison result as QmObtaining a priority set of M pairs of vehicle data consisting of M comparison results; m is more than or equal to 1 and less than or equal to M;
and step 1.4, for the sequencing model, the Ranking-SVM intuitively applies the classic two-classification SVM model to the sequencing problem. Inputting the priority set of the M pairs of vehicle data into a Ranking-SVM for training to obtain a Ranking-SVM vehicle Ranking model MrInputting the vehicle data set S' with the classification result into a Ranking-SVM vehicle Ranking model MrIn (D), the ranking result of the vehicle data set S' is recorded as D ═ D1,d2,…,dj,…,dN},djRepresenting the jth vehicle data g in the vehicle data set SjThe sorting result of (1);
sorting the sorting result D in a descending order to obtain a sorting result D ' ═ D ' after the descending order '1,d′2,…,d′j,…,d′NWherein, d'jRepresents the jth vehicle data g ' in the sorting result D ' after descending 'jSorting results after descending;
step 2, obtaining vehicle recommendation data:
and 2.1, if the sequence relation between the data categories obtained through the classification model is inconsistent with the sequence relation of the adjacent data obtained through the sequencing model, the current model can be considered to not process data well, and the data can be considered as vehicle recommendation data. Judging two adjacent sequencing results D ' in the sequencing result D ' after descending 'jAnd d'j+1If the priority order of the corresponding vehicle data in the classification result C is larger than that of the corresponding vehicle data in the classification result C, the classification result of the corresponding two vehicle data is normal, otherwise, the classification result of the corresponding two vehicle data is in conflict; thus, all the vehicle data pairs with conflicting classification results form a vehicle recommendation data set;
step 2.2, selecting the ranking corresponding to the vehicle data with the top rank in each pair of vehicle data in the vehicle recommendation data set as the ranking of the corresponding vehicle data pair, thereby obtaining a vehicle recommendation data ranking set, and recording as T ═ T { (T)1,t2,…,tx,…,tX};txRepresenting the ranking of the X-th vehicle data pair, wherein X is more than or equal to 1 and less than or equal to X; x represents the total number of vehicle recommendation data pairs;
and 2.3, based on the 'boundary point aggregation effect hypothesis', the data far away from the boundary can be processed preferentially, so that the data can be processed rapidly, and the model is iterated. But does not know the true classification boundary, how to measure the distance of the data point to the boundary? According to the "assumption of the aggregation effect of boundary points", if a candidate annotation data is around a boundary, there should be many other candidate annotation data nearby. Based on the recognition, the invention measures the distance from the candidate annotation data to the boundary by using the aggregation degree of the candidate annotation data, and obtains the priority P of the x-th vehicle data pair ranking by using the formula (3)riority(tx):
Figure GDA0002662923640000071
In the formula (3), tyIndicating the ranking of the y-th vehicle data pair, y is more than or equal to 1 and less than or equal to X, and y is not equal to X. Priority(tx) Indicating that the farther any data in a data pair is from the classification boundary, the higher the ranking priority of that data pair. The candidate data set displays all candidate data point pairs from high to low in priority. For example, the problem of carrying out three classifications of 'high-risk vehicles, medium-risk vehicles and low-risk vehicles' is solved. According to the formula (4), the low-risk vehicle-medium-risk vehicle and the low-risk vehicle-high-risk vehicleThe effects of coincidence on P are the same, and in practice their effects do differ. Candidate data points with confusion between different classes are thus presented by a triangular array of (m-1) × (m-1) as shown in FIG. 2, with the abscissa of the triangular array from left to right (c)1,c2,...cm-1) The ordinate is (c) from top to bottom2,c3,...cm) Display of c in the (i, j) th regioniAnd cjWith confusing candidate points appearing in between.
Step 3, optimizing the classification model and the sequencing model:
and 3.1, optimizing the model by further processing the recommended data in the recommended data set. Defining the iteration times as b; and initializing b as 1;
step 3.2, obtaining Z except n pieces of vehicle data in the vehicle recommendation data setbIndividual vehicle data, and setting the vehicle safety class of the z-th vehicle data as pz,pz∈{L1,L2,…,Lh,…,LH},1≤z≤Zb(ii) a And merging the data with n vehicle data with vehicle safety categories into a data category set of a b-th iteration
Figure GDA0002662923640000072
Wherein
Figure GDA0002662923640000073
Set of data classes S representing the b-th iterationbKth vehicle data with vehicle safety category; k is more than or equal to 1 and less than or equal to n + Zb
Step 3.3, the data category of the b-th iteration is collected SbInputting the data into a support vector machine for training to obtain a b-th iteration SVM vehicle classification model
Figure GDA0002662923640000074
Inputting a vehicle data set S to be classified into the SVM vehicle classification model of the b-th iteration
Figure GDA0002662923640000075
Obtaining the vehicle data of the b-th iterationThe classification result of the set S is recorded as
Figure GDA0002662923640000076
Figure GDA0002662923640000077
J-th vehicle data g in the vehicle data set S representing the b-th iterationjThe classification result of (2); thereby obtaining a vehicle data set S 'with classification results of the b-th iteration'b
Step 3.4, obtaining the Y of the b-th iteration from the vehicle recommendation data setbComparing the vehicle data and the priorities of the vehicle safety types of any y-th pair of vehicle data to obtain a comparison result QyThereby obtaining YbThe priority set of the b-th iteration formed by the comparison results; y is more than or equal to 1 and less than or equal to Yb(ii) a And combined with the priority set of M pairs of vehicle data into an overall priority set for the b-th iteration;
step 3.5, inputting the total priority set of the b-th iteration into a Ranking-SVM for training to obtain a Ranking-SVM vehicle Ranking model of the b-th iteration
Figure GDA0002662923640000081
Inputting a vehicle data set S to be classified into a b-th iteration Ranking-SVM vehicle Ranking model
Figure GDA0002662923640000082
In (1), the ordering result of the vehicle data set S of the b-th iteration is recorded as
Figure GDA0002662923640000083
Figure GDA0002662923640000084
J-th vehicle data g in the vehicle data set S representing the b-th iterationjThe sorting result of (1);
step 3.6, sequencing result D of the b-th iterationbSorting in descending order to obtain sorting result after sorting in descending order
Figure GDA0002662923640000085
Wherein the content of the first and second substances,
Figure GDA0002662923640000086
representing the sorting result D 'of the b-th iteration after descending'bJ-th vehicle data g'jThe sorting result of (1);
step 4, evaluating the quality of the classification model:
the interactive data classification method is an interactive drive iterative process, so the current classification result is evaluated through observed information, and further whether continuous iteration is needed or not is determined. Although the classification result can be observed, it is difficult to make an intuitive sense of the quality of the classification result.
Step 4.1, judging the descending sorting result D 'of the b-th iteration'bTwo adjacent sorting results
Figure GDA0002662923640000087
And
Figure GDA0002662923640000088
vehicle data set S 'with classification result of b-th iteration of corresponding vehicle data'bIf the former is larger than the latter, the classification result of the corresponding two vehicle data of the b-th iteration is normal, otherwise, the classification result of the corresponding two vehicle data of the b-th iteration is in conflict; obtaining a vehicle recommendation data set of the b-th iteration formed by all the vehicle data pairs with conflicting classification results of the b-th iteration;
step 4.2, selecting the rank corresponding to the vehicle data with the top rank in each pair of vehicle data in the vehicle recommendation data set of the b-th iteration as the rank of the corresponding vehicle data pair of the b-th iteration, thereby obtaining the vehicle recommendation data rank set of the b-th iteration, and recording the rank set as the vehicle recommendation data rank set of the b-th iteration
Figure GDA0002662923640000089
Figure GDA00026629236400000810
Represents the rank of the X 'th vehicle data pair of the b-th iteration, X' is more than or equal to 1 and less than or equal to Xb;XbRepresenting a total number of vehicle recommendation data pairs for the b-th iteration;
step 4.3, obtaining the consistency P of the b-th iteration by using the formula (2)bTo evaluate the SVM vehicle classification model of the b-th iteration
Figure GDA0002662923640000091
Figure GDA0002662923640000092
In the formula (2), Pb∈[0,1]Degree of agreement of model PbIndicating the degree of coincidence between the classification result and the ranking result, PbThe larger the classification result, the higher the quality of the classification result, and the better the classification model.
Step 5, if PbIf < Δ, then b +1 is assigned to b, n + ZbAssigning n to M + YbAnd (3) after assigning the value to the M, returning to the step 3.2 for sequential execution, otherwise, finishing the optimization of the SVM vehicle classification model and realizing the optimal classification of the vehicle data set S to be classified.

Claims (1)

1. A method for interactive vehicle data classification with sequencing support, comprising the steps of:
step 1, obtaining a vehicle data training model:
step 1.1, acquiring n pieces of vehicle data from a vehicle data set S to be classified, and setting the vehicle safety category of the ith piece of vehicle data as piAnd p isi∈{L1,L2,…,Lh,…,LHIn which, { L }1,L2,…,Lh,…,LHDenotes a set of vehicle safety categories, LhIndicates the h-th vehicle safety category, and let the h-th vehicle safety category LhIs higher than the h +1 th vehicle safety class Lh+1(ii) a Thus, n pieces of vehicle data with vehicle safety classification are obtained and are marked as { g1,g2,…,gi,…,gn},giIndicating the ith vehicle data with vehicle safety categories, and H indicating the total number of the vehicle safety categories; i is less than or equal to n and H is less than or equal to n;
step 1.2, inputting the n vehicle data with the vehicle safety categories into a support vector machine for training to obtain an SVM vehicle classification model McInputting a vehicle data set S to be classified into the SVM vehicle classification model McIn (2), the classification result of the obtained vehicle data set S is denoted as C ═ { C ═ C1,c2,…,cj…,cNIn which c isjRepresenting the jth vehicle data g in the vehicle data set SjThe classification result of (2); n is the total number of the vehicle data in the vehicle data set S; so as to obtain a vehicle data set S', 1 is equal to or more than j is equal to or less than N with a classification result;
step 1.3, obtaining M pairs of vehicle data from the vehicle data set S' with the classification result, comparing the priority of the vehicle safety category of any M-th pair of vehicle data, and marking the obtained comparison result as QmObtaining a priority set of M pairs of vehicle data consisting of M comparison results; m is more than or equal to 1 and less than or equal to M;
step 1.4, inputting the priority set of the M pairs of vehicle data into a Ranking-SVM for training to obtain a Ranking-SVM vehicle Ranking model MrInputting the vehicle data set S' with the classification result into the Ranking-SVM vehicle Ranking model MrIn (D), the ranking result of the vehicle data set S' is recorded as D ═ D1,d2,…,dj,…,dN},djRepresenting the jth vehicle data g in the vehicle data set SjThe sorting result of (1);
sorting the sorting result D in a descending order to obtain a sorting result D ' ═ D ' after the descending order '1,d′2,…,d′j,…,d′NWherein, d'jRepresents the jth vehicle data g ' in the sorting result D ' after descending 'jSorting results after descending;
step 2, obtaining vehicle recommendation data:
step 2.1, judging two adjacent sequencing results in the sequencing results D' after descendingd′jAnd d'j+1If the priority order of the corresponding vehicle data in the classification result C is larger than that of the corresponding vehicle data in the classification result C, the classification result of the corresponding two vehicle data is normal, otherwise, the classification result of the corresponding two vehicle data is in conflict; thus, all the vehicle data pairs with conflicting classification results form a vehicle recommendation data set;
step 2.2, selecting the ranking corresponding to the vehicle data with the top rank in each pair of vehicle data in the vehicle recommendation data set as the ranking of the corresponding vehicle data pair, thereby obtaining a vehicle recommendation data ranking set, and recording as T ═ T { (T)1,t2,…,tx,…,tX};txRepresenting the ranking of the X-th vehicle data pair, wherein X is more than or equal to 1 and less than or equal to X; x represents the total number of vehicle recommendation data pairs;
step 2.3, based on the assumption of the aggregation effect of the boundary points, obtaining the ranking priority P of the x-th vehicle data pair by using the formula (3)riority(tx):
Figure FDA0002662923630000021
In the formula (3), tyRepresenting the ranking of the y-th vehicle data pair, wherein y is more than or equal to 1 and less than or equal to X, and y is not equal to X;
step 3, optimizing a vehicle classification model and a vehicle sequencing model:
step 3.1, defining the iteration times as b; and initializing b as 1;
step 3.2, obtaining Z except the n pieces of vehicle data in the vehicle recommendation data setbIndividual vehicle data, and setting the vehicle safety class of the z-th vehicle data as pz,pz∈{L1,L2,…,Lh,…,LH},1≤z≤Zb(ii) a And merging the data with the n vehicle data with the vehicle safety categories into a data category set of a b-th iteration
Figure FDA0002662923630000022
Wherein
Figure FDA0002662923630000023
Set of data classes S representing the b-th iterationbKth vehicle data with vehicle safety category; k is more than or equal to 1 and less than or equal to n + Zb
Step 3.3, collecting the data category S of the b-th iterationbInputting the data into a support vector machine for training to obtain a b-th iteration SVM vehicle classification model
Figure FDA0002662923630000024
Inputting the vehicle data set S to be classified into the SVM vehicle classification model of the b-th iteration
Figure FDA0002662923630000025
In (1), the classification result of the vehicle data set S of the b-th iteration is recorded as
Figure FDA0002662923630000026
Figure FDA0002662923630000027
J-th vehicle data g in the vehicle data set S representing the b-th iterationjThe classification result of (2); thereby obtaining a vehicle data set S 'with classification results of the b-th iteration'b
Step 3.4, obtaining the Y of the b-th iteration from the vehicle recommendation data setbComparing the vehicle data and the priorities of the vehicle safety types of any y-th pair of vehicle data to obtain a comparison result QyThereby obtaining YbThe priority set of the b-th iteration formed by the comparison results; y is more than or equal to 1 and less than or equal to Yb(ii) a And combined with the priority set of the M pairs of vehicle data into an overall priority set for the b-th iteration;
step 3.5, inputting the total priority set of the b-th iteration into a Ranking-SVM for training to obtain a Ranking-SVM vehicle Ranking model of the b-th iteration
Figure FDA0002662923630000031
Inputting a vehicle data set S to be classified into the b-th iteration Ranking-SVM vehicle Ranking model
Figure FDA0002662923630000032
In (1), the ordering result of the vehicle data set S of the b-th iteration is recorded as
Figure FDA0002662923630000033
Figure FDA0002662923630000034
J-th vehicle data g in the vehicle data set S representing the b-th iterationjThe sorting result of (1);
step 3.6, sequencing result D of the b-th iterationbSorting in descending order to obtain sorting result after sorting in descending order
Figure FDA0002662923630000035
Wherein the content of the first and second substances,
Figure FDA0002662923630000036
representing the sorting result D 'of the b-th iteration after descending'bJ-th vehicle data g'jThe sorting result of (1);
step 4, evaluating the quality of the vehicle classification model:
step 4.1, judging the descending sorting result D 'of the b-th iteration'bTwo adjacent sorting results
Figure FDA0002662923630000037
And
Figure FDA0002662923630000038
vehicle data set S 'with classification result of corresponding vehicle data at the b-th iteration'bIf the former is larger than the latter, the classification result of the corresponding two vehicle data of the b-th iteration is normal, otherwise, the classification result of the corresponding two vehicles of the b-th iteration is normalThe classification results of the data conflict; obtaining a vehicle recommendation data set of the b-th iteration formed by all the vehicle data pairs with conflicting classification results of the b-th iteration;
step 4.2, selecting the rank corresponding to the vehicle data with the top rank in each pair of vehicle data in the vehicle recommendation data set of the b-th iteration as the rank of the corresponding vehicle data pair of the b-th iteration, thereby obtaining the vehicle recommendation data rank set of the b-th iteration, and recording the rank set as the vehicle recommendation data rank set of the b-th iteration
Figure FDA0002662923630000039
Figure FDA00026629236300000310
Represents the rank of the X 'th vehicle data pair of the b-th iteration, X' is more than or equal to 1 and less than or equal to Xb;XbRepresenting a total number of vehicle recommendation data pairs for the b-th iteration;
step 4.3, obtaining the consistency P of the b-th iteration by using the formula (2)bTo evaluate the SVM vehicle classification model of the b-th iteration
Figure FDA00026629236300000311
Figure FDA00026629236300000312
In the formula (2), Pb∈[0,1];
Step 5, if PbIf < Δ, then b +1 is assigned to b, n + ZbAssigning n to M + YbAnd (3) after assigning the value to the M, returning to the step 3.2 for sequential execution, otherwise, finishing the optimization of the SVM vehicle classification model and realizing the optimal classification of the vehicle data set S to be classified.
CN201910386811.2A 2019-05-10 2019-05-10 Interactive vehicle data classification method under ordering support Active CN110059771B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910386811.2A CN110059771B (en) 2019-05-10 2019-05-10 Interactive vehicle data classification method under ordering support

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910386811.2A CN110059771B (en) 2019-05-10 2019-05-10 Interactive vehicle data classification method under ordering support

Publications (2)

Publication Number Publication Date
CN110059771A CN110059771A (en) 2019-07-26
CN110059771B true CN110059771B (en) 2021-01-15

Family

ID=67322742

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910386811.2A Active CN110059771B (en) 2019-05-10 2019-05-10 Interactive vehicle data classification method under ordering support

Country Status (1)

Country Link
CN (1) CN110059771B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014037709A1 (en) * 2012-09-06 2014-03-13 The University Of Manchester Image processing apparatus and method for fitting a deformable shape model to an image using random forest regression voting
CN106295541A (en) * 2016-08-03 2017-01-04 乐视控股(北京)有限公司 Vehicle type recognition method and system
CN107194412A (en) * 2017-04-20 2017-09-22 百度在线网络技术(北京)有限公司 A kind of method of processing data, device, equipment and computer-readable storage medium
CN107862341A (en) * 2017-11-27 2018-03-30 南京理工大学 A kind of vehicle checking method
CN109117826A (en) * 2018-09-05 2019-01-01 湖南科技大学 A kind of vehicle identification method of multiple features fusion

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160180194A1 (en) * 2014-12-23 2016-06-23 Qualcomm Incorporated Systems and methods for identifying a region of an image
US11080533B2 (en) * 2017-05-01 2021-08-03 Sensormatic Electronics, LLC Surveillance system with human behavior prediction by human action recognition
US10762788B2 (en) * 2017-08-01 2020-09-01 Swoppz, LLC Method and system for requesting and granting priority between vehicles
CN108108657B (en) * 2017-11-16 2020-10-30 浙江工业大学 Method for correcting locality sensitive Hash vehicle retrieval based on multitask deep learning
CN109344825A (en) * 2018-09-14 2019-02-15 广州麦仑信息科技有限公司 A kind of licence plate recognition method based on convolutional neural networks
CN109635825A (en) * 2018-12-19 2019-04-16 苏州市科远软件技术开发有限公司 Vehicle attribute detection method, device and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014037709A1 (en) * 2012-09-06 2014-03-13 The University Of Manchester Image processing apparatus and method for fitting a deformable shape model to an image using random forest regression voting
CN106295541A (en) * 2016-08-03 2017-01-04 乐视控股(北京)有限公司 Vehicle type recognition method and system
CN107194412A (en) * 2017-04-20 2017-09-22 百度在线网络技术(北京)有限公司 A kind of method of processing data, device, equipment and computer-readable storage medium
CN107862341A (en) * 2017-11-27 2018-03-30 南京理工大学 A kind of vehicle checking method
CN109117826A (en) * 2018-09-05 2019-01-01 湖南科技大学 A kind of vehicle identification method of multiple features fusion

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Research on armored vehicle classification based on MFCC and SVM;Wenlei Shi等;《2017 3rd IEEE International Conference on Computer and Communications》;20171216;1938-1941 *
基于栈式自编码器特征融合的SAR图像车辆目标识别;康妙等;《雷达学报》;20170430;167-176 *

Also Published As

Publication number Publication date
CN110059771A (en) 2019-07-26

Similar Documents

Publication Publication Date Title
Kang et al. Visually-aware fashion recommendation and design with generative image models
CN107391729B (en) Sort method, electronic equipment and the computer storage media of user comment
Cordeiro et al. A survey on deep learning with noisy labels: How to train your model when you cannot trust on the annotations?
Dodge et al. Visual saliency prediction using a mixture of deep neural networks
CN102262642A (en) Web image search engine and realizing method thereof
CN108228569A (en) A kind of Chinese microblog emotional analysis method based on Cooperative Study under the conditions of loose
McCormack et al. Deep learning of individual aesthetics
Bykov et al. Explaining bayesian neural networks
CN110119448B (en) Semi-supervised cross-domain text classification method based on dual automatic encoders
CN107016416B (en) Data classification prediction method based on neighborhood rough set and PCA fusion
Walsh et al. Automated human cell classification in sparse datasets using few-shot learning
CN111581529A (en) Course recommendation method and device combining student fitness and course collocation degree
CN106897419A (en) The study recommendation method that sorted to level of fusion social information
Badea et al. Identifying transcriptomic correlates of histology using deep learning
JP2021002354A (en) Display control device, display control method, and display control program
Fuda et al. Artificial intelligence in clinical multiparameter flow cytometry and mass cytometry–key tools and progress
Qu et al. Rethinking multiple instance learning for whole slide image classification: A good instance classifier is all you need
Liang et al. Figure-ground image segmentation using feature-based multi-objective genetic programming techniques
CN110059771B (en) Interactive vehicle data classification method under ordering support
CN117371511A (en) Training method, device, equipment and storage medium for image classification model
CN110377841B (en) Similarity calculation method and system applied to collaborative filtering method
Pereira et al. Assessing active learning strategies to improve the quality control of the soybean seed vigor
CN116452895A (en) Small sample image classification method, device and medium based on multi-mode symmetrical enhancement
Bhatia et al. Integrating explainability into deep learning-based models for white blood cells classification
Fan et al. A medical pre-diagnosis system for histopathological image of breast cancer

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant