CN112733874B - Suspicious vehicle discrimination method based on knowledge graph reasoning - Google Patents

Suspicious vehicle discrimination method based on knowledge graph reasoning Download PDF

Info

Publication number
CN112733874B
CN112733874B CN202011144715.6A CN202011144715A CN112733874B CN 112733874 B CN112733874 B CN 112733874B CN 202011144715 A CN202011144715 A CN 202011144715A CN 112733874 B CN112733874 B CN 112733874B
Authority
CN
China
Prior art keywords
entity
triple
path
relation
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011144715.6A
Other languages
Chinese (zh)
Other versions
CN112733874A (en
Inventor
俞山川
谢耀华
闫禹
周欣
周健
王少飞
涂耘
陈晓利
叶青
陈晨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Merchants Chongqing Communications Research and Design Institute Co Ltd
Original Assignee
China Merchants Chongqing Communications Research and Design Institute Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Merchants Chongqing Communications Research and Design Institute Co Ltd filed Critical China Merchants Chongqing Communications Research and Design Institute Co Ltd
Priority to CN202011144715.6A priority Critical patent/CN112733874B/en
Publication of CN112733874A publication Critical patent/CN112733874A/en
Application granted granted Critical
Publication of CN112733874B publication Critical patent/CN112733874B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention discloses a suspicious vehicle distinguishing method based on knowledge graph reasoning, which comprises the following steps: s1: the method comprises the steps of obtaining suspicious vehicle knowledge, representing the suspicious vehicle knowledge in a triple (h, r, t) mode based on a knowledge graph construction mode, and establishing a suspicious vehicle knowledge graph; wherein h represents a head entity, r represents a relationship, and t represents a tail entity; s2: embedding the triple suspicious vehicle knowledge graph into a vector space in a fusion manner to obtain a triple expression vector; s3: constructing a triple relation model, training the triple relation model, and determining the vector similarity between the relation r of the entity h and the tail entity t according to a relation score function; s4: and constructing a triple path model, training the triple path model, and determining the reliability between the relation r of the entity h and the tail entity t according to a path score function of the triple path model, wherein the path with the highest path score is a reasoning result.

Description

Suspicious vehicle discrimination method based on knowledge graph reasoning
Technical Field
The invention relates to a suspicious vehicle distinguishing method based on knowledge graph reasoning.
Background
With the establishment of an intelligent management system/platform of management departments such as an expressway and the like, managers can obviously improve the capacity and dimensionality in the aspects of automatic vehicle feature identification and automatic historical behavior recording, and the establishment of a large number of electronic files for vehicles becomes possible. However, the relevance between vehicles and potential traffic accidents of vehicles is lack of deep mining, and the management department is difficult to perform 'capture' and fine management on suspicious vehicles.
Disclosure of Invention
The invention aims to provide a suspicious vehicle distinguishing method based on knowledge graph reasoning, which aims to solve the problem that the suspicious vehicle is difficult to effectively identify at present.
In order to solve the technical problem, the invention provides a suspicious vehicle discrimination method based on knowledge graph reasoning, which comprises the following steps:
s1: the method comprises the steps of obtaining suspicious vehicle knowledge, representing the suspicious vehicle knowledge in a triple (h, r, t) mode based on a knowledge graph construction mode, and establishing a suspicious vehicle knowledge graph; wherein h represents a head entity, r represents a relationship, and t represents a tail entity;
s2: embedding the triple suspicious vehicle knowledge graph into a vector space in a fusion manner to obtain a triple expression vector;
s3: constructing a triple relation model, training the triple relation model, and determining the vector similarity between the relation r of the entity h and the tail entity t according to a relation score function;
s4: and constructing a triple path model, training the triple path model, and determining the reliability between the relation r of the entity h and the tail entity t according to a path score function of the triple path model, wherein the path with the highest path score is an inference result.
Further, constructing the triplet relation model specifically includes:
and establishing one-to-one, one-to-many, many-to-one or many-to-many relationship between the entity h and the entity t through mapping of the entity space and the relationship space.
Further, according to a relation score function f r (h, t) determining the vector similarity between the relation r of the entity h and the tail entity t, said relation scoring function f r (h, t) is calculated using formula (1):
Figure GDA0004055068540000021
wherein w h 、w t 、w r Mapping functions between entity/relationship representations; i is an identity matrix; each vector satisfies the following constraints:
‖h‖ 2 ≤1,‖r‖ 2 ≤1,‖t‖ 2 ≤1 (2)
Figure GDA0004055068540000022
further, the training objective of the triplet relation model is a minimization loss function, which is expressed as follows:
Figure GDA0004055068540000023
s.t.
Figure GDA0004055068540000024
wherein the content of the first and second substances,
Figure GDA0004055068540000025
representing a set of triples; />
Figure GDA0004055068540000026
A negative sample representing (h, r, t) is obtained by randomly replacing a head entity h or a tail entity t in the training process; and, if +>
Figure GDA0004055068540000027
y hrt =1;/>
Figure GDA0004055068540000028
y hrt =-1。
Further, constructing the triplet path model specifically includes:
according to the difference between the entities h and tGenerates path representation according to the relation between the paths and according to the characteristic value function s of each path h,p(e) Establishing a series of paths, wherein the path set is represented as:
p(h,t)={…,p i (h,t),…}
wherein p is i (h,t)=(h,r 1 ,e 1 ,r 2 ,e 2 ,…,e k-1 ,r k T), k is the path length.
Further, a path score function f is used p (h, t) to determine the reliability between the relation r of the entity h and the tail entity t, the path score function f p (h, t) is calculated by the formula (5):
Figure GDA0004055068540000031
probability of each sample being
Figure GDA0004055068540000032
Further, the training objective of the triplet path model is a minimization loss function, which is expressed as follows:
Figure GDA0004055068540000033
the invention has the beneficial effects that: the invention judges whether a certain vehicle running in a certain area is a suspicious vehicle or not based on knowledge map reasoning, and accordingly, the invention carries out early warning to relevant management departments such as a highway and the like, thereby improving the capturing and emergency handling capabilities of the management departments on the suspicious vehicle.
Detailed Description
A suspicious vehicle discrimination method based on knowledge graph reasoning,
s1: acquiring suspicious vehicle knowledge, representing the suspicious vehicle knowledge in a triple (h, r, t) mode based on a knowledge map construction mode, and establishing a suspicious vehicle knowledge map; wherein h represents a head entity, r represents a relationship, and t represents a tail entity;
wherein the suspicious vehicle knowledge comprises: a vehicle in an area (e.g., a section of highway) meeting one of three conditions: (1) the vehicle type, the license plate and the driver characteristics are not matched with the historical records of the database; (2) have occurred in historical accident scenes; (3) the dangerous driving behaviors such as traffic violation like overspeed and overload or the like changing lanes in lushikim are generated for many times; (4) often co-occurring with a vehicle that has been determined to be suspect, i.e., a companion vehicle to the suspect vehicle.
S2: embedding the triple suspicious vehicle knowledge graph into a vector space in a fusion manner to obtain a triple expression vector;
by using
Figure GDA0004055068540000047
Represents a collection of entities, is selected based on the presence of a particular entity>
Figure GDA0004055068540000041
Representing a collection of relationships. Representing a pair of embedded relationships using a triplet (h, r, t), wherein +>
Figure GDA0004055068540000042
Indicates head entity, is present>
Figure GDA0004055068540000043
Represents a relationship, is>
Figure GDA0004055068540000044
Representing the tail entity. Set of triples is used to +>
Figure GDA0004055068540000045
And (4) showing. Thus, each entity and relationship in the knowledge-graph is represented as a vector. Examples are as follows:
if r is the relationship of vehicle type, then (h, r, t) can be expressed as (Yu A00000, vehicle type is car); if r is the "driver is" relationship, (h, r, t) can be represented as (Yu A00000, driver is, driver ID 1); if r is a "body feature is" relationship, (h, r, t) can be represented as (driver ID 1, body feature is, thin); if r is "occurred at the accident scene", then (h, r, t) can be expressed as (Yu A00000, occurred at the accident scene, accident ID 1); if r is "multiple traffic violations," then (h, r, t) can be expressed as (Yu A00000, multiple traffic violations, 0 or 1); if r is "present at location", then (h, r, t) can be expressed as (Yu A00000, present at location, K1+ 120); if r is "present at time", then (h, r, t) can be expressed as (Yu A00000, present at time, YYYY-MM-DDhh: MM: ss); and so on.
S3: constructing a triple relation model, training the triple relation model, and determining the vector similarity between the relation r of the entity h and the tail entity t according to a relation score function;
1. triple relationship model building
The ranD model establishes one-to-one, one-to-many, many-to-one or many-to-many relationships between h and t through the mapping of the entity space and the relationship space. The vector similarity between the relations r is measured by a score function. Score function f r (h, t) is calculated by the formula (1):
Figure GDA0004055068540000046
wherein, w h 、w t 、w r Mapping functions between entity/relationship representations; and I is an identity matrix. Each vector satisfies the following constraints:
‖h‖ 2 ≤1,‖r‖ 2 ≤1,‖t‖ 2 ≤1 (2)
Figure GDA0004055068540000051
2. triple relationship model training
The training goal of the triplet relation model is to minimize the loss function, which is expressed as follows:
Figure GDA0004055068540000052
s.t.
Figure GDA0004055068540000053
wherein the content of the first and second substances,
Figure GDA0004055068540000054
representing a set of triples; />
Figure GDA0004055068540000055
Negative samples representing (h, r, t) are obtained by randomly replacing a head entity h or a tail entity t in the training process; and, if +>
Figure GDA0004055068540000056
y hrt =1;/>
Figure GDA0004055068540000057
y hrt =-1。
S4: and constructing a triple path model, training the triple path model, and determining the reliability between the relation r of the entity h and the tail entity t according to a path score function of the triple path model, wherein the path with the highest path score is an inference result.
For example, if the suspicious vehicle feature knowledge graph contains relationships (li x, driving, yu a 00000), (yu a00000, affiliated to Chongqing shipping Co., ltd.), then the missing relationships (li x, working at Chongqing shipping Co., ltd.) may be obtained.
The invention calculates the characteristic value function s of each path based on a random walk model h,P(t) Thereby establishing a series of paths. A path P is defined by a series of relationships r 1 ,…,r l ,…,r n Consists of the following components:
Figure GDA0004055068540000058
wherein, T n-1 Is a relation r n Scope of action of, T n-1 While being in relation r n-1 Value range of (i.e. T) n-1 =ran(r n )=dom(r n-1 ). Scope and value of the relationship, i.e. type of entity, T 0 ={h},T n = t. Eigenvalue function s h,P(t) Is the probability that the tail entity t can be reached starting from the head entity h along path P. When the path goes to any intermediate entity e, s h,P(e) The updating method is
Figure GDA0004055068540000061
Wherein, in the initial stage of random walk, if e ∈ P, s h,P(e) =1; otherwise, s h,P(e) =0; i (e', e) is an indicator function if r l (e', e) present, I (r) l (e', e)) =1; otherwise, I (r) l (e′,e))=0;
For the relation r, a series of path characteristics P are obtained through a random walk algorithm r ={P 1 ,…,P n A score function is then built for each training sample (i.e. a combination of head and tail entities) under relation r:
Figure GDA0004055068540000062
the probability for each sample is:
Figure GDA0004055068540000063
the minimization loss function is:
min w k (y k log P k +(1-y k )log(1-P k )) (7)
wherein, y k For training the sample (h) k ,t k ) Whether or not there is a flag of relation r, y k =1, if triplet (h) k ,r,t k ) (ii) present; otherwise y k =0。
The method and the device can well establish the complex relations such as the relations of 1 to N, N to 1, N to N and the like, are simpler and higher in calculation efficiency compared with other methods of the same type, and are suitable for establishing the mass suspicious vehicle data knowledge graph in the management department platforms such as the highway and the like. The model training of the method is based on open world assumption, the effect on the incomplete knowledge graph is better, and in the model training fine tuning, the model based on the open world assumption has better effect.
Finally, the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting, although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made to the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention, and all of them should be covered in the claims of the present invention.

Claims (4)

1. A suspicious vehicle discrimination method based on knowledge graph reasoning is characterized by comprising the following steps:
s1: acquiring suspicious vehicle knowledge, representing the suspicious vehicle knowledge in a triple (h, r, t) mode based on a knowledge map construction mode, and establishing a suspicious vehicle knowledge map; wherein h represents a head entity, r represents a relationship, and t represents a tail entity;
s2: embedding the triple suspicious vehicle knowledge graph into a vector space in a fusion manner to obtain a triple expression vector;
s3: constructing a triple relation model, training the triple relation model, and determining the vector similarity between the relation r of the entity h and the tail entity t according to a relation score function;
s4: constructing a triple path model, training the triple path model, and determining the reliability between the relation r of an entity h and a tail entity t according to a path score function of the triple path model, wherein the path with the highest path score is a reasoning result; step S4 specifically includes:
calculating a characteristic value function s of each path based on a random walk model h,P(t) Thereby establishing a series of paths; a path P is defined by a series of relationships r 1 ,...,r l ,...,r n Consists of the following components:
Figure QLYQS_1
wherein, T n-1 Is a relation r n Scope of action of (1), T n-1 While being in the relationship r n-1 Value range of (i.e. T) n-1 =ran(r n )=dom(r n-1 ) Scope and value of a relationship, i.e. type of entity, T 0 ={h},T n = t }; function of eigenvalues s h,P(t) Is the probability that the tail entity t can be reached starting from the head entity h along path P; when the path goes to any intermediate entity e, s h,P(e) The updating method is
Figure QLYQS_2
Wherein, in the initial stage of random walk, if e ∈ P, s h,P(e) =1; otherwise, s h,P(e) =0; i (e', e) is an indicator function if r l (e', e) present, I (r) l (e', e)) =1; otherwise, I (r) l (e′,e))=0;
For the relation r, a series of path features P are obtained through a random walk algorithm r ={P 1 ,...,P n A score function is built for each training sample under relation r:
Figure QLYQS_3
the probability for each sample is:
Figure QLYQS_4
the training target of the triplet path model is a minimization loss function, which is expressed as follows:
Figure QLYQS_5
2. the method for distinguishing suspicious vehicles based on knowledge graph reasoning according to claim 1, wherein the constructing of the triple relationship model specifically comprises:
and establishing one-to-one, one-to-many, many-to-one or many-to-many relationship between the entity h and the entity t through mapping of the entity space and the relationship space.
3. The method of claim 2, wherein the method of suspect vehicle discrimination based on knowledgegraph reasoning is based on a relational score function f r (h, t) determining the vector similarity between the relation r of the entity h and the tail entity t, the relation score function f r (h, t) is calculated using formula (1):
Figure QLYQS_6
wherein, w h 、w t 、w r Mapping functions between entity/relationship representations; i is an identity matrix; each vector satisfies the following constraints:
||h|| 2 ≤1,||r|| 2 ≤1,||t|| 2 ≤1(2)
Figure QLYQS_7
4. the method of claim 3, wherein the training objective of the triplet relational model is a minimization loss function, the minimization loss function being expressed as follows:
Figure QLYQS_8
s.t.
Figure QLYQS_9
wherein the content of the first and second substances,
Figure QLYQS_10
representing a set of triples; />
Figure QLYQS_11
A negative sample representing (h, r, t) is obtained by randomly replacing a head entity h or a tail entity t in the training process; and, if +>
Figure QLYQS_12
Figure QLYQS_13
y hrt =-1。/>
CN202011144715.6A 2020-10-23 2020-10-23 Suspicious vehicle discrimination method based on knowledge graph reasoning Active CN112733874B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011144715.6A CN112733874B (en) 2020-10-23 2020-10-23 Suspicious vehicle discrimination method based on knowledge graph reasoning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011144715.6A CN112733874B (en) 2020-10-23 2020-10-23 Suspicious vehicle discrimination method based on knowledge graph reasoning

Publications (2)

Publication Number Publication Date
CN112733874A CN112733874A (en) 2021-04-30
CN112733874B true CN112733874B (en) 2023-04-07

Family

ID=75597309

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011144715.6A Active CN112733874B (en) 2020-10-23 2020-10-23 Suspicious vehicle discrimination method based on knowledge graph reasoning

Country Status (1)

Country Link
CN (1) CN112733874B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103098111A (en) * 2010-09-24 2013-05-08 丰田自动车株式会社 Track estimation device and program
CN109799477A (en) * 2018-12-06 2019-05-24 北京邮电大学 A kind of sequential vehicle fingerprint localization method and device towards millimeter wave car networking

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9092483B2 (en) * 2010-10-19 2015-07-28 Microsoft Technology Licensing, Llc User query reformulation using random walks
CN110119355B (en) * 2019-04-25 2022-10-28 天津大学 Knowledge graph vectorization reasoning general software defect modeling method
CN110232186A (en) * 2019-05-20 2019-09-13 浙江大学 The knowledge mapping for merging entity description, stratification type and text relation information indicates learning method
CN110795543B (en) * 2019-09-03 2023-09-22 腾讯科技(深圳)有限公司 Unstructured data extraction method, device and storage medium based on deep learning
CN111026875A (en) * 2019-11-26 2020-04-17 中国人民大学 Knowledge graph complementing method based on entity description and relation path
CN110781254A (en) * 2020-01-02 2020-02-11 四川大学 Automatic case knowledge graph construction method, system, equipment and medium
CN111291135A (en) * 2020-01-21 2020-06-16 深圳追一科技有限公司 Knowledge graph construction method and device, server and computer readable storage medium
CN111597350B (en) * 2020-04-30 2023-06-02 西安理工大学 Rail transit event knowledge graph construction method based on deep learning
CN111753101B (en) * 2020-06-30 2022-06-07 华侨大学 Knowledge graph representation learning method integrating entity description and type

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103098111A (en) * 2010-09-24 2013-05-08 丰田自动车株式会社 Track estimation device and program
CN109799477A (en) * 2018-12-06 2019-05-24 北京邮电大学 A kind of sequential vehicle fingerprint localization method and device towards millimeter wave car networking

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于随机游走模型的物体识别;林霄等;《计算机工程与应用》;20131101(第21期);145-151页 *

Also Published As

Publication number Publication date
CN112733874A (en) 2021-04-30

Similar Documents

Publication Publication Date Title
CN110949398B (en) Method for detecting abnormal driving behavior of first-vehicle drivers in vehicle formation driving
CN108769104B (en) Road condition analysis and early warning method based on vehicle-mounted diagnosis system data
CN111401414B (en) Natural driving data-based dangerous scene extraction and classification method
CN110562261B (en) Method for detecting risk level of driver based on Markov model
US20230406357A1 (en) Autonomous driving control device and autonomous driving control method
CN111563555A (en) Driver driving behavior analysis method and system
Xu et al. Utilizing structural equation modeling and segmentation analysis in real-time crash risk assessment on freeways
CN111858719A (en) Specific condition early warning method and system based on vehicle accident data mining analysis
CN112927511A (en) Vehicle early warning method based on driver age and gender identification
WO2021036277A1 (en) Multi-dimensional urban traffic anomaly event recognition method based on ternary gaussian mixture model
JP2017071333A (en) Drive assisting device
Ma et al. Driving style estimation by fusing multiple driving behaviors: a case study of freeway in China
CN112733874B (en) Suspicious vehicle discrimination method based on knowledge graph reasoning
WO2024067135A1 (en) Unmanned aircraft management method and system based on big data identification, and medium
CN116153082B (en) Expressway road condition acquisition, analysis and processing system based on machine vision
CN110705628B (en) Method for detecting risk level of driver based on hidden Markov model
CN116911610A (en) Method and system for monitoring, evaluating and early warning of driving safety risk of transport vehicle
CN114386737B (en) Method for monitoring safety of passengers on internet about car in mobile interconnection era
CN115270400A (en) Test scene construction method for automatic driving simulation and readable storage medium
CN112633163B (en) Detection method for realizing illegal operation vehicle detection based on machine learning algorithm
CN111860383B (en) Group abnormal behavior identification method, device, equipment and storage medium
CN111507380B (en) Picture classification method, system, device and storage medium based on clustering
CN111798091B (en) Expressway lane change scoring model building method based on lane change duration
CN114742293A (en) Method and system for evaluating driver traffic safety risk and analyzing human-vehicle association
CN114611773A (en) Method for disposing tossed vehicles nested with potential risk prediction and multiple punishment mechanisms

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant