CN111814658B

CN111814658B - Scene semantic structure diagram retrieval method based on semantics

Info

Publication number: CN111814658B
Application number: CN202010644017.6A
Authority: CN
Inventors: 沈沛意
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2020-07-07
Filing date: 2020-07-07
Publication date: 2024-02-09
Anticipated expiration: 2040-07-07
Also published as: CN111814658A

Abstract

The invention discloses a scene semantic structure diagram retrieval method, which mainly solves the problem of poor retrieval effect in the prior art. The implementation scheme is as follows: 1) Inputting a query scene semantic structure diagram, recalling a result related to the scene semantic structure diagram in a scene Jing Yuyi structure diagram database D, and obtaining a scene semantic structure diagram candidate set T; 2) Calculating the matching distance of the matching results in the candidate set T, sequencing the matching results in the candidate set T from small to large according to the matching distance, and reserving k results arranged in front to obtain a simplified candidate set T'; 3) And calculating the similarity S between the scene semantic structure diagram and the query scene semantic structure diagram in the simplified candidate set T' by using the graph neural network, and sequencing from large to small according to the similarity value to obtain a final retrieval result. The method improves the retrieval efficiency and the retrieval precision of the scene semantic structure diagram, and can be used for searching the scene semantic structure diagram with similar semantics and realizing the accurate positioning of the local scene in the global scene.

Description

Scene semantic structure diagram retrieval method based on semantics

Technical Field

The invention belongs to the technical field of computer vision, and particularly relates to a scene semantic structure diagram retrieval method which can be used for searching a scene semantic structure diagram with similar semantics and realizing accurate positioning of a local scene in a global scene.

Background

Training computers to interpret and understand the visual world has attracted attention from many researchers in the field of computer vision over the past decades. For an image or a video, the eyes of a human can easily capture objects, backgrounds and hidden abundant semantic information among the objects or backgrounds in the image or the video, and how to represent the abundant object information and semantic information contained in the image becomes a very critical research point. The scene semantic structure diagram is a directed graph structure which is proposed by Johnson and used for describing a visual scene, nodes in the scene semantic structure diagram describe semantic information of objects in the visual scene, and sides describe interaction information among the objects. Scene semantic structure not only provides context cues for basic recognition tasks, but also provides powerful support for advanced visual tasks like semantic-based image retrieval.

In the field of image retrieval, both images and text descriptions can be represented by a field Jing Yuyi structure diagram, so that an image retrieval problem can be converted into a scene semantic structure diagram retrieval problem, and semantic-based image retrieval is realized through the retrieval of the scene semantic structure diagram.

The retrieval of the scene semantic structure diagram can be also cited to the accurate positioning of the local field in the panoramic local scene, when a plurality of unmanned aerial vehicles work cooperatively, the unmanned aerial vehicle at a high position can obtain a larger visual angle corresponding to a global scene, and the unmanned aerial vehicle at a low position has a smaller visual angle corresponding to a local scene.

Because the scene semantic structure is a directed graph structure, the retrieval of the scene semantic structure is closely related to the graph retrieval. At home and abroad, the retrieval of the scene semantic structure diagram is mainly carried out by two methods: the first type is a graph matching method, such as Ullmann algorithm, which adopts a method of depth-first search and backtracking pruning to judge whether a substructure which is exactly matched with an inquiry scene semantic structure drawing exists in the scene semantic structure drawing or not. The second type of method is a conditional random field method adopted by Johnson, which takes each node in a scene semantic structure diagram as a variable, and calculates the classification probability of the scene semantic structure diagram through the propagation of probabilities between edges.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides a scene semantic structure diagram searching method based on semantics, which greatly improves the algorithm efficiency by firstly carrying out recall and rough sequencing filtration on the scene semantic structure diagram in a database, and then comprehensively considers the similarity among objects, relations and structures in the scene semantic structure diagram when using a graph neural network to accurately sequence the recall result, thereby greatly improving the searching precision.

In order to achieve the above purpose, the technical scheme adopted by the invention comprises the following steps:

(1) Inputting a query scene semantic structure diagram, recalling a result related to the scene semantic structure diagram in a scene Jing Yuyi structure diagram database D to obtain a scene semantic structure diagram candidate set T:

(1.1) extracting 5 fixed substructures from a scene semantic structure diagram database D to obtain a substructures database D'; extracting the same 5 fixed substructures from the input scene semantic structure drawing to obtain an inquiry substructures set Q, wherein each substructures comprises the name of the scene semantic structure drawing, the substructures type and the object label;

(1.2) for each substructure in the query substructure set Q, searching a substructure which can be matched with the substructure in the substructure database D', and obtaining a substructure matching pair formed by the query substructure and the matched substructure;

(1.3) selecting two substructure matching pairs which belong to the same scene semantic structure diagram and have no identical object or identical matching objects corresponding to the identical object from the two inquiry substructures, and connecting an edge between the two substructure matching pairs to obtain a plurality of undirected graphs;

(1.4) solving the maximum groups of the plurality of undirected graphs, and merging the sub-structure matching pairs in the maximum groups to obtain matching results between the query scene semantic structure diagram and the scene semantic structure diagram in the scene semantic structure diagram database D, wherein the matching results form a candidate set T;

(2) Calculating the matching distance of the matching results in the candidate set T, sorting the matching results in the candidate set from small to large according to the matching distance, and reserving k results arranged in front to obtain a simplified candidate set T', wherein k is set according to actual requirements and takes a value of 50 or 100;

(3) And precisely calculating the similarity S between the scene semantic structure diagram and the query scene semantic structure diagram in the simplified candidate set T' by using the graph neural network, and sequencing from large to small according to the similarity value to obtain a final retrieval result.

Compared with the prior art, the invention has the following advantages:

1) The invention carries out recall and rough sequencing on the input query scene semantic structure diagram in the database, can rapidly filter a large number of irrelevant results, and greatly improves the retrieval efficiency.

2) According to the method, the scene semantic structure diagram is firstly converted into the vectors by using the graph neural network, and then the similarity between the vectors is used for representing the similarity between the scene semantic structure diagrams, so that the problem of calculating the similarity of the complex and difficult scene semantic structure diagram is converted into the problem of calculating the simple vector similarity, the calculation efficiency and the precision of the similarity of the scene semantic structure diagram are greatly improved, and the retrieval efficiency and the precision of the scene semantic structure diagram are further improved;

3) The invention can also be applied to other fields of graph retrieval.

Drawings

FIG. 1 is a general flow chart of an implementation of the present invention;

FIG. 2 is a scene semantic structure diagram recall sub-flowchart in the present invention;

FIG. 3 is a diagram of an example of a sub-structure of a scene semantic structure used in the present invention;

FIG. 4 is a sub-flowchart of calculating the similarity between scene semantic structure diagrams using the graph neural network in the present invention;

FIG. 5 is a diagram of information encoding structure in the neural network of FIG. 5;

FIG. 6 is a cross-graph information propagation block diagram in the neural network of FIG. 6;

FIG. 7 is a diagram of an information aggregation structure in the neural network of FIG. 7;

fig. 8 is a diagram showing an example of the search result according to the present invention.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly described below in conjunction with the illustrations in the embodiments of the present invention, and it is apparent that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Referring to fig. 1, the implementation steps of this example are as follows:

step 1, recalling a scene semantic structure diagram related to the input scene semantic structure diagram from a database D to form a candidate set T.

Referring to fig. 2, the specific implementation of this step is as follows:

firstly, extracting 5 substructures with different shapes, which are shown in fig. 3 and are formed by points and lines, from a scene semantic structure diagram database D, wherein the points represent objects, the lines represent the relationship among the objects, and a substructures database D' is obtained; extracting 5 fixed substructures which are the same as those in FIG. 3 from the input scene semantic structure drawing to obtain an inquiry substructures set Q, wherein each substructure contains the name of the scene semantic structure drawing, the substructures type and the object label;

(1.2) for each substructure in the query substructure set Q, retrieving a substructure that can be matched with the query substructure set Q in the substructure database D', to obtain a substructure matching pair of the query substructure and the matched substructure:

(1.2.1) determining whether the query sub-structure is the same as the type in the currently traversed sub-structure in database D': if so, then execute (1.2.2), otherwise, execute (1.2.4);

(1.2.2) determining whether the objects of the two substructures satisfy the following constraints:

L ₂ (V(c _i )，V(φ(c _i )))≤th _o

wherein th _o 、th _a An object matching threshold value and an attribute matching threshold value set by people, c _i Representing object o _i Category V (c) _i ) Representing the word vector corresponding to the category, phi (c _i ) Representing an object phi (o) _i ) Is of the class V (phi (c) _i ) A word vector corresponding to the category;

if the constraint is satisfied, execute (1.2.3),

otherwise, execute (1.2.4);

(1.2.3) determining whether the relationship of the two substructures satisfies the following constraint:

wherein th _r Is a manually set relationship matching threshold value o _i 、o _j Is the interrogation of two different objects in the substructure, phi (o _i ) And phi (o) _j ) Is two different objects in the substructure to be matched of the database, E ₁ (o _i ,o _j ) _p Representing object o in interrogation substructure _i And object o _j In relation to p-th, V (E ₁ (o _i ,o _j ) _p ) Representing the word vector corresponding to the p-th relation class, E ₂ (φ(o _i ),φ(oj)) _q Representing the phi (o) of an object in a sub-structure of a database to be matched _i ) And phi of the objecto _j ) The q-th relation between V (E ₂ (φ(o _i ),φ(o _j )) _q ) A word vector corresponding to the q-th relation category is represented,

if the constraint is satisfied, a substructure match pair is obtained, added to the set of substructure match pairs, and executed (1.2.4),

otherwise, directly executing (1.2.4);

(1.2.4) enumerating the next substructure in the database D', ending the search for the current query substructure until enumeration is completed, otherwise, returning to (1.2.1).

(1.4) solving a plurality of undirected graphs for a maximum clique:

existing methods for solving the maximum clique of the undirected graph include the Bron-Kerbosch algorithm, the Hochbaum algorithm, and the like, and the present example uses, but is not limited to, the Bron-Kerbosch algorithm, and the solving process is as follows:

(1.4.1) constructing 4 sets R, P, X, M and a tag n that records the number of nodes of the largest maximum clique currently found, wherein R sets record the points that have been added in the current maximum clique; the P set records the points which can be added possibly, namely the points which are connected with all the nodes in the R set by edges; x sets record points to which a certain maximum group has been added; m sets are the last returned maximum cluster set; initially, R, X, M is an empty set, P is a set containing all nodes in the undirected graph, n is 0, and the execution is (1.4.2);

(1.4.2) sequentially taking out each node in the P set, adding the currently taken node into the R set on the assumption that the currently taken node is v, updating the P set to be an intersection of the original P set and a node set connected with the node v, and simultaneously updating the X set to be an intersection of the original X set and the node set connected with the node v, and executing (1.4.3);

(1.4.3) determining whether the P set and the X set are both empty: if yes, execute (1.4.4), otherwise execute (1.4.5);

(1.4.4) comparing n with the number of nodes in R:

if n is less than the number of nodes in R, updating n to the number of nodes in R, emptying the M set, adding the R set into the M set, and executing (1.4.5);

if n is equal to the number of nodes in R, adding the R set to the M set, and then executing (1.4.5);

if n is greater than the number of nodes in R, then execute directly (1.4.5);

(1.4.5) delete v node from P while adding v node to X, return (1.4.2).

And (1.5) merging the substructure matching pairs in the maximum group to obtain matching results between the query scene semantic structure diagram and the scene semantic structure diagram in the scene semantic structure diagram database D, and forming a candidate set T by using the matching results.

And 2, roughly sorting the candidate set T, and reserving the results ranked in the front to obtain a simplified candidate set T'.

(2.1) calculating a matching distance of the matching result in the candidate set T:

(2.1.1) constructing a scene semantic structure map matching distance metric function D ^φ (G ₁ ,G ₂ )：

First, according to two scene semantic structure diagram G ₁ And G ₂ Mainly the object category, the relation category, the object attribute, the structure difference and the first scene semantic structure figure G ₁ Some of the objects in (a) may be at G ₂ The characteristics of the matching objects can not be found, and the differences of the matching results on the categories of the objects are respectively givenDifference in relation categories->Difference in properties of objects +.>Structural difference->First scene semantic Structure FIG. G ₁ Some of the objects in G ₂ In the case of no matching object found +.>Is represented by the formula:

in c _i Represents G ₁ Object o in (a) _i Category V (c) _i ) Representing the word vector corresponding to the category, phi (c _i ) Representing object o _i At G ₂ Of the matching object, V (c _i ) A) represents the word vector corresponding to the category, dg (o) _i ) Representing object o _i Degrees in the graph;

in E ₁ (o _i ,o _j ) _p Represents G ₁ Object o in (a) _i And object o _j In relation to p-th, V (E ₁ (o _i ,o _j ) _p ) Representing the word vector corresponding to the p-th relation class, E ₂ (φ(o _i ),φ(o _j )) _q Represents G ₂ The object phi (o) _i ) And object phi (o) _j ) The q-th relation between V (E ₂ (φ(o _i ),φ(o _j )) _q ) A word vector corresponding to the q-th relation class;

in which A _i,p Representing object o _i Is the p-th attribute of V (A _i,p ) Is the word vector corresponding to the category of the p-th attribute, phi (A _i ) _q Representing a piece of paperLigand phi (o) _i ) Is the q-th attribute of V (phi (A) _i ) _q ) Is the word vector corresponding to the category of the q-th attribute;

in which d (o) _i ,o _j ) Represents G ₁ Object o in (a) _i And o _j At G of (2) ₁ D (o) _i ),φ(o _j ) G) represents G ₂ The object phi (o) _i ) And phi (o) _j ) At G ₂ The shortest distance in (a);

o in _i ∈O ₁ -O ₁ ' is G ₁ At G ₂ An object set in which no matching object is found;

then, the 5 parts are weighted and summed to obtain a scene semantic structure matching distance measurement function D shown as follows ^φ (G ₁ ,G ₂ )：

Wherein G is ₁ ＝(O ₁ ,E ₁ ) Is a semantic structure diagram of inquiry scene, O ₁ Is G ₁ Object set of (E) ₁ Is G ₁ Relation set in G ₂ ＝(O ₂ ,E ₂ ) Is a semantic structure diagram of a matching scene, O ₂ Is G ₂ Object set of (E) ₂ Is G ₂ In (a) and phi represents G ₁ And G ₂ Matching results, w _o ，w _r ，w _a ，w _s ，w _g The weights of the parts are respectively represented;

(2.1.2) for each match result phi in the candidate set T, the distance metric function D is matched using the field Jing Yuyi structure map ^φ (G ₁ ,G ₂ ) Calculating a matching junctionMatching distance of fruit phi;

and (2.2) sorting the matching results in the candidate set T according to the matching distance from small to large, and reserving k results arranged in front to obtain a simplified candidate set T', wherein k is set according to actual requirements and takes a value of 50 or 100, and the example takes 100.

And 3, accurately sequencing the simplified candidate set T' by using the graph neural network to obtain a final retrieval result.

(3.1) calculating the similarity between the query scene semantic structure map and the scene semantic structure map in the reduced candidate set T' by using the map neural network:

referring to fig. 4, the specific implementation of this step is as follows:

(3.1.1) semantic Structure of query scene G ₁ And candidate scene semantic structure diagram G in candidate set T ₂ All object information o of (a) _i And relationship information rel _i,j Input into the information encoding structure shown in FIG. 5, namely object information o _i Input to a first multi-layer perceptron MLP _object In, relation information rel _i,j Input to a second multi-layer perceptron MLP _relationship In which the coded object hidden state information is outputtedAnd encoded relationship information e _i,j ：

e _i,j ＝MLP _relationship (rel _i,j ),e _i,j ∈E ₁ E ₂

Wherein o is _i Representing the ith object information, rel in the scene semantic structure diagram _ij Representing object o _i And object o _j Relationship information between the two;

(3.1.2) implicit State information with all encoded objectsLearning the information in the whole graph and the object matching information between the graphs to obtain final object hidden state information +.>

Semantic structure diagram G of query scene after coding ₁ And candidate scene semantic structure diagram G ₂ Inputting into the cross-graph information propagation structure shown in fig. 6, iterating t=5 times to make the input query scene semantic structure diagram G ₁ And candidate scene semantic structure diagram G ₂ All of the encoded implicit state informationCan learn the information in the whole graph and the object matching information between the graphs to obtain the final hidden state information of the object>

The cross-graph information propagation structure adopts an iterative structure and comprises a plurality of time steps, wherein each time step consists of three parts:

the first part uses a multi-layer perceptron MLP _union Pair relationship information e _i,j And implicit status information of objects at both ends thereofPerforming joint coding to obtain information m after joint coding _j→i ：

Wherein, MLP _union Consists of a full connection layer and a linear rectification function;

the second part calculates G using the following ₁ Implicit state information of objects in a computer systemAnd G ₂ Implicit state information of the object in (a)>Matching information μ between _j→i ：

Wherein S is _v The vector similarity calculation function can be an European similarity calculation function or a cosine similarity calculation function, and the example adopts but is not limited to the European similarity calculation function;

a third section for inputting implicit state information of the objectJointly encoded information m _j→i Matching information mu between hidden states of objects _j→i Calculating implicit state information of an object including first-order neighborhood information using the following formula>

Wherein, MLP _p Is a multi-layer perceptron, which consists of a full connection layer and a linear rectifying function, S _v The vector similarity calculation function can be an European similarity calculation function or a cosine similarity calculation function, and the example adopts but is not limited to the European similarity calculation function;

(3.1.3) semantic structure diagram G of query scene respectively ₁ Implicit state information of all objects in a networkAnd candidate scene semantic structure diagram G ₂ Implicit status information of all objects in (a)>Input into the information aggregation structure shown in FIG. 7 to obtain two vectors V ₁ And V ₂ ：

Wherein, MLP _w 、MLP _u 、MLP _G Is three multi-layer perceptron, all composed of a fully connected layer and a linear rectifying function, sigma is a logistic function, g can be a summation function or a mean function, the example adopts but is not limited to summation function;

(3.1.4) calculating the first vector V by different methods according to different training tasks ₁ And a second vector V ₂ Similarity between:

if the training task is a classification task, the first vector V is first ₁ And a second vector V ₂ Splicing the two vectors together, inputting the two vectors into a logic Studies function layer in the graph neural network, and calculating the similarity between the two vectors;

if the training task is a regression task, the first vector V is first ₁ And a second vector V ₂ And splicing the two vectors together, inputting the two vectors into a full-connection layer in the graph neural network, and calculating the similarity between the two vectors.

And (3.2) sequencing the scene semantic structure images in the simplified candidate set T' according to the similarity value from large to small to obtain a final retrieval result.

The effect of the invention can be further illustrated by the following simulation experiments:

simulation experiment condition

The whole flow of the example is realized by using a python programming language under a windows 64-bit platform, a database adopts a Johnson real scene database containing 5000 scene semantic structure diagrams, two inquiry scene semantic structure diagrams are selected, and the first inquiry scene semantic structure diagram contains 3 objects: "woman", "man", "bike", two relations: "woman behand man", "man on bike", the second query scene semantic structure contains 3 objects: "woman", "snorowBoard", "hat", two relations: "man by snorowBoard", "man has hat".

Experimental details and results

Experiment 1, input the first query scene semantic structure diagram, use the method of the invention to search in the above-mentioned database, keep the first 4 search results, as shown in figure 8 (a). The right side of the arrow in fig. 8 (a) shows the 4 search results and the corresponding real pictures, and as can be seen from fig. 8 (a), the method of the invention can search the scene semantic structure diagram which is exactly matched with the query scene semantic structure diagram, and can also search the similar scene semantic structure diagram which is different from the query scene semantic structure diagram in object type and relation type.

Experiment 2, input the second query scene semantic structure diagram, use the method of the invention to search in the above-mentioned database, keep the first 4 search results, as shown in figure 8 (b). The right side of the arrow in fig. 8 (b) shows the 4 search results and the corresponding real pictures, and as can be seen from fig. 8 (b), the method of the invention can search the scene semantic structure diagram which is exactly matched with the query scene semantic structure diagram, and can also search the similar scene semantic structure diagram which is structurally different from the query scene semantic structure diagram.

Claims

1. A scene semantic structure diagram retrieval method is characterized by comprising the following steps:

(1a) Extracting 5 fixed substructures from a scene semantic structure diagram database D to obtain a substructures database D'; extracting the same 5 fixed substructures from the input scene semantic structure drawing to obtain an inquiry substructures set Q, wherein each substructures comprises the name of the scene semantic structure drawing, the substructures type and the object label;

(1b) For each substructure in the query substructure set Q, searching a substructure which can be matched with the substructure in the substructure database D', and obtaining a substructure matching pair formed by the query substructure and the matched substructure;

(1c) Selecting two substructure matching pairs, wherein all the matching substructures belong to the same scene semantic structure diagram, the two inquiry substructures have no identical object or the matching objects corresponding to the identical object are identical, and connecting one edge between the two substructure matching pairs to obtain a plurality of undirected graphs;

(1d) Solving the maximum group of the plurality of undirected graphs, and merging the substructure matching pairs in the maximum group to obtain matching results between the query scene semantic structure diagram and the scene semantic structure diagram in the scene semantic structure diagram database D, wherein the matching results form a candidate set T;

(3) Accurately calculating the similarity S between the scene semantic structure diagram and the query scene semantic structure diagram in the simplified candidate set T' by using a graph neural network, and sorting from large to small according to the similarity value to obtain a final retrieval result;

the method comprises the steps of calculating the similarity S between a scene semantic structure diagram and an inquiry scene semantic structure diagram in a simplified candidate set T' by using a graph neural network, wherein the similarity S is realized as follows:

(3a) Query scene semantic structure graph G using two multi-layer perceptron pairs ₁ And candidate scene semantic structure diagram G in T ₂ All object information o of (a) _i And relationship information rel _i,j Coding to obtain each object o _i Implicit state information of (a)And each relation rel _i,j Encoded information e of (2) _i,j ；

(3b) At each time step, let G ₁ And G ₂ Each object o in (2) _i Implicit state information of (a)Learning first-order neighborhood information in the graph and matching information of objects between the graphs, and iterating for T=5 times to obtain each object o _i New implicit status information of-> Aggregation into two vectors V ₁ And V ₂ ；

(3d) According to different training tasks, different methods are adopted to calculate the first vector V ₁ And a second vector V ₂ Similarity between:

2. The method of claim 1, wherein (2) a matching distance of the matching result of the candidate set T is calculated,matching distance metric function D by scene semantic structure diagram ^φ (G ₁ ,G ₂ ) And (3) calculating:

wherein G is ₁ ＝(O ₁ ,E ₁ ) Is a semantic structure diagram of inquiry scene, O ₁ Is G ₁ Object set of (E) ₁ Is G ₁ Relation set in G ₂ ＝(O ₂ ,E ₂ ) Is a semantic structure diagram of a matching scene, O ₂ Is G ₂ Object set of (E) ₂ Is G ₂ In (a) is G ₁ And G ₂ A bi-directional mapping function between objects in (1) representing G ₁ And G ₂ Matching result between the functions D ^φ Consists of 5 parts:w _o ，w _r ，w _a ，w _s ，w _g the weights of the parts are respectively represented, wherein:

representing the difference of the matching result in the category of the object, wherein c _i Represents G ₁ Object o in (a) _i Category V (c) _i ) Representing the category c _i Corresponding word vector, phi (c) _i ) Representing object o _i Is a match of the class of objects, V (phi (c) _i ) Represents the category phi (c) _i ) Corresponding word vector, dg (o _i ) Representing object o _i Degrees in the graph;

representing the difference of the matching result in the category of the relation, wherein E ₁ (o _i ,o _j ) _p Represents G ₁ Object o in (a) _i And object o _j Category of p-th relation between V (E ₁ (o _i ,o _j ) _p ) Category E representing the relationship ₁ (o _i ,o _j ) _p Corresponding word vector, E ₂ (φ(o _i ),φ(o _j )) _q Representing an object phi (o) _i ) And object phi (o) _j ) Category of the q-th relation between V (E ₂ (φ(o _i ),φ(o _j )) _q ) Category E representing the relationship ₂ (φ(o _i ),φ(o _j )) _q A corresponding word vector;

representing the difference of the matching result on the attribute of the object, wherein A _i,p Representing object o _i The p-th attribute of (a) gets a category, V (a) _i,p ) Is category A of the attribute _i,p Corresponding word vector, phi (A _i ) _q Representing a matching object phi (o _i ) Is of the q-th attribute class, V (phi (a _i ) _q ) Is the category phi (A of the attribute _i ) _q A corresponding word vector;

represents the structural difference of the matching result, wherein d (o _i ,o _j ) Representing object o _i And o _j At G of (2) ₁ D (o) _i ),φ(o _j ) Represents an object phi (o) _i ) And phi (o) _j ) At G ₂ The shortest distance in (a);

represents G ₁ In the absence of matching of objects in (a), o _i ∈O ₁ -O′ ₁ Is G ₁ At G ₂ No object set is found that matches the object.

3. The method of claim 1, wherein the 5 seed structures in (1 a) are different shapes consisting of points and lines, wherein the points represent objects and the lines represent relationships between the objects.

4. The method of claim 1, wherein (1 b) retrieving a substructure in the substructure database D' that matches it is accomplished by:

(1b1) Determining whether the query sub-structure is the same as the type in the currently traversed sub-structure in database D': if so, executing (1 b 2), otherwise, executing (1 b 4);

(1b2) Judging whether the objects of the two substructures meet the following constraint:

L ₂ (V(c _i )，V(φ(c _i )))≤th _o

wherein th _o 、th _a An object matching threshold value and an attribute matching threshold value set by people, c _i Representing object o _i Category V (c) _i ) Representing the category c _i Corresponding word vector, phi (c) _i ) Representing an object phi (o) _i ) Is of the class V (phi (c) _i ) Represents the category phi (c) _i ) The corresponding word vector is used to determine the word vector,

executing (1 b 3) if the constraint is satisfied, otherwise executing (1 b 4);

(1b3) Judging whether the relation between the two substructures meets the constraint shown in the following formula:

th _r is a manually set relationship matching threshold value o _i 、o _j Is an object in the interrogation substructure, phi (o _i ) And phi (o) _j ) Is a sub-node to be matched of the databaseObject in the structure E ₁ (o _i ,o _j ) _p Representing object o _i And object o _j Category of p-th relation between V (E ₁ (o _i ,o _j ) _p ) Word vectors corresponding to the categories representing the relationship, E ₂ (φ(o _i ),φ(o _j )) _q Representing an object phi (o) _i ) And object phi (o) _j ) Category of the q-th relation between V (E ₂ (φ(o _i ),φ(o _j )) _q ) A word vector corresponding to the category representing the relationship,

if the constraint is met, obtaining a substructure matching pair, adding the substructure matching pair to the substructure matching pair set, and executing (1 b 4), otherwise, directly executing (1 b 4);

(1b4) Enumerating the next substructure in the database D', if traversing is completed, ending the search, otherwise, returning to (1 b 1).

5. The method of claim 1, wherein the maximum clique of the undirected graph of (1 d) is obtained by using a Bron-Kerbosch algorithm, which is specifically implemented as follows:

(1d1) Constructing 4 sets R, P, X, M and a mark n for recording the number of nodes of the maximum cluster found currently, wherein R sets record the points added in the maximum cluster currently; the P set records the points which can be added possibly, namely the points which are connected with all the nodes in the R set by edges; x sets record points to which a certain maximum group has been added; m sets are the last returned maximum cluster set; initially, R, X, M is an empty set, P is a set containing all nodes in the undirected graph, n is 0, and (1 d 2) is executed;

(1d2) Taking out each node in the P set in turn, assuming that the currently taken node is v, adding the node into the R set, updating the P set to be the intersection of the original P set and the node set connected with the node v, and simultaneously updating the X set to be the intersection of the original X set and the node set connected with the node v, and executing (1 d 3);

(1d3) Judging whether the P set and the X set are empty or not: if yes, executing (1 d 4), otherwise, executing (1 d 5);

(1d4) Comparing n with the number of nodes in R:

if n is less than the number of nodes in R, updating n to the number of nodes in R, emptying the M set, adding the R set into the M set, and executing (1 d 5);

if n is equal to the number of nodes in R, adding the R set to the M set, and then executing (1 d 5);

if n is greater than the number of nodes in R, then directly executing (1 d 5);

(1d5) The v node is deleted from P while the v node is added to X, returning to (1 d 2).