CN117808084A

CN117808084A - Pre-selection method based on graph reduction brief representation and graph neural network

Info

Publication number: CN117808084A
Application number: CN202311490926.9A
Authority: CN
Inventors: 何星星; 兰咏琪; 李志辉; 杨晗; 周正春; 李天瑞; 杨洋; 张晓博
Original assignee: Southwest Jiaotong University
Current assignee: Southwest Jiaotong University
Priority date: 2023-11-09
Filing date: 2023-11-09
Publication date: 2024-04-02

Abstract

The invention relates to the technical field of artificial intelligence, and relates to a premise selection method based on graph reduction representation and graph neural network, which includes the following steps: Step 1: Obtain a simplified first-order logic formula graph by judging and deleting continuously repeated quantifiers; 2: Based on the simplified logical formula diagram, a term-walk graph neural network model with an attention mechanism is proposed. The model aggregates the node information located in the upper, middle and lower parts of the term-walk triplet according to the term-walk mode, and introduces attention. The force mechanism calculates the item wandering feature weight of the node, and combines the weight with the node information to generate a new node embedding vector, and then obtains the final formula graph feature vector through global average pooling; Step 3: Combine the candidate premises with the given conjecture The graph feature vector is input to the binary classifier to classify the candidate premises. The present invention can better perform premise selection.

Description

Premise selection method based on graph reduction representation and graph neural network

技术领域Technical field

本发明涉及人工智能技术领域，具体地说，涉及一种基于图约简表示与图神经网络的前提选择方法。The present invention relates to the field of artificial intelligence technology, and specifically to a premise selection method based on graph reduction representation and graph neural network.

背景技术Background technique

自动定理证明(Automated Theorem Provers，ATPs)是人工智能领域一个核心且前沿的方向，作为人工智能系统的重要组成部分，它被广泛应用于专家系统、电路设计、编译器和软件验证等领域。ATPs首先将猜想与前提形式化为逻辑公式，然后将逻辑公式输入到自动定理证明器中，从而实现从前提自动演绎出猜想。ATPs通过不断迭代搜索问题库中所有待处理的子句集来实现对新问题证明，这会导致它在较大规模的问题库中出现搜索空间呈指数型爆炸的问题。前提选择为解决该问题提供了一个新的方法，也就是在将逻辑公式输入到ATPs之前确定并选出有助于证明给定问题的结论的公式。Automated Theorem Provers (ATPs) is a core and cutting-edge direction in the field of artificial intelligence. As an important part of artificial intelligence systems, it is widely used in expert systems, circuit design, compilers and software verification and other fields. ATPs first formalizes conjectures and premises into logical formulas, and then inputs the logical formulas into the automatic theorem prover, thereby automatically deducing conjectures from premises. ATPs achieves proof of new problems by continuously iteratively searching all pending clause sets in the problem library, which will lead to an exponential explosion of the search space in a larger problem library. Premise selection provides a new approach to solving this problem, that is, identifying and selecting formulas that help prove the conclusion of a given problem before entering logical formulas into ATPs.

有效的前提选择方法能够大大提高ATPs的能力。早期的前提选择方法主要是一种基于符号比较分析的手工设计的启发式方法，它通过比较分析从输入公式中抽取子句的深度、符号计数等符号和结构特征来筛选前提集合中与结论更相关的前提，该方法局限于手工设计的特征。随着计算机能力的发展，一些机器学习的方法成为前提选择的一种有效的替代方法，它们将问题自然地转化为分类或排序问题，比原有方法更能捕捉到逻辑公式的深层特征，如卷积神经网络、长短期记忆神经网络、门控循环神经网络等。Effective premise selection methods can greatly improve the capabilities of ATPs. The early premise selection method was mainly a hand-designed heuristic method based on symbolic comparative analysis. It extracted symbolic and structural features such as clause depth and symbol count from the input formula through comparative analysis to screen the premise set that is more relevant to the conclusion. Related to the premise, this method is limited to hand-designed features. With the development of computer capabilities, some machine learning methods have become an effective alternative method for premise selection. They naturally transform the problem into a classification or sorting problem, and can capture the deep characteristics of logical formulas better than the original method, such as Convolutional neural network, long short-term memory neural network, gated recurrent neural network, etc.

在此基础上，由于逻辑公式可以自然地被表示为图，而融合图拓扑结构信息的特征能够充分体现逻辑公式的特征，因此，图神经网络与自动定理证明的结合成为当前较为热门的研究主题。现有的基于图神经网络的前提选择方法虽然能够在一定程度上提高前提选择分类的能力，但仍存在一些不足之处：On this basis, because logical formulas can be naturally represented as graphs, and the characteristics of fused graph topology information can fully reflect the characteristics of logical formulas, the combination of graph neural networks and automatic theorem proving has become a popular research topic at present. . Although the existing premise selection method based on graph neural network can improve the ability of premise selection classification to a certain extent, it still has some shortcomings:

(1)逻辑公式图包含丰富的语法和语义性质，多数前提选择方法忽略了逻辑公式不同图表示法对图神经网络模型的影响，这将导致图神经网络不能很好捕捉逻辑公式的内部和外部信息；(1) Logic formula graphs contain rich grammatical and semantic properties. Most premise selection methods ignore the impact of different graph representations of logical formulas on the graph neural network model. This will result in the graph neural network not being able to capture the internal and external aspects of the logic formula well. information;

(2)现有的图神经网络模型常通过聚合来自邻居节点或其他节点的信息，来生成保留更多逻辑公式信息的特征，这些特征往往包含大量的节点信息，这可能导致生成的公式特征被图上不重要的信息影响，从而无法充分表示逻辑公式信息的图特征。(2) Existing graph neural network models often generate features that retain more logical formula information by aggregating information from neighbor nodes or other nodes. These features often contain a large amount of node information, which may cause the generated formula features to be The influence of unimportant information on the graph makes it impossible to fully represent the graph features of logical formula information.

发明内容Contents of the invention

本发明的内容是提供一种基于图约简表示与图神经网络的前提选择方法，其能够为图上节点分配不同的权重，进而更好的编码一阶逻辑公式图特征。The content of the present invention is to provide a premise selection method based on graph reduction representation and graph neural network, which can assign different weights to nodes on the graph, thereby better encoding first-order logic formula graph features.

根据本发明的基于图约简表示与图神经网络的前提选择方法，包括以下步骤：The premise selection method based on graph reduction representation and graph neural network according to the present invention includes the following steps:

步骤一：通过判断并删除连续重复的量词得到简化的一阶逻辑公式图；Step 1: Get a simplified first-order logic formula diagram by judging and deleting consecutive repeated quantifiers;

步骤二：基于简化的逻辑公式图，提出一种具有注意力机制的项游走图神经网络模型，模型按照项游走模式聚合位于项游走三元组上部、中部和下部的节点信息，引入注意力机制计算节点的项游走特征权重，并将权重与节点信息结合生成新的节点嵌入向量，再通过全局平均池化得到最终的公式图特征向量；Step 2: Based on the simplified logical formula diagram, a term-walk graph neural network model with an attention mechanism is proposed. The model aggregates the node information located in the upper, middle and lower parts of the term-walk triplet according to the term-walk mode, and introduces The attention mechanism calculates the item walking feature weight of the node, combines the weight with the node information to generate a new node embedding vector, and then obtains the final formula graph feature vector through global average pooling;

步骤三，将候选前提和给定猜想的图特征向量输入到二元分类器，进而实现对候选前提的分类。Step 3: Input the candidate premise and the graph feature vector of the given conjecture into the binary classifier to classify the candidate premise.

作为优选，步骤一中，在有向无环图DAGs的基础上将满足相同且连续这一条件的量词进行合并，从而得到简化的一阶逻辑公式图。As a preferred method, in step one, quantifiers that meet the same and continuous conditions are combined on the basis of directed acyclic graphs DAGs to obtain a simplified first-order logic formula graph.

作为优选，步骤二中，一种具有注意力机制的项游走图神经网络模型具体为：As a preferred option, in step two, an item-walking graph neural network model with an attention mechanism is specifically:

(1)输入图G＝(V，E)，其中，V为图上所有节点，E为图上所有的边；首先为每个节点v∈V分配一个初始嵌入x_v，再通过k轮迭代生成的节点状态向量实现消息传递过程，其中k∈{1，…，K}；(1) Input graph G = (V, E), where V is all the nodes on the graph and E is all the edges on the graph; first assign an initial embedding x _v to each node v∈V, and then go through k rounds of iterations Generated node status vector Implement the message passing process, where k∈{1,…,K};

其中，是固定大小的初始状态向量，/>为节点嵌入向量的输出维度，F_V为一个查找表；in, is a fixed-size initial state vector,/> is the output dimension of the node embedding vector, and F _V is a lookup table;

(2)按照项游走模式聚集节点信息，输入图G＝(V，E)的每个节点v，设T_u(v)，T_m(v)和T_l(v)分别表示节点v位于上部、中部和下部的项游走特征集合，有：(2) Gather node information according to the term walking mode, input each node v of the graph G = (V, E), let T _u (v), T _m (v) and T _l (v) respectively represent that node v is located The item wandering feature sets in the upper, middle and lower parts are:

T_u(v)＝{(v，u，w)|(v，u)，(u，w)∈E}，T _u (v)={(v,u,w)|(v,u), (u,w)∈E},

T_m(v)＝{(u，v，w)|(u，v)，(v，w)∈E}，T _m (v)={(u, v, w)|(u, v), (v, w)∈E},

T_l(v)＝{(u，w，v)|(u，w)，(w，v)∈E}T _l (v)={(u,w,v)|(u,w),(w,v)∈E}

其中，u，w均为图上任意节点；Among them, u and w are any nodes on the graph;

为了区分来自项游走特征不同位置的节点v，分别对T_u(v)，T_m(v)和T_l(v)三元组中的向量进行拼接，然后模型针对T_u(v)，T_m(v)和T_l(v)以及聚合函数F_u，F_m和F_l聚集来自不同位置的节点v信息；In order to distinguish the nodes v from different positions of the item wandering feature, the vectors in the triples T _u (v), T _m (v) and T _l (v) are spliced respectively, and then the model is based on _Tu (v), T _m (v) and T _l (v) and the aggregation functions F _u , F _m and F _l aggregate node v information from different locations;

其中，公式中的分号表示不同节点状态向量的拼接；|T_u(v)|，|T_m(v)|和|T_m(v)|分别表示集合T_u(v)，T_m(v)和T_l(v)中所有三元组个数的总和；Among them, the semicolon in the formula represents the splicing of different node state vectors; |T _u (v)|, |T _m (v)| and |T _m (v)| respectively represent the set T _u (v), T _m ( v) and the sum of the numbers of all triples in T _l (v);

模型引入注意力机制以平衡节点状态信息与节点的项游走特征信息和/>为减少模型结构的复杂度，使用/>和/>作为项游走特征信息对节点信息的注意力分数，通过softmax函数归一化该注意力分数得到注意力权重α_vu，α_vm和α_vl，最终使用聚合函数F_U，F_M和F_L得到平衡后的节点信息/>和/> The model introduces an attention mechanism to balance node status information Item travel feature information with nodes and/> To reduce the complexity of the model structure, use/> and/> As the attention score of item wandering feature information to node information, the attention score is normalized by the softmax function to obtain the attention weights α _vu , α _vm and α _vl , and finally obtained by using the aggregation functions F _U , F _M and F _L Balanced node information/> and/>

其中，α_vu，α_vm和α_vl分别表示节点项游走特征信息和/>对节点信息、/>的注意力权重，vu，vm和vl分别表示位于项游走三元组上部、中部和下部的节点；Among them, α _vu , α _vm and α _vl respectively represent the node item wandering characteristic information. and/> For node information,/> The attention weights, vu, vm and vl respectively represent the nodes located in the upper, middle and lower parts of the item wandering triplet;

最后，节点v聚集来自集合T_u(v)，T_m(v)和T_l(v)的平衡节点信息和/>被汇总为/> Finally, node v aggregates the balanced node information from the sets T _u (v), T _m (v) and T _l (v) and/> is summarized as/>

(3)利用来自集合T_u(v)，T_m(v)和T_l(v)的调整后的节点总信息和上一步的节点v状态向量/>对节点向量/>进行传递和更新；(3) Using the adjusted total node information from the sets _Tu (v), _Tm (v) and _Tl (v) and the node v state vector of the previous step/> For node vector/> to deliver and update;

其中，F_sum为节点信息传递函数；Among them, F _sum is the node information transfer function;

(4)对逻辑公式图上的所有节点进行平均池化操作AvgPool，最终的公式图嵌入向量h_G为：(4) Perform average pooling operation AvgPool on all nodes on the logical formula graph. The final embedding vector h _G of the formula graph is:

其中，AvgPool为平均池。in, AvgPool is the average pool.

作为优选，步骤三中，将候选前提和给定猜想的图嵌入向量对(h_p，h_c)输入到分类函数F_class中得到候选前提在猜想下的有用性得分；As a preferred method, in step three, input the graph embedding vector pair (h _p , h _c ) of the candidate premise and the given conjecture into the classification function F _class to obtain the usefulness score of the candidate premise under the conjecture;

z＝F_class([h_p；h_c])z＝F _class ([h _p ；h _c ])

其中，z∈R²表示候选前提对猜想有用和无用的得分；Among them, z∈R ² represents the score of the candidate premise being useful and useless for the guess;

模型使用softmax函数归一化候选前提对猜想的有用和无用得分，并按照得分大小对候选前提进行划分：The model uses the softmax function to normalize the usefulness and useless scores of the candidate premises for the guess, and divides the candidate premises according to the score size:

其中，表示归一化后的前提有用和无用得分，z_i为z中第i个元素，/>为/>中第l个元素，候选前提有用得分和无用得分对应着不同的标签属性，则根据划分结果，确定候选前提的标签属性，并与已有标签比较，进而实现分类。in, Represents the normalized usefulness and useless scores of the premise, z _i is the i-th element in z,/> for/> In the l-th element, the useful score and useless score of the candidate premise correspond to different label attributes. Based on the division results, the label attribute of the candidate premise is determined and compared with the existing labels to achieve classification.

本发明的有益效果如下：The beneficial effects of the present invention are as follows:

1)本发明提出的一种删除重复量词的简化一阶逻辑公式图表示法，能够防止逻辑公式不同图表示法影响图神经网络模型，使图神经网络能够很好捕捉逻辑公式的内部和外部信息；1) The invention proposes a simplified first-order logical formula graphical representation method that deletes repeated quantifiers, which can prevent different graphical representations of logical formulas from affecting the graph neural network model, so that the graph neural network can well capture the internal and external information of the logical formula ;

2)本发明提出了一种具有注意力机制的项游走图神经网络模型，并将其应用于前提选择问题。该模型能够防止图神经网络中公式特征被图上不重要的信息影响，能够为图上节点分配不同的权重，进而更好的编码一阶逻辑公式图特征。2) The present invention proposes an item-walking graph neural network model with an attention mechanism, and applies it to the premise selection problem. This model can prevent the formula features in the graph neural network from being affected by unimportant information on the graph, assign different weights to the nodes on the graph, and thus better encode the first-order logic formula graph features.

附图说明Description of drawings

图1为实施例中一种基于图约简表示与图神经网络的前提选择方法的流程图。FIG1 is a flow chart of a premise selection method based on graph reduction representation and graph neural network in an embodiment.

具体实施方式Detailed ways

为进一步了解本发明的内容，结合附图和实施例对本发明作详细描述。应当理解的是，实施例仅仅是对本发明进行解释而并非限定。In order to further understand the content of the present invention, the present invention will be described in detail with reference to the accompanying drawings and embodiments. It should be understood that the embodiments are only for explanation of the present invention but not for limitation.

实施例Example

如图1所示，本实施例提供了一种基于图约简表示与图神经网络的前提选择方法，包括以下步骤：As shown in Figure 1, this embodiment provides a premise selection method based on graph reduction representation and graph neural network, which includes the following steps:

步骤一：通过判断并删除连续重复的量词得到简化的一阶逻辑公式图；一阶逻辑公式图包括一阶逻辑前提公式图和一阶逻辑猜想公式图；Step 1: Obtain a simplified first-order logic formula diagram by judging and deleting continuously repeated quantifiers; the first-order logic formula diagram includes a first-order logic premise formula diagram and a first-order logic conjecture formula diagram;

常用的逻辑公式到图的表示法多为被拓展的有向无环图(DAGs)，其一般步骤为：1)将逻辑公式转化为类似程序语言的语法解析树；2)合并解析树上相同的子表达式和叶子节点；3)重命名逻辑公式中的变量。Commonly used representations of logical formulas into graphs are mostly extended directed acyclic graphs (DAGs). The general steps are: 1) Convert logical formulas into a syntax parse tree similar to a programming language; 2) Merge the same parsing trees subexpressions and leaf nodes; 3) Rename the variables in the logical formula.

为了减少图数据的规模以及让DAGs包含更多的逻辑性质，本发明提出基于删除重复量词的有向无环图(Simplified-DAGs)用以表示一阶逻辑公式。该Simplified-DAGs的操作相当于在原有DAGs的基础上将满足相同且连续这一条件的量词进行合并。In order to reduce the size of graph data and allow DAGs to contain more logical properties, the present invention proposes directed acyclic graphs (Simplified-DAGs) based on deletion of repeated quantifiers to represent first-order logic formulas. The operation of Simplified-DAGs is equivalent to merging quantifiers that meet the same and continuous conditions on the basis of the original DAGs.

步骤二：基于简化的逻辑公式图，提出一种具有注意力机制的项游走图神经网络模型(Attention-TW-GNN)，模型按照项游走模式聚合位于项游走三元组上部、中部和下部的节点信息，引入注意力机制计算节点的项游走特征权重，并将权重与节点信息结合生成新的节点嵌入向量，再通过全局平均池化得到最终的公式图特征向量；Step 2: Based on the simplified logical formula diagram, an item-walking graph neural network model with attention mechanism (Attention-TW-GNN) is proposed. The model is aggregated in the upper and middle parts of the item-walking triplet according to the item-walking mode. and the node information in the lower part, introduce the attention mechanism to calculate the item walking feature weight of the node, combine the weight with the node information to generate a new node embedding vector, and then obtain the final formula graph feature vector through global average pooling;

步骤二中，一种具有注意力机制的项游走图神经网络模型按照图神经网络模型的工作流程，通过图节点向量初始化阶段、图节点信息聚合阶段、图节点信息传递阶段和图特征读出阶段(图池化)迭代更新节点嵌入信息并获得最终的一阶逻辑公式图向量。具体为：In step two, an item-walking graph neural network model with an attention mechanism follows the workflow of the graph neural network model and goes through the graph node vector initialization phase, the graph node information aggregation phase, the graph node information transfer phase and the graph feature readout The stage (graph pooling) iteratively updates node embedding information and obtains the final first-order logic formula graph vector. Specifically:

(2)按照项游走模式聚集节点信息，输入图G＝(V，E)的每个节点v，设T_u(v)，T_m(v)和T_l(v)分别表示节点v位于上部、中部和下部的项游走特征集合，有：(2) Aggregate node information according to the term walking mode, input each node v of the graph G = (V, E), let T _u (v), T _m (v) and T _l (v) respectively represent that node v is located The item wandering feature sets in the upper, middle and lower parts are:

T_m(v)＝{(u，v，w)|(u，v)，(v，w)∈E}，T _m (v) = {(u, v, w) | (u, v), (v, w) ∈ E},

T_l(v)＝{(u，w，v)|(u，w)，(w，v)∈E}T _l (v)={(u,w,v)|(u,w),(w,v)∈E}

为了区分来自项游走特征不同位置的节点v，分别对T_u(v)，T_m(v)和T_l(v)三元组中的向量进行拼接，然后模型针对T_u(v)，T_m(v)和T_l(v)以及聚合函数F_u，F_m和F_l聚集来自不同位置的节点v信息；In order to distinguish nodes v from different positions of the item walk feature, the vectors in the triplets of _Tu (v), _Tm (v) and _Tl (v) are concatenated respectively. Then the model aggregates the information of nodes v from different positions for _Tu (v), _Tm (v) and _Tl (v) and the aggregation functions _Fu , _Fm and _Fl .

模型引入注意力机制以平衡节点状态信息与节点的项游走特征信息和/>为减少模型结构的复杂度，使用/>和/>作为项游走特征信息对节点信息的注意力分数(贡献)，通过softmax函数归一化该注意力分数得到注意力权重α_vu，α_vm和α_vl，最终使用聚合函数F_U，F_M和F_L得到平衡后的节点信息/>和/> The model introduces an attention mechanism to balance node status information Item travel feature information with nodes and/> To reduce the complexity of the model structure, use/> and/> As the attention score (contribution) of item wandering feature information to node information, the attention score is normalized through the softmax function to obtain the attention weights α _vu , α _vm and α _vl , and finally the aggregation functions F _U , F _M and F _L gets the balanced node information/> and/>

其中，α_vu，α_vm和α_vl分别表示节点项游走特征信息和/>对节点信息的注意力权重，vu，vm和vl分别表示位于项游走三元组上部、中部和下部的节点；Among them, α _vu , α _vm and α _vl respectively represent the node item wandering characteristic information. and/> Node information The attention weights, vu, vm and vl respectively represent the nodes located in the upper, middle and lower parts of the item wandering triplet;

其中，F_sum为节点信息传递函数；F_sum为一个简单的单层MLPs。Among them, F _sum is the node information transfer function; F _sum is a simple single-layer MLPs.

(4)对逻辑公式图上的所有节点进行平均池化操作(AvgPool)，最终的公式图嵌入向量h_G为：(4) Perform average pooling operation (AvgPool) on all nodes on the logical formula graph. The final formula graph embedding vector h _G is:

其中，AvgPool为平均池。in, AvgPool is the average pool.

将候选前提和给定猜想的图嵌入向量对(h_p，h_c)输入到分类函数F_class中得到候选前提在猜想下的有用性得分；Input the graph embedding vector pair (h _p , h _c ) of the candidate premise and the given conjecture into the classification function F _class to obtain the usefulness score of the candidate premise under the conjecture;

z＝F_class([h_p；h_c])z＝F _class ([h _p ; h _c ])

其中，表示归一化后的前提有用和无用得分，z_i为z中第i个元素，/>为/>中第l个元素，候选前提有用得分和无用得分对应着不同的标签属性(1或0)，则根据划分结果(得分最大值)，确定候选前提的标签属性，并与已有标签比较，进而实现分类。in, Represents the normalized usefulness and useless scores of the premise, z _i is the i-th element in z,/> for/> In the l-th element, the useful score and useless score of the candidate premise correspond to different label attributes (1 or 0). According to the division result (maximum score), the label attribute of the candidate premise is determined, and compared with the existing labels, and then Implement classification.

实验experiment

(1)数据集(1)Data set

本实施例基于MPTP2078题库建立了两个数据集，分别为原始(MPTP)数据集、合取范式(CNF)数据集，用以测试模型的预测分类效果。其中，MPTP数据集为MPTP2078题库中的逻辑公式，CNF数据集为MPTP2078题库中逻辑公式对应的合取范式。该题库包含1469个猜想和24087个用于证明这些猜想的前提。In this embodiment, two data sets are established based on the MPTP2078 question bank, namely the original (MPTP) data set and the conjunction normal form (CNF) data set, to test the prediction and classification effect of the model. Among them, the MPTP data set is the logical formula in the MPTP2078 question bank, and the CNF data set is the conjunction normal form corresponding to the logical formula in the MPTP2078 question bank. The question bank contains 1,469 conjectures and 24,087 premises used to prove these conjectures.

本实施例构建用于训练、验证和测试(40996、13990和14068个样本)的MPTP数据集，其中，每个数据集都形如三元组(前提、猜想、标签)，前提是给定猜想的候选前提，标签是二元分类中的0或1(1表示前提有用，0表示前提无用)；同时，本实施例构建的CNF数据集分布与MPTP数据集相同。This embodiment constructs MPTP data sets for training, verification and testing (40996, 13990 and 14068 samples), where each data set is shaped like a triplet (premise, conjecture, label), the premise is a given conjecture The candidate premise of , the label is 0 or 1 in binary classification (1 indicates that the premise is useful, 0 indicates that the premise is useless); at the same time, the distribution of the CNF data set constructed in this embodiment is the same as the MPTP data set.

(2)模型设置(2)Model settings

本实施例根据数据集将逻辑公式转化为基于删除重复量词的简化逻辑公式图，获取每个简化逻辑公式的图信息(节点id、节点名称、父节点id、子节点id)。随后，将逻辑公式图传入图神经网络，并得到公式图特征向量。图神经网络的模型具体设置如下：This embodiment converts the logic formula into a simplified logic formula graph based on deleting repeated quantifiers according to the data set, and obtains the graph information (node id, node name, parent node id, child node id) of each simplified logic formula. Subsequently, the logic formula graph is passed into the graph neural network, and the formula graph feature vector is obtained. The specific model settings of the graph neural network are as follows:

在本实施例的图神经网络模型中，每个节点的初始一个热向量具有d_v维。F_V是一个嵌入网络，它将d_v维的初始一个热向量嵌入到维的节点初始状态向量中。F_u、F_m、F_l的配置相同，它们是具有输入维度/>和输出维度/>的全连通层(FC)。F_sum、F_U、F_M和F_L的配置与F_u相似，因为它只改变输入维度/>F_class有两个完全连接的层：第一个是维度为/>的FC和批归一化(BN)；第二个是通过包含softmax的维度为2的FC。值得注意的是，d_v是793，它表示793个节点标记，在这些标记中，我们统一地将变量表示为“Var”。In the graph neural network model of this embodiment, the initial heat vector of each node has d _v dimensions. F _V is an embedding network, which embeds an initial heat vector of d _v dimensions into dimensional node initial state vector. The configurations of F _u , F _m and F _l are the same. They have input dimensions/> and output dimensions/> Fully Connected Layer (FC). The configuration of F _sum , F _U , F _M and F _L is similar to F _u in that it only changes the input dimension/> F _class has two fully connected layers: the first is of dimension/> FC and batch normalization (BN); the second is FC of dimension 2 by including softmax. It is worth noting that d _v is 793, which represents 793 node tags in which we uniformly represent variables as "Var".

(3)实验设置(3) Experimental settings

本实施例的模型参数设置如下：The model parameters of this embodiment are set as follows:

(a)使用自适应矩估计Adam优化器的默认设置来训练模型；(a) Use the default settings of the adaptive moment estimation Adam optimizer to train the model;

(b)批量大小设置为32；(b) The batch size is set to 32;

(c)正则化参数设置为0.0001；(c) The regularization parameter is set to 0.0001;

(d)初始学习率设置为0.01；(d) The initial learning rate is set to 0.01;

(e)模型使用Pytorch库中的ReduceLROnPlateau策略自动调整学习率。(e) The model automatically adjusts the learning rate using the ReduceLROnPlateau strategy in the Pytorch library.

(f)模型使用交叉熵损失函数训练模型(f) The model uses the cross-entropy loss function to train the model

(4)实验结果与分析(4)Experimental results and analysis

本实施例在MPTP数据集和CNF数据集上评估基于Attention-TW-GNN的前提选择模型，并将该模型与一些主流方法进行比较。从中可以看出：This embodiment evaluates the premise selection model based on Attention-TW-GNN on the MPTP data set and the CNF data set, and compares the model with some mainstream methods. It can be seen from this:

基于图神经网络模型的前提选择方法均能够取得较好的分类精度，优于一些主流的图神经网络模型。例如：GCN、GAT、SGC等模型在MPTP数据集下的Accuracy指标分别为86.25％，85.38％，85.67％。这说明主流的图神经网络只能简单捕捉逻辑公式图的拓扑结构，无法捕捉逻辑公式的深层次信息。The premise selection methods based on the graph neural network model can achieve good classification accuracy, which is better than some mainstream graph neural network models. For example, the Accuracy indicators of GCN, GAT, SGC and other models under the MPTP dataset are 86.25%, 85.38%, and 85.67% respectively. This shows that the mainstream graph neural network can only simply capture the topological structure of the logic formula graph, but cannot capture the deep information of the logic formula.

在其他基线方法中，基于手工设计特征的图神经网络PC-GCN、TW-GNN明显优于主流的图神经网络。例如：PC-GCN、TW-GNN模型在CNF数据集下的F1指标分别为83.98％，83.72％。一个合理的解释是，这些图神经网络模型除了聚合邻居节点信息以外，还聚合了来自更远节点的信息。Among other baseline methods, the graph neural networks PC-GCN and TW-GNN based on hand-designed features are significantly better than mainstream graph neural networks. For example, the F1 index of PC-GCN and TW-GNN models in the CNF dataset is 83.98% and 83.72% respectively. A reasonable explanation is that these graph neural network models aggregate information from more distant nodes in addition to neighboring node information.

本实施例的基于Attention-TW-GNN的前提选择模型在分类精度能在绝大多数情况下超越现有的其他基于图神经网络的前提选择模型。例如：在MPTP数据集下，Attention-TW-GNN相比主流图神经网络至少提高了2％，相比其他基线方法提高了0.5％；在CNF数据集下，Attention-TW-GNN相比主流图神经网络模型提高了3％。这说明在加入注意力机制调整后的图神经网络能够更好的表征一阶逻辑公式的语法和语义信息，同时一阶逻辑公式的图表示也在一定程度上影响模型的分类效果。The classification accuracy of the premise selection model based on Attention-TW-GNN in this embodiment can surpass other existing graph neural network-based premise selection models in most cases. For example: Under the MPTP data set, Attention-TW-GNN has improved by at least 2% compared to mainstream graph neural networks and 0.5% compared to other baseline methods; under the CNF data set, Attention-TW-GNN has improved compared to mainstream graph neural networks The neural network model improved by 3%. This shows that the graph neural network adjusted by adding the attention mechanism can better represent the syntax and semantic information of the first-order logic formula. At the same time, the graph representation of the first-order logic formula also affects the classification effect of the model to a certain extent.

以上示意性的对本发明及其实施方式进行了描述，该描述没有限制性，附图中所示的也只是本发明的实施方式之一，实际的结构并不局限于此。所以，如果本领域的普通技术人员受其启示，在不脱离本发明创造宗旨的情况下，不经创造性的设计出与该技术方案相似的结构方式及实施例，均应属于本发明的保护范围。The present invention and its embodiments have been schematically described above. This description is not limiting. What is shown in the drawings is only one embodiment of the present invention, and the actual structure is not limited thereto. Therefore, if a person of ordinary skill in the art is inspired by the invention and without departing from the spirit of the invention, can devise structural methods and embodiments similar to the technical solution without inventiveness, they shall all fall within the protection scope of the invention. .

Claims

1. The pre-selection method based on the graph reduction brief representation and the graph neural network is characterized by comprising the following steps of: the method comprises the following steps:

step one: obtaining a simplified first-order logic formula diagram by judging and deleting continuous repeated graduated words;

step two: based on a simplified logic formula diagram, a term walk diagram neural network model with an attention mechanism is provided, the model aggregates node information positioned at the upper part, the middle part and the lower part of a term walk triplet according to a term walk mode, the attention mechanism is introduced to calculate term walk characteristic weights of nodes, the weights are combined with the node information to generate a new node embedded vector, and the final formula diagram characteristic vector is obtained through global average pooling;

and thirdly, inputting the candidate preconditions and the given guess graph feature vectors into a binary classifier, and further classifying the candidate preconditions.

2. The graph reduction profile and graph neural network based precursor selection method of claim 1, wherein: in the first step, the adjectives satisfying the same and continuous condition are combined on the basis of the directed acyclic graph DAGs, so that a simplified first-order logic formula graph is obtained.

3. The graph reduction profile and graph neural network based precursor selection method of claim 2, wherein: in the second step, a term walk graph neural network model with an attention mechanism specifically comprises:

(1) Inputting a graph g= (V, E), wherein V is all nodes on the graph and E is all edges on the graph; first, each node V e V is assigned an initial embedding x _v Node state vector generated through k rounds of iterationImplementing a messaging process, where K e {1, …, K };

wherein,is a fixed-size initial state vector, +.>Embedding the output dimension of the vector for the node, F _V Is a lookup table;

(2) Aggregating node information according to the term walk pattern, setting T for each node V of the input graph g= (V, E) _u (v)，T _m (v) And T _l (v) Item walk feature sets representing that the node v is located at the upper, middle, and lower portions, respectively, have:

T _u (v)＝{(v，u，w)|(v，u)，(u，w)∈E}，

T _m (v)＝{(u，v，w)|(u，v)，(v，w)∈E}，

T _l (v)＝{(u，w，v)|(u，w)，(w，v)∈E}

wherein u and w are any nodes on the graph;

to distinguish nodes v from different positions of the item walk feature, T is respectively calculated _u (v)，T _m (v) And T _l (v) The vectors in the triples are stitched and then the model is directed to T _u (v)，T _m (v) And T _l (v) Aggregation function F _u ，F _m And F _l Aggregating node v information from different locations;

wherein, the semicolons in the formula represent the concatenation of different node state vectors; i T _u (v)|，|T _m (v) I and T _m (v) I respectively represent the set T _u (v)，T _m (v) And T _l (v) The sum of all triples in the database;

model-induced attention mechanism to balance node state informationItem wander characteristic information of node->And->To reduce the complexity of the model structure +.>And->Attention score of term walk characteristic information to node information, and attention score is normalized through softmax function to obtain attention weight alpha _vu ，α _vm And alpha _vl Finally use the aggregation function F _U ，F _M And F _L Obtain balanced node information->And->

Wherein alpha is _vu ，α _vm And alpha _vl Respectively representing node item wander characteristic informationAnd->Information about node->Is that the concentration weights of (v), vu, vm and vl represent nodes located at the upper, middle and lower parts of the item walk triplet, respectively;

finally, node v aggregates from set T _u (v)，T _m (v) And T _l (v) Balance node information of (a)And->Is summarized as->

(3) Using data from set T _u (v)，T _m (v) And T _l (v) Is adjusted node total informationAnd node v state vector of the last step +.>For node vector->Carrying out transfer and updating;

wherein F is _sum A node information transfer function;

(4) Performing average pooling operation AvgPool on all nodes on the logic formula diagram, and embedding a vector h into the final formula diagram _G The method comprises the following steps:

wherein,AvgPool is the average pool.

4. The graph reduction profile and graph neural network based precursor selection method of claim 3, wherein: in step three, the candidate preconditions and the map of the given guess are embedded into vector pairs (h _p ，h _c ) Input to classification function F _class Obtaining usefulness scores of candidate preconditions under guesses;

z＝F _class ([h _p ；h _c ])

wherein z is E R ² A score representing the candidate preconditions as useful and useless for guessing;

the model normalizes the candidate preconditions using a softmax function to guess the useful and useless scores and divides the candidate preconditions by score size:

wherein,indicating normalized precondition useful and useless scores, zi being the i-th element in z, ++>Is->If the candidate precondition useful score and the useless score correspond to different tag attributes, determining the tag attribute of the candidate precondition according to the dividing result, and comparing the tag attribute with the existing tag, thereby realizing classification.