CN111026880A

CN111026880A - Joint learning-based judicial knowledge graph construction method

Info

Publication number: CN111026880A
Application number: CN201911254309.2A
Authority: CN
Inventors: 孙媛媛; 陈彦光; 刘海顺
Original assignee: Dalian University of Technology
Current assignee: Dalian University of Technology
Priority date: 2019-12-08
Filing date: 2019-12-08
Publication date: 2020-04-17
Anticipated expiration: 2039-12-08
Also published as: CN111026880B

Abstract

The invention relates to a judicial knowledge graph construction method, in particular to a judicial knowledge graph construction method based on joint learning, which comprises the following steps: (1) the method comprises the steps of (1) constructing a criminal judicial field body, (2) building a Seq2Seq neural network model, (3) extracting crime episode triples, and (4) storing in a graph database. The judicial knowledge graph construction method based on joint learning and the body format are feasible, the structure is clear, the reference value is achieved, the correlation information of criminal episodes and criminal results is mined, the accuracy of criminal suggestion can be improved, and the judicial knowledge graph based on the establishment can be applied to the aspects of judicial literature knowledge reasoning and intelligent retrieval of judicial services.

Description

Joint learning-based judicial knowledge graph construction method

Technical Field

The invention relates to a judicial knowledge graph construction method, in particular to a judicial knowledge graph construction method based on joint learning.

Background

Knowledge maps are effective tools for describing a large number of entities, entity attributes, and relationships between entities. In recent years, with the development of the internet, the knowledge graph is widely concerned, and compared with the knowledge graph in the general field which is subjected to a great amount of analysis and research in academia and industry, the knowledge graph in the vertical field is relatively less in construction method. The basic composition unit of the knowledge graph is an entity-relation-entity triple structure or an entity-attribute value triple structure, and all entities are connected with one another through relations to form a netlike graph structure. Knowledge is expressed in the form of a knowledge graph, and information which is difficult to understand can be displayed in a mining, analyzing and visualization mode, so that a user can conveniently acquire and understand concepts and relations thereof. Knowledge graph construction methods are generally divided into bottom-up methods and top-down methods. The bottom-up method flow is that related entities, attributes and interrelations among the entities are extracted from massive text data to obtain knowledge elements, then ambiguity among the entities is eliminated through processes of entity linking, knowledge merging and the like, a top-level ontology mode is automatically constructed in a data-driven mode for obtaining structured knowledge representation, the key of the bottom-up construction method is also the bottom-up construction method, and most of current general domain knowledge maps are constructed in the bottom-up mode. The top-down method flow is that an ontology and a data mode of a knowledge graph are defined firstly, then information such as entities and the like is filled into a knowledge base according to the defined mode, the top-down construction method is mainly used for constructing the vertical domain knowledge graph, certain domain knowledge is required to be used for guiding and defining an ontology structure, the domain range of data is required to be collected and the like, and then the construction of the vertical domain knowledge graph is realized through information extraction technologies such as named entity recognition, relation extraction and the like. At present, although knowledge map construction methods in many general fields are developed, research on the knowledge map construction method in the criminal judicial field is still in an exploration stage. At present, most of existing information extraction methods aiming at the judicial field obtain basic information of legal documents in a rule construction mode, structurally express each content of the legal documents, do not perform more detailed mining on the episode content of cases, and do not extract relevant information of criminal episodes and criminal results, so that the application in the aspects of criminal suggestion, criminal case recommendation and the like cannot be performed.

Disclosure of Invention

In order to overcome the defects in the prior art, the invention aims to provide a judicial knowledge graph construction method based on joint learning. The method is based on professional knowledge in the judicial field and criminal judgment book text content, utilizes a top-down construction mode and combines a mainstream neural network deep learning algorithm and a joint learning algorithm to extract triples related in the criminal judgment book text, and stores the triples in a graph database mode.

In order to achieve the above purpose and solve the problems existing in the prior art, the invention adopts the technical scheme that: a judicial knowledge graph construction method based on joint learning comprises the following steps:

step 1, constructing a criminal judicial domain body, defining a domain body structure according to specific contents of a criminal judgment book of a drug-related case, extracting corresponding contents in a criminal judgment book text and filling, and specifically comprising the following substeps:

(a) defining a judicial case body structure according to the knowledge of the judicial professional field and the contents set forth in the criminal judgment book text of the drug-related cases, wherein the defined judicial case body structure comprises 10 parts of contents, namely document numbers, criminal judgment book titles, judgment places, judgment time, official complaints, defenders, crime types, crime plots, judgment results and judgment basis;

(b) respectively enabling information to be extracted to pass through manual construction rules according to a defined judicial case body structure, and supplementing and perfecting the existing manual construction rules by adopting an iterative evaluation mode so as to cover all information of each criminal judgment book and extracting contents of each part of the criminal judgment book by using the manual construction rules;

(c) expressing the judicial case body structure by adopting an XML format, filling contents of all parts in the extracted criminal judgment book, and designing a reading module aiming at the judicial case body structure so as to be convenient for subsequent calling and reading;

step 2, building a Seq2Seq neural network model, determining a model structure of the Seq2Seq neural network model, and initializing each parameter of the Seq2Seq neural network model, specifically comprising the following substeps:

(a) determining the overall structure of a Seq2Seq neural network model, wherein the model mainly comprises two parts, namely an encoding layer based on a convolutional neural network and a decoding layer based on a cyclic neural network, and a softmax classification layer is arranged behind the decoding layer based on the cyclic neural network to obtain the prediction result of a final label sequence;

(b) the method comprises the following steps of building a coding layer based on a Convolutional Neural Network (CNN), wherein the coding layer comprises two convolutional neural network structures which are respectively used for coding words and words, obtaining coded text characteristic representation through vector splicing, and describing the process of coding the words by the coding layer based on the convolutional neural network through a formula (1) -a formula (3):

w^c＝reshape(v^c) (2)

where conv () denotes a convolution operation, c_jAn initialization vector representing the jth word,

represents the convolution result of the j-th word, reshape () represents the conversion of the matrix shape, representing the character-level vector as v^cConversion to word-level vector representation w^c，

An initialization vector representing the ith word,

the character feature vector obtained by performing convolution operation on the word of the ith word is represented,

representing a vector splicing operation, w_iRepresenting the coding result of the ith word by coding the word;

the process of encoding words based on the encoding layer of the convolutional neural network is described by formula (4) -formula (5):

h_i＝conv(w_i) (4)

where conv () denotes a convolution operation, w_iRepresenting the coding result of the ith word by coding the word, h_iRepresenting the word feature vector of the ith word obtained by performing convolution operation on the word,

a vector splicing operation is represented as a vector splicing operation,

representing a feature vector obtained by the ith word through a coding layer based on a convolutional neural network;

(c) and (b) building a decoding layer based on a Recurrent Neural Network (RNN), wherein the decoding layer uses a unidirectional long-short term memory neural network (LSTM), the characteristic vector of the coding layer based on the convolutional neural network obtained in the substep (b) is input, and an output characteristic vector is obtained through decoding of the long-short term memory neural network, and the process is described by a formula (6):

in the formula, LSTM () represents a calculation by a one-way long-short term memory neural network,

a feature vector representing the i-th word passing through the convolutional neural network-based coding layer,

a feature vector representing the i-th word passing through a Recurrent Neural Network (RNN) -based decoding layer;

(d) and performing normalization processing on the feature vector obtained by the decoding layer based on the recurrent neural network through linear mapping operation and by using a softmax function, wherein the normalization processing is described by a formula (7):

where Softmax () denotes a Softmax function, W denotes a parameter matrix of linear mapping,

feature vector, y, representing the i-th word through a Recurrent Neural Network (RNN) based decoding layer_iAn output vector representing the i-th word through a Seq2Seq neural network model, each value of the vector representing the probability that the word belongs to each tag, by an output vector y for each word_iPerforming argmax operation to obtain a final prediction result of the tag sequence;

step 3, extracting a crime episode triple, establishing a Seq2Seq neural network model aiming at the criminal decision book text extracted in the step 1, and extracting the crime episode triple, wherein the method specifically comprises the following substeps:

(a) collecting and labeling the text content of the criminal scenario, labeling corresponding entities and the relationship between the entities according to the relationship type between the criminal scenario involved person and the involved articles to construct a data set required by the experiment, dividing the data set, and dividing a training set, a verification set and a test set according to the ratio of 6:2: 2;

(b) preprocessing crime episode text data to form a neural network model and perform vector representation, performing vector representation by adopting a random initialization mode for words, and expressing words by adopting word vectors for word2vec pre-training on criminal decision book texts, and combining a joint learning thought in a label strategy to enable a label to contain two kinds of information of entities and relationship types so as to prevent redundant entities from being identified;

(c) using the divided data set in the substep (a) of the step 3, predicting a label sequence by using a Seq2Seq neural network model built in the supervised learning training step 2 and the trained Seq2Seq neural network model, restoring natural language representation of the predicted label by indexing and inquiring a word list aiming at elements with the predicted label as an entity, determining a relation type according to label information, and finally extracting criminal plots and judgment results in a criminal judgment book text in a triple form;

step 4, storing the crime episode and judgment result triples related to the crime in a graph database Neo4j, and specifically comprising the following substeps:

(a) reading information stored in a body library in the criminal judicial field, and extracting judgment results of the same case corresponding to the criminal scenario extracted by the triple extraction in the step 3;

(b) preprocessing a judgment result of a current criminal suspect, dividing the judgment result into two parts of judgment contents, namely, criminal penalty related to a criminal period, namely, arrest, futile prison, no-term prison and death prison, and aiming at specific criminal period duration, processing the criminal period expressed by Chinese characters into Arabic numerals which are expressed in a form of year, month and day; secondly, penalty related penalty is realized by taking RMB yuan as a unit and processing the penalty expressed by Chinese characters into Arabic numbers;

(c) respectively processing the two parts of judgment contents into a triple form, and corresponding the triple with the crime festival to form association through a criminal suspect; the criminal plot triple and the judgment result triple are stored by adopting a graph database Neo4j, the triple is firstly processed into a csv format in consideration of storage efficiency, and then a graph database is imported to form a judicial knowledge graph of the virus-related cases in the criminal judicial field.

The invention has the beneficial effects that: a judicial knowledge graph construction method based on joint learning comprises the following steps: (1) the method comprises the steps of (1) constructing a criminal judicial field body, (2) building a Seq2Seq neural network model, (3) extracting crime episode triples, and (4) storing in a graph database. The judicial knowledge graph construction method based on joint learning and the body format are feasible, the structure is clear, the reference value is achieved, the correlation information of criminal episodes and criminal results is mined, the criminal suggestion accuracy can be improved, and the judicial service oriented application in the aspects of intelligent retrieval of official documents, classification recommendation and the like can be realized based on the established judicial knowledge graph.

Drawings

FIG. 1 is a flow chart of the method steps of the present invention.

FIG. 2 is a diagram of a Seq2Seq neural network model in accordance with the present invention.

Detailed Description

The invention will be further explained with reference to the drawings.

A judicial knowledge graph construction method based on joint learning comprises the following steps:

(a) determining the overall structure of a Seq2Seq neural network model, wherein the model mainly comprises two parts, namely an encoding layer based on a convolutional neural network and a decoding layer based on a cyclic neural network, and a softmax classification layer is arranged behind the decoding layer based on the cyclic neural network to obtain a final prediction result of a label sequence, as shown in fig. 2;

w^c＝reshape(v^c) (2)

represents the convolution result of the j-th word, reshape () represents the conversion of the matrix shape, representing the character-level vector as v^cConversion to word levelVector representation w^c，

An initialization vector representing the ith word,

h_i＝conv(w_i) (4)

a vector splicing operation is represented as a vector splicing operation,