CN111639494A - Case affair relation determining method and system - Google Patents

Case affair relation determining method and system Download PDF

Info

Publication number
CN111639494A
CN111639494A CN202010312498.0A CN202010312498A CN111639494A CN 111639494 A CN111639494 A CN 111639494A CN 202010312498 A CN202010312498 A CN 202010312498A CN 111639494 A CN111639494 A CN 111639494A
Authority
CN
China
Prior art keywords
event
case
word segmentation
determining
neural network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010312498.0A
Other languages
Chinese (zh)
Inventor
张世琨
李皓辰
乔子乐
李伟平
赵文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Original Assignee
Peking University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University filed Critical Peking University
Priority to CN202010312498.0A priority Critical patent/CN111639494A/en
Publication of CN111639494A publication Critical patent/CN111639494A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/049Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Technology Law (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Databases & Information Systems (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention provides a case affair relation determining method and a case affair relation determining system, wherein the method comprises the steps of collecting a description text of a case, and determining all event types corresponding to the attributes of the case; performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results; extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information; and determining the event relation among all the events in the case based on the event information corresponding to each event. According to the method for the case structure disassembly by taking the event as the core and determining the case affair relation, the case information is fully used, the method can be used for constructing downstream tasks such as event logic chains, meta-event prediction and motivation analysis, and the effects of completing intelligent legal tasks such as related law and law searching and automatic judgment can be further achieved.

Description

Case affair relation determining method and system
Technical Field
The invention relates to the technical field of legal information processing, in particular to a case affair relationship determining method and system.
Background
In the case investigation process, the analysis of the case information is very important. In the prior art, case information is generally analyzed in a structural table form, keyword such as time, case party, location, behavior and the like is used for extracting the case information, a table is constructed, and follow-up tasks such as follow-up case analysis, related method search, automatic judgment and the like are performed in a table searching mode.
However, the method provided in the prior art does not consider the case relationship among a series of events in a case, and splits the events and events under the same case, so that deep analysis such as human motivation analysis and emotion analysis cannot be performed, and a large amount of information is lost when tasks such as related law and law search and automatic judgment are performed.
Therefore, it is urgently needed to provide a case affair relationship determining method and system.
Disclosure of Invention
To overcome the above problems or at least partially solve the above problems, embodiments of the present invention provide a case affairs relationship determining method and system.
In a first aspect, an embodiment of the present invention provides a method for determining a case affair relationship, including:
collecting a description text of a case, and determining all event types corresponding to the attributes of the case;
performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results;
extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information;
and determining the event relation among all the events in the case based on the event information corresponding to each event.
Preferably, the extracting entity information in the case from the word segmentation result specifically includes:
inputting the word segmentation result into a long-short term memory neural network model, and outputting entity information in the case by the long-short term memory neural network model;
the long-short term memory neural network model adopts a feature extractor based on self attention to extract features, and is obtained by training sample words and entity information corresponding to the sample words.
Preferably, the output layer of the long-short term memory neural network model is constructed based on a Softmax function or a conditional random field.
Preferably, the extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information specifically includes:
inputting the word segmentation result into a target neural network model, and outputting event information corresponding to each event in the case by the target neural network model;
the target neural network model is obtained by training events to which sample words belong and event information corresponding to the events to which the sample words belong, and the events to which the sample words belong correspond to all event types and the entity information.
Preferably, the event information includes event trigger, event type and event argument; accordingly, the number of the first and second electrodes,
the inputting the word segmentation result into a target neural network model, and outputting event information corresponding to each event in the case by the target neural network model specifically includes:
the target neural network model is based on a pipeline model, firstly, an event trigger corresponding to each event in the word segmentation result is recognized, then, an event type corresponding to the event trigger is determined, and finally, an event argument corresponding to the event trigger is determined.
Preferably, the target neural network model is a convolutional neural network model and/or a recurrent neural network model.
Preferably, the event information includes event trigger, event type and event argument;
correspondingly, the determining the event relationship among all the events in the case based on the event information corresponding to each event specifically includes:
determining the superior-inferior relation between all events in the case based on the superior-inferior relation of the event types according to the event type corresponding to each event in the case;
and determining the sequential relation among all the events in the case according to the event trigger, the event type and the event argument corresponding to each event in the case.
In a second aspect, an embodiment of the present invention provides a case affairs relationship determining system, including: the system comprises an event type determining module, an entity information extracting module, an event information extracting module and a matter relation determining module. Wherein the content of the first and second substances,
the event type determining module is used for acquiring description texts of cases and determining all event types contained in the cases;
the entity information extraction module is used for carrying out word segmentation processing on the description text to obtain word segmentation results and extracting entity information in the case from the word segmentation results;
the event information extraction module is used for extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information;
and the case relation determining module is used for determining the case relation among all the events in the case based on the event information corresponding to each event.
In a third aspect, an embodiment of the present invention provides an electronic device, including: a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps of the case affairs relationship determination method according to the first aspect when executing the program.
In a fourth aspect, an embodiment of the present invention provides a non-transitory computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of the case affair relationship determination method according to the first aspect.
The embodiment of the invention provides a case affair relation determining method and a case affair relation determining system, wherein the method comprises the steps of collecting a description text of a case, and determining all event types corresponding to the attributes of the case; performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results; extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information; and determining the event relation among all the events in the case based on the event information corresponding to each event. According to the method for the case structure disassembly by taking the event as the core and determining the case affair relation, the case information is fully used, the method can be used for constructing downstream tasks such as event logic chains, meta-event prediction and motivation analysis, and the effects of completing intelligent legal tasks such as related law and law searching and automatic judgment can be further achieved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a schematic flow chart of a case affair relationship determining method according to an embodiment of the present invention;
fig. 2 is a schematic diagram of a directed acyclic graph formed in the case affairs relationship determination method according to the embodiment of the present invention;
fig. 3 is a schematic structural diagram of a case affairs relationship determining system according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1, an embodiment of the present invention provides a case affairs relationship determining method, including:
s1, collecting the description text of the case, and determining all event types contained in the case;
s2, performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results;
s3, extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information;
s4, determining the event relation among all the events in the case based on the event information corresponding to each event.
Specifically, in the case affair relationship determining method provided in the embodiment of the present invention, the execution subject is a server, specifically, the execution subject may be a local server, or may be a cloud server, and the local server may be a computer.
Step S1 is first executed, a description text of a case is collected, the case refers to a combination including a plurality of events, an attribute of the case refers to a type of the case, and the case may be an administrative penalty case, a criminal case, or another case requiring determination of a physical relationship between events included inside the case, which is not specifically limited in the embodiment of the present invention. Each event contained in the case is composed of event arguments such as event principal, event occurrence time and event occurrence place and event trigger, wherein the event trigger refers to action related to the event. In the embodiment of the invention, the description text of the case refers to the content obtained by recording event arguments, event triggers and the like of the case in a text mode. After the description text of the case is collected, all event types corresponding to the attribute of the case can be determined through the form of expert discussion, for example, if the attribute of the case is an administrative penalty case, all corresponding event types may include an attack event, a theft event, a fraud event, a penalty event, a fine event, a non-repudiation event, and the like. In the embodiment of the invention, after all event types corresponding to the attributes of the case are determined, all event types can form a set, and the set is an event type set. The upper and lower bit relations of all event types in the event type set can be determined to form a tree diagram. The upper and lower relationships of the event types refer to the inclusion or contained relationship between the event types, for example, the penalty event is a higher concept of a fine event, a non-repudiation event, etc., and the penalty event, the non-repudiation event, etc. is a lower concept of the penalty event, that is, the penalty event includes the fine event, the non-repudiation event, etc.
Then, step 2 is executed, sentence separation processing and word separation processing are carried out on the description text, each sentence content in the description text can contain one or more event types, and event discovery and event argument extraction are carried out by using a Chinese legal event extraction technology; the method comprises the steps of segmenting each sentence of content in a description text by using a Chinese segmentation technology of an added legal dictionary, splitting each sentence of content in the description text into a list of a plurality of words, and enabling all single words to form a segmentation result. And classifying each word to determine entity information in the case. The entity information refers to information such as time, place, people, actions and the like related in the whole case, and the process of determining the entity information in the case is the process of determining the category to which each word belongs.
Then, step S3 is executed to extract event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information in the case. According to all event types, extracting a plurality of events in the word segmentation result by taking the events as units, wherein the event information corresponding to each event comprises event trigger, event types, event arguments and the like. In the embodiment of the present invention, the extracted events may also be numbered, that is, the event information corresponding to each event further includes an event number. In the embodiment of the invention, the event information corresponding to each event can be stored, and the storage mode can be that the event number, the event type, the event trigger and the event argument corresponding to the event are jointly stored. It should be noted that event triggers, events and event numbers are in one-to-one correspondence, that is, one event has only one number, and also has only one event trigger, and different event triggers correspond to different events. The same event type may correspond to different event triggers, i.e. to different events. That is, different events may have the same event type.
And finally, executing step S4, and determining the event relation among all the events in the case according to the event information corresponding to each event in the case determined in step S3. The event relationship may include an upper-lower relationship and a sequential relationship, where the upper-lower relationship refers to an inclusion or contained relationship of an event, and the sequential relationship refers to a conditional relationship and a causal relationship of the event.
In the embodiment of the invention, a Directed Acyclic Graph (DAG) between all events can be constructed according to the affair relationship between all events in the case, so that the affair relationship is more intuitive. For example, if it is determined that the case includes an overload event, an impound event, a payment event, a fine event, and a return event by the method provided in the embodiment of the present invention, a directed acyclic graph is constructed as shown in fig. 2.
The case affair relationship determining method provided by the embodiment of the invention comprises the steps of collecting description texts of cases, and determining all event types corresponding to the attributes of the cases; performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results; extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information; and determining the event relation among all the events in the case based on the event information corresponding to each event. According to the method for the case structure disassembly by taking the event as the core and determining the case affair relation, the case information is fully used, the method can be used for constructing downstream tasks such as event logic chains, meta-event prediction and motivation analysis, and the effects of completing intelligent legal tasks such as related law and law searching and automatic judgment can be further achieved.
On the basis of the above embodiment, the case affair relationship determining method provided in the embodiment of the present invention extracts entity information in the case from the word segmentation result, and specifically includes:
inputting the word segmentation result into a long-short term memory neural network model, and outputting entity information in the case by the long-short term memory neural network model;
the long-short term memory neural network model adopts a feature extractor based on self attention to extract features, and is obtained by training sample words and entity information corresponding to the sample words.
Specifically, in the embodiment of the present invention, when the entity information in the case is extracted from the word segmentation result in step S2, the extraction may be specifically implemented by a Long Short-term memory neural network (LSTM) model, the word segmentation result is input to the LSTM model, the LSTM model performs entity extraction, and the entity information is output. The LSTM model may be combined with a self-attention-based feature extractor (Transformer) to perform feature extraction by the Transformer. The output layer of the LSTM model may be implemented by using a Softmax function, or may also be implemented by using a Conditional Random Field (CRF), which is not specifically limited in the embodiment of the present invention.
The case affair relationship determining method provided by the embodiment of the invention adopts the long-short term memory neural network model and combines the feature extractor based on self-attention and the conditional random field, so that the extraction efficiency of the entity information is higher, and the extracted entity information is more accurate.
On the basis of the foregoing embodiment, the case affair relationship determining method provided in the embodiment of the present invention is a case affair relationship determining method that extracts, from the word segmentation result, event information corresponding to each event in the case based on all event types and the entity information, and specifically includes:
inputting the word segmentation result into a target neural network model, and outputting event information corresponding to each event in the case by the target neural network model;
the target neural network model is obtained by training events to which sample words belong and event information corresponding to the events to which the sample words belong, and the events to which the sample words belong correspond to all event types and the entity information.
Specifically, in the embodiment of the present invention, when step S3 is executed, the target neural network model processes the word segmentation result, and determines event information corresponding to each event. The target Neural network model may be a Convolutional Neural Network (CNN) model or a Recurrent Neural Network (RNN) model, or may be a combination of a CNN model and an RNN model. The embodiment of the invention can also adopt a CNN model or an RNN model based on an attention mechanism, or a combination of the CNN model and the RNN model.
On the basis of the above embodiment, in the case affair relationship determining method provided in the embodiment of the present invention, the event information includes event trigger, event type, and event argument; correspondingly, the inputting the word segmentation result into a target neural network model, and outputting event information corresponding to each event in the case by the target neural network model specifically includes:
the target neural network model is based on a pipeline model, firstly, an event trigger corresponding to each event in the word segmentation result is recognized, then, an event type corresponding to the event trigger is determined, and finally, an event argument corresponding to the event trigger is determined.
Specifically, in the embodiment of the present invention, a Pipeline model (Pipeline) may be introduced into the target neural network model, the event trigger corresponding to each event is identified first, then the event type corresponding to the event trigger is determined, and finally the event argument corresponding to the event trigger is extracted according to the characteristics of the event type to form the structured expression of the event in the case.
On the basis of the above embodiment, in the case affair relationship determining method provided in the embodiment of the present invention, the event information includes event trigger, event type, and event argument;
correspondingly, the determining the event relationship among all the events in the case based on the event information corresponding to each event specifically includes:
determining the superior-inferior relation between all events in the case based on the superior-inferior relation of the event types according to the event type corresponding to each event in the case;
and determining the sequential relation among all the events in the case according to the event trigger, the event type and the event argument corresponding to each event in the case.
Specifically, in the embodiment of the present invention, when step S4 is executed, the upper and lower relationships of each event may be determined by comparing the event type with the event type corresponding to each event in the case and combining the upper and lower relationships of the event type, where the upper and lower relationships of the event type are consistent with the upper and lower relationships of the event. And then comparing event types, event triggers, event arguments such as event time, people, places and the like, and determining the cis-bearing relation among all the events in the case. And obtaining the superior-subordinate relation and the sequential relation among all the events in the case, namely determining the affair relation among all the events in the case.
As shown in fig. 3, on the basis of the above embodiment, an embodiment of the present invention provides a case affairs relationship determining system, including: an event type determination module 31, an entity information extraction module 32, an event information extraction module 33, and a case relationship determination module 34.
The event type determining module 31 is configured to collect description texts of cases, and determine all event types included in the cases;
the entity information extraction module 32 is configured to perform word segmentation processing on the description text to obtain word segmentation results, and extract entity information in the case from the word segmentation results;
the event information extraction module 33 is configured to extract event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information;
the case relation determining module 34 is configured to determine a case relation between all events in the case based on the event information corresponding to each event.
Specifically, the functions of the modules in the case affair relationship determination system provided in the embodiment of the present invention correspond to the operation flows of the steps in the method embodiments one to one, and the implementation effects are also consistent.
As shown in fig. 4, on the basis of the above embodiment, an embodiment of the present invention provides an electronic device, including: a processor (processor)401, a memory (memory)402, a communication Interface (Communications Interface)403, and a communication bus 404; wherein the content of the first and second substances,
the processor 401, the memory 402 and the communication interface 403 complete communication with each other through the communication bus 404. The memory 402 stores program instructions executable by the processor 401, and the processor 401 is configured to call the program instructions in the memory 402 to perform the method provided by the above-mentioned embodiments of the method, for example, including: collecting a description text of a case, and determining all event types corresponding to the attributes of the case; performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results; extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information; and determining the event relation among all the events in the case based on the event information corresponding to each event.
It should be noted that, when being implemented specifically, the electronic device in this embodiment may be a server, a PC, or another device, as long as the structure includes the processor 401, the communication interface 403, the memory 402, and the communication bus 404 shown in fig. 4, where the processor 401, the communication interface 403, and the memory 402 complete mutual communication through the communication bus 404, and the processor 401 may call a logic instruction in the memory 402 to execute the above method. The embodiment does not limit the specific implementation form of the electronic device.
The logic instructions in memory 402 may be implemented in software functional units and stored in a computer readable storage medium when sold or used as a stand-alone article of manufacture. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
Further, embodiments of the present invention disclose a computer program product comprising a computer program stored on a non-transitory computer-readable storage medium, the computer program comprising program instructions, which when executed by a computer, the computer is capable of performing the methods provided by the above-mentioned method embodiments, for example, comprising: collecting a description text of a case, and determining all event types corresponding to the attributes of the case; performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results; extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information; and determining the event relation among all the events in the case based on the event information corresponding to each event.
On the basis of the foregoing embodiments, an embodiment of the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, the computer program being implemented to perform the transmission method provided by the foregoing embodiments when executed by a processor, and the method includes: collecting a description text of a case, and determining all event types corresponding to the attributes of the case; performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results; extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information; and determining the event relation among all the events in the case based on the event information corresponding to each event.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A case affairs relation determining method is characterized by comprising the following steps:
collecting a description text of a case, and determining all event types corresponding to the attributes of the case;
performing word segmentation processing on the description text to obtain word segmentation results, and extracting entity information in the case from the word segmentation results;
extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information;
and determining the event relation among all the events in the case based on the event information corresponding to each event.
2. A case affairs relationship determination method according to claim 1, wherein the extracting entity information in the case from the word segmentation result specifically includes:
inputting the word segmentation result into a long-short term memory neural network model, and outputting entity information in the case by the long-short term memory neural network model;
the long-short term memory neural network model adopts a feature extractor based on self attention to extract features, and is obtained by training sample words and entity information corresponding to the sample words.
3. A case affairs relationship determination method according to claim 2, wherein an output layer of the long-short term memory neural network model is constructed based on a Softmax function or a conditional random field.
4. The case affairs relationship determining method according to claim 1, wherein the extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information specifically includes:
inputting the word segmentation result into a target neural network model, and outputting event information corresponding to each event in the case by the target neural network model;
the target neural network model is obtained by training events to which sample words belong and event information corresponding to the events to which the sample words belong, and the events to which the sample words belong correspond to all event types and the entity information.
5. A case affairs relationship determination method according to claim 4, wherein the event information includes event trigger, event type and event argument; accordingly, the number of the first and second electrodes,
the inputting the word segmentation result into a target neural network model, and outputting event information corresponding to each event in the case by the target neural network model specifically includes:
the target neural network model is based on a pipeline model, firstly, an event trigger corresponding to each event in the word segmentation result is recognized, then, an event type corresponding to the event trigger is determined, and finally, an event argument corresponding to the event trigger is determined.
6. A case affairs relationship determination method according to claim 4, wherein the target neural network model is specifically a convolutional neural network model and/or a recurrent neural network model.
7. A case affairs relationship determination method according to any one of claims 1-6, wherein the event information includes event trigger, event type and event argument;
correspondingly, the determining the event relationship among all the events in the case based on the event information corresponding to each event specifically includes:
determining the superior-inferior relation between all events in the case based on the superior-inferior relation of the event types according to the event type corresponding to each event in the case;
and determining the sequential relation among all the events in the case according to the event trigger, the event type and the event argument corresponding to each event in the case.
8. A case affairs relationship determination system, comprising:
the event type determining module is used for acquiring description texts of cases and determining all event types contained in the cases;
the entity information extraction module is used for carrying out word segmentation processing on the description text to obtain word segmentation results and extracting entity information in the case from the word segmentation results;
the event information extraction module is used for extracting event information corresponding to each event in the case from the word segmentation result based on all event types and the entity information;
and the case relation determining module is used for determining the case relation among all the events in the case based on the event information corresponding to each event.
9. An electronic device, comprising: memory, processor and computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the case affairs relationship determination method according to any of claims 1-7 when executing the program.
10. A non-transitory computer readable storage medium, having stored thereon a computer program, characterized in that the computer program, when being executed by a processor, is adapted to carry out the steps of the case affairs relationship determination method according to any one of claims 1-7.
CN202010312498.0A 2020-04-20 2020-04-20 Case affair relation determining method and system Pending CN111639494A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010312498.0A CN111639494A (en) 2020-04-20 2020-04-20 Case affair relation determining method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010312498.0A CN111639494A (en) 2020-04-20 2020-04-20 Case affair relation determining method and system

Publications (1)

Publication Number Publication Date
CN111639494A true CN111639494A (en) 2020-09-08

Family

ID=72330782

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010312498.0A Pending CN111639494A (en) 2020-04-20 2020-04-20 Case affair relation determining method and system

Country Status (1)

Country Link
CN (1) CN111639494A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113626618A (en) * 2021-10-12 2021-11-09 深圳前海环融联易信息科技服务有限公司 Case series connection method and device based on cross-chapter event extraction and related components
CN114707004A (en) * 2022-05-24 2022-07-05 国网浙江省电力有限公司信息通信分公司 Method and system for extracting and processing case-affair relation based on image model and language model

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130226844A1 (en) * 2005-12-12 2013-08-29 Qin Zhang Content Summarizing and Search Method and System Using Thinking System
CN110457443A (en) * 2019-08-12 2019-11-15 贵州大学 A kind of criminal offence chain building method based on criminal case
CN110738039A (en) * 2019-09-03 2020-01-31 平安科技(深圳)有限公司 Prompting method, device, storage medium and server for case auxiliary information
CN110858269A (en) * 2018-08-09 2020-03-03 清华大学 Criminal name prediction method and device
CN110969549A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 Judicial data processing method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130226844A1 (en) * 2005-12-12 2013-08-29 Qin Zhang Content Summarizing and Search Method and System Using Thinking System
CN110858269A (en) * 2018-08-09 2020-03-03 清华大学 Criminal name prediction method and device
CN110969549A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 Judicial data processing method and system
CN110457443A (en) * 2019-08-12 2019-11-15 贵州大学 A kind of criminal offence chain building method based on criminal case
CN110738039A (en) * 2019-09-03 2020-01-31 平安科技(深圳)有限公司 Prompting method, device, storage medium and server for case auxiliary information

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113626618A (en) * 2021-10-12 2021-11-09 深圳前海环融联易信息科技服务有限公司 Case series connection method and device based on cross-chapter event extraction and related components
CN113626618B (en) * 2021-10-12 2022-02-15 深圳前海环融联易信息科技服务有限公司 Case series connection method and device based on cross-chapter event extraction and related components
WO2023060634A1 (en) * 2021-10-12 2023-04-20 深圳前海环融联易信息科技服务有限公司 Case concatenation method and apparatus based on cross-chapter event extraction, and related component
CN114707004A (en) * 2022-05-24 2022-07-05 国网浙江省电力有限公司信息通信分公司 Method and system for extracting and processing case-affair relation based on image model and language model
CN114707004B (en) * 2022-05-24 2022-08-16 国网浙江省电力有限公司信息通信分公司 Method and system for extracting and processing case-affair relation based on image model and language model

Similar Documents

Publication Publication Date Title
Enayet et al. NileTMRG at SemEval-2017 task 8: Determining rumour and veracity support for rumours on Twitter.
CN108073568A (en) keyword extracting method and device
CN108536681A (en) Intelligent answer method, apparatus, equipment and storage medium based on sentiment analysis
KR20200007969A (en) Information processing methods, terminals, and computer storage media
Udanor et al. Combating the challenges of social media hate speech in a polarized society: A Twitter ego lexalytics approach
JP2021101361A (en) Method, device, apparatus and storage medium for generating event topics
CN110929520A (en) Non-named entity object extraction method and device, electronic equipment and storage medium
CN111178701B (en) Risk control method and device based on feature derivation technology and electronic equipment
CN111581956A (en) Sensitive information identification method and system based on BERT model and K nearest neighbor
CN111639494A (en) Case affair relation determining method and system
EP3477575A1 (en) Computer implemented method and system for detecting a set of relevant communities
CN116340606A (en) Analysis method, device, storage medium and equipment for major emergency
Suhas Bharadwaj et al. A novel multimodal hybrid classifier based cyberbullying detection for social media platform
AlAjlan et al. Machine learning approach for threat detection on social media posts containing Arabic text
CN110069686A (en) User behavior analysis method, apparatus, computer installation and storage medium
CN115033668B (en) Story venation construction method and device, electronic equipment and storage medium
Gupta et al. Real-time sentiment analysis of tweets: A case study of Punjab elections
CN112507214B (en) User name-based data processing method, device, equipment and medium
CN109597879B (en) Service behavior relation extraction method and device based on 'citation relation' data
KR20220105792A (en) AI-based Decision Making Support System utilizing Dynamic Text Sources
CN114117057A (en) Keyword extraction method of product feedback information and terminal equipment
Laaksonen et al. From hate speech recognition to happiness indexing: critical issues in datafication of emotion in text mining
Barrows et al. Sentiment and objectivity in Iranian state-sponsored propaganda on twitter
Sandaka et al. Sentiment Analysis and Time-series Analysis for the COVID-19 vaccine Tweets
CN112487165A (en) Question and answer method, device and medium based on keywords

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200908