WO2021051865A1 - Case recommendation method and device, apparatus, and computer readable storage medium - Google Patents

Case recommendation method and device, apparatus, and computer readable storage medium Download PDF

Info

Publication number
WO2021051865A1
WO2021051865A1 PCT/CN2020/093108 CN2020093108W WO2021051865A1 WO 2021051865 A1 WO2021051865 A1 WO 2021051865A1 CN 2020093108 W CN2020093108 W CN 2020093108W WO 2021051865 A1 WO2021051865 A1 WO 2021051865A1
Authority
WO
WIPO (PCT)
Prior art keywords
case
knowledge
information
similarity
candidate
Prior art date
Application number
PCT/CN2020/093108
Other languages
French (fr)
Chinese (zh)
Inventor
邓俊豪
聂宇昕
徐冰
陈晨
汪伟
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021051865A1 publication Critical patent/WO2021051865A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services

Definitions

  • This application relates to the technical field of data analysis in artificial intelligence, and in particular to a case recommendation method, device, equipment, and computer-readable storage medium.
  • the focus of the case can usually represent the entire case. Therefore, the judge needs to determine the focus of the dispute based on the plaintiff’s petition, the court’s defense, and the evidence provided by both parties. The judgment can be completed by resolving the focus of the dispute, and the facts need to be considered when resolving the focus of the dispute. Factors, evidence, laws and regulations, etc., while the facts, evidence, and laws and regulations are complex, there will be situations where the focus of the dispute is the same but the results of the case are different, which is not convenient for the judge to decide the case.
  • judges refer to similar cases that have been tried before to effectively regulate and restrict discretion to ensure that similar cases are basically uniform in the application of law, basically the same criteria for judgment, and basically consistent judgment results.
  • the inventor realizes that there are many cases that have been tried before, and it takes more time to find similar cases, and the judges subjectively determine the logical relationship between the cases, and the judgments of similar cases may not be found. Applicable to the current case, which affects the efficiency of judgment. Therefore, how to accurately determine the same case in the same case, ensure the same case and improve the efficiency of the case is a problem that needs to be solved urgently.
  • the main purpose of this application is to provide a case recommendation method, device, equipment, and computer-readable storage medium, aiming to accurately determine the case of the same case, ensure the same case and improve the efficiency of the case.
  • this application provides a case recommendation method, which includes the following steps:
  • co-judge cases are determined from the set of candidate cases.
  • this application also provides a case recommendation device, the case recommendation device including:
  • the dispute focus determination module is used to obtain the judgment data of the target case, and based on the preset dispute focus determination model, determine the dispute focus of the target case according to the judgment data;
  • the graph construction module is used to construct the first knowledge graph of the target case according to the judgment data and the dispute focus;
  • the candidate case determination module is used to determine the candidate case set to be recommended from the pre-stored judged case database according to the focus of the dispute;
  • a calculation module configured to obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each of the second knowledge graphs;
  • the case determination module is configured to determine the same case and concurrent judgment cases from the set of candidate cases according to the similarity between the first knowledge graph and each of the second knowledge graphs.
  • the present application also provides a computer device that includes a processor, a memory, and a computer program stored on the memory and executable by the processor, wherein the computer program is When the processor executes, a case recommendation method is implemented, where the case recommendation method includes:
  • co-judge cases are determined from the set of candidate cases.
  • this application also provides a computer-readable storage medium with a computer program stored on the computer-readable storage medium, and when the computer program is executed by a processor, a case recommendation method is implemented, wherein:
  • the recommended method for the case includes the following steps:
  • co-judge cases are determined from the set of candidate cases.
  • This application provides a case recommendation method, device, equipment, and computer-readable storage medium, which can accurately determine the same case and the same case.
  • FIG. 1 is a schematic flowchart of a case recommendation method provided by an embodiment of the application
  • Figure 2 is a schematic diagram of the knowledge graph in an embodiment of the application.
  • FIG. 3 is a schematic flowchart of another case recommendation method provided by an embodiment of the application.
  • FIG. 4 is a schematic block diagram of a case recommendation device provided by an embodiment of the application.
  • Fig. 5 is a schematic block diagram of another case recommendation device provided by an embodiment of the application.
  • FIG. 6 is a schematic block diagram of the structure of a computer device related to an embodiment of this application.
  • embodiments of the present application provide a case recommendation method, device, computer equipment, and computer-readable storage medium.
  • the case recommendation method can be applied to a server, and the server can be a single server or a server cluster composed of multiple servers.
  • FIG. 1 is a schematic flowchart of a case recommendation method provided by an embodiment of the application.
  • the case recommendation method includes steps S101 to S105.
  • Step S101 Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model.
  • both parties to the case upload the judgment data of the pending case, defense text, and evidence information to the server through the terminal device, or both parties to the case upload the appeal text and defense text of the pending case offline Judgment data such as evidence and evidence information are submitted to the court, and the court staff upload the judgment data submitted by both parties to the case to the server through the terminal device, and the server stores the judgment data of the outstanding cases.
  • the judgment data includes the petition text, defense text and evidence information.
  • the petition text includes the plaintiff's information and the petition's opinions.
  • the defense text includes the court's information and the defense's opinions.
  • the evidence information includes but is not limited to documentary evidence and material evidence. , Audiovisual materials, witness testimony, statements by the parties and conclusions of the appraisal.
  • Evidence attribute refers to the characteristic attribute of the evidence, such as whether the IOU has the signature of the borrower.
  • the trigger mode of the co-case recommendation instruction includes real-time trigger and timing trigger.
  • the real-time trigger is when the co-case recommendation request sent by the terminal device is monitored, the unconfirmed cases that need to be recommended are obtained from the co-case recommendation request of the case.
  • the case number of triggers the recommendation instruction of the same case that contains the case number; timing trigger sets a certain time task for the server, and obtains a case number from the recommendation queue of the same case through the timed task, and triggers the case reasoning instruction containing the case number .
  • the server extracts the plaintiff request information, the lawyer's defense information, and the evidence information from the judgment data; based on the preset dispute focus determination model, the preset number is determined based on the plaintiff's request information, the court's defense information, and the evidence information.
  • the candidate dispute focus and the output probability value of each candidate dispute focus; the candidate dispute focus with the largest output probability value is regarded as the dispute focus of the target case.
  • the output probability value is the probability value that the dispute focus output of the dispute focus determination model is the candidate dispute focus
  • the server stores the first regular expression for extracting the original claim information and the second for extracting the victim’s defense information. Regular expressions, the first regular expression can be used to extract the original claim information from the judgment data, and the second regular expression can be used to extract the defense information from the judgment data.
  • the method of extracting evidence information is specifically: extracting evidence sentences and the evidence items in each evidence sentence from the judgment data through the evidence item extraction model; determining the evidence category of each evidence item and the evidence of each evidence category Attributes, and use each evidence item, the evidence category of each evidence item, and the evidence attributes of each evidence category as evidence information.
  • the method for determining the evidence category is specifically: obtaining the evidence category of the evidence item and each evidence sub-category under the evidence category from a preset evidence classification table, and calculating the evidence item by a similarity formula The similarity between the evidence keywords corresponding to each evidence sub-category under the evidence category, and then the evidence sub-category with the largest similarity is determined as the evidence category of the evidence item.
  • the evidence classification table can be set based on actual conditions, which is not specifically limited in this application.
  • the method for determining the attributes of the evidence is specifically: for each type of evidence, the judgment data is traversed to determine the context information of each type of evidence; and the mapping relationship between the pre-stored evidence attributes and the evidence keywords is searched. , Obtain the target evidence sentence containing the evidence keyword from the context information of each type of evidence, and obtain the evidence attribute group corresponding to the target evidence sentence; after concatenating the target evidence sentence and the evidence attribute group, input the similarity calculation The model calculates the similarity between each evidence attribute in the evidence attribute group and the target evidence sentence, and uses the evidence attribute with the highest similarity as the evidence attribute of the corresponding evidence category to obtain the evidence attribute of each evidence category. It should be noted that the above-mentioned mapping relationship table between the evidence attributes and the evidence keywords can be set based on actual conditions, which is not specifically limited in this application.
  • the above-mentioned evidence item extraction model is through AutoNER (Auto Named Entity Recognition (Auto Named Entity Recognition) is obtained by training based on manually labeled sample data.
  • AutoNER Auto Named Entity Recognition
  • the AutoNER model uses the word vector trained for the ruling document as an embedding. Because the accuracy of the automatic remote labeling data is too low, this module is abandoned and used instead. For training on labeled data, the accuracy of manually labeled sample data is high.
  • the method of data enhancement is used during training, that is, random replacement of no more than 3 words and/or in the evidence sentence Reverse the order of the words in the evidence sentence.
  • the above-mentioned similarity calculation model is obtained by retraining the pre-training model of the BERT model based on legal corpus, reducing the encoder module (encoding module) of the BERT model to 3 layers, and adjusting the sentence length to achieve time optimization.
  • the interference sample data is obtained, and the model is trained based on the interference sample data, and the classification layer is connected to the position of the encoder layer.
  • the AutoNER model is a model that can automatically label data and train named entity recognition without manual labeling.
  • the BERT model is the first deep, bidirectional and unsupervised language representation model.
  • Step S102 Construct a first knowledge graph of the target case according to the judgment data and the dispute focus.
  • the basic case knowledge information and evidence knowledge information are extracted from the judgment data, and the focus of dispute, basic case knowledge information and evidence knowledge information are taken as the case knowledge information of the target case, and then obtained from the knowledge base of preset laws
  • the knowledge of the law that matches the knowledge information of the case, and based on the knowledge information of the case and the knowledge of the law, construct the knowledge map of the target case, which is recorded as the first knowledge map.
  • the method of constructing the knowledge graph is specifically as follows: the law in the knowledge of the law and the plaintiff, lawyer, petition, argument, dispute focus, fact elements, and evidence in the knowledge information of the case are used as the entity nodes of the knowledge graph, And from the legal knowledge and case knowledge information to obtain the relationship and attributes between each entity node (plaintiff, lawyer, claim point of view, argued point of view, dispute focus, factual elements, evidence and specific value of the legal provisions), and then Based on the entity node, the relationship between the entity node and the attribute of the entity node, the knowledge graph of the target case is constructed.
  • Figure 2 is a schematic diagram of the knowledge graph in the embodiment of the application.
  • the entity nodes of the knowledge graph are the plaintiff, the lawyer, the opinion of the appeal, the argued opinion, the focus of the dispute, the factual elements, and the evidence And the law, and the fact elements include small element 1, small element 2, and small element 3.
  • Small element 1 corresponds to law 1
  • small element 2 corresponds to law 2
  • small element 3 corresponds to law 3.
  • Step S103 According to the dispute focus, a candidate case set to be recommended is determined from the pre-stored sentenced case database.
  • the server also determines the candidate case set to be recommended from the pre-stored sentenced case database according to the dispute focus of the target case, that is, it traverses the knowledge map of all the sentenced cases in the stored sentenced case database, and obtains that the knowledge map includes the dispute Focus on the sentenced cases, and collect each of the sentenced cases obtained to form a sentenced case set, and use the sentenced case set as a candidate case set to be recommended.
  • the server stores the knowledge map of the judged case, which is recorded as the second knowledge map.
  • the case knowledge map stores case knowledge information and legal knowledge information.
  • the case knowledge information includes basic case knowledge, dispute focus knowledge, and fact elements Knowledge and evidence knowledge.
  • Basic case knowledge includes but is not limited to litigation figures, litigation companies, litigation participant relationships, plaintiffs, lawyers, petition and defense opinions
  • dispute focus knowledge includes dispute focus
  • evidence knowledge includes evidence items and types of evidence And evidence attributes.
  • Evidence attribute refers to the characteristic attribute of the evidence, such as whether the IOU has the signature of the borrower. There is at least one fact in every plaintiff request, such as "A requests B to repay the money". The fact is "A lent money to B and B does not repay it.”
  • the following takes a sentenced case as an example to explain how to construct the knowledge graph of a sentenced case. Specifically, obtain the judgment document of the judged case, and extract the case knowledge from the judgment document to obtain the structured case knowledge information and legal knowledge information of the judged case, and then according to the case knowledge information and legal knowledge information, Construct a knowledge graph of convicted cases.
  • the ruling document includes the relationship between the characters, the origin of the case, the trial process, the facts, and the reason and basis of the judgment.
  • the character relationship part includes the basic information of the parties, the basic information of the entrusted agents ad litem, and the litigation status of the parties;
  • the origin part of the case includes information about the petition and plea, the part of the trial process is the court record, and the fact part includes the plaintiff’s litigation request, Facts and reasons, the facts and reasons of the court’s defense, the facts found by the court and the evidence on which the case was determined, the reasons and basis of the judgment include the relationship between the evidence and the focus of the dispute, and the relationship between the basis of the judgment and the legal provisions.
  • Step S104 Obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each second knowledge graph.
  • the server After determining the candidate case set, the server obtains the second knowledge graph corresponding to each candidate case in the candidate case set, and calculates the similarity between the first knowledge graph and each second knowledge graph. It should be noted that the similarity between the first knowledge graph and each second knowledge graph is the similarity between the target case and each candidate case.
  • the server obtains the first case knowledge information from the first knowledge graph, and obtains the respective corresponding second case knowledge information from each second knowledge graph; according to the first case knowledge information and each second knowledge graph; Case knowledge information, calculate the similarity between the first knowledge graph and each second knowledge graph.
  • the first case knowledge information includes attribute information and relationship information of each entity node in the first knowledge graph
  • the second knowledge information includes attribute information and relationship information of each entity node in the second knowledge graph.
  • the server calculates the target similarity between each entity node in the first knowledge graph and the corresponding entity node in each second knowledge graph based on the first case knowledge information and each second case knowledge information ; According to the respective preset coefficients corresponding to each entity node in the first knowledge graph and the similarity of each target, the similarity between the first knowledge graph and each second knowledge graph is calculated.
  • a second knowledge graph Take a second knowledge graph as an example to illustrate the calculation of target similarity. Specifically, obtain the attribute information and/or relationship information of each first entity node from the first case knowledge information, and obtain the second case knowledge information To obtain the attribute information and/or relationship information of each second entity node; according to the attribute information of each first entity node and the attribute information of each second entity node, calculate each first entity node and the corresponding second entity node The first degree of similarity between the entity nodes; and/or calculate the relationship between each first entity node and the corresponding second entity node based on the relationship information of each first entity node and the relationship information of each second entity node The second degree of similarity; take the first degree of similarity and/or the second degree of similarity between each first entity node and the corresponding second entity node as each entity node and each second knowledge in the first knowledge graph The target similarity between corresponding entity nodes in the graph.
  • the attribute information is the attributes of the entity nodes in the knowledge graph, such as the specific parameters of the entity nodes such as dispute focus, fact elements, and evidence
  • the relationship information is the relationship information between the entity nodes in the knowledge graph, such as dispute focus and facts
  • the calculation method of the similarity between the first knowledge graph and a second knowledge graph is specifically: multiplying each first similarity by its corresponding After the first weight coefficient of, is accumulated to obtain the first target similarity between the first knowledge graph and the second knowledge graph; each second similarity is multiplied by the corresponding second weight coefficient and then accumulated to obtain the first The second target similarity between a knowledge graph and the second knowledge graph; calculate the sum of the first target similarity and the second target similarity, and use the sum of the first target similarity and the second target similarity as the first The similarity between the knowledge graph and the second knowledge graph.
  • first weight coefficient and second weight coefficient can be set based on actual conditions, the first weight coefficient of each physical node can be the same or different, and the second weight coefficient of each physical node can be the same , Can also be different, and this application is not specifically limited.
  • Step S105 according to the similarity between the first knowledge graph and each of the second knowledge graphs, determine the same case and concurrent judgment cases from the set of candidate cases.
  • the similarity between the first knowledge graph and each second knowledge graph is taken as the similarity between the target case and each candidate case in the candidate case set, and in the order of the magnitude of the similarity, the candidate Each candidate case in the case set is sorted, and the candidate case with the highest ranking is regarded as a co-judgment case, and the candidate case corresponding to the highest similarity is regarded as a co-judgment case.
  • the case recommendation method provided in the above embodiments is based on the dispute focus determination model, and the dispute focus of the target case can be accurately determined according to the judgment data, and the knowledge map of the target case is constructed based on the judgment data and the dispute focus, and according to the dispute focus, Determine the set of candidate cases to be recommended, and then calculate the similarity between the knowledge graph of the target case and the knowledge graph corresponding to each candidate case, and according to the similarity between the knowledge graph of the target case and the knowledge graph corresponding to each candidate case Degree, can accurately determine the same case and the same sentence.
  • FIG. 3 is a schematic flowchart of another case recommendation method provided by an embodiment of the application.
  • the case recommendation method includes steps S201 to 206.
  • Step S201 Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model.
  • the triggered co-case recommendation instruction When the triggered co-case recommendation instruction is monitored, according to the co-case recommendation instruction, the unconfirmed cases that need to be recommended are determined, and the unconfirmed cases that need to be recommended as the target cases, and then the judgment data of the target cases are obtained.
  • the server extracts the plaintiff request information, the lawyer's defense information, and the evidence information from the judgment data; based on the preset dispute focus determination model, the preset number is determined based on the plaintiff's request information, the court's defense information, and the evidence information.
  • the candidate dispute focus and the output probability value of each candidate dispute focus; the candidate dispute focus with the largest output probability value is regarded as the dispute focus of the target case.
  • Step S202 Construct a first knowledge graph of the target case according to the judgment data and the dispute focus.
  • the basic case knowledge information and evidence knowledge information are extracted from the judgment data, and the focus of dispute, basic case knowledge information and evidence knowledge information are taken as the case knowledge information of the target case, and then obtained from the knowledge base of preset laws
  • the knowledge of the law that matches the knowledge information of the case, and based on the knowledge information of the case and the knowledge of the law, construct the knowledge map of the target case, which is recorded as the first knowledge map.
  • Step S203 Calculate the similarity between the dispute focus and the dispute focus of each judged case in the prestored judged case database.
  • the server calculates the similarity between the dispute focus of the target case and the dispute focus of each sentenced case in the pre-stored sentenced case database, that is, encodes each word in the dispute focus of the target case to obtain the first vector, and then Each word in the dispute focus of each judged case is coded to obtain the corresponding second vector, and then the cosine similarity between the first vector and each second vector is calculated, and the first vector is compared with each second vector.
  • the cosine similarity between the vectors is taken as the similarity between the dispute focus of the target case and the dispute focus of each judged case.
  • the server counts the total number of cases of judged cases, and determines the number of concurrent threads according to the total number of cases, that is, obtains the mapping relationship table between the number of pre-stored cases and the number of concurrent threads, and queries the mapping relationship table , Get the number of concurrent threads corresponding to the total number of cases; According to the number of concurrent threads, call the corresponding number of idle threads in the preset thread pool to concurrently calculate the similarity between the dispute focus and the dispute focus of each judged case. It should be noted that the mapping relationship table between the number of cases and the number of concurrent threads and the number of threads in the thread pool can be set based on actual conditions, and this application does not specifically limit this. By concurrently calculating the similarity between the dispute focus and the dispute focus of each judged case through multiple threads, the calculation speed of the similarity can be improved.
  • Step S204 According to the similarity between the dispute focus and the dispute focus of each judged case, determine the candidate case set to be recommended from the pre-stored judged case database.
  • each judged case whose similarity is greater than the preset similarity threshold is written into the preset candidate case blank set, To form a set of candidate cases to be recommended.
  • the aforementioned similarity threshold may be set based on actual conditions, which is not specifically limited in this application.
  • Step S205 Obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each second knowledge graph.
  • the server After determining the candidate case set, the server obtains the second knowledge graph corresponding to each candidate case in the candidate case set, and calculates the similarity between the first knowledge graph and each second knowledge graph. It should be noted that the similarity between the first knowledge graph and each second knowledge graph is the similarity between the target case and each candidate case.
  • Step S206 According to the similarity between the first knowledge graph and each of the second knowledge graphs, determine the same case and the same judgment case from the set of candidate cases.
  • the similarity between the first knowledge graph and each second knowledge graph is taken as the similarity between the target case and each candidate case in the candidate case set, and in the order of the magnitude of the similarity, the candidate Each candidate case in the case set is sorted, and the candidate case with the highest ranking is regarded as a co-judgment case, and the candidate case corresponding to the highest similarity is regarded as a co-judgment case.
  • the case recommendation method provided by the above embodiment can determine the candidate case set to be recommended more accurately based on the similarity between the dispute focus of the target case and the dispute focus of each sentenced case, and is based on the knowledge map of the target case and The similarity between the knowledge graphs of each candidate case can further accurately determine cases similar to the target case, ensure the same case and improve the efficiency of judgment.
  • FIG. 4 is a schematic block diagram of a case recommendation apparatus provided by an embodiment of the application.
  • the case recommendation device 300 includes: a dispute focus determination module 301, a graph construction module 302, a candidate case determination module 303, a calculation module 304, and a case determination module 305.
  • the dispute focus determination module 301 is used to obtain the judgment data of the target case, and based on a preset dispute focus determination model, determine the dispute focus of the target case according to the judgment data;
  • the graph construction module 302 is configured to construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
  • the candidate case determination module 303 is configured to determine the candidate case set to be recommended from the pre-stored judged case database according to the focus of the dispute;
  • the calculation module 304 is configured to obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each second knowledge graph;
  • the case determination module 305 is configured to determine the same-sentence cases from the set of candidate cases according to the similarity between the first knowledge graph and each of the second knowledge graphs.
  • the dispute focus determination module 301 is also used to extract the plaintiff information, the court's defense information, and the evidence information from the judgment data; based on the preset dispute focus determination model according to the The plaintiff’s information, the court’s defense information, and the evidence information determine a preset number of candidate dispute focus and the output probability value of each candidate dispute focus; the candidate dispute focus with the largest output probability value is taken as the target The focus of the case.
  • the calculation module 304 is further configured to obtain first case knowledge information from the first knowledge graph, and obtain respective corresponding second case knowledge information from each of the second knowledge graphs ; According to the first case knowledge information and each of the second case knowledge information, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  • the calculation module 304 is further configured to calculate each entity node and each entity in the first knowledge graph according to the first case knowledge information and each of the second case knowledge information. State the target similarity between corresponding entity nodes in the second knowledge graph;
  • the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  • the case recommendation device 400 includes: a dispute focus determination module 401, a graph construction module 402, a first calculation module 403, a candidate case determination module 404, a second calculation module 405, and a case determination module 406.
  • the dispute focus determination module 401 is configured to obtain the judgment data of the target case, and based on a preset dispute focus determination model, determine the dispute focus of the target case according to the judgment data;
  • the graph construction module 402 is configured to construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
  • the first calculation module 403 is used to calculate the similarity between the dispute focus and the dispute focus of each judged case in the pre-stored judged case database;
  • the candidate case determination module 404 is configured to determine the candidate case set to be recommended from the pre-stored judged case database according to the similarity between the dispute focus and the dispute focus of each judged case;
  • the second calculation module 405 is further configured to obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each second knowledge graph;
  • the case determination module 406 is configured to determine the same case and concurrent judgment cases from the set of candidate cases according to the similarity between the first knowledge graph and each of the second knowledge graphs.
  • the first calculation module 403 is also used to count the total number of cases that have been judged, and to determine the number of concurrent threads based on the total number of cases; according to the number of concurrent threads, call a preset The corresponding number of idle threads in the thread pool concurrently calculate the similarity between the dispute focus and the dispute focus of each judged case in the pre-stored judged case library.
  • the first calculation module 403 is further configured to obtain a mapping relationship table between the number of pre-stored cases and the number of concurrent threads, and query the mapping relationship table to obtain the concurrency corresponding to the total number of cases. Threads.
  • the apparatus provided in the foregoing embodiment may be implemented in the form of a computer program, and the computer program may run on the computer device as shown in FIG. 6.
  • FIG. 6 is a schematic block diagram of the structure of a computer device provided by an embodiment of the application.
  • the computer device may be a server.
  • the computer device includes a processor, a memory, and a network interface connected through a system bus, where the memory may include a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium can store an operating system and a computer program.
  • the computer program includes program instructions, and when the program instructions are executed, the processor can execute any case recommendation method.
  • the processor is used to provide computing and control capabilities and support the operation of the entire computer equipment.
  • the internal memory provides an environment for the operation of the computer program in the non-volatile storage medium.
  • the processor can make the processor execute the case recommendation method shown in any of the above embodiments, wherein the Recommended methods for the case include:
  • the judgment data and the dispute focus construct the first knowledge graph of the target case; according to the dispute focus, determine the candidate case set to be recommended from the pre-stored judged case database;
  • co-judge cases are determined from the set of candidate cases.
  • the network interface is used for network communication, such as sending assigned tasks.
  • the network interface is used for network communication, such as sending assigned tasks.
  • FIG. 6 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.
  • the specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
  • the embodiment of the present application also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, the computer-readable storage medium may be non-volatile or volatile, and the computer
  • the program includes program instructions, and the method implemented when the program instructions are executed can refer to the various embodiments of the case recommendation method of this application.
  • the case recommendation method includes the following steps:
  • the judgment data and the dispute focus construct the first knowledge graph of the target case; according to the dispute focus, determine the candidate case set to be recommended from the pre-stored judged case database;
  • co-judge cases are determined from the set of candidate cases.
  • the computer-readable storage medium may be the internal storage unit of the computer device described in the foregoing embodiment, for example, the hard disk or memory of the computer device.
  • the computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk or a smart memory card (Smart Memory Card) equipped on the computer device.
  • Media Card, SMC Secure Digital (Secure Digital, SD) card, flash memory card (Flash Card) and so on.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Technology Law (AREA)
  • General Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to the technical field of data analysis in artificial intelligence. Provided are a case recommendation method and device, an apparatus, and a computer readable storage medium. The method comprises: acquiring judgement data of a target case, and using a preset focus of disputes determination model to determine the focus of disputes of the target case according to the judgement data; constructing a first knowledge graph of the target case according to the judgement data and the focus of disputes; determining, from a pre-stored closed case base, and according to the focus of disputes, a candidate case set to be recommended; acquiring second knowledge graphs corresponding to respective candidate cases in the candidate case set, and performing calculation to obtain levels of similarity between the first knowledge graph and the respective second knowledge graphs; and determining similar cases with similar decisions from the candidate case set according to the levels of similarity between the first knowledge graph and the respective second knowledge graphs. The present invention relates to data analysis and knowledge graphs, and can be used to accurately determine similar cases with similar decisions.

Description

案件推荐方法、装置、设备及计算机可读存储介质Case recommendation method, device, equipment and computer readable storage medium
本申请要求于2019年09月18日提交中国专利局、申请号为201910883521.9,发明名称为“案件推荐方法、装置、设备及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on September 18, 2019, the application number is 201910883521.9, and the invention title is "case recommendation method, device, equipment, and computer-readable storage medium". The entire content of the Chinese patent application is approved. The reference is incorporated in this application.
技术领域Technical field
本申请涉及人工智能中的数据分析的技术领域,尤其涉及一种案件推荐方法、装置、设备及计算机可读存储介质。This application relates to the technical field of data analysis in artificial intelligence, and in particular to a case recommendation method, device, equipment, and computer-readable storage medium.
背景技术Background technique
目前,案情的争议焦点通常能够代表整个案件,因此法官需要根据原告诉请、被告辩称和双方提供的证据项确定争议焦点,解决这些争议焦点即可完成判案,而解决争议焦点需要考虑事实要素、证据和法律法规等,而事实要素、证据和法律法规复杂,会出现争议焦点相同,但判案结果不同的情况,不便于法官判案。At present, the focus of the case can usually represent the entire case. Therefore, the judge needs to determine the focus of the dispute based on the plaintiff’s petition, the defendant’s defense, and the evidence provided by both parties. The judgment can be completed by resolving the focus of the dispute, and the facts need to be considered when resolving the focus of the dispute. Factors, evidence, laws and regulations, etc., while the facts, evidence, and laws and regulations are complex, there will be situations where the focus of the dispute is the same but the results of the case are different, which is not convenient for the judge to decide the case.
为解决上述问题,法官在审理案件时,参考先前己经审判过的类似案件,以有效规范和限制自由裁量权,确保同类案件法律适用基本统一,裁判尺度基本相同,判案结果基本一致。然而发明人意识到,先前己经审判过的案件较多,需要花费较多的时间才能找到类似案件,且法官主观的确定案件之间的逻辑关系,找到的类似案件的判案情况可能并不适用当前审理的案件,影响判案效率。因此,如何准确的确定案件的同案同判案件,确保同案同判,提高判案效率是目前亟待解决的问题。In order to solve the above-mentioned problems, judges refer to similar cases that have been tried before to effectively regulate and restrict discretion to ensure that similar cases are basically uniform in the application of law, basically the same criteria for judgment, and basically consistent judgment results. However, the inventor realizes that there are many cases that have been tried before, and it takes more time to find similar cases, and the judges subjectively determine the logical relationship between the cases, and the judgments of similar cases may not be found. Applicable to the current case, which affects the efficiency of judgment. Therefore, how to accurately determine the same case in the same case, ensure the same case and improve the efficiency of the case is a problem that needs to be solved urgently.
技术问题technical problem
本申请的主要目的在于提供一种案件推荐方法、装置、设备及计算机可读存储介质,旨在准确的确定案件的同案同判案件,确保同案同判,提高判案效率。The main purpose of this application is to provide a case recommendation method, device, equipment, and computer-readable storage medium, aiming to accurately determine the case of the same case, ensure the same case and improve the efficiency of the case.
技术解决方案Technical solutions
为实现上述目的,第一方面,本申请提供一种案件推荐方法,所述案件推荐方法包括以下步骤:In order to achieve the foregoing objectives, in the first aspect, this application provides a case recommendation method, which includes the following steps:
获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model;
根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;Construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;According to the focus of the dispute, determine the candidate case set to be recommended from the pre-stored judged case database;
获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;Acquiring a second knowledge graph corresponding to each candidate case in the candidate case set, and calculating the similarity between the first knowledge graph and each of the second knowledge graphs;
根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。According to the degree of similarity between the first knowledge graph and each of the second knowledge graphs, co-judge cases are determined from the set of candidate cases.
第二方面,本申请还提供一种案件推荐装置,所述案件推荐装置包括:In the second aspect, this application also provides a case recommendation device, the case recommendation device including:
争议焦点确定模块,用于获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;The dispute focus determination module is used to obtain the judgment data of the target case, and based on the preset dispute focus determination model, determine the dispute focus of the target case according to the judgment data;
图谱构建模块,用于根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;The graph construction module is used to construct the first knowledge graph of the target case according to the judgment data and the dispute focus;
候选案件确定模块,用于根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;The candidate case determination module is used to determine the candidate case set to be recommended from the pre-stored judged case database according to the focus of the dispute;
计算模块,用于获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;A calculation module, configured to obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each of the second knowledge graphs;
案件确定模块,用于根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。The case determination module is configured to determine the same case and concurrent judgment cases from the set of candidate cases according to the similarity between the first knowledge graph and each of the second knowledge graphs.
第三方面,本申请还提供一种计算机设备,所述计算机设备包括处理器、存储器、以及存储在所述存储器上并可被所述处理器执行的计算机程序,其中所述计算机程序被所述处理器执行时,实现一种案件推荐方法,其中,所述案件推荐方法包括:In a third aspect, the present application also provides a computer device that includes a processor, a memory, and a computer program stored on the memory and executable by the processor, wherein the computer program is When the processor executes, a case recommendation method is implemented, where the case recommendation method includes:
获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model;
根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;Construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;According to the focus of the dispute, determine the candidate case set to be recommended from the pre-stored judged case database;
获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;Acquiring a second knowledge graph corresponding to each candidate case in the candidate case set, and calculating the similarity between the first knowledge graph and each of the second knowledge graphs;
根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。According to the degree of similarity between the first knowledge graph and each of the second knowledge graphs, co-judge cases are determined from the set of candidate cases.
第四方面,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,其中所述计算机程序被处理器执行时,实现一种案件推荐方法,其中,所述案件推荐方法包括以下步骤:In a fourth aspect, this application also provides a computer-readable storage medium with a computer program stored on the computer-readable storage medium, and when the computer program is executed by a processor, a case recommendation method is implemented, wherein: The recommended method for the case includes the following steps:
获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model;
根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;Construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;According to the focus of the dispute, determine the candidate case set to be recommended from the pre-stored judged case database;
获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;Acquiring a second knowledge graph corresponding to each candidate case in the candidate case set, and calculating the similarity between the first knowledge graph and each of the second knowledge graphs;
根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。According to the degree of similarity between the first knowledge graph and each of the second knowledge graphs, co-judge cases are determined from the set of candidate cases.
有益效果Beneficial effect
本申请提供一种案件推荐方法、装置、设备及计算机可读存储介质,可以准确的确定同案同判案件。This application provides a case recommendation method, device, equipment, and computer-readable storage medium, which can accurately determine the same case and the same case.
附图说明Description of the drawings
图1为本申请实施例提供的一种案件推荐方法的流程示意图;FIG. 1 is a schematic flowchart of a case recommendation method provided by an embodiment of the application;
图2为本申请实施例中知识图谱的一示意图;Figure 2 is a schematic diagram of the knowledge graph in an embodiment of the application;
图3为本申请实施例提供的另一种案件推荐方法的流程示意图;FIG. 3 is a schematic flowchart of another case recommendation method provided by an embodiment of the application;
图4为本申请实施例提供的一种案件推荐装置的示意性框图;FIG. 4 is a schematic block diagram of a case recommendation device provided by an embodiment of the application;
图5为本申请实施例提供的另一种案件推荐装置的示意性框图;Fig. 5 is a schematic block diagram of another case recommendation device provided by an embodiment of the application;
图6为本申请一实施例涉及的计算机设备的结构示意框图。FIG. 6 is a schematic block diagram of the structure of a computer device related to an embodiment of this application.
本发明的最佳实施方式The best mode of the present invention
为了解决上述问题,本申请实施例提供一种案件推荐方法、装置、计算机设备及计算机可读存储介质。其中,该案件推荐方法可应用于于服务器,该服务器可以为单台的服务器,也可以为由多台服务器组成的服务器集群。In order to solve the foregoing problems, embodiments of the present application provide a case recommendation method, device, computer equipment, and computer-readable storage medium. Among them, the case recommendation method can be applied to a server, and the server can be a single server or a server cluster composed of multiple servers.
请参照图1,图1为本申请实施例提供的一种案件推荐方法的流程示意图。Please refer to FIG. 1, which is a schematic flowchart of a case recommendation method provided by an embodiment of the application.
如图1所示,该案件推荐方法包括步骤S101至步骤S105。As shown in Fig. 1, the case recommendation method includes steps S101 to S105.
步骤S101、获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点。Step S101: Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model.
其中,案件双方当事人通过终端设备将未判案件的诉请文本、辩护文本和证据信息等判案数据上传至服务器,或者案件双方当事人通过线下的方式将未判案件的诉请文本、辩护文本和证据信息等判案数据提交给法院,由法院的工作人员通过终端设备将案件双方当事人提交的判案数据上传至服务器,服务器存储未判案件的判案数据。Among them, both parties to the case upload the judgment data of the pending case, defense text, and evidence information to the server through the terminal device, or both parties to the case upload the appeal text and defense text of the pending case offline Judgment data such as evidence and evidence information are submitted to the court, and the court staff upload the judgment data submitted by both parties to the case to the server through the terminal device, and the server stores the judgment data of the outstanding cases.
该判案数据包括诉请文本、辩护文本和证据信息,该诉请文本包括原告信息和诉请观点,该辩护文本包括被告信息和辩称观点,该证据信息包括但不限于书证信息、物证信息、视听资料、证人证言、当事人陈述和鉴定结论。证据属性是指证据的特征属性,比如欠条这一项证据有没有借款人签名。The judgment data includes the petition text, defense text and evidence information. The petition text includes the plaintiff's information and the petition's opinions. The defense text includes the defendant's information and the defense's opinions. The evidence information includes but is not limited to documentary evidence and material evidence. , Audiovisual materials, witness testimony, statements by the parties and conclusions of the appraisal. Evidence attribute refers to the characteristic attribute of the evidence, such as whether the IOU has the signature of the borrower.
当监测到触发的同案推荐指令时,根据该同案推荐指令,确定需要推荐同案的未判案件,并需要推荐同案的未判案件作为目标案件,然后获取目标案件的判案数据。其中,该同案推荐指令的触发方式包括实时触发和定时触发,实时触发为当监测到终端设备发送的同案推荐请求时,从该案件同案推荐请求中获取需要推荐同案的未判案件的案件编号,触发包含该案件编号的同案推荐指令;定时触发为服务器设置一定时任务,通过该定时任务定时从同案推荐队列中获取一个案件编号,并触发包含该案件编号的案件推理指令。When the triggered co-case recommendation instruction is monitored, according to the co-case recommendation instruction, the unconfirmed cases that need to be recommended are determined, and the unconfirmed cases that need to be recommended as the target cases, and then the judgment data of the target cases are obtained. Among them, the trigger mode of the co-case recommendation instruction includes real-time trigger and timing trigger. The real-time trigger is when the co-case recommendation request sent by the terminal device is monitored, the unconfirmed cases that need to be recommended are obtained from the co-case recommendation request of the case. The case number of, triggers the recommendation instruction of the same case that contains the case number; timing trigger sets a certain time task for the server, and obtains a case number from the recommendation queue of the same case through the timed task, and triggers the case reasoning instruction containing the case number .
具体地,服务器从判案数据中提取出原告诉请信息、被告辩称信息和证据信息;基于预设的争议焦点确定模型根据原告诉请信息、被告辩称信息和证据信息确定预设数量的候选争议焦点和每个候选争议焦点的输出概率值;将输出概率值最大的候选争议焦点作为目标案件的争议焦点。其中,该输出概率值为该争议焦点确定模型输出的争议焦点为候选争议焦点的概率值,服务器存储有用于提取原告诉请信息的第一正则表达式和用于提取被告辩称信息的第二正则表达式,通过第一正则表达式即可从判案数据中提取出原告诉请信息,通过第二正则表达式即可从判案数据中提取出被告辩称信息。Specifically, the server extracts the plaintiff request information, the defendant's defense information, and the evidence information from the judgment data; based on the preset dispute focus determination model, the preset number is determined based on the plaintiff's request information, the defendant's defense information, and the evidence information. The candidate dispute focus and the output probability value of each candidate dispute focus; the candidate dispute focus with the largest output probability value is regarded as the dispute focus of the target case. Wherein, the output probability value is the probability value that the dispute focus output of the dispute focus determination model is the candidate dispute focus, and the server stores the first regular expression for extracting the original claim information and the second for extracting the defendant’s defense information. Regular expressions, the first regular expression can be used to extract the original claim information from the judgment data, and the second regular expression can be used to extract the defendant’s defense information from the judgment data.
其中,证据信息的提取方式具体为:通过证据项提取模型从该判案数据中提取出证据语句以及每个证据语句中的证据项;确定每个证据项的证据类别以及每个证据类别的证据属性,并将每个证据项、每个证据项的证据类别以及每个证据类别的证据属性作为证据信息。Among them, the method of extracting evidence information is specifically: extracting evidence sentences and the evidence items in each evidence sentence from the judgment data through the evidence item extraction model; determining the evidence category of each evidence item and the evidence of each evidence category Attributes, and use each evidence item, the evidence category of each evidence item, and the evidence attributes of each evidence category as evidence information.
在一实施例中,证据类别的确定方式具体为:从预设的证据分类表中获取证据项的证据大类和该证据大类下的每个证据小类,并通过相似度公式计算证据项与该证据大类下的每个证据小类对应的证据关键字之间的相似度,然后将相似度最大的证据小类,确定为该证据项的证据类别。需要说明的是,该证据分类表可基于实际情况进行设置,本申请对此不作具体限定。In one embodiment, the method for determining the evidence category is specifically: obtaining the evidence category of the evidence item and each evidence sub-category under the evidence category from a preset evidence classification table, and calculating the evidence item by a similarity formula The similarity between the evidence keywords corresponding to each evidence sub-category under the evidence category, and then the evidence sub-category with the largest similarity is determined as the evidence category of the evidence item. It should be noted that the evidence classification table can be set based on actual conditions, which is not specifically limited in this application.
在一实施例中,证据属性的确定方式具体为:对于每一类证据,对该判案数据进行遍历,确定每一类证据的上下文信息;查询预存的证据属性与证据关键字的映射关系表,从每一类证据的上下文信息中获取包含有证据关键字的目标证据语句,以及获取该目标证据语句对应的证据属性组;将该目标证据语句和证据属性组进行拼接之后,输入相似度计算模型,计算证据属性组中每个证据属性与目标证据语句之间的相似度,并将该相似度最高的证据属性作为对应证据类别的证据属性,从而得到每个证据类别的证据属性。需要说明的是,上述证据属性与证据关键字的映射关系表可基于实际情况进行设置,本申请对此不作具体限定。In one embodiment, the method for determining the attributes of the evidence is specifically: for each type of evidence, the judgment data is traversed to determine the context information of each type of evidence; and the mapping relationship between the pre-stored evidence attributes and the evidence keywords is searched. , Obtain the target evidence sentence containing the evidence keyword from the context information of each type of evidence, and obtain the evidence attribute group corresponding to the target evidence sentence; after concatenating the target evidence sentence and the evidence attribute group, input the similarity calculation The model calculates the similarity between each evidence attribute in the evidence attribute group and the target evidence sentence, and uses the evidence attribute with the highest similarity as the evidence attribute of the corresponding evidence category to obtain the evidence attribute of each evidence category. It should be noted that the above-mentioned mapping relationship table between the evidence attributes and the evidence keywords can be set based on actual conditions, which is not specifically limited in this application.
其中,上述证据项提取模型是通过AutoNER(Auto Named Entity Recognition,Auto命名实体识别)基于人工标注的样本数据训练得到的,AutoNER模型通过针对裁决文书训练的词向量作为嵌入,由于自动远程标注数据准确率太低,故放弃此模块转而用人工标注的数据进行训练,用人工标注的样本数据的准确性较高,此外,为了防止过拟合,在训练时采用数据增强的方法,即随机替换证据语句中不超过3个的词和/或调换证据语句中词语的顺序。上述相似度计算模型是基于法律语料对BERT模型的预训练模型进行重新训练得到的,并将BERT模型的encoder模块(编码模块)减到3层,且调整句子长度,从而实现时间上的优化,此外,针对训练任务,获取干扰样本数据,并基于干扰样本数据训练模型,且将分类层接到encoder层的位置。AutoNER模型为一个无需人工标注就可以自动标记数据并训练命名实体识别的模型,BERT 模型是第一个深度、双向和无监督的语言表示模型。Among them, the above-mentioned evidence item extraction model is through AutoNER (Auto Named Entity Recognition (Auto Named Entity Recognition) is obtained by training based on manually labeled sample data. The AutoNER model uses the word vector trained for the ruling document as an embedding. Because the accuracy of the automatic remote labeling data is too low, this module is abandoned and used instead. For training on labeled data, the accuracy of manually labeled sample data is high. In addition, in order to prevent over-fitting, the method of data enhancement is used during training, that is, random replacement of no more than 3 words and/or in the evidence sentence Reverse the order of the words in the evidence sentence. The above-mentioned similarity calculation model is obtained by retraining the pre-training model of the BERT model based on legal corpus, reducing the encoder module (encoding module) of the BERT model to 3 layers, and adjusting the sentence length to achieve time optimization. In addition, for the training task, the interference sample data is obtained, and the model is trained based on the interference sample data, and the classification layer is connected to the position of the encoder layer. The AutoNER model is a model that can automatically label data and train named entity recognition without manual labeling. The BERT model is the first deep, bidirectional and unsupervised language representation model.
步骤S102、根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱。Step S102: Construct a first knowledge graph of the target case according to the judgment data and the dispute focus.
具体地,从判案数据中提取出基础案件知识信息和证据知识信息,并将争议焦点、基础案件知识信息和证据知识信息作为目标案件的案件知识信息,然后从预设法条知识库中获取与该案件知识信息匹配的法条知识,并根据该案件知识信息和法条知识,构建目标案件的知识图谱,记为第一知识图谱。Specifically, the basic case knowledge information and evidence knowledge information are extracted from the judgment data, and the focus of dispute, basic case knowledge information and evidence knowledge information are taken as the case knowledge information of the target case, and then obtained from the knowledge base of preset laws The knowledge of the law that matches the knowledge information of the case, and based on the knowledge information of the case and the knowledge of the law, construct the knowledge map of the target case, which is recorded as the first knowledge map.
其中,知识图谱构建方式具体为:将该法条知识中的法条以及案件知识信息中的原告、被告、诉请观点、辩称观点、争议焦点、事实要素和证据作为知识图谱的实体节点,并从法条知识和案件知识信息中获取每个实体节点之间的关系和属性(原告、被告、诉请观点、辩称观点、争议焦点、事实要素、证据和法条的具体值),然后基于实体节点、实体节点之间的关系和实体节点的属性,构建目标案件的知识图谱。Among them, the method of constructing the knowledge graph is specifically as follows: the law in the knowledge of the law and the plaintiff, defendant, petition, argument, dispute focus, fact elements, and evidence in the knowledge information of the case are used as the entity nodes of the knowledge graph, And from the legal knowledge and case knowledge information to obtain the relationship and attributes between each entity node (plaintiff, defendant, claim point of view, argued point of view, dispute focus, factual elements, evidence and specific value of the legal provisions), and then Based on the entity node, the relationship between the entity node and the attribute of the entity node, the knowledge graph of the target case is constructed.
请参阅图2,图2为本申请实施例中知识图谱的一示意图,如图2所示,知识图谱的实体节点为原告、被告、诉请观点、辩称观点、争议焦点、事实要素、证据和法条,且事实要素包括小要素1、小要素2和小要素3,小要素1与法条1对应,小要素2与法条2对应,小要素3与法条3对应。Please refer to Figure 2. Figure 2 is a schematic diagram of the knowledge graph in the embodiment of the application. As shown in Figure 2, the entity nodes of the knowledge graph are the plaintiff, the defendant, the opinion of the appeal, the argued opinion, the focus of the dispute, the factual elements, and the evidence And the law, and the fact elements include small element 1, small element 2, and small element 3. Small element 1 corresponds to law 1, small element 2 corresponds to law 2, and small element 3 corresponds to law 3.
步骤S103、根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集。Step S103: According to the dispute focus, a candidate case set to be recommended is determined from the pre-stored sentenced case database.
服务器还根据目标案件的争议焦点,从预存的已判案件库中确定待推荐的候选案件集,即遍历预存的已判案件库中全部已判案件的知识图谱,获取该知识图谱中包括该争议焦点的已判案件,并汇集获取到的每个已判案件,形成已判案件集,且将该已判案件集作为待推荐的候选案件集。The server also determines the candidate case set to be recommended from the pre-stored sentenced case database according to the dispute focus of the target case, that is, it traverses the knowledge map of all the sentenced cases in the stored sentenced case database, and obtains that the knowledge map includes the dispute Focus on the sentenced cases, and collect each of the sentenced cases obtained to form a sentenced case set, and use the sentenced case set as a candidate case set to be recommended.
其中,服务器存储有已判案件的知识图谱,记为第二知识图谱,案件的知识图谱中存储有案件知识信息和法条知识信息,该案件知识信息包括基础案件知识、争议焦点知识、事实要素知识和证据知识,基础案件知识包括但不限于诉讼人物、诉讼公司、诉讼参与人关系、原告、被告、诉请观点和辩称观点,争议焦点知识包括争议焦点,证据知识包括证据项、证据类别和证据属性。证据属性是指证据的特征属性,比如欠条这一项证据有没有借款人签名。每一个原告诉请中都有至少一个事实如“A请求B还钱”的事实是“A借给B钱且B不还”。Among them, the server stores the knowledge map of the judged case, which is recorded as the second knowledge map. The case knowledge map stores case knowledge information and legal knowledge information. The case knowledge information includes basic case knowledge, dispute focus knowledge, and fact elements Knowledge and evidence knowledge. Basic case knowledge includes but is not limited to litigation figures, litigation companies, litigation participant relationships, plaintiffs, defendants, petition and defense opinions, dispute focus knowledge includes dispute focus, and evidence knowledge includes evidence items and types of evidence And evidence attributes. Evidence attribute refers to the characteristic attribute of the evidence, such as whether the IOU has the signature of the borrower. There is at least one fact in every plaintiff request, such as "A requests B to repay the money". The fact is "A lent money to B and B does not repay it."
以下以一个已判案件为例,解释说明已判案件的知识图谱的构建方式。具体地,获取已判案件的裁决文书,并对该裁决文书进行案件知识提取,得到已判案件的结构化的案件知识信息和法条知识信息,然后根据该案件知识信息和法条知识信息,构建已判案件的知识图谱。其中,裁决文书包括人物关系部分、案件由来部分、审理经过部分、事实部分以及判决理由和依据部分。人物关系部分包括当事人的基本情况、委托诉讼代理人的基本情况和当事人的诉讼地位;案件由来部分包括诉请信息和辩称信息,审理经过部分为庭审记录,事实部分包括原告起诉的诉讼请求、事实和理由,被告答辩的事实和理由以及法院认定的事实和据以定案的证据,判决理由和依据部分包括证据与争议焦点之间的关系以及裁判依据与法律条文之间的关系。The following takes a sentenced case as an example to explain how to construct the knowledge graph of a sentenced case. Specifically, obtain the judgment document of the judged case, and extract the case knowledge from the judgment document to obtain the structured case knowledge information and legal knowledge information of the judged case, and then according to the case knowledge information and legal knowledge information, Construct a knowledge graph of convicted cases. Among them, the ruling document includes the relationship between the characters, the origin of the case, the trial process, the facts, and the reason and basis of the judgment. The character relationship part includes the basic information of the parties, the basic information of the entrusted agents ad litem, and the litigation status of the parties; the origin part of the case includes information about the petition and plea, the part of the trial process is the court record, and the fact part includes the plaintiff’s litigation request, Facts and reasons, the facts and reasons of the defendant’s defense, the facts found by the court and the evidence on which the case was determined, the reasons and basis of the judgment include the relationship between the evidence and the focus of the dispute, and the relationship between the basis of the judgment and the legal provisions.
步骤S104、获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。Step S104: Obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each second knowledge graph.
在确定候选案件集之后,服务器获取候选案件集中每个候选案件各自对应的第二知识图谱,并计算第一知识图谱与每个第二知识图谱之间的相似度。需要说明的是,第一知识图谱与每个第二知识图谱之间的相似度即为目标案件与每个候选案件之间的相似度。After determining the candidate case set, the server obtains the second knowledge graph corresponding to each candidate case in the candidate case set, and calculates the similarity between the first knowledge graph and each second knowledge graph. It should be noted that the similarity between the first knowledge graph and each second knowledge graph is the similarity between the target case and each candidate case.
在一实施例中,服务器从第一知识图谱中获取第一案件知识信息,以及从每个第二知识图谱中获取各自对应的第二案件知识信息;根据第一案件知识信息和每个第二案件知识信息,计算第一知识图谱与每个第二知识图谱之间的相似度。其中,第一案件知识信息包括第一知识图谱中每个实体节点的属性信息和关系信息,第二知识信息包括第二知识图谱中每个实体节点的属性信息和关系信息。In an embodiment, the server obtains the first case knowledge information from the first knowledge graph, and obtains the respective corresponding second case knowledge information from each second knowledge graph; according to the first case knowledge information and each second knowledge graph; Case knowledge information, calculate the similarity between the first knowledge graph and each second knowledge graph. The first case knowledge information includes attribute information and relationship information of each entity node in the first knowledge graph, and the second knowledge information includes attribute information and relationship information of each entity node in the second knowledge graph.
在一实施例中,服务器根据第一案件知识信息和每个第二案件知识信息,计算第一知识图谱中的各实体节点与每个第二知识图谱中的对应实体节点之间的目标相似度;根据第一知识图谱中的每个实体节点各自对应的预设系数和每个目标相似度,计算第一知识图谱与每个第二知识图谱之间的相似度。In an embodiment, the server calculates the target similarity between each entity node in the first knowledge graph and the corresponding entity node in each second knowledge graph based on the first case knowledge information and each second case knowledge information ; According to the respective preset coefficients corresponding to each entity node in the first knowledge graph and the similarity of each target, the similarity between the first knowledge graph and each second knowledge graph is calculated.
以一个第二知识图谱为例对目标相似度的计算进行说明,具体地,从第一案件知识信息中获取每个第一实体节点的属性信息和/或关系信息,以及从第二案件知识信息中获取每个第二实体节点的属性信息和/或关系信息;根据每个第一实体节点的属性信息和每个第二实体节点的属性信息,计算每个第一实体节点与对应的第二实体节点之间的第一相似度;和/或根据每个第一实体节点的关系信息和每个第二实体节点的关系信息,计算每个第一实体节点与对应的第二实体节点之间的第二相似度;将每个第一实体节点与对应的第二实体节点之间的第一相似度和/或第二相似度作为第一知识图谱中的各实体节点与每个第二知识图谱中的对应实体节点之间的目标相似度。Take a second knowledge graph as an example to illustrate the calculation of target similarity. Specifically, obtain the attribute information and/or relationship information of each first entity node from the first case knowledge information, and obtain the second case knowledge information To obtain the attribute information and/or relationship information of each second entity node; according to the attribute information of each first entity node and the attribute information of each second entity node, calculate each first entity node and the corresponding second entity node The first degree of similarity between the entity nodes; and/or calculate the relationship between each first entity node and the corresponding second entity node based on the relationship information of each first entity node and the relationship information of each second entity node The second degree of similarity; take the first degree of similarity and/or the second degree of similarity between each first entity node and the corresponding second entity node as each entity node and each second knowledge in the first knowledge graph The target similarity between corresponding entity nodes in the graph.
其中,该属性信息为知识图谱中的实体节点的属性,如争议焦点、事实要素和证据等实体节点的具体参数,该关系信息为知识图谱中各实体节点间的关系信息,如争议焦点与事实要素之间的关系,通过属性信息,可以得到案件之间在语义上的相似度,而通过关系信息可以得到案件之间在逻辑上的相似度。可以提高相似度的可信度和准确度。Among them, the attribute information is the attributes of the entity nodes in the knowledge graph, such as the specific parameters of the entity nodes such as dispute focus, fact elements, and evidence, and the relationship information is the relationship information between the entity nodes in the knowledge graph, such as dispute focus and facts The relationship between the elements, through the attribute information, can get the semantic similarity between the cases, and the logical similarity between the cases can be obtained through the relationship information. The credibility and accuracy of similarity can be improved.
其中,当目标相似度为第一相似度和第二相似度时,第一知识图谱与一个第二知识图谱之间的相似度的计算方式具体为:将每个第一相似度乘以各自对应的第一权重系数之后进行累加,得到第一知识图谱与第二知识图谱之间的第一目标相似度;将每个第二相似度乘以各自对应的第二权重系数之后进行累加,得到第一知识图谱与第二知识图谱之间的第二目标相似度;计算第一目标相似度与第二目标相似度的和,并将第一目标相似度与第二目标相似度的和作为第一知识图谱与第二知识图谱之间的相似度。通过属性信息和关系信息可以综合考虑案件之间在语义和逻辑上的相似度,进一步地提高相似度的可信度和准确度。Wherein, when the target similarity is the first similarity and the second similarity, the calculation method of the similarity between the first knowledge graph and a second knowledge graph is specifically: multiplying each first similarity by its corresponding After the first weight coefficient of, is accumulated to obtain the first target similarity between the first knowledge graph and the second knowledge graph; each second similarity is multiplied by the corresponding second weight coefficient and then accumulated to obtain the first The second target similarity between a knowledge graph and the second knowledge graph; calculate the sum of the first target similarity and the second target similarity, and use the sum of the first target similarity and the second target similarity as the first The similarity between the knowledge graph and the second knowledge graph. Through attribute information and relationship information, the semantic and logical similarity between cases can be comprehensively considered, and the credibility and accuracy of similarity can be further improved.
需要说明的是,上述第一权重系数和第二权重系数可基于实际情况进行设置,每个实体节点的第一权重系数可以相同,也可以不同,且每个实体节点的第二权重系数可以相同,也可以不同,本申请不作具体限定。It should be noted that the above-mentioned first weight coefficient and second weight coefficient can be set based on actual conditions, the first weight coefficient of each physical node can be the same or different, and the second weight coefficient of each physical node can be the same , Can also be different, and this application is not specifically limited.
步骤S105、根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。Step S105, according to the similarity between the first knowledge graph and each of the second knowledge graphs, determine the same case and concurrent judgment cases from the set of candidate cases.
具体地,将第一知识图谱与每个第二知识图谱之间的相似度作为目标案件与该候选案件集中的每个候选案件之间的相似度,并按照相似度的大小顺序,对该候选案件集中的每个候选案件进行排序,将排序最靠前的候选案件作为同案同判案件,也即将相似度最大对应的候选案件作为同案同判案件。Specifically, the similarity between the first knowledge graph and each second knowledge graph is taken as the similarity between the target case and each candidate case in the candidate case set, and in the order of the magnitude of the similarity, the candidate Each candidate case in the case set is sorted, and the candidate case with the highest ranking is regarded as a co-judgment case, and the candidate case corresponding to the highest similarity is regarded as a co-judgment case.
上述实施例提供的案件推荐方法,基于争议焦点确定模型,根据判案数据可以准确的确定目标案件的争议焦点,并根据判案数据和争议焦点,构建目标案件的知识图谱,且根据争议焦点,确定待推荐的候选案件集,然后计算目标案件的知识图谱与每个候选案件对应的知识图谱之间的相似度,并根据目标案件的知识图谱与每个候选案件对应的知识图谱之间的相似度,可以准确的确定同案同判案件。The case recommendation method provided in the above embodiments is based on the dispute focus determination model, and the dispute focus of the target case can be accurately determined according to the judgment data, and the knowledge map of the target case is constructed based on the judgment data and the dispute focus, and according to the dispute focus, Determine the set of candidate cases to be recommended, and then calculate the similarity between the knowledge graph of the target case and the knowledge graph corresponding to each candidate case, and according to the similarity between the knowledge graph of the target case and the knowledge graph corresponding to each candidate case Degree, can accurately determine the same case and the same sentence.
请参照图3,图3为本申请实施例提供的另一种案件推荐方法的流程示意图。如图3所示,该案件推荐方法包括步骤S201至206。Please refer to FIG. 3, which is a schematic flowchart of another case recommendation method provided by an embodiment of the application. As shown in FIG. 3, the case recommendation method includes steps S201 to 206.
步骤S201、获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点。Step S201: Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model.
当监测到触发的同案推荐指令时,根据该同案推荐指令,确定需要推荐同案的未判案件,并需要推荐同案的未判案件作为目标案件,然后获取目标案件的判案数据。When the triggered co-case recommendation instruction is monitored, according to the co-case recommendation instruction, the unconfirmed cases that need to be recommended are determined, and the unconfirmed cases that need to be recommended as the target cases, and then the judgment data of the target cases are obtained.
具体地,服务器从判案数据中提取出原告诉请信息、被告辩称信息和证据信息;基于预设的争议焦点确定模型根据原告诉请信息、被告辩称信息和证据信息确定预设数量的候选争议焦点和每个候选争议焦点的输出概率值;将输出概率值最大的候选争议焦点作为目标案件的争议焦点。Specifically, the server extracts the plaintiff request information, the defendant's defense information, and the evidence information from the judgment data; based on the preset dispute focus determination model, the preset number is determined based on the plaintiff's request information, the defendant's defense information, and the evidence information. The candidate dispute focus and the output probability value of each candidate dispute focus; the candidate dispute focus with the largest output probability value is regarded as the dispute focus of the target case.
步骤S202、根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱。Step S202: Construct a first knowledge graph of the target case according to the judgment data and the dispute focus.
具体地,从判案数据中提取出基础案件知识信息和证据知识信息,并将争议焦点、基础案件知识信息和证据知识信息作为目标案件的案件知识信息,然后从预设法条知识库中获取与该案件知识信息匹配的法条知识,并根据该案件知识信息和法条知识,构建目标案件的知识图谱,记为第一知识图谱。Specifically, the basic case knowledge information and evidence knowledge information are extracted from the judgment data, and the focus of dispute, basic case knowledge information and evidence knowledge information are taken as the case knowledge information of the target case, and then obtained from the knowledge base of preset laws The knowledge of the law that matches the knowledge information of the case, and based on the knowledge information of the case and the knowledge of the law, construct the knowledge map of the target case, which is recorded as the first knowledge map.
步骤S203、计算所述争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度。Step S203: Calculate the similarity between the dispute focus and the dispute focus of each judged case in the prestored judged case database.
服务器计算目标案件的争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度,即对目标案件的争议焦点中的各字进行编码,得到第一向量,并对每个已判案件的争议焦点中的各字进行编码,得到对应的第二向量,然后计算第一向量与每个第二向量之间的余弦相似度,并将第一向量与每个第二向量之间的余弦相似度作为目标案件的争议焦点与每个已判案件的争议焦点之间的相似度。The server calculates the similarity between the dispute focus of the target case and the dispute focus of each sentenced case in the pre-stored sentenced case database, that is, encodes each word in the dispute focus of the target case to obtain the first vector, and then Each word in the dispute focus of each judged case is coded to obtain the corresponding second vector, and then the cosine similarity between the first vector and each second vector is calculated, and the first vector is compared with each second vector. The cosine similarity between the vectors is taken as the similarity between the dispute focus of the target case and the dispute focus of each judged case.
在一实施例中,服务器统计已判案件的案件总数量,并根据案件总数量,确定并发线程数,即获取预存的案件数量与并发线程数之间的映射关系表,并查询该映射关系表,获取案件总数量对应的并发线程数;根据并发线程数,调用预设的线程池中对应数量的空闲线程并发的计算争议焦点与每个已判案件的争议焦点之间的相似度。需要说明的是,案件数量与并发线程数之间的映射关系表和线程池中的线程个数可基于实际情况进行设置,本申请对此不作具体限定。通过多个线程并发的计算争议焦点与每个已判案件的争议焦点之间的相似度,可以提高相似度的计算速度。In one embodiment, the server counts the total number of cases of judged cases, and determines the number of concurrent threads according to the total number of cases, that is, obtains the mapping relationship table between the number of pre-stored cases and the number of concurrent threads, and queries the mapping relationship table , Get the number of concurrent threads corresponding to the total number of cases; According to the number of concurrent threads, call the corresponding number of idle threads in the preset thread pool to concurrently calculate the similarity between the dispute focus and the dispute focus of each judged case. It should be noted that the mapping relationship table between the number of cases and the number of concurrent threads and the number of threads in the thread pool can be set based on actual conditions, and this application does not specifically limit this. By concurrently calculating the similarity between the dispute focus and the dispute focus of each judged case through multiple threads, the calculation speed of the similarity can be improved.
步骤S204、根据所述争议焦点与每个已判案件的争议焦点之间的相似度,从预存的已判案件库中确定待推荐的候选案件集。Step S204: According to the similarity between the dispute focus and the dispute focus of each judged case, determine the candidate case set to be recommended from the pre-stored judged case database.
在计算得到目标案件的争议焦点与每个已判案件的争议焦点之间的相似度之后,将相似度大于预设的相似度阈值的每个已判案件写入预设的候选案件空白集中,以形成待推荐的候选案件集。需要说明的是,上述相似度阈值可基于实际情况进行设置,本申请对此不作具体限定。After calculating the similarity between the dispute focus of the target case and the dispute focus of each judged case, each judged case whose similarity is greater than the preset similarity threshold is written into the preset candidate case blank set, To form a set of candidate cases to be recommended. It should be noted that the aforementioned similarity threshold may be set based on actual conditions, which is not specifically limited in this application.
步骤S205、获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。Step S205: Obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each second knowledge graph.
在确定候选案件集之后,服务器获取候选案件集中每个候选案件各自对应的第二知识图谱,并计算第一知识图谱与每个第二知识图谱之间的相似度。需要说明的是,第一知识图谱与每个第二知识图谱之间的相似度即为目标案件与每个候选案件之间的相似度。After determining the candidate case set, the server obtains the second knowledge graph corresponding to each candidate case in the candidate case set, and calculates the similarity between the first knowledge graph and each second knowledge graph. It should be noted that the similarity between the first knowledge graph and each second knowledge graph is the similarity between the target case and each candidate case.
步骤S206、根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。Step S206: According to the similarity between the first knowledge graph and each of the second knowledge graphs, determine the same case and the same judgment case from the set of candidate cases.
具体地,将第一知识图谱与每个第二知识图谱之间的相似度作为目标案件与该候选案件集中的每个候选案件之间的相似度,并按照相似度的大小顺序,对该候选案件集中的每个候选案件进行排序,将排序最靠前的候选案件作为同案同判案件,也即将相似度最大对应的候选案件作为同案同判案件。Specifically, the similarity between the first knowledge graph and each second knowledge graph is taken as the similarity between the target case and each candidate case in the candidate case set, and in the order of the magnitude of the similarity, the candidate Each candidate case in the case set is sorted, and the candidate case with the highest ranking is regarded as a co-judgment case, and the candidate case corresponding to the highest similarity is regarded as a co-judgment case.
上述实施例提供的案件推荐方法,通过目标案件的争议焦点与每个已判案件的争议焦点之间的相似度,可以较为准确的确定待推荐的候选案件集,同时基于目标案件的知识图谱与每个候选案件的知识图谱之间的相似度,可以进一步准确的确定与目标案件相似的案件,保证同案同判,提高判案效率。The case recommendation method provided by the above embodiment can determine the candidate case set to be recommended more accurately based on the similarity between the dispute focus of the target case and the dispute focus of each sentenced case, and is based on the knowledge map of the target case and The similarity between the knowledge graphs of each candidate case can further accurately determine cases similar to the target case, ensure the same case and improve the efficiency of judgment.
请参照图4,图4为本申请实施例提供的一种案件推荐装置的示意性框图。Please refer to FIG. 4, which is a schematic block diagram of a case recommendation apparatus provided by an embodiment of the application.
如图4所示,该案件推荐装置300,包括:争议焦点确定模块301、图谱构建模块302、候选案件确定模块303、计算模块304和案件确定模块305。As shown in FIG. 4, the case recommendation device 300 includes: a dispute focus determination module 301, a graph construction module 302, a candidate case determination module 303, a calculation module 304, and a case determination module 305.
争议焦点确定模块301,用于获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;The dispute focus determination module 301 is used to obtain the judgment data of the target case, and based on a preset dispute focus determination model, determine the dispute focus of the target case according to the judgment data;
图谱构建模块302,用于根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;The graph construction module 302 is configured to construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
候选案件确定模块303,用于根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;The candidate case determination module 303 is configured to determine the candidate case set to be recommended from the pre-stored judged case database according to the focus of the dispute;
计算模块304,用于获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;The calculation module 304 is configured to obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each second knowledge graph;
案件确定模块305,用于根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。The case determination module 305 is configured to determine the same-sentence cases from the set of candidate cases according to the similarity between the first knowledge graph and each of the second knowledge graphs.
在一个实施例中,所述争议焦点确定模块301,还用于从所述判案数据中提取出原告诉请信息、被告辩称信息和证据信息;基于预设的争议焦点确定模型根据所述原告诉请信息、被告辩称信息和证据信息确定预设数量的候选争议焦点和每个所述候选争议焦点的输出概率值;将所述输出概率值最大的所述候选争议焦点作为所述目标案件的争议焦点。In one embodiment, the dispute focus determination module 301 is also used to extract the plaintiff information, the defendant's defense information, and the evidence information from the judgment data; based on the preset dispute focus determination model according to the The plaintiff’s information, the defendant’s defense information, and the evidence information determine a preset number of candidate dispute focus and the output probability value of each candidate dispute focus; the candidate dispute focus with the largest output probability value is taken as the target The focus of the case.
在一个实施例中,所述计算模块304,还用于从所述第一知识图谱中获取第一案件知识信息,以及从每个所述第二知识图谱中获取各自对应的第二案件知识信息;根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。In an embodiment, the calculation module 304 is further configured to obtain first case knowledge information from the first knowledge graph, and obtain respective corresponding second case knowledge information from each of the second knowledge graphs ; According to the first case knowledge information and each of the second case knowledge information, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
在一个实施例中,所述计算模块304,还用于根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱中的各实体节点与每个所述第二知识图谱中的对应实体节点之间的目标相似度;In an embodiment, the calculation module 304 is further configured to calculate each entity node and each entity in the first knowledge graph according to the first case knowledge information and each of the second case knowledge information. State the target similarity between corresponding entity nodes in the second knowledge graph;
根据所述第一知识图谱中的每个实体节点各自对应的预设系数和每个所述目标相似度,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。According to the preset coefficient corresponding to each entity node in the first knowledge graph and the similarity of each target, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
请参照图5,图5为本申请实施例提供的另一种案件推荐装置的示意性框图。如图5所示,该案件推荐装置400,包括:争议焦点确定模块401、图谱构建模块402、第一计算模块403、候选案件确定模块404、第二计算模块405和案件确定模块406。Please refer to FIG. 5, which is a schematic block diagram of another case recommendation apparatus provided by an embodiment of the application. As shown in FIG. 5, the case recommendation device 400 includes: a dispute focus determination module 401, a graph construction module 402, a first calculation module 403, a candidate case determination module 404, a second calculation module 405, and a case determination module 406.
争议焦点确定模块401,用于获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;The dispute focus determination module 401 is configured to obtain the judgment data of the target case, and based on a preset dispute focus determination model, determine the dispute focus of the target case according to the judgment data;
图谱构建模块402,用于根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;The graph construction module 402 is configured to construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
第一计算模块403,用于计算所述争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度;The first calculation module 403 is used to calculate the similarity between the dispute focus and the dispute focus of each judged case in the pre-stored judged case database;
候选案件确定模块404,用于根据所述争议焦点与每个已判案件的争议焦点之间的相似度,从预存的已判案件库中确定待推荐的候选案件集;The candidate case determination module 404 is configured to determine the candidate case set to be recommended from the pre-stored judged case database according to the similarity between the dispute focus and the dispute focus of each judged case;
第二计算模块405,还用于获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;The second calculation module 405 is further configured to obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each second knowledge graph;
案件确定模块406,用于根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。The case determination module 406 is configured to determine the same case and concurrent judgment cases from the set of candidate cases according to the similarity between the first knowledge graph and each of the second knowledge graphs.
在一实施例中,所述第一计算模块403,还用于统计已判案件的案件总数量,并根据所述案件总数量,确定并发线程数;根据所述并发线程数,调用预设的线程池中对应数量的空闲线程并发的计算所述争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度。In one embodiment, the first calculation module 403 is also used to count the total number of cases that have been judged, and to determine the number of concurrent threads based on the total number of cases; according to the number of concurrent threads, call a preset The corresponding number of idle threads in the thread pool concurrently calculate the similarity between the dispute focus and the dispute focus of each judged case in the pre-stored judged case library.
在一实施例中,所述第一计算模块403,还用于获取预存的案件数量与并发线程数之间的映射关系表,并查询所述映射关系表,获取所述案件总数量对应的并发线程数。In an embodiment, the first calculation module 403 is further configured to obtain a mapping relationship table between the number of pre-stored cases and the number of concurrent threads, and query the mapping relationship table to obtain the concurrency corresponding to the total number of cases. Threads.
需要说明的是,所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的装置和各模块及单元的具体工作过程,可以参考前述案件推荐方法实施例中的对应过程,在此不再赘述。It should be noted that those skilled in the art can clearly understand that for the convenience and brevity of description, the specific working process of the above described device and each module and unit can refer to the corresponding process in the foregoing case recommendation method embodiment. I won't repeat them here.
上述实施例提供的装置可以实现为一种计算机程序的形式,该计算机程序可以在如图6所示的计算机设备上运行。请参阅图6,图6为本申请实施例提供的一种计算机设备的结构示意性框图。该计算机设备可以为服务器。The apparatus provided in the foregoing embodiment may be implemented in the form of a computer program, and the computer program may run on the computer device as shown in FIG. 6. Please refer to FIG. 6, which is a schematic block diagram of the structure of a computer device provided by an embodiment of the application. The computer device may be a server.
如图6所示,该计算机设备包括通过系统总线连接的处理器、存储器和网络接口,其中,存储器可以包括非易失性存储介质和内存储器。非易失性存储介质可存储操作系统和计算机程序。该计算机程序包括程序指令,该程序指令被执行时,可使得处理器执行任意一种案件推荐方法。处理器用于提供计算和控制能力,支撑整个计算机设备的运行。内存储器为非易失性存储介质中的计算机程序的运行提供环境,该计算机程序被处理器执行时,可使得处理器执行上述的任一实施例所示出的案件推荐方法,其中,所述案件推荐方法包括:As shown in FIG. 6, the computer device includes a processor, a memory, and a network interface connected through a system bus, where the memory may include a non-volatile storage medium and an internal memory. The non-volatile storage medium can store an operating system and a computer program. The computer program includes program instructions, and when the program instructions are executed, the processor can execute any case recommendation method. The processor is used to provide computing and control capabilities and support the operation of the entire computer equipment. The internal memory provides an environment for the operation of the computer program in the non-volatile storage medium. When the computer program is executed by the processor, the processor can make the processor execute the case recommendation method shown in any of the above embodiments, wherein the Recommended methods for the case include:
获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model;
根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;According to the judgment data and the dispute focus, construct the first knowledge graph of the target case; according to the dispute focus, determine the candidate case set to be recommended from the pre-stored judged case database;
获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;Acquiring a second knowledge graph corresponding to each candidate case in the candidate case set, and calculating the similarity between the first knowledge graph and each of the second knowledge graphs;
根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。According to the degree of similarity between the first knowledge graph and each of the second knowledge graphs, co-judge cases are determined from the set of candidate cases.
该网络接口用于进行网络通信,如发送分配的任务等。本领域技术人员可以理解,图6中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface is used for network communication, such as sending assigned tasks. Those skilled in the art can understand that the structure shown in FIG. 6 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied. The specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
需要说明的是,所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,上述描述的计算机设备的具体工作过程,可以参考前述案件推荐方法实施例中的对应过程,在此不再赘述。It should be noted that those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the computer device described above can refer to the corresponding process in the above-mentioned case recommendation method embodiment. Go into details.
本申请实施例还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有计算机程序,所述计算机可读存储介质可以是非易失性,也可以是易失性,所述计算机程序中包括程序指令,所述程序指令被执行时所实现的方法可参照本申请案件推荐方法的各个实施例,所述案件推荐方法包括以下步骤:The embodiment of the present application also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, the computer-readable storage medium may be non-volatile or volatile, and the computer The program includes program instructions, and the method implemented when the program instructions are executed can refer to the various embodiments of the case recommendation method of this application. The case recommendation method includes the following steps:
获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model;
根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;According to the judgment data and the dispute focus, construct the first knowledge graph of the target case; according to the dispute focus, determine the candidate case set to be recommended from the pre-stored judged case database;
获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;Acquiring a second knowledge graph corresponding to each candidate case in the candidate case set, and calculating the similarity between the first knowledge graph and each of the second knowledge graphs;
根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。According to the degree of similarity between the first knowledge graph and each of the second knowledge graphs, co-judge cases are determined from the set of candidate cases.
其中,所述计算机可读存储介质可以是前述实施例所述的计算机设备的内部存储单元,例如所述计算机设备的硬盘或内存。所述计算机可读存储介质也可以是所述计算机设备的外部存储设备,例如所述计算机设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。The computer-readable storage medium may be the internal storage unit of the computer device described in the foregoing embodiment, for example, the hard disk or memory of the computer device. The computer-readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk or a smart memory card (Smart Memory Card) equipped on the computer device. Media Card, SMC), Secure Digital (Secure Digital, SD) card, flash memory card (Flash Card) and so on.

Claims (20)

  1. 一种案件推荐方法,其中,包括:A case recommendation method, which includes:
    获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model;
    根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;Construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
    根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;According to the focus of the dispute, determine the candidate case set to be recommended from the pre-stored judged case database;
    获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;Acquiring a second knowledge graph corresponding to each candidate case in the candidate case set, and calculating the similarity between the first knowledge graph and each of the second knowledge graphs;
    根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。According to the degree of similarity between the first knowledge graph and each of the second knowledge graphs, co-judge cases are determined from the set of candidate cases.
  2. 根据权利要求1所述的案件推荐方法,其中,所述基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点,包括:The case recommendation method according to claim 1, wherein the determining the focus of dispute of the target case based on the preset dispute focus determination model and determining the focus of dispute of the target case according to the judgment data comprises:
    从所述判案数据中提取出原告诉请信息、被告辩称信息和证据信息;Extract the plaintiff information, the defendant's defense information and the evidence information from the judgment data;
    基于预设的争议焦点确定模型根据所述原告诉请信息、被告辩称信息和证据信息确定预设数量的候选争议焦点和每个所述候选争议焦点的输出概率值;Based on the preset dispute focus determination model, determine a preset number of candidate dispute focus and the output probability value of each candidate dispute focus according to the plaintiff request information, the defendant's defense information, and the evidence information;
    将所述输出概率值最大的所述候选争议焦点作为所述目标案件的争议焦点。The candidate dispute focus with the largest output probability value is taken as the dispute focus of the target case.
  3. 根据权利要求1所述的案件推荐方法,其中,所述计算所述第一知识图谱与每个所述第二知识图谱之间的相似度,包括:The case recommendation method according to claim 1, wherein the calculating the similarity between the first knowledge graph and each of the second knowledge graphs comprises:
    从所述第一知识图谱中获取第一案件知识信息,以及从每个所述第二知识图谱中获取各自对应的第二案件知识信息;Obtaining first case knowledge information from the first knowledge graph, and obtaining respective corresponding second case knowledge information from each of the second knowledge graphs;
    根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。According to the first case knowledge information and each of the second case knowledge information, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  4. 根据权利要求3所述的案件推荐方法,其中,所述根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度,包括:The case recommendation method according to claim 3, wherein the first knowledge graph and each second knowledge graph are calculated based on the first case knowledge information and each of the second case knowledge information The similarity between, including:
    根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱中的各实体节点与每个所述第二知识图谱中的对应实体节点之间的目标相似度;According to the first case knowledge information and each of the second case knowledge information, the target similarity between each entity node in the first knowledge graph and the corresponding entity node in each second knowledge graph is calculated degree;
    根据所述第一知识图谱中的每个实体节点各自对应的预设系数和每个所述目标相似度,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。According to the preset coefficient corresponding to each entity node in the first knowledge graph and the similarity of each target, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  5. 根据权利要求1至4中任一项所述的案件推荐方法,其中,所述根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集,包括:The case recommendation method according to any one of claims 1 to 4, wherein the determining a candidate case set to be recommended from a pre-stored judged case database according to the focus of the dispute comprises:
    计算所述争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度;Calculate the similarity between the dispute focus and the dispute focus of each judged case in the pre-stored judged case database;
    根据所述争议焦点与每个已判案件的争议焦点之间的相似度,从预存的已判案件库中确定待推荐的候选案件集。According to the similarity between the dispute focus and the dispute focus of each sentenced case, the candidate case set to be recommended is determined from the pre-stored sentenced case database.
  6. 根据权利要求5所述的案件推荐方法,其中,所述计算所述争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度,包括:The case recommendation method according to claim 5, wherein said calculating the similarity between the focus of dispute and the focus of dispute of each judged case in the pre-stored judged case database comprises:
    统计已判案件的案件总数量,并根据所述案件总数量,确定并发线程数;Count the total number of cases that have been judged, and determine the number of concurrent threads based on the total number of cases;
    根据所述并发线程数,调用预设的线程池中对应数量的空闲线程并发的计算所述争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度。According to the number of concurrent threads, a corresponding number of idle threads in the preset thread pool are called to concurrently calculate the similarity between the dispute focus and the dispute focus of each judged case in the stored judged case library.
  7. 根据权利要求6所述的案件推荐方法,其中,所述根据所述总数量,确定并发线程数,包括:The case recommendation method according to claim 6, wherein the determining the number of concurrent threads according to the total number comprises:
    获取预存的案件数量与并发线程数之间的映射关系表,并查询所述映射关系表,获取所述案件总数量对应的并发线程数。Obtain a mapping relationship table between the number of pre-stored cases and the number of concurrent threads, and query the mapping relationship table to obtain the number of concurrent threads corresponding to the total number of cases.
  8. 一种案件推荐装置,其中,所述案件推荐装置包括:A case recommendation device, wherein the case recommendation device includes:
    争议焦点确定模块,用于获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;The dispute focus determination module is used to obtain the judgment data of the target case, and based on the preset dispute focus determination model, determine the dispute focus of the target case according to the judgment data;
    图谱构建模块,用于根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;The graph construction module is used to construct the first knowledge graph of the target case according to the judgment data and the dispute focus;
    候选案件确定模块,用于根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;The candidate case determination module is used to determine the candidate case set to be recommended from the pre-stored judged case database according to the focus of the dispute;
    计算模块,用于获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;A calculation module, configured to obtain a second knowledge graph corresponding to each candidate case in the candidate case set, and calculate the similarity between the first knowledge graph and each of the second knowledge graphs;
    案件确定模块,用于根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。The case determination module is configured to determine the same case and concurrent judgment cases from the set of candidate cases according to the similarity between the first knowledge graph and each of the second knowledge graphs.
  9. 根据权利要求8所述的案件推荐装置,其中,所述争议焦点确定模块,The case recommendation device according to claim 8, wherein the dispute focus determining module,
    还用于从所述判案数据中提取出原告诉请信息、被告辩称信息和证据信息;It is also used to extract the plaintiff information, the defendant's defense information, and the evidence information from the judgment data;
    基于预设的争议焦点确定模型根据所述原告诉请信息、被告辩称信息和证据信息确定预设数量的候选争议焦点和每个所述候选争议焦点的输出概率值;Based on the preset dispute focus determination model, determine a preset number of candidate dispute focus and the output probability value of each candidate dispute focus according to the plaintiff request information, the defendant's defense information, and the evidence information;
    将所述输出概率值最大的所述候选争议焦点作为所述目标案件的争议焦点。The candidate dispute focus with the largest output probability value is taken as the dispute focus of the target case.
  10. 根据权利要求8所述的案件推荐装置,其中,所述计算模块,The case recommendation device according to claim 8, wherein the calculation module,
    还用于从所述第一知识图谱中获取第一案件知识信息,以及从每个所述第二知识图谱中获取各自对应的第二案件知识信息;It is also used to obtain first case knowledge information from the first knowledge graph, and obtain respective corresponding second case knowledge information from each of the second knowledge graphs;
    根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。According to the first case knowledge information and each of the second case knowledge information, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  11. 一种计算机设备,其中,所述计算机设备包括处理器、存储器、以及存储在所述存储器上并可被所述处理器执行的计算机程序,其中所述计算机程序被所述处理器执行时,实现一种案件推荐方法:A computer device, wherein the computer device includes a processor, a memory, and a computer program stored on the memory and executable by the processor, wherein when the computer program is executed by the processor, the A recommended method for a case:
    其中,所述案件推荐方法包括:Among them, the recommended method for the case includes:
    获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model;
    根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;Construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
    根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;According to the focus of the dispute, determine the candidate case set to be recommended from the pre-stored judged case database;
    获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;Acquiring a second knowledge graph corresponding to each candidate case in the candidate case set, and calculating the similarity between the first knowledge graph and each of the second knowledge graphs;
    根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。According to the degree of similarity between the first knowledge graph and each of the second knowledge graphs, co-judge cases are determined from the set of candidate cases.
  12. 根据权利要求11所述的计算机设备,其中,所述基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点,包括:11. The computer device according to claim 11, wherein the determining the focus of dispute of the target case based on the preset dispute focus determination model according to the judgment data comprises:
    从所述判案数据中提取出原告诉请信息、被告辩称信息和证据信息;Extract the plaintiff information, the defendant's defense information and the evidence information from the judgment data;
    基于预设的争议焦点确定模型根据所述原告诉请信息、被告辩称信息和证据信息确定预设数量的候选争议焦点和每个所述候选争议焦点的输出概率值;Based on the preset dispute focus determination model, determine a preset number of candidate dispute focuses and the output probability value of each candidate dispute focus according to the plaintiff information, the defendant's defense information, and the evidence information;
    将所述输出概率值最大的所述候选争议焦点作为所述目标案件的争议焦点。The candidate dispute focus with the largest output probability value is taken as the dispute focus of the target case.
  13. 根据权利要求11所述的计算机设备,其中,所述计算所述第一知识图谱与每个所述第二知识图谱之间的相似度,包括:The computer device according to claim 11, wherein the calculating the similarity between the first knowledge graph and each of the second knowledge graphs comprises:
    从所述第一知识图谱中获取第一案件知识信息,以及从每个所述第二知识图谱中获取各自对应的第二案件知识信息;Obtaining first case knowledge information from the first knowledge graph, and obtaining respective corresponding second case knowledge information from each of the second knowledge graphs;
    根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。According to the first case knowledge information and each of the second case knowledge information, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  14. 根据权利要求13所述的计算机设备,其中,所述根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度,包括:The computer device according to claim 13, wherein the calculation of the first knowledge graph and each of the second knowledge graphs is based on the first case knowledge information and each of the second case knowledge information The similarity between:
    根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱中的各实体节点与每个所述第二知识图谱中的对应实体节点之间的目标相似度;According to the first case knowledge information and each of the second case knowledge information, the target similarity between each entity node in the first knowledge graph and the corresponding entity node in each second knowledge graph is calculated degree;
    根据所述第一知识图谱中的每个实体节点各自对应的预设系数和每个所述目标相似度,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。According to the preset coefficient corresponding to each entity node in the first knowledge graph and the similarity of each target, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  15. 根据权利要求11至14所述的计算机设备,其中,所述根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集,包括:The computer device according to claims 11 to 14, wherein the determining the candidate case set to be recommended from the pre-stored judged case database according to the dispute focus includes:
    计算所述争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度;Calculate the similarity between the dispute focus and the dispute focus of each judged case in the pre-stored judged case database;
    根据所述争议焦点与每个已判案件的争议焦点之间的相似度,从预存的已判案件库中确定待推荐的候选案件集。According to the similarity between the dispute focus and the dispute focus of each sentenced case, the candidate case set to be recommended is determined from the pre-stored sentenced case database.
  16. 一种计算机可读存储介质,其中,所述计算机可读存储介质上存储有计算机程序,其中所述计算机程序被处理器执行时,实现一种案件推荐方法,其中,所述案件推荐方法包括以下步骤:A computer-readable storage medium, wherein a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, a case recommendation method is implemented, wherein the case recommendation method includes the following step:
    获取目标案件的判案数据,并基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点;Obtain the judgment data of the target case, and determine the dispute focus of the target case based on the judgment data based on a preset dispute focus determination model;
    根据所述判案数据和所述争议焦点,构建所述目标案件的第一知识图谱;Construct a first knowledge graph of the target case according to the judgment data and the dispute focus;
    根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集;According to the focus of the dispute, determine the candidate case set to be recommended from the pre-stored judged case database;
    获取所述候选案件集中每个候选案件各自对应的第二知识图谱,并计算所述第一知识图谱与每个所述第二知识图谱之间的相似度;Acquiring a second knowledge graph corresponding to each candidate case in the candidate case set, and calculating the similarity between the first knowledge graph and each of the second knowledge graphs;
    根据所述第一知识图谱与每个所述第二知识图谱之间的相似度,从所述候选案件集中确定同案同判案件。According to the degree of similarity between the first knowledge graph and each of the second knowledge graphs, co-judge cases are determined from the set of candidate cases.
  17. 根据权利要求16所述的计算机可读存储介质,其中,所述基于预设的争议焦点确定模型,根据所述判案数据确定所述目标案件的争议焦点,包括:The computer-readable storage medium according to claim 16, wherein the determining the focus of dispute of the target case based on the preset dispute focus determination model according to the judgment data comprises:
    从所述判案数据中提取出原告诉请信息、被告辩称信息和证据信息;Extract the plaintiff information, the defendant's defense information and the evidence information from the judgment data;
    基于预设的争议焦点确定模型根据所述原告诉请信息、被告辩称信息和证据信息确定预设数量的候选争议焦点和每个所述候选争议焦点的输出概率值;Based on the preset dispute focus determination model, determine a preset number of candidate dispute focus and the output probability value of each candidate dispute focus according to the plaintiff request information, the defendant's defense information, and the evidence information;
    将所述输出概率值最大的所述候选争议焦点作为所述目标案件的争议焦点。The candidate dispute focus with the largest output probability value is taken as the dispute focus of the target case.
  18. 根据权利要求16所述的计算机可读存储介质,其中,所述计算所述第一知识图谱与每个所述第二知识图谱之间的相似度,包括:The computer-readable storage medium according to claim 16, wherein the calculating the similarity between the first knowledge graph and each of the second knowledge graphs comprises:
    从所述第一知识图谱中获取第一案件知识信息,以及从每个所述第二知识图谱中获取各自对应的第二案件知识信息;Obtaining first case knowledge information from the first knowledge graph, and obtaining respective corresponding second case knowledge information from each of the second knowledge graphs;
    根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。According to the first case knowledge information and each of the second case knowledge information, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  19. 根据权利要求18所述的计算机可读存储介质,其中,所述根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度,包括:18. The computer-readable storage medium of claim 18, wherein the first knowledge graph and each second case knowledge information are calculated based on the first case knowledge information and each of the second case knowledge information. The similarity between knowledge graphs, including:
    根据所述第一案件知识信息和每个所述第二案件知识信息,计算所述第一知识图谱中的各实体节点与每个所述第二知识图谱中的对应实体节点之间的目标相似度;According to the first case knowledge information and each of the second case knowledge information, the target similarity between each entity node in the first knowledge graph and the corresponding entity node in each second knowledge graph is calculated degree;
    根据所述第一知识图谱中的每个实体节点各自对应的预设系数和每个所述目标相似度,计算所述第一知识图谱与每个所述第二知识图谱之间的相似度。According to the preset coefficient corresponding to each entity node in the first knowledge graph and the similarity of each target, the similarity between the first knowledge graph and each of the second knowledge graphs is calculated.
  20. 根据权利要求16至19所述的计算机可读存储介质,其中,所述根据所述争议焦点,从预存的已判案件库中确定待推荐的候选案件集,包括:The computer-readable storage medium according to claims 16 to 19, wherein the determining the candidate case set to be recommended from the pre-stored judged case database according to the dispute focus includes:
    计算所述争议焦点与预存的已判案件库中每个已判案件的争议焦点之间的相似度;Calculate the similarity between the dispute focus and the dispute focus of each judged case in the pre-stored judged case database;
    根据所述争议焦点与每个已判案件的争议焦点之间的相似度,从预存的已判案件库中确定待推荐的候选案件集。According to the similarity between the dispute focus and the dispute focus of each sentenced case, the candidate case set to be recommended is determined from the pre-stored sentenced case database.
PCT/CN2020/093108 2019-09-18 2020-05-29 Case recommendation method and device, apparatus, and computer readable storage medium WO2021051865A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910883521.9 2019-09-18
CN201910883521.9A CN110795566A (en) 2019-09-18 2019-09-18 Case recommendation method, device and equipment and computer-readable storage medium

Publications (1)

Publication Number Publication Date
WO2021051865A1 true WO2021051865A1 (en) 2021-03-25

Family

ID=69427290

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/093108 WO2021051865A1 (en) 2019-09-18 2020-05-29 Case recommendation method and device, apparatus, and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN110795566A (en)
WO (1) WO2021051865A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113011789A (en) * 2021-04-23 2021-06-22 集美大学 Overwater dangerous chemical accident emergency aid decision-making method, terminal equipment and storage medium
CN113901781A (en) * 2021-09-15 2022-01-07 昆明理工大学 Similar case matching method for fusing segmented coding and affine mechanism
CN116910277A (en) * 2023-09-13 2023-10-20 之江实验室 Knowledge graph construction method, resource searching method, computer equipment and medium

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110019655A (en) * 2017-07-21 2019-07-16 北京国双科技有限公司 Precedent case acquisition methods and device
CN110795566A (en) * 2019-09-18 2020-02-14 平安科技(深圳)有限公司 Case recommendation method, device and equipment and computer-readable storage medium
CN111814477B (en) * 2020-07-06 2022-06-21 重庆邮电大学 Dispute focus discovery method and device based on dispute focus entity and terminal
CN112487146B (en) * 2020-12-02 2022-05-31 重庆邮电大学 Legal case dispute focus acquisition method and device and computer equipment
CN112632224B (en) * 2020-12-29 2023-01-24 天津汇智星源信息技术有限公司 Case recommendation method and device based on case knowledge graph and electronic equipment
CN113468323B (en) * 2021-06-01 2023-07-18 成都数之联科技股份有限公司 Dispute focus category and similarity judging method, system and device and recommending method
CN114428848A (en) * 2021-12-29 2022-05-03 航天科工网络信息发展有限公司 Intelligent class case pushing method and system based on incidence relation

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010029099A (en) * 1999-09-29 2001-04-06 박용준 Automated intelligent item recommending system
CN107133276A (en) * 2017-04-12 2017-09-05 安徽富驰信息技术有限公司 A kind of judicial class case searching system and its search method
CN108280149A (en) * 2018-01-04 2018-07-13 东南大学 A kind of doctor-patient dispute class case recommendation method based on various dimensions tag along sort
CN108694178A (en) * 2017-04-06 2018-10-23 北京国双科技有限公司 A kind of method and device for recommending judicial cognizance
US20180367856A1 (en) * 2017-06-16 2018-12-20 The Directv Group, Inc. Scalable calculation of the similarity content graph
CN110795566A (en) * 2019-09-18 2020-02-14 平安科技(深圳)有限公司 Case recommendation method, device and equipment and computer-readable storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108241621B (en) * 2016-12-23 2019-12-10 北京国双科技有限公司 legal knowledge retrieval method and device
US10726071B2 (en) * 2017-02-02 2020-07-28 Kensho Technologies, Llc Content search engine
CN108009299A (en) * 2017-12-28 2018-05-08 北京市律典通科技有限公司 Law tries method and device for business processing
CN110209828B (en) * 2018-02-12 2021-08-27 北大方正集团有限公司 Case query method, case query device, computer device and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010029099A (en) * 1999-09-29 2001-04-06 박용준 Automated intelligent item recommending system
CN108694178A (en) * 2017-04-06 2018-10-23 北京国双科技有限公司 A kind of method and device for recommending judicial cognizance
CN107133276A (en) * 2017-04-12 2017-09-05 安徽富驰信息技术有限公司 A kind of judicial class case searching system and its search method
US20180367856A1 (en) * 2017-06-16 2018-12-20 The Directv Group, Inc. Scalable calculation of the similarity content graph
CN108280149A (en) * 2018-01-04 2018-07-13 东南大学 A kind of doctor-patient dispute class case recommendation method based on various dimensions tag along sort
CN110795566A (en) * 2019-09-18 2020-02-14 平安科技(深圳)有限公司 Case recommendation method, device and equipment and computer-readable storage medium

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113011789A (en) * 2021-04-23 2021-06-22 集美大学 Overwater dangerous chemical accident emergency aid decision-making method, terminal equipment and storage medium
CN113901781A (en) * 2021-09-15 2022-01-07 昆明理工大学 Similar case matching method for fusing segmented coding and affine mechanism
CN113901781B (en) * 2021-09-15 2024-04-26 昆明理工大学 Similar case matching method integrating segment coding and affine mechanism
CN116910277A (en) * 2023-09-13 2023-10-20 之江实验室 Knowledge graph construction method, resource searching method, computer equipment and medium
CN116910277B (en) * 2023-09-13 2024-02-27 之江实验室 Knowledge graph construction method, resource searching method, computer equipment and medium

Also Published As

Publication number Publication date
CN110795566A (en) 2020-02-14

Similar Documents

Publication Publication Date Title
WO2021051865A1 (en) Case recommendation method and device, apparatus, and computer readable storage medium
WO2021051866A1 (en) Method and apparatus for determining case judgment result, device, and computer-readable storage medium
US10795922B2 (en) Authorship enhanced corpus ingestion for natural language processing
US9373086B1 (en) Crowdsource reasoning process to facilitate question answering
US10366093B2 (en) Query result bottom retrieval method and apparatus
US9213946B1 (en) Comparing models
US8150827B2 (en) Methods for enhancing efficiency and cost effectiveness of first pass review of documents
WO2021114810A1 (en) Graph structure-based official document recommendation method, apparatus, computer device, and medium
CN107844533A (en) A kind of intelligent Answer System and analysis method
Pervin et al. Fast, scalable, and context-sensitive detection of trending topics in microblog post streams
GB2568233A (en) A computer implemented determination method and system
CN111401077A (en) Language model processing method and device and computer equipment
CN116134432A (en) System and method for providing answers to queries
WO2024036840A1 (en) Open-domain dialogue reply method and system based on topic enhancement
US11475329B2 (en) Systems and methods for adaptive training of a machine learning system processing textual data
CN116304748A (en) Text similarity calculation method, system, equipment and medium
CN115510226A (en) Emotion classification method based on graph neural network
JP2021068448A5 (en)
US20230169361A1 (en) Generating answers to multi-hop constraint-based questions from knowledge graphs
CN113010690B (en) Method for enhancing entity embedding based on text information
CN114491079A (en) Knowledge graph construction and query method, device, equipment and medium
CN110674313A (en) Method for dynamically updating knowledge graph based on user log
WO2022061877A1 (en) Event extraction and extraction model training method, apparatus and device, and medium
GB2572320A (en) Hate speech detection system for online media content
CN112667797A (en) Question-answer matching method, system and storage medium for adaptive transfer learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20865656

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20865656

Country of ref document: EP

Kind code of ref document: A1