CN108241621B - legal knowledge retrieval method and device - Google Patents

legal knowledge retrieval method and device Download PDF

Info

Publication number
CN108241621B
CN108241621B CN201611204508.9A CN201611204508A CN108241621B CN 108241621 B CN108241621 B CN 108241621B CN 201611204508 A CN201611204508 A CN 201611204508A CN 108241621 B CN108241621 B CN 108241621B
Authority
CN
China
Prior art keywords
dispute focus
dispute
text
focus
legal knowledge
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201611204508.9A
Other languages
Chinese (zh)
Other versions
CN108241621A (en
Inventor
石鹏
贾炜
舒怡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201611204508.9A priority Critical patent/CN108241621B/en
Priority to PCT/CN2017/113804 priority patent/WO2018113498A1/en
Publication of CN108241621A publication Critical patent/CN108241621A/en
Application granted granted Critical
Publication of CN108241621B publication Critical patent/CN108241621B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a legal knowledge retrieval method and device, relates to the technical field of data processing, and solves the problems of low retrieval efficiency and low applicability of the conventional legal knowledge. The main technical scheme of the invention is as follows: training a dispute focus data model according to text data corresponding to each dispute focus in the existing case document; the text data is used for explaining the corresponding dispute focus in the existing case document; when receiving an instruction of acquiring legal knowledge points of a target case document, identifying a dispute focus in the target case document through the dispute focus data model; and outputting the legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points. The method is mainly used for retrieving legal knowledge corresponding to the target case document.

Description

legal knowledge retrieval method and device
Technical Field
the invention relates to the technical field of data processing, in particular to a legal knowledge retrieval method and device.
background
the case document refers to a special document formed and used by law-enforcement agencies such as investigation, inspection, trial and judgment, notarization and the like in each link and step of processing various cases. Mainly includes documents with legal effectiveness, such as judgment books, adjudication books, etc.; documents which do not directly take place in legal effectiveness but have a practical guarantee of law enforcement, such as prosecution, response books, court trial notes and the like, are also included. The legal officer, the lawyer, the party and other personnel closely related to the case help the related personnel to better analyze the case condition by retrieving legal knowledge related to the judicial literature.
at present, the target case is analyzed by the personnel closely related to the case, such as a judge, a lawyer, a party and the like, the law and the fact focus in the target case are summarized by the French language, and then the legal knowledge related to the fact focus is searched by the keyword search technology, but the legal knowledge searched by adopting the method requires that the relevant personnel can accurately summarize the law and the fact focus in the target case, namely, the requirement on the user is too high, and in addition, the conclusion of the law and the fact focus in the target case requires extra manual time, so that the efficiency and the applicability of the conventional legal knowledge search are low.
disclosure of Invention
the present invention has been made in view of the above problems, and aims to provide a legal knowledge retrieval method and apparatus that overcomes or at least partially solves the above problems.
in order to achieve the purpose, the invention mainly provides the following technical scheme:
In one aspect, an embodiment of the present invention provides a legal knowledge retrieval method, including:
Training a dispute focus data model according to text data corresponding to each dispute focus in the existing case document; the text data is used for explaining the corresponding dispute focus in the existing case document;
when receiving an instruction of acquiring legal knowledge points of a target case document, identifying a dispute focus in the target case document through the dispute focus data model;
And outputting the legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points.
on the other hand, the embodiment of the invention also provides a legal knowledge retrieval device, which comprises:
The training unit is used for training a dispute focus data model according to the text data corresponding to each dispute focus in the existing case document; the text data is used for explaining the corresponding dispute focus in the existing case document;
The identification unit is used for identifying a dispute focus in the target case document through the dispute focus data model when receiving a legal knowledge point instruction for acquiring the target case document;
And the output unit is used for outputting the legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points.
by the technical scheme, the technical scheme provided by the embodiment of the invention at least has the following advantages:
the embodiment of the invention provides a legal knowledge retrieval method and a legal knowledge retrieval device, wherein a dispute focus data model is trained according to text data corresponding to each dispute focus in the existing case document; when an instruction for acquiring legal knowledge points of a target case document is received, identifying dispute focus in the target case document through the dispute focus data model, and outputting legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points. Compared with the prior art that the legal knowledge related to the target case is retrieved by manually summarizing the dispute focus in the target case, the embodiment of the invention can directly identify the dispute focus contained in the target case document according to the focus data model and output the legal knowledge point corresponding to the dispute focus according to the identified dispute focus, so that the legal knowledge point corresponding to the target case document can be rapidly retrieved by the embodiment of the invention without manually summarizing the dispute focus of the target case, and the retrieval efficiency and retrieval applicability of the legal knowledge can be improved by the embodiment of the invention.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
FIG. 1 is a flowchart of a legal knowledge retrieval method according to an embodiment of the present invention;
FIG. 2 is a flow chart of another legal knowledge retrieval method provided by an embodiment of the present invention;
FIG. 3 is a block diagram of a legal knowledge retrieval apparatus according to an embodiment of the present invention;
fig. 4 is a block diagram of another legal knowledge retrieval apparatus according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
In order to make the advantages of the technical solutions of the present invention clearer, the present invention is described in detail below with reference to the accompanying drawings and examples.
The embodiment of the invention provides a legal knowledge retrieval method, as shown in fig. 1, the method comprises the following steps:
101. And training a dispute focus data model according to the text data corresponding to each dispute focus in the existing case document.
Wherein the text data is used for explaining a corresponding dispute focus in the existing case document; the existing case documents can be litigation documents, answer documents, court trial notes, referee documents and the like, and the embodiment of the invention is not limited in detail. For the embodiment of the invention, the focus data model is obtained by training a large number of existing case documents, namely, the focus data model is obtained according to the dispute focus corresponding to the training of the existing case documents and the text statement and/or description statement corresponding to the dispute focus.
It should be noted that the focus of disputes is summarized by legal experts based on the analysis of existing case documents, i.e. the focus of disputes is summarized from different legal points of view. For example disputes over trademark infringement: it may be necessary to consider from the trademark law how to infringe trademark exclusive rights; it is also possible to consider from a contractual law perspective how contract agreements are violated in trademark authorization usage contracts; it may also be a dispute for reimbursement after a trademark infringement. If there are multiple angles of disputes in a case document, it is necessary to summarize these disputes.
In the embodiment of the invention, after combing the dispute focus of the existing case document, the legal expert needs to extract the text sentence corresponding to the dispute focus in the existing case document and write the text sentence describing the dispute focus according to the specification. And finally, forming a data set by the descriptive statement of the dispute focus and the excerpted text statement, wherein the data set is a dispute focus data set, and generating a focus data model by a machine learning method for the dispute focus data set. And identifying the dispute focus contained in the target case document through the generated dispute focus data model.
102. And when receiving an instruction of acquiring the legal knowledge point of the target case document, identifying the dispute focus in the target case document through the dispute focus data model.
the existing case document is a document to be retrieved corresponding to legal knowledge, namely a document needing to recommend the corresponding legal knowledge. For the embodiment of the invention, because the dispute focus data model is obtained by training the dispute focus data set, when the legal knowledge point instruction for obtaining the target case document is received, the target case document is put into the focus data model, and then the dispute focus contained in the target case document is identified by utilizing the natural semantic analysis technology.
103. and outputting the legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points.
in the embodiment of the invention, the legal knowledge structural system constructed from judicial practices firstly considers that the legal rules corresponding to a dispute focus are summarized into a legal knowledge point, and the legal knowledge point is the result of the convergence of a plurality of legal provisions of a plurality of legal rules, thereby reflecting the actual situation of applicable laws. And the legal knowledge points are connected into the whole legal and legal system through legal concepts and rules.
each of the dispute foci requires a point of knowledge in judicial practice to help judges, lawyers and others understand legal concepts, judicial interpretations, practice cases, academic opinions, etc. Therefore, the legal knowledge points corresponding to the dispute focus are the result of the convergence of a plurality of legal provisions of a plurality of legal rules, which reflects the actual situation of applicable laws, and are connected to the whole legal rule system through legal concepts and rules. The legal knowledge points may be legal concepts, judicial interpretations, practice cases, academic viewpoints, provisions of laws and regulations, judicial interpretation provisions, legal book articles, case documents and the like, and the embodiment of the present invention is not particularly limited.
It should be noted that the legal knowledge point is connected with a set of legal provisions, including judicial interpretation provisions, legal book articles, case documents, etc.; and (3) resolving the referee rule of dispute focus of a case at each legal knowledge point. All legal knowledge points are organized in order to form a judicial knowledge case document library, the judicial knowledge case document library is a knowledge system organized by the legal knowledge points required in judicial practice, and the knowledge system is formed by mixing specific contents of a plurality of related laws, so that the legality of the legal knowledge points is reflected.
The legal knowledge retrieval method provided by the embodiment of the invention trains a dispute focus data model according to the text data corresponding to each dispute focus in the existing case document; when an instruction for acquiring legal knowledge points of a target case document is received, identifying dispute focus in the target case document through the dispute focus data model, and outputting legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points. Compared with the prior art that the legal knowledge related to the target case is retrieved by manually summarizing the dispute focus in the target case, the embodiment of the invention can directly identify the dispute focus contained in the target case document according to the focus data model and output the legal knowledge point corresponding to the dispute focus according to the identified dispute focus, so that the legal knowledge point corresponding to the target case document can be rapidly retrieved by the embodiment of the invention without manually summarizing the dispute focus of the target case, and the retrieval efficiency and retrieval applicability of the legal knowledge can be improved by the embodiment of the invention.
the embodiment of the invention provides another legal knowledge retrieval method, as shown in fig. 2, the method comprises the following steps:
201. and intercepting text sentences respectively corresponding to the dispute focuses from the existing case documents.
in the embodiment of the invention, the dispute focus is summarized by legal experts according to the analysis of the existing case documents, namely, the dispute focus is summarized from different aspects of law. If there are multiple angles of disputes in a case document, it is necessary to summarize these disputes. Each of the dispute foci requires a point of knowledge in judicial practice to help judges, lawyers and others understand legal concepts, judicial interpretations, practice cases, academic opinions, etc.
And (3) abstracting the text sentence expressing a specific dispute focus in each existing case document by legal experts, namely abstracting the text sentence directly corresponding to the legal knowledge point, and recording the position of the abstracted text sentence in the existing case document. The extracted text sentence can be the paragraph contents of the original complaint requirement, the response content, the court trial debate content, the home opinion and the like in the existing case document, and the embodiment of the invention is not particularly limited.
202. And setting description sentences corresponding to the dispute focuses respectively according to the existing case documents.
the legal expert writes a text sentence describing the dispute focus according to the specification, and generates a new document knowledge point describing the dispute focus in the target case document, wherein the document knowledge point has a fixed format and is full and detailed in content. The processed dispute focus descriptive statement and the text statement extracted in step 201 together form a data set, which becomes a dispute focus data set. A dispute focus data model may be derived by training a focus data set.
203. And training a dispute focus data model according to the text statement and the description statement corresponding to the dispute focus.
For the embodiment of the present invention, the training of the dispute focus data model according to the text sentence and the description sentence corresponding to the dispute focus includes: acquiring text sentences and description sentences which comprise the same dispute focus and correspond to the existing case documents; performing semantic analysis on the acquired text sentences and description sentences to generate fact attribute vectors corresponding to the dispute focus; and generating the dispute focus data model according to the corresponding relation between the dispute focus and the fact attribute vector.
And the computer performs semantic analysis on the dispute focus data set to form a multi-dimensional data vector of legal knowledge and factual attributes. Then generating a dispute focus data model by a machine learning method for the dispute focus data set, analyzing all existing case documents by using the dispute focus model, and finding out the document matched with the dispute focus; and extracting a dispute focus text sentence from the partial matching document, processing a dispute focus description sentence by legal experts, and combining the dispute focus description sentence with the first dispute focus data set to form a dispute focus data model by machine learning. Repeating the above steps for a plurality of times, and the final version of the focus data model becomes the dispute focus data model of the legal knowledge point.
204. And when receiving an instruction of acquiring the legal knowledge point of the target case document, identifying the dispute focus in the target case document through the dispute focus data model.
for the embodiment of the present invention, identifying the dispute focus in the target case document through the dispute focus data model includes: generating a text vector corresponding to the target case document; acquiring a fact attribute vector with the highest similarity to the text vector; and determining the dispute focus corresponding to the acquired fact attribute vector as the dispute focus of the target case document.
In the embodiment of the invention, a multidimensional data vector of legal knowledge points and factual attributes is formed by performing semantic analysis on the dispute focus data set, and a dispute focus data model of the legal knowledge points is generated by a machine learning related algorithm. And then continuously acquiring a new dispute focus data set by using an iterative method, and learning a new dispute focus data model. The computer analyzes the target case document by using the focus data model, can acquire the dispute focus in the target case document, and recommends the corresponding legal knowledge point according to the acquired dispute focus. Therefore, the retrieval efficiency of legal knowledge is improved through the embodiment of the invention.
205. And outputting the legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points.
The dispute focus and the legal knowledge points are the result of the convergence of a plurality of legal provisions of a plurality of laws and regulations, the actual situation of applicable laws is reflected, and meanwhile, the legal knowledge points are connected to the whole law and regulation system through legal concepts and rules.
Further, the method further comprises: and marking a text sentence containing the dispute focus in the target case document. In the embodiment of the invention, the accurate legal knowledge points can be intelligently pushed to the obtained target case document through the focus data model, and the statement corresponding to the dispute focus in the target case document is marked. The computer analyzes the target case document by using the focus data model, and can identify the dispute focus contained in the target case document, thereby realizing the automatic retrieval and recommendation of legal and legal provisions, legal and academic books and periodicals, typical case documents, related case documents and the like corresponding to the legal knowledge points by the machine.
compared with the prior art that the legal knowledge related to the target case is retrieved by manually summarizing the dispute focus in the target case, the retrieval method of the legal knowledge provided by the embodiment of the invention can directly identify the dispute focus contained in the target case document according to the dispute focus data model, output the legal knowledge point corresponding to the dispute focus according to the identified dispute focus, and mark out the dispute focus sentences in the target case document without manually summarizing the dispute focus of the target case, so that the retrieval efficiency and retrieval applicability of the legal knowledge can be improved by the embodiment of the invention.
further, an embodiment of the present invention provides a legal knowledge retrieval apparatus, as shown in fig. 3, the apparatus includes: training section 31, recognition section 32, and output section 33.
The training unit 31 is used for training a dispute focus data model according to the text data corresponding to each dispute focus in the existing case document; the text data is used for explaining the corresponding dispute focus in the existing case document;
The identification unit 32 is used for identifying the dispute focus in the target case document through the dispute focus data model when receiving an instruction of acquiring the legal knowledge point of the target case document;
And the output unit 33 is configured to output the legal knowledge point corresponding to the dispute focus through the corresponding relationship between the dispute focus and the legal knowledge point.
it should be noted that, for other corresponding descriptions of the functional units related to the legal knowledge retrieval apparatus provided in the embodiment of the present invention, reference may be made to corresponding descriptions of the method shown in fig. 1, which are not described herein again, but it should be clear that the apparatus in the embodiment can correspondingly implement all the contents in the foregoing method embodiments.
the legal knowledge retrieval device provided by the embodiment of the invention trains a dispute focus data model according to the text data corresponding to each dispute focus in the existing case document; when an instruction for acquiring legal knowledge points of a target case document is received, identifying dispute focus in the target case document through the dispute focus data model, and outputting legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points. Compared with the prior art that the legal knowledge related to the target case is retrieved by manually summarizing the dispute focus in the target case, the embodiment of the invention can directly identify the dispute focus contained in the target case document according to the focus data model and output the legal knowledge point corresponding to the dispute focus according to the identified dispute focus, so that the legal knowledge point corresponding to the target case document can be rapidly retrieved by the embodiment of the invention without manually summarizing the dispute focus of the target case, and the retrieval efficiency and retrieval applicability of the legal knowledge can be improved by the embodiment of the invention.
further, another legal knowledge retrieval apparatus is provided in the embodiment of the present invention, as shown in fig. 4, the apparatus includes: training section 41, recognition section 42, and output section 43.
a training unit 41, configured to train a dispute focus data model according to text data corresponding to each dispute focus in an existing case document; the text data is used for explaining the corresponding dispute focus in the existing case document;
the identification unit 42 is used for identifying the dispute focus in the target case document through the dispute focus data model when receiving an instruction of acquiring the legal knowledge point of the target case document;
And the output unit 43 is configured to output the legal knowledge point corresponding to the dispute focus through the corresponding relationship between the dispute focus and the legal knowledge point. .
specifically, the training unit 41 includes:
an intercepting module 411, configured to intercept text statements corresponding to the dispute focuses from the existing case documents;
A setting module 412, configured to set description statements corresponding to the dispute focuses according to the existing case documents;
And the training module 413 is configured to train a dispute focus data model according to the text statement and the description statement corresponding to the dispute focus.
specifically, the training module 413 includes:
the obtaining submodule is used for obtaining text sentences and description sentences which contain the same dispute focus and correspond to the existing case documents;
The generation submodule is used for carrying out semantic analysis on the acquired text sentences and description sentences to generate fact attribute vectors corresponding to the dispute focus;
and the generation submodule is also used for generating the dispute focus data model according to the corresponding relation between the dispute focus and the fact attribute vector.
Specifically, the identification unit 42 includes:
a generating module 421, configured to generate a text vector corresponding to the target case document;
an obtaining module 422, configured to obtain a fact attribute vector with the highest similarity to the text vector;
The determining module 423 is configured to determine a dispute focus corresponding to the obtained fact attribute vector as a dispute focus of the target case document.
Further, the apparatus further comprises:
And the marking unit 44 is used for marking the text statement containing the dispute focus in the target case document.
It should be noted that, for other corresponding descriptions of the functional units related to the legal knowledge retrieval apparatus provided in the embodiment of the present invention, reference may be made to corresponding descriptions of the method shown in fig. 2, which are not described herein again, but it should be clear that the apparatus in the embodiment can correspondingly implement all the contents in the foregoing method embodiments.
compared with the prior art that the legal knowledge related to the target case is retrieved by manually summarizing the dispute focus in the target case, the retrieval device for the legal knowledge provided by the embodiment of the invention can directly identify the dispute focus contained in the target case document according to the dispute focus data model, output the legal knowledge point corresponding to the dispute focus according to the identified dispute focus, and mark out the dispute focus sentences in the target case document without manually summarizing the dispute focus of the target case, so that the retrieval efficiency and the retrieval applicability of the legal knowledge can be improved by the retrieval device for the legal knowledge.
The legal knowledge retrieval device comprises a processor and a memory, wherein the training unit, the identification unit, the output unit, the labeling unit and the like are stored in the memory as program units, and the processor executes the program units stored in the memory to realize corresponding functions.
the processor comprises a kernel, and the kernel calls the corresponding program unit from the memory. The kernel can be set to be one or more than one, and the problem that the existing legal knowledge retrieval efficiency and applicability are low is solved by adjusting kernel parameters.
The memory may include volatile memory in a computer readable medium, Random Access Memory (RAM) and/or nonvolatile memory such as Read Only Memory (ROM) or flash memory (flash RAM), and the memory includes at least one memory chip.
The present application further provides a computer program product adapted to perform program code for initializing the following method steps when executed on a data processing device: training a dispute focus data model according to text data corresponding to each dispute focus in the existing case document; the text data is used for explaining the corresponding dispute focus in the existing case document; when receiving an instruction of acquiring legal knowledge points of a target case document, identifying a dispute focus in the target case document through the dispute focus data model; and outputting the legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points.
as will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
the present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
these computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include forms of volatile memory in a computer readable medium, Random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). The memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
The above are merely examples of the present application and are not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.

Claims (8)

1. a legal knowledge retrieval method, comprising:
training a dispute focus data model according to text data corresponding to each dispute focus in the existing case document; the text data is used for explaining a corresponding dispute focus in the existing case document, the dispute focus is obtained by a legal expert according to analysis of the existing case document summary, and the text data corresponding to the dispute focus is a text statement and/or a description statement corresponding to the dispute focus;
The training of the dispute focus data model according to the text sentences of the dispute focuses in the existing case documents comprises the following steps:
Intercepting text sentences corresponding to the dispute focuses from the existing case documents;
setting description sentences corresponding to the dispute focuses according to the existing case documents;
Training a dispute focus data model according to the text sentences and the description sentences corresponding to the dispute focuses;
The training of the dispute focus data model according to the text sentences and the description sentences corresponding to the dispute focuses comprises the following steps:
acquiring text sentences and description sentences which comprise the same dispute focus and correspond to the existing case documents;
Performing semantic analysis on the acquired text sentences and description sentences to generate fact attribute vectors corresponding to the dispute focus;
Generating a dispute focus data model according to the corresponding relation between the dispute focus and the fact attribute vector;
When receiving an instruction of acquiring legal knowledge points of a target case document, identifying a dispute focus in the target case document through the dispute focus data model;
And outputting the legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points.
2. The method of claim 1, wherein identifying the point of dispute focus in the target case document via the model of dispute focus data comprises:
Generating a text vector corresponding to the target case document;
Acquiring a fact attribute vector with the highest similarity to the text vector;
And determining the dispute focus corresponding to the acquired fact attribute vector as the dispute focus of the target case document.
3. the method according to claim 1 or 2, characterized in that the method further comprises:
and marking a text sentence containing the dispute focus in the target case document.
4. An apparatus for retrieving legal knowledge, comprising:
the training unit is used for training a dispute focus data model according to the text data corresponding to each dispute focus in the existing case document; the text data is used for explaining a corresponding dispute focus in the existing case document, the dispute focus is obtained by a legal expert according to analysis of the existing case document summary, and the text data corresponding to the dispute focus is a text statement and/or a description statement corresponding to the dispute focus;
The training unit includes:
The intercepting module is used for intercepting text sentences corresponding to the dispute focuses from the existing case documents;
The setting module is used for setting descriptive sentences corresponding to the dispute focuses respectively according to the existing case documents;
The training module is used for training a dispute focus data model according to the text sentences and the description sentences corresponding to the dispute focuses;
the training module comprises:
The obtaining submodule is used for obtaining text sentences and description sentences which contain the same dispute focus and correspond to the existing case documents;
the generation submodule is used for carrying out semantic analysis on the acquired text sentences and description sentences to generate fact attribute vectors corresponding to the dispute focus;
The generation submodule is further used for generating the dispute focus data model according to the corresponding relation between the dispute focus and the fact attribute vector;
The identification unit is used for identifying a dispute focus in the target case document through the dispute focus data model when receiving a legal knowledge point instruction for acquiring the target case document;
And the output unit is used for outputting the legal knowledge points corresponding to the dispute focus through the corresponding relation between the dispute focus and the legal knowledge points.
5. The apparatus of claim 4, wherein the identification unit comprises:
The generating module is used for generating a text vector corresponding to the target case document;
The obtaining module is used for obtaining a fact attribute vector with the highest similarity to the text vector;
And the determining module is used for determining the dispute focus corresponding to the acquired fact attribute vector as the dispute focus of the target case document.
6. The apparatus of claim 4 or 5, further comprising:
and the marking unit is used for marking the text statement containing the dispute focus in the target case document.
7. A storage medium, characterized in that the storage medium comprises a stored program, wherein, when the program runs, a device where the storage medium is located is controlled to execute the legal knowledge retrieval method of any one of claims 1 to 3.
8. A processor, characterized in that the processor is configured to execute a program, wherein the program executes the method for retrieving legal knowledge according to any one of claims 1 to 3.
CN201611204508.9A 2016-12-23 2016-12-23 legal knowledge retrieval method and device Active CN108241621B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201611204508.9A CN108241621B (en) 2016-12-23 2016-12-23 legal knowledge retrieval method and device
PCT/CN2017/113804 WO2018113498A1 (en) 2016-12-23 2017-11-30 Method and apparatus for retrieving legal knowledge

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611204508.9A CN108241621B (en) 2016-12-23 2016-12-23 legal knowledge retrieval method and device

Publications (2)

Publication Number Publication Date
CN108241621A CN108241621A (en) 2018-07-03
CN108241621B true CN108241621B (en) 2019-12-10

Family

ID=62624399

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611204508.9A Active CN108241621B (en) 2016-12-23 2016-12-23 legal knowledge retrieval method and device

Country Status (2)

Country Link
CN (1) CN108241621B (en)
WO (1) WO2018113498A1 (en)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110717609A (en) * 2018-07-12 2020-01-21 北京京东尚科信息技术有限公司 Method and device for predicting claims
CN109241528B (en) * 2018-08-24 2023-09-01 讯飞智元信息科技有限公司 Criminal investigation result prediction method, device, equipment and storage medium
CN109359175B (en) * 2018-09-07 2023-04-07 平安科技(深圳)有限公司 Electronic device, litigation data processing method, and storage medium
CN110969017B (en) * 2018-09-30 2024-02-20 北京国双科技有限公司 Judicial data processing method and system
CN109460468A (en) * 2018-10-23 2019-03-12 出门问问信息科技有限公司 Classifying method, categorization arrangement and the corresponding electronic equipment of law related text
CN109597889B (en) * 2018-11-19 2023-04-07 刘品新 Crime determining method and system based on text classification and deep neural network
CN111367879A (en) * 2018-12-26 2020-07-03 北京国双科技有限公司 Legal document processing method and device
CN111401047A (en) * 2018-12-29 2020-07-10 北京国双科技有限公司 Method and device for generating dispute focus of legal document and computer equipment
CN110532359A (en) * 2019-06-14 2019-12-03 平安科技(深圳)有限公司 Legal provision query method, apparatus, computer equipment and storage medium
CN112329436B (en) * 2019-07-30 2024-08-23 北京国双科技有限公司 Legal document element analysis method and system
CN112395388B (en) * 2019-08-16 2023-12-26 阿里巴巴集团控股有限公司 Information processing method and device
CN110825879B (en) * 2019-09-18 2024-05-07 平安科技(深圳)有限公司 Decide a case result determination method, device, equipment and computer readable storage medium
CN110795566A (en) * 2019-09-18 2020-02-14 平安科技(深圳)有限公司 Case recommendation method, device and equipment and computer-readable storage medium
CN110717041B (en) * 2019-09-19 2023-10-03 太极计算机股份有限公司 Case retrieval method and system
CN112561744A (en) * 2019-09-25 2021-03-26 北京国双科技有限公司 Method and device for generating similar case retrieval report
CN112580338A (en) * 2019-09-27 2021-03-30 北京国双科技有限公司 Method and device for determining dispute focus, storage medium and equipment
CN112579731A (en) * 2019-09-30 2021-03-30 北京国双科技有限公司 Data processing method and device
CN110928987B (en) * 2019-10-18 2023-07-25 平安科技(深圳)有限公司 Legal provision retrieval method and related equipment based on neural network hybrid model
CN110929039B (en) * 2019-10-18 2023-09-29 平安科技(深圳)有限公司 Data processing method, device, equipment and storage medium
CN112784034B (en) * 2019-11-01 2024-05-31 阿里巴巴集团控股有限公司 Digest generation method and device and computer equipment
CN111143550B (en) * 2019-11-27 2022-05-03 浙江大学 Method for automatically identifying dispute focus based on hierarchical attention neural network model
CN111695874B (en) * 2020-06-09 2023-08-11 山东交通学院 Judicial decision auxiliary system, judicial decision auxiliary method, judicial decision auxiliary equipment and storable medium
CN112395409A (en) * 2020-11-30 2021-02-23 重庆工程职业技术学院 Legal knowledge retrieval system and method
CN112487146B (en) * 2020-12-02 2022-05-31 重庆邮电大学 Legal case dispute focus acquisition method and device and computer equipment
CN112800746A (en) * 2021-01-28 2021-05-14 北京华宇元典信息服务有限公司 Text matching-based dispute focus recommendation method and device and electronic equipment
CN112950414B (en) * 2021-02-25 2023-04-18 华东师范大学 Legal text representation method based on decoupling legal elements
CN112989820B (en) * 2021-03-22 2022-12-02 平安国际智慧城市科技股份有限公司 Legal document positioning method, device, equipment and storage medium
CN115017917B (en) * 2022-08-09 2022-10-28 北京肇祺信息科技有限公司 Judgment document dispute focus identification method based on multi-head attention mechanism
CN118485054B (en) * 2024-07-15 2024-09-20 人民法院信息技术服务中心 Bulletin document generation method, device and equipment

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101408885A (en) * 2007-10-05 2009-04-15 富士通株式会社 Modeling topics using statistical distributions
CN102004774A (en) * 2010-11-16 2011-04-06 清华大学 Personalized user tag modeling and recommendation method based on unified probability model
CN102831558A (en) * 2012-07-20 2012-12-19 桂林电子科技大学 System and method for automatically scoring college English compositions independent of manual pre-scoring
CN104915396A (en) * 2015-05-28 2015-09-16 杭州电子科技大学 Knowledge retrieving method
CN105023214A (en) * 2015-07-17 2015-11-04 蓝舰信息科技南京有限公司 Title knowledge point intelligent recommending method
CN105095229A (en) * 2014-04-29 2015-11-25 国际商业机器公司 Method for training topic model, method for comparing document content and corresponding device
CN105893363A (en) * 2014-09-26 2016-08-24 北大方正集团有限公司 A method and a system for acquiring relevant knowledge points of a knowledge point
CN106095762A (en) * 2016-02-05 2016-11-09 中科鼎富(北京)科技发展有限公司 A kind of news based on ontology model storehouse recommends method and device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101853449A (en) * 2010-06-18 2010-10-06 上海百事通信息技术有限公司 Legal question intelligent diagnosis method and system
CN104111933B (en) * 2013-04-17 2017-08-04 阿里巴巴集团控股有限公司 Obtain business object label, set up the method and device of training pattern
CN103995885B (en) * 2014-05-29 2017-11-17 百度在线网络技术(北京)有限公司 The recognition methods of physical name and device
CN104063427A (en) * 2014-06-06 2014-09-24 北京搜狗科技发展有限公司 Expression input method and device based on semantic understanding
CN104090863A (en) * 2014-07-24 2014-10-08 高德良 Intelligent legal instrument generating method and system
CN105930470B (en) * 2016-04-25 2019-03-26 安徽富驰信息技术有限公司 A kind of document retrieval method based on feature weight analytical technology

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101408885A (en) * 2007-10-05 2009-04-15 富士通株式会社 Modeling topics using statistical distributions
CN102004774A (en) * 2010-11-16 2011-04-06 清华大学 Personalized user tag modeling and recommendation method based on unified probability model
CN102831558A (en) * 2012-07-20 2012-12-19 桂林电子科技大学 System and method for automatically scoring college English compositions independent of manual pre-scoring
CN105095229A (en) * 2014-04-29 2015-11-25 国际商业机器公司 Method for training topic model, method for comparing document content and corresponding device
CN105893363A (en) * 2014-09-26 2016-08-24 北大方正集团有限公司 A method and a system for acquiring relevant knowledge points of a knowledge point
CN104915396A (en) * 2015-05-28 2015-09-16 杭州电子科技大学 Knowledge retrieving method
CN105023214A (en) * 2015-07-17 2015-11-04 蓝舰信息科技南京有限公司 Title knowledge point intelligent recommending method
CN106095762A (en) * 2016-02-05 2016-11-09 中科鼎富(北京)科技发展有限公司 A kind of news based on ontology model storehouse recommends method and device

Also Published As

Publication number Publication date
CN108241621A (en) 2018-07-03
WO2018113498A1 (en) 2018-06-28

Similar Documents

Publication Publication Date Title
CN108241621B (en) legal knowledge retrieval method and device
CN111291570B (en) Method and device for realizing element identification in judicial documents
CN106649316B (en) Video pushing method and device
CN108280114B (en) Deep learning-based user literature reading interest analysis method
CN110968663B (en) Answer display method and device of question-answering system
US11170270B2 (en) Automatic generation of content using multimedia
CN108694178B (en) Method and device for recommending judicial knowledge
CN109145110B (en) Label query method and device
CN109471889B (en) Report accelerating method, system, computer equipment and storage medium
US11144579B2 (en) Use of machine learning to characterize reference relationship applied over a citation graph
CN109472017B (en) Method and device for obtaining relevant information of text court deeds of referee to be generated
CN109697231A (en) A kind of display methods, system, storage medium and the processor of case document
CN112329460A (en) Text topic clustering method, device, equipment and storage medium
WO2020063524A1 (en) Method and system for determining legal instrument
CN109472722B (en) Method and device for obtaining relevant information of approved finding segment of official document to be generated
Tardy et al. Align then summarize: Automatic alignment methods for summarization corpus creation
CN110019670A (en) A kind of text searching method and device
CN108255891B (en) Method and device for judging webpage type
CN117290481A (en) Question and answer method and device based on deep learning, storage medium and electronic equipment
AU2019290658B2 (en) Systems and methods for identifying and linking events in structured proceedings
CN116028626A (en) Text matching method and device, storage medium and electronic equipment
Yang et al. Lecture video browsing using multimodal information resources
CN111401047A (en) Method and device for generating dispute focus of legal document and computer equipment
CN110019665A (en) Text searching method and device
CN114817586A (en) Target object classification method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing

Applicant before: Beijing Guoshuang Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant