CN111178817A - Method and device for obtaining referee results based on deep learning - Google Patents

Method and device for obtaining referee results based on deep learning Download PDF

Info

Publication number
CN111178817A
CN111178817A CN201811344580.0A CN201811344580A CN111178817A CN 111178817 A CN111178817 A CN 111178817A CN 201811344580 A CN201811344580 A CN 201811344580A CN 111178817 A CN111178817 A CN 111178817A
Authority
CN
China
Prior art keywords
vector
information
law
case
referee
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811344580.0A
Other languages
Chinese (zh)
Inventor
佟津乐
朱元婧
谢海华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
New Founder Holdings Development Co ltd
Original Assignee
Pku Founder Information Industry Group Co ltd
Peking University Founder Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pku Founder Information Industry Group Co ltd, Peking University Founder Group Co Ltd filed Critical Pku Founder Information Industry Group Co ltd
Priority to CN201811344580.0A priority Critical patent/CN111178817A/en
Publication of CN111178817A publication Critical patent/CN111178817A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/101Collaborative creation, e.g. joint development of products or services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • G06Q50/182Alternative dispute resolution

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Technology Law (AREA)
  • Data Mining & Analysis (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a method and a device for acquiring a referee result based on deep learning, wherein the method comprises the following steps: acquiring the information of a legal provision, and acquiring the case information of the case to be processed according to the original text of the case to be processed; and processing the law information and the case information of the case to be processed through a judge model to obtain a judge result output by the judge model, wherein the judge model is obtained through deep learning training by taking the law information, the case information of at least one judge document and the judge result as samples. The scheme improves the treatment efficiency of the case.

Description

Judgment result obtaining method and device based on deep learning
Technical Field
The invention relates to the technical field of deep learning, in particular to a method and a device for acquiring a referee result based on deep learning.
Background
With the development of society and the perfection of laws, more cases need to be judged.
In the prior art, for each case needing referee, the referee is usually manually judged in a trial and error manner. However, the manual referee mode will cause the case processing efficiency to be low. Therefore, how to improve the processing efficiency of the case is the next problem to be solved urgently.
Disclosure of Invention
The invention provides a judging result obtaining method and device based on deep learning, and the case processing efficiency is improved.
In a first aspect, the present invention provides a method for obtaining a referee result based on deep learning, including:
acquiring the information of a legal provision, and acquiring the case information of the case to be processed according to the original text of the case to be processed;
and processing the law information and the case information of the case to be processed through a judge model to obtain a judge result output by the judge model, wherein the judge model is obtained through deep learning training by taking the law information, the case information of at least one judge document and the judge result as samples.
Further, before acquiring case information of the case to be processed according to the original text of the case to be processed, the method further includes:
identifying the amount information in the original text, and converting the amount information in an integer format;
further, before acquiring case information of the case to be processed according to the original text of the case to be processed, the method further includes:
identifying name information in the original text, and replacing the name information with the same name identifier;
further, before acquiring case information of the case to be processed according to the original text of the case to be processed, the method further includes:
and identifying the time information in the original text, and replacing the time information with the same time identifier.
Further, the processing the law information and the case information of the case to be processed through a referee model to obtain a referee result output by the referee model includes:
after a corresponding first text word vector matrix is obtained according to the case information of the case to be processed, respectively performing forward interpretation and reverse interpretation on the first text word vector matrix to obtain a first forward interpretation result and a first reverse interpretation result, splicing the first forward interpretation result and the first reverse interpretation result, and performing information extraction on spliced data to obtain a first text content vector;
after a corresponding first normal word vector matrix is obtained according to the normal information, information extraction is carried out on the first normal word vector matrix to obtain a first normal content vector;
fusing the first normal content vector and the first text content vector to obtain a first to-be-output vector;
and extracting a first arbitration vector from the first to-be-output vector, and analyzing the first arbitration vector to obtain the arbitration result.
Further, the extracting information of the first normal word vector matrix to obtain a first normal content vector includes:
processing the first normal word vector matrix through an attention-rnn submodel in the referee model to obtain a second normal content vector output by the attention-rnn submodel, wherein the shape of the second normal content vector is { N, D }, N is the number of normal, and D is the word vector dimension of the first normal word vector matrix;
performing the following operation according to the second normal content vector to extract information of the second normal content vector to obtain the first normal content vector:
Y3=Y2*Y1+(1-Y2)*X
wherein, Y1=relu(XM1+C1),Y2=sigmoid(Y1M2+XM3+C2) Wherein M is1、M2、M3Are all variable matrices of { D, D }, C1、C2Are all variable matrixes with the shape of { N, D }, X is the second normal content vector, Y3Is the first normal content vector.
Further, the fusing the first normal content vector and the first text content vector to obtain a first to-be-output vector includes:
performing the following operation according to the first normal content vector and the first text content vector to fuse the first normal content vector and the first text content vector to obtain the first to-be-output vector;
Y5=Y3Y4
wherein, Y4=M4YbWherein Y isbFor the first text content vector, M4Is a variable matrix of { D, N } shape, Y5The vector to be output is the first vector to be output.
Further, the method further comprises:
acquiring case information and a judge result of the law information and the at least one judge document;
after a corresponding second text word vector matrix is obtained according to case information of the at least one referee document, respectively performing forward interpretation and reverse interpretation on the second text word vector matrix to obtain a second forward interpretation result and a second reverse interpretation result, splicing the second forward interpretation result and the second reverse interpretation result, and performing information extraction on spliced data to obtain a second text content vector;
after a corresponding second normal word vector matrix is obtained according to the normal information, information extraction is carried out on the second normal word vector matrix to obtain a third normal content vector;
fusing the third normal content vector and the second text content vector to obtain a second vector to be output;
extracting a second arbitration vector from the second vector to be output;
calculating according to the second judgment vector and the vector corresponding to the judgment result of the judgment document to obtain a loss error; and carrying out learning training on the referee model by using the loss error until the referee model converges.
In a second aspect, the present invention provides an apparatus for obtaining a referee result based on deep learning, comprising:
the first acquisition unit is used for acquiring the information of the legal provision and acquiring the case information of the case to be processed according to the original text of the case to be processed;
and the second acquisition unit is used for processing the law information and the case information of the case to be processed through a referee model to obtain a referee result output by the referee model, wherein the referee model is obtained through deep learning training by taking the law information, the case information of at least one referee document and the referee result as samples.
Further, the apparatus further comprises: and the first processing unit is used for identifying the amount information in the original text and converting the amount information in an integer format.
Further, the apparatus further comprises: and the second processing unit is used for identifying the name information in the original text and replacing the name information with the same name identifier.
Further, the apparatus further comprises: and the third processing unit is used for identifying the time information in the original text and replacing the time information with the same time identifier.
Further, the second obtaining unit includes:
the first word vector conversion module is used for obtaining a corresponding first text word vector matrix according to the case information of the case to be processed;
the first reading module is used for respectively performing forward reading and reverse reading on the first text word vector matrix after the first word vector conversion module obtains the first text word vector matrix to obtain a first forward reading result and a first reverse reading result;
the first splicing module is used for splicing the first forward interpretation result and the first reverse interpretation result;
the first information extraction module is used for extracting information of the data spliced by the splicing module to obtain a first text content vector;
the second word vector conversion module is used for obtaining a corresponding first normal word vector matrix according to the normal information;
the second information extraction module is used for extracting information of the first normal word vector matrix after the second word vector conversion module obtains the first normal word vector matrix to obtain a first normal content vector;
the first fusion module is used for fusing the first normal content vector and the first text content vector to obtain a first to-be-output vector;
the first extraction module is used for extracting a first arbitration vector from the first vector to be output;
and the first analysis module is used for analyzing the first judgment vector to obtain the judgment result.
Further, the second information extraction module includes:
the first processing submodule is used for processing the first law bar word vector matrix through an attention-rnn submodel in the referee model to obtain a second law bar content vector output by the attention-rnn submodel, wherein the shape of the second law bar content vector is { N, D }, N is the number of law bars, and D is the word vector dimension of the first law bar word vector matrix;
the second processing sub-module is configured to perform the following operation according to the second normal content vector to extract information of the second normal content vector, so as to obtain the first normal content vector:
Y3=Y2*Y1+(1-Y2)*X
wherein, Y1=relu(XM1+C1),Y2=sigmoid(Y1M2+XM3+C2) Wherein M is1、M2、M3Are all variable matrices of { D, D }, C1、C2Are all variable matrixes with the shape of { N, D }, X is the second normal content vector, Y3Is the first normal content vector.
Further, the first fusion module is configured to perform the following operation according to the first normal content vector and the first text content vector, so as to fuse the first normal content vector and the first text content vector, and obtain the first to-be-output vector;
Y5=Y3Y4
wherein, Y4=M4YbWherein Y isbFor the first text content vector, M4Is a variable matrix of { D, N } shape, Y5The vector to be output is the first vector to be output.
Further, the apparatus further comprises: a third obtaining unit and a model training unit, wherein,
the third acquiring unit is used for acquiring the case information and the referee result of the law enforcement information and the at least one referee document;
the model training unit comprises:
the third word vector conversion module is used for obtaining a corresponding second text word vector matrix according to the case information of the at least one referee document;
the second reading module is used for respectively performing forward reading and reverse reading on the second text word vector matrix after the third word vector conversion module obtains the second text word vector matrix to obtain a second forward reading result and a second reverse reading result;
the second splicing module is used for splicing the second forward interpretation result and the second reverse interpretation result;
the third information extraction module is used for extracting information of the data obtained by splicing to obtain a second text content vector;
the fourth word vector conversion module is used for obtaining a corresponding second normal word vector matrix according to the normal information;
the fourth information extraction module is used for extracting information of the second normal word vector matrix after the fourth word vector conversion module obtains the second normal word vector matrix to obtain a third normal content vector;
the second fusion module is used for fusing the third normal content vector and the second text content vector to obtain a second vector to be output;
the second extraction module is used for extracting a second arbitration vector from the second vector to be output;
the optimization module is used for calculating and obtaining loss errors according to the second judging vector and the vector corresponding to the judging result of the judging document; and carrying out learning training on the referee model by using the loss error until the referee model converges.
In a third aspect, the present invention provides a referee result acquisition apparatus based on deep learning, including: a processor, a memory, and a computer program;
wherein the computer program is stored in the memory and configured to be executed by the processor to implement any of the methods of the first aspect.
In a fourth aspect, the invention provides a computer readable storage medium having stored thereon a computer program for execution by a processor to perform any of the methods of the first aspect.
The invention provides a method and a device for acquiring a referee result based on deep learning. According to the scheme, intelligent judgment is realized through a judgment model obtained through deep learning training based on input case information, so that case processing efficiency is effectively improved.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.
Fig. 1 is a flowchart of a referee result obtaining method based on deep learning according to an embodiment of the present invention;
fig. 2 is a flowchart of a referee result obtaining method based on deep learning according to a second embodiment of the present invention;
fig. 3 is a schematic structural diagram of a referee model according to a second embodiment of the present invention;
fig. 4 is a flowchart of a referee result acquisition method based on deep learning according to a third embodiment of the present invention;
fig. 5 is a schematic structural diagram of a referee result acquisition device based on deep learning according to a fourth embodiment of the present invention;
fig. 6 is a schematic structural diagram of a referee result acquisition device based on deep learning according to a fifth embodiment of the present invention;
fig. 7 is a schematic structural diagram of a referee result acquisition device based on deep learning according to a sixth embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 is a flowchart of a method for obtaining a referee result based on deep learning according to an embodiment of the present invention, which is exemplified by applying the method provided in the embodiment to a referee result obtaining apparatus based on deep learning, and as shown in fig. 1, the method may include:
step 101: and acquiring the information of the legal provision, and acquiring the case information of the case to be processed according to the original text of the case to be processed.
In practical applications, the execution subject of the embodiment may be a referee result acquisition device based on deep learning, and the referee result acquisition device may be program software, or a medium storing a related computer program, such as a usb disk; alternatively, the referee result acquisition device may also be a physical device integrated or installed with a related computer program, for example, a chip, an intelligent terminal, a computer, a server, and the like.
The law information in the scheme is used as the input of the referee model, so that the referee model is subjected to deep learning training or the output referee result is obtained through the processing of the referee model. In practical application, there may be a plurality of methods for acquiring the legal information for different scenes. Optionally, on the basis of any embodiment, for a scene of the 1 st deep learning training, acquiring the legal information may specifically include:
acquiring original information corresponding to the R legal items respectively;
performing word segmentation processing on original information corresponding to each of the R law bars to obtain a plurality of first words to be processed corresponding to each law bar;
according to Word identifications (Word ids) corresponding to different words, Word ids corresponding to a plurality of first words to be processed of each French are obtained and used as French information, and the French information is stored, wherein R is a positive integer.
For the following scenes of training and application of the referee model (for example, in the embodiment, the referee model is used for obtaining the referee result of the case to be processed), when obtaining the law enforcement information, the stored law enforcement information can be directly called without performing a series of processing operations again according to the original text of the law enforcement, so that the implementation mode greatly improves the training efficiency and the acquisition efficiency of the referee result, and improves the user experience.
In addition, in order to improve the processing capacity of the referee model, the method further comprises the following steps:
and taking the law containing the most Word ids as a reference, and adjusting the number of the Word ids respectively corresponding to other laws.
Specifically, for other laws, 0 may be complemented after the Word id corresponding thereto, respectively.
Examples are made in connection with actual scenarios: the original text in the scheme refers to original information related to the case, and the form of the original information is not limited, and the original information may be, for example, a paper case, an electronic case, and the like. Specifically, the original text in the embodiment is used for extracting case information, so that the original text processable by a computer can be obtained in different scenes, and further, the obtaining way can be various, for example, the entry of the original text can be performed according to case materials, and the accuracy of the embodiment is higher; or to perform word recognition on paper's case material to obtain the original text, this embodiment is more efficient.
The case information in the scheme is used as the input of the referee model, so that the referee model is subjected to deep learning training or the output referee result is obtained through the processing of the referee model. In practical application, the case information of a case can be obtained according to the obtained original text, and specific processing methods can be various. Optionally, on the basis of any embodiment, 101 may specifically include:
performing word segmentation processing on an original text of a case to be processed to obtain a plurality of second words to be processed;
and obtaining and using the word identifications corresponding to the second words to be processed as case information of the cases to be processed according to the word identifications corresponding to the different words.
Specifically, a maximum reverse matching method can be adopted for word segmentation, and the word segmentation algorithm is high in accuracy and beneficial to improving the accuracy of output of judgment results.
In addition, when the data volume of the original text is large, the data volume of the case information is also large, so to further improve the processing efficiency, the method further comprises the following steps:
and adjusting the word identification number of the case information to a preset threshold value.
Wherein the threshold value can be determined as required. The way of adjustment includes addition or deletion of word identity. In one case, when the number of word identifications corresponding to the case information is less than the threshold value, the word identifications corresponding to the case information may be supplemented. Alternatively, a preset identifier may be appended to the case information, and the preset identifier may be a predetermined letter identifier, a number identifier, or the like, and may be 0, for example. In practical applications, the word identifiers in the case information may be sorted, for example, the word identifiers may be arranged from front to back according to the obtaining order, and correspondingly, when the word identifier of the case information is appended, at least one 0 may be appended after the last word identifier until the word identifier of the case information is the threshold.
In another case, when the number of word identifications corresponding to the case information is greater than the threshold, the number of word identifications may be decremented to the threshold. Optionally, there may be multiple specific deletion methods, for example, words with smaller information amount (such as, e.g., ones, etc.) may be deleted from the word identifiers of the case information to improve accuracy. For another example, in combination with the foregoing example of sorting word identifiers, when a word identifier is deleted, the word identifier beyond the threshold may be directly deleted, so as to improve efficiency.
It should be noted that the term identifier in the present scheme is used to uniquely identify a single term, and the length of the term is not limited, and the term may be a term formed by at least one word. In addition, the execution sequence for acquiring the legal information and the case information in the scheme is not limited.
Step 102: and processing the law information and the case information of the case to be processed through a judge model to obtain a judge result output by the judge model, wherein the judge model is obtained through deep learning training by taking the law information, the case information of at least one judge document and the judge result as samples.
In practical application, the referee model may adopt different types of models, such as adding an attention-rnn submodel on the basis of the BiBloSAN model. The referee model is obtained by taking the law information, case information of at least one referee document and a referee result as samples and performing deep learning training. The judgment result in the scheme can comprise at least one of the following items: the name of the criminal suspect, whether to death or not, whether to untimely petition or not, the length of the criminal period, the amount of the penalty, the name of the crime and related legal provisions.
The embodiment provides a referee result acquisition method based on deep learning, which is characterized by acquiring case information according to an original text of a case to be processed, and then processing the case information and the acquired legal information through a referee model so as to obtain a referee result. According to the scheme, intelligent judgment is realized through a judgment model obtained through deep learning training based on input case information, so that case processing efficiency is effectively improved. In addition, the method and the device can also avoid the influence of subjective factors in the judging process based on the scheme, provide more objective judging results and are beneficial to improving the accuracy of the judging. In addition, when the model is trained, the law information is fused on the basis of the referee document, so that when the referee model predicts the law related to the case to be processed, the accuracy of law prediction can be improved, and then the accuracy of prediction such as crime name, penalty, criminal term and the like is indirectly improved, and therefore the accuracy of a referee result can be greatly improved.
Fig. 2 is a flowchart of a referee result obtaining method based on deep learning according to a second embodiment of the present invention, and as shown in fig. 2, the method may include:
step 201: identifying the amount information in the original text of the case to be processed, and converting the amount information in an integer format; and/or recognizing name information in the original text, and replacing the name information with the same name identifier; and/or identifying the time information in the original text, and replacing the time information with the same time identifier.
The conversion of the amount information in an integer format can specifically refer to the conversion of the amount information into an integer closest to the amount information; the same NAME identifier may be preset by the user according to actual requirements, for example, the same NAME identifier may be < NAME >; the same-TIME identifier may be preset by the user according to actual needs, for example, the same-TIME identifier may be < TIME >.
This embodiment takes as an example that the above three processes need to be performed simultaneously. For example, the original text of the case to be processed is that a national institute of civil inspection in the C area, B, C, of province A directs the victim to travel to the gate of a supermarket in the C area from 7 months and 4 days in 2016, the victim is stolen by a motorcycle (identified as the valuable RMB 3086 yuan) parked at the gate of the supermarket, and then the motorcycle paid is returned to the victim due to suspicious patterns and stolen by public security personnel. ", specifically, in the treatment, 3086 can be converted to 3000, Zhang III and Li IV are both replaced by < NAME >, and 2016 is replaced by < TIME > on 7/4 th. Thus, after the treatment, the original text of the case to be treated is changed into' instructing the notifier < NAME > to the gate of a supermarket in the C district of B City of A province to control the notifier < TIME > in the gate of the supermarket in the C district, stealing the victim < NAME > in a motorcycle (the value RMB is 3000 yuan after being identified) in the gate of the supermarket, and then, because of suspicious patterns, the motorcycle is stolen and obtained by the public security personnel, and the obtained motorcycle has returned the victim. ".
Step 202: and acquiring the information of the legal provision, and acquiring the case information of the case to be processed according to the original text acquired after processing.
It should be noted that fig. 2 only represents an execution sequence that may exist in the present embodiment, wherein the process of acquiring the legal information may also be executed in step 201, and so on.
On one hand, according to the first embodiment, the stored legal information can be directly called here.
On the other hand, when case information of a case to be processed is acquired, a word segmentation process is first performed on an original text obtained after the process to obtain a plurality of second words to be processed, which are ' a province ', ' B city ', ' C district ', ' people inspection house ', ' finger control ', ' advertiser ', ' < NAME > ', ' payment ', ' motorcycle ', ' hair returning ', ' victim ', ' for processing. '; and then converting each Word to be processed into a corresponding Word id according to the Word ids corresponding to different words, and assuming that the Word ids corresponding to the obtained multiple words to be processed are [79,123,1824,434,1112,4978.,. 65,7,45,236,426,56,21], finally judging whether the number of the obtained Word ids exceeds a preset threshold value, assuming that the threshold value is 30 and the number of the obtained Word ids is 31, deleting the 31 st Word id, namely 21 according to the front-to-back arrangement sequence, thus finally using [79,123,1824,434,1112,4978.,. 65,7,45,236,426,56] as case information of the case to be processed and inputting the case information into a judgment model.
Step 203: after a corresponding first text word vector matrix is obtained according to case information of a case to be processed, forward interpretation and reverse interpretation are respectively carried out on the first text word vector matrix to obtain a first forward interpretation result and a first reverse interpretation result, the first forward interpretation result and the first reverse interpretation result are spliced, information extraction is carried out on data obtained by splicing, and a first text content vector is obtained.
Fig. 3 is a schematic structural diagram of a referee model according to a second embodiment of the present invention, where the present solution is not limited to the referee model shown in fig. 3. In this embodiment, as shown in fig. 3, first, case information of a case to be processed is input to an Embedding layer Embedding of a referee model, each input Word id is converted into a Word vector through the layer, and a first text Word vector matrix is output to a full-connected layer 1 and a full-connected layer 2, on one hand, forward interpretation of the first text Word vector matrix is realized through the full-connected layer 1 and an mBiOSAwith forward mask, and a first forward interpretation result is obtained and output to a splice layer Concatenate; on the other hand, the reverse interpretation of the first text word vector matrix is realized through the full connection layer 2 and the mBiOSA with backward mask, a first reverse interpretation result is obtained and output to the concatemate, the concatemate layer realizes splicing, the spliced data is output to the Source to token self-entry of the information extraction layer, and the extraction of information is realized through the layer, so that the first text content vector is obtained.
Step 204: and after obtaining a corresponding first normal word vector matrix according to the normal information, extracting information of the first normal word vector matrix to obtain a first normal content vector.
As shown in fig. 3, the law information is input to an Embedding layer of the referee model, and similarly, a first law word vector matrix with a shape of { N, L, D } can be obtained, where N is the number of laws (e.g., total number 183), L is the maximum law word number, and D is a word vector dimension adopted by the Embedding layer, that is, a word vector dimension of the first law word vector matrix, and then information extraction is performed on the first law word vector matrix, and one implementation manner of obtaining the first law content vector may be:
processing the first normal word vector matrix through the attention-rnn submodel in fig. 3 to obtain a second normal content vector with { N, D } shape output by the attention-rnn submodel; performing the following operation according to the second normal content vector to extract information of the second normal content vector to obtain a first normal content vector:
Y3=Y2*Y1+(1-Y2)*X
wherein, Y1=relu(XM1+C1),Y2=sigmoid(Y1M2+XM3+C2) Wherein M is1、M2、M3Are all variable matrices of { D, D }, C1、C2Are all variable matrices with the shape of { N, D }, X is a second normal content vector, Y3For the first legal content vector, relu and sigmoid are two existing activation functions.
To the above-mentioned values it is worth mentioning:
first, assume that 183 french terms are adopted in this embodiment, and if after word segmentation, the number of first to-be-processed words obtained for the ith french term is 200 at most, and the numbers corresponding to other french terms are all less than 200, it is determined that the maximum number of french terms L is 200.
Second, before training the model for the 1 st time, five matrices may be preset to be M1、M2、M3、C1、C2And after each training is finished, learning and optimizing the last five matrixes respectively based on the loss errors obtained by the training, so that the five matrixes are all called as variable matrixes.
In addition, the execution sequence between step 203 and step 204 is not limited.
Step 205: and fusing the first normal content vector and the first text content vector to obtain a first to-be-output vector.
Specifically, one way to achieve fusion is: performing the following operation according to the first normal content vector and the first text content vector to fuse the first normal content vector and the first text content vector to obtain a first to-be-output vector;
Y5=Y3Y4
wherein, Y4=M4YbWherein Y isbIs a first text content vector, Y5Is the first vector to be output, M4Is a variable matrix with the shape of { D, N }, and after each training is completed, the last M is paired based on the current loss error, as described in the previous step4And (6) performing learning optimization.
Step 206: and extracting a first arbitration vector from the first vector to be output, and analyzing the first arbitration vector to obtain an arbitration result.
In this embodiment, the first referee vector can be extracted based on a preset referee dimension, wherein the referee dimension can be set according to the requirements of referee results, such as related law articles, criminal term, criminal name, penalty, and the like. In the embodiment, three referee dimensions of law, criminal name and criminal phase are preset as an example.
For example, the first to-be-output vector is {1, D }, the first arbitration vector with the shape of { D, 202} is obtained by multiplying the first to-be-output vector by the vector with the shape of { D, 202}, the first arbitration vector with the shape of {1, 183} is obtained by multiplying the first to-be-output vector by the vector with the shape of { D, 183}, the first arbitration vector with the shape of {1,3} is obtained by multiplying the first to-be-output vector by the vector with the shape of { D, 1}, the first arbitration vector with the shape of {1,1} is obtained by multiplying the first to-be-output vector by the vector with the shape of { D, 1}, wherein the first arbitration vector with the shape of {1,202} is arbitrated for the arbitrated dimension of the crime name, and the first arbitration vector with the shape of {1, 183} is for the arbitrated dimension of the concerned french, the first adjudication vector with the shape of {1,3} is for the criminal phase dimension, and the first adjudication vector with the shape of {1,1} is for the specific criminal phase length, that is, according to three adjudication dimensions, four first adjudication vectors are extracted.
When analyzing the names of the guilties, the first adjudication vector with the shape of {1,202} can be understood as that the vector consists of 1 row and 202 columns, the corresponding numerical value of each column is the predicted value of the corresponding name of the guilty, in addition, a guilty threshold value can be set in advance according to the actual requirement, the guilty threshold value is assumed to be 0.5, and in the analyzing process, if the predicted values corresponding to the 1 st column, the 3 rd column and the 140 th column are respectively 1, and the rest are all less than 0.5, the guilty names corresponding to the cases to be processed are the 1 st, the 3 rd and the 140 th guilty names in the preset guilty list.
The process of the law strip analysis is similar to that of the criminal name analysis, and is not described in detail here.
When analyzing the criminal period, first analyzing the first adjudication vector with the shape of {1,3}, wherein the vector is composed of 1 row and 3 columns, the 3 columns respectively correspond to the dead criminal, the dead criminal and the dead criminal, and the current period, if the dead criminal or the dead criminal is analyzed, directly outputting the adjudication result, and if the current period is analyzed, continuously analyzing the first adjudication vector with the shape of {1,1}, so as to output the time corresponding to the current period.
The embodiment carries out conversion processing of an integer format on the amount information in the original text of the case to be processed, carries out replacement processing on the name information and carries out replacement processing on the time information, so that a referee model can not generate a large-scale word list when processing the case information of the input case to be processed, thereby improving the processing capacity, further being capable of rapidly obtaining a final referee result, greatly reducing the error rate in the processing process, and further greatly improving the accuracy of a final output result. Moreover, in the embodiment, the corresponding first refereeing vector can be extracted through the referee dimension of the referee result, so that the referee result with multiple referee dimensions can be analyzed, and an objective criminal reference mode under multiple referee dimensions is provided for a judge.
Fig. 4 is a flowchart of a referee result obtaining method based on deep learning according to a third embodiment of the present invention, and as shown in fig. 4, the method includes:
step 401: and acquiring case information and a judge result of the law information and at least one judge document.
On one hand, according to the first embodiment, taking the 1 st training and R183 as an example, the obtained original texts of 183 french articles can be shown in table 1 below.
TABLE 1
Figure BDA0001863431390000131
Figure BDA0001863431390000141
The Word id converted from the first to-be-processed words corresponding to each law may be as shown in table 2 below.
TABLE 2
1 1,31,57,128,8876,….431,1,3,0
……
183 53,323,357,1728,3876,….342,41,311,1,3
And finally, filling the number which is less than the maximum number in the table 2, and obtaining and storing the filled legal information.
On the other hand, the form of data to be acquired from the official document may be as shown in the following table 3, but is not limited to table 1.
TABLE 3
Figure BDA0001863431390000142
In table 1, the fact part, that is, the case description part, is a part for specifically describing the corresponding case, for example, what time, what place, what happens, and the like, that is, the fact part corresponds to the case information in the embodiment; and the meta part comprises 7 items, wherein the last 6 items are the judgment results of the corresponding case, such as 3 years of criminal with period and the like.
After obtaining one piece of data from hundreds of thousands of referee documents respectively according to the data form of table 1, preprocessing (money information integer format conversion, word segmentation, etc.) can be performed on all pieces of data in advance as in the second embodiment to obtain and store case information of hundreds of thousands of referee documents, so that at least one case information can be selected at random directly from the stored case information of hundreds of thousands of referee documents each time when the referee model is trained, for example, 64 cases are selected at random this time, and the selected 64 cases are directly used for learning and training, thereby greatly improving the training efficiency of the referee model.
In addition, for the meta part of each piece of data, that is, the referee result part, the loss error is calculated according to the vector corresponding to the referee result, so that the related data of the meta part needs to be converted into a vector, and the data used for training the referee model can be shown in table 4.
TABLE 4
Figure BDA0001863431390000151
Step 402: and after a corresponding second text word vector matrix is obtained according to case information of at least one referee document, respectively performing forward interpretation and reverse interpretation on the second text word vector matrix to obtain a second forward interpretation result and a second reverse interpretation result, splicing the second forward interpretation result and the second reverse interpretation result, and performing information extraction on data obtained by splicing to obtain a second text content vector.
Specifically, reference may be made to the referee model shown in fig. 3, in this embodiment, the first text word vector matrix in fig. 3 should be a second text word vector matrix.
Step 403: and after a corresponding second normal word vector matrix is obtained according to the normal information, extracting information of the second normal word vector matrix to obtain a third normal content vector.
Similarly, in the present embodiment, the first normal word vector matrix in fig. 3 should be the second normal word vector matrix.
Step 404: and fusing the third normal content vector and the second text content vector to obtain a second vector to be output.
Step 405: and extracting a second arbitration vector from the second vector to be output.
Step 406: calculating according to the second judgment vector and the vector corresponding to the judgment result of the judgment document to obtain a loss error; and (4) learning and training the referee model by using the loss error until the referee model converges.
During model training, the second arbitration vector is no longer parsed, but rather the second arbitration vector and the vectors in table 4 are used to calculate the loss error. For example, the referee result includes three referee dimensions of the name of the crime, the law and the criminal phase, the second referee vector aiming at the referee dimension of the name of the crime can be obtained through the steps, the name of the crime loss can be obtained through calculation according to the second referee vector, the vector corresponding to the name of the crime in 64 referee documents and the preset crime loss function, and similarly, the name of the crime loss and the criminal phase loss can be obtained, then calculating the loss error of the judge model according to the weight of the criminal name loss, the law loss, the criminal period loss and each judge dimension, to optimize the referee model by using the loss error, the steps 401 to 406 are repeated until the referee model converges, for example, after repeatedly performing the above steps several times, the fluctuation of the loss error is small, so that the referee model can be determined to be converged, which is only an example, and the determination of whether the referee model is converged includes but is not limited thereto.
Step 407: and acquiring the information of the legal provision, and acquiring the case information of the case to be processed according to the original text of the case to be processed.
Step 408: and processing the law information and the case information of the case to be processed through the judging model to obtain a judging result output by the judging model.
In the embodiment, when the model is trained, the case information and the referee results of the law information and a large number of referee documents are fused, so that the accuracy of law forecast can be improved, and the accuracy of forecast of crime names, penalties, criminal periods and the like is further improved. Therefore, the scheme greatly improves the accuracy of judgment results.
Fig. 5 is a schematic structural diagram of a referee result acquisition device based on deep learning according to a fourth embodiment of the present invention, as shown in fig. 5, including:
the first obtaining unit 501 is configured to obtain the information of the law enforcement, and obtain the case information of the case to be processed according to the original text of the case to be processed;
a second obtaining unit 502, configured to process, by using a referee model, the law information and the case information of the case to be processed, and obtain a referee result output by the referee model, where the referee model is obtained through deep learning training by taking the law information, and the case information and the referee result of at least one referee document as samples.
In this embodiment, the judging result obtaining device based on deep learning of this embodiment can execute the method provided in the first embodiment of the present invention, and the implementation principles thereof are similar, and are not described herein again.
In this embodiment, for a case to be processed, case information is obtained according to an original text of the case to be processed, and then the case information and the obtained legal information are processed through a referee model, so as to obtain a referee result. According to the scheme, intelligent judgment is realized through a judgment model obtained through deep learning training based on input case information, so that case processing efficiency is effectively improved. In addition, the method and the device can also avoid the influence of subjective factors in the judging process based on the scheme, provide more objective judging results and are beneficial to improving the accuracy of the judging. In addition, when the model is trained, the law information is fused on the basis of the referee document, so that when the referee model predicts the law related to the case to be processed, the accuracy of law prediction can be improved, and then the accuracy of prediction such as crime name, penalty, criminal term and the like is indirectly improved, and therefore the accuracy of a referee result can be greatly improved.
Fig. 6 is a schematic structural diagram of a referee result acquisition device based on deep learning according to a fifth embodiment of the present invention, and based on a fourth embodiment, as shown in fig. 6,
the device further comprises: a first processing unit 601, configured to identify amount information in the original text, and perform integer format conversion on the amount information;
further, the apparatus further comprises: a second processing unit 602, configured to identify name information in the original text, and replace the name information with a same name identifier;
further, the apparatus further comprises: a third processing unit 603, configured to identify time information in the original text, and replace the time information with the same time identifier.
The second obtaining unit 502 includes:
a first word vector conversion module 5021, configured to obtain a corresponding first text word vector matrix according to the case information of the case to be processed;
the first interpretation module 5022 is configured to perform forward interpretation and reverse interpretation on the first text word vector matrix respectively after the first word vector conversion module obtains the first text word vector matrix, so as to obtain a first forward interpretation result and a first reverse interpretation result;
a first splicing module 5023, configured to splice the first forward interpretation result and the first reverse interpretation result;
the first information extraction module 5024 is used for extracting information of the data spliced by the splicing module to obtain a first text content vector;
the second word vector conversion module 5025 is used for obtaining a corresponding first normal word vector matrix according to the normal information;
a second information extraction module 5026, configured to extract information of the first normal word vector matrix after the second word vector conversion module obtains the first normal word vector matrix, so as to obtain a first normal content vector;
a first fusion module 5027, configured to fuse the first normal content vector and the first text content vector to obtain a first to-be-output vector;
a first extracting module 5028, configured to extract a first arbitration vector from the first to-be-output vector;
a first parsing module 5029, configured to parse the first arbitration vector to obtain the arbitration result.
Further, the second information extraction module 5026 includes:
the first processing submodule is used for processing the first law bar word vector matrix through an attention-rnn submodel in the referee model to obtain a second law bar content vector output by the attention-rnn submodel, wherein the shape of the second law bar content vector is { N, D }, N is the number of law bars, and D is the word vector dimension of the first law bar word vector matrix;
the second processing sub-module is configured to perform the following operation according to the second normal content vector to extract information of the second normal content vector, so as to obtain the first normal content vector:
Y3=Y2*Y1+(1-Y2)*X
wherein, Y1=relu(XM1+C1),Y2=sigmoid(Y1M2+XM3+C2) Wherein M is1、M2、M3Are all variable matrices of { D, D }, C1、C2Are all variable matrixes with the shape of { N, D }, X is the second normal content vector, Y3Is the first normal content vector.
Further, the first fusion module 5027 is configured to perform the following operation according to the first normal content vector and the first text content vector, so as to fuse the first normal content vector and the first text content vector to obtain the first to-be-output vector;
Y5=Y3Y4
wherein, Y4=M4YbWherein Y isbFor the first text content vector, M4Is a variable matrix of { D, N } shape, Y5The vector to be output is the first vector to be output.
In this embodiment, the referee result acquisition device based on deep learning of this embodiment can execute the method provided in the second embodiment of the present invention, and the implementation principles thereof are similar, and are not described herein again.
The embodiment carries out conversion processing of an integer format on the amount information in the original text of the case to be processed, carries out replacement processing on the name information and carries out replacement processing on the time information, so that a referee model can not generate a large-scale word list when processing the case information of the input case to be processed, thereby improving the processing capacity, further being capable of rapidly obtaining a final referee result, greatly reducing the error rate in the processing process, and further greatly improving the accuracy of a final output result. Moreover, in the embodiment, the corresponding first refereeing vector can be extracted through the referee dimension of the referee result, so that the referee result with multiple referee dimensions can be analyzed, and an objective criminal reference mode under multiple referee dimensions is provided for a judge.
Fig. 7 is a schematic structural diagram of an apparatus for obtaining a referee result based on deep learning according to a sixth embodiment of the present invention, and based on the fourth embodiment, as shown in fig. 7,
the device further comprises: a third acquisition unit 701 and a model training unit 702, wherein,
the third obtaining unit 701 is configured to obtain the case information and the referee result of the law enforcement information and the at least one referee document;
the model training unit 702 includes:
a third word vector conversion module 7021, configured to obtain a corresponding second text word vector matrix according to case information of the at least one referee document;
a second interpretation module 7022, configured to perform forward interpretation and reverse interpretation on the second text word vector matrix respectively after the third word vector conversion module obtains the second text word vector matrix, so as to obtain a second forward interpretation result and a second reverse interpretation result;
a second splicing module 7023, configured to splice the second forward interpretation result and the second backward interpretation result;
a third information extraction module 7024, configured to perform information extraction on the data obtained by splicing to obtain a second text content vector;
a fourth word vector conversion module 7025, configured to obtain a corresponding second normal word vector matrix according to the normal information;
a fourth information extraction module 7026, configured to, after the fourth word vector conversion module obtains the second normal word vector matrix, perform information extraction on the second normal word vector matrix to obtain a third normal content vector;
a second fusion module 7027, configured to fuse the third normal content vector and the second text content vector to obtain a second to-be-output vector;
a second extracting module 7028, configured to extract a second arbitration vector from the second vector to be output;
an optimizing module 7029, configured to obtain a loss error by calculating according to the second referee vector and a vector corresponding to a referee result of the referee document; and carrying out learning training on the referee model by using the loss error until the referee model converges.
In this embodiment, the referee result acquisition device based on deep learning of this embodiment can execute the method provided in the third embodiment of the present invention, and the implementation principles thereof are similar, and are not described herein again.
In the embodiment, when the model is trained, the case information and the referee results of the law information and a large number of referee documents are fused, so that the accuracy of law forecast can be improved, and the accuracy of forecast of crime names, penalties, criminal periods and the like is further improved. Therefore, the scheme greatly improves the accuracy of judgment results.
The invention provides a referee result acquisition device based on deep learning, which comprises: a processor, a memory, and a computer program;
wherein the computer program is stored in the memory and configured to be executed by the processor to implement the method provided by any of the first to third embodiments.
The invention provides a computer-readable storage medium on which a computer program is stored, the computer program being executed by a processor to implement the method provided by any one of the first to third embodiments.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (10)

1.一种基于深度学习的裁判结果获取方法,其特征在于,包括:1. A method for obtaining referee results based on deep learning, comprising: 获取法条信息,并根据待处理案件的原始文本,获取所述待处理案件的案件信息;Obtain legal information, and obtain the case information of the pending case according to the original text of the pending case; 通过裁判模型对所述法条信息和所述待处理案件的案件信息进行处理,获得所述裁判模型输出的裁判结果,其中所述裁判模型是以法条信息、以及至少一个裁判文书的案件信息和裁判结果为样本,经过深度学习训练获得的。The legal article information and the case information of the pending case are processed through the adjudication model, and the adjudication result output by the adjudication model is obtained, wherein the adjudication model is based on the legal article information and the case information of at least one adjudication document The results of the referee and the referee are samples obtained through deep learning training. 2.根据权利要求1所述的方法,其特征在于,所述根据待处理案件的原始文本,获取所述待处理案件的案件信息之前,还包括:2. The method according to claim 1, wherein before acquiring the case information of the pending case according to the original text of the pending case, the method further comprises: 识别所述原始文本中的金额信息,对所述金额信息进行整数格式的转化;和/或,Identify the amount information in the original text, and convert the amount information to an integer format; and/or, 识别所述原始文本中的姓名信息,将所述姓名信息替换为同一姓名标识;和/或,Identify the name information in the original text, and replace the name information with the same name identification; and/or, 识别所述原始文本中的时间信息,将所述时间信息替换为同一时间标识。Identify the time information in the original text, and replace the time information with the same time stamp. 3.根据权利要求1所述的方法,其特征在于,所述通过裁判模型对所述法条信息和所述待处理案件的案件信息进行处理,获得所述裁判模型输出的裁判结果,包括:3 . The method according to claim 1 , wherein the processing of the legal article information and the case information of the pending case through the adjudication model to obtain the adjudication result output by the adjudication model comprises: 3 . 根据所述待处理案件的案件信息,获得相应的第一文本词向量矩阵后,对所述第一文本词向量矩阵分别进行正向解读和反向解读,获得第一正向解读结果和第一反向解读结果,将所述第一正向解读结果和所述第一反向解读结果进行拼接,对拼接获得的数据进行信息抽取,获得第一文本内容向量;According to the case information of the case to be processed, after obtaining the corresponding first text word vector matrix, perform forward interpretation and reverse interpretation on the first text word vector matrix, respectively, to obtain the first forward interpretation result and the first Reverse interpretation result, splicing the first forward interpretation result and the first reverse interpretation result, extracting information from the data obtained by splicing, and obtaining a first text content vector; 根据所述法条信息,获得相应的第一法条词向量矩阵后,对所述第一法条词向量矩阵进行信息抽取,获得第一法条内容向量;After obtaining the corresponding first legal article word vector matrix according to the legal article information, extract information on the first legal article word vector matrix to obtain the first legal article content vector; 对所述第一法条内容向量和所述第一文本内容向量进行融合,获得第一待输出向量;merging the first law content vector and the first text content vector to obtain the first to-be-output vector; 从所述第一待输出向量中提取出第一裁定向量,对所述第一裁定向量进行解析,获得所述裁判结果。A first adjudication vector is extracted from the first to-be-output vector, and the first adjudication vector is parsed to obtain the adjudication result. 4.根据权利要求3所述的方法,其特征在于,所述对所述第一法条词向量矩阵进行信息抽取,获得第一法条内容向量,包括:4. The method according to claim 3, wherein the information extraction is performed on the first law word vector matrix to obtain the first law content vector, comprising: 通过所述裁判模型中的attention-rnn子模型对所述第一法条词向量矩阵进行处理,获得所述attention-rnn子模型输出的第二法条内容向量,所述第二法条内容向量的形状为{N,D},其中,N为法条的数量,D为所述第一法条词向量矩阵的词向量维度;The first law word vector matrix is processed by the attention-rnn sub-model in the referee model, and the second law content vector output by the attention-rnn sub-model is obtained. The second law content vector The shape of is {N, D}, wherein, N is the number of laws, and D is the word vector dimension of the first law word vector matrix; 根据所述第二法条内容向量作如下运算,以对所述第二法条内容向量进行信息抽取,获得所述第一法条内容向量:Perform the following operations according to the second law content vector to extract information from the second law content vector to obtain the first law content vector: Y3=Y2*Y1+(1-2)*XY 3 =Y 2 *Y 1 +(1- 2 )*X 其中,Y1=relu(XM1+C1),Y2=sigmoid(Y1M2+XM3+C2),其中,M1、M2、M3均为形状为{D,D}的可变矩阵,C1、C2均为形状为{N,D}的可变矩阵,X为所述第二法条内容向量,Y3为所述第一法条内容向量。Wherein, Y 1 =relu(XM 1 +C 1 ), Y 2 =sigmoid(Y 1 M 2 +XM 3 +C 2 ), where M 1 , M 2 , and M 3 are all shaped {D,D} The variable matrix of , C 1 and C 2 are both variable matrices of shape {N, D}, X is the content vector of the second normal, and Y 3 is the content vector of the first normal. 5.根据权利要求4所述的方法,其特征在于,所述对所述第一法条内容向量和所述第一文本内容向量进行融合,获得第一待输出向量,包括:5 . The method according to claim 4 , wherein the fusion of the first law content vector and the first text content vector to obtain the first to-be-output vector comprises: 6 . 根据所述第一法条内容向量和所述第一文本内容向量作如下运算,以对所述第一法条内容向量和所述第一文本内容向量进行融合,获得所述第一待输出向量;Perform the following operations according to the first law content vector and the first text content vector to fuse the first law content vector and the first text content vector to obtain the first to-be-output vector ; Y5=Y3Y4 Y 5 =Y 3 Y 4 其中,Y4=M4Yb,其中,Yb为所述第一文本内容向量,M4为形状为{D,N}的可变矩阵,Y5为所述第一待输出向量。Wherein, Y 4 =M 4 Y b , where Y b is the first text content vector, M 4 is a variable matrix with a shape of {D,N}, and Y 5 is the first vector to be output. 6.根据权利要求1至5任一项所述的方法,其特征在于,所述方法还包括:6. The method according to any one of claims 1 to 5, wherein the method further comprises: 获取所述法条信息和所述至少一个裁判文书的案件信息和裁判结果;Obtain the legal information and the case information and judgment result of the at least one judgment document; 根据所述至少一个裁判文书的案件信息,获得相应的第二文本词向量矩阵后,对所述第二文本词向量矩阵分别进行正向解读和反向解读,获得第二正向解读结果和第二反向解读结果,将所述第二正向解读结果和所述第二反向解读结果进行拼接,对拼接获得的数据进行信息抽取,获得第二文本内容向量;After obtaining the corresponding second text word vector matrix according to the case information of the at least one judgment document, perform forward interpretation and reverse interpretation on the second text word vector matrix, respectively, to obtain the second forward interpretation result and the first 2. Reverse interpretation result, splicing the second forward interpretation result and the second reverse interpretation result, extracting information from the data obtained by splicing, and obtaining a second text content vector; 根据所述法条信息,获得相应的第二法条词向量矩阵后,对所述第二法条词向量矩阵进行信息抽取,获得第三法条内容向量;After obtaining the corresponding second legal article word vector matrix according to the legal article information, perform information extraction on the second legal article word vector matrix to obtain the third legal article content vector; 对所述第三法条内容向量和所述第二文本内容向量进行融合,获得第二待输出向量;merging the third law content vector and the second text content vector to obtain a second to-be-output vector; 从所述第二待输出向量中提取第二裁定向量;extracting a second arbitration vector from the second to-be-output vector; 根据所述第二裁定向量和所述裁判文书的裁判结果对应的向量计算获得损失误差;利用所述损失误差,对所述裁判模型进行学习训练,直至所述裁判模型收敛。A loss error is obtained by calculating according to the second adjudication vector and a vector corresponding to the adjudication result of the adjudication document; using the loss error, the adjudication model is learned and trained until the adjudication model converges. 7.一种基于深度学习的裁判结果获取装置,其特征在于,包括:7. A device for obtaining referee results based on deep learning, comprising: 第一获取单元,用于获取法条信息,并根据待处理案件的原始文本,获取所述待处理案件的案件信息;a first obtaining unit, used for obtaining legal information, and obtaining the case information of the pending case according to the original text of the pending case; 第二获取单元,用于通过裁判模型对所述法条信息和所述待处理案件的案件信息进行处理,获得所述裁判模型输出的裁判结果,其中所述裁判模型是以法条信息、以及至少一个裁判文书的案件信息和裁判结果为样本,经过深度学习训练获得的。The second obtaining unit is configured to process the legal article information and the case information of the pending case through a referee model, and obtain a referee result output by the referee model, wherein the referee model is based on the legal article information, and The case information and adjudication results of at least one adjudication document are samples obtained through deep learning training. 8.根据权利要求7所述的装置,其特征在于,所述第二获取单元,包括:8. The apparatus according to claim 7, wherein the second acquiring unit comprises: 第一词向量转换模块,用于根据所述待处理案件的案件信息,获得相应的第一文本词向量矩阵;a first word vector conversion module, configured to obtain a corresponding first text word vector matrix according to the case information of the case to be processed; 第一解读模块,用于在所述第一词向量转换模块获得所述第一文本词向量矩阵后,对所述第一文本词向量矩阵分别进行正向解读和反向解读,获得第一正向解读结果和第一反向解读结果;The first interpretation module is configured to perform forward interpretation and reverse interpretation on the first text word vector matrix after the first word vector conversion module obtains the first text word vector matrix, and obtain the first positive interpretation. To interpret the result and the first reverse interpret the result; 第一拼接模块,用于将所述第一正向解读结果和所述第一反向解读结果进行拼接;a first splicing module for splicing the first forward interpretation result and the first reverse interpretation result; 第一信息抽取模块,用于对所述拼接模块拼接获得的数据进行信息抽取,获得第一文本内容向量;a first information extraction module, configured to perform information extraction on the data spliced by the splicing module to obtain a first text content vector; 第二词向量转换模块,用于根据所述法条信息,获得相应的第一法条词向量矩阵;The second word vector conversion module is used to obtain the corresponding first law word vector matrix according to the law information; 第二信息抽取模块,用于在所述第二词向量转换模块获得所述第一法条词向量矩阵后,对所述所述第一法条词向量矩阵进行信息抽取,获得第一法条内容向量;The second information extraction module is configured to perform information extraction on the first law word vector matrix after the second word vector conversion module obtains the first law word vector matrix to obtain the first law content vector; 第一融合模块,用于对所述第一法条内容向量和所述第一文本内容向量进行融合,获得第一待输出向量;a first fusion module, configured to fuse the first law content vector and the first text content vector to obtain a first to-be-output vector; 第一提取模块,用于从所述第一待输出向量中提取出第一裁定向量;a first extraction module, configured to extract a first arbitration vector from the first to-be-output vector; 第一解析模块,用于对所述第一裁定向量进行解析,获得所述裁判结果。A first parsing module, configured to parse the first adjudication vector to obtain the adjudication result. 9.根据权利要求8所述的装置,其特征在于,所述第二信息抽取模块,包括:9. The apparatus according to claim 8, wherein the second information extraction module comprises: 第一处理子模块,用于通过所述裁判模型中的attention-rnn子模型对所述第一法条词向量矩阵进行处理,获得所述attention-rnn子模型输出的第二法条内容向量,所述第二法条内容向量的形状为{N,D},其中,N为法条的数量,D为所述第一法条词向量矩阵的词向量维度;The first processing sub-module is used to process the first law word vector matrix through the attention-rnn sub-model in the referee model, and obtain the second law content vector output by the attention-rnn sub-model, The shape of the second law content vector is {N, D}, where N is the number of law, and D is the word vector dimension of the first law word vector matrix; 第二处理子模块,用于根据所述第二法条内容向量作如下运算,以对所述第二法条内容向量进行信息抽取,获得所述第一法条内容向量:The second processing sub-module is configured to perform the following operations according to the second law content vector to extract information from the second law content vector to obtain the first law content vector: Y3=Y2*Y1+(1-Y2)*XY 3 =Y 2 *Y 1 +(1-Y 2 )*X 其中,Y1=relu(XM1+C1),Y2=sigmoid(Y1M2+XM3+C2),其中,M1、M2、M3均为形状为{D,D}的可变矩阵,C1、C2均为形状为{N,D}的可变矩阵,X为所述第二法条内容向量,Y3为所述第一法条内容向量。Wherein, Y 1 =relu(XM 1 +C 1 ), Y 2 =sigmoid(Y 1 M 2 +XM 3 +C 2 ), where M 1 , M 2 , and M 3 are all shaped {D,D} The variable matrix of , C 1 and C 2 are both variable matrices of shape {N, D}, X is the content vector of the second normal, and Y 3 is the content vector of the first normal. 10.根据权利要求8所述的装置,其特征在于,所述第一融合模块,用于根据所述第一法条内容向量和所述第一文本内容向量作如下运算,以对所述第一法条内容向量和所述第一文本内容向量进行融合,获得所述第一待输出向量;10. The apparatus according to claim 8, wherein the first fusion module is configured to perform the following operation according to the first law content vector and the first text content vector, to A rule content vector is fused with the first text content vector to obtain the first to-be-output vector; Y5=Y3Y4 Y 5 =Y 3 Y 4 其中,Y4=M4Yb,其中,Yb为所述第一文本内容向量,M4为形状为{D,N}的可变矩阵,Y5为所述第一待输出向量。Wherein, Y 4 =M 4 Y b , where Y b is the first text content vector, M 4 is a variable matrix with a shape of {D,N}, and Y 5 is the first vector to be output.
CN201811344580.0A 2018-11-13 2018-11-13 Method and device for obtaining referee results based on deep learning Pending CN111178817A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811344580.0A CN111178817A (en) 2018-11-13 2018-11-13 Method and device for obtaining referee results based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811344580.0A CN111178817A (en) 2018-11-13 2018-11-13 Method and device for obtaining referee results based on deep learning

Publications (1)

Publication Number Publication Date
CN111178817A true CN111178817A (en) 2020-05-19

Family

ID=70653597

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811344580.0A Pending CN111178817A (en) 2018-11-13 2018-11-13 Method and device for obtaining referee results based on deep learning

Country Status (1)

Country Link
CN (1) CN111178817A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111815485A (en) * 2020-06-12 2020-10-23 中国司法大数据研究院有限公司 Sentencing prediction method and device based on deep learning BERT model

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107818138A (en) * 2017-09-28 2018-03-20 银江股份有限公司 A kind of case legal regulation recommends method and system
CN107918921A (en) * 2017-11-21 2018-04-17 南京擎盾信息科技有限公司 Criminal case court verdict measure and system
CN108133436A (en) * 2017-11-23 2018-06-08 科大讯飞股份有限公司 Automatic method and system of deciding a case

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107818138A (en) * 2017-09-28 2018-03-20 银江股份有限公司 A kind of case legal regulation recommends method and system
CN107918921A (en) * 2017-11-21 2018-04-17 南京擎盾信息科技有限公司 Criminal case court verdict measure and system
CN108133436A (en) * 2017-11-23 2018-06-08 科大讯飞股份有限公司 Automatic method and system of deciding a case

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111815485A (en) * 2020-06-12 2020-10-23 中国司法大数据研究院有限公司 Sentencing prediction method and device based on deep learning BERT model

Similar Documents

Publication Publication Date Title
CN109871532B (en) Text theme extraction method and device and storage medium
CN109872162B (en) Wind control classification and identification method and system for processing user complaint information
CN107122375A (en) The recognition methods of image subject based on characteristics of image
CN111222305A (en) Information structuring method and device
WO2021043076A1 (en) Method and apparatus for processing network data to be published, and computer device and storage medium
CN111190997A (en) A Question Answering System Implementation Method Using Neural Networks and Machine Learning Sorting Algorithms
CN108984555B (en) User state mining and information recommendation method, device and equipment
CN111260220B (en) Group control equipment identification method and device, electronic equipment and storage medium
CN110619064A (en) Case studying and judging method and device based on deep learning
CN112347223A (en) Document retrieval method, document retrieval equipment and computer-readable storage medium
CN112507912B (en) Method and device for identifying illegal pictures
CN112801784A (en) Bit currency address mining method and device for digital currency exchange
CN111475648B (en) Text classification model generation method, text classification device and equipment
CN112347254A (en) News text classification method and device, computer equipment and storage medium
CN113761204B (en) Emoji text emotion analysis method and system based on deep learning
CN111178817A (en) Method and device for obtaining referee results based on deep learning
CN118643180A (en) Image retrieval method, system, device and storage medium
CN117951308A (en) Zero sample knowledge graph completion method and device
CN113935387A (en) Text similarity determination method and device and computer readable storage medium
CN117633358A (en) Content recommendation method, content recommendation device, and storage medium
CN110377819A (en) Arbitrator&#39;s recommended method, device and computer equipment based on big data
CN117009528A (en) Business processing method, device, equipment and medium based on natural language processing
CN112686339B (en) Case routing determination method and device based on appeal
CN113051869B (en) Method and system for realizing identification of text difference content by combining semantic recognition
KR102265947B1 (en) Method and apparatus for providing information based on machine learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20230703

Address after: 3007, Hengqin International Financial Center Building, No. 58 Huajin Street, Hengqin New District, Zhuhai City, Guangdong Province, 519030

Applicant after: New founder holdings development Co.,Ltd.

Address before: 100871, Beijing, Haidian District, Cheng Fu Road, No. 298, Zhongguancun Fangzheng building, 9 floor

Applicant before: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd.

Applicant before: PKU FOUNDER INFORMATION INDUSTRY GROUP CO.,LTD.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200519