CN109933789A - A kind of judicial domain Relation extraction method and system neural network based - Google Patents

A kind of judicial domain Relation extraction method and system neural network based Download PDF

Info

Publication number
CN109933789A
CN109933789A CN201910145396.1A CN201910145396A CN109933789A CN 109933789 A CN109933789 A CN 109933789A CN 201910145396 A CN201910145396 A CN 201910145396A CN 109933789 A CN109933789 A CN 109933789A
Authority
CN
China
Prior art keywords
vector
corpus
judicial domain
relation extraction
feature set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910145396.1A
Other languages
Chinese (zh)
Other versions
CN109933789B (en
Inventor
梁庆中
王晨威
郑坤
姚宏
刘超
董理君
康晓军
李新川
李圣文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Geosciences
Original Assignee
China University of Geosciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Geosciences filed Critical China University of Geosciences
Priority to CN201910145396.1A priority Critical patent/CN109933789B/en
Publication of CN109933789A publication Critical patent/CN109933789A/en
Application granted granted Critical
Publication of CN109933789B publication Critical patent/CN109933789B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

A kind of judicial domain Relation extraction method and system neural network based, the present invention is in original open neural network Relation extraction frame foundation, construct judicial domain dedicated data set, and form judicial domain charge feature set, the method for improving Relation extraction accuracy rate by optimization neural network: first, it is netted from Chinese judgement document and obtains a large amount of judicial domain correlation non-structured texts, and with Word2Vec model, the vector that the vectors transformation model such as Bert model obtains text is indicated;Secondly, by non-structured text carry out TF-IDF word frequency statistics, obtain different charges and case by feature set, and obtain vector expression;Then optimize OpenNRE model and JointNRE model, obtain the higher JudNRE model of accuracy;Finally, charge feature vector vector is handled using JudNRE model to text vector, judicial domain Relation extraction model is obtained, for carrying out judicial domain Relation extraction to judicial domain unstructured text data to be processed, obtains corresponding entity triple.

Description

A kind of judicial domain Relation extraction method and system neural network based
Technical field
The present invention relates to Relation extraction fields, more specifically to a kind of judicial domain relationship neural network based Abstracting method and system.
Background technique
In recent years, with the rapid development of artificial intelligence technology, more and more research work are put to practical application ?.A kind of strong form of expression of the Relation extraction as natural language processing technique, also along with knowledge mapping research Intimately it is concerned.For practicability, Relation extraction extracts useful information as one from non-structured text, quickly The accurate technology for obtaining structured data information, can effectively mitigate the burden of manual analysis mass data text.
Instantly, it is based primarily upon single language text for the Relation extraction of specific area, and research surrounds English exhibition mostly It opens.Procuratorate handles a case at present, and there is still a need for carry out artificial screening to a large amount of judgement document's text and criminal case folder text and divide Analysis, and merit text and folder text are presented mostly in the form of half structure or pure urtext, and text categories are various, public procurator The process checked takes time and effort and working efficiency is lower, and Relation extraction technology be to aid in user's rapid and convenient obtain information can By medium.Therefore, judicial domain text is realized in conjunction with judicial domain text feature using existing Relation extraction the relevant technologies Relation extraction, handling a case procuratorate's high efficiency high quality has great meaning.
Problems Existing:
Although the process of Relation extraction be obtain text data, text participle, natural language processing, entity to predict, Relation inference (having used remote supervisory mostly), relationship probabilistic forecasting and etc., and training data more standardizes.But by Some judicial domain judgement document texts are analyzed, it is known that it has following features:
1, text includes information multiplicity;
2, text locating is strong;
3, without external data as reference.
Complete judicial domain criminal case folder text is analyzed, it is known that it there are following features:
1, it is complete to cover information for folder;
2, information is interrelated between text in folder;
3, information to be extracted is more, only relies on nlp processing relatively difficult to achieve.
Whether therefore unpredictable judicial domain text data can be handled by conventional Relation extraction mode.
Summary of the invention
The technical problem to be solved in the present invention is that whether for unpredictable in the prior art conventional relationship can be passed through Extraction mode handles the technological deficiency of judicial domain text data, provides a kind of judicial domain relationship neural network based Abstracting method and system.
The present invention solves judicial domain Relation extraction method neural network based used by its technical problem, includes:
S1, judicial domain unstructured text data is obtained, then by Text Pretreatment, obtains the conclusion of each text Part as corpus after participle, and generates the corresponding entity triple of each corpus;
S2, TF-IDF word frequency statistics are carried out to the judicial domain unstructured text data, obtains different charges and case By the feature set constituted together;
S3, it respectively obtains and expects that the vector of collection and feature set indicates, then carry out corpus vector sum feature set vector Splicing, so that obtaining the final sequence vector of corpus indicates;
S4, final sequence vector is indicated to carry out neural metwork training, obtains the Relation extraction model towards judicial domain;
S5, using the Relation extraction model, judicial neck is carried out to judicial domain unstructured text data to be processed Domain Relation extraction obtains corresponding entity triple.
Further, it in the step S1 of judicial domain Relation extraction method neural network based of the invention, obtains Judicial domain unstructured text data, which refers to from Chinese judgement document's net, crawls unstructured text data collection, sentences comprising criminal Certainly two kinds of texts of book and criminal written verdict.
Further, in judicial domain Relation extraction method neural network based of the invention, feature set vector is The term vector * (1+TF-IDF word frequency weight) of each word of corpus.
Further, it in the step S3 of judicial domain Relation extraction method neural network based of the invention, obtains Expect that the method that the vector of collection and feature set indicates is to obtain vector using Word2Vec model or Bert model to indicate.
Further, in judicial domain Relation extraction method neural network based of the invention, corpus vector by Word Embeddings and Position Embeddings are spliced, and Word Embeddings generates every corpus Term vector, dimension are denoted as dw, Position Embeddings generates the position vector of every corpus, and dimension is denoted as dp;Feature Integrate vector as the term vector * (1+TF-IDF word frequency weight) of each word of corpus, dimension is denoted as dti, the vector of feature set vector Sequence is expressed as
The final sequence vector of corpus is expressed as w={ w1,w2,...,wm};
Wherein, wi=Rd, d=dw+dp*2+dti
The present invention is to solve its technical problem, additionally provides a kind of judicial domain Relation extraction neural network based system System includes:
Corpus obtains module, for obtaining judicial domain unstructured text data, then by Text Pretreatment, obtains It takes the conclusion part of each text, as corpus after participle, and generates the corresponding entity triple of each corpus;
Feature set obtains module, for carrying out TF-IDF word frequency statistics to the judicial domain unstructured text data, Different charges and case are obtained by the feature set that constitutes together;
Final sequence vector representation module expects that the vector of collection and feature set indicates for respectively obtaining, then by corpus Collection vector sum feature set vector is spliced, so that obtaining the final sequence vector of corpus indicates;
Model training module carries out neural metwork training for indicating final sequence vector, obtains towards judicial domain Relation extraction model;
Relation extraction module, for utilizing the Relation extraction model, to judicial domain non-structured text to be processed Data carry out judicial domain Relation extraction, obtain corresponding entity triple.
Further, module is obtained in the corpus of judicial domain Relation extraction system neural network based of the invention In, acquisition judicial domain unstructured text data, which refers to from Chinese judgement document's net, crawls unstructured text data collection, wraps Containing two kinds of texts of criminal judgment and criminal written verdict.
Further, in judicial domain Relation extraction system neural network based of the invention, feature set vector is The term vector * (1+TF-IDF word frequency weight) of each word of corpus.
Further, in judicial domain Relation extraction system neural network based of the invention, final sequence vector It obtains expecting that the method that the vector of collection and feature set indicates is to obtain using Word2Vec model or Bert model in representation module It is indicated to vector.
Further, in judicial domain Relation extraction system neural network based of the invention, corpus vector by Word Embeddings and Position Embeddings are spliced, and Word Embeddings generates every corpus Term vector, dimension are denoted as dw, Position Embeddings generates the position vector of every corpus, and dimension is denoted as dp;Feature Integrate vector as the term vector * (1+TF-IDF word frequency weight) of each word of corpus, dimension is denoted as dti, the vector of feature set vector Sequence is expressed as
The final sequence vector of corpus is expressed as w={ w1,w2,...,wm};
Wherein, wi=Rd, d=dw+dp*2+dti
Implement judicial domain Relation extraction method and system neural network based of the invention, has below beneficial to effect Fruit: present invention firstly provides open neural network Relation extraction is applied in judicial domain work, it is intended to pass through analysis department Method field text semantic feature realizes the pumping to key message in extensive judicial domain text by the method for machine learning It takes, meanwhile, charge feature vector is added in the present invention in Relation extraction neural network model for the first time, and multi-angle optimizes Relation extraction Accurate rate.
Detailed description of the invention
Present invention will be further explained below with reference to the attached drawings and examples, in attached drawing:
Fig. 1 is the flow chart of one embodiment of judicial domain Relation extraction method neural network based of the invention;
Fig. 2 is the position vector schematic diagram of corpus;
Fig. 3 is the schematic diagram of one embodiment of judicial domain Relation extraction system neural network based.
Specific embodiment
For a clearer understanding of the technical characteristics, objects and effects of the present invention, now control attached drawing is described in detail A specific embodiment of the invention.
It is the process of one embodiment of judicial domain Relation extraction method neural network based of the invention with reference to Fig. 1, Fig. 1 Figure.The present embodiment judicial domain Relation extraction method neural network based comprises the following steps:
S1, judicial domain unstructured text data is obtained, then by Text Pretreatment, obtains the conclusion of each text Part as corpus after participle, and generates the corresponding entity triple of each corpus.
In this step, frame (Wenshu_Spider) is crawled first with open source judicial domain case document, is cut out from China Sentence document and obtains two kinds of texts of criminal judgment and criminal written verdict on the net, it is main to extract the text containing defendant and the injured party (can according to case by or charge define, common charge such as table 1).
The charge of 1 corresponding relationship containing PER2PER of table
The conclusion part in text is intercepted, using jieba, the participles tool such as KCWS is segmented, and extracts text based on NLP The entity pair of PERSON type in this constitutes entity triple as label in conjunction with charge, and then forms corpus (label form Such as entity triple " defendant-swindle-injured party ", in instances, defendant and the injured party are indicated by specific name, such as " Mr. Wang-swindle-Lee ").
S2, TF-IDF word frequency statistics are carried out to the judicial domain unstructured text data, obtains different charges and case By the feature set constituted together.Feature set mainly include every kind of charge and case by common vocabulary and word-building rule.
In this step, TF-IDF word frequency statistics are carried out to the urtext crawled, assesses the words in text for every Class case by or charge text significance level, by crucial words obtain different charges and case by feature set.
For in the word in a certain specific file, its importance (i.e. word frequency) is indicated are as follows:
Wherein, ni,jIndicate the number that word i occurs in specific file j, ∑knk,jIndicate all in specific file j The total degree that word occurs.
S3, it respectively obtains and expects that the vector of collection and feature set indicates, then carry out corpus vector sum feature set vector Splicing, so that obtaining the final sequence vector of corpus indicates.
In step, model (such as Word2Vec model or Bert model) is generated using vector and obtains corpus and feature set Vector indicate.Wherein Word2Vec model can directly train the vector for the fixed dimension for obtaining each word to indicate, using CBOW Method (one kind of Continuous Bag-of-Words, Word2Vec model) does term vector to the description text of each entity It generates.When model training, the dimension size that each term vector is arranged is 100, and suitable min-count and sliding is arranged (the training input of CBOW model is the corresponding term vector of context-sensitive word of some Feature Words, Min-cout to window value With sliding window value for controlling context length), it obtains indicating term vector.And the training of BERT model is divided into pre-training (Pre-training) and fine tuning (Fine-tuning) two step, the present invention only use pre-training, generate the Chinese words of 768 dimensions to Amount indicates.
Corpus vector sum feature set vector is spliced, obtaining the final sequence vector of corpus indicates, is denoted as w (using Word2Vec model respectively, Bert model generates vector and indicates, compares for experimental result).Wherein, corpus vector It is spliced by Word Embeddings and Position Embeddings, Word Embeddings generates every corpus Term vector, dimension is denoted as dw, Position Embeddings generates the position vector of every corpus, and dimension is denoted as dp, often The relative position of a word and entity pair, such as " president " in Fig. 2, the relative distance with head entity " king AA " is 4, with tail reality The relative position of body " king BB " is -3);Feature set vector is the term vector * (1+TF-IDF word frequency weight) of each word of corpus, Dimension is denoted as dti(sequence vector of feature set is expressed as
The final sequence vector of corpus is expressed as w={ w1,w2,...,wm};
Wherein, wi=Rd, d=dw+dp*2+dti
S4, final sequence vector is indicated to carry out neural metwork training, obtains the Relation extraction model towards judicial domain.
For above-mentioned implementation steps S4, completed by step in detail below:
Feature extraction is carried out using the convolutional layer of convolutional neural networks.Convolutional layer sliding window is denoted as l, sentence Embeddings length is denoted as dc, convolutional layer matrix-vector is expressed asBias vector is expressed as b.I-th of cunning The sequence vector of dynamic window may be expressed as:
qi=wi-l+1:i(1≤i≤m+l-1)
The calculation formula of i-th of filter of convolutional layer are as follows:
pi=[Wq+b]i
Followed by maximum pond:
[x]i=max (pi)
Finally by non-linear layer tanh function and full articulamentum, the output vector o of convolutional layer is obtained.Non-linear layer are as follows: g =tanh ([x]i), full articulamentum are as follows: o=Wg+b.
By the output vector o of convolutional layer by one softmax layers, the relationship probability of entity pair is predicted:
Wherein, nrFor relationship quantity, o is convolutional layer output vector.
In addition, the sequence vector of feature set is carried out the charge prediction of each corpus by attention mechanism.It is first First calculate the attention weight of each charge:The charge possibility distrabtion of each corpus is calculated again:
zi=softmax (Wai+b)
Finally obtain charge prediction probability:
Optimal Parameters.The loss function setup of Relationship Prediction are as follows:
The loss function setup of charge prediction are as follows:
Final loss function setup are as follows:
L=Lrelation+α·Lcharge
Wherein, s is corpus quantity, and θ is all parameters of model, and α is hyper parameter, is set as 1 herein.
S5, using the Relation extraction model, judicial neck is carried out to judicial domain unstructured text data to be processed Domain Relation extraction obtains corresponding entity triple.
With reference to Fig. 3, the present invention is to solve its technical problem, additionally provides a kind of judicial domain relationship neural network based Extraction system obtains module 31 comprising corpus, feature set obtains module 32, final sequence vector representation module 33, model instruction Practice module 34 and Relation extraction module 35.
Corpus obtains module 31 for obtaining judicial domain unstructured text data, then passes through Text Pretreatment, It obtains the conclusion part of each text, as corpus after participle, and generates the corresponding entity triple of each corpus; Feature set obtains module 32 and is used to carry out TF-IDF word frequency statistics to the judicial domain unstructured text data, obtains difference Charge and case are by the feature set that constitutes together;Final sequence vector representation module 33 is for respectively obtaining expectation collection and feature set Vector indicate, then corpus vector sum feature set vector is spliced, to obtain the final sequence vector of corpus It indicates;Model training module 34 is used to indicate final sequence vector to carry out neural metwork training, obtains towards judicial domain Relation extraction model;Relation extraction module 35 is used to utilize the Relation extraction model, non-structural to judicial domain to be processed Change text data and carry out judicial domain Relation extraction, obtains corresponding entity triple.
Wherein, acquisition judicial domain unstructured text data, which refers to from Chinese judgement document's net, crawls non-structured text Data set includes two kinds of texts of criminal judgment and criminal written verdict;Obtain expecting in final sequence vector representation module collection and The method that the vector of feature set indicates is to obtain vector using Word2Vec model or Bert model to indicate.
Corpus vector is spliced by Word Embeddings and Position Embeddings, Word Embeddings generates the term vector of every corpus, and dimension is denoted as dw, Position Embeddings generates every corpus Position vector, dimension is denoted as dp;Feature set vector is the term vector * (1+TF-IDF word frequency weight) of each word of corpus, dimension Degree is denoted as dti, the sequence vector of feature set vector is expressed as
The final sequence vector of corpus is expressed as w={ w1,w2,...,wm};
Wherein, wi=Rd, d=dw+dp*2+dti
The embodiment of the present invention is described with above attached drawing, but the invention is not limited to above-mentioned specific Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much Form, all of these belong to the protection of the present invention.

Claims (10)

1. a kind of judicial domain Relation extraction method neural network based, characterized by comprising:
S1, judicial domain unstructured text data is obtained, then by Text Pretreatment, obtains the conclusion portion of each text Point, as corpus after participle, and generate the corresponding entity triple of each corpus;
S2, TF-IDF word frequency statistics are carried out to the judicial domain unstructured text data, obtains different charges and case by institute The feature set constituted together;
S3, it respectively obtains and expects that the vector of collection and feature set indicates, then splice corpus vector sum feature set vector, It is indicated to obtain the final sequence vector of corpus;
S4, final sequence vector is indicated to carry out neural metwork training, obtains the Relation extraction model towards judicial domain;
S5, using the Relation extraction model, judicial domain pass is carried out to judicial domain unstructured text data to be processed System extracts, and obtains corresponding entity triple.
2. judicial domain Relation extraction method neural network based according to claim 1, which is characterized in that step S1 In, acquisition judicial domain unstructured text data, which refers to from Chinese judgement document's net, crawls unstructured text data collection, wraps Containing two kinds of texts of criminal judgment and criminal written verdict.
3. judicial domain Relation extraction method neural network based according to claim 1, which is characterized in that feature set Vector is the term vector * (1+TF-IDF word frequency weight) of each word of corpus.
4. judicial domain Relation extraction method neural network based according to claim 1, which is characterized in that step S3 In obtain expecting that the method that the vector of collection and feature set indicates is using Word2Vec model or Bert model to obtain vector table Show.
5. judicial domain Relation extraction method neural network based according to claim 1, which is characterized in that corpus Vector is spliced by Word Embeddings and Position Embeddings, and Word Embeddings generates every language Expect the term vector of collection, dimension is denoted as dw, Position Embeddings generates the position vector of every corpus, and dimension is denoted as dp;Feature set vector is the term vector * (1+TF-IDF word frequency weight) of each word of corpus, and dimension is denoted as dti, feature set vector Sequence vector be expressed as
The final sequence vector of corpus is expressed as w={ w1,w2,...,wm};
Wherein, wi=Rd, d=dw+dp*2+dti
6. a kind of judicial domain Relation extraction system neural network based, characterized by comprising:
Corpus obtains module, for obtaining judicial domain unstructured text data, then by Text Pretreatment, obtains every The conclusion part of a text as corpus after participle, and generates the corresponding entity triple of each corpus;
Feature set obtains module, for carrying out TF-IDF word frequency statistics to the judicial domain unstructured text data, obtains Different charges and case are by the feature set that constitutes together;
Final sequence vector representation module expects that the vector of collection and feature set indicates for respectively obtaining, then by corpus to Amount and feature set vector are spliced, so that obtaining the final sequence vector of corpus indicates;
Model training module carries out neural metwork training for indicating final sequence vector, obtains the pass towards judicial domain It is extraction model;
Relation extraction module, for utilizing the Relation extraction model, to judicial domain unstructured text data to be processed Judicial domain Relation extraction is carried out, corresponding entity triple is obtained.
7. judicial domain Relation extraction system neural network based according to claim 6, which is characterized in that corpus It obtains in module, acquisition judicial domain unstructured text data, which refers to from Chinese judgement document's net, crawls non-structured text number It include two kinds of texts of criminal judgment and criminal written verdict according to collection.
8. judicial domain Relation extraction system neural network based according to claim 6, which is characterized in that feature set Vector is the term vector * (1+TF-IDF word frequency weight) of each word of corpus.
9. judicial domain Relation extraction system neural network based according to claim 6, which is characterized in that finally to It obtains expecting that the method that the vector of collection and feature set indicates is using Word2Vec model or Bert in amount sequence representation module Model obtains vector expression.
10. judicial domain Relation extraction system neural network based according to claim 6, which is characterized in that corpus Collection vector is spliced by Word Embeddings and Position Embeddings, and Word Embeddings generates every The term vector of corpus, dimension are denoted as dw, Position Embeddings generates the position vector of every corpus, dimension note For dp;Feature set vector is the term vector * (1+TF-IDF word frequency weight) of each word of corpus, and dimension is denoted as dti, feature set to The sequence vector of amount is expressed as
The final sequence vector of corpus is expressed as w={ w1,w2,...,wm};
Wherein, wi=Rd, d=dw+dp*2+dti
CN201910145396.1A 2019-02-27 2019-02-27 Neural network-based judicial domain relation extraction method and system Expired - Fee Related CN109933789B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910145396.1A CN109933789B (en) 2019-02-27 2019-02-27 Neural network-based judicial domain relation extraction method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910145396.1A CN109933789B (en) 2019-02-27 2019-02-27 Neural network-based judicial domain relation extraction method and system

Publications (2)

Publication Number Publication Date
CN109933789A true CN109933789A (en) 2019-06-25
CN109933789B CN109933789B (en) 2021-04-13

Family

ID=66986028

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910145396.1A Expired - Fee Related CN109933789B (en) 2019-02-27 2019-02-27 Neural network-based judicial domain relation extraction method and system

Country Status (1)

Country Link
CN (1) CN109933789B (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110570920A (en) * 2019-08-20 2019-12-13 华东理工大学 Entity and relationship joint learning method based on attention focusing model
CN110598846A (en) * 2019-08-15 2019-12-20 北京航空航天大学 Hierarchical recurrent neural network decoder and decoding method
CN110837558A (en) * 2019-11-07 2020-02-25 成都星云律例科技有限责任公司 Judgment document entity relation extraction method and system
CN111260114A (en) * 2020-01-08 2020-06-09 昆明理工大学 Low-frequency confusable criminal name prediction method for integrating case auxiliary sentence
CN111444704A (en) * 2020-03-27 2020-07-24 中南大学 Network security keyword extraction method based on deep neural network
CN111476035A (en) * 2020-05-06 2020-07-31 中国人民解放军国防科技大学 Chinese open relation prediction method and device, computer equipment and storage medium
CN111563374A (en) * 2020-03-23 2020-08-21 北京交通大学 Personnel social relationship extraction method based on judicial official documents
CN111680504A (en) * 2020-08-11 2020-09-18 四川大学 Legal information extraction model, method, system, device and auxiliary system
CN111798847A (en) * 2020-06-22 2020-10-20 广州小鹏车联网科技有限公司 Voice interaction method, server and computer-readable storage medium
CN112347780A (en) * 2020-11-27 2021-02-09 浙江大学 Judicial fact finding generation method, device and medium based on deep neural network
CN112396201A (en) * 2019-07-30 2021-02-23 北京国双科技有限公司 Criminal name prediction method and system
CN112579748A (en) * 2019-09-30 2021-03-30 北京国双科技有限公司 Method and device for extracting specific event relation from inquiry record
CN112764024A (en) * 2020-12-29 2021-05-07 杭州电子科技大学 Radar target identification method based on convolutional neural network and Bert
CN113011185A (en) * 2020-07-17 2021-06-22 上海浦东华宇信息技术有限公司 Legal field text analysis and identification method, system, storage medium and terminal
CN113779260A (en) * 2021-08-12 2021-12-10 华东师范大学 Domain map entity and relationship combined extraction method and system based on pre-training model
CN113901827A (en) * 2021-12-10 2022-01-07 北京邮电大学 Entity identification and relation extraction method and device, electronic equipment and storage medium
CN116304035A (en) * 2023-02-28 2023-06-23 中国司法大数据研究院有限公司 Multi-notice multi-crime name relation extraction method and device in complex case

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150169563A1 (en) * 2013-12-13 2015-06-18 National Cheng Kung University Method and system for recommending research information news
JP2017091270A (en) * 2015-11-11 2017-05-25 大日本印刷株式会社 Information processing device, information processing system, and program
CN106952193A (en) * 2017-03-23 2017-07-14 北京华宇信息技术有限公司 A kind of criminal case aid decision-making method based on fuzzy depth belief network
CN108073673A (en) * 2017-05-15 2018-05-25 北京华宇元典信息服务有限公司 A kind of legal knowledge map construction method, apparatus, system and medium based on machine learning
CN108280058A (en) * 2018-01-02 2018-07-13 中国科学院自动化研究所 Relation extraction method and apparatus based on intensified learning
CN108763485A (en) * 2018-05-25 2018-11-06 南京大学 A kind of chain of evidence relational model construction method of the judgement document based on text similarity
CN108763483A (en) * 2018-05-25 2018-11-06 南京大学 A kind of Text Information Extraction method towards judgement document
CN109145071A (en) * 2018-08-06 2019-01-04 中国地质大学(武汉) A kind of automated construction method and system towards geophysics field knowledge mapping
CN109241285A (en) * 2018-08-29 2019-01-18 东南大学 A kind of device of the judicial decision in a case of auxiliary based on machine learning
WO2019023412A1 (en) * 2017-07-26 2019-01-31 Siuvo Inc. Semantic classification numerical data in natural language context based on machine learning
CN109376963A (en) * 2018-12-10 2019-02-22 杭州世平信息科技有限公司 A kind of criminal case charge law article unified prediction neural network based

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150169563A1 (en) * 2013-12-13 2015-06-18 National Cheng Kung University Method and system for recommending research information news
JP2017091270A (en) * 2015-11-11 2017-05-25 大日本印刷株式会社 Information processing device, information processing system, and program
CN106952193A (en) * 2017-03-23 2017-07-14 北京华宇信息技术有限公司 A kind of criminal case aid decision-making method based on fuzzy depth belief network
CN108073673A (en) * 2017-05-15 2018-05-25 北京华宇元典信息服务有限公司 A kind of legal knowledge map construction method, apparatus, system and medium based on machine learning
WO2019023412A1 (en) * 2017-07-26 2019-01-31 Siuvo Inc. Semantic classification numerical data in natural language context based on machine learning
CN108280058A (en) * 2018-01-02 2018-07-13 中国科学院自动化研究所 Relation extraction method and apparatus based on intensified learning
CN108763485A (en) * 2018-05-25 2018-11-06 南京大学 A kind of chain of evidence relational model construction method of the judgement document based on text similarity
CN108763483A (en) * 2018-05-25 2018-11-06 南京大学 A kind of Text Information Extraction method towards judgement document
CN109145071A (en) * 2018-08-06 2019-01-04 中国地质大学(武汉) A kind of automated construction method and system towards geophysics field knowledge mapping
CN109241285A (en) * 2018-08-29 2019-01-18 东南大学 A kind of device of the judicial decision in a case of auxiliary based on machine learning
CN109376963A (en) * 2018-12-10 2019-02-22 杭州世平信息科技有限公司 A kind of criminal case charge law article unified prediction neural network based

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王林玉 等: "基于卷积神经网络和关键词策略的实体关系抽取方法", 《模式识别与人工智能》 *
高丹 等: "海量法律文书中基于 CNN 的实体关系抽取技术", 《小型微型计算机系统》 *

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112396201A (en) * 2019-07-30 2021-02-23 北京国双科技有限公司 Criminal name prediction method and system
CN110598846A (en) * 2019-08-15 2019-12-20 北京航空航天大学 Hierarchical recurrent neural network decoder and decoding method
CN110598846B (en) * 2019-08-15 2022-05-03 北京航空航天大学 Hierarchical recurrent neural network decoder and decoding method
CN110570920B (en) * 2019-08-20 2023-07-14 华东理工大学 Entity and relationship joint learning method based on concentration model
CN110570920A (en) * 2019-08-20 2019-12-13 华东理工大学 Entity and relationship joint learning method based on attention focusing model
CN112579748A (en) * 2019-09-30 2021-03-30 北京国双科技有限公司 Method and device for extracting specific event relation from inquiry record
CN110837558A (en) * 2019-11-07 2020-02-25 成都星云律例科技有限责任公司 Judgment document entity relation extraction method and system
CN110837558B (en) * 2019-11-07 2022-04-15 成都星云律例科技有限责任公司 Judgment document entity relation extraction method and system
CN111260114A (en) * 2020-01-08 2020-06-09 昆明理工大学 Low-frequency confusable criminal name prediction method for integrating case auxiliary sentence
CN111260114B (en) * 2020-01-08 2022-06-17 昆明理工大学 Low-frequency confusable criminal name prediction method for integrating case auxiliary sentence
CN111563374A (en) * 2020-03-23 2020-08-21 北京交通大学 Personnel social relationship extraction method based on judicial official documents
CN111444704B (en) * 2020-03-27 2023-09-19 中南大学 Network safety keyword extraction method based on deep neural network
CN111444704A (en) * 2020-03-27 2020-07-24 中南大学 Network security keyword extraction method based on deep neural network
CN111476035A (en) * 2020-05-06 2020-07-31 中国人民解放军国防科技大学 Chinese open relation prediction method and device, computer equipment and storage medium
CN111476035B (en) * 2020-05-06 2023-09-05 中国人民解放军国防科技大学 Chinese open relation prediction method, device, computer equipment and storage medium
CN111798847A (en) * 2020-06-22 2020-10-20 广州小鹏车联网科技有限公司 Voice interaction method, server and computer-readable storage medium
CN113011185A (en) * 2020-07-17 2021-06-22 上海浦东华宇信息技术有限公司 Legal field text analysis and identification method, system, storage medium and terminal
CN111680504A (en) * 2020-08-11 2020-09-18 四川大学 Legal information extraction model, method, system, device and auxiliary system
CN112347780A (en) * 2020-11-27 2021-02-09 浙江大学 Judicial fact finding generation method, device and medium based on deep neural network
CN112347780B (en) * 2020-11-27 2023-09-12 浙江大学 Judicial fact finding generation method, device and medium based on deep neural network
CN112764024A (en) * 2020-12-29 2021-05-07 杭州电子科技大学 Radar target identification method based on convolutional neural network and Bert
CN113779260B (en) * 2021-08-12 2023-07-18 华东师范大学 Pre-training model-based domain map entity and relationship joint extraction method and system
CN113779260A (en) * 2021-08-12 2021-12-10 华东师范大学 Domain map entity and relationship combined extraction method and system based on pre-training model
CN113901827A (en) * 2021-12-10 2022-01-07 北京邮电大学 Entity identification and relation extraction method and device, electronic equipment and storage medium
CN116304035A (en) * 2023-02-28 2023-06-23 中国司法大数据研究院有限公司 Multi-notice multi-crime name relation extraction method and device in complex case
CN116304035B (en) * 2023-02-28 2023-11-03 中国司法大数据研究院有限公司 Multi-notice multi-crime name relation extraction method and device in complex case

Also Published As

Publication number Publication date
CN109933789B (en) 2021-04-13

Similar Documents

Publication Publication Date Title
CN109933789A (en) A kind of judicial domain Relation extraction method and system neural network based
Singh et al. Vectorization of text documents for identifying unifiable news articles
CN107168945B (en) Bidirectional cyclic neural network fine-grained opinion mining method integrating multiple features
WO2018218705A1 (en) Method for recognizing network text named entity based on neural network probability disambiguation
Zou et al. A lexicon-based supervised attention model for neural sentiment analysis
CN107818164A (en) A kind of intelligent answer method and its system
CN107679039A (en) The method and apparatus being intended to for determining sentence
CN110929030A (en) Text abstract and emotion classification combined training method
CN108255805A (en) The analysis of public opinion method and device, storage medium, electronic equipment
CN103631859A (en) Intelligent review expert recommending method for science and technology projects
CN112001186A (en) Emotion classification method using graph convolution neural network and Chinese syntax
CN108563703A (en) A kind of determination method of charge, device and computer equipment, storage medium
CN110597998A (en) Military scenario entity relationship extraction method and device combined with syntactic analysis
CN110851601A (en) Cross-domain emotion classification system and method based on layered attention mechanism
Ji et al. SKGSUM: Abstractive document summarization with semantic knowledge graphs
Feifei et al. Bert-based Siamese network for semantic similarity
Ji et al. A deep learning method for named entity recognition in bidding document
Wu et al. TW-TGNN: Two windows graph-based model for text classification
CN109117471A (en) A kind of calculation method and terminal of the word degree of correlation
CN115359323A (en) Image text information generation method and deep learning model training method
Ji et al. Research on semantic similarity calculation methods in Chinese financial intelligent customer service
Huang et al. Structure graph refined information propagate network for aspect-based sentiment analysis
Li et al. DTGCN: a method combining dependency tree and graph convolutional networks for Chinese long-interval named entity relationship extraction
CN112347251A (en) Emotion analysis system based on attention mechanism and bidirectional LSTM (least squares Transmission technology)
Rodrigues et al. Domain adaptation of POS taggers without handcrafted features

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20210413