CN113239130A - Criminal judicial literature-based knowledge graph construction method and device, electronic equipment and storage medium - Google Patents

Criminal judicial literature-based knowledge graph construction method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN113239130A
CN113239130A CN202110677480.5A CN202110677480A CN113239130A CN 113239130 A CN113239130 A CN 113239130A CN 202110677480 A CN202110677480 A CN 202110677480A CN 113239130 A CN113239130 A CN 113239130A
Authority
CN
China
Prior art keywords
criminal
knowledge
graph
judicial
entity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110677480.5A
Other languages
Chinese (zh)
Inventor
王燕玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Bowei Chuangyuan Technology Co ltd
Original Assignee
Guangdong Bowei Chuangyuan Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Bowei Chuangyuan Technology Co ltd filed Critical Guangdong Bowei Chuangyuan Technology Co ltd
Priority to CN202110677480.5A priority Critical patent/CN113239130A/en
Publication of CN113239130A publication Critical patent/CN113239130A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services; Handling legal documents

Abstract

The invention discloses a construction method of a knowledge graph based on criminal judicial documents, which comprises the following steps: the method comprises the steps of classifying, processing, summarizing and summarizing various criminal plots specified by law regulations based on a server terminal, establishing a structured semantic knowledge base, forming and establishing a data set by automatically synchronizing data from a criminal referee document network or manually adding document data based on criminal factor map rules, obtaining entity information through entity identification, information extraction, knowledge fusion and knowledge storage, constructing a knowledge map, collecting and studying and analyzing the criminal referee document by adopting an artificial intelligence technical means, continuously iterating and updating, and perfecting the knowledge map. The invention has sufficient legal basis as legal support, and has high numerical accuracy; the method fully considers the difference factor of the criminal penalty amount, can be suitable for different criminal environments, and can be continuously updated and perfected.

Description

Criminal judicial literature-based knowledge graph construction method and device, electronic equipment and storage medium
Technical Field
The invention relates to the technical field of judicial file character recognition, in particular to a construction method and a device of a knowledge map based on criminal judicial documents, an electronic device and a storage medium.
Background
The fact is the iron law which is based on the crime of the Chinese and foreign criminal law to convince and measure the criminal, and the fact corresponding to the criminal treatment mode and degree is the fact reflecting the specific form of the criminal and the agent factors attached to the criminal. Basic crime facts and related criminal cases are factors which have important influence on the degree of criminal fatality, the criminal fatality and the degree of criminal fatality of the same kind of crimes can be reflected, the omission of any factor can cause deviation of final criminal results, and relatively fair criminal results can be obtained only by designing legal compliance, scientific and reasonable case extraction factors.
Any crime has a plurality of basic plots, the plots for crime determination and criminal investigation are the most basic crime forms, the basic conditions of crime behaviors are reflected, the content forms of the crime determination plot and the criminal investigation plot are changeable, the constructed words are standard and complex, a specific character mode is not followed, and the extraction difficulty is high.
Disclosure of Invention
Aiming at the technical defects in the background art, the invention provides a construction method and a device of a knowledge graph based on criminal judicial documents, electronic equipment and a storage medium, which solve the technical problems and meet the actual requirements, and the specific technical scheme is as follows:
a construction method of a knowledge graph based on criminal judicial papers, the construction method comprising:
classifying, processing and summarizing various criminal episodes specified by law specifications of criminal law, criminal instruction and criminal law on the basis of a server terminal;
according to the classification processing and induction summary of various criminal plots, the plots are used as basic variables, legal logic is expanded, a structured semantic knowledge base is established, and a systematic complete criminal factor map rule is formed;
based on the criminal factor map rule, extracting criminal referee document automatic synchronization data from a large amount of document data of a criminal referee document network, or manually adding document data directly on a data set detail page to create a data set;
according to the content of the data set, entity information is obtained and labeled through entity identification, information extraction, knowledge fusion and knowledge storage;
constructing a knowledge graph based on a criminal referee document based on an criminal factor graph rule according to the marked entity information;
the established knowledge graph continuously adopts an artificial intelligence technical means, public legal documents are collected, legal and appropriate plot entity information is extracted from the large amount of criminal referee documents through learning and analyzing, and part of the legal and appropriate plot entity information is added into a knowledge base after manual examination and continuously iterated and updated, so that the knowledge graph is perfected.
As a further scheme of the present invention, the classification processing and induction summarization includes concept functions and features of sentencing, sentencing principles, sentencing related systems, various sentencing elements which need to be considered in the sentencing process according to the sentencing law, sentencing directive opinions, sentencing rules of different sentencing according to the sentencing law enforcement rules, and the present law and law, judicial interpretation and other notice regulations.
As a further scheme of the present invention, the criminal factor map rule is based on legislation or judicial evidence presented in the form of normative legal documents, including "the people's republic of china" law of highest people < criminal guidance opinions on common crimes > "the law of advanced people of Guangdong province < criminal guidance opinions on common crimes >" implementation rules "the law of highest people, the explanation of the highest people's inspection institute on handling several problems of law applicable to stolen criminal cases" (the law of advanced people of Guangdong province, the notice of the inspection institute of Guangdong province on determining the standard of the amount of stolen criminal cases ".
As a further scheme of the invention, the creation of the data set is based on the massive judicial data formed by criminal officials documents published by the Internet, and the data analysis and the data preprocessing are carried out on the judicial data in the criminal officials documents based on the criminal factor map rules, so as to label the given criminal officials documents in sections.
As a further scheme of the invention, the entity identification, the information extraction, the knowledge fusion and the knowledge storage are that according to the mutual relation of different case entities in the criminal factor map rule, the mass judicial text information is automatically, real-timely and definitely analyzed, deduced, extracted and fused through an NLP natural language processing technology, the specific expression and the elements in the criminal referee document are extracted by using a regular expression in combination with a mode matching method, and the type marking is carried out on the extracted entities by using a named entity identification technology.
As a further aspect of the present invention, the type label of the named entity includes a label entity label, a label relationship label and a specific rule, the entity label is obtained based on entity information of the entity identification, information extraction, knowledge fusion and knowledge storage, the relationship label is obtained based on a combination of interrelations of different case entities in the criminal factor map rule, and the specific rule means that the entity and the label are in a matching correspondence and is determined by the criminal factor map rule.
As a further scheme of the present invention, the construction of the knowledge graph specifically includes performing relationship construction on the entity information according to the entity labels and the relationship labels labeled by the entity information through the sentencing factor graph rule, so as to determine the relationship between the entity information and construct the knowledge graph.
A server based on a construction method of a knowledge-graph based on criminal judicial papers, comprising: one or more processors, memory, one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more applications configured to: the construction method of the criminal judicial paperwork-based knowledge graph in the above embodiment is performed.
The invention has the beneficial effects that: the criminal referee document-based knowledge graph is constructed in a top-down mode and a bottom-up mode, high-quality data, a body and mode information are extracted from legal and legal contents, the criminal factor graph rule is constructed, the law and the legal are fully followed, and sufficient legal basis is provided as legal support. Then, by means of an artificial intelligence technology, public legal documents are collected, mass criminal referee documents are learned and analyzed, legal and appropriate plot entity information is extracted from the documents, part of the documents are added into a knowledge base after being manually checked, the documents are enough in number, the documents can comprehensively reflect all required materials depending on the structural content of the documents, and the numerical accuracy obtained by intelligent calculation through a big data algorithm is also high enough.
Besides, the knowledge map takes the actual criminal referee document stolen as a material, so that the free referee right of the officers involved in the criminal penalty referee, the regional difference, the economic culture difference and other factors of the officers are taken into consideration, the knowledge map can be suitable for different quantitative environments, and accurate data information is provided to greatly assist the realization of criminal monitoring. Meanwhile, the construction of the knowledge graph can be realized through deep learning, intelligent capture and locking and automatic absorption of the criminal rules established in the latest legislation, judicial interpretation and instructional cases.
Drawings
FIG. 1 is a logic flow diagram of the present invention.
Detailed Description
Embodiments of the present invention will be described below with reference to the accompanying drawings and related embodiments, wherein the following related embodiments are merely preferred embodiments for better illustrating the present invention itself, and the embodiments of the present invention are not limited to the following embodiments, and the present invention relates to the related essential parts in the technical field, which should be regarded as the known technology in the technical field and can be known and grasped by those skilled in the art.
A construction method of a knowledge graph based on criminal judicial papers, the construction method comprising:
classifying, processing and summarizing various criminal episodes specified by law specifications of criminal law, criminal instruction and criminal law on the basis of a server terminal;
according to the classification processing and induction summary of various criminal plots, the plots are used as basic variables, legal logic is expanded, a structured semantic knowledge base is established, and a systematic complete criminal factor map rule is formed;
based on the criminal factor map rule, extracting criminal referee document automatic synchronization data from a large amount of document data of a criminal referee document network, or manually adding document data directly on a data set detail page to create a data set;
according to the content of the data set, entity information is obtained and labeled through entity identification, information extraction, knowledge fusion and knowledge storage;
constructing a knowledge graph based on a criminal referee document based on an criminal factor graph rule according to the marked entity information;
the established knowledge graph continuously adopts an artificial intelligence technical means, public legal documents are collected, legal and appropriate plot entity information is extracted from the large amount of criminal referee documents through learning and analyzing, and part of the legal and appropriate plot entity information is added into a knowledge base after manual examination and continuously iterated and updated, so that the knowledge graph is perfected.
The judicial knowledge map is the basis of judicial intelligent application, and the construction of the stealing knowledge map by using an artificial intelligent technology is the first step for realizing the criminal monitoring model. The method is also a technical guarantee for promoting judicial intelligence, and comprises information such as entities, concepts, attributes, relationships and the like. The construction of a knowledge graph requires that various kinds of knowledge constructed by the knowledge graph are obtained from different forms of data sources through a knowledge extraction technology, and various kinds of unsuitable knowledge are removed through knowledge fusion, so that the quality and the performance of the knowledge graph are improved. For example, from the perspective of blending the knowledge map into a legal situation model, the criminal knowledge map for stealing is to organize a structured semantic knowledge base describing the relationship between case facts and referee results by combing and refining criminal facts for stealing crimes into case elements such as crime-making situation elements and criminal situation elements, and thus objectively reflect the relationship between crimes and criminal penalties. The invention carries out classification processing and induction summarization on various criminal plots specified by law specifications such as criminal law, criminal instruction, criminal law, and the like through the server terminal, takes the plots as basic variables, expands legal logic, establishes a structured semantic knowledge base, and forms a systematic complete criminal factor map rule. For example, in the aspect of a crime scene, the theft amount can be subdivided into a large amount, a huge amount and a special huge amount, and is linked with a specific amount. As another example, the criminal scenario of self-beginning can be subdivided into "self-beginning of active case," self-beginning (light crime) "," self-beginning (actively taking a position after questioning due to suspicious patterns or during forced drug abstinence) "," self-beginning (discovered by the handling organization but not investigating conversation, actively taking a position) "," self-beginning (actively taking a position to a criminal or dissuading by relatives and accompanying with self) "," other types of self-beginning ", and is connected with the penalty relief.
After the criminal factor map rule is established, the artificial intelligence and the big data technology are comprehensively applied, structured and semi-structured text data are extracted from legal regulations and judicial criminal referee documents, the knowledge map of stealing criminal cases is drawn by setting the knowledge elements such as crime-setting case elements and criminal case elements, and the knowledge map of stealing criminal cases is established through continuous iterative updating of links such as 'entity identification, information extraction, knowledge fusion and knowledge storage' so as to ensure standardization and precision of criminals. The construction technology of the knowledge graph is mainly divided into a top-down mode and a bottom-up mode in practice. The realization path of constructing the map from top to bottom mainly refers to the legal and legal contents, extracts high-quality data and body and mode information in the data, such as entity information of a legal conviction and an appraisal, and adds the entity information into a knowledge base. The bottom-up atlas structure is obtained by means of artificial intelligence technology, public legal documents are collected, mass criminal referee documents are studied and analyzed, legal and appropriate plot entity information is extracted from the documents, and part of the documents are added into a knowledge base after manual examination. The knowledge graph can be completely constructed according to laws and regulations, and meanwhile, information receipts are continuously extracted from mass criminal referee document data for learning and perfecting, so that the accurate criminal knowledge graph suitable for the judicial field is constructed, the optimization of a machine model is facilitated, and the recognition accuracy is improved.
As a further scheme of the present invention, the classification processing and induction summarization includes concept functions and features of sentencing, sentencing principles, sentencing related systems, various sentencing elements which need to be considered in the sentencing process according to the sentencing law, sentencing directive opinions, sentencing rules of different sentencing according to the sentencing law enforcement rules, and the present law and law, judicial interpretation and other notice regulations.
As a further scheme of the present invention, the criminal factor map rule is based on legislation or judicial evidence presented in the form of normative legal documents, including "the people's republic of china" law of highest people < criminal guidance opinions on common crimes > "the law of advanced people of Guangdong province < criminal guidance opinions on common crimes >" implementation rules "the law of highest people, the explanation of the highest people's inspection institute on handling several problems of law applicable to stolen criminal cases" (the law of advanced people of Guangdong province, the notice of the inspection institute of Guangdong province on determining the standard of the amount of stolen criminal cases ".
The classification processing and the induction summarization are based on the rules of the criminal law general rule on the basic principle of the sentencing, and specifically include but not limited to the concept function and the characteristics of the sentencing, the sentencing principle, the basic theoretical knowledge of the sentencing relevant system and the like, and are also based on various sentencing plot elements which need to be considered in the sentencing process of the sentencing law, sentencing instruction opinions and sentencing rules of different sentencing rules of each criminal specified by the sentencing implementation rules of the corresponding sentencing of each province.
The sentention factor map rules are based on, but not limited to, the current laws and regulations, judicial explanations and other notice regulations, and specifically include, but not limited to, legislation or judicial bases presented in the form of normative legal documents, such as "national common people's republic of china" highest people's court < sentention instruction comments on common crimes > "," high-level people's court of Guangdong province < sentention instruction comments on common crimes > implementation rules ".
According to certain laws and regulations, the criminal factor map rule takes documents such as highest people's law of the people's republic of China < criminal instruction opinions about common crimes >, (high-grade people's law of Guangdong province < criminal instruction opinions about common crimes >) and the like as a legal basis for constructing the criminal factor map rule, and the factors influencing the criminal are quantitatively analyzed. In conclusion, the knowledge graph construction has enough legal basis for law support for monitoring, massive case big data obtained by big data artificial intelligence application technical means are enough in quantity, documents can comprehensively reflect all materials required by monitoring depending on the structural content of the documents, numerical value accuracy obtained by intelligent calculation of a big data algorithm is also high enough, meanwhile, the criminal monitoring theoretical model takes criminal referees documents as materials and monitoring objects, the criminal free cutting right involved in the criminal referees and the factors such as regional difference and economic and cultural difference of the criminal referees are necessarily considered, and the criminal monitoring theoretical model integrates all positive factors to greatly assist in realizing the criminal monitoring.
As a further scheme of the invention, the creation of the data set is based on the massive judicial data formed by criminal officials documents published by the Internet, and the data analysis and the data preprocessing are carried out on the judicial data in the criminal officials documents based on the criminal factor map rules, so as to label the given criminal officials documents in sections.
The legal documents are the main expression form of case data, and the mass judicial big data formed by criminal referee documents published by people's court on the internet becomes important materials for analyzing criminal law theft and criminal law construction and criminal monitoring models, and mainly comprises the occurrence process of cases, case related personnel information, referee results and the like. The invention carries out data analysis and data preprocessing on judicial data in the criminal official document, labels the given criminal official document in a segmented mode, and carries out information arrangement on the criminal official document from the content structure, such as classified segmentation of a title, a text and a tail. Numbering the name of the judging mechanism and the type of the document in the title part, and the information of the personnel involved in the case, the case generation process and the judgment and identification result in the text part; and (3) labeling the names, the trial time and other contents of the participants at the end, and degrading the information of the criminal referee document from the document level to the label level, so that the next step of entity information processing is facilitated. A criminal referee document is accurately identified, collected, labeled and associated with information in the case, including but not limited to, the name of a crime, the year of the referee, the level of a court, an accreditation program, the nature of the document, the crime episode, the sentencing episode, the president, the postponement of criminals, and the like.
As a further scheme of the invention, the entity identification, the information extraction, the knowledge fusion and the knowledge storage are that according to the mutual relation of different case entities in the criminal factor map rule, the mass judicial text information is automatically, real-timely and definitely analyzed, deduced, extracted and fused through an NLP natural language processing technology, the specific expression and the elements in the criminal referee document are extracted by using a regular expression in combination with a mode matching method, and the type marking is carried out on the extracted entities by using a named entity identification technology.
Extracted in the data set: the contents of the name of the advertiser, the year of the judge, the judging structure, the court level, the treating program, the nature of the document, the chief criminal, the delay criminal and the like are obvious in text form characteristics and follow a certain character pattern, so that the method can adopt a pattern matching mode and carry out directional extraction on the contents by constructing a regular expression model.
Any crime has a plurality of basic plots, the plots for crime and sentencing are the most basic crime forms, the basic conditions of crime behaviors are reflected, the content forms of the crime and sentencing plots are changeable, the constructed words are standard and complex, a specific character mode is not followed, and the extraction difficulty is large. The entity identification, information extraction, knowledge fusion and knowledge storage are used for preprocessing semi-structured and unstructured text data in the field of data mining, extracting information such as specified events and facts from massive criminal referee documents, and forming structured storage in the server terminal. Under the legal situation, the NLP natural language processing technology automatically, real-timely and definitely analyzes, deduces, extracts and fuses massive judicial text information, and can accurately extract corresponding plots from the legal language (one state of natural language) of the legal document. The method has the advantages that based on a designed criminal case body framework, semi-structured and unstructured data in batch criminal referee documents are subjected to semantic annotation and characteristic extraction, tags with good structuring degree and rich semantic information are formed and stored in a case library, and valuable information conversion from 'sleeping' mass data to the operation of a support model is realized.
As a further aspect of the present invention, the type label of the named entity includes a label entity label, a label relationship label and a specific rule, the entity label is obtained based on entity information of the entity identification, information extraction, knowledge fusion and knowledge storage, the relationship label is obtained based on a combination of interrelations of different case entities in the criminal factor map rule, and the specific rule means that the entity and the label are in a matching correspondence and is determined by the criminal factor map rule.
As a further scheme of the present invention, the construction of the knowledge graph specifically includes performing relationship construction on the entity information according to the entity labels and the relationship labels labeled by the entity information through the sentencing factor graph rule, so as to determine the relationship between the entity information and construct the knowledge graph.
And the entity information constructs entity triples through the entity labels and the relationship labels obtained by naming the entities, and the triples are screened and sorted for effectiveness to form entity relationship triples which are most suitable for the rules of the sentencing factor maps, and the obtained entity relationship triples form a complete knowledge language logic system to become the theoretical basis of the knowledge maps. The entity labels and the relation labels define a plurality of relations such as case involvement, type, acceptance and the like in the criminal referee document, the relations among a plurality of entity information can form a complete knowledge base, and a knowledge map based on the criminal referee document is formed through data construction. Then, through the technical means of continuously adopting artificial intelligence, the public legal documents are collected, the legal and appropriate plot entity information is extracted from the large amount of criminal referee documents through learning and analyzing, part of the legal and appropriate plot entity information is added into a knowledge base after manual examination and continuously iterated and updated, and the knowledge map is continuously perfected.
A server based on a construction method of a knowledge-graph based on criminal judicial papers, comprising: one or more processors, memory, one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more applications configured to: the construction method of the criminal judicial paperwork-based knowledge graph in the above embodiment is performed.
An embodiment of the present invention provides a computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the method for predicting the actual criminal period of a crime according to any one of the technical solutions. The computer-readable storage medium includes, but is not limited to, any type of disk including floppy disks, hard disks, optical disks, CD-ROMs, and magnetic-optical disks, ROMs (read-only memories), RAMs (random access memories), EPROMs (erasable programmable read-only memories), EEPROMs (electrically erasable programmable read-only memories), flash memory, magnetic cards, or optical cards. That is, a storage device includes any medium that stores or transmits a message in a form readable by a device (e.g., a computer, a cell phone), and may be a read-only memory, a magnetic or optical disk, or the like.
As one embodiment, the server includes: one or more processors, a memory, one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more programs configured to perform the method of criminal law document based knowledge-graph construction in the above embodiments.
The server provided by the embodiment of the invention can realize the embodiment of the construction method of the knowledge graph based on the criminal judicial essay, and for the specific function realization, please refer to the description in the detailed method embodiment, which is not repeated herein.
In the embodiment of the invention, artificial intelligence and big data technology are comprehensively applied, structured and semi-structured text data are extracted from law, regulation and judicial criminal officials documents, the knowledge map based on the criminal officials documents is drawn by setting the knowledge elements such as criminal case elements, criminal case elements and the like, and the knowledge map for stealing criminal cases is constructed through continuous iterative updating of links such as 'entity identification, information extraction, knowledge fusion, knowledge storage' and the like, so as to ensure standardization and precision of criminal officials.
The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and decorations can be made without departing from the principle of the present invention, and these modifications and decorations should also be regarded as the protection scope of the present invention.

Claims (8)

1. A construction method of a knowledge graph based on criminal judicial documents is characterized by comprising the following steps:
classifying, processing and summarizing various criminal episodes specified by law specifications of criminal law, criminal instruction and criminal law on the basis of a server terminal;
according to the classification processing and induction summary of various criminal plots, the plots are used as basic variables, legal logic is expanded, a structured semantic knowledge base is established, and a systematic complete criminal factor map rule is formed;
based on the criminal factor map rule, extracting criminal referee document automatic synchronization data from a large amount of document data of a criminal referee document network, or manually adding document data directly on a data set detail page to create a data set;
according to the content of the data set, entity information is obtained through entity identification, information extraction, knowledge fusion and knowledge storage, and type marking is carried out;
constructing a knowledge graph based on criminal referee documents based on the criminal factor graph rule according to the marked entity information;
the established knowledge graph continuously adopts an artificial intelligence technical means, public legal documents are collected, legal and appropriate plot entity information is extracted from the large amount of criminal referee documents through learning and analyzing, and part of the legal and appropriate plot entity information is added into a knowledge base after manual examination and continuously iterated and updated, so that the knowledge graph is perfected.
2. The criminal judicial paperwork-based knowledge-graph construction method according to claim 1, wherein said classification and induction summarize comprises concept functions and features of criminal, principles of criminal, related laws of criminal, various elements of criminal plots to be considered in the course of criminal regulation, and different rules of criminal regulation, which are regulated by the corresponding law enforcement rules of each province, as well as current laws and regulations, judicial interpretations and other notice regulations.
3. The construction method of knowledge-graph based on criminal judicial papers according to claim 1, characterized in that the criminal factor-graph rules are based on legislation or judicial evidence presented in the form of normative legal documents, including "people's republic of china" criminal law of the highest people's court < criminal instruction opinions on common crimes > "law of senior people of Guangdong province < criminal instruction opinions on common crimes >" enforcement rules of the highest people's court, the highest people's inspection institute about handling a plurality of problems of applicable law for stealing criminal cases "(notice of senior people's court of Guangdong province, Guangdong province's inspection institute about determining the amount standard of stealing criminal cases).
4. The criminal judicial paperwork-based knowledge-graph construction method according to claim 1, wherein the data set is created according to massive large judicial data formed by criminal officials paperwork published on the internet, and the judicial data in the criminal officials paperwork is subjected to data analysis and data preprocessing based on the criminal factor graph rule, so as to label the given criminal officials paperwork in sections.
5. The criminal judicial paperwork-based knowledge-graph construction method according to claim 1, wherein the entity recognition, information extraction, knowledge fusion and knowledge storage are implemented by analyzing, deducing, extracting and fusing mass judicial text information automatically, in real time and definitely through an NLP (non line language processing) natural language processing technology according to the interrelation of different case entities in the criminal factor graph rule, combining a mode matching method, extracting specific expressions and elements in the criminal judicial paperwork through a regular expression, and performing the type marking on the extracted entities through a named entity recognition technology.
6. The criminal judicial paperwork-based knowledge-graph construction method according to claim 5, wherein the type labels of the named entities comprise labeled entity labels, labeled relationship labels and specific rules, the entity labels are obtained based on entity information of the entity identification, information extraction, knowledge fusion and knowledge storage, the relationship labels are obtained based on the mutual relationship combination of different case entities in the criminal factor graph rules, and the specific rules refer to the fact that the entities and the labels are in matching correspondence and are determined by the criminal factor graph rules.
7. The criminal judicial paperwork-based knowledge-graph construction method according to claim 6, wherein the knowledge-graph is specifically constructed by performing relationship construction on the entity information according to the entity labels marked on the entity information and the relationship labels through the criminal factor graph rule, so as to determine the relationship between the entity information and construct the knowledge-graph.
8. A server, characterized in that it comprises: one or more processors, memory, one or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more applications configured to: a construction method of a criminal law document based knowledge-graph according to any one of claims 1 to 7 is performed.
CN202110677480.5A 2021-06-18 2021-06-18 Criminal judicial literature-based knowledge graph construction method and device, electronic equipment and storage medium Pending CN113239130A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110677480.5A CN113239130A (en) 2021-06-18 2021-06-18 Criminal judicial literature-based knowledge graph construction method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110677480.5A CN113239130A (en) 2021-06-18 2021-06-18 Criminal judicial literature-based knowledge graph construction method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113239130A true CN113239130A (en) 2021-08-10

Family

ID=77140351

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110677480.5A Pending CN113239130A (en) 2021-06-18 2021-06-18 Criminal judicial literature-based knowledge graph construction method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113239130A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114610954A (en) * 2022-03-09 2022-06-10 上海弘玑信息技术有限公司 Information processing method and device, storage medium and electronic equipment
CN114911879A (en) * 2022-07-19 2022-08-16 南京航天数智科技有限公司 Criminal case knowledge map relation reasoning method based on graph attention network
CN115203439B (en) * 2022-09-15 2022-11-29 天津市道本科技有限公司 Construction method of knowledge graph for compliance supervision, related method and system
CN115687632A (en) * 2022-08-25 2023-02-03 中国司法大数据研究院有限公司 Criminal measuring plot decomposition analysis method and system
CN115858507A (en) * 2022-08-25 2023-03-28 中国司法大数据研究院有限公司 Method and system for automatic assessment, error correction and completion of criminal investigation rule
CN115982388A (en) * 2023-03-06 2023-04-18 共道网络科技有限公司 Case quality control map establishing method, case document quality testing method, case quality control map establishing equipment and storage medium
CN116127977A (en) * 2023-02-08 2023-05-16 中国司法大数据研究院有限公司 Casualties extraction method for referee document

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107122444A (en) * 2017-04-24 2017-09-01 北京科技大学 A kind of legal knowledge collection of illustrative plates method for auto constructing
CN110837564A (en) * 2019-09-25 2020-02-25 中央民族大学 Construction method of knowledge graph of multilingual criminal judgment books
US20200073932A1 (en) * 2018-08-30 2020-03-05 Intelligent Fusion Technology, Inc Method and system for pattern discovery and real-time anomaly detection based on knowledge graph
CN111291161A (en) * 2020-02-20 2020-06-16 平安科技(深圳)有限公司 Legal case knowledge graph query method, device, equipment and storage medium
CN111930856A (en) * 2020-07-06 2020-11-13 北京邮电大学 Method, device and system for constructing domain knowledge graph ontology and data
CN112347270A (en) * 2020-11-30 2021-02-09 重庆工程职业技术学院 Legal knowledge graph construction system and method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107122444A (en) * 2017-04-24 2017-09-01 北京科技大学 A kind of legal knowledge collection of illustrative plates method for auto constructing
US20200073932A1 (en) * 2018-08-30 2020-03-05 Intelligent Fusion Technology, Inc Method and system for pattern discovery and real-time anomaly detection based on knowledge graph
CN110837564A (en) * 2019-09-25 2020-02-25 中央民族大学 Construction method of knowledge graph of multilingual criminal judgment books
CN111291161A (en) * 2020-02-20 2020-06-16 平安科技(深圳)有限公司 Legal case knowledge graph query method, device, equipment and storage medium
CN111930856A (en) * 2020-07-06 2020-11-13 北京邮电大学 Method, device and system for constructing domain knowledge graph ontology and data
CN112347270A (en) * 2020-11-30 2021-02-09 重庆工程职业技术学院 Legal knowledge graph construction system and method

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114610954A (en) * 2022-03-09 2022-06-10 上海弘玑信息技术有限公司 Information processing method and device, storage medium and electronic equipment
CN114911879A (en) * 2022-07-19 2022-08-16 南京航天数智科技有限公司 Criminal case knowledge map relation reasoning method based on graph attention network
CN115687632A (en) * 2022-08-25 2023-02-03 中国司法大数据研究院有限公司 Criminal measuring plot decomposition analysis method and system
CN115858507A (en) * 2022-08-25 2023-03-28 中国司法大数据研究院有限公司 Method and system for automatic assessment, error correction and completion of criminal investigation rule
CN115687632B (en) * 2022-08-25 2024-04-09 中国司法大数据研究院有限公司 Criminal investigation plot decomposition analysis method and system
CN115203439B (en) * 2022-09-15 2022-11-29 天津市道本科技有限公司 Construction method of knowledge graph for compliance supervision, related method and system
CN116127977A (en) * 2023-02-08 2023-05-16 中国司法大数据研究院有限公司 Casualties extraction method for referee document
CN116127977B (en) * 2023-02-08 2023-10-03 中国司法大数据研究院有限公司 Casualties extraction method for referee document
CN115982388A (en) * 2023-03-06 2023-04-18 共道网络科技有限公司 Case quality control map establishing method, case document quality testing method, case quality control map establishing equipment and storage medium
CN115982388B (en) * 2023-03-06 2024-04-19 共道网络科技有限公司 Case quality control map establishment method, case document quality inspection method, case quality control map establishment equipment and storage medium

Similar Documents

Publication Publication Date Title
CN113239130A (en) Criminal judicial literature-based knowledge graph construction method and device, electronic equipment and storage medium
Sleimi et al. Automated extraction of semantic legal metadata using natural language processing
CN108572967B (en) Method and device for creating enterprise portrait
CN112182246B (en) Method, system, medium, and application for creating an enterprise representation through big data analysis
CN107403375A (en) A kind of listed company&#39;s bulletin classification and abstraction generating method based on deep learning
CN113239208A (en) Mark training model based on knowledge graph
CN108108744B (en) Method and system for radiation image auxiliary analysis
CN108563783B (en) Financial analysis management system and method based on big data
CN109800354B (en) Resume modification intention identification method and system based on block chain storage
CN110147540B (en) Method and system for generating business security requirement document
CN109492097B (en) Enterprise news data risk classification method
CN103176984A (en) Detection method of deceptive rubbish suggestions in user generated contents
CN110770781A (en) Legal information processing system, method and program
CN114860882A (en) Fair competition review auxiliary method based on text classification model
CN112347254A (en) News text classification method and device, computer equipment and storage medium
Bacalu Digital policing tools as social control technologies: data-driven predictive algorithms, automated facial recognition surveillance, and law enforcement biometrics
CN115794798B (en) Market supervision informatization standard management and dynamic maintenance system and method
CN115936932A (en) Method and device for processing judicial documents, electronic equipment and storage medium
CN115618085A (en) Interface data exposure detection method based on dynamic label
Kruse et al. Developing a legal form classification and extraction approach for company entity matching: Benchmark of rule-based and machine learning approaches
CN112988972A (en) Administrative penalty file evaluation and checking method and system based on data model
Watanabe et al. A study on support method of consulting service using text mining
Burley et al. Nlp workflows for computational social science: Understanding triggers of state-led mass killings
KR20220068937A (en) Standard Industrial Classification Based on Machine Learning Approach
CN113157949A (en) Method and device for extracting event information, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210810