CN107133283A - A kind of Legal ontology knowledge base method for auto constructing - Google Patents

A kind of Legal ontology knowledge base method for auto constructing Download PDF

Info

Publication number
CN107133283A
CN107133283A CN201710248747.2A CN201710248747A CN107133283A CN 107133283 A CN107133283 A CN 107133283A CN 201710248747 A CN201710248747 A CN 201710248747A CN 107133283 A CN107133283 A CN 107133283A
Authority
CN
China
Prior art keywords
law
word
classification
legal
knowledge base
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710248747.2A
Other languages
Chinese (zh)
Inventor
齐宇馨
阿孜古丽
张德政
张铮
谢永红
黄彪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Science and Technology Beijing USTB
Original Assignee
University of Science and Technology Beijing USTB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Science and Technology Beijing USTB filed Critical University of Science and Technology Beijing USTB
Priority to CN201710248747.2A priority Critical patent/CN107133283A/en
Publication of CN107133283A publication Critical patent/CN107133283A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Abstract

The present invention provides a kind of Legal ontology knowledge base method for auto constructing, can try document according to law and build Legal ontology knowledge base.Methods described includes:The specialist high frequency word in law trial document is counted using network law text data, and the specialist high frequency word obtained according to statistics builds legal field vocabulary;Law trial document is gone after stop words according to the legal field vocabulary of structure and participle is carried out;Document data are tried according to the law of participle classification is iterated to law trial document data, extract the Topic word of each classification in each subseries, and divided the Topic word of each classification according to taxonomical hierarchy order, obtain the hierarchical relationship between Topic word;According to the hierarchical relationship between the Topic word of extraction and obtained Topic word, Legal ontology knowledge base is built.The present invention is applied to knowledge engineering technology field.

Description

A kind of Legal ontology knowledge base method for auto constructing
Technical field
The present invention relates to knowledge engineering technology field, a kind of Legal ontology knowledge base method for auto constructing is particularly related to.
Background technology
Ontologies be it is clear and definite to concept system, formalization, sharable specification explanation.Ontology definition composition master The basic terms and its relation of the vocabulary in topic field, and combine these terms and relation to define the rule that vocabulary off-balancesheet is prolonged Then.Knowledge base is structuring in knowledge engineering, easy to operate, is easily utilized, comprehensive organized knowledge cluster, is to be directed to a certain field The need for problem solving, using certain knowledge representation mode store, organize, manage and use in computer storage it is mutual The knowledge piece set of contact.Ontology knowledge base by information representation into the form closer to the human cognitive world there is provided it is a kind of more The ability of internet mass information is organized, manages and understood well.Ontology knowledge base brings vigor to internet semantic search, Simultaneously also in intelligent answer, big data analysis with showing powerful power in decision-making, internet Knowledge based engineering intelligence is had become The infrastructure that can be serviced.With the arrival in big data epoch, big data is converted into knowledge, strengthened in Internet resources Hold and understand, Contemporary Information treatment technology will be promoted to change from information service to knowledge services.
Law Domain-specific ontology knowledge base is built to integrating legal knowledge, excavating law focus, prediction legal matter, structure Building legal field expert system etc. will play an important role, still, in the prior art, the Legal ontology knowledge being also constructed without Storehouse.
The content of the invention
It is existing to solve the technical problem to be solved in the present invention is to provide a kind of Legal ontology knowledge base method for auto constructing Present in technology the problem of lacking Legal ontology knowledge base.
In order to solve the above technical problems, the embodiment of the present invention provides a kind of Legal ontology knowledge base method for auto constructing, bag Include:
Using network law text data count law trial document in specialist high frequency word, and according to statistics obtain it is special Industry high frequency words build legal field vocabulary;
Law trial document is gone after stop words according to the legal field vocabulary of structure and participle is carried out;
Document data are tried according to the law of participle classification is iterated to law trial document data, extracted each time The Topic word of each classification in classification, and the Topic word of each classification is divided according to taxonomical hierarchy order, obtain Hierarchical relationship between Topic word;
According to the hierarchical relationship between the Topic word of extraction and obtained Topic word, Legal ontology knowledge base is built.
Further, the specialist high frequency word in the utilization network law text data statistics law trial document, and root The specialist high frequency word obtained according to statistics, which builds legal field vocabulary, to be included:
Civil, criminal, administrative trial document specialist high frequency word is counted using network law text data;
According to civil, criminal, the administrative trial document specialist high frequency word of statistics, method is built with reference to default input method dictionary Rule field vocabulary.
Further, the civil justice document specialist high frequency word includes:Civil justice, civil case, civil disputation, the people Duties and responsibilities benefit, property, the person, citizen, minor, capacity for civil rights, capacity for civil acts, monitoring, adjudication of disappearance, declaration are dead Die, voting qualification, assert property dereliction, the restitution of property, self-employed entrepreneur, leaseholding farm households, credits, debt, personal close Partner, legal person, business entity, corporate body, joint operation, act in-the-law, civil right, civil liability, agency, commission generation Reason, legal agency, authorized agency, property right, lien, be sold, leased, mortgaged or transferred, debtor-creditor relationship, unjustified enrichment, Intellectual property, copyright, patent right, exclusive right to use trademark, personal rights, the right of health, the right of name, portraiture right, the right of fame, the right of discovery, Inventor's patent right, reputation right, marriage, divorce, bring up, adopting, fostering, legacy, succession, testament, bequeath, agreement for legacy in return for support, premises Production, house charter in one or more.
Further, the criminal justice document specialist high frequency word includes:Criminal, criminal justice, criminal case, crime, Important cases, economic crime, juvenile delinquency, analogy in criminal law, criminal responsibility, object of crime, subject of crime, criminal capacity, Age for criminal responsibility, body corporate of crime, self-defence, urgent danger prevention, completed offence, crime in preparation, attempted crime, discontinuation of a crime, Joint crime, criminal group, punishment, control, detention, fixed-term imprisonment, life imprisonment, death penalty, fine, deprive of political rights, do not have Receive property, deport, the measurement of penalty, recidivist, confess one's crime, concurrence of offences, sentence calculation, the prison term folding support, reprieve, reducing a penalty, paroling, prosecuting Timeliness, absolution, anti-revolutionary, tissue are escaped from prison, spy, spy, set fire, explode, poisoning, endangering public security, destroy the vehicles, Destruction transit equipment, destruction communication apparatus, sabotage of electrical power equipment, traffic accident, serious accident involving serious consequences, smuggle, engage in speculation and profiteering, escaping Foreign exchange is covered, counterfeits currency, counterfeit valuable securities, forging value warrant, tax dodging, tax refusal, counterfeit trademark, Holiday culture, woods of felling trees unlawfully Wood, excessive cutting and felling of trees, intentional murder, involuntary homicide, intentional injury, fault severe injury, prostitution and whoring, extort a confession by torture, falsely charge, by force Paramour female, fornication with an underage girl, prostitution, kidnap and sell people, kidnap women and children, trafficking in women and children, illegal detention, bag job, insult Disgrace, calumny, revenge frame, false wittness, invade freedom of correspondence, destruction election, property tort, robbery, grab, blackmail, stealing, Hardened thief, swindle, hardened swindler, corrupt, appropriate public funds for personal use, kidnapping, interference with state functions, refuse to implement judgement decide, upset the public, Rogue, run away, shelter, shielding, manufacturing and peddle counterfeit drug, swindle by false pretences, gamble, obscene goods, drugs, receive stolen goods, dispose of stolen goods, destroying treasure Your historical relic, cross illegally state border, crime of insulting the national flag or the national emblem national emblem, impart criminal method, harm marriage and family, it is bigamy, destruction soldier marriage, cruel Treat, abandon, swindle children, malfeasance, accept bribes, bribe, bribery-pander, leakage state secret, neglect one's duties, bend the law for the benefit of relatives or friends, private is put One or more in criminal, destruction post and telecommunications, soldier's breach of duty, drug smuggling, drug trafficking, transport drugs.
Further, the administrative trial document specialist high frequency word includes:Administrative trial, administrative case, state administration machine Pass, national public servants, administrative behavior, administrative responsibility, disciplinary sanction, administrative penalty, disciplinary management, administrative compensation, administrative reexamination, Public security, customs, commodity inspection, land management, geological and mineral, the energy, administrative execution, traffic are raided, wild fauna and flora protection, plan life Educate, fishery, salt industry, water resource, salt political affairs, compulsory administrative measure, Lu Zheng, technical supervision, patent, herding, housing demolition, river Road, post and telecommunications, science and technology, traffic, health, medicine, environmental protection, industry, economy and trade, agricultural, forestry, culture, education, statistics, physical culture, Civil administration, urban planning, town and country construction, metering, price, industry and commerce, work, historical relic, finance, audit, the tax, water conservancy, business administration, One or more in railway, civil aviaton, occurrences in human life, journalism, radio, film and television, tourism, meteorology.
Further, the legal field vocabulary according to structure goes after stop words to law trial document and carries out participle Including:
Stop words dictionary and dictionary for word segmentation are removed using the legal field vocabulary of structure is self-defined;
Stop words dictionary and dictionary for word segmentation are removed according to customized, text is tried to law using Java Chinese word segmentation machines Ansj Book goes after stop words and carries out participle.
Further, the law trial document data of basis participle are iterated point to law trial document data Class, extracts the Topic word of each classification in each subseries, and by the Topic word of each classification according to taxonomical hierarchy order Divided, the hierarchical relationship obtained between Topic word includes:
Document data are tried using the law of participle, theme point is carried out to law trial document using LDA topic models Class, extracts each classification Topic word;
Subject classification is proceeded to the law trial document application LDA topic models under each classification, each class is extracted Other Topic word, and the Topic word of each classification is divided according to taxonomical hierarchy order, until meeting default end Only iterated conditional, then stop iteration.
Further, the application LDA topic models carry out subject classification to law trial document, extract each classification master Epigraph language includes:
Document is tried to law according to maximum theme probability selection theme;
Subject classification is carried out to law trial document according to the theme of selection, and each class is obtained by LDA topic models Other Topic word.
Further, described up to meeting default termination iterated conditional, then stopping iteration including:
During the application continuous iteration of LDA topic models is by subject classification and extraction Topic word, subject classification is judged As a result the maximum theme probability for whether having the data of preset ratio in is less than default threshold value;
If so, then judging to continue to classify according to theme, stop iteration.
Further, the hierarchical relationship between the Topic word according to extraction and obtained Topic word, builds law Ontology knowledge base includes:
Make the Topic word of extraction as the hierarchical relationship between the body, obtained Topic word of Legal ontology knowledge base For the sub- parent relation of Legal ontology knowledge base, the triple structure of body-sub- parent relation-body is formed, law sheet is completed The structure of body knowledge base.
The above-mentioned technical proposal of the present invention has the beneficial effect that:
In such scheme, the specialist high frequency word in law trial document is counted by using network law text data, and Legal field vocabulary is built according to the specialist high frequency word that statistics is obtained;Document is tried to law according to the legal field vocabulary of structure Go after stop words and carry out participle;Document data are tried according to the law of participle to be iterated point law trial document data Class, extracts the Topic word of each classification in each subseries, and by the Topic word of each classification according to taxonomical hierarchy order Divided, obtain the hierarchical relationship between Topic word;According to the level between the Topic word of extraction and obtained Topic word Relation, builds Legal ontology knowledge base.
Brief description of the drawings
Fig. 1 is the schematic flow sheet of Legal ontology knowledge base method for auto constructing provided in an embodiment of the present invention;
Fig. 2 is entity word and its pass in the Legal ontology knowledge base that the first subseries provided in an embodiment of the present invention is obtained It is level schematic diagram;
Fig. 3 carries out subject classification, obtained entity for the classification 0 provided in an embodiment of the present invention obtained to the first subseries Word and its relational hierarchy schematic diagram;
Fig. 4 carries out subject classification, obtained entity for the classification 1 provided in an embodiment of the present invention obtained to the first subseries Word and its relational hierarchy schematic diagram;
Fig. 5 carries out subject classification, obtained entity for the classification 2 provided in an embodiment of the present invention obtained to the first subseries Word and its relational hierarchy schematic diagram;
Fig. 6 carries out subject classification, obtained entity for the classification 0 provided in an embodiment of the present invention obtained to the second subseries Word and its relational hierarchy schematic diagram.
Embodiment
To make the technical problem to be solved in the present invention, technical scheme and advantage clearer, below in conjunction with accompanying drawing and tool Body embodiment is described in detail.
The present invention for it is existing lack Legal ontology knowledge base the problem of there is provided a kind of automatic structure of Legal ontology knowledge base Construction method.
As shown in figure 1, Legal ontology knowledge base method for auto constructing provided in an embodiment of the present invention, including:
Step 101, the specialist high frequency word in law trial document is counted using network law text data, and according to statistics Obtained specialist high frequency word builds legal field vocabulary;
Step 102, law trial document is gone after stop words according to the legal field vocabulary of structure and carries out participle;
Step 103, document data are tried according to the law of participle and classification is iterated to law trial document data, carried The Topic word of each classification in each subseries is taken, and the Topic word of each classification is drawn according to taxonomical hierarchy order Point, obtain the hierarchical relationship between Topic word;
Step 104, according to the hierarchical relationship between the Topic word of extraction and obtained Topic word, build Legal ontology and know Know storehouse.
Legal ontology knowledge base method for auto constructing described in the embodiment of the present invention, by using network law text data The specialist high frequency word in law trial document is counted, and the specialist high frequency word obtained according to statistics builds legal field vocabulary;Root Law trial document is gone after stop words according to the legal field vocabulary of structure and participle is carried out;Text is tried according to the law of participle Book data are iterated classification to law trial document data, extract the Topic word of each classification in each subseries, and will The Topic word of each classification is divided according to taxonomical hierarchy order, obtains the hierarchical relationship between Topic word;According to extraction Topic word and obtained Topic word between hierarchical relationship, build Legal ontology knowledge base.
In the embodiment of foregoing Legal ontology knowledge base method for auto constructing, further, the utilization net Specialist high frequency word in network Law Text data statistics law trial document, and the specialist high frequency word obtained according to statistics builds method Rule field vocabulary includes:
Civil, criminal, administrative trial document specialist high frequency word is counted using network law text data;
According to civil, criminal, the administrative trial document specialist high frequency word of statistics, method is built with reference to default input method dictionary Rule field vocabulary.
In the present embodiment, first obtain network law text data, using acquisition network law text data statistics it is civil, Criminal, administrative specialist high frequency word, the high frequency words refer to occurrence number and exceed predetermined threshold more than predetermined threshold or access times Word, and combine default input method dictionary structure legal field vocabulary, it is preferable that the default input method dictionary is search dog Input method dictionary.
In the present embodiment, counting obtained civil, criminal, administrative trial document specialist high frequency word has 268, actually should In, obtained civil, criminal, administrative trial document specialist high frequency word is counted relevant with the network law text data of acquisition.
It is further, described civil careful in the embodiment of foregoing Legal ontology knowledge base method for auto constructing Sentencing document specialist high frequency word includes:Civil justice, civil case, civil disputation, civil rights and interests, property, the person, citizen, not into Year people, capacity for civil rights, capacity for civil acts, monitoring, adjudication of disappearance, the declaration of death, voting qualification, assert property dereliction, The restitution of property, self-employed entrepreneur, leaseholding farm households, credits, debt, individual partnership, legal person, business entity, public organization Legal person, joint operation, act in-the-law, civil right, civil liability, agency, agency by agreement, legal agency, authorized agency, property Ownership, lien, be sold, leased, mortgaged or transferred, debtor-creditor relationship, unjustified enrichment, intellectual property, copyright, patent right, business Mark private right, personal rights, the right of health, the right of name, portraiture right, the right of fame, the right of discovery, inventor's patent right, reputation right, marriage, divorce, comfort Support, adopt, fostering, legacy, succession, testament, bequeath, agreement for legacy in return for support, real estate, house charter in one or more.
It is further, described criminal careful in the embodiment of foregoing Legal ontology knowledge base method for auto constructing Sentencing document specialist high frequency word includes:Criminal, criminal justice, criminal case, crime, important cases, economic crime, juvenile delinquency, It is analogy in criminal law, criminal responsibility, object of crime, subject of crime, criminal capacity, age for criminal responsibility, body corporate of crime, proper anti- Defend, urgent danger prevention, completed offence, crime in preparation, attempted crime, discontinuation of a crime, joint crime, criminal group, punishment, control, Detention, fixed-term imprisonment, life imprisonment, death penalty, fine, deprivation of political rights, confiscate property, deport, the measurement of penalty, recidivist, from Head, concurrence of offences, sentence calculation, prison term folding are supported, reprieved, reducing a penalty, paroling, the limitation of prosecution, absolution, anti-revolutionary, tissue are escaped from prison, Spy, spy, set fire, explode, poisoning, endangering public security, destroying the vehicles, it is destruction transit equipment, destruction communication apparatus, broken Bad power equipment, traffic accident, serious accident involving serious consequences, smuggle, engage in speculation and profiteering, escape set foreign exchange, counterfeit currency, forge valuable card Certificate, forgery value warrant, tax dodging, tax refusal, counterfeit trademark, Holiday culture, unlawful cutting trees, excessive cutting and felling of trees, intentional murder, fault are killed People, intentional injury, fault severe injury, prostitution and whoring, extort a confession by torture, falsely charge, making an indecent assault on women, fornicating with an underage girl, prostitution, abducting people Mouthful, kidnapping women and children, trafficking in women and children, illegal detention, bag job, insult, calumny, revenge frame, false wittness, invade logical Letter freely, destruction election, property tort, robbery, grab, blackmail, stealing, hardened thief, swindle, hardened swindler, corruption, divert public affairs Money, kidnapping, interference with state functions, refuse to implement judgement and decide, upset the public, rogue, run away, shelter, shielding, manufacturing dealer Sell quack medicine, swindle by false pretences, gambling, obscene goods, drugs, receive stolen goods, dispose of stolen goods, destroy rare cultural relics, cross state border, crime of insulting the national flag or the national emblem illegally National emblem, impart criminal method, harm marriage and family, bigamy, destruction soldier marriage, maltreat, abandon, swindle children, malfeasance, by Bribe, bribe, bribery-pander, leakage state secret, neglect one's duties, bend the law for the benefit of relatives or friends, criminal is put in private, destruction post and telecommunications, soldier disobey One or more in anti-responsibility, drug smuggling, drug trafficking, transport drugs.
It is further, described administrative careful in the embodiment of foregoing Legal ontology knowledge base method for auto constructing Sentencing document specialist high frequency word includes:Administrative trial, administrative case, state administrative organs, national public servants, administrative behavior, administration Responsibility, disciplinary sanction, administrative penalty, disciplinary management, administrative compensation, administrative reexamination, public security, customs, commodity inspection, land management, Matter mineral products, the energy, administrative execution, traffic are raided, wild fauna and flora protection, family planning, fishery, salt industry, water resource, salt Political affairs, compulsory administrative measure, Lu Zheng, technical supervision, patent, herding, housing demolition, river course, post and telecommunications, science and technology, traffic, health, doctor Medicine, environmental protection, industry, economy and trade, agricultural, forestry, culture, education, statistics, physical culture, civil administration, urban planning, town and country construction, meter Amount, price, industry and commerce, work, historical relic, finance, audit, the tax, water conservancy, business administration, railway, civil aviaton, occurrences in human life, journalism, One or more in radio, film and television, tourism, meteorology.
It is further, described according to structure in the embodiment of foregoing Legal ontology knowledge base method for auto constructing The legal field vocabulary built goes after stop words to law trial document and carries out participle to include:
Stop words dictionary and dictionary for word segmentation are removed using the legal field vocabulary of structure is self-defined;
Stop words dictionary and dictionary for word segmentation are removed according to customized, text is tried to law using Java Chinese word segmentation machines Ansj Book goes after stop words and carries out participle.
In the present embodiment, it is possible to use the legal field vocabulary of structure is self-defined to remove stop words dictionary NewWordFilter.dic and dictionary for word segmentation LegalWord.dic, and remove stop words dictionary according to customized NewWordFilter.dic and dictionary for word segmentation LegalWord.dic, document is tried using Java Chinese word segmentation machines Ansj to law Go after stop words and carry out participle, be that the automatic Legal ontology knowledge base that builds is prepared.
In the present embodiment, the Chinese word segmentation machine Ansj is realized based on Java, and stop words word is removed using customized Allusion quotation, which is realized, goes stop words code as follows:
In the present embodiment, call the code of self-defined dictionary for word segmentation as follows:
In the present embodiment, document data are tried using the law of participle, can be using implicit Di Li Crays distribution (Latent Dirichlet Allocation, LDA) topic model carries out subject classification to law trial document, and extracts every One classification Topic word;Then application LDA topic models are continued to the law trial document under each classification and carries out subject classification And the Topic word of each classification is extracted, while the Topic word of each classification is divided according to taxonomical hierarchy order, directly To default termination iterated conditional is met, then stop iteration.
In the present embodiment, the LDA is a kind of document subject matter generation model, also referred to as one three layers of Bayesian probability mould Type, includes word, theme and document three-decker.Each word of every article is by the way that " some is led with certain probability selection Topic, and with certain probability selection some word, " such a process is obtained from this theme.Document obeys multinomial to theme Distribution, theme to word obeys multinomial distribution.The Topic word of article can be thus obtained by theme.So using LDA Topic model, tries document to law and selects certain theme according to maximum probability, and then law trial document is carried out according to theme Subject classification, and each classification Topic word is obtained by LDA topic models;Then the trial document under each classification is distinguished Continue application LDA topic models to carry out subject classification and obtain the Topic word of each classification (subclass).Similarly, after classification Continue using LDA topic models to classify and obtain the Topic word of each classification in each classification (subclass), and by each class Other Topic word is divided according to taxonomical hierarchy order.
In the present embodiment, using the LDA topic models, civil to 2000 parts, criminal, administrative trial document carries out first Subseries and the Topic word for obtaining each classification.Number of topics is set to 3 (the classification number per subseries is 3), and takes first 20 Probability highest word is used as each classification Topic word;After classification, Topic word and its probability under three obtained themes It is as follows:
topic 0:
Appeal 0.020716457203748394
Defendant 0.015358624096518602
Soil 0.013313991406572972
Compensation 0.011940953395806564
Management 0.011463374957279117
Removal 0.009224726026681711
People's court 0.009090407090845867
In accordance with the law 0.008612828652318421
Judgement 0.008582979999910454
Trial 0.008299417802034784
Government 0.008045704256567077
Plaintiff 0.00783676368971132
Ruling 0.007791990711099372
Assert 0.007642747449059544
Confirm 0.007045774400900236
The people 0.006702514898208634
Interim 0.00662789326718872
Certain person 0.006553271636168806
RMB 0.006329406743109066
Standard 0.006210012133477204
topic 1:
Company 0.06931993926904303
Lawsuit 0.020827735689207656
This case 0.017832326507314325
Second trial 0.014348395656271011
Review 0.01391129732071576
Contract 0.01370560398633682
Regulation 0.013242793983984202
Civil 0.012240038978886866
Judgement 0.01215004814509608
The first sentence 0.011597247308952674
Law court 0.011584391475553991
Application 0.010825897305031645
Applicant 0.010530213136861918
People's court 0.010183105635097455
Law 0.009643160632352734
Evidence 0.008936089795425125
Agency 0.008370433125883035
Assert 0.007984758123922521
Agreement 0.007907623123530418
Co., Ltd 0.007869055623334367
topic 2:
Trade mark 0.04003179012285558
Company 0.028305286471471677
Appeal 0.027311845132234835
The first sentence 0.021623906484055266
Application 0.01940327290223174
Prosecution 0.016052843287550623
Birth 0.013559500318485606
Evidence 0.01324783244735248
Tourism 0.01194272323698251
It is required that 0.01188428551114505
Regulation 0.011221991284987154
Patent 0.011144074317203873
Administration 0.010910323413854027
Right 0.010715530994395824
Judgement 0.010248029187696133
Technology 0.01018959146185867
Dispute 0.01017011221991285
The committee 0.009975319800454647
Product 0.009605214203484059
Assert 0.009468859509863314
In the present embodiment, entity word and its relational hierarchy, such as Fig. 2 in the Legal ontology knowledge base that the first subseries is obtained It is shown.
In the present embodiment, the document of theme 0 (topic 0), theme 1 (topic 1), theme 2 (topic 2) is divided into three Class, is designated as classification 0 respectively, and then classification 1, classification 2 carries out the second wheel classification:
In the present embodiment, the second subseries, the classification 0 obtained to the first subseries carries out subject classification, three obtained Topic word and its probability under theme is as follows:
topic 0:
Compensation 0.025827294598994424
Management 0.02316208779335283
Appeal 0.022909973636062408
Soil 0.021361272384135536
Removal 0.020857044069554694
Plaintiff 0.017471511100226184
Company 0.016643136011986227
Interim 0.01581476092374627
Pay 0.01549061415008716
The first sentence 0.014518173829109821
Contract 0.012933456268998602
Defendant 0.012825407344445566
Government 0.012465244262602107
Standard 0.012465244262602107
Administration 0.011744918098915189
Administration commission 0.01156483655799346
Agreement 0.010808494086122196
This case 0.010628412545200467
Agreement 0.010052151614250934
Regulation 0.00972800484059182
topic 1:
Defendant 0.039311415137042616
People's court 0.020743457335267113
Ruling 0.020466323636733147
Trial 0.019773489390398243
Criminal 0.01857257669675107
Criminal 0.015662672862144462
Judgement 0.015246972314343517
In accordance with the law 0.014923649666053893
Sentence 0.014646515967519931
Perform 0.014046059620696345
The People's Republic of China (PRC) 0.011828990032424644
Assert 0.01155185633389068
The injured party 0.010443321539754828
Office 0.009750487293419922
Law 0.0096581093939086
Intentional 0.0096581093939086
Appeal 0.00961192044415294
Fixed-term imprisonment 0.009427164645130298
Occur 0.009057653047085015
Judge 0.009011464097329355
topic 2:
Company 0.025217530195625056
Appeal 0.01833414463476224
Bank 0.012971949253095623
Without 0.01137469956493961
Prove 0.010804253247741032
RMB 0.010576074720861603
Testimony 0.010461985457421887
Situation 0.010119717667102741
Project 0.009929568894703216
Engineering 0.009207003559585019
Confirm 0.008902765523745778
Account 0.008560497733426632
Witness 0.008294289452067297
Evidence 0.008028081170707961
Cash 0.007571724116949101
Wuqi County 0.007533694362469196
Work 0.0071533968176701445
Defender 0.0065829505004715684
Post 0.006468861237031854
Assert 0.006392801728072043
In the present embodiment, the classification 0 obtained to the first subseries carries out subject classification, obtained entity word and its relation Level, as shown in Figure 3.
In the present embodiment, the second subseries, the classification 1 obtained to the first subseries carries out subject classification, three obtained Topic word and its probability under theme is as follows:
topic 0:
Lawsuit 0.03758776353541387
Second trial 0.03037409869990828
Review 0.03030881666519782
Applicant 0.023927497772250565
This case 0.02302986979498177
Application 0.02226280588713389
Regulation 0.019863691111524563
The first sentence 0.019618883481360348
Civil 0.019178229747064755
Judgement 0.0167464739541002
Law court 0.015783563942120948
The People's Republic of China (PRC) 0.015147064103693984
Company 0.014788012912786466
Timeliness 0.013939346461550515
Appeal 0.013727179848741526
Evidence 0.013319167131801165
Law 0.013221244079735479
Agency 0.012796910854117502
Period 0.01253578271527567
People's court 0.01206248796362485
topic 1:
Company 0.09230282265274267
Engineering 0.016336205438475256
People's court 0.014668999144650686
Contract 0.014209913353597543
There is provided 0.01203529644860897
Prove 0.010295602924618114
Perform 0.010223115694451829
Co., Ltd 0.009739867493343256
Product 0.008894183141403257
Senior 0.0085075845805164
Arbitration 0.007541088178299257
Judgement 0.007444438538077543
Agreement 0.007444438538077543
Assert 0.0073236264878004
Ruling 0.007251139257634114
This case 0.007251139257634114
Sign 0.007178652027467828
House 0.007106164797301543
Evidence 0.007106164797301543
Appeal 0.007082002387246114
topic 2:
Company 0.07835860181090756
Contract 0.019682281289988175
Transfer the possession of 0.01802201273142524
Agreement 0.013324667541344738
Property 0.013243678831170936
Both sides 0.012231319953998414
Pay 0.011785882048042503
Soil 0.011502421562434197
Judgement 0.010530557040348576
Regulation 0.010145860667023018
This case 0.010064871956849216
Technology 0.009740917116154008
Evidence 0.008546333641090432
Common 0.008303367510569027
Law 0.008262873155482126
Agreement 0.00797941266987382
Man and wife 0.007817435249526217
Fund 0.007371997343570307
Prove 0.007108784035505451
Application 0.006724087662179893
In the present embodiment, the classification 1 obtained to the first subseries carries out subject classification, obtained entity word and its relation Level, as shown in Figure 4.
In the present embodiment, the second subseries, the classification 2 obtained to the first subseries carries out subject classification, obtains three masters Topic word and its probability under topic is as follows:
topic 0:
Patent 0.046073634052009656
It is required that 0.03962050471163914
Right 0.037270027137332
Technology 0.03427851022457745
Company 0.023081689779696147
Product 0.01735507168956602
Evidence 0.015132801982948354
Application 0.01457723455629394
Disclose 0.014363554776811472
Feature 0.012825060364537705
Specification 0.012269492937883288
Judgement 0.011030150216884978
Second trial 0.010944678305091991
Determine 0.009833543451783158
Review 0.009705335584093677
Assert 0.009662599628197183
Examine 0.009577127716404196
Design 0.009064296245646274
Effect 0.008807880510267313
Patent right 0.00863693668668134
topic 1:
Trade mark 0.1053547720902788
Company 0.04951387305716853
Application 0.02668937494132951
Dispute 0.020493770869932548
The committee 0.01821400313803323
This case 0.013976317236385094
Evidence 0.013466722096313482
Second trial 0.013252155721546487
Assert 0.012796202175166626
Judgement 0.01239389022247851
Review 0.012045219863482145
Regulation 0.011991578269790396
Ruling 0.01183065348871515
Objection 0.010865104802263676
Constitute 0.010194584881116818
Applicant 0.00957770655366171
The People's Republic of China (PRC) 0.009524064959969962
Correlation 0.009443602569432339
Administration 0.00882672424197723
Submit 0.00845123308613499
topic 2:
Appeal 0.07846932751310445
The first sentence 0.049236989462761865
Prosecution 0.04644614201119917
Birth 0.04186628978299372
Tourism 0.03871764137610247
Administration 0.01696334329212659
Plaintiff 0.016784442814462317
Company 0.015603699661878098
Punishment 0.012741292019249692
Regulation 0.011560548866665474
First trial 0.011417428484534054
Science and technology 0.010916507147074083
Behavior 0.010129345045351273
Defendant 0.009449523230227026
Make 0.009127502370431329
Travel agency 0.007803638835715691
Prove 0.006730235969730039
Legal 0.006336654918868633
Carry out 0.006336654918868633
Determine 0.006300874823335778
In the present embodiment, the classification 2 obtained to the first subseries carries out subject classification, obtained entity word and its relation Level, as shown in Figure 5.
In the present embodiment, third time is classified, so that the second subseries obtains topic0 subject categories (classification 0) as an example, to the The classification 0 that secondary classification is obtained carries out subject classification, and obtained classification results are as follows:
topic 0:
Removal 0.02838756716488881
Management 0.02680255503954605
Compensation 0.026327051401943222
Appeal 0.02490054048913474
Interim 0.019987002900572187
Plaintiff 0.019352998050435083
Pay 0.017767985925092326
The first sentence 0.01681697864988667
Company 0.015390467737078186
Contract 0.015390467737078186
Administration commission 0.015231966524543911
Defendant 0.014597961674406808
Standard 0.014439460461872531
Agreement 0.013171450761598324
House 0.013171450761598324
Agreement 0.012378944698926945
Area 0.01142793742372129
Government 0.010952433786118463
Fulfil 0.010635431361049911
This case 0.010635431361049911
topic 1:
Defendant 0.03511978747776071
People's court 0.019034388632984813
In accordance with the law 0.018790670468670025
Trial 0.018303234140040453
Judgement 0.015622334332577804
Ruling 0.015622334332577804
Criminal 0.015134898003948233
Sentence 0.013428870853744728
Assert 0.01269771636080037
The People's Republic of China (PRC) 0.01196656186785601
The injured party 0.01196656186785601
Perform 0.010747971046282079
Criminal 0.010504252881967293
Appeal 0.010260534717652507
RMB 0.010016816553337719
Fixed-term imprisonment 0.009773098389022933
Crime 0.009529380224708147
Opinion 0.009529380224708147
Judge 0.00928566206039336
Confirm 0.009041943896078574
topic 2:
Wuqi County 0.040996668115312176
Engineering 0.03354649117360981
Project 0.030028352062250362
Company 0.027544959748349574
Build 0.015955795616812565
Science and technology 0.01574884625732083
Situation 0.011816808426977918
Post 0.011816808426977918
Without 0.011609859067486185
Agree to 0.011609859067486185
Account 0.010989010989010988
Testimony 0.010989010989010988
Appeal 0.010989010989010988
Contract 0.010989010989010988
Bribe 0.010782061629519257
Hunan 0.009747314832060594
Manager 0.008091719956126736
Investment 0.007884770596635003
Investment 0.007470871877651539
Dividend 0.007263922518159806
In the present embodiment, the classification 0 obtained to the second subseries carries out subject classification, hierarchical relationship such as Fig. 6 institutes of formation Show.
It is further, described until symbol in the embodiment of foregoing Legal ontology knowledge base method for auto constructing Default termination iterated conditional is closed, then stopping iteration including:
During the application continuous iteration of LDA topic models is by subject classification and extraction Topic word, subject classification is judged As a result the maximum theme probability for whether having the data of preset ratio in is less than default threshold value;
If so, then judging to continue to classify according to theme, stop iteration.
In the present embodiment, the probability distribution that some themes of each documents representative are constituted in LDA topic models, And each theme represents the probability distribution that many words are constituted.It therefore, it can regard as a document according to one Determine document MAXIMUM SELECTION probability in a certain theme of probability selection, the theme some words of correspondence distribution again, the present embodiment Theme as document theme.So, in the application continuous iteration of LDA topic models by subject classification and extraction Topic word process In, rational critical value need to be set, it can be assumed for instance that the preset ratio is 30%, default threshold value is 0.5, i.e.,:Answering With the continuous iteration of LDA topic models by subject classification and extracting Topic word during, need to judge in subject classification result whether The maximum theme probability for having 30% data is less than<0.5, if so, then judging to continue to classify according to theme, stop iteration.
In the embodiment of foregoing Legal ontology knowledge base method for auto constructing, further, the basis is carried Hierarchical relationship between the Topic word taken and obtained Topic word, building Legal ontology knowledge base includes:
Make the Topic word of extraction as the hierarchical relationship between the body, obtained Topic word of Legal ontology knowledge base For the sub- parent relation of Legal ontology knowledge base, the triple structure of body-sub- parent relation-body is formed, law neck is completed The structure of domain ontology knowledge base.
, will using the Topic word extracted during Iterative classification as the body of Legal ontology knowledge base in the present embodiment Hierarchical relationship between the Topic word that each subseries is obtained forms body as the sub- parent relation of Legal ontology knowledge base The triple structure of (entity word)-sub- parent relation-body (entity word), so as to build Legal ontology knowledge base automatically.
The method of automatic structure Legal ontology knowledge base described in the present embodiment can be applied not only to legal field body The structure of knowledge base, is also applied for the structure of other Domain-specific ontology knowledge bases, specifically, counting the special of other specific areas Industry high frequency words, build other specific area vocabularys, other specific area vocabularys based on structure, and other specific areas are built automatically Ontology knowledge base.
Described above is the preferred embodiment of the present invention, it is noted that for those skilled in the art For, on the premise of principle of the present invention is not departed from, some improvements and modifications can also be made, these improvements and modifications It should be regarded as protection scope of the present invention.

Claims (10)

1. a kind of Legal ontology knowledge base method for auto constructing, it is characterised in that including:
The specialist high frequency word in law trial document is counted using network law text data, and the specialty obtained according to statistics is high Frequency word builds legal field vocabulary;
Law trial document is gone after stop words according to the legal field vocabulary of structure and participle is carried out;
Document data are tried according to the law of participle classification is iterated to law trial document data, extract each subseries In each classification Topic word, and the Topic word of each classification is divided according to taxonomical hierarchy order, obtains theme Hierarchical relationship between word;
According to the hierarchical relationship between the Topic word of extraction and obtained Topic word, Legal ontology knowledge base is built.
2. Legal ontology knowledge base method for auto constructing according to claim 1, it is characterised in that the utilization network technique The specialist high frequency word in text data statistics law trial document is restrained, and the specialist high frequency word obtained according to statistics builds law neck Domain vocabulary includes:
Civil, criminal, administrative trial document specialist high frequency word is counted using network law text data;
According to civil, criminal, the administrative trial document specialist high frequency word of statistics, law neck is built with reference to default input method dictionary Domain vocabulary.
3. Legal ontology knowledge base method for auto constructing according to claim 2, it is characterised in that the civil justice text Book specialist high frequency word includes:Civil justice, civil case, civil disputation, civil rights and interests, property, the person, citizen, minor, Capacity for civil rights, capacity for civil acts, monitoring, adjudication of disappearance, the declaration of death, voting qualification, identification property dereliction, return wealth Production, self-employed entrepreneur, leaseholding farm households, credits, debt, individual partnership, legal person, business entity, corporate body, connection Battalion, act in-the-law, civil right, civil liability, agency, agency by agreement, legal agency, authorized agency, property right, Lien, be sold, leased, mortgaged or transferred, debtor-creditor relationship, unjustified enrichment, intellectual property, copyright, patent right, trade mark it is special Power, personal rights, the right of health, the right of name, portraiture right, the right of fame, the right of discovery, inventor's patent right, reputation right, marriage, divorce, bring up, receiving Support, foster, legacy, succession, testament, bequeath, agreement for legacy in return for support, real estate, house charter in one or more.
4. Legal ontology knowledge base method for auto constructing according to claim 2, it is characterised in that the criminal justice text Book specialist high frequency word includes:It is criminal, criminal justice, criminal case, crime, important cases, economic crime, juvenile delinquency, criminal Analogize, criminal responsibility, object of crime, subject of crime, criminal capacity, age for criminal responsibility, body corporate of crime, self-defence, Urgent danger prevention, completed offence, crime in preparation, attempted crime, discontinuation of a crime, joint crime, criminal group, punishment, control, arrest Labour, fixed-term imprisonment, life imprisonment, death penalty, fine, deprivation of political rights, confiscate property, deport, the measurement of penalty, recidivist, confess one's crime, Concurrence of offences, sentence calculation, prison term folding are supported, reprieved, reducing a penalty, paroling, the limitation of prosecution, absolution, anti-revolutionary, tissue are escaped from prison, spy, Spy, set fire, explode, poisoning, endangering public security, destroy the vehicles, destruction transit equipment, destruction communication apparatus, destruction Power equipment, traffic accident, serious accident involving serious consequences, smuggle, engage in speculation and profiteering, escape set foreign exchange, counterfeit currency, counterfeit valuable securities, Forge value warrant, tax dodging, tax refusal, counterfeit trademark, Holiday culture, unlawful cutting trees, excessive cutting and felling of trees, intentional murder, involuntary homicide, Intentional injury, fault severe injury, prostitution and whoring, extort a confession by torture, falsely charge, making an indecent assault on women, fornicating with an underage girl, prostitution, kidnap and sell people, Kidnapping women and children, trafficking in women and children, illegal detention, bag job, insult, calumny, revenge frame, false wittness, invade communication Freely, destruction election, property tort, robbery, grab, blackmail, stealing, hardened thief, swindle, hardened swindler, corrupt, appropriate public funds for personal use, Kidnapping, interference with state functions, refuse to implement judgement and decide, upset the public, rogue, run away, shelter, shielding, manufacturing and peddle Counterfeit drug, swindle by false pretences, gamble, obscene goods, drugs, receive stolen goods, dispose of stolen goods, destroy rare cultural relics, cross state border, crime of insulting the national flag or the national emblem state illegally Emblem, impart criminal method, harm marriage and family, bigamy, destruction soldier marriage, maltreat, abandon, swindle children, malfeasance, accept bribes, Bribe, bribery-pander, leakage state secret, neglect one's duties, bend the law for the benefit of relatives or friends, criminal is put in private, destruction post and telecommunications, soldier violate duty One or more in duty, drug smuggling, drug trafficking, transport drugs.
5. Legal ontology knowledge base method for auto constructing according to claim 2, it is characterised in that the administrative trial text Book specialist high frequency word includes:Administrative trial, administrative case, state administrative organs, national public servants, administrative behavior, administrative responsibility, Disciplinary sanction, administrative penalty, disciplinary management, administrative compensation, administrative reexamination, public security, customs, commodity inspection, land management, geology ore deposit Production, the energy, administrative execution, traffic are raided, wild fauna and flora protection, family planning, fishery, salt industry, water resource, salt political affairs, OK Political affairs compulsory measure, Lu Zheng, technical supervision, patent, herding, housing demolition, river course, post and telecommunications, science and technology, traffic, health, medicine, ring Border protection, industry, economy and trade, agricultural, forestry, culture, education, statistics, physical culture, civil administration, urban planning, town and country construction, metering, thing Valency, industry and commerce, work, historical relic, finance, audit, the tax, water conservancy, business administration, railway, civil aviaton, occurrences in human life, journalism, broadcast shadow Depending on, travel, it is meteorological in one or more.
6. Legal ontology knowledge base method for auto constructing according to claim 1, it is characterised in that described according to structure Legal field vocabulary goes after stop words to law trial document and carries out participle to include:
Stop words dictionary and dictionary for word segmentation are removed using the legal field vocabulary of structure is self-defined;
Stop words dictionary and dictionary for word segmentation are removed according to customized, law trial document is gone using Java Chinese word segmentation machines Ansj After stop words and carry out participle.
7. Legal ontology knowledge base method for auto constructing according to claim 1, it is characterised in that basis participle Law trial document data classification is iterated to law trial document data, the master of each classification in each subseries of extraction Language is write inscription, and the Topic word of each classification is divided according to taxonomical hierarchy order, the level obtained between Topic word is closed System includes:
Document data are tried using the law of participle, subject classification is carried out to law trial document using LDA topic models, carried Take each classification Topic word;
Subject classification is proceeded to the law trial document application LDA topic models under each classification, each classification is extracted Topic word, and the Topic word of each classification is divided according to taxonomical hierarchy order, changed until meeting default termination For condition, then stop iteration.
8. Legal ontology knowledge base method for auto constructing according to claim 7, it is characterised in that the application LDA master Inscribe model and subject classification is carried out to law trial document, extracting each classification Topic word includes:
Document is tried to law according to maximum theme probability selection theme;
Subject classification is carried out to law trial document according to the theme of selection, and each classification master is obtained by LDA topic models Write inscription language.
9. Legal ontology knowledge base method for auto constructing according to claim 7, it is characterised in that described until meeting pre- If termination iterated conditional, then stop iteration including:
During the application continuous iteration of LDA topic models is by subject classification and extraction Topic word, subject classification result is judged In whether have preset ratio data maximum theme probability be less than default threshold value;
If so, then judging to continue to classify according to theme, stop iteration.
10. Legal ontology knowledge base method for auto constructing according to claim 1, it is characterised in that described according to extraction Topic word and obtained Topic word between hierarchical relationship, building Legal ontology knowledge base includes:
The Topic word of extraction is regard as method as the hierarchical relationship between the body, obtained Topic word of Legal ontology knowledge base The sub- parent relation of ontology knowledge base is restrained, the triple structure of body-sub- parent relation-body is formed, Legal ontology is completed and knows Know the structure in storehouse.
CN201710248747.2A 2017-04-17 2017-04-17 A kind of Legal ontology knowledge base method for auto constructing Pending CN107133283A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710248747.2A CN107133283A (en) 2017-04-17 2017-04-17 A kind of Legal ontology knowledge base method for auto constructing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710248747.2A CN107133283A (en) 2017-04-17 2017-04-17 A kind of Legal ontology knowledge base method for auto constructing

Publications (1)

Publication Number Publication Date
CN107133283A true CN107133283A (en) 2017-09-05

Family

ID=59715813

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710248747.2A Pending CN107133283A (en) 2017-04-17 2017-04-17 A kind of Legal ontology knowledge base method for auto constructing

Country Status (1)

Country Link
CN (1) CN107133283A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107679226A (en) * 2017-10-23 2018-02-09 中国科学院重庆绿色智能技术研究院 Tourism body constructing method based on theme
CN107918824A (en) * 2017-11-02 2018-04-17 中交第二公路工程局有限公司 A kind of highway engineering construction Norm Measure method
CN108280149A (en) * 2018-01-04 2018-07-13 东南大学 A kind of doctor-patient dispute class case recommendation method based on various dimensions tag along sort
CN108304488A (en) * 2018-01-04 2018-07-20 上海电机学院 A method of utilizing the automatic study ontology of Topic Model
CN108563630A (en) * 2018-03-21 2018-09-21 上海蔚界信息科技有限公司 A kind of construction method of text analyzing knowledge base
CN109614606A (en) * 2018-10-23 2019-04-12 中山大学 Long article this case fine range classification prediction technique and device based on document insertion
CN110046262A (en) * 2019-06-10 2019-07-23 南京擎盾信息科技有限公司 A kind of Context Reasoning method based on law expert's knowledge base
CN110119473A (en) * 2019-05-23 2019-08-13 北京金山数字娱乐科技有限公司 A kind of construction method and device of file destination knowledge mapping
CN110334337A (en) * 2019-04-24 2019-10-15 北京科技大学 A kind of short phrase picking method and system based on Chinese medical book document
CN110795932A (en) * 2019-09-30 2020-02-14 中国地质大学(武汉) Geological report text information extraction method based on geological ontology
CN110895703A (en) * 2018-09-12 2020-03-20 北京国双科技有限公司 Legal document routing identification method and device
WO2021002800A1 (en) * 2019-07-01 2021-01-07 Intelllex Holdings Private Limited Apparatus and method for tagging electronic legal documents for classification and retrieval

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6502081B1 (en) * 1999-08-06 2002-12-31 Lexis Nexis System and method for classifying legal concepts using legal topic scheme
CN102402599A (en) * 2011-11-17 2012-04-04 天津大学 Dynamic maintenance system for large-scale semantic knowledge base
CN103049532A (en) * 2012-12-21 2013-04-17 东莞中国科学院云计算产业技术创新与育成中心 Method for creating knowledge base engine on basis of sudden event emergency management and method for inquiring knowledge base engine
CN103324700A (en) * 2013-06-08 2013-09-25 同济大学 Noumenon concept attribute learning method based on Web information
CN103412917A (en) * 2013-08-08 2013-11-27 广西大学 Extensible database system and management method for coordinated management of data in multi-type field
CN105893551A (en) * 2016-03-31 2016-08-24 上海智臻智能网络科技股份有限公司 Method and device for processing data and knowledge graph
CN106407208A (en) * 2015-07-29 2017-02-15 清华大学 Establishment method and system for city management ontology knowledge base

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6502081B1 (en) * 1999-08-06 2002-12-31 Lexis Nexis System and method for classifying legal concepts using legal topic scheme
CN102402599A (en) * 2011-11-17 2012-04-04 天津大学 Dynamic maintenance system for large-scale semantic knowledge base
CN103049532A (en) * 2012-12-21 2013-04-17 东莞中国科学院云计算产业技术创新与育成中心 Method for creating knowledge base engine on basis of sudden event emergency management and method for inquiring knowledge base engine
CN103324700A (en) * 2013-06-08 2013-09-25 同济大学 Noumenon concept attribute learning method based on Web information
CN103412917A (en) * 2013-08-08 2013-11-27 广西大学 Extensible database system and management method for coordinated management of data in multi-type field
CN106407208A (en) * 2015-07-29 2017-02-15 清华大学 Establishment method and system for city management ontology knowledge base
CN105893551A (en) * 2016-03-31 2016-08-24 上海智臻智能网络科技股份有限公司 Method and device for processing data and knowledge graph

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
甘绍宁: "《专利文献研究2016》", 30 September 2016, 知识产权出版社 *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107679226B (en) * 2017-10-23 2019-03-05 中国科学院重庆绿色智能技术研究院 Tourism body constructing method based on theme
CN107679226A (en) * 2017-10-23 2018-02-09 中国科学院重庆绿色智能技术研究院 Tourism body constructing method based on theme
CN107918824B (en) * 2017-11-02 2022-03-08 中交第二公路工程局有限公司 Method for determining construction quota of highway engineering
CN107918824A (en) * 2017-11-02 2018-04-17 中交第二公路工程局有限公司 A kind of highway engineering construction Norm Measure method
CN108280149A (en) * 2018-01-04 2018-07-13 东南大学 A kind of doctor-patient dispute class case recommendation method based on various dimensions tag along sort
CN108304488A (en) * 2018-01-04 2018-07-20 上海电机学院 A method of utilizing the automatic study ontology of Topic Model
CN108563630A (en) * 2018-03-21 2018-09-21 上海蔚界信息科技有限公司 A kind of construction method of text analyzing knowledge base
CN110895703B (en) * 2018-09-12 2023-05-23 北京国双科技有限公司 Legal document case recognition method and device
CN110895703A (en) * 2018-09-12 2020-03-20 北京国双科技有限公司 Legal document routing identification method and device
CN109614606A (en) * 2018-10-23 2019-04-12 中山大学 Long article this case fine range classification prediction technique and device based on document insertion
CN109614606B (en) * 2018-10-23 2023-02-03 中山大学 Document embedding-based long text case penalty range classification prediction method and device
CN110334337A (en) * 2019-04-24 2019-10-15 北京科技大学 A kind of short phrase picking method and system based on Chinese medical book document
CN110119473A (en) * 2019-05-23 2019-08-13 北京金山数字娱乐科技有限公司 A kind of construction method and device of file destination knowledge mapping
CN110046262A (en) * 2019-06-10 2019-07-23 南京擎盾信息科技有限公司 A kind of Context Reasoning method based on law expert's knowledge base
WO2021002800A1 (en) * 2019-07-01 2021-01-07 Intelllex Holdings Private Limited Apparatus and method for tagging electronic legal documents for classification and retrieval
CN110795932A (en) * 2019-09-30 2020-02-14 中国地质大学(武汉) Geological report text information extraction method based on geological ontology

Similar Documents

Publication Publication Date Title
CN107133283A (en) A kind of Legal ontology knowledge base method for auto constructing
Coleman et al. Surveillance and crime
CN107103087A (en) Block chain big data analysis of market conditions system
CN113011185A (en) Legal field text analysis and identification method, system, storage medium and terminal
Khalyubi et al. Electoral manipulation informationally on hoax production in 2019 presidential and vice presidential election in Indonesia
Trikoz et al. Russian experience of using digital technologies and legal risks of AI
Sahramäki et al. Wasting opportunities: prevention of illicit cross-border waste trafficking
Joutsen The European Union and cooperation in criminal matters: the search for balance
Wood et al. Analysing the multiple dimensions of predictive policing’s techno-social harms
Ugwu Forensic accounting and fraud control in Nigeria: A critical review
Pohoretskyi et al. Detection and proof of cybercrime
Roddy The federal computer systems Protection Act
Zhu et al. Construction and application of knowledge-base in telecom fraud domain
Steel The harms and wrongs of stealing: the harm principle and dishonesty in theft.
Child et al. Criminal Law Reform Now: Proposals & Critique
Leighton-Daly Identity Theft and Tax Crime: Has Technology Made It Easier to Defraud the Revenue
Dewey The Characteristics of Illegal Markets
Demydova et al. Intellectual property: search of the optimum model of legal protection
Bak et al. Application of an ontology-based and rule-based model to selected economic crimes: fraudulent disbursement and money laundering
Genosko et al. Administrative surveillance of alcohol consumption in Ontario, Canada: pre electronic technologies of control
Isibor Using Blockchain Technology to Curb Voter Fraud in Nigeria: Prospects and Challenges
Glynn Computer Abuse: The Emerging Crime and the Need for Legislation
Bak et al. Application of an ontology-based model to a selected fraudulent disbursement economic crime
Liao Review of Big Data Evidence in Criminal Proceedings: Basis of Academic Theory, Practical Pattern and Mode Selection
Li Roles of information systems in socio-legal context.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170905