CN109614453A - A kind of data storage, querying method and the device of regulatory information - Google Patents

A kind of data storage, querying method and the device of regulatory information Download PDF

Info

Publication number
CN109614453A
CN109614453A CN201811533428.7A CN201811533428A CN109614453A CN 109614453 A CN109614453 A CN 109614453A CN 201811533428 A CN201811533428 A CN 201811533428A CN 109614453 A CN109614453 A CN 109614453A
Authority
CN
China
Prior art keywords
data
regulation
fine
regulatory information
search engine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811533428.7A
Other languages
Chinese (zh)
Inventor
孙海波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Faxun Information Technology Co Ltd
Original Assignee
Hangzhou Faxun Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Faxun Information Technology Co Ltd filed Critical Hangzhou Faxun Information Technology Co Ltd
Priority to CN201811533428.7A priority Critical patent/CN109614453A/en
Publication of CN109614453A publication Critical patent/CN109614453A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Technology Law (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of date storage methods of regulatory information, comprising: regulation data of the acquisition about regulatory information;According to preset fractionation rule, the regulation data are split, fine data is obtained;According to preset matching rule, the fine data is matched, forms relationship maps table;The relationship maps table and the fine data are stored to database.Compared to the prior art, the present invention can rapidly establish regulation, case, the incidence relation between attachment, and be stored, and various dimensions, multivariant query function can be provided for user.

Description

A kind of data storage, querying method and the device of regulatory information
Technical field
The present invention relates to data storage and inquiring technology fields, and in particular to a kind of data storage, the inquiry of regulatory information Method and device.
Background technique
Since existing promulgation regulation functional department and functional department, regulatory agency are numerous, each independent functional department is being sent out Cloth relevant laws and regulations regulation or punishment case Shi Douhui are carried out in respective official website, and the content mark of different functional department's publications It is quasi- also different.And the complicated relationship that many regulations can have while promulgating with other regulations, in such relationship It may be related to the revision of other regulations or abrogate, and user is only capable of focusing in regulation itself when browsing a regulation Hold, none very easily inquires entrance to all other regulation users associated therewith, or even other regulations clause with Content user is not known without exception, does not know where inquire yet.
The form for punishing case is even more varied, and wherein much also only very simple reference is different for punishing justification item The different clauses of regulation, also without a convenient inquiry entrance.These some outstanding problems encountered when being all browsing.
It with the continuous revision of regulation, update, is substituted, regulation promulgates the change of organization names, and nonstandard abbreviation is answered With the height of regulation name is similar, and code inconvenience memory etc., these problems all give subsequent inquiry bring greatly inconvenient, most Whole result can not find desired result or have found the time cost also paid at double.
Summary of the invention
For the defects in the prior art, the present invention provides data storage, querying method and the device of a kind of regulatory information, Various dimensions, multivariant query function can be provided for user.
In a first aspect, the present invention provides a kind of date storage methods of regulatory information, comprising:
Acquire the regulation data about regulatory information;
According to preset fractionation rule, the regulation data are split, fine data is obtained;
According to preset matching rule, the fine data is matched, forms relationship maps table;
The relationship maps table and the fine data are stored to database.
Optionally, it after the regulation data of the acquisition about regulatory information the step of, is torn open according to preset Before the step of divider then, splits the regulation data, obtains fine data, further includes:
Verify the correctness of the regulation data;If correct, execute it is described regular according to preset fractionations, to institute The step of stating regulation data to be split, obtaining fine data;If mistake, the regulation data are modified, then executes It is described regular according to preset fractionation, the step of being split to the regulation data, obtain fine data.
Optionally, described that the regulation data are split according to preset fractionation rule, obtain fine number According to, comprising:
The regulation data are split according to natural paragraph;
According to the keyword in natural paragraph, new logic paragraph is established, obtains fine data.
Optionally, described that the fine data is matched according to preset matching rule, form relationship maps Table, comprising:
To in the fine data regulation and logical segment drop into line flag, obtain regulation ID and paragraph ID;
According to preset matching rule, the fine data is matched, is obtained between the fine data Incidence relation;
The incidence relation is indicated using the regulation ID and paragraph ID, forms relationship maps table.
Optionally, the incidence relation, comprising: regulation and regulation, regulation and logic paragraph and logic paragraph and logical segment Incidence relation between falling.
Optionally, incidence relation of the incidence relation between regulation and regulation;
It is described that the fine data is matched according to preset matching rule, it obtains between regulation and regulation Incidence relation, comprising:
In the fine data, according to the regulation name complete match of regulation, if successful match, using matching result as Incidence relation between regulation and regulation;
If it fails to match, matched according to the content of punctuation marks used to enclose the title in regulation name, if successful match, by matching result As the incidence relation between regulation and regulation;
If it fails to match, removing in regulation name and ignore word, the regulation name after word is ignored according to removal is matched, if Successful match, then using matching result as the incidence relation between regulation and regulation;If it fails to match, do not have between corresponding regulation It is relevant.
Second aspect, the present invention provides a kind of data storage devices of regulatory information, comprising:
Data acquisition module, for acquiring the regulation data about regulatory information;
Data split module, for splitting to the regulation data, obtaining essence according to preset fractionation rule Count evidence accurately;
Data match module is formed and is closed for being matched to the fine data according to preset matching rule Join mapping table;
Data memory module, for storing the relationship maps table and the fine data to database.
The third aspect, the present invention provides a kind of data query methods of regulatory information, comprising:
By the relationship maps table and the fine data that are stored using a kind of date storage method of regulatory information synchronize deposit It stores up to search engine database;
Establish query search engine;
Obtain the screening conditions of user's input;
Corresponding pass is transferred from described search engine database using described search engine according to the screening conditions Join data;
The associated data is exported to user.
It is optionally, described to establish query search engine, comprising:
Obtain the keyword and synonym of regulatory information;
The keyword and the synonym are segmented;
According to word segmentation result, it is based on elasticsearch distributed search engine, establishes query search engine.
Fourth aspect, the present invention provide a kind of data query device of regulatory information, comprising:
Data simultaneous module, for by relationship maps table and fine data synchronize store to search engine database;
Engine establishes module, for establishing query search engine;
Condition obtains module, for obtaining the screening conditions of user's input;
Data transfer module, are used for according to the screening conditions, using described search engine, from described search engine data Corresponding associated data is transferred in library;
Data outputting module, for exporting the associated data to user.
A kind of date storage method of regulatory information provided by the invention, comprising: regulation number of the acquisition about regulatory information According to;According to preset fractionation rule, the regulation data are split, fine data is obtained;According to preset Matching rule matches the fine data, forms relationship maps table;By the relationship maps table and the fine data It stores to database.Compared to the prior art, the present invention can rapidly establish regulation, case, the incidence relation between attachment, And stored, various dimensions, multivariant query function can be provided for user.
The data storage device of a kind of regulatory information provided by the invention, with a kind of above-mentioned data storage side of regulatory information Method is for identical inventive concept, beneficial effect having the same.
The data query method of a kind of regulatory information provided by the invention, comprising: will be using a kind of data of regulatory information Relationship maps table and the fine data of storage method storage synchronize store to search engine database;Query search is established to draw It holds up;Obtain the screening conditions of user's input;According to the screening conditions, using described search engine, from described search engine number According to transferring corresponding associated data in library;The associated data is exported to user.In this way, user can rapidly, it is quasi- Associated regulation data really are inquired, and then can be improved user experience.
The data query device of a kind of regulatory information provided by the invention, with a kind of above-mentioned data query side of regulatory information Method is for identical inventive concept, beneficial effect having the same.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art Embodiment or attached drawing needed to be used in the description of the prior art are briefly described.In all the appended drawings, similar element Or part is generally identified by similar appended drawing reference.In attached drawing, each element or part might not be drawn according to actual ratio.
Fig. 1 is a kind of flow chart of the date storage method of regulatory information provided in an embodiment of the present invention;
Fig. 2 is a kind of schematic diagram of the data query device of regulatory information provided in an embodiment of the present invention;
Fig. 3 is a kind of flow chart of the date storage method of regulatory information provided in an embodiment of the present invention;
Fig. 4 is a kind of schematic diagram of the data query device of regulatory information provided in an embodiment of the present invention.
Specific embodiment
It is described in detail below in conjunction with embodiment of the attached drawing to technical solution of the present invention.Following embodiment is only used for Clearly illustrate technical solution of the present invention, therefore be intended only as example, and cannot be used as a limitation and limit protection of the invention Range.
It should be noted that unless otherwise indicated, technical term or scientific term used in this application should be this hair The ordinary meaning that bright one of ordinary skill in the art are understood.
The present invention provides a kind of storage of the data of regulatory information, querying method and devices.With reference to the accompanying drawing to this hair Bright embodiment is illustrated.
Referring to FIG. 1, Fig. 1 is a kind of process of the date storage method for regulatory information that the specific embodiment of the invention provides Figure, a kind of date storage method of regulatory information provided in this embodiment, comprising:
Step S101: regulation data of the acquisition about regulatory information.
Wherein, regulation data may include: regulation, case, attachment etc..
When acquiring regulation data, separate sources data can be acquired by professional, input system is sorted out after arrangement, is protected Demonstrate,prove the correctness and integrality of data.
Step S102: according to preset fractionation rule, the regulation data is split, fine data is obtained.
It can also include: the correctness for verifying the regulation data before being split to regulation data;If correct, Then execute it is described according to preset fractionation rule, the step of being split to the regulation data, obtain fine data;If Mistake is then modified the regulation data, then execute it is described according to preset fractionation rule, to the regulation data The step of being split, obtaining fine data.
When the correctness to regulation data is verified, can verify the following aspects: special field cannot be sky; Whether regulation timeliness is correct;Whether data repeat;Whether format standardizes.
(1) before the correctness of verifying regulation data, it is also necessary to standardize to regulation data.For example, " silver hair [2018] No. 296 ", become " silver hair (2018) 296 " after the bracket and space standardization in the code.
(2) fields such as title of code, text, promulgation time, promulgation mechanism, validity cannot be sky.
(3) when the entry-into-force time is later than current time, regulation validity should be " Pending The Entry Into Force ";Entry-into-force time is earlier than current time When, regulation validity should be " effective " or " revision ";Expiration time should be " failure " earlier than current time, regulation validity.
(4) judge whether regulation repeats by title of code and code combination.
It, can also be with if, can be by being artificially modified in amendment it was found that error in data, is modified data It is modified by smart machine, this is all within the scope of the present invention.
By being verified to data correctness, it can be ensured that correctness of the data in typing improves user experience.
After the correctness for having verified regulation data, using preset fractionation rule, the regulation data are torn open Point, obtain fine data, detailed process are as follows: split according to natural paragraph to the regulation data;According in natural paragraph Keyword, establish new logic paragraph, obtain fine data.
Regulation data are first dropped into capable fractionation by paragragh, then, if there are specific keyword, examples in natural paragraph Such as, multiple natural paragraphs between two keywords are merged into as a new logical segment by chapter n has, the N articles, N money etc. It falls, forms fine data.
Fall behind being split as new logical segment, can also be modified by manually being fallen to logical segment, guarantees logic paragraph Correctness.
Step S103: according to preset matching rule, matching the fine data, forms relationship maps Table.
On the basis of fine data, regulation and regulation, regulation and logic paragraph, logic paragraph and logical segment can establish Incidence relation between falling, to provide various dimensions, more fine-grained accurate inquiry for user.
Form the process of relationship maps table are as follows:
To in the fine data regulation and logical segment drop into line flag, obtain regulation ID and paragraph ID;According to preparatory The matching rule of setting matches the fine data, obtains the incidence relation between the fine data;Using described Regulation ID and paragraph ID indicates the incidence relation, forms relationship maps table.
Wherein, a regulation ID can correspond at least one paragraph ID in relation mapping table, and multiple regulation ID can also be right A paragraph ID is answered, this is all within the scope of the present invention.
When incidence relation of the incidence relation between regulation and regulation, matched using following matching rule:
In the fine data, according to the regulation name complete match of regulation, if successful match, using matching result as Incidence relation between regulation and regulation;If it fails to match, matched according to the content of punctuation marks used to enclose the title in regulation name, if matching Success, then using matching result as the incidence relation between regulation and regulation;If it fails to match, ignoring in regulation name is removed Word, the regulation name after word is ignored according to removal is matched, if successful match, using matching result as between regulation and regulation Incidence relation;If it fails to match, there is no incidence relation between corresponding regulation.
Before matching, it should remove additional character, guarantee accurately and efficiently matches.
Such as: 1) first press regulation name complete match.If miss is matched, into step 2;2) by the punctuation marks used to enclose the title in regulation name Matching, if regulation is entitled " about the notice for printing and distributing " actuarial report " ", then matches " the actuarial report " in punctuation marks used to enclose the title.If matching is not Hit, into step 3;3) it is matched again after " the ignoring word " of the hit of removal regulation, as regulation " refers to about efficiency credit is printed and distributed The notice drawn ", " about printing and distributing " therein and " notice " are to ignore word, and entitled " efficiency credit refers to effective regulation after removal Draw ";4) sequence according to 1), 2), 3) is successively matched, if by the rule of 1) step description in entire regulation text With unsuccessful, then matched again by 2) rule;Rule is 2) also without successful match, then again by 3) being matched;5) above-mentioned step Matching in rapid should all remove additional character in advance, such as drawing in " about the notice for printing and distributing " " Shandong blueness benchmark loan " detailed rules for the implementation " " Number.
When incidence relation of the incidence relation between regulation and logic paragraph, matching rule are as follows: in retrieval logic paragraph Whether it is related to regulation, if being related to, the ID of the logic paragraph and regulation ID is associated, forms incidence relation;If not relating to And then the logic paragraph and the regulation do not have incidence relation.
When incidence relation of the incidence relation between logic paragraph and logic paragraph, matching rule are as follows: retrieval two is patrolled It collects in paragraph and whether is related to identical regulation meaning, if being related to, the two logic paragraphs ID is associated, form association and close System;If not being related to, there is no incidence relation between the two logic paragraphs.
Step S104: the relationship maps table and the fine data are stored to database.
After relationship maps table and fine data are all formed, store to database.
It in the present invention, can also include acquiring real-time regulation data in real time, and according to real-time regulation data more new data Data in library guarantee the accuracy of data in database.
Collection and arrangement by using the present invention to regulation data can reduce the cost of labor of data inputting, and energy Enough more fully regulation data are provided for user.
Based on inventive concept identical with a kind of above-mentioned date storage method of regulatory information, corresponding, this hair Bright embodiment additionally provides a kind of data storage device of regulatory information, as shown in Figure 2.Due to Installation practice substantially it is similar with Embodiment of the method, so describing fairly simple, the relevent part can refer to the partial explaination of embodiments of method.
A kind of data storage device of regulatory information provided by the invention, comprising:
Data acquisition module 101, for acquiring the regulation data about regulatory information;
Data split module 102, for splitting, obtaining to the regulation data according to preset fractionation rule Obtain fine data;
Data match module 103, for being matched to the fine data, shape according to preset matching rule At relationship maps table;
Data memory module 104, for storing the relationship maps table and the fine data to database.
In a specific embodiment provided by the invention, described device, further includes:
Authentication module, for verifying the correctness of the regulation data;If correct, execute the data and split module 102 content;If mistake, the regulation data are modified, then execute the content that the data split module 102.
In a specific embodiment provided by the invention, the data split module 102, comprising:
Split cells, for being split according to natural paragraph to the regulation data;
New paragraph establishes unit, for establishing new logic paragraph according to the keyword in natural paragraph, obtains fine number According to.
In a specific embodiment provided by the invention, the data match module 103, comprising:
Marking unit, for in the fine data regulation and logical segment drop into line flag, obtain regulation ID and section Fall ID;
Matching unit, for being matched to the fine data, obtaining the essence according to preset matching rule Count the incidence relation between accurately;
It indicates unit, for indicating the incidence relation using the regulation ID and paragraph ID, forms relationship maps table.
In a specific embodiment provided by the invention, the incidence relation, comprising: regulation and regulation, regulation and patrol Collect the incidence relation between paragraph and logic paragraph and logic paragraph.
In a specific embodiment provided by the invention, the incidence relation is associated with pass between regulation and regulation System;
The matching unit, specifically includes:
In the fine data, according to the regulation name complete match of regulation, if successful match, using matching result as Incidence relation between regulation and regulation;
If it fails to match, matched according to the content of punctuation marks used to enclose the title in regulation name, if successful match, by matching result As the incidence relation between regulation and regulation;
If it fails to match, removing in regulation name and ignore word, the regulation name after word is ignored according to removal is matched, if Successful match, then using matching result as the incidence relation between regulation and regulation;If it fails to match, do not have between corresponding regulation It is relevant.
More than, it is a kind of data storage device of regulatory information provided by the invention.
Based on a kind of above-mentioned date storage method of regulatory information, the present invention also provides a kind of numbers of regulatory information accordingly According to querying method, referring to FIG. 3, Fig. 3 is a kind of data query method for regulatory information that the specific embodiment of the invention provides Flow chart.The data query method is used cooperatively with above-mentioned date storage method, and related place is referring to a kind of above-mentioned regulatory information Date storage method.
A kind of data query method of regulatory information provided by the invention, comprising:
Step S201: by storage relationship maps table in the database and the fine data synchronize store to search engine Database.
Had using the data that the date storage method of regulatory information stores in the database: relationship maps table and fine number According in order to guarantee the rapidity of data query, needing relationship maps table and fine data being synchronized to search in data query In engine database.In synchrodata, it can be synchronized and be serviced using multiple servers, improve the synchronous efficiency of data.
Step S202: query search engine is established.
When establishing query search engine, firstly, obtaining the keyword and synonym of regulatory information;Then, to the pass Keyword and the synonym are segmented;Finally, being based on elasticsearch distributed search engine according to word segmentation result, building Vertical query search engine.
Wherein, ElasticSearch is the search server based on Lucene.It is multi-purpose that it provides a distribution The full-text search engine of family ability is based on RESTful web interface.Elasticsearch is developed with Java, and conduct Open source code publication under Apache license terms, is Enterprise search engine currently popular.Designed in cloud computing, energy Enough reach real-time search, stablizes, it is reliably, quickly, easy to install and use.
Keyword may include: title, promulgate the time, promulgate the fields such as mechanism.Synonym refers to the synonym of keyword. By the way that synonym is arranged, it can be improved the range of search, keep search result more comprehensive.
Regulation is searched for other than traditional search based on elasticsearch distributed search engine+Chinese word segmentation, this The distinctive vocabulary in system combination financial supervision field has carried out depth customization, and synonym search is incorporated.Such as " stock supervisory committee ", " China Securities Regulatory Commission ", " China Securities Regulatory Commission " three are the same meaning in fact, then when user's search " card prison When meeting ", and " China Securities Regulatory Commission ", " China Securities Regulatory Commission " relevant data may also appear in search result In.
By carrying out Chinese word segmentation and index to data automatically, be stored in distributed search engine, with realize a large amount of regulations/ Quick, the accurate search of case data.
Chinese word segmentation refers to a chinese character sequence being cut into individual word one by one.Participle is exactly by continuous word sequence Column are reassembled into the process of word sequence according to certain specification.It is using space as certainly between word in the style of writing of English Right delimiter, and Chinese is that word, sentence and section can simply be demarcated by apparent delimiter, none form of word only On delimiter, although English is also the same, there are the partition problems of phrase, but on this layer of word, the English of Chinese ratio is answered It is much miscellaneous, much more difficult.
Regulation system segments Chinese using well-known IKAnalyzer, and the result after participle is with inverted index structure It is stored in elasticsearch distributed search engine.
Step S203: the screening conditions of user's input are obtained.
Wherein, screening conditions may include: one of fields such as title, promulgation time, promulgation mechanism or a variety of groups It closes.
Step S204: it is transferred from described search engine database according to the screening conditions using described search engine Corresponding associated data.
User is in foregrounding screening conditions, the data that precisely search needs.
Step S205: the associated data is exported to user.
After having searched for associated data, corresponding associated data is exported to user, user is made to check corresponding search result.
By using this way of search, the search engine is established, more comprehensive search result can be provided for user, And then provide user experience.
Based on inventive concept identical with a kind of above-mentioned data query method of regulatory information, corresponding, this hair Bright embodiment additionally provides a kind of data query device of regulatory information, as shown in Figure 4.Due to Installation practice substantially it is similar with Embodiment of the method, so describing fairly simple, the relevent part can refer to the partial explaination of embodiments of method.
A kind of data query device of regulatory information provided by the invention, comprising:
Data simultaneous module 201, for by relationship maps table and fine data synchronize store to search engine database;
Engine establishes module 202, for establishing query search engine;
Condition obtains module 203, for obtaining the screening conditions of user's input;
Data transfer module 204, are used for according to the screening conditions, using described search engine, from described search engine Corresponding associated data is transferred in database;
Data outputting module 205, for exporting the associated data to user.
In a specific embodiment provided by the invention, the engine establishes module 202, comprising:
Word acquiring unit, for obtaining the keyword and synonym of regulatory information;
Participle unit, for being segmented to the keyword and the synonym;
Engine establishes unit, and for being based on elasticsearch distributed search engine according to word segmentation result, foundation is looked into Ask search engine.
More than, it is a kind of data query device of regulatory information provided by the invention.
Those of ordinary skill in the art may be aware that list described in conjunction with the examples disclosed in the embodiments of the present disclosure Member and algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware With the interchangeability of software, each exemplary composition and step are generally described according to function in the above description.This A little functions are implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Specially Industry technical staff can use different methods to achieve the described function each specific application, but this realization is not It is considered as beyond the scope of this invention.
In several embodiments provided herein, it should be understood that disclosed device and method can pass through it Its mode is realized.For example, the apparatus embodiments described above are merely exemplary, for example, the division of the unit, only Only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components can be tied Another system is closed or is desirably integrated into, or some features can be ignored or not executed.In addition, shown or discussed phase Mutually between coupling, direct-coupling or communication connection can be through some interfaces, the INDIRECT COUPLING or communication of device or unit Connection is also possible to electricity, mechanical or other form connections.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.Some or all of unit therein can be selected to realize the embodiment of the present invention according to the actual needs Purpose.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, is also possible to two or more units and is integrated in one unit.It is above-mentioned integrated Unit both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially The all or part of the part that contributes to existing technology or the technical solution can be in the form of software products in other words It embodies, which is stored in a storage medium, including some instructions are used so that a computer Equipment (can be personal computer, server or the network equipment etc.) executes the complete of each embodiment the method for the present invention Portion or part steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can store journey The medium of sequence code.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right It is required that protection scope subject to.

Claims (10)

1. a kind of date storage method of regulatory information characterized by comprising
Acquire the regulation data about regulatory information;
According to preset fractionation rule, the regulation data are split, fine data is obtained;
According to preset matching rule, the fine data is matched, forms relationship maps table;
The relationship maps table and the fine data are stored to database.
2. the method according to claim 1, wherein the step of the regulation data in the acquisition about regulatory information After rapid, according to preset fractionations rule, the regulation data are split, the step of acquisition fine data it Before, further includes:
Verify the correctness of the regulation data;If correct, execute it is described regular according to preset fractionations, to the method The step of rule data are split, obtain fine data;If mistake, the regulation data are modified, then are executed described According to preset fractionation rule, the step of being split to the regulation data, obtain fine data.
3. the method according to claim 1, wherein described according to preset fractionation rule, to the method Rule data are split, and fine data is obtained, comprising:
The regulation data are split according to natural paragraph;
According to the keyword in natural paragraph, new logic paragraph is established, obtains fine data.
4. according to the method described in claim 3, it is characterized in that, described according to preset matching rule, to the essence It counts accurately according to being matched, forms relationship maps table, comprising:
To in the fine data regulation and logical segment drop into line flag, obtain regulation ID and paragraph ID;
According to preset matching rule, the fine data is matched, obtains the association between the fine data Relationship;
The incidence relation is indicated using the regulation ID and paragraph ID, forms relationship maps table.
5. according to the method described in claim 4, it is characterized in that, the incidence relation, comprising: regulation and regulation, regulation with Incidence relation between logic paragraph and logic paragraph and logic paragraph.
6. according to the method described in claim 5, it is characterized in that, the incidence relation is associated with pass between regulation and regulation System;
It is described that the fine data is matched according to preset matching rule, obtain the pass between regulation and regulation Connection relationship, comprising:
In the fine data, according to the regulation name complete match of regulation, if successful match, using matching result as regulation Incidence relation between regulation;
If it fails to match, matched according to the content of punctuation marks used to enclose the title in regulation name, if successful match, using matching result as Incidence relation between regulation and regulation;
If it fails to match, remove in regulation name and ignore word, the regulation name after word is ignored according to removal is matched, if matching Success, then using matching result as the incidence relation between regulation and regulation;If it fails to match, do not closed between corresponding regulation Connection relationship.
7. a kind of data storage device of regulatory information characterized by comprising
Data acquisition module, for acquiring the regulation data about regulatory information;
Data split module, for being split to the regulation data according to preset fractionation rule, obtain fine number According to;
Data match module forms association and reflects for being matched to the fine data according to preset matching rule Firing table;
Data memory module, for storing the relationship maps table and the fine data to database.
8. a kind of data query method of regulatory information characterized by comprising
By the relationship maps table of the date storage method storage using regulatory information described in claim 1 to 6 any one and institute It states fine data and synchronizes and store to search engine database;
Establish query search engine;
Obtain the screening conditions of user's input;
Corresponding incidence number is transferred from described search engine database using described search engine according to the screening conditions According to;
The associated data is exported to user.
9. according to the method described in claim 8, it is characterized in that, described establish query search engine, comprising:
Obtain the keyword and synonym of regulatory information;
The keyword and the synonym are segmented;
According to word segmentation result, it is based on elasticsearch distributed search engine, establishes query search engine.
10. a kind of data query device of regulatory information characterized by comprising
Data simultaneous module, for by relationship maps table and fine data synchronize store to search engine database;
Engine establishes module, for establishing query search engine;
Condition obtains module, for obtaining the screening conditions of user's input;
Data transfer module, are used for according to the screening conditions, using described search engine, from described search engine database Transfer corresponding associated data;
Data outputting module, for exporting the associated data to user.
CN201811533428.7A 2018-12-14 2018-12-14 A kind of data storage, querying method and the device of regulatory information Pending CN109614453A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811533428.7A CN109614453A (en) 2018-12-14 2018-12-14 A kind of data storage, querying method and the device of regulatory information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811533428.7A CN109614453A (en) 2018-12-14 2018-12-14 A kind of data storage, querying method and the device of regulatory information

Publications (1)

Publication Number Publication Date
CN109614453A true CN109614453A (en) 2019-04-12

Family

ID=66008582

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811533428.7A Pending CN109614453A (en) 2018-12-14 2018-12-14 A kind of data storage, querying method and the device of regulatory information

Country Status (1)

Country Link
CN (1) CN109614453A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110442590A (en) * 2019-08-06 2019-11-12 北京三维天地科技有限公司 It is a kind of for provide examine detection service system and method
CN110737839A (en) * 2019-10-22 2020-01-31 京东数字科技控股有限公司 Short text recommendation method, device, medium and electronic equipment
CN112199466A (en) * 2020-09-08 2021-01-08 深圳价值在线信息科技股份有限公司 Method and device for identifying associated regulation of mail
CN117743390A (en) * 2024-02-20 2024-03-22 证通股份有限公司 Query method and system for financial information and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8126884B1 (en) * 1997-01-10 2012-02-28 The Board Of Trustees Of The Leland Stanford Junior University Scoring documents in a linked database
CN104008171A (en) * 2014-06-03 2014-08-27 中国科学院计算技术研究所 Legal database establishing method and legal retrieving service method
CN106815256A (en) * 2015-12-01 2017-06-09 北京国双科技有限公司 Set up the method and device of laws and regulations bar fund incidence relation
CN108132941A (en) * 2016-11-30 2018-06-08 北京国双科技有限公司 The treating method and apparatus of the incidence relation of juristic writing

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8126884B1 (en) * 1997-01-10 2012-02-28 The Board Of Trustees Of The Leland Stanford Junior University Scoring documents in a linked database
CN104008171A (en) * 2014-06-03 2014-08-27 中国科学院计算技术研究所 Legal database establishing method and legal retrieving service method
CN106815256A (en) * 2015-12-01 2017-06-09 北京国双科技有限公司 Set up the method and device of laws and regulations bar fund incidence relation
CN108132941A (en) * 2016-11-30 2018-06-08 北京国双科技有限公司 The treating method and apparatus of the incidence relation of juristic writing

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110442590A (en) * 2019-08-06 2019-11-12 北京三维天地科技有限公司 It is a kind of for provide examine detection service system and method
CN110737839A (en) * 2019-10-22 2020-01-31 京东数字科技控股有限公司 Short text recommendation method, device, medium and electronic equipment
CN112199466A (en) * 2020-09-08 2021-01-08 深圳价值在线信息科技股份有限公司 Method and device for identifying associated regulation of mail
CN112199466B (en) * 2020-09-08 2024-04-12 深圳价值在线信息科技股份有限公司 Method and device for identifying associated rule of mail
CN117743390A (en) * 2024-02-20 2024-03-22 证通股份有限公司 Query method and system for financial information and storage medium
CN117743390B (en) * 2024-02-20 2024-05-28 证通股份有限公司 Query method and system for financial information and storage medium

Similar Documents

Publication Publication Date Title
CN109614453A (en) A kind of data storage, querying method and the device of regulatory information
CN103488648B (en) A kind of multilingual mixed index method and system
US8135717B2 (en) Processor for fast contextual matching
CN103020293B (en) A kind of construction method and system of the ontology library of mobile application
Kuzey et al. Extraction of temporal facts and events from Wikipedia
Hadni et al. A new and efficient stemming technique for Arabic Text Categorization
CN103678576A (en) Full-text retrieval system based on dynamic semantic analysis
CN109376202B (en) NLP-based enterprise supply relationship automatic extraction and analysis method
Bjarnadóttir The database of modern Icelandic inflection (Beygingarlýsing íslensks nútímamáls)
CN107357777B (en) Method and device for extracting label information
CN107844493B (en) File association method and system
Strötgen et al. An event-centric model for multilingual document similarity
CN103186556A (en) Method for obtaining and searching structural semantic knowledge and corresponding device
CN110032622B (en) Keyword determination method, keyword determination device, keyword determination equipment and computer readable storage medium
CN112380848B (en) Text generation method, device, equipment and storage medium
Hassan et al. Improving named entity translation by exploiting comparable and parallel corpora
CN111222028B (en) Intelligent data crawling method
Renouf et al. Filling the gaps: Using the WebCorp Linguist’s Search Engine to supplement existing text resources
CN112667866A (en) Test paper generation method and device, electronic equipment and storage medium
CN109885641A (en) A kind of method and system of database Chinese Full Text Retrieval
CN111078839A (en) Structured processing method and processing device for referee document
Sembok et al. Arabic word stemming algorithms and retrieval effectiveness
CN102117285A (en) Search method based on semantic indexing
Abdurakhmonova Formal-Functional Models of The Uzbek Electron Corpus
CN113591476A (en) Data label recommendation method based on machine learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination