CN109614453A - A kind of data storage, querying method and the device of regulatory information - Google Patents
A kind of data storage, querying method and the device of regulatory information Download PDFInfo
- Publication number
- CN109614453A CN109614453A CN201811533428.7A CN201811533428A CN109614453A CN 109614453 A CN109614453 A CN 109614453A CN 201811533428 A CN201811533428 A CN 201811533428A CN 109614453 A CN109614453 A CN 109614453A
- Authority
- CN
- China
- Prior art keywords
- data
- regulation
- fine
- regulatory information
- search engine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000001105 regulatory effect Effects 0.000 title claims abstract description 62
- 238000000034 method Methods 0.000 title claims abstract description 51
- 238000013500 data storage Methods 0.000 title claims description 12
- 238000005194 fractionation Methods 0.000 claims abstract description 19
- 238000012216 screening Methods 0.000 claims description 16
- 230000011218 segmentation Effects 0.000 claims description 7
- 238000012546 transfer Methods 0.000 claims description 4
- 238000010304 firing Methods 0.000 claims 1
- 230000006870 function Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000007639 printing Methods 0.000 description 3
- 230000001360 synchronised effect Effects 0.000 description 3
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000009434 installation Methods 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000005192 partition Methods 0.000 description 2
- 229910052709 silver Inorganic materials 0.000 description 2
- 239000004332 silver Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/18—Legal services
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Tourism & Hospitality (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Technology Law (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides a kind of date storage methods of regulatory information, comprising: regulation data of the acquisition about regulatory information;According to preset fractionation rule, the regulation data are split, fine data is obtained;According to preset matching rule, the fine data is matched, forms relationship maps table;The relationship maps table and the fine data are stored to database.Compared to the prior art, the present invention can rapidly establish regulation, case, the incidence relation between attachment, and be stored, and various dimensions, multivariant query function can be provided for user.
Description
Technical field
The present invention relates to data storage and inquiring technology fields, and in particular to a kind of data storage, the inquiry of regulatory information
Method and device.
Background technique
Since existing promulgation regulation functional department and functional department, regulatory agency are numerous, each independent functional department is being sent out
Cloth relevant laws and regulations regulation or punishment case Shi Douhui are carried out in respective official website, and the content mark of different functional department's publications
It is quasi- also different.And the complicated relationship that many regulations can have while promulgating with other regulations, in such relationship
It may be related to the revision of other regulations or abrogate, and user is only capable of focusing in regulation itself when browsing a regulation
Hold, none very easily inquires entrance to all other regulation users associated therewith, or even other regulations clause with
Content user is not known without exception, does not know where inquire yet.
The form for punishing case is even more varied, and wherein much also only very simple reference is different for punishing justification item
The different clauses of regulation, also without a convenient inquiry entrance.These some outstanding problems encountered when being all browsing.
It with the continuous revision of regulation, update, is substituted, regulation promulgates the change of organization names, and nonstandard abbreviation is answered
With the height of regulation name is similar, and code inconvenience memory etc., these problems all give subsequent inquiry bring greatly inconvenient, most
Whole result can not find desired result or have found the time cost also paid at double.
Summary of the invention
For the defects in the prior art, the present invention provides data storage, querying method and the device of a kind of regulatory information,
Various dimensions, multivariant query function can be provided for user.
In a first aspect, the present invention provides a kind of date storage methods of regulatory information, comprising:
Acquire the regulation data about regulatory information;
According to preset fractionation rule, the regulation data are split, fine data is obtained;
According to preset matching rule, the fine data is matched, forms relationship maps table;
The relationship maps table and the fine data are stored to database.
Optionally, it after the regulation data of the acquisition about regulatory information the step of, is torn open according to preset
Before the step of divider then, splits the regulation data, obtains fine data, further includes:
Verify the correctness of the regulation data;If correct, execute it is described regular according to preset fractionations, to institute
The step of stating regulation data to be split, obtaining fine data;If mistake, the regulation data are modified, then executes
It is described regular according to preset fractionation, the step of being split to the regulation data, obtain fine data.
Optionally, described that the regulation data are split according to preset fractionation rule, obtain fine number
According to, comprising:
The regulation data are split according to natural paragraph;
According to the keyword in natural paragraph, new logic paragraph is established, obtains fine data.
Optionally, described that the fine data is matched according to preset matching rule, form relationship maps
Table, comprising:
To in the fine data regulation and logical segment drop into line flag, obtain regulation ID and paragraph ID;
According to preset matching rule, the fine data is matched, is obtained between the fine data
Incidence relation;
The incidence relation is indicated using the regulation ID and paragraph ID, forms relationship maps table.
Optionally, the incidence relation, comprising: regulation and regulation, regulation and logic paragraph and logic paragraph and logical segment
Incidence relation between falling.
Optionally, incidence relation of the incidence relation between regulation and regulation;
It is described that the fine data is matched according to preset matching rule, it obtains between regulation and regulation
Incidence relation, comprising:
In the fine data, according to the regulation name complete match of regulation, if successful match, using matching result as
Incidence relation between regulation and regulation;
If it fails to match, matched according to the content of punctuation marks used to enclose the title in regulation name, if successful match, by matching result
As the incidence relation between regulation and regulation;
If it fails to match, removing in regulation name and ignore word, the regulation name after word is ignored according to removal is matched, if
Successful match, then using matching result as the incidence relation between regulation and regulation;If it fails to match, do not have between corresponding regulation
It is relevant.
Second aspect, the present invention provides a kind of data storage devices of regulatory information, comprising:
Data acquisition module, for acquiring the regulation data about regulatory information;
Data split module, for splitting to the regulation data, obtaining essence according to preset fractionation rule
Count evidence accurately;
Data match module is formed and is closed for being matched to the fine data according to preset matching rule
Join mapping table;
Data memory module, for storing the relationship maps table and the fine data to database.
The third aspect, the present invention provides a kind of data query methods of regulatory information, comprising:
By the relationship maps table and the fine data that are stored using a kind of date storage method of regulatory information synchronize deposit
It stores up to search engine database;
Establish query search engine;
Obtain the screening conditions of user's input;
Corresponding pass is transferred from described search engine database using described search engine according to the screening conditions
Join data;
The associated data is exported to user.
It is optionally, described to establish query search engine, comprising:
Obtain the keyword and synonym of regulatory information;
The keyword and the synonym are segmented;
According to word segmentation result, it is based on elasticsearch distributed search engine, establishes query search engine.
Fourth aspect, the present invention provide a kind of data query device of regulatory information, comprising:
Data simultaneous module, for by relationship maps table and fine data synchronize store to search engine database;
Engine establishes module, for establishing query search engine;
Condition obtains module, for obtaining the screening conditions of user's input;
Data transfer module, are used for according to the screening conditions, using described search engine, from described search engine data
Corresponding associated data is transferred in library;
Data outputting module, for exporting the associated data to user.
A kind of date storage method of regulatory information provided by the invention, comprising: regulation number of the acquisition about regulatory information
According to;According to preset fractionation rule, the regulation data are split, fine data is obtained;According to preset
Matching rule matches the fine data, forms relationship maps table;By the relationship maps table and the fine data
It stores to database.Compared to the prior art, the present invention can rapidly establish regulation, case, the incidence relation between attachment,
And stored, various dimensions, multivariant query function can be provided for user.
The data storage device of a kind of regulatory information provided by the invention, with a kind of above-mentioned data storage side of regulatory information
Method is for identical inventive concept, beneficial effect having the same.
The data query method of a kind of regulatory information provided by the invention, comprising: will be using a kind of data of regulatory information
Relationship maps table and the fine data of storage method storage synchronize store to search engine database;Query search is established to draw
It holds up;Obtain the screening conditions of user's input;According to the screening conditions, using described search engine, from described search engine number
According to transferring corresponding associated data in library;The associated data is exported to user.In this way, user can rapidly, it is quasi-
Associated regulation data really are inquired, and then can be improved user experience.
The data query device of a kind of regulatory information provided by the invention, with a kind of above-mentioned data query side of regulatory information
Method is for identical inventive concept, beneficial effect having the same.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art
Embodiment or attached drawing needed to be used in the description of the prior art are briefly described.In all the appended drawings, similar element
Or part is generally identified by similar appended drawing reference.In attached drawing, each element or part might not be drawn according to actual ratio.
Fig. 1 is a kind of flow chart of the date storage method of regulatory information provided in an embodiment of the present invention;
Fig. 2 is a kind of schematic diagram of the data query device of regulatory information provided in an embodiment of the present invention;
Fig. 3 is a kind of flow chart of the date storage method of regulatory information provided in an embodiment of the present invention;
Fig. 4 is a kind of schematic diagram of the data query device of regulatory information provided in an embodiment of the present invention.
Specific embodiment
It is described in detail below in conjunction with embodiment of the attached drawing to technical solution of the present invention.Following embodiment is only used for
Clearly illustrate technical solution of the present invention, therefore be intended only as example, and cannot be used as a limitation and limit protection of the invention
Range.
It should be noted that unless otherwise indicated, technical term or scientific term used in this application should be this hair
The ordinary meaning that bright one of ordinary skill in the art are understood.
The present invention provides a kind of storage of the data of regulatory information, querying method and devices.With reference to the accompanying drawing to this hair
Bright embodiment is illustrated.
Referring to FIG. 1, Fig. 1 is a kind of process of the date storage method for regulatory information that the specific embodiment of the invention provides
Figure, a kind of date storage method of regulatory information provided in this embodiment, comprising:
Step S101: regulation data of the acquisition about regulatory information.
Wherein, regulation data may include: regulation, case, attachment etc..
When acquiring regulation data, separate sources data can be acquired by professional, input system is sorted out after arrangement, is protected
Demonstrate,prove the correctness and integrality of data.
Step S102: according to preset fractionation rule, the regulation data is split, fine data is obtained.
It can also include: the correctness for verifying the regulation data before being split to regulation data;If correct,
Then execute it is described according to preset fractionation rule, the step of being split to the regulation data, obtain fine data;If
Mistake is then modified the regulation data, then execute it is described according to preset fractionation rule, to the regulation data
The step of being split, obtaining fine data.
When the correctness to regulation data is verified, can verify the following aspects: special field cannot be sky;
Whether regulation timeliness is correct;Whether data repeat;Whether format standardizes.
(1) before the correctness of verifying regulation data, it is also necessary to standardize to regulation data.For example, " silver hair
[2018] No. 296 ", become " silver hair (2018) 296 " after the bracket and space standardization in the code.
(2) fields such as title of code, text, promulgation time, promulgation mechanism, validity cannot be sky.
(3) when the entry-into-force time is later than current time, regulation validity should be " Pending The Entry Into Force ";Entry-into-force time is earlier than current time
When, regulation validity should be " effective " or " revision ";Expiration time should be " failure " earlier than current time, regulation validity.
(4) judge whether regulation repeats by title of code and code combination.
It, can also be with if, can be by being artificially modified in amendment it was found that error in data, is modified data
It is modified by smart machine, this is all within the scope of the present invention.
By being verified to data correctness, it can be ensured that correctness of the data in typing improves user experience.
After the correctness for having verified regulation data, using preset fractionation rule, the regulation data are torn open
Point, obtain fine data, detailed process are as follows: split according to natural paragraph to the regulation data;According in natural paragraph
Keyword, establish new logic paragraph, obtain fine data.
Regulation data are first dropped into capable fractionation by paragragh, then, if there are specific keyword, examples in natural paragraph
Such as, multiple natural paragraphs between two keywords are merged into as a new logical segment by chapter n has, the N articles, N money etc.
It falls, forms fine data.
Fall behind being split as new logical segment, can also be modified by manually being fallen to logical segment, guarantees logic paragraph
Correctness.
Step S103: according to preset matching rule, matching the fine data, forms relationship maps
Table.
On the basis of fine data, regulation and regulation, regulation and logic paragraph, logic paragraph and logical segment can establish
Incidence relation between falling, to provide various dimensions, more fine-grained accurate inquiry for user.
Form the process of relationship maps table are as follows:
To in the fine data regulation and logical segment drop into line flag, obtain regulation ID and paragraph ID;According to preparatory
The matching rule of setting matches the fine data, obtains the incidence relation between the fine data;Using described
Regulation ID and paragraph ID indicates the incidence relation, forms relationship maps table.
Wherein, a regulation ID can correspond at least one paragraph ID in relation mapping table, and multiple regulation ID can also be right
A paragraph ID is answered, this is all within the scope of the present invention.
When incidence relation of the incidence relation between regulation and regulation, matched using following matching rule:
In the fine data, according to the regulation name complete match of regulation, if successful match, using matching result as
Incidence relation between regulation and regulation;If it fails to match, matched according to the content of punctuation marks used to enclose the title in regulation name, if matching
Success, then using matching result as the incidence relation between regulation and regulation;If it fails to match, ignoring in regulation name is removed
Word, the regulation name after word is ignored according to removal is matched, if successful match, using matching result as between regulation and regulation
Incidence relation;If it fails to match, there is no incidence relation between corresponding regulation.
Before matching, it should remove additional character, guarantee accurately and efficiently matches.
Such as: 1) first press regulation name complete match.If miss is matched, into step 2;2) by the punctuation marks used to enclose the title in regulation name
Matching, if regulation is entitled " about the notice for printing and distributing " actuarial report " ", then matches " the actuarial report " in punctuation marks used to enclose the title.If matching is not
Hit, into step 3;3) it is matched again after " the ignoring word " of the hit of removal regulation, as regulation " refers to about efficiency credit is printed and distributed
The notice drawn ", " about printing and distributing " therein and " notice " are to ignore word, and entitled " efficiency credit refers to effective regulation after removal
Draw ";4) sequence according to 1), 2), 3) is successively matched, if by the rule of 1) step description in entire regulation text
With unsuccessful, then matched again by 2) rule;Rule is 2) also without successful match, then again by 3) being matched;5) above-mentioned step
Matching in rapid should all remove additional character in advance, such as drawing in " about the notice for printing and distributing " " Shandong blueness benchmark loan " detailed rules for the implementation " "
Number.
When incidence relation of the incidence relation between regulation and logic paragraph, matching rule are as follows: in retrieval logic paragraph
Whether it is related to regulation, if being related to, the ID of the logic paragraph and regulation ID is associated, forms incidence relation;If not relating to
And then the logic paragraph and the regulation do not have incidence relation.
When incidence relation of the incidence relation between logic paragraph and logic paragraph, matching rule are as follows: retrieval two is patrolled
It collects in paragraph and whether is related to identical regulation meaning, if being related to, the two logic paragraphs ID is associated, form association and close
System;If not being related to, there is no incidence relation between the two logic paragraphs.
Step S104: the relationship maps table and the fine data are stored to database.
After relationship maps table and fine data are all formed, store to database.
It in the present invention, can also include acquiring real-time regulation data in real time, and according to real-time regulation data more new data
Data in library guarantee the accuracy of data in database.
Collection and arrangement by using the present invention to regulation data can reduce the cost of labor of data inputting, and energy
Enough more fully regulation data are provided for user.
Based on inventive concept identical with a kind of above-mentioned date storage method of regulatory information, corresponding, this hair
Bright embodiment additionally provides a kind of data storage device of regulatory information, as shown in Figure 2.Due to Installation practice substantially it is similar with
Embodiment of the method, so describing fairly simple, the relevent part can refer to the partial explaination of embodiments of method.
A kind of data storage device of regulatory information provided by the invention, comprising:
Data acquisition module 101, for acquiring the regulation data about regulatory information;
Data split module 102, for splitting, obtaining to the regulation data according to preset fractionation rule
Obtain fine data;
Data match module 103, for being matched to the fine data, shape according to preset matching rule
At relationship maps table;
Data memory module 104, for storing the relationship maps table and the fine data to database.
In a specific embodiment provided by the invention, described device, further includes:
Authentication module, for verifying the correctness of the regulation data;If correct, execute the data and split module
102 content;If mistake, the regulation data are modified, then execute the content that the data split module 102.
In a specific embodiment provided by the invention, the data split module 102, comprising:
Split cells, for being split according to natural paragraph to the regulation data;
New paragraph establishes unit, for establishing new logic paragraph according to the keyword in natural paragraph, obtains fine number
According to.
In a specific embodiment provided by the invention, the data match module 103, comprising:
Marking unit, for in the fine data regulation and logical segment drop into line flag, obtain regulation ID and section
Fall ID;
Matching unit, for being matched to the fine data, obtaining the essence according to preset matching rule
Count the incidence relation between accurately;
It indicates unit, for indicating the incidence relation using the regulation ID and paragraph ID, forms relationship maps table.
In a specific embodiment provided by the invention, the incidence relation, comprising: regulation and regulation, regulation and patrol
Collect the incidence relation between paragraph and logic paragraph and logic paragraph.
In a specific embodiment provided by the invention, the incidence relation is associated with pass between regulation and regulation
System;
The matching unit, specifically includes:
In the fine data, according to the regulation name complete match of regulation, if successful match, using matching result as
Incidence relation between regulation and regulation;
If it fails to match, matched according to the content of punctuation marks used to enclose the title in regulation name, if successful match, by matching result
As the incidence relation between regulation and regulation;
If it fails to match, removing in regulation name and ignore word, the regulation name after word is ignored according to removal is matched, if
Successful match, then using matching result as the incidence relation between regulation and regulation;If it fails to match, do not have between corresponding regulation
It is relevant.
More than, it is a kind of data storage device of regulatory information provided by the invention.
Based on a kind of above-mentioned date storage method of regulatory information, the present invention also provides a kind of numbers of regulatory information accordingly
According to querying method, referring to FIG. 3, Fig. 3 is a kind of data query method for regulatory information that the specific embodiment of the invention provides
Flow chart.The data query method is used cooperatively with above-mentioned date storage method, and related place is referring to a kind of above-mentioned regulatory information
Date storage method.
A kind of data query method of regulatory information provided by the invention, comprising:
Step S201: by storage relationship maps table in the database and the fine data synchronize store to search engine
Database.
Had using the data that the date storage method of regulatory information stores in the database: relationship maps table and fine number
According in order to guarantee the rapidity of data query, needing relationship maps table and fine data being synchronized to search in data query
In engine database.In synchrodata, it can be synchronized and be serviced using multiple servers, improve the synchronous efficiency of data.
Step S202: query search engine is established.
When establishing query search engine, firstly, obtaining the keyword and synonym of regulatory information;Then, to the pass
Keyword and the synonym are segmented;Finally, being based on elasticsearch distributed search engine according to word segmentation result, building
Vertical query search engine.
Wherein, ElasticSearch is the search server based on Lucene.It is multi-purpose that it provides a distribution
The full-text search engine of family ability is based on RESTful web interface.Elasticsearch is developed with Java, and conduct
Open source code publication under Apache license terms, is Enterprise search engine currently popular.Designed in cloud computing, energy
Enough reach real-time search, stablizes, it is reliably, quickly, easy to install and use.
Keyword may include: title, promulgate the time, promulgate the fields such as mechanism.Synonym refers to the synonym of keyword.
By the way that synonym is arranged, it can be improved the range of search, keep search result more comprehensive.
Regulation is searched for other than traditional search based on elasticsearch distributed search engine+Chinese word segmentation, this
The distinctive vocabulary in system combination financial supervision field has carried out depth customization, and synonym search is incorporated.Such as " stock supervisory committee ",
" China Securities Regulatory Commission ", " China Securities Regulatory Commission " three are the same meaning in fact, then when user's search " card prison
When meeting ", and " China Securities Regulatory Commission ", " China Securities Regulatory Commission " relevant data may also appear in search result
In.
By carrying out Chinese word segmentation and index to data automatically, be stored in distributed search engine, with realize a large amount of regulations/
Quick, the accurate search of case data.
Chinese word segmentation refers to a chinese character sequence being cut into individual word one by one.Participle is exactly by continuous word sequence
Column are reassembled into the process of word sequence according to certain specification.It is using space as certainly between word in the style of writing of English
Right delimiter, and Chinese is that word, sentence and section can simply be demarcated by apparent delimiter, none form of word only
On delimiter, although English is also the same, there are the partition problems of phrase, but on this layer of word, the English of Chinese ratio is answered
It is much miscellaneous, much more difficult.
Regulation system segments Chinese using well-known IKAnalyzer, and the result after participle is with inverted index structure
It is stored in elasticsearch distributed search engine.
Step S203: the screening conditions of user's input are obtained.
Wherein, screening conditions may include: one of fields such as title, promulgation time, promulgation mechanism or a variety of groups
It closes.
Step S204: it is transferred from described search engine database according to the screening conditions using described search engine
Corresponding associated data.
User is in foregrounding screening conditions, the data that precisely search needs.
Step S205: the associated data is exported to user.
After having searched for associated data, corresponding associated data is exported to user, user is made to check corresponding search result.
By using this way of search, the search engine is established, more comprehensive search result can be provided for user,
And then provide user experience.
Based on inventive concept identical with a kind of above-mentioned data query method of regulatory information, corresponding, this hair
Bright embodiment additionally provides a kind of data query device of regulatory information, as shown in Figure 4.Due to Installation practice substantially it is similar with
Embodiment of the method, so describing fairly simple, the relevent part can refer to the partial explaination of embodiments of method.
A kind of data query device of regulatory information provided by the invention, comprising:
Data simultaneous module 201, for by relationship maps table and fine data synchronize store to search engine database;
Engine establishes module 202, for establishing query search engine;
Condition obtains module 203, for obtaining the screening conditions of user's input;
Data transfer module 204, are used for according to the screening conditions, using described search engine, from described search engine
Corresponding associated data is transferred in database;
Data outputting module 205, for exporting the associated data to user.
In a specific embodiment provided by the invention, the engine establishes module 202, comprising:
Word acquiring unit, for obtaining the keyword and synonym of regulatory information;
Participle unit, for being segmented to the keyword and the synonym;
Engine establishes unit, and for being based on elasticsearch distributed search engine according to word segmentation result, foundation is looked into
Ask search engine.
More than, it is a kind of data query device of regulatory information provided by the invention.
Those of ordinary skill in the art may be aware that list described in conjunction with the examples disclosed in the embodiments of the present disclosure
Member and algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware
With the interchangeability of software, each exemplary composition and step are generally described according to function in the above description.This
A little functions are implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Specially
Industry technical staff can use different methods to achieve the described function each specific application, but this realization is not
It is considered as beyond the scope of this invention.
In several embodiments provided herein, it should be understood that disclosed device and method can pass through it
Its mode is realized.For example, the apparatus embodiments described above are merely exemplary, for example, the division of the unit, only
Only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components can be tied
Another system is closed or is desirably integrated into, or some features can be ignored or not executed.In addition, shown or discussed phase
Mutually between coupling, direct-coupling or communication connection can be through some interfaces, the INDIRECT COUPLING or communication of device or unit
Connection is also possible to electricity, mechanical or other form connections.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit
The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple
In network unit.Some or all of unit therein can be selected to realize the embodiment of the present invention according to the actual needs
Purpose.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit
It is that each unit physically exists alone, is also possible to two or more units and is integrated in one unit.It is above-mentioned integrated
Unit both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product
When, it can store in a computer readable storage medium.Based on this understanding, technical solution of the present invention is substantially
The all or part of the part that contributes to existing technology or the technical solution can be in the form of software products in other words
It embodies, which is stored in a storage medium, including some instructions are used so that a computer
Equipment (can be personal computer, server or the network equipment etc.) executes the complete of each embodiment the method for the present invention
Portion or part steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only
Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can store journey
The medium of sequence code.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any
Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace
It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right
It is required that protection scope subject to.
Claims (10)
1. a kind of date storage method of regulatory information characterized by comprising
Acquire the regulation data about regulatory information;
According to preset fractionation rule, the regulation data are split, fine data is obtained;
According to preset matching rule, the fine data is matched, forms relationship maps table;
The relationship maps table and the fine data are stored to database.
2. the method according to claim 1, wherein the step of the regulation data in the acquisition about regulatory information
After rapid, according to preset fractionations rule, the regulation data are split, the step of acquisition fine data it
Before, further includes:
Verify the correctness of the regulation data;If correct, execute it is described regular according to preset fractionations, to the method
The step of rule data are split, obtain fine data;If mistake, the regulation data are modified, then are executed described
According to preset fractionation rule, the step of being split to the regulation data, obtain fine data.
3. the method according to claim 1, wherein described according to preset fractionation rule, to the method
Rule data are split, and fine data is obtained, comprising:
The regulation data are split according to natural paragraph;
According to the keyword in natural paragraph, new logic paragraph is established, obtains fine data.
4. according to the method described in claim 3, it is characterized in that, described according to preset matching rule, to the essence
It counts accurately according to being matched, forms relationship maps table, comprising:
To in the fine data regulation and logical segment drop into line flag, obtain regulation ID and paragraph ID;
According to preset matching rule, the fine data is matched, obtains the association between the fine data
Relationship;
The incidence relation is indicated using the regulation ID and paragraph ID, forms relationship maps table.
5. according to the method described in claim 4, it is characterized in that, the incidence relation, comprising: regulation and regulation, regulation with
Incidence relation between logic paragraph and logic paragraph and logic paragraph.
6. according to the method described in claim 5, it is characterized in that, the incidence relation is associated with pass between regulation and regulation
System;
It is described that the fine data is matched according to preset matching rule, obtain the pass between regulation and regulation
Connection relationship, comprising:
In the fine data, according to the regulation name complete match of regulation, if successful match, using matching result as regulation
Incidence relation between regulation;
If it fails to match, matched according to the content of punctuation marks used to enclose the title in regulation name, if successful match, using matching result as
Incidence relation between regulation and regulation;
If it fails to match, remove in regulation name and ignore word, the regulation name after word is ignored according to removal is matched, if matching
Success, then using matching result as the incidence relation between regulation and regulation;If it fails to match, do not closed between corresponding regulation
Connection relationship.
7. a kind of data storage device of regulatory information characterized by comprising
Data acquisition module, for acquiring the regulation data about regulatory information;
Data split module, for being split to the regulation data according to preset fractionation rule, obtain fine number
According to;
Data match module forms association and reflects for being matched to the fine data according to preset matching rule
Firing table;
Data memory module, for storing the relationship maps table and the fine data to database.
8. a kind of data query method of regulatory information characterized by comprising
By the relationship maps table of the date storage method storage using regulatory information described in claim 1 to 6 any one and institute
It states fine data and synchronizes and store to search engine database;
Establish query search engine;
Obtain the screening conditions of user's input;
Corresponding incidence number is transferred from described search engine database using described search engine according to the screening conditions
According to;
The associated data is exported to user.
9. according to the method described in claim 8, it is characterized in that, described establish query search engine, comprising:
Obtain the keyword and synonym of regulatory information;
The keyword and the synonym are segmented;
According to word segmentation result, it is based on elasticsearch distributed search engine, establishes query search engine.
10. a kind of data query device of regulatory information characterized by comprising
Data simultaneous module, for by relationship maps table and fine data synchronize store to search engine database;
Engine establishes module, for establishing query search engine;
Condition obtains module, for obtaining the screening conditions of user's input;
Data transfer module, are used for according to the screening conditions, using described search engine, from described search engine database
Transfer corresponding associated data;
Data outputting module, for exporting the associated data to user.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811533428.7A CN109614453A (en) | 2018-12-14 | 2018-12-14 | A kind of data storage, querying method and the device of regulatory information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811533428.7A CN109614453A (en) | 2018-12-14 | 2018-12-14 | A kind of data storage, querying method and the device of regulatory information |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109614453A true CN109614453A (en) | 2019-04-12 |
Family
ID=66008582
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811533428.7A Pending CN109614453A (en) | 2018-12-14 | 2018-12-14 | A kind of data storage, querying method and the device of regulatory information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109614453A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110442590A (en) * | 2019-08-06 | 2019-11-12 | 北京三维天地科技有限公司 | It is a kind of for provide examine detection service system and method |
CN110737839A (en) * | 2019-10-22 | 2020-01-31 | 京东数字科技控股有限公司 | Short text recommendation method, device, medium and electronic equipment |
CN112199466A (en) * | 2020-09-08 | 2021-01-08 | 深圳价值在线信息科技股份有限公司 | Method and device for identifying associated regulation of mail |
CN117743390A (en) * | 2024-02-20 | 2024-03-22 | 证通股份有限公司 | Query method and system for financial information and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8126884B1 (en) * | 1997-01-10 | 2012-02-28 | The Board Of Trustees Of The Leland Stanford Junior University | Scoring documents in a linked database |
CN104008171A (en) * | 2014-06-03 | 2014-08-27 | 中国科学院计算技术研究所 | Legal database establishing method and legal retrieving service method |
CN106815256A (en) * | 2015-12-01 | 2017-06-09 | 北京国双科技有限公司 | Set up the method and device of laws and regulations bar fund incidence relation |
CN108132941A (en) * | 2016-11-30 | 2018-06-08 | 北京国双科技有限公司 | The treating method and apparatus of the incidence relation of juristic writing |
-
2018
- 2018-12-14 CN CN201811533428.7A patent/CN109614453A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8126884B1 (en) * | 1997-01-10 | 2012-02-28 | The Board Of Trustees Of The Leland Stanford Junior University | Scoring documents in a linked database |
CN104008171A (en) * | 2014-06-03 | 2014-08-27 | 中国科学院计算技术研究所 | Legal database establishing method and legal retrieving service method |
CN106815256A (en) * | 2015-12-01 | 2017-06-09 | 北京国双科技有限公司 | Set up the method and device of laws and regulations bar fund incidence relation |
CN108132941A (en) * | 2016-11-30 | 2018-06-08 | 北京国双科技有限公司 | The treating method and apparatus of the incidence relation of juristic writing |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110442590A (en) * | 2019-08-06 | 2019-11-12 | 北京三维天地科技有限公司 | It is a kind of for provide examine detection service system and method |
CN110737839A (en) * | 2019-10-22 | 2020-01-31 | 京东数字科技控股有限公司 | Short text recommendation method, device, medium and electronic equipment |
CN112199466A (en) * | 2020-09-08 | 2021-01-08 | 深圳价值在线信息科技股份有限公司 | Method and device for identifying associated regulation of mail |
CN112199466B (en) * | 2020-09-08 | 2024-04-12 | 深圳价值在线信息科技股份有限公司 | Method and device for identifying associated rule of mail |
CN117743390A (en) * | 2024-02-20 | 2024-03-22 | 证通股份有限公司 | Query method and system for financial information and storage medium |
CN117743390B (en) * | 2024-02-20 | 2024-05-28 | 证通股份有限公司 | Query method and system for financial information and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109614453A (en) | A kind of data storage, querying method and the device of regulatory information | |
CN103488648B (en) | A kind of multilingual mixed index method and system | |
US8135717B2 (en) | Processor for fast contextual matching | |
CN103020293B (en) | A kind of construction method and system of the ontology library of mobile application | |
Kuzey et al. | Extraction of temporal facts and events from Wikipedia | |
Hadni et al. | A new and efficient stemming technique for Arabic Text Categorization | |
CN103678576A (en) | Full-text retrieval system based on dynamic semantic analysis | |
CN109376202B (en) | NLP-based enterprise supply relationship automatic extraction and analysis method | |
Bjarnadóttir | The database of modern Icelandic inflection (Beygingarlýsing íslensks nútímamáls) | |
CN107357777B (en) | Method and device for extracting label information | |
CN107844493B (en) | File association method and system | |
Strötgen et al. | An event-centric model for multilingual document similarity | |
CN103186556A (en) | Method for obtaining and searching structural semantic knowledge and corresponding device | |
CN110032622B (en) | Keyword determination method, keyword determination device, keyword determination equipment and computer readable storage medium | |
CN112380848B (en) | Text generation method, device, equipment and storage medium | |
Hassan et al. | Improving named entity translation by exploiting comparable and parallel corpora | |
CN111222028B (en) | Intelligent data crawling method | |
Renouf et al. | Filling the gaps: Using the WebCorp Linguist’s Search Engine to supplement existing text resources | |
CN112667866A (en) | Test paper generation method and device, electronic equipment and storage medium | |
CN109885641A (en) | A kind of method and system of database Chinese Full Text Retrieval | |
CN111078839A (en) | Structured processing method and processing device for referee document | |
Sembok et al. | Arabic word stemming algorithms and retrieval effectiveness | |
CN102117285A (en) | Search method based on semantic indexing | |
Abdurakhmonova | Formal-Functional Models of The Uzbek Electron Corpus | |
CN113591476A (en) | Data label recommendation method based on machine learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |