CN107766545A - Scientific and technological data management method and device - Google Patents
Scientific and technological data management method and device Download PDFInfo
- Publication number
- CN107766545A CN107766545A CN201711043893.8A CN201711043893A CN107766545A CN 107766545 A CN107766545 A CN 107766545A CN 201711043893 A CN201711043893 A CN 201711043893A CN 107766545 A CN107766545 A CN 107766545A
- Authority
- CN
- China
- Prior art keywords
- model
- data
- science
- conceptual
- entity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 238000013523 data management Methods 0.000 title 1
- 238000013499 data model Methods 0.000 claims abstract description 67
- 238000007726 management method Methods 0.000 claims abstract description 20
- 238000013507 mapping Methods 0.000 claims description 33
- 238000010276 construction Methods 0.000 claims description 28
- 241001269238 Data Species 0.000 claims description 22
- 238000005516 engineering process Methods 0.000 claims description 21
- 238000012545 processing Methods 0.000 claims description 12
- 230000008859 change Effects 0.000 claims description 3
- 241000208340 Araliaceae Species 0.000 claims description 2
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 claims description 2
- 235000003140 Panax quinquefolius Nutrition 0.000 claims description 2
- 235000008434 ginseng Nutrition 0.000 claims description 2
- 238000012360 testing method Methods 0.000 claims description 2
- 230000008569 process Effects 0.000 description 7
- 238000013461 design Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 2
- 230000001149 cognitive effect Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 230000010181 polygamy Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides a management method and a device of scientific and technological data, wherein the method comprises the following steps: constructing an ensemble data model, the ensemble data model comprising: at least two scientific and technical data and an incidence relation between the scientific and technical data; receiving a query keyword; according to the query key words, determining reference scientific and technological data corresponding to the query key words from the overall data model; and determining target scientific and technological data corresponding to the query key words according to the incidence relation corresponding to the reference scientific and technological data, and outputting the target scientific and technological data. The technical scheme can improve the searching efficiency of the scientific and technological data.
Description
Technical field
The present invention relates to field of computer technology, the management method and device of more particularly to a kind of science data.
Background technology
With the development of information technology, the information content of science data does not stop to expand, and its source is varied, how it is quick simultaneously
Correctly find correct science data and have become a problem.
Traditional data integration scheme only the data of separate sources, form and feature property logically or physically
Concentrated, therefore could be searched when searching science data using data integration scheme, it is necessary to travel through substantial amounts of data sample
To correct science data, cause data search less efficient.
The content of the invention
The embodiments of the invention provide a kind of management method of science data and device, the search efficiency of data can be improved.
In a first aspect, the embodiments of the invention provide a kind of management method of science data, including:
Conceptual data model is built, the conceptual data model includes:At least two science datas and each section
Incidence relation between skill data;
Also include:
Receive searching keyword;
According to the searching keyword, the ginseng corresponding with the searching keyword is determined from the conceptual data model
Examine science data;
According to the incidence relation with reference to corresponding to science data, it is determined that target corresponding with the searching keyword is scientific and technological
Data, and export the target science data.
Preferably,
The structure conceptual data model, including:
Structure concept model, the conceptual model include:At least two entities, attribute corresponding to each described entity
Incidence relation between information, and each entity;Wherein, corresponding at least two science datas of each described entity;
According to each entity attributes information, the incidence relation between each entity is verified, root
According to the result, construction logic model;
According to the conceptual model of structure and the logical model, the conceptual data model is built.
Preferably,
After the construction logic model, further comprise:
Physical model is built, the physical model includes:Default data output mode;
The conceptual model according to structure and the logical model, the conceptual data model is built, including:
Determine the concept logic mapping relations between the conceptual model and the logical model, and the conceptual model
Concept physical mappings relation between the physical model;
According to the concept logic mapping relations and concept physical mappings relation determined, and the conceptual model, logic
Model and the physical model, build the conceptual data model.
Preferably,
The structure concept model, including:
Determine at least two bodies, and Noumenon property corresponding to each described body;
For body each described:It is determined that at least two science data corresponding with the Noumenon property, and will
At least two science data determined is as example corresponding to the body;
According to the example determined, the entity in the conceptual model is determined, and by each body
The corresponding Noumenon property is as the corresponding entity attributes information;
According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
Preferably,
It is described according to each entity attributes information, the incidence relation between each entity is tested
Card, including:
According to each entity attributes information, corresponding to each entity at least two science datas are carried out
Normal form and the processing of anti-normal formization;
According to the science data after processing, the incidence relation between each science data is determined.
Preferably,
The structure conceptual data model, including:
According to the incidence relation between each science data, each science data is grouped, obtain to
Few two science data groups;Wherein, each science data group includes at least one science data;
According at least two science datas group, the conceptual data model is built;
It is described according to the searching keyword, determined from the conceptual data model corresponding with the searching keyword
Reference science data, including:
At least one science data group corresponding with the searching keyword is determined from the conceptual data model;
Determine described to refer to science data from least one science data group determined.
Second aspect, the embodiments of the invention provide a kind of managing device of science data, including:Construction unit, reception
Unit and query unit;Wherein,
The construction unit, for building conceptual data model, the conceptual data model includes:At least two scientific and technological numbers
According to this and the incidence relation between each science data;
The receiving unit, for receiving searching keyword;
The query unit, for according to the searching keyword, determining to look into described from the conceptual data model
Ask the corresponding reference science data of keyword;And according to described with reference to incidence relation corresponding to science data, it is determined that with it is described
Target science data corresponding to searching keyword, and export the target science data.
Preferably,
The construction unit, for structure concept model, the conceptual model includes:At least two entities, each
Incidence relation between attribute information corresponding to the entity, and each entity;Wherein, each described entity is corresponding
At least two science datas;According to each entity attributes information, the incidence relation between each entity is entered
Row checking, according to the result, construction logic model;According to the conceptual model of structure and the logical model, institute is built
State conceptual data model.
Preferably,
The construction unit, for building physical model, the physical model includes:Default data output mode;Really
Fixed concept logic mapping relations between the conceptual model and the logical model, and the conceptual model and the physics
Concept physical mappings relation between model;According to the concept logic mapping relations and concept physical mappings relation determined, with
And the conceptual model, logical model and the physical model, build the conceptual data model.
Preferably,
The construction unit, for determining at least two bodies, and Noumenon property corresponding to each described body;Pin
To body each described:It is determined that at least two science data corresponding with the Noumenon property, and the institute that will be determined
At least two science datas are stated as example corresponding to the body;According to the example determined, the concept mould is determined
The entity in type, and using the Noumenon property corresponding to each described body as the corresponding entity attributes
Information;According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
Preferably,
The construction unit, for according to each entity attributes information, to corresponding to each entity at least two
The individual science data carries out normal form and the processing of anti-normal formization;According to the science data after processing, determine each described
Incidence relation between science data.
The embodiments of the invention provide a kind of management method of science data and device, by building conceptual data mould in advance
Type, when receiving searching keyword, determined from conceptual data model it is corresponding with searching keyword refer to science data,
And the incidence relation according to corresponding to this refers to science data, it is determined that target science data corresponding with searching keyword.Due to total
Include the incidence relation between each science data in volume data model, only need to arbitrarily determine one it is corresponding with searching keyword
Reference science data, then all target science datas can be determined according to incidence relation, without by traveling through mass data
The mode of sample searches target science data, so as to improve the search efficiency of data.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
There is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are the present invention
Some embodiments, for those of ordinary skill in the art, on the premise of not paying creative work, can also basis
These accompanying drawings obtain other accompanying drawings.
Fig. 1 is a kind of flow chart of the management method for science data that one embodiment of the invention provides;
Fig. 2 is a kind of structural representation of the managing device for science data that one embodiment of the invention provides.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention
In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is
Part of the embodiment of the present invention, rather than whole embodiments, based on the embodiment in the present invention, those of ordinary skill in the art
The every other embodiment obtained on the premise of creative work is not made, belongs to the scope of protection of the invention.
As shown in figure 1, the embodiments of the invention provide a kind of management method of science data, this method can include following
Step:
Step 101:Conceptual data model is built, the conceptual data model includes:At least two science datas and each
Incidence relation between the individual science data;
Step 102:Receive searching keyword;
Step 103:According to the searching keyword, determined and the searching keyword phase from the conceptual data model
It is corresponding to refer to science data;
Step 104:According to the incidence relation with reference to corresponding to science data, it is determined that corresponding with the searching keyword
Target science data, and export the target science data.
In above-described embodiment, by building conceptual data model in advance, when receiving searching keyword, from conceptual data
Determined in model it is corresponding with searching keyword refer to science data, and associated and closed with reference to corresponding to science data according to this
System, it is determined that target science data corresponding with searching keyword.Due to include in conceptual data model each science data it
Between incidence relation, only need to arbitrarily determine one it is corresponding with searching keyword refer to science data, then can be according to incidence relation
All target science datas are determined, without searching target science data by way of traveling through mass data sample, so as to
Improve the search efficiency of data.
In one embodiment of the invention, the embodiment of step 101, it can include:
Structure concept model, the conceptual model include:At least two entities, attribute corresponding to each described entity
Incidence relation between information, and each entity;Wherein, corresponding at least two science datas of each described entity;
According to each entity attributes information, the incidence relation between each entity is verified, root
According to the result, construction logic model;
According to the conceptual model of structure and the logical model, the conceptual data model is built.
Herein, conceptual data model is a multilevel hierarchy, both can easily data storage, and can efficiently organizes number
According to.Conceptual model is the framework of whole model, comprehensive description to the logical construction of all data in data model and characteristic.Patrol
The Data View that model is database user is collected, is the logical description to user data;Logical model is typically conceptual model
Subset;One conceptual model can be described by multiple logical models.Physical model is that database is specific on physical storage
Description is realized, physical model will generally consider the influence of deployed environment.
Incidence relation between entity typically has following several:Part-of represents part and the relation of entirety between concept;
Kind-of represents the inheritance between concept, class father's subclass relation;Instance-of represent concept example and concept it
Between relation, the relation between class object and class;Attribute-of represents that some concept is the attribute of another concept.In addition,
Can be used for the attribute of several ontological relationships of reasoning includes:Invert sexual intercourse:Also be reverse-power (inverse), such as management with
It is managed.Transitive relation (transitivity), i.e., while concept A and concept B have relation R, concept B is relevant with concept A
It is R-1, then claims relation R that there is inverse attribute, or relation R is reciprocal relation.Inheritance (kind-of), such as concept C and D,
Note C '=x | x is C example }, D '=x | x is D example }, if belonging to D ' to arbitrary x, x belongs to C ', then C is referred to as
D father's concept;D is referred to as C concept.Part relations (part-of), the relation between part and entirety between concept, example
If concept C is a concept D part.Relation between the example and concept of example relationship (instance-of) expression concept, E
It is concept C example.Relation on attributes (attribute-of) expresses the attribute that some concept is another concept, and C is D category
Property.
Logical model foundation is to use the method Representation of concepts data model of logic, the answering due to business in building process
Polygamy, it is difficult to establish logical model fully according to conceptual model.It is during foundation, it is necessary to whole to conceptual model progress merger
Reason, entity relationship are also classified, enumerated, and at utmost ensure the complete of conceptual model.
Logical model shows the data acquisition system abstracted in conceptual model with specific entity, real in setting business
Prevailing relationship between body, total data item is enumerated, is more specifically operated.This stage needs to use recapitulative data set
Close, embody the relation of inter-entity.
, it is necessary to operation function or logic checking data structure during logical model is established.Data structure it is appropriate
Property depend on its composition and whether can ensure that the execution of business function, therefore logic data model must be responsible for testing by business director
Card.In verification process, improve the data improperly inefficient factor of structure or removal operation flow.Comb the pass of inter-entity
, generally there is N in system:N relations, exclusive relation, recurrence relation, set membership, by ER figures by entity relationship represent come.Logic is built
Set of properties is needed to use during mould, it is ensured that grasp physical contents that can be simple and clear.
In one embodiment of the invention, after step 101, it may further include:
Physical model is built, the physical model includes:Default data output mode;
The conceptual model according to structure and the logical model, the conceptual data model is built, including:
Determine the concept logic mapping relations between the conceptual model and the logical model, and the conceptual model
Concept physical mappings relation between the physical model;
According to the concept logic mapping relations and concept physical mappings relation determined, and the conceptual model, logic
Model and the physical model, build the conceptual data model.
In the present embodiment, physical model is built first, and physical model provides that the way of output of data, and data are being situated between
Physical organization's form and record addressing system in matter, define size and Overflow handling method of physical storage block etc., logic mould
The corresponding relation of mapping definition between type and conceptual model between them, when global logic structure changes for some reason, is only needed
Corresponding relation is changed, realizes the logical independence of data.The mapping definition of conceptual model and physical model mathematical logic and thing
The corresponding relation of storage is managed, when the physical arrangement of data changes, changes corresponding mapping relations, there is provided the physics of data is only
Vertical property.Thus, by building physical model, conceptual model and logical model, and its corresponding mapping relations is determined, further according to reflecting
Relation structure conceptual data model is penetrated, makes the more accurate practicality of conceptual data model constructed, number is searched so as to further improve
According to efficiency.
When building physical model, it is contemplated that system deployment environment and application structure, decompose or merge actual realization
When be capable of the data least unit set of being optimal.Data use centralized storage in this application system, in structure physics mould
During type, it is direct acquisition or circulation association to determine data correlation;How the entity of set membership divides;It is real to set exclusive relation
The set of body;The division of recursive association entity;The unique key of entity is set;Dtd--data type definition and value constraint of attribute etc.
Deng.
In one embodiment of the invention, the structure concept model, including:
Determine at least two bodies, and Noumenon property corresponding to each described body;
For body each described:It is determined that at least two science data corresponding with the Noumenon property, and will
At least two science data determined is as example corresponding to the body;
According to the example determined, the entity in the conceptual model is determined, and by each body
The corresponding Noumenon property is as the corresponding entity attributes information;
According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
In the conceptual model stage, the application target of factor data is different from scope, and the set of data can be different.Namely
Say, see that the operational angle of pending data is different, the collective concept of data appears likely to can be entirely different.Built in conceptual model
When, need to define body first, herein, body must be reconstructed into a cognitive model in the brain of people, to body
The data that the body can be described by browsing by understanding are reached.In rdf model, RDF triples are unordered.It is obvious that
It is this unordered to be read to user and browse RDF data and bring very big difficulty.Therefore, entity needs a kind of suitable organizer
Formula, this organizational form should support efficient data to position and meet the rule of human cognitive.Key step is as follows:
1st, scope is determined:It is this data model definitions to develop one group of body and used by national science and technology management information system
Core purpose.The body setting range of this definition is national science and technology management domain.Specific business procedure includes science and technology item meter
Draw, science and technology item guide is compiled and edit, science and technology item is declared, science and technology item is examined, each period appraisal of science and technology item, the project acceptance inspection.Relate to
And the objective objects arrived include scientific and technical personnel, science and technology item, Technology value, technological constraints resource, science and technology environment, scientific and technological event
Deng.
2nd, it is multiplexed body:Along with widely using for semantic net, many bodies, especially deposited in public or crossing domain
It can use in the body defined.Therefore the body group defined can be used.Refer to normative reference and specification
3rd, term is enumerated:The unstructured list of all relational languages is write out, scientific and technological neck is referred to when enumerating term
Generic term in the generic term of term, information management discipline in domain in issued standard criterion, project management, lead to
Cross sample data and enumerate the term being related to.Generally, noun forms the basis of class name, and verb forms attribute basis.
4th, defining classification:After relational language is identified, these terms must be organized into a taxonomical hierarchy.For adopting
Can be depending on actual conditions with top-down or bottom-up method.The important point is to ensure that the standard of hierarchical classification
Really, for example, if A is B subclass, then the example in A must all be B example;Clear-cut is answered between category in same level, non-this is
That;Classification balanced should deploy in classification chart, series length is differed greatly.
5th, defined attribute:The step for generally and previous step interlock, when organize class hierarchy, organize arrive class attribute very
Normally.When A is B subclass, the attribute declaration that B examples are possessed also must be adapted for A example.By Attribute Association to class
When, preferably attribute provides domain and codomain statement, and this causes attribute to be more prone to be applicable by subclass;On the other hand, define
Domain and codomain granularity are more thin better, can potentially be differed in detection body by checking the promise breaking value of domain and codomain
Cause and erroneous picture.Designate whether to allow for attribute or need certain number of different values, such as sex this attribute needs
Specify certain number of value, man, female or other.The value of certain generic attribute is specified from specific value.As science and technology item always passes through
Take and derive from various funds sums.
6th, example pattern:In specific works, instances of ontology is seldom directly defined.Under normal circumstances, tissue is carried out using body
One group of example, in other words by the data tissue in data source into different bodies.The number of example may surmount body class
The several orders of magnitude of mesh, therefore the tissue of example is not usually by having been manually done, and is usually extracted from database or from text language
Material obtains in storehouse.
7th, RDF relations are built based on UML:By the mapping between the two fundamental relation, the relation of triple is described using UML
And dependence, to realize inquiry of the database directly to RDF, preferably merge existing information chemical conversion fruit.
After body is defined, the body of sciemtifec and technical sphere can be specifically constructed, people describe usually using one group of vocabulary
The concept and its relation of one specific area, this group of vocabulary are referred to as domain body.Generally, domain body is all by domain expert
Artificial constructed, meet human thinking's feature.Therefore, division is carried out to data in units of domain body and meets Subspace partition
Requirement.
First, the general concept frame in scientific and technological management field is provided according to itself understanding to domain knowledge by professional
Frame, analysis and synthesis is carried out to these concepts, its core technology is that things will be divided into different themes, and theme space is orthogonal
, i.e., the term change of one subject does not interfere with another theme space.Then classification is divided under each theme space.Point
Classified description is carried out to things according to each theme space during class.
Secondly, assembling sphere concept, including the collection of candidate concepts and the screening of field concept.Candidate concepts derive from section
All kinds of documents during skill management, pertinent literature.Being gone out according to certain screening rule from candidate concepts concentration filter can most represent
The term of field concept, and delimit the theme space belonging to concept.
Then tissue is carried out to field concept, this is the committed step of domain body structure, includes the tissue of hierarchical relationship
With the tissue of Domain relation.For the tissue of hierarchical relationship, according to high cohesion, the principle of lower coupling, while existing master is used for reference
Write inscription table and sorting technique.
Finally, domain body blank is modified, evaluate and formalization processing (being described with RDFS).Please professional people
Member modifies and evaluated to domain body blank, to ensure the correctness of body and logicality.Build each body completed
Each entity as in model, the incidence relation between each entity can be determined according to each entity attributes information, such as
Transitive relation and inheritance etc..Thus, by first setting body and its corresponding Noumenon property, further according to the body of setting
Attribute determines science data, and entity is built with this, so that the science data that the entity constructed is covered is more comprehensively accurate, favorably
In lookup of the later stage to target science data.
It is described according to each entity attributes information in one embodiment of the invention, to each entity it
Between incidence relation verified, including:
According to each entity attributes information, corresponding to each entity at least two science datas are carried out
Normal form and the processing of anti-normal formization;
According to the science data after processing, the incidence relation between each science data is determined.
Entity attributes are described by attribute word.Attribute word, the language mirror image of attribute, is represented possessed by a certain entity
Structure or feature.One attribute word possibility while still an entity word, such as " head " is a part of " people ", is people
One attribute, but be also simultaneously an entity, there is the attribute of its own;Can also be only non-Simple attribute in kind, such as people
" age " be not just one can be in kind.Attribute word corresponds to member variable (its data class defined in program language inside class
Type can be other classes) entity or basic data type (Simple attribute).Attribute be entity structure or
Feature, that is to say, that attribute is probably other entity, and these entities are the composition structures of current entity;It is also likely to be current reality
Certain feature of body.It should all belong to nominal from the structure division of attribute word either entity from the point of view of grammer angle or feature
The syntactical unit of matter.Exist it was found that attribute word is mainly concentrated in the form of noun or noun phrase, in addition also certain verb
Or form of V-O construction etc..
Normal form is " to meet the set of the relation schema of a certain rank, represent the connection between each attribute inside a relation
The rationalization degree of system ".Actually you it can be roughly interpreted as that the table structure of a tables of data met certain set
The rank of meter standard.First normal form:If the codomain of all properties is all simple domain in relation R, then relation schema R is first
Normal form.1st, there is major key;2nd, major key can not be sky;3rd, major key can not repeat;4th, field cannot divide again.Second normal form:
It is a yard A-- to have nonprime attribute to rely on R (A, B, C) A to the transitivity of code>B, B-->C, if relation schema R is first normal form
, and each nonprime attribute does not partly depend on major key in relation, and it is second normal form to claim R.The main of second normal form is appointed
Business is exactly on the premise of meeting first normal form, eliminates partial functional dependence.Third normal form:In the absence of transmission of the nonprime attribute to code
Property rely on and partial dependence.It is briefly first normal form to change an angle:Row can not divide again;Second normal form:Each table will
There is major key;Third normal form:The table being associated will have external key to be connected.Division data entity and attribute are required according to normal form.
Anti- normal form is to improve the process of database reading performance by increasing redundant data or packet.In some situations
Under, anti-normal form helps to cover the poorly efficient of relational data library software.Even if the normal form database of relationship type did optimization, also often
Often heavy access can be brought to load.
The normal form design of database can store different but related information in different logical tables, if the storage of these tables
Physically and separation, then the inquiry that database is completed from several watches may will be very slow (such as JOIN operations).
Common way is that data are done with anti-normal form design.This method can equally improve inquiry response speed, but now
No longer be DBMS but database designers go ensure data uniformity.Database designers in database by creating
Rule ensures the uniformity of data, and these rules are named constraint.So, the logical complexity of database design is increased by
, while the complexity of additional restraint also increases, this makes this method become dangerous.In addition, " constraint " is accelerating read operation
(SELECT) while, write operation (INSERT, UPDATE and DELETE) has been slowed down.This means the number of an anti-normal form design
According to storehouse, there may be worse write performance than its normal form version.
Anti- normal form data model is different from the data model without normal form.Only having reached certain in normal formization expires
The horizontal and required constraint of meaning and rule all have built up, and just carry out anti-normal form.For example, all relations belong to
Third normal form, the relation and multivalued dependence of connection are dealt carefully with, and the mode of anti-normal form generally has duplicate attribute, entity
Merge with separating, under running environment entity repetition.
In summary, the conceptual data model establish it is a set of to describe the data acquisition system of scientific and technological management domain body and
One description service, the metadata schema of file.It can be combed for database data set resource and support is provided, one is provided for service
The general ontology data model of sciemtifec and technical sphere and specification are covered, support is provided from data improvement, data mining, data schema aspect.
Meanwhile this model is open, user can be based on its constantly improve supplementary data content, realize that data are data from filling
Excavate, Knowledge Discovery provides good basis.The foundation of the model cannot be only used for sciemtifec and technical sphere, and can also be other rows
The Constructing data center of industry provides reference.
In one embodiment of the invention, the embodiment of step 101, it can include:According to each scientific and technological number
Incidence relation between, each science data is grouped, obtains at least two science data groups;Wherein, it is each
Individual science data group includes at least one science data;According at least two science datas group, build described total
Volume data model;
The embodiment of step 103, it can include:Determine to close with the inquiry from the conceptual data model
At least one science data group corresponding to keyword;Determine described to refer to section from least one science data group determined
Skill data.
In embodiments of the present invention, after the incidence relation between determining each science data, according to corresponding pass
Connection relation, each science data is concluded, that is, is grouped, by the packet to each science data, be more beneficial for scientific and technological number
According to lookup.In specifically grouping process, each incidence relation can be enumerated, be divided according to the incidence relation enumerated
Group, this is advantageous to carry out comprehensive and accurate conclusion to data.For example, being directed to a science and technology item, the scientific and technological item is participated in wherein having
Purpose research staff, also there is the auditor audited to science and technology item, research staff and auditor are concluded to same
In individual science data group, then in searching data, you can according to the title of the name of some personnel or the project in the group,
Each associated science data is found, so as to further increase the search efficiency of science data.
As shown in Fig. 2 the embodiments of the invention provide a kind of managing device of science data, including:Construction unit 201,
Receiving unit 202 and query unit 203;Wherein,
The construction unit 201, for building conceptual data model, the conceptual data model includes:At least two sections
Incidence relation between skill data and each science data;
The receiving unit 202, for receiving searching keyword;
The query unit 203, for according to the searching keyword, determined from the conceptual data model with it is described
The corresponding reference science data of searching keyword;And according to the incidence relation with reference to corresponding to science data, it is determined that and institute
Target science data corresponding to searching keyword is stated, and exports the target science data.
In one embodiment of the invention, the construction unit 201, for structure concept model, wrapped in the conceptual model
Include:At least two entities, the incidence relation between attribute information corresponding to each described entity, and each entity;
Wherein, corresponding at least two science datas of each described entity;According to each entity attributes information, to each institute
The incidence relation stated between entity is verified, according to the result, construction logic model;According to the conceptual model of structure
With the logical model, the conceptual data model is built.
In one embodiment of the invention, the construction unit 201, for building physical model, the physical model includes:
Default data output mode;The concept logic mapping relations between the conceptual model and the logical model are determined, and
Concept physical mappings relation between the conceptual model and the physical model;According to the concept logic mapping relations determined
With concept physical mappings relation, and the conceptual model, logical model and the physical model, the conceptual data mould is built
Type.
In one embodiment of the invention, the construction unit 201, for determining at least two bodies, and each institute
State Noumenon property corresponding to body;For body each described:It is determined that described in corresponding with the Noumenon property at least two
Science data, and using at least two science data determined as example corresponding to the body;According to what is determined
The example, the entity in the conceptual model is determined, and the Noumenon property corresponding to each described body is made
For the corresponding entity attributes information;According to attribute information corresponding to each described entity, each reality is determined
Incidence relation between body.
In one embodiment of the invention, the construction unit 201, for according to each entity attributes information,
Normal form and the processing of anti-normal formization are carried out to corresponding to each entity at least two science datas;According to after processing
Science data, determine the incidence relation between each science data.
The contents such as the information exchange between each unit, implementation procedure in said apparatus, due to implementing with the inventive method
Example is based on same design, and particular content can be found in the narration in the inventive method embodiment, and here is omitted.
The embodiment of the present invention additionally provides a kind of computer-readable recording medium, including execute instruction, when the processor of storage control is held
During the row execute instruction, the storage control performs the method that any of the above-described embodiment of the present invention provides.
The embodiment of the present invention additionally provides a kind of storage control, including:Processor, memory and bus;The storage
Device is used to store execute instruction, and the processor is connected with the memory by the bus, when the storage control is transported
During row, the execute instruction of memory storage described in the computing device, so that the storage control performs the present invention
The method that any of the above-described embodiment provides.
In summary, the more than present invention each embodiment at least has the advantages that:
1st, in embodiments of the present invention, by building conceptual data model in advance, when receiving searching keyword, from total
Determined in volume data model it is corresponding with searching keyword refer to science data, and according to this with reference to corresponding to science data pass
Connection relation, it is determined that target science data corresponding with searching keyword.Due to including each scientific and technological number in conceptual data model
Incidence relation between, only need to arbitrarily determine one it is corresponding with searching keyword refer to science data, then can be according to association
Relation determines all target science datas, without searching target science data by way of traveling through mass data sample,
So as to improve the search efficiency of data.
2nd, in embodiments of the present invention, structure concept model, logical model and physical model, and determine conceptual model with
Concept logic mapping relations between logical model, and the concept physical mappings relation between conceptual model and physical model,
According to the concept logic mapping relations and concept physical mappings relation determined, and conceptual model, logical model and physics mould
Type, build the conceptual data model.So that the conceptual data model constructed is more accurate practical, searched so as to further improve
The efficiency of data.
3rd, in embodiments of the present invention, by determining body and Noumenon property, according to Noumenon property by corresponding science and technology
Data are defined as the example of body, and each entity in conceptual model is constructed with this.So that what the entity constructed was covered
Science data is more comprehensively accurate, is advantageous to lookup of the later stage to target science data.
It should be noted that herein, such as first and second etc relational terms are used merely to an entity
Or operation makes a distinction with another entity or operation, and not necessarily require or imply and exist between these entities or operation
Any this actual relation or order.Moreover, term " comprising ", "comprising" or its any other variant be intended to it is non-
It is exclusive to include, so that process, method, article or equipment including a series of elements not only include those key elements,
But also the other element including being not expressly set out, or also include solid by this process, method, article or equipment
Some key elements.In the absence of more restrictions, the key element limited by sentence " including one ", is not arranged
Except other identical factor in the process including the key element, method, article or equipment being also present.
One of ordinary skill in the art will appreciate that:Realizing all or part of step of above method embodiment can pass through
Programmed instruction related hardware is completed, and foregoing program can be stored in computer-readable storage medium, the program
Upon execution, the step of execution includes above method embodiment;And foregoing storage medium includes:ROM, RAM, magnetic disc or light
Disk etc. is various can be with the medium of store program codes.
It is last it should be noted that:Presently preferred embodiments of the present invention is the foregoing is only, is merely to illustrate the skill of the present invention
Art scheme, is not intended to limit the scope of the present invention.Any modification for being made within the spirit and principles of the invention,
Equivalent substitution, improvement etc., are all contained in protection scope of the present invention.
Claims (10)
- A kind of 1. management method of science data, it is characterised in that including:Conceptual data model is built, the conceptual data model includes:At least two science datas and each scientific and technological number Incidence relation between;Also include:Receive searching keyword;According to the searching keyword, the reference section corresponding with the searching keyword is determined from the conceptual data model Skill data;According to the incidence relation with reference to corresponding to science data, it is determined that target science and technology number corresponding with the searching keyword According to, and export the target science data.
- 2. management method according to claim 1, it is characterised in thatThe structure conceptual data model, including:Structure concept model, the conceptual model include:At least two entities, attribute corresponding to each described entity are believed Breath, and the incidence relation between each entity;Wherein, corresponding at least two science datas of each described entity;According to each entity attributes information, the incidence relation between each entity is verified, according to testing Demonstrate,prove result, construction logic model;According to the conceptual model of structure and the logical model, the conceptual data model is built.
- 3. according to the method for claim 2, it is characterised in thatAfter the construction logic model, further comprise:Physical model is built, the physical model includes:Default data output mode;The conceptual model according to structure and the logical model, the conceptual data model is built, including:Determine the concept logic mapping relations between the conceptual model and the logical model, and the conceptual model and institute State the concept physical mappings relation between physical model;According to the concept logic mapping relations and concept physical mappings relation determined, and the conceptual model, logical model With the physical model, the conceptual data model is built.
- 4. management method according to claim 2, it is characterised in thatThe structure concept model, including:Determine at least two bodies, and Noumenon property corresponding to each described body;For body each described:It is determined that at least two science data corresponding with the Noumenon property, and will determine At least two science data gone out is as example corresponding to the body;According to the example determined, the entity in the conceptual model is determined, and each described body is corresponding The Noumenon property as the corresponding entity attributes information;According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
- 5. management method according to claim 4, it is characterised in thatIt is described incidence relation between each entity to be verified according to each entity attributes information, wrap Include:According to each entity attributes information, normal form is carried out to corresponding to each entity at least two science datas Change and anti-normal formization is handled;According to the science data after processing, the incidence relation between each science data is determined;And/orThe structure conceptual data model, including:According to the incidence relation between each science data, each science data is grouped, obtains at least two Individual science data group;Wherein, each science data group includes at least one science data;According at least two science datas group, the conceptual data model is built;It is described according to the searching keyword, the ginseng corresponding with the searching keyword is determined from the conceptual data model Science data is examined, including:At least one science data group corresponding with the searching keyword is determined from the conceptual data model;Determine described to refer to science data from least one science data group determined.
- A kind of 6. managing device of science data, it is characterised in that including:Construction unit, receiving unit and query unit;Its In,The construction unit, for building conceptual data model, the conceptual data model includes:At least two science datas with And the incidence relation between each science data;The receiving unit, for receiving searching keyword;The query unit, for according to the searching keyword, determining to close with the inquiry from the conceptual data model The corresponding reference science data of keyword;And according to described with reference to incidence relation corresponding to science data, it is determined that with the inquiry Target science data corresponding to keyword, and export the target science data.
- 7. device according to claim 6, it is characterised in thatThe construction unit, for structure concept model, the conceptual model includes:At least two entities, described in each Incidence relation between attribute information corresponding to entity, and each entity;Wherein, each described entity is corresponding at least Two science datas;According to each entity attributes information, the incidence relation between each entity is tested Card, according to the result, construction logic model;According to the conceptual model of structure and the logical model, build described total Volume data model.
- 8. according to the device described in right 7, it is characterised in thatThe construction unit, for building physical model, the physical model includes:Default data output mode;Determine institute State the concept logic mapping relations between conceptual model and the logical model, and the conceptual model and the physical model Between concept physical mappings relation;According to the concept logic mapping relations and concept physical mappings relation determined, Yi Jisuo Conceptual model, logical model and the physical model are stated, builds the conceptual data model.
- 9. device according to claim 7, it is characterised in thatThe construction unit, for determining at least two bodies, and Noumenon property corresponding to each described body;For every One body:It is determined that at least two science data corresponding with the Noumenon property, and described in determining extremely Few two science datas are as example corresponding to the body;According to the example determined, determine in the conceptual model The entity, and believe the Noumenon property corresponding to each described body as the corresponding entity attributes Breath;According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
- 10. device according to claim 9, it is characterised in thatThe construction unit, for according to each entity attributes information, at least two institute corresponding to each entity State science data and carry out normal form and the processing of anti-normal formization;According to the science data after processing, each science and technology is determined Incidence relation between data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711043893.8A CN107766545A (en) | 2017-10-31 | 2017-10-31 | Scientific and technological data management method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711043893.8A CN107766545A (en) | 2017-10-31 | 2017-10-31 | Scientific and technological data management method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107766545A true CN107766545A (en) | 2018-03-06 |
Family
ID=61271236
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711043893.8A Pending CN107766545A (en) | 2017-10-31 | 2017-10-31 | Scientific and technological data management method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107766545A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112115235A (en) * | 2020-09-28 | 2020-12-22 | 中国建设银行股份有限公司 | Entity attribute data query and configuration method, device and server |
CN113191145A (en) * | 2021-05-21 | 2021-07-30 | 百度在线网络技术(北京)有限公司 | Keyword processing method and device, electronic equipment and medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102495860A (en) * | 2011-11-22 | 2012-06-13 | 北京大学 | Expert recommendation method based on language model |
CN103366735A (en) * | 2012-03-29 | 2013-10-23 | 北京中传天籁数字技术有限公司 | A voice data mapping method and apparatus |
CN104424310A (en) * | 2013-09-06 | 2015-03-18 | 中国海洋大学 | Ontology-based smart home semantic query method and ontology-based smart home semantic query device |
US20170024493A1 (en) * | 2015-07-23 | 2017-01-26 | Autodesk, Inc. | System-level approach to goal-driven design |
-
2017
- 2017-10-31 CN CN201711043893.8A patent/CN107766545A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102495860A (en) * | 2011-11-22 | 2012-06-13 | 北京大学 | Expert recommendation method based on language model |
CN103366735A (en) * | 2012-03-29 | 2013-10-23 | 北京中传天籁数字技术有限公司 | A voice data mapping method and apparatus |
CN104424310A (en) * | 2013-09-06 | 2015-03-18 | 中国海洋大学 | Ontology-based smart home semantic query method and ontology-based smart home semantic query device |
US20170024493A1 (en) * | 2015-07-23 | 2017-01-26 | Autodesk, Inc. | System-level approach to goal-driven design |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112115235A (en) * | 2020-09-28 | 2020-12-22 | 中国建设银行股份有限公司 | Entity attribute data query and configuration method, device and server |
CN113191145A (en) * | 2021-05-21 | 2021-07-30 | 百度在线网络技术(北京)有限公司 | Keyword processing method and device, electronic equipment and medium |
CN113191145B (en) * | 2021-05-21 | 2023-08-11 | 百度在线网络技术(北京)有限公司 | Keyword processing method and device, electronic equipment and medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210407033A1 (en) | Patent mapping | |
Kharlamov et al. | Ontology based access to exploration data at statoil | |
CN109255031A (en) | The data processing method of knowledge based map | |
Rinaldi et al. | A matching framework for multimedia data integration using semantics and ontologies | |
Jayaram et al. | A review: Information extraction techniques from research papers | |
Wątróbski | Ontology learning methods from text-an extensive knowledge-based approach | |
Ángel et al. | Automated modelling assistance by integrating heterogeneous information sources | |
CN115438199A (en) | Knowledge platform system based on smart city scene data middling platform technology | |
WO2006015110A2 (en) | Patent mapping | |
CN107766545A (en) | Scientific and technological data management method and device | |
Yang et al. | User story clustering in agile development: a framework and an empirical study | |
Wang et al. | Normalized Storage Model Construction and Query Optimization of Book Multi-Source Heterogeneous Massive Data | |
Hu et al. | A classification model of power operation inspection defect texts based on graph convolutional network | |
Vogt et al. | Towards a Rosetta Stone for (meta) data: Learning from natural language to improve semantic and cognitive interoperability | |
CN113127650A (en) | Technical map construction method and system based on map database | |
Tang et al. | Ontology-based semantic retrieval for education management systems | |
CN110046163A (en) | A kind of data retrieval method and system | |
Khazraee et al. | Demystifying ontology | |
Delemazure | A Knowledge Base of Mathematical Results | |
Qiu et al. | An architecture for cell-centric indexing of datasets | |
Laadidi et al. | Simplification of owl ontology sources for data warehousing | |
Kalampokis et al. | Towards interoperable open statistical data | |
KR102605931B1 (en) | Method for processing structured data and unstructured data on a plurality of databases and data processing platform providing the method | |
Leshcheva et al. | Towards a method of ontology population from heterogeneous sources of structured data | |
Hasan et al. | An extensible digital library service to support network science |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180306 |
|
RJ01 | Rejection of invention patent application after publication |