CN107766545A - Scientific and technological data management method and device - Google Patents

Scientific and technological data management method and device Download PDF

Info

Publication number
CN107766545A
CN107766545A CN201711043893.8A CN201711043893A CN107766545A CN 107766545 A CN107766545 A CN 107766545A CN 201711043893 A CN201711043893 A CN 201711043893A CN 107766545 A CN107766545 A CN 107766545A
Authority
CN
China
Prior art keywords
model
data
science
conceptual
entity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711043893.8A
Other languages
Chinese (zh)
Inventor
曲翠钰
王乐
石园
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Software Group Co Ltd
Original Assignee
Inspur Software Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Software Group Co Ltd filed Critical Inspur Software Group Co Ltd
Priority to CN201711043893.8A priority Critical patent/CN107766545A/en
Publication of CN107766545A publication Critical patent/CN107766545A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a management method and a device of scientific and technological data, wherein the method comprises the following steps: constructing an ensemble data model, the ensemble data model comprising: at least two scientific and technical data and an incidence relation between the scientific and technical data; receiving a query keyword; according to the query key words, determining reference scientific and technological data corresponding to the query key words from the overall data model; and determining target scientific and technological data corresponding to the query key words according to the incidence relation corresponding to the reference scientific and technological data, and outputting the target scientific and technological data. The technical scheme can improve the searching efficiency of the scientific and technological data.

Description

The management method and device of a kind of science data
Technical field
The present invention relates to field of computer technology, the management method and device of more particularly to a kind of science data.
Background technology
With the development of information technology, the information content of science data does not stop to expand, and its source is varied, how it is quick simultaneously Correctly find correct science data and have become a problem.
Traditional data integration scheme only the data of separate sources, form and feature property logically or physically Concentrated, therefore could be searched when searching science data using data integration scheme, it is necessary to travel through substantial amounts of data sample To correct science data, cause data search less efficient.
The content of the invention
The embodiments of the invention provide a kind of management method of science data and device, the search efficiency of data can be improved.
In a first aspect, the embodiments of the invention provide a kind of management method of science data, including:
Conceptual data model is built, the conceptual data model includes:At least two science datas and each section Incidence relation between skill data;
Also include:
Receive searching keyword;
According to the searching keyword, the ginseng corresponding with the searching keyword is determined from the conceptual data model Examine science data;
According to the incidence relation with reference to corresponding to science data, it is determined that target corresponding with the searching keyword is scientific and technological Data, and export the target science data.
Preferably,
The structure conceptual data model, including:
Structure concept model, the conceptual model include:At least two entities, attribute corresponding to each described entity Incidence relation between information, and each entity;Wherein, corresponding at least two science datas of each described entity;
According to each entity attributes information, the incidence relation between each entity is verified, root According to the result, construction logic model;
According to the conceptual model of structure and the logical model, the conceptual data model is built.
Preferably,
After the construction logic model, further comprise:
Physical model is built, the physical model includes:Default data output mode;
The conceptual model according to structure and the logical model, the conceptual data model is built, including:
Determine the concept logic mapping relations between the conceptual model and the logical model, and the conceptual model Concept physical mappings relation between the physical model;
According to the concept logic mapping relations and concept physical mappings relation determined, and the conceptual model, logic Model and the physical model, build the conceptual data model.
Preferably,
The structure concept model, including:
Determine at least two bodies, and Noumenon property corresponding to each described body;
For body each described:It is determined that at least two science data corresponding with the Noumenon property, and will At least two science data determined is as example corresponding to the body;
According to the example determined, the entity in the conceptual model is determined, and by each body The corresponding Noumenon property is as the corresponding entity attributes information;
According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
Preferably,
It is described according to each entity attributes information, the incidence relation between each entity is tested Card, including:
According to each entity attributes information, corresponding to each entity at least two science datas are carried out Normal form and the processing of anti-normal formization;
According to the science data after processing, the incidence relation between each science data is determined.
Preferably,
The structure conceptual data model, including:
According to the incidence relation between each science data, each science data is grouped, obtain to Few two science data groups;Wherein, each science data group includes at least one science data;
According at least two science datas group, the conceptual data model is built;
It is described according to the searching keyword, determined from the conceptual data model corresponding with the searching keyword Reference science data, including:
At least one science data group corresponding with the searching keyword is determined from the conceptual data model;
Determine described to refer to science data from least one science data group determined.
Second aspect, the embodiments of the invention provide a kind of managing device of science data, including:Construction unit, reception Unit and query unit;Wherein,
The construction unit, for building conceptual data model, the conceptual data model includes:At least two scientific and technological numbers According to this and the incidence relation between each science data;
The receiving unit, for receiving searching keyword;
The query unit, for according to the searching keyword, determining to look into described from the conceptual data model Ask the corresponding reference science data of keyword;And according to described with reference to incidence relation corresponding to science data, it is determined that with it is described Target science data corresponding to searching keyword, and export the target science data.
Preferably,
The construction unit, for structure concept model, the conceptual model includes:At least two entities, each Incidence relation between attribute information corresponding to the entity, and each entity;Wherein, each described entity is corresponding At least two science datas;According to each entity attributes information, the incidence relation between each entity is entered Row checking, according to the result, construction logic model;According to the conceptual model of structure and the logical model, institute is built State conceptual data model.
Preferably,
The construction unit, for building physical model, the physical model includes:Default data output mode;Really Fixed concept logic mapping relations between the conceptual model and the logical model, and the conceptual model and the physics Concept physical mappings relation between model;According to the concept logic mapping relations and concept physical mappings relation determined, with And the conceptual model, logical model and the physical model, build the conceptual data model.
Preferably,
The construction unit, for determining at least two bodies, and Noumenon property corresponding to each described body;Pin To body each described:It is determined that at least two science data corresponding with the Noumenon property, and the institute that will be determined At least two science datas are stated as example corresponding to the body;According to the example determined, the concept mould is determined The entity in type, and using the Noumenon property corresponding to each described body as the corresponding entity attributes Information;According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
Preferably,
The construction unit, for according to each entity attributes information, to corresponding to each entity at least two The individual science data carries out normal form and the processing of anti-normal formization;According to the science data after processing, determine each described Incidence relation between science data.
The embodiments of the invention provide a kind of management method of science data and device, by building conceptual data mould in advance Type, when receiving searching keyword, determined from conceptual data model it is corresponding with searching keyword refer to science data, And the incidence relation according to corresponding to this refers to science data, it is determined that target science data corresponding with searching keyword.Due to total Include the incidence relation between each science data in volume data model, only need to arbitrarily determine one it is corresponding with searching keyword Reference science data, then all target science datas can be determined according to incidence relation, without by traveling through mass data The mode of sample searches target science data, so as to improve the search efficiency of data.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are the present invention Some embodiments, for those of ordinary skill in the art, on the premise of not paying creative work, can also basis These accompanying drawings obtain other accompanying drawings.
Fig. 1 is a kind of flow chart of the management method for science data that one embodiment of the invention provides;
Fig. 2 is a kind of structural representation of the managing device for science data that one embodiment of the invention provides.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is Part of the embodiment of the present invention, rather than whole embodiments, based on the embodiment in the present invention, those of ordinary skill in the art The every other embodiment obtained on the premise of creative work is not made, belongs to the scope of protection of the invention.
As shown in figure 1, the embodiments of the invention provide a kind of management method of science data, this method can include following Step:
Step 101:Conceptual data model is built, the conceptual data model includes:At least two science datas and each Incidence relation between the individual science data;
Step 102:Receive searching keyword;
Step 103:According to the searching keyword, determined and the searching keyword phase from the conceptual data model It is corresponding to refer to science data;
Step 104:According to the incidence relation with reference to corresponding to science data, it is determined that corresponding with the searching keyword Target science data, and export the target science data.
In above-described embodiment, by building conceptual data model in advance, when receiving searching keyword, from conceptual data Determined in model it is corresponding with searching keyword refer to science data, and associated and closed with reference to corresponding to science data according to this System, it is determined that target science data corresponding with searching keyword.Due to include in conceptual data model each science data it Between incidence relation, only need to arbitrarily determine one it is corresponding with searching keyword refer to science data, then can be according to incidence relation All target science datas are determined, without searching target science data by way of traveling through mass data sample, so as to Improve the search efficiency of data.
In one embodiment of the invention, the embodiment of step 101, it can include:
Structure concept model, the conceptual model include:At least two entities, attribute corresponding to each described entity Incidence relation between information, and each entity;Wherein, corresponding at least two science datas of each described entity;
According to each entity attributes information, the incidence relation between each entity is verified, root According to the result, construction logic model;
According to the conceptual model of structure and the logical model, the conceptual data model is built.
Herein, conceptual data model is a multilevel hierarchy, both can easily data storage, and can efficiently organizes number According to.Conceptual model is the framework of whole model, comprehensive description to the logical construction of all data in data model and characteristic.Patrol The Data View that model is database user is collected, is the logical description to user data;Logical model is typically conceptual model Subset;One conceptual model can be described by multiple logical models.Physical model is that database is specific on physical storage Description is realized, physical model will generally consider the influence of deployed environment.
Incidence relation between entity typically has following several:Part-of represents part and the relation of entirety between concept; Kind-of represents the inheritance between concept, class father's subclass relation;Instance-of represent concept example and concept it Between relation, the relation between class object and class;Attribute-of represents that some concept is the attribute of another concept.In addition, Can be used for the attribute of several ontological relationships of reasoning includes:Invert sexual intercourse:Also be reverse-power (inverse), such as management with It is managed.Transitive relation (transitivity), i.e., while concept A and concept B have relation R, concept B is relevant with concept A It is R-1, then claims relation R that there is inverse attribute, or relation R is reciprocal relation.Inheritance (kind-of), such as concept C and D, Note C '=x | x is C example }, D '=x | x is D example }, if belonging to D ' to arbitrary x, x belongs to C ', then C is referred to as D father's concept;D is referred to as C concept.Part relations (part-of), the relation between part and entirety between concept, example If concept C is a concept D part.Relation between the example and concept of example relationship (instance-of) expression concept, E It is concept C example.Relation on attributes (attribute-of) expresses the attribute that some concept is another concept, and C is D category Property.
Logical model foundation is to use the method Representation of concepts data model of logic, the answering due to business in building process Polygamy, it is difficult to establish logical model fully according to conceptual model.It is during foundation, it is necessary to whole to conceptual model progress merger Reason, entity relationship are also classified, enumerated, and at utmost ensure the complete of conceptual model.
Logical model shows the data acquisition system abstracted in conceptual model with specific entity, real in setting business Prevailing relationship between body, total data item is enumerated, is more specifically operated.This stage needs to use recapitulative data set Close, embody the relation of inter-entity.
, it is necessary to operation function or logic checking data structure during logical model is established.Data structure it is appropriate Property depend on its composition and whether can ensure that the execution of business function, therefore logic data model must be responsible for testing by business director Card.In verification process, improve the data improperly inefficient factor of structure or removal operation flow.Comb the pass of inter-entity , generally there is N in system:N relations, exclusive relation, recurrence relation, set membership, by ER figures by entity relationship represent come.Logic is built Set of properties is needed to use during mould, it is ensured that grasp physical contents that can be simple and clear.
In one embodiment of the invention, after step 101, it may further include:
Physical model is built, the physical model includes:Default data output mode;
The conceptual model according to structure and the logical model, the conceptual data model is built, including:
Determine the concept logic mapping relations between the conceptual model and the logical model, and the conceptual model Concept physical mappings relation between the physical model;
According to the concept logic mapping relations and concept physical mappings relation determined, and the conceptual model, logic Model and the physical model, build the conceptual data model.
In the present embodiment, physical model is built first, and physical model provides that the way of output of data, and data are being situated between Physical organization's form and record addressing system in matter, define size and Overflow handling method of physical storage block etc., logic mould The corresponding relation of mapping definition between type and conceptual model between them, when global logic structure changes for some reason, is only needed Corresponding relation is changed, realizes the logical independence of data.The mapping definition of conceptual model and physical model mathematical logic and thing The corresponding relation of storage is managed, when the physical arrangement of data changes, changes corresponding mapping relations, there is provided the physics of data is only Vertical property.Thus, by building physical model, conceptual model and logical model, and its corresponding mapping relations is determined, further according to reflecting Relation structure conceptual data model is penetrated, makes the more accurate practicality of conceptual data model constructed, number is searched so as to further improve According to efficiency.
When building physical model, it is contemplated that system deployment environment and application structure, decompose or merge actual realization When be capable of the data least unit set of being optimal.Data use centralized storage in this application system, in structure physics mould During type, it is direct acquisition or circulation association to determine data correlation;How the entity of set membership divides;It is real to set exclusive relation The set of body;The division of recursive association entity;The unique key of entity is set;Dtd--data type definition and value constraint of attribute etc. Deng.
In one embodiment of the invention, the structure concept model, including:
Determine at least two bodies, and Noumenon property corresponding to each described body;
For body each described:It is determined that at least two science data corresponding with the Noumenon property, and will At least two science data determined is as example corresponding to the body;
According to the example determined, the entity in the conceptual model is determined, and by each body The corresponding Noumenon property is as the corresponding entity attributes information;
According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
In the conceptual model stage, the application target of factor data is different from scope, and the set of data can be different.Namely Say, see that the operational angle of pending data is different, the collective concept of data appears likely to can be entirely different.Built in conceptual model When, need to define body first, herein, body must be reconstructed into a cognitive model in the brain of people, to body The data that the body can be described by browsing by understanding are reached.In rdf model, RDF triples are unordered.It is obvious that It is this unordered to be read to user and browse RDF data and bring very big difficulty.Therefore, entity needs a kind of suitable organizer Formula, this organizational form should support efficient data to position and meet the rule of human cognitive.Key step is as follows:
1st, scope is determined:It is this data model definitions to develop one group of body and used by national science and technology management information system Core purpose.The body setting range of this definition is national science and technology management domain.Specific business procedure includes science and technology item meter Draw, science and technology item guide is compiled and edit, science and technology item is declared, science and technology item is examined, each period appraisal of science and technology item, the project acceptance inspection.Relate to And the objective objects arrived include scientific and technical personnel, science and technology item, Technology value, technological constraints resource, science and technology environment, scientific and technological event Deng.
2nd, it is multiplexed body:Along with widely using for semantic net, many bodies, especially deposited in public or crossing domain It can use in the body defined.Therefore the body group defined can be used.Refer to normative reference and specification
3rd, term is enumerated:The unstructured list of all relational languages is write out, scientific and technological neck is referred to when enumerating term Generic term in the generic term of term, information management discipline in domain in issued standard criterion, project management, lead to Cross sample data and enumerate the term being related to.Generally, noun forms the basis of class name, and verb forms attribute basis.
4th, defining classification:After relational language is identified, these terms must be organized into a taxonomical hierarchy.For adopting Can be depending on actual conditions with top-down or bottom-up method.The important point is to ensure that the standard of hierarchical classification Really, for example, if A is B subclass, then the example in A must all be B example;Clear-cut is answered between category in same level, non-this is That;Classification balanced should deploy in classification chart, series length is differed greatly.
5th, defined attribute:The step for generally and previous step interlock, when organize class hierarchy, organize arrive class attribute very Normally.When A is B subclass, the attribute declaration that B examples are possessed also must be adapted for A example.By Attribute Association to class When, preferably attribute provides domain and codomain statement, and this causes attribute to be more prone to be applicable by subclass;On the other hand, define Domain and codomain granularity are more thin better, can potentially be differed in detection body by checking the promise breaking value of domain and codomain Cause and erroneous picture.Designate whether to allow for attribute or need certain number of different values, such as sex this attribute needs Specify certain number of value, man, female or other.The value of certain generic attribute is specified from specific value.As science and technology item always passes through Take and derive from various funds sums.
6th, example pattern:In specific works, instances of ontology is seldom directly defined.Under normal circumstances, tissue is carried out using body One group of example, in other words by the data tissue in data source into different bodies.The number of example may surmount body class The several orders of magnitude of mesh, therefore the tissue of example is not usually by having been manually done, and is usually extracted from database or from text language Material obtains in storehouse.
7th, RDF relations are built based on UML:By the mapping between the two fundamental relation, the relation of triple is described using UML And dependence, to realize inquiry of the database directly to RDF, preferably merge existing information chemical conversion fruit.
After body is defined, the body of sciemtifec and technical sphere can be specifically constructed, people describe usually using one group of vocabulary The concept and its relation of one specific area, this group of vocabulary are referred to as domain body.Generally, domain body is all by domain expert Artificial constructed, meet human thinking's feature.Therefore, division is carried out to data in units of domain body and meets Subspace partition Requirement.
First, the general concept frame in scientific and technological management field is provided according to itself understanding to domain knowledge by professional Frame, analysis and synthesis is carried out to these concepts, its core technology is that things will be divided into different themes, and theme space is orthogonal , i.e., the term change of one subject does not interfere with another theme space.Then classification is divided under each theme space.Point Classified description is carried out to things according to each theme space during class.
Secondly, assembling sphere concept, including the collection of candidate concepts and the screening of field concept.Candidate concepts derive from section All kinds of documents during skill management, pertinent literature.Being gone out according to certain screening rule from candidate concepts concentration filter can most represent The term of field concept, and delimit the theme space belonging to concept.
Then tissue is carried out to field concept, this is the committed step of domain body structure, includes the tissue of hierarchical relationship With the tissue of Domain relation.For the tissue of hierarchical relationship, according to high cohesion, the principle of lower coupling, while existing master is used for reference Write inscription table and sorting technique.
Finally, domain body blank is modified, evaluate and formalization processing (being described with RDFS).Please professional people Member modifies and evaluated to domain body blank, to ensure the correctness of body and logicality.Build each body completed Each entity as in model, the incidence relation between each entity can be determined according to each entity attributes information, such as Transitive relation and inheritance etc..Thus, by first setting body and its corresponding Noumenon property, further according to the body of setting Attribute determines science data, and entity is built with this, so that the science data that the entity constructed is covered is more comprehensively accurate, favorably In lookup of the later stage to target science data.
It is described according to each entity attributes information in one embodiment of the invention, to each entity it Between incidence relation verified, including:
According to each entity attributes information, corresponding to each entity at least two science datas are carried out Normal form and the processing of anti-normal formization;
According to the science data after processing, the incidence relation between each science data is determined.
Entity attributes are described by attribute word.Attribute word, the language mirror image of attribute, is represented possessed by a certain entity Structure or feature.One attribute word possibility while still an entity word, such as " head " is a part of " people ", is people One attribute, but be also simultaneously an entity, there is the attribute of its own;Can also be only non-Simple attribute in kind, such as people " age " be not just one can be in kind.Attribute word corresponds to member variable (its data class defined in program language inside class Type can be other classes) entity or basic data type (Simple attribute).Attribute be entity structure or Feature, that is to say, that attribute is probably other entity, and these entities are the composition structures of current entity;It is also likely to be current reality Certain feature of body.It should all belong to nominal from the structure division of attribute word either entity from the point of view of grammer angle or feature The syntactical unit of matter.Exist it was found that attribute word is mainly concentrated in the form of noun or noun phrase, in addition also certain verb Or form of V-O construction etc..
Normal form is " to meet the set of the relation schema of a certain rank, represent the connection between each attribute inside a relation The rationalization degree of system ".Actually you it can be roughly interpreted as that the table structure of a tables of data met certain set The rank of meter standard.First normal form:If the codomain of all properties is all simple domain in relation R, then relation schema R is first Normal form.1st, there is major key;2nd, major key can not be sky;3rd, major key can not repeat;4th, field cannot divide again.Second normal form: It is a yard A-- to have nonprime attribute to rely on R (A, B, C) A to the transitivity of code>B, B-->C, if relation schema R is first normal form , and each nonprime attribute does not partly depend on major key in relation, and it is second normal form to claim R.The main of second normal form is appointed Business is exactly on the premise of meeting first normal form, eliminates partial functional dependence.Third normal form:In the absence of transmission of the nonprime attribute to code Property rely on and partial dependence.It is briefly first normal form to change an angle:Row can not divide again;Second normal form:Each table will There is major key;Third normal form:The table being associated will have external key to be connected.Division data entity and attribute are required according to normal form.
Anti- normal form is to improve the process of database reading performance by increasing redundant data or packet.In some situations Under, anti-normal form helps to cover the poorly efficient of relational data library software.Even if the normal form database of relationship type did optimization, also often Often heavy access can be brought to load.
The normal form design of database can store different but related information in different logical tables, if the storage of these tables Physically and separation, then the inquiry that database is completed from several watches may will be very slow (such as JOIN operations).
Common way is that data are done with anti-normal form design.This method can equally improve inquiry response speed, but now No longer be DBMS but database designers go ensure data uniformity.Database designers in database by creating Rule ensures the uniformity of data, and these rules are named constraint.So, the logical complexity of database design is increased by , while the complexity of additional restraint also increases, this makes this method become dangerous.In addition, " constraint " is accelerating read operation (SELECT) while, write operation (INSERT, UPDATE and DELETE) has been slowed down.This means the number of an anti-normal form design According to storehouse, there may be worse write performance than its normal form version.
Anti- normal form data model is different from the data model without normal form.Only having reached certain in normal formization expires The horizontal and required constraint of meaning and rule all have built up, and just carry out anti-normal form.For example, all relations belong to Third normal form, the relation and multivalued dependence of connection are dealt carefully with, and the mode of anti-normal form generally has duplicate attribute, entity Merge with separating, under running environment entity repetition.
In summary, the conceptual data model establish it is a set of to describe the data acquisition system of scientific and technological management domain body and One description service, the metadata schema of file.It can be combed for database data set resource and support is provided, one is provided for service The general ontology data model of sciemtifec and technical sphere and specification are covered, support is provided from data improvement, data mining, data schema aspect. Meanwhile this model is open, user can be based on its constantly improve supplementary data content, realize that data are data from filling Excavate, Knowledge Discovery provides good basis.The foundation of the model cannot be only used for sciemtifec and technical sphere, and can also be other rows The Constructing data center of industry provides reference.
In one embodiment of the invention, the embodiment of step 101, it can include:According to each scientific and technological number Incidence relation between, each science data is grouped, obtains at least two science data groups;Wherein, it is each Individual science data group includes at least one science data;According at least two science datas group, build described total Volume data model;
The embodiment of step 103, it can include:Determine to close with the inquiry from the conceptual data model At least one science data group corresponding to keyword;Determine described to refer to section from least one science data group determined Skill data.
In embodiments of the present invention, after the incidence relation between determining each science data, according to corresponding pass Connection relation, each science data is concluded, that is, is grouped, by the packet to each science data, be more beneficial for scientific and technological number According to lookup.In specifically grouping process, each incidence relation can be enumerated, be divided according to the incidence relation enumerated Group, this is advantageous to carry out comprehensive and accurate conclusion to data.For example, being directed to a science and technology item, the scientific and technological item is participated in wherein having Purpose research staff, also there is the auditor audited to science and technology item, research staff and auditor are concluded to same In individual science data group, then in searching data, you can according to the title of the name of some personnel or the project in the group, Each associated science data is found, so as to further increase the search efficiency of science data.
As shown in Fig. 2 the embodiments of the invention provide a kind of managing device of science data, including:Construction unit 201, Receiving unit 202 and query unit 203;Wherein,
The construction unit 201, for building conceptual data model, the conceptual data model includes:At least two sections Incidence relation between skill data and each science data;
The receiving unit 202, for receiving searching keyword;
The query unit 203, for according to the searching keyword, determined from the conceptual data model with it is described The corresponding reference science data of searching keyword;And according to the incidence relation with reference to corresponding to science data, it is determined that and institute Target science data corresponding to searching keyword is stated, and exports the target science data.
In one embodiment of the invention, the construction unit 201, for structure concept model, wrapped in the conceptual model Include:At least two entities, the incidence relation between attribute information corresponding to each described entity, and each entity; Wherein, corresponding at least two science datas of each described entity;According to each entity attributes information, to each institute The incidence relation stated between entity is verified, according to the result, construction logic model;According to the conceptual model of structure With the logical model, the conceptual data model is built.
In one embodiment of the invention, the construction unit 201, for building physical model, the physical model includes: Default data output mode;The concept logic mapping relations between the conceptual model and the logical model are determined, and Concept physical mappings relation between the conceptual model and the physical model;According to the concept logic mapping relations determined With concept physical mappings relation, and the conceptual model, logical model and the physical model, the conceptual data mould is built Type.
In one embodiment of the invention, the construction unit 201, for determining at least two bodies, and each institute State Noumenon property corresponding to body;For body each described:It is determined that described in corresponding with the Noumenon property at least two Science data, and using at least two science data determined as example corresponding to the body;According to what is determined The example, the entity in the conceptual model is determined, and the Noumenon property corresponding to each described body is made For the corresponding entity attributes information;According to attribute information corresponding to each described entity, each reality is determined Incidence relation between body.
In one embodiment of the invention, the construction unit 201, for according to each entity attributes information, Normal form and the processing of anti-normal formization are carried out to corresponding to each entity at least two science datas;According to after processing Science data, determine the incidence relation between each science data.
The contents such as the information exchange between each unit, implementation procedure in said apparatus, due to implementing with the inventive method Example is based on same design, and particular content can be found in the narration in the inventive method embodiment, and here is omitted.
The embodiment of the present invention additionally provides a kind of computer-readable recording medium, including execute instruction, when the processor of storage control is held During the row execute instruction, the storage control performs the method that any of the above-described embodiment of the present invention provides.
The embodiment of the present invention additionally provides a kind of storage control, including:Processor, memory and bus;The storage Device is used to store execute instruction, and the processor is connected with the memory by the bus, when the storage control is transported During row, the execute instruction of memory storage described in the computing device, so that the storage control performs the present invention The method that any of the above-described embodiment provides.
In summary, the more than present invention each embodiment at least has the advantages that:
1st, in embodiments of the present invention, by building conceptual data model in advance, when receiving searching keyword, from total Determined in volume data model it is corresponding with searching keyword refer to science data, and according to this with reference to corresponding to science data pass Connection relation, it is determined that target science data corresponding with searching keyword.Due to including each scientific and technological number in conceptual data model Incidence relation between, only need to arbitrarily determine one it is corresponding with searching keyword refer to science data, then can be according to association Relation determines all target science datas, without searching target science data by way of traveling through mass data sample, So as to improve the search efficiency of data.
2nd, in embodiments of the present invention, structure concept model, logical model and physical model, and determine conceptual model with Concept logic mapping relations between logical model, and the concept physical mappings relation between conceptual model and physical model, According to the concept logic mapping relations and concept physical mappings relation determined, and conceptual model, logical model and physics mould Type, build the conceptual data model.So that the conceptual data model constructed is more accurate practical, searched so as to further improve The efficiency of data.
3rd, in embodiments of the present invention, by determining body and Noumenon property, according to Noumenon property by corresponding science and technology Data are defined as the example of body, and each entity in conceptual model is constructed with this.So that what the entity constructed was covered Science data is more comprehensively accurate, is advantageous to lookup of the later stage to target science data.
It should be noted that herein, such as first and second etc relational terms are used merely to an entity Or operation makes a distinction with another entity or operation, and not necessarily require or imply and exist between these entities or operation Any this actual relation or order.Moreover, term " comprising ", "comprising" or its any other variant be intended to it is non- It is exclusive to include, so that process, method, article or equipment including a series of elements not only include those key elements, But also the other element including being not expressly set out, or also include solid by this process, method, article or equipment Some key elements.In the absence of more restrictions, the key element limited by sentence " including one ", is not arranged Except other identical factor in the process including the key element, method, article or equipment being also present.
One of ordinary skill in the art will appreciate that:Realizing all or part of step of above method embodiment can pass through Programmed instruction related hardware is completed, and foregoing program can be stored in computer-readable storage medium, the program Upon execution, the step of execution includes above method embodiment;And foregoing storage medium includes:ROM, RAM, magnetic disc or light Disk etc. is various can be with the medium of store program codes.
It is last it should be noted that:Presently preferred embodiments of the present invention is the foregoing is only, is merely to illustrate the skill of the present invention Art scheme, is not intended to limit the scope of the present invention.Any modification for being made within the spirit and principles of the invention, Equivalent substitution, improvement etc., are all contained in protection scope of the present invention.

Claims (10)

  1. A kind of 1. management method of science data, it is characterised in that including:
    Conceptual data model is built, the conceptual data model includes:At least two science datas and each scientific and technological number Incidence relation between;
    Also include:
    Receive searching keyword;
    According to the searching keyword, the reference section corresponding with the searching keyword is determined from the conceptual data model Skill data;
    According to the incidence relation with reference to corresponding to science data, it is determined that target science and technology number corresponding with the searching keyword According to, and export the target science data.
  2. 2. management method according to claim 1, it is characterised in that
    The structure conceptual data model, including:
    Structure concept model, the conceptual model include:At least two entities, attribute corresponding to each described entity are believed Breath, and the incidence relation between each entity;Wherein, corresponding at least two science datas of each described entity;
    According to each entity attributes information, the incidence relation between each entity is verified, according to testing Demonstrate,prove result, construction logic model;
    According to the conceptual model of structure and the logical model, the conceptual data model is built.
  3. 3. according to the method for claim 2, it is characterised in that
    After the construction logic model, further comprise:
    Physical model is built, the physical model includes:Default data output mode;
    The conceptual model according to structure and the logical model, the conceptual data model is built, including:
    Determine the concept logic mapping relations between the conceptual model and the logical model, and the conceptual model and institute State the concept physical mappings relation between physical model;
    According to the concept logic mapping relations and concept physical mappings relation determined, and the conceptual model, logical model With the physical model, the conceptual data model is built.
  4. 4. management method according to claim 2, it is characterised in that
    The structure concept model, including:
    Determine at least two bodies, and Noumenon property corresponding to each described body;
    For body each described:It is determined that at least two science data corresponding with the Noumenon property, and will determine At least two science data gone out is as example corresponding to the body;
    According to the example determined, the entity in the conceptual model is determined, and each described body is corresponding The Noumenon property as the corresponding entity attributes information;
    According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
  5. 5. management method according to claim 4, it is characterised in that
    It is described incidence relation between each entity to be verified according to each entity attributes information, wrap Include:
    According to each entity attributes information, normal form is carried out to corresponding to each entity at least two science datas Change and anti-normal formization is handled;
    According to the science data after processing, the incidence relation between each science data is determined;
    And/or
    The structure conceptual data model, including:
    According to the incidence relation between each science data, each science data is grouped, obtains at least two Individual science data group;Wherein, each science data group includes at least one science data;
    According at least two science datas group, the conceptual data model is built;
    It is described according to the searching keyword, the ginseng corresponding with the searching keyword is determined from the conceptual data model Science data is examined, including:
    At least one science data group corresponding with the searching keyword is determined from the conceptual data model;
    Determine described to refer to science data from least one science data group determined.
  6. A kind of 6. managing device of science data, it is characterised in that including:Construction unit, receiving unit and query unit;Its In,
    The construction unit, for building conceptual data model, the conceptual data model includes:At least two science datas with And the incidence relation between each science data;
    The receiving unit, for receiving searching keyword;
    The query unit, for according to the searching keyword, determining to close with the inquiry from the conceptual data model The corresponding reference science data of keyword;And according to described with reference to incidence relation corresponding to science data, it is determined that with the inquiry Target science data corresponding to keyword, and export the target science data.
  7. 7. device according to claim 6, it is characterised in that
    The construction unit, for structure concept model, the conceptual model includes:At least two entities, described in each Incidence relation between attribute information corresponding to entity, and each entity;Wherein, each described entity is corresponding at least Two science datas;According to each entity attributes information, the incidence relation between each entity is tested Card, according to the result, construction logic model;According to the conceptual model of structure and the logical model, build described total Volume data model.
  8. 8. according to the device described in right 7, it is characterised in that
    The construction unit, for building physical model, the physical model includes:Default data output mode;Determine institute State the concept logic mapping relations between conceptual model and the logical model, and the conceptual model and the physical model Between concept physical mappings relation;According to the concept logic mapping relations and concept physical mappings relation determined, Yi Jisuo Conceptual model, logical model and the physical model are stated, builds the conceptual data model.
  9. 9. device according to claim 7, it is characterised in that
    The construction unit, for determining at least two bodies, and Noumenon property corresponding to each described body;For every One body:It is determined that at least two science data corresponding with the Noumenon property, and described in determining extremely Few two science datas are as example corresponding to the body;According to the example determined, determine in the conceptual model The entity, and believe the Noumenon property corresponding to each described body as the corresponding entity attributes Breath;According to attribute information corresponding to each described entity, the incidence relation between each entity is determined.
  10. 10. device according to claim 9, it is characterised in that
    The construction unit, for according to each entity attributes information, at least two institute corresponding to each entity State science data and carry out normal form and the processing of anti-normal formization;According to the science data after processing, each science and technology is determined Incidence relation between data.
CN201711043893.8A 2017-10-31 2017-10-31 Scientific and technological data management method and device Pending CN107766545A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711043893.8A CN107766545A (en) 2017-10-31 2017-10-31 Scientific and technological data management method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711043893.8A CN107766545A (en) 2017-10-31 2017-10-31 Scientific and technological data management method and device

Publications (1)

Publication Number Publication Date
CN107766545A true CN107766545A (en) 2018-03-06

Family

ID=61271236

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711043893.8A Pending CN107766545A (en) 2017-10-31 2017-10-31 Scientific and technological data management method and device

Country Status (1)

Country Link
CN (1) CN107766545A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112115235A (en) * 2020-09-28 2020-12-22 中国建设银行股份有限公司 Entity attribute data query and configuration method, device and server
CN113191145A (en) * 2021-05-21 2021-07-30 百度在线网络技术(北京)有限公司 Keyword processing method and device, electronic equipment and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102495860A (en) * 2011-11-22 2012-06-13 北京大学 Expert recommendation method based on language model
CN103366735A (en) * 2012-03-29 2013-10-23 北京中传天籁数字技术有限公司 A voice data mapping method and apparatus
CN104424310A (en) * 2013-09-06 2015-03-18 中国海洋大学 Ontology-based smart home semantic query method and ontology-based smart home semantic query device
US20170024493A1 (en) * 2015-07-23 2017-01-26 Autodesk, Inc. System-level approach to goal-driven design

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102495860A (en) * 2011-11-22 2012-06-13 北京大学 Expert recommendation method based on language model
CN103366735A (en) * 2012-03-29 2013-10-23 北京中传天籁数字技术有限公司 A voice data mapping method and apparatus
CN104424310A (en) * 2013-09-06 2015-03-18 中国海洋大学 Ontology-based smart home semantic query method and ontology-based smart home semantic query device
US20170024493A1 (en) * 2015-07-23 2017-01-26 Autodesk, Inc. System-level approach to goal-driven design

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112115235A (en) * 2020-09-28 2020-12-22 中国建设银行股份有限公司 Entity attribute data query and configuration method, device and server
CN113191145A (en) * 2021-05-21 2021-07-30 百度在线网络技术(北京)有限公司 Keyword processing method and device, electronic equipment and medium
CN113191145B (en) * 2021-05-21 2023-08-11 百度在线网络技术(北京)有限公司 Keyword processing method and device, electronic equipment and medium

Similar Documents

Publication Publication Date Title
US20210407033A1 (en) Patent mapping
Kharlamov et al. Ontology based access to exploration data at statoil
CN109255031A (en) The data processing method of knowledge based map
Rinaldi et al. A matching framework for multimedia data integration using semantics and ontologies
Jayaram et al. A review: Information extraction techniques from research papers
Wątróbski Ontology learning methods from text-an extensive knowledge-based approach
Ángel et al. Automated modelling assistance by integrating heterogeneous information sources
CN115438199A (en) Knowledge platform system based on smart city scene data middling platform technology
WO2006015110A2 (en) Patent mapping
CN107766545A (en) Scientific and technological data management method and device
Yang et al. User story clustering in agile development: a framework and an empirical study
Wang et al. Normalized Storage Model Construction and Query Optimization of Book Multi-Source Heterogeneous Massive Data
Hu et al. A classification model of power operation inspection defect texts based on graph convolutional network
Vogt et al. Towards a Rosetta Stone for (meta) data: Learning from natural language to improve semantic and cognitive interoperability
CN113127650A (en) Technical map construction method and system based on map database
Tang et al. Ontology-based semantic retrieval for education management systems
CN110046163A (en) A kind of data retrieval method and system
Khazraee et al. Demystifying ontology
Delemazure A Knowledge Base of Mathematical Results
Qiu et al. An architecture for cell-centric indexing of datasets
Laadidi et al. Simplification of owl ontology sources for data warehousing
Kalampokis et al. Towards interoperable open statistical data
KR102605931B1 (en) Method for processing structured data and unstructured data on a plurality of databases and data processing platform providing the method
Leshcheva et al. Towards a method of ontology population from heterogeneous sources of structured data
Hasan et al. An extensible digital library service to support network science

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180306

RJ01 Rejection of invention patent application after publication