CN104182454A - Multi-source heterogeneous data semantic integration model constructed based on domain ontology and method - Google Patents
Multi-source heterogeneous data semantic integration model constructed based on domain ontology and method Download PDFInfo
- Publication number
- CN104182454A CN104182454A CN201410317211.8A CN201410317211A CN104182454A CN 104182454 A CN104182454 A CN 104182454A CN 201410317211 A CN201410317211 A CN 201410317211A CN 104182454 A CN104182454 A CN 104182454A
- Authority
- CN
- China
- Prior art keywords
- semantic
- ontology
- data
- concept
- local
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 58
- 230000010354 integration Effects 0.000 title abstract description 6
- 238000013507 mapping Methods 0.000 claims abstract description 20
- 238000010276 construction Methods 0.000 claims abstract description 16
- 230000008901 benefit Effects 0.000 claims abstract description 9
- 230000000295 complement effect Effects 0.000 claims abstract description 6
- 238000006116 polymerization reaction Methods 0.000 claims description 13
- 150000001875 compounds Chemical class 0.000 claims description 12
- 238000004458 analytical method Methods 0.000 claims description 10
- 238000005516 engineering process Methods 0.000 claims description 9
- 238000004422 calculation algorithm Methods 0.000 claims description 6
- 238000005520 cutting process Methods 0.000 claims description 6
- 239000000463 material Substances 0.000 claims description 6
- 239000011159 matrix material Substances 0.000 claims description 6
- 230000000694 effects Effects 0.000 claims description 4
- 238000001914 filtration Methods 0.000 claims description 4
- 230000006870 function Effects 0.000 claims description 4
- 230000008569 process Effects 0.000 claims description 4
- 238000011156 evaluation Methods 0.000 claims description 3
- 238000010801 machine learning Methods 0.000 claims description 3
- 238000005065 mining Methods 0.000 claims description 3
- 238000012545 processing Methods 0.000 claims description 3
- 238000005295 random walk Methods 0.000 claims description 3
- 238000012549 training Methods 0.000 claims description 3
- 238000004220 aggregation Methods 0.000 abstract 3
- 230000002776 aggregation Effects 0.000 abstract 3
- 238000005457 optimization Methods 0.000 abstract 2
- 239000003208 petroleum Substances 0.000 description 5
- 230000008878 coupling Effects 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- 238000013523 data management Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000007726 management method Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- 101100297738 Danio rerio plekho1a gene Proteins 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/211—Schema design and management
- G06F16/212—Schema design and management with details for data modelling support
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/242—Query formulation
- G06F16/243—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/256—Integrating or interfacing systems involving database management systems in federated or virtual databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/80—Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
- G06F16/83—Querying
- G06F16/835—Query processing
- G06F16/8358—Query translation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
The invention discloses a multi-source heterogeneous data semantic integrated model constructed based on domain ontology. The multi-source heterogeneous data semantic integration model comprises a local ontology constructing module, a domain ontology merging module and a semantic inquiring dynamic propagation and protocol module. A multi-source heterogeneous data semantic integrating method comprises the following steps: constructing the domain ontology through an ontology merging technique and establishing a semantic mapping relation between a data source and the local ontology as well as between the local ontology and the domain ontology; combining the complementary advantage of social label and ontology on knowledge representation and performing inquiring protocol and propagation for the semantic inquiring request of the user, thereby generating a formal semantic inquiring statement; respectively inquiring a plurality of data sources, performing duplicate removal and aggregation optimization on the inquiring result, and lastly feeding back to the user. The invention provides an aggregation heterogeneous data semantic integration model constructed based on domain ontology and the method through the construction and mapping of the domain ontology, semantic inquiring propagation and result aggregation optimization.
Description
Technical field
The crossing domain that the invention belongs to the semantic integrated and oil-gas exploration Knowledge Discovery of isomeric data, relates in particular to a kind of multi-source heterogeneous data semantic integrated model and method building based on domain body.
Background technology
Along with industrialization and information-based deepening continuously of merging, oil-gas exploration enterprise in long-term production run, accumulated enrich full and accurate such as reconnoitring, the data such as earthquake, well logging.These are containing abundant geological information with different structure, the data that are stored in diverse location, have obvious potential using value, are the important evidence of the following exploration and development of oil gas field, are important sci-tech innovation resources.Utilize method and the technology such as Knowledge Discovery, artificial intelligence, from the existing oil-gas exploration data that pile up like a mountain, find out effective, novel, that there is potential utility, final intelligible rule and pattern, and integrate the large scale scale heterogeneous data resource of this class and realize data semantic and share, can provide more complete and reliable data, services support for decision maker, thereby instruct the technology decision-making of oil-gas exploration enterprise formulation science, for enterprise creates huge economic benefit.This is from " management data ", to rise to the effective way of " managerial knowledge ", is also important topic and the direction of current and following each field in-depth Information System configuration development.
The informatization of petroleum exploration domain starts from six the seventies, because the specialized fields relating to is many, each specialty all adopts the standard of sealing, do not have common standard to follow, therefore caused the phenomenon that between different departments and system, information sharing difficulty, a large amount of " information island " exist, for the integrated application of petroleum exploration domain data with share and brought challenge.For the difficulty facing in oil-gas exploration Data management and utilizaition, Chinese scholars has proposed some active data integrated approaches and has developed corresponding system.But these methods have still adopted the integrated technology of traditional database and middleware Technology generally, lack real-time semantic support, at the aspects such as dynamic mapping of inquiry request semantic extension, resource, have many limitations, and accuracy rate and efficiency are not satisfactory.
The data relevant to oil-gas exploration comprise underground tectonic structure, lithology, physical property, electrically, the raw storage lid of oil gas etc.These data are from different departments and area, various informative (comprising the image, video, text of relational database, GIS data, CAD figure, various forms etc.), and there is different structure type (comprise structurized, semi-structured with non-structured), be difficult to the multi-source heterogeneous data of this class to manage uniformly integrated and shared with semantic hierarchies.The more important thing is, increasing hydrocarbon resources can the exploration information based on existing be explored and develop, they have become the important data assets in oil company, to improving accuracy and the correct decision-making of auxiliary formulation of the oil-gas exploration level of information technology, data interpretation, all significant, effective utilization of these data, also has important effect to improving the economic benefit of oil company.In addition, the change of decision maker's mode of thinking is also for the knowledge architecture of data has been established ideal basis, the management philosophy of " buy but not build " is advocated in modern data management, i.e. competition is not placed on and how obtains and to occupy in data, but is placed in the careful analysis of data and Study on Interpretation.Therefore, from these multi-source heterogeneous oil-gas exploration data, carrying out the unified of knowledge builds and semantic integrated having great importance.
Body, as a kind of clear and definite Formal Specification of shared ideas model, has good concept hierarchy and reasoning from logic tenability, be applicable to describe very much domain knowledge standard, and to the semanteme of isomeric data, integrated and inquiry has very strong supporting role.Body is introduced in oil-gas exploration data integration, contributed to solve the difficulties that it lacks the aspects such as semantic support and standardization in data integration.Therefore, ontology is incorporated into oil-gas exploration Knowledge Discovery and knowledge engineering field, becoming " data management " is " information management ", contributes to solve the problem that the aspects such as current oil-gas exploration data sharing degree is low, Semantic Heterogeneous exist.In order to seek the breakthrough in the integrated theory of oil-gas exploration data semantic and method, oil-gas exploration data semantic integration mechanism and model that foundation builds based on domain body, can effectively improve the value of available data, the science decision of petroleum engineering relevant departments will be had to important theory significance and practical value.
Summary of the invention
The object of the embodiment of the present invention is to provide a kind of integrated models and methods of multi-source heterogeneous data semantic building based on domain body, is intended to solve the integrated and shared problem of semanteme between oil-gas exploration isomeric data.
The embodiment of the present invention is achieved in that a kind of integrated method of multi-source heterogeneous data semantic building based on domain body, and the integrated method of multi-source heterogeneous data semantic that should build based on domain body comprises:
Utilize ontology construction, automatically, semi-automatically set up the local ontology of multi-source heterogeneous data, by ontology merging technique construction domain body, and set up the Semantic mapping relation between data source and local ontology, local ontology and domain body;
In conjunction with society's mark and the complementary advantage of body in knowledge representation, stipulations and expansion are inquired about in semantic query request to user, and the semantic query statement of generating standard, inquires about respectively a plurality of data sources, then by Query Result duplicate removal and optimizing polymerization, finally return to user.
Another object of the present invention is to provide a kind of integrated model of multi-source heterogeneous data semantic building based on domain body, the multi-source heterogeneous data semantic integrated model that should build based on domain body comprises: local ontology builds module, ontology merging module and semantic query dynamic expansion and stipulations module;
Local ontology builds module, according to data source feature, selects adaptively body construction strategy, thereby constructs oil-gas exploration local ontology;
Ontology merging module, building module with local ontology is connected, the ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient (MIC) method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, realize a plurality of local ontology to the flexible merging of domain body;
Semantic query dynamic expansion and stipulations module, build module with local ontology and be connected, for the optimizing polymerization of validity and the result of inquiry request dynamic expansion.
Further, local ontology builds module, according to data source feature, by self-adaptation body construction strategy, carries out the structure of local ontology, specifically comprises:
Step 1, based on unstructured data sources, build local ontology:
First, applicating text filtrator changes into different file layouts into text-only file form, obtains language material data, and carries out consistency check; Then, adopt reverse maximum classification Chinese word cutting method to carry out preliminary cutting to these language materials and process, obtain word string set; Then, utilize maximum information coefficient (MIC) method to calculate the internal bond strength of word string, obtain compound word set, and judge the domain-specific of compound word and non-compound word, extract concept set; Then, the classification relation on application drawing between Random Walk Algorithm fuzzy filtering word concept, adopts the clustering algorithm based on hidden Markov model (HMM) to extract the classification relation between non-compound word concept; Then, use the method based on association rule mining to obtain the non-categorical relation between concept; Finally, the local ontology of applied ontology the build tool output OWL form;
Step 2, builds local ontology based on structured data source:
First, utilize the Semantic mapping relation between R2O technology building database pattern and ontology model, thereby be the concept in body the relationship map in relational database, attribute is mapped as to OWL attribute accordingly, and the relation table of database is converted into body class, the data in database are converted into example; Then, the initial local ontology extracting from database is done to a series of standardization work, by carrying out semantic similarity calculating with standard body, the ontology information that meets threshold value is set up to semantic relation, the ontology information that does not meet threshold value carries out standardization processing, thereby constructs satisfactory standardization local ontology;
Step 3, builds local ontology based on semi-structured data source
Due to semi-structured data between structuring and unstructured data, there is implicit structure but lack class data of fixing or strict structure, so, ontology construction based on above-mentioned two kinds of data types also can be applied to semi-structured data source, first, extract semi-structured data pattern, given mapping ruler, utilizes XML2RD method, and semi-structured data is converted into structural data; Then, according to structural data, build the local ontology corresponding to method construct semi-structured data source of local ontology.
Further, the method for ontology merging block merging is:
The ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient MIC method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, then, by similarity assessment function, the similarity between concept is assessed, output similar matrix, and use field axiom constraint knowledge further to assess its similarity to similar matrix; Then,, by the method training study sorter of machine learning, utilize learning classification device to calculate the similarity between concept example; Finally, by in conjunction with ISO15926 oil gas body and fuzzy formal concept analysis method, consider symmetry and the transitive relations of semantic similarity, fuzzy set theory is introduced in the setting of semantic similarity, realize a plurality of local ontology to the flexible merging of domain body.
Further, the concrete grammar that semantic query dynamic expansion and stipulations module realize is:
First, the conceptual relation and the inferential capability that by the semantic analysis of society mark and body, comprise, inquiry request is carried out to grammer and stipulations semantically and expansion, the semantic query statement of generating standard, solve between inquiry request and domain body data source different the caused mismatch problems due to expression-form, and automatically recommend the semantic respective labels of cluster according to user's inquiry request, for realizing data source, accurately assemble guiding is provided; Then, by the semantic similarity calculating between expanding query request and domain body concept, quantize the degree of association between request and resource concept; Finally, the Concept Semantic relation of enriching of utilizing society's mark and body to comprise, Query Result pattern is carried out to semantic annotations, according to the semanteme overall situation effect of society's mark, introducing is usingd the most relevant credible mark that statistic analysis result obtains data source pointed as one of Query Result reliability evaluation standard, result set is carried out to duplicate removal and optimizing polymerization, realize believable Top-K inquiry.
The integrated models and methods of multi-source heterogeneous data semantic building based on domain body provided by the invention, by structure and mapping, query semantics expansion and the result optimizing polymerization of domain body, provide a kind of oil-gas exploration isomeric data building based on domain body semantic integrated model, solved integrated and shared between oil-gas exploration isomeric data, this semanteme integrated model also has loose coupling, easily expansion, supports the superperformances such as semantic query.
The present invention has realized the dynamic growth of data source, for newly-increased data source, only need to provide corresponding wrapper, builds corresponding local ontology, can improve dirigibility and the practicality of integrated system.With domain body, domain knowledge is described, local ontology is described the Heterogeneous Information knowledge in a certain field, and the mapping of setting up respectively mapping, local ontology and the data source of domain body and local ontology, domain body, local ontology and data source were both interknited, relatively independent again, can reduce the coupling of semantic integrated system.In order to realize semantic query and ease for use, in conjunction with society's mark and the complementary advantage of body in knowledge representation, stipulations and expansion are inquired about in semantic query request to user, and to Query Result duplicate removal and optimizing polymerization, the result after optimizing the most at last returns to user.
Accompanying drawing explanation
Fig. 1 is the integrated method flow diagram of multi-source heterogeneous data semantic building based on domain body that the embodiment of the present invention provides;
Fig. 2 is the structural representation of the multi-source heterogeneous data semantic integrated model building based on domain body that provides of the embodiment of the present invention;
In figure: 1, local ontology builds module; 2, ontology merging module; 3, semantic query dynamic expansion and stipulations module;
Fig. 3 is the embodiment of the multi-source heterogeneous data semantic integrated model building based on domain body that provides of the embodiment of the present invention;
Fig. 4 is the process flow diagram that the unstructured data sources that provides of the embodiment of the present invention builds local ontology;
Fig. 5 is the simple abstract body schematic diagram that the embodiment of the present invention provides.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearer, below in conjunction with embodiment, the present invention is further elaborated.Should be appreciated that specific embodiment described herein, only in order to explain the present invention, is not intended to limit the present invention.
Below in conjunction with drawings and the specific embodiments, application principle of the present invention is further described.
As shown in Figure 1, the integrated method of multi-source heterogeneous data semantic building based on domain body of the embodiment of the present invention comprises:
S101: utilize ontology construction, automatically, semi-automatically set up the local ontology of multi-source heterogeneous data, by ontology merging technique construction domain body, and set up the Semantic mapping relation of data source and local ontology, local ontology and domain body;
S102: in conjunction with society's mark and the complementary advantage of body in knowledge representation, stipulations and expansion are inquired about in semantic query request to user, and the semantic query statement of generating standard, inquires about respectively a plurality of data sources, then by Query Result duplicate removal and optimizing polymerization, finally return to user.
As shown in Figure 2, the multi-source heterogeneous data semantic integrated model building based on domain body of the embodiment of the present invention mainly by: local ontology builds module 1, ontology merging module 2 and semantic query dynamic expansion and stipulations module 3 forms;
Local ontology builds module 1, according to data source feature, selects adaptively body construction strategy, thereby constructs oil-gas exploration local ontology;
Ontology merging module 2, building module 1 with local ontology is connected, the ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient MIC method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, realize a plurality of local ontology to the flexible merging of domain body;
Semantic query dynamic expansion and stipulations module 3, build module 1 with local ontology and be connected, for the optimizing polymerization of validity and the result of inquiry request dynamic expansion.
3 pairs of multi-source heterogeneous data semantic integrated models that build based on domain body of the present invention are described further by reference to the accompanying drawings:
1, local ontology builds module:
At present, also there is no maturation, unified methodology instructs the structure of body, set up relatively complete domain body, not only need to catch a large amount of field concept knowledge, also need to solve the semantic conflict of these field concepts, the problems such as ambiguity, this work that causes body to build is both dull, difficult thorny again, become the bottleneck of knowledge acquisition, in addition, although the editing environment of existing body the build tool can meet the needs of setting up body, but by the manual relation of collecting between field concept and field concept, build body completely, remain a job of wasting time and energy, make the application based on body be difficult to promote, therefore, research is automatic from multi-source heterogeneous data, semi-automatically build domain body, and realize the fast mapping of domain body and multi-source heterogeneous data, the semanteme during thereby solution body builds and inquires about is inconsistent and time bottleneck problem, this has important theory significance and a researching value to the data semantic based on domain body is integrated,
Petroleum exploration domain body self-adaptation construction strategy can be according to data source feature, select adaptively different body construction strategy, be mainly divided into four aspects: based on unstructured data sources (as document, webpage etc.), build local ontology, based on structured data source (as relational database), build local ontology, based on semi-structured data source structure local ontology and local ontology, merge into domain body.
1) based on unstructured data sources, build local ontology:
When unstructured data sources is built into body, first to carry out data pre-service, the pretreated fundamental purpose of data is by filtering unworthy symbol and word, thereby extract significant term, current, some natural language processing instruments (as OpenNLP, CKIP etc.) can be realized this function preferably, in the present invention, adopt OpenNLP to realize the pre-service work of unstructured data sources.
For unstructured data sources, first, applicating text filtrator changes into different file layouts into text-only file form, obtains language material data, and carries out consistency check; Then, adopt reverse maximum classification Chinese word cutting method to carry out preliminary cutting to these language materials and process, obtain word string set; Then, utilize maximum information coefficient (MIC) to calculate the internal bond strength of word string, obtain compound word set, and judge the domain-specific of compound word and non-compound word, extract concept set; Then, the classification relation on application drawing between Random Walk Algorithm fuzzy filtering word concept, adopts the clustering algorithm based on hidden Markov model (HMM) to extract the classification relation between non-compound word concept; Then, use the method based on association rule mining to obtain the non-categorical relation between concept; Finally, the local ontology of applied ontology the build tool output OWL form.Idiographic flow as shown in Figure 4.
2) based on structured data source, build local ontology:
From structured data source, building local ontology mainly considers by Semantic mapping, automatically to build petroleum exploration domain body from relational database.First, utilize the Semantic mapping relation between R2O technology building database pattern and ontology model, thereby be the concept in body the relationship map in relational database, attribute is mapped as to OWL attribute accordingly, and the relation table of database is converted into body class, the data in database are converted into example; Then, the initial local ontology extracting from database is done to standardization work, by carrying out semantic similarity calculating with standard body, the ontology information that meets threshold value is set up to semantic relation, the ontology information that does not meet threshold value carries out standardization processing, thereby constructs normalized local ontology.
3) based on semi-structured data source, build local ontology:
First, extract semi-structured data pattern, given mapping ruler, utilizes XML2RD method, and semi-structured data is converted into structural data; Then, according to structural data, build the local ontology corresponding to method construct semi-structured data source of local ontology.
2, ontology merging module:
The research of ontology merging at present mainly concentrates in concept matching, by coupling, find two semantic corresponding relations between Ontological concept, this will cause the incomplete situation of ontology merging result, in the present invention, adopt the ontology merging method that concept matching and attributes match are combined, utilize maximum information coefficient MIC method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, then, by similarity assessment function, the similarity between concept is assessed, output similar matrix, and use field axiom constraint knowledge further to assess its similarity to similar matrix, then,, by the method training study sorter of machine learning, utilize learning classification device to calculate the similarity between concept example, finally, by in conjunction with ISO15926 oil gas body and fuzzy formal concept analysis (FFCA) method, consider symmetry and the transitive relations of semantic similarity, fuzzy set theory is introduced in the setting of semantic similarity, thereby realize a plurality of local ontology to the flexible merging of domain body,
3, semantic query dynamic expansion and stipulations module:
The validity of inquiry request dynamic expansion and the optimizing polymerization of result are the integrated another one key issues of oil-gas exploration data semantic, first, the conceptual relation and the inferential capability that by the semantic analysis of society mark and body, comprise, inquiry request is carried out to grammer and stipulations semantically and expansion, the semantic query statement of generating standard (normally SPARQL statement), solve between inquiry request and domain body data source different the caused mismatch problems due to expression-form, and automatically recommend the semantic respective labels of cluster according to user's inquiry request, for realizing data source, accurately focus on guiding is provided, then, by the semantic similarity calculating between expanding query request and domain body concept, quantize the degree of association between request and resource concept, finally, the Concept Semantic relation of enriching of utilizing society's mark and body to comprise, Query Result pattern is carried out to semantic annotations, according to the semanteme overall situation effect of society's mark, introducing is usingd the most relevant credible mark that statistic analysis result obtains data source pointed as one of Query Result reliability evaluation standard, result set is carried out to duplicate removal and optimizing polymerization, realize believable Top-K inquiry.
Semantic distance is often referred to two close degree of the semanteme between concept, it is a kind of effective means of weighing semantic similarity, semantic distance between two concepts is less, their semanteme is more approaching, otherwise far away, semantic distance and semantic similarity are generally inverse relation, in information retrieval field, semantic distance is less, descriptive information is more approaching with user's inquiry request, when semantic distance is 0, information meets user's request completely, when semantic distance is greater than certain value, information and user inquire about onrelevant, therefore can not as a result of collect and return, for the result set returning, whether the arbitrary result in the own subjective judgement result set of user meets the demands completely, semantic distance based on body is mainly measured in body the length of fillet between concept, two concepts access path in body is shorter, they are just more similar, the quantity of information that the length of every fillet is comprised by it determines, if p (c) is the probability of happening of concept c in whole concept set, if count (c) is the occurrence number of concept c in body O, count (O) is the total number of concept in body O, because concept may appear at the form of different abstraction hierarchies in body, the occurrence number of its all sub-concepts that should add up while calculating the total occurrence number of concept, the computing method of p (c) are:
From formula (1), p (c) is along with the rising monotone increasing of c place level in body O, and p (Root)=1, as shown in Figure 5, and p (c
0)=1, p (c
1)=0.5;
If parent (c) is the father set of concept c in body O:
According to the acyclicity of H relation, easily know:
Wherein, | parent (c) | represent to gather the element number comprising in parent (c);
From information theory, if the frequency of a concept appearance is larger, the quantity of information that it comprises is just fewer; Otherwise if the frequency of a concept appearance is less, the quantity of information that it comprises is just more, therefore, the quantity of information that concept c comprises is:
I(c)=-log(p(c)) (4)
Therefore the quantity of information that, fillet c → parent (c) comprises is:
The length of fillet c → parent (c) is proportional to the quantity of information that it comprises:
length(c→parent(c))∝I(c→parent(c)) (6)
By formula (5) and formula (6), can be obtained:
length(c→parent(c))=β·(I(c→parent(c))) (7)
Wherein, β is constant, is a scale factor, in order to express conveniently, makes β=1, can obtain:
length(c→parent(c))=I(c→parent(c)) (8)
Due to
set up given any two concept c
1and c
2, establish LCA (c
1, c
2) be c
1and c
2the public ancestors of minimum in body O (Least Common Ancestor), LCA (c
1, c
2) be c
1and c
2, for any two concept nodes, necessarily there are the public ancestors of minimum in the concept node of depth capacity on common ancestor path, as shown in Figure 5, and LCA (c
3, c
9)=c
1, LCA (c
1, c
5)=c
0, i.e. the root concept node (Root) of body, therefore:
Definition 1 (access path): due to
set up, and H relation has acyclicity, c
1and c
2access path in body O has and only has one, is designated as path (c
1, c
2), and c
1to LCA (c
1, c
2) path and c
2to LCA (c
1, c
2) union in path is access path, be defined as:
path(c
1,c
2)=path(c
1,LCA(c
1,c
2))∪path(c
2,LCA(c
1,c
2)) (10)
Definition 2 (semantic distances): semantic distance has reflected the semantic association degree between two concepts, and the semantic distance between two concepts is defined as:
From formula (11), in fact the definition of semantic distance is comprised the information of two aspects here: on the one hand, the formation of domain body has determined LCA (c
1, c
2) position, this is when measuring similarity, within the inheritance between concept and concept, in body level, residing position has all been included in semantic distance to the impact of similarity; On the other hand, p (LCA (c
1, c
2)), p (c
1) and p (c
2) value all come from the statistical information of concept set,
From above-mentioned definition, with respect to different domain bodies, semantic distance between concept is different, and different concepts is with respect to same domain body, its semantic distance is also different, this computing method to semantic distance combine semantic relation between concept and the statistical information of objective fact, contribute to simulate more accurately the original appearance of objective world.
The integrated domain body that first need to be participated in creating by domain expert of semanteme of isomeric data of the present invention provides shared knowledge base, next needs ability to express to enrich and have the ontology describing language of certain logical reasoning ability, then by selecting rational mapping method, the Uniform semantic information that becomes integrated system to understand the Semantic Heterogeneous data-switching in different pieces of information source.In addition, system also should be supported semantic query, and has certain ease for use and extensibility.
This model can be realized the dynamic growth of data source, for newly-increased data source, only need to provide corresponding wrapper, builds corresponding local ontology, can improve dirigibility and the practicality of integrated system.With domain body, domain knowledge is described, local ontology is described the Heterogeneous Information knowledge in a certain field, and the mapping of setting up respectively mapping, local ontology and the data source of domain body and local ontology, this both interknited domain body, local ontology and data source, relatively independent again, can reduce the coupling of semantic integrated system.In order to realize semantic query and ease for use, in conjunction with society's mark and the complementary advantage of body in knowledge representation, user's semantic query request is inquired about to stipulations and expansion, and to Query Result duplicate removal and optimizing polymerization, finally return to user.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, all any modifications of doing within the spirit and principles in the present invention, be equal to and replace and improvement etc., within all should being included in protection scope of the present invention.
Claims (5)
1. the integrated method of multi-source heterogeneous data semantic building based on domain body, is characterized in that, the integrated method of multi-source heterogeneous data semantic that should build based on domain body comprises:
Utilize ontology construction, automatically, semi-automatically set up the local ontology of multi-source heterogeneous data, by ontology merging technique construction domain body, and set up the Semantic mapping relation between data source and local ontology, local ontology and domain body;
In conjunction with society's mark and the complementary advantage of body in knowledge representation, stipulations and expansion are inquired about in semantic query request to user, and the semantic query statement of generating standard, inquires about respectively a plurality of data sources, then by Query Result duplicate removal and optimizing polymerization, finally return to user.
2. the integrated model of multi-source heterogeneous data semantic building based on domain body, it is characterized in that, the multi-source heterogeneous data semantic integrated model that should build based on domain body comprises: local ontology builds module, ontology merging module and semantic query dynamic expansion and stipulations module;
Local ontology builds module, according to data source feature, selects adaptively body construction strategy, thereby constructs oil-gas exploration local ontology;
Ontology merging module, building module with local ontology is connected, the ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, realize a plurality of local ontology to the flexible merging of domain body;
Semantic query dynamic expansion and stipulations module, build module with local ontology and be connected, for the optimizing polymerization of validity and the result of inquiry request dynamic expansion.
3. the integrated model of multi-source heterogeneous data semantic building based on domain body as claimed in claim 2, is characterized in that, local ontology builds module, according to data source feature, by self-adaptation body construction strategy, carries out the structure of local ontology, specifically comprises:
Step 1, based on unstructured data sources, build local ontology:
First, applicating text filtrator changes into different file layouts into text-only file form, obtains language material data, and carries out consistency check; Then, adopt reverse maximum classification Chinese word cutting method to carry out preliminary cutting to these language materials and process, obtain word string set; Then, utilize maximum information coefficient method to calculate the internal bond strength of word string, obtain compound word set, and judge the domain-specific of compound word and non-compound word, extract concept set; Then, the classification relation on application drawing between Random Walk Algorithm fuzzy filtering word concept, adopts the clustering algorithm based on hidden Markov model to extract the classification relation between non-compound word concept; Then, use the method based on association rule mining to obtain the non-categorical relation between concept; Finally, the local ontology of applied ontology the build tool output OWL form;
Step 2, builds local ontology based on structured data source:
First, utilize the Semantic mapping relation between R2O technology building database pattern and ontology model, thereby be the concept in body the relationship map in relational database, attribute is mapped as to OWL attribute accordingly, and the relation table of database is converted into body class, the data in database are converted into example; Then, the initial local ontology extracting from database is done to a series of standardization work, by carrying out semantic similarity calculating with standard body, the ontology information that meets threshold value is set up to semantic relation, the ontology information that does not meet threshold value carries out standardization processing, thereby constructs satisfactory standardization local ontology;
Step 3, builds local ontology based on semi-structured data source
Due to semi-structured data between structuring and unstructured data, there is implicit structure but lack class data of fixing or strict structure; So the ontology construction based on above-mentioned two kinds of data types also can be applied to semi-structured data source; First, extract semi-structured data pattern, given mapping ruler, utilizes XML2RD method, and semi-structured data is converted into structural data; Then, according to structural data, build the local ontology corresponding to method construct semi-structured data source of local ontology.
4. the integrated model of multi-source heterogeneous data semantic building based on domain body as claimed in claim 2, is characterized in that, the method for ontology merging block merging is:
The ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, then, by similarity assessment function, the similarity between concept is assessed, output similar matrix, and use field axiom constraint knowledge further to assess its similarity to similar matrix; Then,, by the method training study sorter of machine learning, utilize learning classification device to calculate the similarity between concept example; Finally, by in conjunction with ISO15926 oil gas body and fuzzy formal concept analysis method, consider symmetry and the transitive relations of semantic similarity, fuzzy set theory is introduced in the setting of semantic similarity, realize a plurality of local ontology to the flexible merging of domain body.
5. the integrated model of multi-source heterogeneous data semantic building based on domain body as claimed in claim 2, is characterized in that, the concrete grammar that semantic query dynamic expansion and stipulations module realize is:
First, the conceptual relation and the inferential capability that by the semantic analysis of society mark and body, comprise, inquiry request is carried out to grammer and stipulations semantically and expansion, the semantic query statement of generating standard, solve between inquiry request and domain body data source different the caused mismatch problems due to expression-form, and automatically recommend the semantic respective labels of cluster according to user's inquiry request, for realizing data source, accurately assemble guiding is provided; Then, by the semantic similarity calculating between expanding query request and domain body concept, quantize the degree of association between request and resource concept; Finally, the Concept Semantic relation of enriching of utilizing society's mark and body to comprise, Query Result pattern is carried out to semantic annotations, according to the semanteme overall situation effect of society's mark, introducing is usingd the most relevant credible mark that statistic analysis result obtains data source pointed as one of Query Result reliability evaluation standard, result set is carried out to duplicate removal and optimizing polymerization, realize believable Top-K inquiry.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410317211.8A CN104182454B (en) | 2014-07-04 | 2014-07-04 | The integrated model of multi-source heterogeneous data semantic based on domain body structure and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410317211.8A CN104182454B (en) | 2014-07-04 | 2014-07-04 | The integrated model of multi-source heterogeneous data semantic based on domain body structure and method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104182454A true CN104182454A (en) | 2014-12-03 |
CN104182454B CN104182454B (en) | 2018-03-27 |
Family
ID=51963496
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410317211.8A Expired - Fee Related CN104182454B (en) | 2014-07-04 | 2014-07-04 | The integrated model of multi-source heterogeneous data semantic based on domain body structure and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104182454B (en) |
Cited By (54)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104679823A (en) * | 2014-12-31 | 2015-06-03 | 智慧城市信息技术有限公司 | Semantic annotation-based association method and system of heterogeneous data |
CN105426358A (en) * | 2015-11-09 | 2016-03-23 | 中国农业大学 | Automatic disease noun identification method |
CN105677766A (en) * | 2015-12-30 | 2016-06-15 | 南京理工大学 | Algebraic-specification to ontology-description conversion and assessment method of Web service semantics |
CN105808853A (en) * | 2016-03-09 | 2016-07-27 | 哈尔滨工程大学 | Engineering application oriented body establishment management and body data automatic obtaining method |
CN106021306A (en) * | 2016-05-05 | 2016-10-12 | 上海交通大学 | Ontology matching based case search system |
CN106055560A (en) * | 2016-05-18 | 2016-10-26 | 上海申腾信息技术有限公司 | Method for collecting data of word segmentation dictionary based on statistical machine learning method |
CN106296343A (en) * | 2016-08-01 | 2017-01-04 | 王四春 | A kind of e-commerce transaction monitoring method based on the Internet and big data |
CN106302680A (en) * | 2016-08-06 | 2017-01-04 | 内蒙古大学 | A kind of data based on Internet of Things display background system |
CN106407208A (en) * | 2015-07-29 | 2017-02-15 | 清华大学 | Establishment method and system for city management ontology knowledge base |
CN106777372A (en) * | 2017-01-26 | 2017-05-31 | 语义(上海)信息科技有限公司 | A kind of honeybee stream device data water conservancy diversion and data method for transformation based on Ontology on Semantic Web |
CN106779146A (en) * | 2016-11-15 | 2017-05-31 | 广州铁路职业技术学院 | A kind of tourism service system for providing recommendation tourism route |
CN107016630A (en) * | 2017-02-09 | 2017-08-04 | 湖南第师范学院 | A kind of novel English teaching learns language system |
CN107045545A (en) * | 2017-03-30 | 2017-08-15 | 山东省农业科学院 | A kind of peanut cultivation information database constructing system |
CN107209754A (en) * | 2014-12-10 | 2017-09-26 | 凯恩迪股份有限公司 | Technology and semantic signal processing in large-scale unstructured data field |
CN107330007A (en) * | 2017-06-12 | 2017-11-07 | 南京邮电大学 | A kind of Method for Ontology Learning based on multi-data source |
CN107357933A (en) * | 2017-08-04 | 2017-11-17 | 刘应波 | A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus |
CN107491476A (en) * | 2017-06-29 | 2017-12-19 | 中国科学院计算机网络信息中心 | A kind of data model translation and query analysis method suitable for a variety of big data management systems |
CN107545046A (en) * | 2017-08-17 | 2018-01-05 | 北京奇安信科技有限公司 | A kind of fusion method and device of multi-source heterogeneous data |
CN107679484A (en) * | 2017-09-28 | 2018-02-09 | 辽宁工程技术大学 | A kind of Remote Sensing Target automatic detection and recognition methods based on cloud computing storage |
CN107704480A (en) * | 2016-08-08 | 2018-02-16 | 百度(美国)有限责任公司 | Extension and the method and system and computer media for strengthening knowledge graph |
CN107808001A (en) * | 2017-11-13 | 2018-03-16 | 哈尔滨工业大学 | Towards the Mode integrating method and device of magnanimity isomeric data |
CN107958086A (en) * | 2017-12-18 | 2018-04-24 | 北京睿力科技有限公司 | The multi-source heterogeneous database data for solving data semantic Heterogeneity integrates method |
CN108446293A (en) * | 2018-01-22 | 2018-08-24 | 中电海康集团有限公司 | A method of based on urban multi-source isomeric data structure city portrait |
CN108509486A (en) * | 2018-02-08 | 2018-09-07 | 浙江大学 | A kind of safe big data structural management method of intelligent plant multi-source |
CN108536796A (en) * | 2018-04-02 | 2018-09-14 | 北京大学 | A kind of isomery Ontology Matching method and system based on figure |
CN108614821A (en) * | 2016-12-09 | 2018-10-02 | 中国地质调查局发展研究中心 | Geologic information interconnection mutually looks into system |
CN108710663A (en) * | 2018-05-14 | 2018-10-26 | 北京大学 | A kind of data matching method and system based on ontology model |
CN109063114A (en) * | 2018-07-27 | 2018-12-21 | 华南理工大学广州学院 | Heterogeneous data integrating method, device, terminal and the storage medium of energy cloud platform |
CN109271638A (en) * | 2018-10-08 | 2019-01-25 | 东北石油大学 | A kind of personalized Semantic Integration and system based on data space |
CN109541995A (en) * | 2018-11-01 | 2019-03-29 | 湖南城市学院 | A kind of Intelligent indoor finishing absorption dedusting control system |
CN109587125A (en) * | 2018-11-23 | 2019-04-05 | 南方电网科学研究院有限责任公司 | Network security big data analysis method, system and related device |
CN109597925A (en) * | 2018-10-25 | 2019-04-09 | 同济大学 | A kind of supplier data analysis method and analysis system based on ontology |
CN109643308A (en) * | 2016-08-23 | 2019-04-16 | 伊路米纳有限公司 | Determine the semantic distance system and method for relevant ontology data |
CN109635119A (en) * | 2018-10-25 | 2019-04-16 | 同济大学 | A kind of industrial big data integrated system based on ontology fusion |
CN109766906A (en) * | 2018-11-16 | 2019-05-17 | 中国人民解放军海军大连舰艇学院 | Naval battle field situation data fusion method and system based on occurrence diagram |
CN110008288A (en) * | 2019-02-19 | 2019-07-12 | 武汉烽火技术服务有限公司 | The construction method in the knowledge mapping library for Analysis of Network Malfunction and its application |
CN110175197A (en) * | 2019-05-23 | 2019-08-27 | 长沙学院 | A kind of body constructing method and system based on semantic Internet of Things |
CN110197280A (en) * | 2019-05-20 | 2019-09-03 | 中国银行股份有限公司 | A kind of knowledge mapping construction method, apparatus and system |
CN110275919A (en) * | 2019-06-18 | 2019-09-24 | 合肥工业大学 | Data integrating method and device |
CN110489395A (en) * | 2019-07-27 | 2019-11-22 | 西南电子技术研究所(中国电子科技集团公司第十研究所) | Automatically the method for multi-source heterogeneous data knowledge is obtained |
CN110866332A (en) * | 2019-10-29 | 2020-03-06 | 中国电子科技集团公司第三十八研究所 | Complex cable assembly assembling method and system |
CN111400458A (en) * | 2018-12-27 | 2020-07-10 | 上海智臻智能网络科技股份有限公司 | Automatic generalization method and device |
CN111401988A (en) * | 2020-03-02 | 2020-07-10 | 重庆邮电大学 | Product configuration demand response system based on semantics and order generation method |
CN111858948A (en) * | 2019-04-30 | 2020-10-30 | 杭州海康威视数字技术股份有限公司 | Ontology construction method and device, electronic equipment and storage medium |
CN111858649A (en) * | 2020-08-05 | 2020-10-30 | 哈尔滨工业大学(威海) | Heterogeneous data fusion method based on ontology mapping |
CN112000725A (en) * | 2020-08-28 | 2020-11-27 | 哈尔滨工业大学 | Ontology fusion pretreatment method for multi-source heterogeneous resources |
CN112328623A (en) * | 2020-11-06 | 2021-02-05 | 昆山数字城市信息技术有限公司 | Multi-source heterogeneous data management method based on mixed ontology mode |
CN112667606A (en) * | 2021-01-15 | 2021-04-16 | 中国科学院空天信息创新研究院 | Knowledge base system based on multi-source knowledge acquisition technology and construction method thereof |
CN112908441A (en) * | 2021-03-04 | 2021-06-04 | 文华学院 | Data processing method and device for medical platform and processing equipment |
CN113111503A (en) * | 2021-04-01 | 2021-07-13 | 重庆传晟酷德大数据科技有限公司 | Multi-source heterogeneous data construction method based on CAD |
CN113987131A (en) * | 2021-11-11 | 2022-01-28 | 江苏天汇空间信息研究院有限公司 | Heterogeneous multi-source data correlation analysis system and method |
CN114385819A (en) * | 2022-03-23 | 2022-04-22 | 湖南工商大学 | Environment judicial domain ontology construction method and device and related equipment |
CN115082010A (en) * | 2022-06-02 | 2022-09-20 | 云南电网有限责任公司信息中心 | Intelligent management method, storage medium and system for metadata in power field |
CN115098931A (en) * | 2022-07-20 | 2022-09-23 | 江苏艾佳家居用品有限公司 | Small sample analysis method for mining personalized requirements of indoor design of user |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101582073A (en) * | 2008-12-31 | 2009-11-18 | 北京中机科海科技发展有限公司 | Intelligent retrieval system and method based on domain ontology |
CN102542027A (en) * | 2011-12-22 | 2012-07-04 | 北京航空航天大学深圳研究院 | Construction method of data integration system for studying ontology based on relation schema |
CN102682122A (en) * | 2012-05-15 | 2012-09-19 | 北京科技大学 | Method for constructing semantic data model for material science field based on ontology |
US8504511B2 (en) * | 2009-08-05 | 2013-08-06 | Fujifilm Medical Systems Usa, Inc. | System and method for providing localization of radiological information utilizing radiological domain ontology |
US8666982B2 (en) * | 2011-10-06 | 2014-03-04 | GM Global Technology Operations LLC | Method and system to augment vehicle domain ontologies for vehicle diagnosis |
-
2014
- 2014-07-04 CN CN201410317211.8A patent/CN104182454B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101582073A (en) * | 2008-12-31 | 2009-11-18 | 北京中机科海科技发展有限公司 | Intelligent retrieval system and method based on domain ontology |
US8504511B2 (en) * | 2009-08-05 | 2013-08-06 | Fujifilm Medical Systems Usa, Inc. | System and method for providing localization of radiological information utilizing radiological domain ontology |
US8666982B2 (en) * | 2011-10-06 | 2014-03-04 | GM Global Technology Operations LLC | Method and system to augment vehicle domain ontologies for vehicle diagnosis |
CN102542027A (en) * | 2011-12-22 | 2012-07-04 | 北京航空航天大学深圳研究院 | Construction method of data integration system for studying ontology based on relation schema |
CN102682122A (en) * | 2012-05-15 | 2012-09-19 | 北京科技大学 | Method for constructing semantic data model for material science field based on ontology |
Non-Patent Citations (3)
Title |
---|
RASHMI CHAUHAN.ETC: "Domain Ontology based Semantic Search for Efficient Information Retrieval through Automatic Query Expansion", 《2013 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND SIGNAL PROCESSING 》 * |
XUAN TIAN.ETC: "Measuring Semantic Associaiton in Domain Ontology", 《 THIRD INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRID》 * |
李鹏: "面向地质勘查的多源异构数据集成关键技术研究", 《中国博士学位论文全文数据库》 * |
Cited By (78)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107209754A (en) * | 2014-12-10 | 2017-09-26 | 凯恩迪股份有限公司 | Technology and semantic signal processing in large-scale unstructured data field |
CN107209754B (en) * | 2014-12-10 | 2021-07-13 | 凯恩迪股份有限公司 | Techniques and semantic signal processing in large unstructured data fields |
CN104679823A (en) * | 2014-12-31 | 2015-06-03 | 智慧城市信息技术有限公司 | Semantic annotation-based association method and system of heterogeneous data |
CN106407208A (en) * | 2015-07-29 | 2017-02-15 | 清华大学 | Establishment method and system for city management ontology knowledge base |
CN106407208B (en) * | 2015-07-29 | 2019-06-18 | 清华大学 | A kind of construction method and system of city management ontology knowledge base |
CN105426358A (en) * | 2015-11-09 | 2016-03-23 | 中国农业大学 | Automatic disease noun identification method |
CN105426358B (en) * | 2015-11-09 | 2018-08-31 | 中国农业大学 | A kind of disease noun automatic identifying method for magnanimity news |
CN105677766A (en) * | 2015-12-30 | 2016-06-15 | 南京理工大学 | Algebraic-specification to ontology-description conversion and assessment method of Web service semantics |
CN105808853A (en) * | 2016-03-09 | 2016-07-27 | 哈尔滨工程大学 | Engineering application oriented body establishment management and body data automatic obtaining method |
CN105808853B (en) * | 2016-03-09 | 2018-08-17 | 哈尔滨工程大学 | A kind of ontological construction management of Engineering Oriented application and ontology data automatic obtaining method |
CN106021306B (en) * | 2016-05-05 | 2019-03-15 | 上海交通大学 | Case retrieval system based on Ontology Matching |
CN106021306A (en) * | 2016-05-05 | 2016-10-12 | 上海交通大学 | Ontology matching based case search system |
CN106055560A (en) * | 2016-05-18 | 2016-10-26 | 上海申腾信息技术有限公司 | Method for collecting data of word segmentation dictionary based on statistical machine learning method |
CN106296343A (en) * | 2016-08-01 | 2017-01-04 | 王四春 | A kind of e-commerce transaction monitoring method based on the Internet and big data |
CN106302680A (en) * | 2016-08-06 | 2017-01-04 | 内蒙古大学 | A kind of data based on Internet of Things display background system |
CN107704480B (en) * | 2016-08-08 | 2021-06-11 | 百度(美国)有限责任公司 | Method and system for extending and reinforcing knowledge graph and computer medium |
CN107704480A (en) * | 2016-08-08 | 2018-02-16 | 百度(美国)有限责任公司 | Extension and the method and system and computer media for strengthening knowledge graph |
CN109643308B (en) * | 2016-08-23 | 2023-03-28 | 伊路米纳有限公司 | Semantic distance system and method for determining related ontology data |
CN109643308A (en) * | 2016-08-23 | 2019-04-16 | 伊路米纳有限公司 | Determine the semantic distance system and method for relevant ontology data |
CN106779146A (en) * | 2016-11-15 | 2017-05-31 | 广州铁路职业技术学院 | A kind of tourism service system for providing recommendation tourism route |
CN108614821B (en) * | 2016-12-09 | 2020-10-13 | 中国地质调查局发展研究中心 | Geological data interconnection and mutual-checking system |
CN108614821A (en) * | 2016-12-09 | 2018-10-02 | 中国地质调查局发展研究中心 | Geologic information interconnection mutually looks into system |
CN106777372B (en) * | 2017-01-26 | 2019-08-27 | 语义(上海)信息科技有限公司 | A kind of bee stream device data water conservancy diversion and data method for transformation based on Ontology on Semantic Web |
CN106777372A (en) * | 2017-01-26 | 2017-05-31 | 语义(上海)信息科技有限公司 | A kind of honeybee stream device data water conservancy diversion and data method for transformation based on Ontology on Semantic Web |
CN107016630A (en) * | 2017-02-09 | 2017-08-04 | 湖南第师范学院 | A kind of novel English teaching learns language system |
CN107016630B (en) * | 2017-02-09 | 2023-09-01 | 湖南第一师范学院 | Novel english teaching language learning system |
CN107045545A (en) * | 2017-03-30 | 2017-08-15 | 山东省农业科学院 | A kind of peanut cultivation information database constructing system |
CN107330007A (en) * | 2017-06-12 | 2017-11-07 | 南京邮电大学 | A kind of Method for Ontology Learning based on multi-data source |
CN107491476B (en) * | 2017-06-29 | 2021-01-12 | 中国科学院计算机网络信息中心 | Data model conversion and query analysis method suitable for various big data management systems |
CN107491476A (en) * | 2017-06-29 | 2017-12-19 | 中国科学院计算机网络信息中心 | A kind of data model translation and query analysis method suitable for a variety of big data management systems |
CN107357933A (en) * | 2017-08-04 | 2017-11-17 | 刘应波 | A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus |
CN107545046B (en) * | 2017-08-17 | 2021-05-25 | 北京奇安信科技有限公司 | Fusion method and device for multi-source heterogeneous data |
CN107545046A (en) * | 2017-08-17 | 2018-01-05 | 北京奇安信科技有限公司 | A kind of fusion method and device of multi-source heterogeneous data |
CN107679484A (en) * | 2017-09-28 | 2018-02-09 | 辽宁工程技术大学 | A kind of Remote Sensing Target automatic detection and recognition methods based on cloud computing storage |
CN107808001A (en) * | 2017-11-13 | 2018-03-16 | 哈尔滨工业大学 | Towards the Mode integrating method and device of magnanimity isomeric data |
CN107808001B (en) * | 2017-11-13 | 2019-12-06 | 哈尔滨工业大学 | Massive heterogeneous data oriented mode integration method and device |
CN107958086A (en) * | 2017-12-18 | 2018-04-24 | 北京睿力科技有限公司 | The multi-source heterogeneous database data for solving data semantic Heterogeneity integrates method |
CN108446293B (en) * | 2018-01-22 | 2020-12-15 | 中电海康集团有限公司 | Method for constructing city portrait based on city multi-source heterogeneous data |
CN108446293A (en) * | 2018-01-22 | 2018-08-24 | 中电海康集团有限公司 | A method of based on urban multi-source isomeric data structure city portrait |
CN108509486A (en) * | 2018-02-08 | 2018-09-07 | 浙江大学 | A kind of safe big data structural management method of intelligent plant multi-source |
CN108536796A (en) * | 2018-04-02 | 2018-09-14 | 北京大学 | A kind of isomery Ontology Matching method and system based on figure |
CN108710663A (en) * | 2018-05-14 | 2018-10-26 | 北京大学 | A kind of data matching method and system based on ontology model |
CN109063114B (en) * | 2018-07-27 | 2020-11-24 | 华南理工大学广州学院 | Heterogeneous data integration method and device for energy cloud platform, terminal and storage medium |
CN109063114A (en) * | 2018-07-27 | 2018-12-21 | 华南理工大学广州学院 | Heterogeneous data integrating method, device, terminal and the storage medium of energy cloud platform |
CN109271638B (en) * | 2018-10-08 | 2023-01-06 | 东北石油大学 | Personalized semantic integration method and system based on data space |
CN109271638A (en) * | 2018-10-08 | 2019-01-25 | 东北石油大学 | A kind of personalized Semantic Integration and system based on data space |
CN109597925A (en) * | 2018-10-25 | 2019-04-09 | 同济大学 | A kind of supplier data analysis method and analysis system based on ontology |
CN109635119B (en) * | 2018-10-25 | 2023-08-04 | 同济大学 | Industrial big data integration system based on ontology fusion |
CN109635119A (en) * | 2018-10-25 | 2019-04-16 | 同济大学 | A kind of industrial big data integrated system based on ontology fusion |
CN109541995A (en) * | 2018-11-01 | 2019-03-29 | 湖南城市学院 | A kind of Intelligent indoor finishing absorption dedusting control system |
CN109766906A (en) * | 2018-11-16 | 2019-05-17 | 中国人民解放军海军大连舰艇学院 | Naval battle field situation data fusion method and system based on occurrence diagram |
CN109587125A (en) * | 2018-11-23 | 2019-04-05 | 南方电网科学研究院有限责任公司 | Network security big data analysis method, system and related device |
CN111400458A (en) * | 2018-12-27 | 2020-07-10 | 上海智臻智能网络科技股份有限公司 | Automatic generalization method and device |
CN110008288A (en) * | 2019-02-19 | 2019-07-12 | 武汉烽火技术服务有限公司 | The construction method in the knowledge mapping library for Analysis of Network Malfunction and its application |
CN111858948A (en) * | 2019-04-30 | 2020-10-30 | 杭州海康威视数字技术股份有限公司 | Ontology construction method and device, electronic equipment and storage medium |
CN110197280A (en) * | 2019-05-20 | 2019-09-03 | 中国银行股份有限公司 | A kind of knowledge mapping construction method, apparatus and system |
CN110197280B (en) * | 2019-05-20 | 2021-08-06 | 中国银行股份有限公司 | Knowledge graph construction method, device and system |
CN110175197A (en) * | 2019-05-23 | 2019-08-27 | 长沙学院 | A kind of body constructing method and system based on semantic Internet of Things |
CN110275919A (en) * | 2019-06-18 | 2019-09-24 | 合肥工业大学 | Data integrating method and device |
CN110489395A (en) * | 2019-07-27 | 2019-11-22 | 西南电子技术研究所(中国电子科技集团公司第十研究所) | Automatically the method for multi-source heterogeneous data knowledge is obtained |
CN110489395B (en) * | 2019-07-27 | 2022-07-29 | 西南电子技术研究所(中国电子科技集团公司第十研究所) | Method for automatically acquiring knowledge of multi-source heterogeneous data |
CN110866332A (en) * | 2019-10-29 | 2020-03-06 | 中国电子科技集团公司第三十八研究所 | Complex cable assembly assembling method and system |
CN111401988A (en) * | 2020-03-02 | 2020-07-10 | 重庆邮电大学 | Product configuration demand response system based on semantics and order generation method |
CN111858649A (en) * | 2020-08-05 | 2020-10-30 | 哈尔滨工业大学(威海) | Heterogeneous data fusion method based on ontology mapping |
CN111858649B (en) * | 2020-08-05 | 2022-06-17 | 哈尔滨工业大学(威海) | Heterogeneous data fusion method based on ontology mapping |
CN112000725A (en) * | 2020-08-28 | 2020-11-27 | 哈尔滨工业大学 | Ontology fusion pretreatment method for multi-source heterogeneous resources |
CN112000725B (en) * | 2020-08-28 | 2023-03-21 | 哈尔滨工业大学 | Ontology fusion preprocessing method for multi-source heterogeneous resources |
CN112328623A (en) * | 2020-11-06 | 2021-02-05 | 昆山数字城市信息技术有限公司 | Multi-source heterogeneous data management method based on mixed ontology mode |
CN112667606A (en) * | 2021-01-15 | 2021-04-16 | 中国科学院空天信息创新研究院 | Knowledge base system based on multi-source knowledge acquisition technology and construction method thereof |
CN112908441A (en) * | 2021-03-04 | 2021-06-04 | 文华学院 | Data processing method and device for medical platform and processing equipment |
CN113111503A (en) * | 2021-04-01 | 2021-07-13 | 重庆传晟酷德大数据科技有限公司 | Multi-source heterogeneous data construction method based on CAD |
CN113111503B (en) * | 2021-04-01 | 2024-03-05 | 重庆传晟酷德大数据科技有限公司 | CAD-based multi-source heterogeneous data construction method |
CN113987131A (en) * | 2021-11-11 | 2022-01-28 | 江苏天汇空间信息研究院有限公司 | Heterogeneous multi-source data correlation analysis system and method |
CN114385819A (en) * | 2022-03-23 | 2022-04-22 | 湖南工商大学 | Environment judicial domain ontology construction method and device and related equipment |
CN115082010A (en) * | 2022-06-02 | 2022-09-20 | 云南电网有限责任公司信息中心 | Intelligent management method, storage medium and system for metadata in power field |
CN115082010B (en) * | 2022-06-02 | 2024-05-24 | 云南电网有限责任公司信息中心 | Intelligent management method, storage medium and system for metadata in electric power field |
CN115098931A (en) * | 2022-07-20 | 2022-09-23 | 江苏艾佳家居用品有限公司 | Small sample analysis method for mining personalized requirements of indoor design of user |
CN115098931B (en) * | 2022-07-20 | 2022-12-16 | 江苏艾佳家居用品有限公司 | Small sample analysis method for mining personalized requirements of indoor design of user |
Also Published As
Publication number | Publication date |
---|---|
CN104182454B (en) | 2018-03-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104182454A (en) | Multi-source heterogeneous data semantic integration model constructed based on domain ontology and method | |
CN104699719B (en) | A kind of semantization method of internet-of-things terminal equipment | |
CN102156726B (en) | Geographic element querying and extending method based on semantic similarity | |
CN101093559B (en) | Method for constructing expert system based on knowledge discovery | |
CN109710701A (en) | A kind of automated construction method for public safety field big data knowledge mapping | |
CN104572833B (en) | A kind of mapping ruler creation method and device | |
CN103729432A (en) | Method for analyzing and sequencing academic influence of theme literature in citation database | |
CN106370631B (en) | A kind of automatic assay of sepectrophotofluorometer and data acquisition and recording method | |
CN101667201A (en) | Integration method of Deep Web query interface based on tree merging | |
CN104915396A (en) | Knowledge retrieving method | |
Ren et al. | Long-Term Preservation of Electronic Record Based on Digital Continuity in Smart Cities. | |
CN108416524A (en) | Estate planning based on a figure general framework refines deciphering method | |
Orlandi et al. | Interlinking heterogeneous data for smart energy systems | |
CN102737125B (en) | Web temporal object model-based outdated webpage information automatic discovering method | |
Ceci et al. | Spatial associative classification: propositional vs structural approach | |
CN113254517A (en) | Service providing method based on internet big data | |
Li | Research and analysis of semantic search technology based on knowledge graph | |
CN107045545A (en) | A kind of peanut cultivation information database constructing system | |
CN104598548A (en) | Method and device for analyzing spatial correlation of agricultural product price | |
CN111475604A (en) | Data processing method and device | |
CN107945871A (en) | A kind of blood disease intelligent classification system based on big data | |
Guan et al. | Application prospect of knowledge graph technology in knowledge management of oil and gas exploration and development | |
Xiao et al. | Integration of heterogeneous agriculture information system based on interoperation of domain ontology | |
Guo et al. | Design and implementation of domain ontology-based oilfield non-metallic pipe information retrieval system | |
CN117934209B (en) | Regional power system carbon emission big data analysis method based on knowledge graph |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20180327 |
|
CF01 | Termination of patent right due to non-payment of annual fee |