CN104182454A - Multi-source heterogeneous data semantic integration model constructed based on domain ontology and method - Google Patents

Multi-source heterogeneous data semantic integration model constructed based on domain ontology and method Download PDF

Info

Publication number
CN104182454A
CN104182454A CN201410317211.8A CN201410317211A CN104182454A CN 104182454 A CN104182454 A CN 104182454A CN 201410317211 A CN201410317211 A CN 201410317211A CN 104182454 A CN104182454 A CN 104182454A
Authority
CN
China
Prior art keywords
semantic
ontology
data
concept
local
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410317211.8A
Other languages
Chinese (zh)
Other versions
CN104182454B (en
Inventor
葛继科
杨治明
陈祖琴
刘兴华
裴仰军
王成敏
黄永文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing University of Science and Technology
Original Assignee
Chongqing University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing University of Science and Technology filed Critical Chongqing University of Science and Technology
Priority to CN201410317211.8A priority Critical patent/CN104182454B/en
Publication of CN104182454A publication Critical patent/CN104182454A/en
Application granted granted Critical
Publication of CN104182454B publication Critical patent/CN104182454B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • G06F16/212Schema design and management with details for data modelling support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/243Natural language query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/256Integrating or interfacing systems involving database management systems in federated or virtual databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/83Querying
    • G06F16/835Query processing
    • G06F16/8358Query translation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a multi-source heterogeneous data semantic integrated model constructed based on domain ontology. The multi-source heterogeneous data semantic integration model comprises a local ontology constructing module, a domain ontology merging module and a semantic inquiring dynamic propagation and protocol module. A multi-source heterogeneous data semantic integrating method comprises the following steps: constructing the domain ontology through an ontology merging technique and establishing a semantic mapping relation between a data source and the local ontology as well as between the local ontology and the domain ontology; combining the complementary advantage of social label and ontology on knowledge representation and performing inquiring protocol and propagation for the semantic inquiring request of the user, thereby generating a formal semantic inquiring statement; respectively inquiring a plurality of data sources, performing duplicate removal and aggregation optimization on the inquiring result, and lastly feeding back to the user. The invention provides an aggregation heterogeneous data semantic integration model constructed based on domain ontology and the method through the construction and mapping of the domain ontology, semantic inquiring propagation and result aggregation optimization.

Description

The integrated models and methods of multi-source heterogeneous data semantic building based on domain body
Technical field
The crossing domain that the invention belongs to the semantic integrated and oil-gas exploration Knowledge Discovery of isomeric data, relates in particular to a kind of multi-source heterogeneous data semantic integrated model and method building based on domain body.
Background technology
Along with industrialization and information-based deepening continuously of merging, oil-gas exploration enterprise in long-term production run, accumulated enrich full and accurate such as reconnoitring, the data such as earthquake, well logging.These are containing abundant geological information with different structure, the data that are stored in diverse location, have obvious potential using value, are the important evidence of the following exploration and development of oil gas field, are important sci-tech innovation resources.Utilize method and the technology such as Knowledge Discovery, artificial intelligence, from the existing oil-gas exploration data that pile up like a mountain, find out effective, novel, that there is potential utility, final intelligible rule and pattern, and integrate the large scale scale heterogeneous data resource of this class and realize data semantic and share, can provide more complete and reliable data, services support for decision maker, thereby instruct the technology decision-making of oil-gas exploration enterprise formulation science, for enterprise creates huge economic benefit.This is from " management data ", to rise to the effective way of " managerial knowledge ", is also important topic and the direction of current and following each field in-depth Information System configuration development.
The informatization of petroleum exploration domain starts from six the seventies, because the specialized fields relating to is many, each specialty all adopts the standard of sealing, do not have common standard to follow, therefore caused the phenomenon that between different departments and system, information sharing difficulty, a large amount of " information island " exist, for the integrated application of petroleum exploration domain data with share and brought challenge.For the difficulty facing in oil-gas exploration Data management and utilizaition, Chinese scholars has proposed some active data integrated approaches and has developed corresponding system.But these methods have still adopted the integrated technology of traditional database and middleware Technology generally, lack real-time semantic support, at the aspects such as dynamic mapping of inquiry request semantic extension, resource, have many limitations, and accuracy rate and efficiency are not satisfactory.
The data relevant to oil-gas exploration comprise underground tectonic structure, lithology, physical property, electrically, the raw storage lid of oil gas etc.These data are from different departments and area, various informative (comprising the image, video, text of relational database, GIS data, CAD figure, various forms etc.), and there is different structure type (comprise structurized, semi-structured with non-structured), be difficult to the multi-source heterogeneous data of this class to manage uniformly integrated and shared with semantic hierarchies.The more important thing is, increasing hydrocarbon resources can the exploration information based on existing be explored and develop, they have become the important data assets in oil company, to improving accuracy and the correct decision-making of auxiliary formulation of the oil-gas exploration level of information technology, data interpretation, all significant, effective utilization of these data, also has important effect to improving the economic benefit of oil company.In addition, the change of decision maker's mode of thinking is also for the knowledge architecture of data has been established ideal basis, the management philosophy of " buy but not build " is advocated in modern data management, i.e. competition is not placed on and how obtains and to occupy in data, but is placed in the careful analysis of data and Study on Interpretation.Therefore, from these multi-source heterogeneous oil-gas exploration data, carrying out the unified of knowledge builds and semantic integrated having great importance.
Body, as a kind of clear and definite Formal Specification of shared ideas model, has good concept hierarchy and reasoning from logic tenability, be applicable to describe very much domain knowledge standard, and to the semanteme of isomeric data, integrated and inquiry has very strong supporting role.Body is introduced in oil-gas exploration data integration, contributed to solve the difficulties that it lacks the aspects such as semantic support and standardization in data integration.Therefore, ontology is incorporated into oil-gas exploration Knowledge Discovery and knowledge engineering field, becoming " data management " is " information management ", contributes to solve the problem that the aspects such as current oil-gas exploration data sharing degree is low, Semantic Heterogeneous exist.In order to seek the breakthrough in the integrated theory of oil-gas exploration data semantic and method, oil-gas exploration data semantic integration mechanism and model that foundation builds based on domain body, can effectively improve the value of available data, the science decision of petroleum engineering relevant departments will be had to important theory significance and practical value.
Summary of the invention
The object of the embodiment of the present invention is to provide a kind of integrated models and methods of multi-source heterogeneous data semantic building based on domain body, is intended to solve the integrated and shared problem of semanteme between oil-gas exploration isomeric data.
The embodiment of the present invention is achieved in that a kind of integrated method of multi-source heterogeneous data semantic building based on domain body, and the integrated method of multi-source heterogeneous data semantic that should build based on domain body comprises:
Utilize ontology construction, automatically, semi-automatically set up the local ontology of multi-source heterogeneous data, by ontology merging technique construction domain body, and set up the Semantic mapping relation between data source and local ontology, local ontology and domain body;
In conjunction with society's mark and the complementary advantage of body in knowledge representation, stipulations and expansion are inquired about in semantic query request to user, and the semantic query statement of generating standard, inquires about respectively a plurality of data sources, then by Query Result duplicate removal and optimizing polymerization, finally return to user.
Another object of the present invention is to provide a kind of integrated model of multi-source heterogeneous data semantic building based on domain body, the multi-source heterogeneous data semantic integrated model that should build based on domain body comprises: local ontology builds module, ontology merging module and semantic query dynamic expansion and stipulations module;
Local ontology builds module, according to data source feature, selects adaptively body construction strategy, thereby constructs oil-gas exploration local ontology;
Ontology merging module, building module with local ontology is connected, the ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient (MIC) method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, realize a plurality of local ontology to the flexible merging of domain body;
Semantic query dynamic expansion and stipulations module, build module with local ontology and be connected, for the optimizing polymerization of validity and the result of inquiry request dynamic expansion.
Further, local ontology builds module, according to data source feature, by self-adaptation body construction strategy, carries out the structure of local ontology, specifically comprises:
Step 1, based on unstructured data sources, build local ontology:
First, applicating text filtrator changes into different file layouts into text-only file form, obtains language material data, and carries out consistency check; Then, adopt reverse maximum classification Chinese word cutting method to carry out preliminary cutting to these language materials and process, obtain word string set; Then, utilize maximum information coefficient (MIC) method to calculate the internal bond strength of word string, obtain compound word set, and judge the domain-specific of compound word and non-compound word, extract concept set; Then, the classification relation on application drawing between Random Walk Algorithm fuzzy filtering word concept, adopts the clustering algorithm based on hidden Markov model (HMM) to extract the classification relation between non-compound word concept; Then, use the method based on association rule mining to obtain the non-categorical relation between concept; Finally, the local ontology of applied ontology the build tool output OWL form;
Step 2, builds local ontology based on structured data source:
First, utilize the Semantic mapping relation between R2O technology building database pattern and ontology model, thereby be the concept in body the relationship map in relational database, attribute is mapped as to OWL attribute accordingly, and the relation table of database is converted into body class, the data in database are converted into example; Then, the initial local ontology extracting from database is done to a series of standardization work, by carrying out semantic similarity calculating with standard body, the ontology information that meets threshold value is set up to semantic relation, the ontology information that does not meet threshold value carries out standardization processing, thereby constructs satisfactory standardization local ontology;
Step 3, builds local ontology based on semi-structured data source
Due to semi-structured data between structuring and unstructured data, there is implicit structure but lack class data of fixing or strict structure, so, ontology construction based on above-mentioned two kinds of data types also can be applied to semi-structured data source, first, extract semi-structured data pattern, given mapping ruler, utilizes XML2RD method, and semi-structured data is converted into structural data; Then, according to structural data, build the local ontology corresponding to method construct semi-structured data source of local ontology.
Further, the method for ontology merging block merging is:
The ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient MIC method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, then, by similarity assessment function, the similarity between concept is assessed, output similar matrix, and use field axiom constraint knowledge further to assess its similarity to similar matrix; Then,, by the method training study sorter of machine learning, utilize learning classification device to calculate the similarity between concept example; Finally, by in conjunction with ISO15926 oil gas body and fuzzy formal concept analysis method, consider symmetry and the transitive relations of semantic similarity, fuzzy set theory is introduced in the setting of semantic similarity, realize a plurality of local ontology to the flexible merging of domain body.
Further, the concrete grammar that semantic query dynamic expansion and stipulations module realize is:
First, the conceptual relation and the inferential capability that by the semantic analysis of society mark and body, comprise, inquiry request is carried out to grammer and stipulations semantically and expansion, the semantic query statement of generating standard, solve between inquiry request and domain body data source different the caused mismatch problems due to expression-form, and automatically recommend the semantic respective labels of cluster according to user's inquiry request, for realizing data source, accurately assemble guiding is provided; Then, by the semantic similarity calculating between expanding query request and domain body concept, quantize the degree of association between request and resource concept; Finally, the Concept Semantic relation of enriching of utilizing society's mark and body to comprise, Query Result pattern is carried out to semantic annotations, according to the semanteme overall situation effect of society's mark, introducing is usingd the most relevant credible mark that statistic analysis result obtains data source pointed as one of Query Result reliability evaluation standard, result set is carried out to duplicate removal and optimizing polymerization, realize believable Top-K inquiry.
The integrated models and methods of multi-source heterogeneous data semantic building based on domain body provided by the invention, by structure and mapping, query semantics expansion and the result optimizing polymerization of domain body, provide a kind of oil-gas exploration isomeric data building based on domain body semantic integrated model, solved integrated and shared between oil-gas exploration isomeric data, this semanteme integrated model also has loose coupling, easily expansion, supports the superperformances such as semantic query.
The present invention has realized the dynamic growth of data source, for newly-increased data source, only need to provide corresponding wrapper, builds corresponding local ontology, can improve dirigibility and the practicality of integrated system.With domain body, domain knowledge is described, local ontology is described the Heterogeneous Information knowledge in a certain field, and the mapping of setting up respectively mapping, local ontology and the data source of domain body and local ontology, domain body, local ontology and data source were both interknited, relatively independent again, can reduce the coupling of semantic integrated system.In order to realize semantic query and ease for use, in conjunction with society's mark and the complementary advantage of body in knowledge representation, stipulations and expansion are inquired about in semantic query request to user, and to Query Result duplicate removal and optimizing polymerization, the result after optimizing the most at last returns to user.
Accompanying drawing explanation
Fig. 1 is the integrated method flow diagram of multi-source heterogeneous data semantic building based on domain body that the embodiment of the present invention provides;
Fig. 2 is the structural representation of the multi-source heterogeneous data semantic integrated model building based on domain body that provides of the embodiment of the present invention;
In figure: 1, local ontology builds module; 2, ontology merging module; 3, semantic query dynamic expansion and stipulations module;
Fig. 3 is the embodiment of the multi-source heterogeneous data semantic integrated model building based on domain body that provides of the embodiment of the present invention;
Fig. 4 is the process flow diagram that the unstructured data sources that provides of the embodiment of the present invention builds local ontology;
Fig. 5 is the simple abstract body schematic diagram that the embodiment of the present invention provides.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearer, below in conjunction with embodiment, the present invention is further elaborated.Should be appreciated that specific embodiment described herein, only in order to explain the present invention, is not intended to limit the present invention.
Below in conjunction with drawings and the specific embodiments, application principle of the present invention is further described.
As shown in Figure 1, the integrated method of multi-source heterogeneous data semantic building based on domain body of the embodiment of the present invention comprises:
S101: utilize ontology construction, automatically, semi-automatically set up the local ontology of multi-source heterogeneous data, by ontology merging technique construction domain body, and set up the Semantic mapping relation of data source and local ontology, local ontology and domain body;
S102: in conjunction with society's mark and the complementary advantage of body in knowledge representation, stipulations and expansion are inquired about in semantic query request to user, and the semantic query statement of generating standard, inquires about respectively a plurality of data sources, then by Query Result duplicate removal and optimizing polymerization, finally return to user.
As shown in Figure 2, the multi-source heterogeneous data semantic integrated model building based on domain body of the embodiment of the present invention mainly by: local ontology builds module 1, ontology merging module 2 and semantic query dynamic expansion and stipulations module 3 forms;
Local ontology builds module 1, according to data source feature, selects adaptively body construction strategy, thereby constructs oil-gas exploration local ontology;
Ontology merging module 2, building module 1 with local ontology is connected, the ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient MIC method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, realize a plurality of local ontology to the flexible merging of domain body;
Semantic query dynamic expansion and stipulations module 3, build module 1 with local ontology and be connected, for the optimizing polymerization of validity and the result of inquiry request dynamic expansion.
3 pairs of multi-source heterogeneous data semantic integrated models that build based on domain body of the present invention are described further by reference to the accompanying drawings:
1, local ontology builds module:
At present, also there is no maturation, unified methodology instructs the structure of body, set up relatively complete domain body, not only need to catch a large amount of field concept knowledge, also need to solve the semantic conflict of these field concepts, the problems such as ambiguity, this work that causes body to build is both dull, difficult thorny again, become the bottleneck of knowledge acquisition, in addition, although the editing environment of existing body the build tool can meet the needs of setting up body, but by the manual relation of collecting between field concept and field concept, build body completely, remain a job of wasting time and energy, make the application based on body be difficult to promote, therefore, research is automatic from multi-source heterogeneous data, semi-automatically build domain body, and realize the fast mapping of domain body and multi-source heterogeneous data, the semanteme during thereby solution body builds and inquires about is inconsistent and time bottleneck problem, this has important theory significance and a researching value to the data semantic based on domain body is integrated,
Petroleum exploration domain body self-adaptation construction strategy can be according to data source feature, select adaptively different body construction strategy, be mainly divided into four aspects: based on unstructured data sources (as document, webpage etc.), build local ontology, based on structured data source (as relational database), build local ontology, based on semi-structured data source structure local ontology and local ontology, merge into domain body.
1) based on unstructured data sources, build local ontology:
When unstructured data sources is built into body, first to carry out data pre-service, the pretreated fundamental purpose of data is by filtering unworthy symbol and word, thereby extract significant term, current, some natural language processing instruments (as OpenNLP, CKIP etc.) can be realized this function preferably, in the present invention, adopt OpenNLP to realize the pre-service work of unstructured data sources.
For unstructured data sources, first, applicating text filtrator changes into different file layouts into text-only file form, obtains language material data, and carries out consistency check; Then, adopt reverse maximum classification Chinese word cutting method to carry out preliminary cutting to these language materials and process, obtain word string set; Then, utilize maximum information coefficient (MIC) to calculate the internal bond strength of word string, obtain compound word set, and judge the domain-specific of compound word and non-compound word, extract concept set; Then, the classification relation on application drawing between Random Walk Algorithm fuzzy filtering word concept, adopts the clustering algorithm based on hidden Markov model (HMM) to extract the classification relation between non-compound word concept; Then, use the method based on association rule mining to obtain the non-categorical relation between concept; Finally, the local ontology of applied ontology the build tool output OWL form.Idiographic flow as shown in Figure 4.
2) based on structured data source, build local ontology:
From structured data source, building local ontology mainly considers by Semantic mapping, automatically to build petroleum exploration domain body from relational database.First, utilize the Semantic mapping relation between R2O technology building database pattern and ontology model, thereby be the concept in body the relationship map in relational database, attribute is mapped as to OWL attribute accordingly, and the relation table of database is converted into body class, the data in database are converted into example; Then, the initial local ontology extracting from database is done to standardization work, by carrying out semantic similarity calculating with standard body, the ontology information that meets threshold value is set up to semantic relation, the ontology information that does not meet threshold value carries out standardization processing, thereby constructs normalized local ontology.
3) based on semi-structured data source, build local ontology:
First, extract semi-structured data pattern, given mapping ruler, utilizes XML2RD method, and semi-structured data is converted into structural data; Then, according to structural data, build the local ontology corresponding to method construct semi-structured data source of local ontology.
2, ontology merging module:
The research of ontology merging at present mainly concentrates in concept matching, by coupling, find two semantic corresponding relations between Ontological concept, this will cause the incomplete situation of ontology merging result, in the present invention, adopt the ontology merging method that concept matching and attributes match are combined, utilize maximum information coefficient MIC method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, then, by similarity assessment function, the similarity between concept is assessed, output similar matrix, and use field axiom constraint knowledge further to assess its similarity to similar matrix, then,, by the method training study sorter of machine learning, utilize learning classification device to calculate the similarity between concept example, finally, by in conjunction with ISO15926 oil gas body and fuzzy formal concept analysis (FFCA) method, consider symmetry and the transitive relations of semantic similarity, fuzzy set theory is introduced in the setting of semantic similarity, thereby realize a plurality of local ontology to the flexible merging of domain body,
3, semantic query dynamic expansion and stipulations module:
The validity of inquiry request dynamic expansion and the optimizing polymerization of result are the integrated another one key issues of oil-gas exploration data semantic, first, the conceptual relation and the inferential capability that by the semantic analysis of society mark and body, comprise, inquiry request is carried out to grammer and stipulations semantically and expansion, the semantic query statement of generating standard (normally SPARQL statement), solve between inquiry request and domain body data source different the caused mismatch problems due to expression-form, and automatically recommend the semantic respective labels of cluster according to user's inquiry request, for realizing data source, accurately focus on guiding is provided, then, by the semantic similarity calculating between expanding query request and domain body concept, quantize the degree of association between request and resource concept, finally, the Concept Semantic relation of enriching of utilizing society's mark and body to comprise, Query Result pattern is carried out to semantic annotations, according to the semanteme overall situation effect of society's mark, introducing is usingd the most relevant credible mark that statistic analysis result obtains data source pointed as one of Query Result reliability evaluation standard, result set is carried out to duplicate removal and optimizing polymerization, realize believable Top-K inquiry.
Semantic distance is often referred to two close degree of the semanteme between concept, it is a kind of effective means of weighing semantic similarity, semantic distance between two concepts is less, their semanteme is more approaching, otherwise far away, semantic distance and semantic similarity are generally inverse relation, in information retrieval field, semantic distance is less, descriptive information is more approaching with user's inquiry request, when semantic distance is 0, information meets user's request completely, when semantic distance is greater than certain value, information and user inquire about onrelevant, therefore can not as a result of collect and return, for the result set returning, whether the arbitrary result in the own subjective judgement result set of user meets the demands completely, semantic distance based on body is mainly measured in body the length of fillet between concept, two concepts access path in body is shorter, they are just more similar, the quantity of information that the length of every fillet is comprised by it determines, if p (c) is the probability of happening of concept c in whole concept set, if count (c) is the occurrence number of concept c in body O, count (O) is the total number of concept in body O, because concept may appear at the form of different abstraction hierarchies in body, the occurrence number of its all sub-concepts that should add up while calculating the total occurrence number of concept, the computing method of p (c) are:
p ( c ) = count ( c ) + Σ H ( c ′ , c ) count ( c ′ ) count ( O ) - - - ( 1 )
From formula (1), p (c) is along with the rising monotone increasing of c place level in body O, and p (Root)=1, as shown in Figure 5, and p (c 0)=1, p (c 1)=0.5;
If parent (c) is the father set of concept c in body O:
According to the acyclicity of H relation, easily know:
∀ c ∈ C , if parent ( c ) ≠ φ , then | parent ( c ) | = 1 - - - ( 3 )
Wherein, | parent (c) | represent to gather the element number comprising in parent (c);
From information theory, if the frequency of a concept appearance is larger, the quantity of information that it comprises is just fewer; Otherwise if the frequency of a concept appearance is less, the quantity of information that it comprises is just more, therefore, the quantity of information that concept c comprises is:
I(c)=-log(p(c)) (4)
Therefore the quantity of information that, fillet c → parent (c) comprises is:
I ( c → parent ( e ) ) = - log ( p ( c → parent ( c ) ) ) = - log ( p ( c | parent ( c ) ) )
= - log ( p ( c ) p ( parent ( c ) ) ) = I ( c ) - I ( parent ( c ) ) - - - ( 5 )
The length of fillet c → parent (c) is proportional to the quantity of information that it comprises:
length(c→parent(c))∝I(c→parent(c)) (6)
By formula (5) and formula (6), can be obtained:
length(c→parent(c))=β·(I(c→parent(c))) (7)
Wherein, β is constant, is a scale factor, in order to express conveniently, makes β=1, can obtain:
length(c→parent(c))=I(c→parent(c)) (8)
Due to set up given any two concept c 1and c 2, establish LCA (c 1, c 2) be c 1and c 2the public ancestors of minimum in body O (Least Common Ancestor), LCA (c 1, c 2) be c 1and c 2, for any two concept nodes, necessarily there are the public ancestors of minimum in the concept node of depth capacity on common ancestor path, as shown in Figure 5, and LCA (c 3, c 9)=c 1, LCA (c 1, c 5)=c 0, i.e. the root concept node (Root) of body, therefore:
Definition 1 (access path): due to set up, and H relation has acyclicity, c 1and c 2access path in body O has and only has one, is designated as path (c 1, c 2), and c 1to LCA (c 1, c 2) path and c 2to LCA (c 1, c 2) union in path is access path, be defined as:
path(c 1,c 2)=path(c 1,LCA(c 1,c 2))∪path(c 2,LCA(c 1,c 2)) (10)
Definition 2 (semantic distances): semantic distance has reflected the semantic association degree between two concepts, and the semantic distance between two concepts is defined as:
dis ( c 1 , c 2 ) = Σ c ∈ { path ( c 1 , c 2 ) - LCA ( c 1 , c 2 ) } length ( c → parent ( c ) ) = Σ c ∈ { path ( c 1 , c 2 ) - LCA ( c 1 , c 2 ) } I ( c ) - I ( parent ( c ) ) = ( I ( c 1 ) - I ( LCA ( c 1 , c 2 ) ) + ( I ( c 2 ) - I ( LCA ( c 1 , c 2 ) ) = 2 log ( p ( LCA ( c 1 , c 2 ) ) ) - ( log ( p ( c 1 ) ) + log ( p ( c 2 ) ) ) - - - ( 11 )
From formula (11), in fact the definition of semantic distance is comprised the information of two aspects here: on the one hand, the formation of domain body has determined LCA (c 1, c 2) position, this is when measuring similarity, within the inheritance between concept and concept, in body level, residing position has all been included in semantic distance to the impact of similarity; On the other hand, p (LCA (c 1, c 2)), p (c 1) and p (c 2) value all come from the statistical information of concept set,
From above-mentioned definition, with respect to different domain bodies, semantic distance between concept is different, and different concepts is with respect to same domain body, its semantic distance is also different, this computing method to semantic distance combine semantic relation between concept and the statistical information of objective fact, contribute to simulate more accurately the original appearance of objective world.
The integrated domain body that first need to be participated in creating by domain expert of semanteme of isomeric data of the present invention provides shared knowledge base, next needs ability to express to enrich and have the ontology describing language of certain logical reasoning ability, then by selecting rational mapping method, the Uniform semantic information that becomes integrated system to understand the Semantic Heterogeneous data-switching in different pieces of information source.In addition, system also should be supported semantic query, and has certain ease for use and extensibility.
This model can be realized the dynamic growth of data source, for newly-increased data source, only need to provide corresponding wrapper, builds corresponding local ontology, can improve dirigibility and the practicality of integrated system.With domain body, domain knowledge is described, local ontology is described the Heterogeneous Information knowledge in a certain field, and the mapping of setting up respectively mapping, local ontology and the data source of domain body and local ontology, this both interknited domain body, local ontology and data source, relatively independent again, can reduce the coupling of semantic integrated system.In order to realize semantic query and ease for use, in conjunction with society's mark and the complementary advantage of body in knowledge representation, user's semantic query request is inquired about to stipulations and expansion, and to Query Result duplicate removal and optimizing polymerization, finally return to user.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, all any modifications of doing within the spirit and principles in the present invention, be equal to and replace and improvement etc., within all should being included in protection scope of the present invention.

Claims (5)

1. the integrated method of multi-source heterogeneous data semantic building based on domain body, is characterized in that, the integrated method of multi-source heterogeneous data semantic that should build based on domain body comprises:
Utilize ontology construction, automatically, semi-automatically set up the local ontology of multi-source heterogeneous data, by ontology merging technique construction domain body, and set up the Semantic mapping relation between data source and local ontology, local ontology and domain body;
In conjunction with society's mark and the complementary advantage of body in knowledge representation, stipulations and expansion are inquired about in semantic query request to user, and the semantic query statement of generating standard, inquires about respectively a plurality of data sources, then by Query Result duplicate removal and optimizing polymerization, finally return to user.
2. the integrated model of multi-source heterogeneous data semantic building based on domain body, it is characterized in that, the multi-source heterogeneous data semantic integrated model that should build based on domain body comprises: local ontology builds module, ontology merging module and semantic query dynamic expansion and stipulations module;
Local ontology builds module, according to data source feature, selects adaptively body construction strategy, thereby constructs oil-gas exploration local ontology;
Ontology merging module, building module with local ontology is connected, the ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, realize a plurality of local ontology to the flexible merging of domain body;
Semantic query dynamic expansion and stipulations module, build module with local ontology and be connected, for the optimizing polymerization of validity and the result of inquiry request dynamic expansion.
3. the integrated model of multi-source heterogeneous data semantic building based on domain body as claimed in claim 2, is characterized in that, local ontology builds module, according to data source feature, by self-adaptation body construction strategy, carries out the structure of local ontology, specifically comprises:
Step 1, based on unstructured data sources, build local ontology:
First, applicating text filtrator changes into different file layouts into text-only file form, obtains language material data, and carries out consistency check; Then, adopt reverse maximum classification Chinese word cutting method to carry out preliminary cutting to these language materials and process, obtain word string set; Then, utilize maximum information coefficient method to calculate the internal bond strength of word string, obtain compound word set, and judge the domain-specific of compound word and non-compound word, extract concept set; Then, the classification relation on application drawing between Random Walk Algorithm fuzzy filtering word concept, adopts the clustering algorithm based on hidden Markov model to extract the classification relation between non-compound word concept; Then, use the method based on association rule mining to obtain the non-categorical relation between concept; Finally, the local ontology of applied ontology the build tool output OWL form;
Step 2, builds local ontology based on structured data source:
First, utilize the Semantic mapping relation between R2O technology building database pattern and ontology model, thereby be the concept in body the relationship map in relational database, attribute is mapped as to OWL attribute accordingly, and the relation table of database is converted into body class, the data in database are converted into example; Then, the initial local ontology extracting from database is done to a series of standardization work, by carrying out semantic similarity calculating with standard body, the ontology information that meets threshold value is set up to semantic relation, the ontology information that does not meet threshold value carries out standardization processing, thereby constructs satisfactory standardization local ontology;
Step 3, builds local ontology based on semi-structured data source
Due to semi-structured data between structuring and unstructured data, there is implicit structure but lack class data of fixing or strict structure; So the ontology construction based on above-mentioned two kinds of data types also can be applied to semi-structured data source; First, extract semi-structured data pattern, given mapping ruler, utilizes XML2RD method, and semi-structured data is converted into structural data; Then, according to structural data, build the local ontology corresponding to method construct semi-structured data source of local ontology.
4. the integrated model of multi-source heterogeneous data semantic building based on domain body as claimed in claim 2, is characterized in that, the method for ontology merging block merging is:
The ontology merging method that employing combines concept matching and attributes match, utilize maximum information coefficient method to calculate the semantic similarity of Concept Semantic Similarity and concept attribute, then, by similarity assessment function, the similarity between concept is assessed, output similar matrix, and use field axiom constraint knowledge further to assess its similarity to similar matrix; Then,, by the method training study sorter of machine learning, utilize learning classification device to calculate the similarity between concept example; Finally, by in conjunction with ISO15926 oil gas body and fuzzy formal concept analysis method, consider symmetry and the transitive relations of semantic similarity, fuzzy set theory is introduced in the setting of semantic similarity, realize a plurality of local ontology to the flexible merging of domain body.
5. the integrated model of multi-source heterogeneous data semantic building based on domain body as claimed in claim 2, is characterized in that, the concrete grammar that semantic query dynamic expansion and stipulations module realize is:
First, the conceptual relation and the inferential capability that by the semantic analysis of society mark and body, comprise, inquiry request is carried out to grammer and stipulations semantically and expansion, the semantic query statement of generating standard, solve between inquiry request and domain body data source different the caused mismatch problems due to expression-form, and automatically recommend the semantic respective labels of cluster according to user's inquiry request, for realizing data source, accurately assemble guiding is provided; Then, by the semantic similarity calculating between expanding query request and domain body concept, quantize the degree of association between request and resource concept; Finally, the Concept Semantic relation of enriching of utilizing society's mark and body to comprise, Query Result pattern is carried out to semantic annotations, according to the semanteme overall situation effect of society's mark, introducing is usingd the most relevant credible mark that statistic analysis result obtains data source pointed as one of Query Result reliability evaluation standard, result set is carried out to duplicate removal and optimizing polymerization, realize believable Top-K inquiry.
CN201410317211.8A 2014-07-04 2014-07-04 The integrated model of multi-source heterogeneous data semantic based on domain body structure and method Expired - Fee Related CN104182454B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410317211.8A CN104182454B (en) 2014-07-04 2014-07-04 The integrated model of multi-source heterogeneous data semantic based on domain body structure and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410317211.8A CN104182454B (en) 2014-07-04 2014-07-04 The integrated model of multi-source heterogeneous data semantic based on domain body structure and method

Publications (2)

Publication Number Publication Date
CN104182454A true CN104182454A (en) 2014-12-03
CN104182454B CN104182454B (en) 2018-03-27

Family

ID=51963496

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410317211.8A Expired - Fee Related CN104182454B (en) 2014-07-04 2014-07-04 The integrated model of multi-source heterogeneous data semantic based on domain body structure and method

Country Status (1)

Country Link
CN (1) CN104182454B (en)

Cited By (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104679823A (en) * 2014-12-31 2015-06-03 智慧城市信息技术有限公司 Semantic annotation-based association method and system of heterogeneous data
CN105426358A (en) * 2015-11-09 2016-03-23 中国农业大学 Automatic disease noun identification method
CN105677766A (en) * 2015-12-30 2016-06-15 南京理工大学 Algebraic-specification to ontology-description conversion and assessment method of Web service semantics
CN105808853A (en) * 2016-03-09 2016-07-27 哈尔滨工程大学 Engineering application oriented body establishment management and body data automatic obtaining method
CN106021306A (en) * 2016-05-05 2016-10-12 上海交通大学 Ontology matching based case search system
CN106055560A (en) * 2016-05-18 2016-10-26 上海申腾信息技术有限公司 Method for collecting data of word segmentation dictionary based on statistical machine learning method
CN106296343A (en) * 2016-08-01 2017-01-04 王四春 A kind of e-commerce transaction monitoring method based on the Internet and big data
CN106302680A (en) * 2016-08-06 2017-01-04 内蒙古大学 A kind of data based on Internet of Things display background system
CN106407208A (en) * 2015-07-29 2017-02-15 清华大学 Establishment method and system for city management ontology knowledge base
CN106777372A (en) * 2017-01-26 2017-05-31 语义(上海)信息科技有限公司 A kind of honeybee stream device data water conservancy diversion and data method for transformation based on Ontology on Semantic Web
CN106779146A (en) * 2016-11-15 2017-05-31 广州铁路职业技术学院 A kind of tourism service system for providing recommendation tourism route
CN107016630A (en) * 2017-02-09 2017-08-04 湖南第师范学院 A kind of novel English teaching learns language system
CN107045545A (en) * 2017-03-30 2017-08-15 山东省农业科学院 A kind of peanut cultivation information database constructing system
CN107209754A (en) * 2014-12-10 2017-09-26 凯恩迪股份有限公司 Technology and semantic signal processing in large-scale unstructured data field
CN107330007A (en) * 2017-06-12 2017-11-07 南京邮电大学 A kind of Method for Ontology Learning based on multi-data source
CN107357933A (en) * 2017-08-04 2017-11-17 刘应波 A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus
CN107491476A (en) * 2017-06-29 2017-12-19 中国科学院计算机网络信息中心 A kind of data model translation and query analysis method suitable for a variety of big data management systems
CN107545046A (en) * 2017-08-17 2018-01-05 北京奇安信科技有限公司 A kind of fusion method and device of multi-source heterogeneous data
CN107679484A (en) * 2017-09-28 2018-02-09 辽宁工程技术大学 A kind of Remote Sensing Target automatic detection and recognition methods based on cloud computing storage
CN107704480A (en) * 2016-08-08 2018-02-16 百度(美国)有限责任公司 Extension and the method and system and computer media for strengthening knowledge graph
CN107808001A (en) * 2017-11-13 2018-03-16 哈尔滨工业大学 Towards the Mode integrating method and device of magnanimity isomeric data
CN107958086A (en) * 2017-12-18 2018-04-24 北京睿力科技有限公司 The multi-source heterogeneous database data for solving data semantic Heterogeneity integrates method
CN108446293A (en) * 2018-01-22 2018-08-24 中电海康集团有限公司 A method of based on urban multi-source isomeric data structure city portrait
CN108509486A (en) * 2018-02-08 2018-09-07 浙江大学 A kind of safe big data structural management method of intelligent plant multi-source
CN108536796A (en) * 2018-04-02 2018-09-14 北京大学 A kind of isomery Ontology Matching method and system based on figure
CN108614821A (en) * 2016-12-09 2018-10-02 中国地质调查局发展研究中心 Geologic information interconnection mutually looks into system
CN108710663A (en) * 2018-05-14 2018-10-26 北京大学 A kind of data matching method and system based on ontology model
CN109063114A (en) * 2018-07-27 2018-12-21 华南理工大学广州学院 Heterogeneous data integrating method, device, terminal and the storage medium of energy cloud platform
CN109271638A (en) * 2018-10-08 2019-01-25 东北石油大学 A kind of personalized Semantic Integration and system based on data space
CN109541995A (en) * 2018-11-01 2019-03-29 湖南城市学院 A kind of Intelligent indoor finishing absorption dedusting control system
CN109587125A (en) * 2018-11-23 2019-04-05 南方电网科学研究院有限责任公司 Network security big data analysis method, system and related device
CN109597925A (en) * 2018-10-25 2019-04-09 同济大学 A kind of supplier data analysis method and analysis system based on ontology
CN109643308A (en) * 2016-08-23 2019-04-16 伊路米纳有限公司 Determine the semantic distance system and method for relevant ontology data
CN109635119A (en) * 2018-10-25 2019-04-16 同济大学 A kind of industrial big data integrated system based on ontology fusion
CN109766906A (en) * 2018-11-16 2019-05-17 中国人民解放军海军大连舰艇学院 Naval battle field situation data fusion method and system based on occurrence diagram
CN110008288A (en) * 2019-02-19 2019-07-12 武汉烽火技术服务有限公司 The construction method in the knowledge mapping library for Analysis of Network Malfunction and its application
CN110175197A (en) * 2019-05-23 2019-08-27 长沙学院 A kind of body constructing method and system based on semantic Internet of Things
CN110197280A (en) * 2019-05-20 2019-09-03 中国银行股份有限公司 A kind of knowledge mapping construction method, apparatus and system
CN110275919A (en) * 2019-06-18 2019-09-24 合肥工业大学 Data integrating method and device
CN110489395A (en) * 2019-07-27 2019-11-22 西南电子技术研究所(中国电子科技集团公司第十研究所) Automatically the method for multi-source heterogeneous data knowledge is obtained
CN110866332A (en) * 2019-10-29 2020-03-06 中国电子科技集团公司第三十八研究所 Complex cable assembly assembling method and system
CN111400458A (en) * 2018-12-27 2020-07-10 上海智臻智能网络科技股份有限公司 Automatic generalization method and device
CN111401988A (en) * 2020-03-02 2020-07-10 重庆邮电大学 Product configuration demand response system based on semantics and order generation method
CN111858948A (en) * 2019-04-30 2020-10-30 杭州海康威视数字技术股份有限公司 Ontology construction method and device, electronic equipment and storage medium
CN111858649A (en) * 2020-08-05 2020-10-30 哈尔滨工业大学(威海) Heterogeneous data fusion method based on ontology mapping
CN112000725A (en) * 2020-08-28 2020-11-27 哈尔滨工业大学 Ontology fusion pretreatment method for multi-source heterogeneous resources
CN112328623A (en) * 2020-11-06 2021-02-05 昆山数字城市信息技术有限公司 Multi-source heterogeneous data management method based on mixed ontology mode
CN112667606A (en) * 2021-01-15 2021-04-16 中国科学院空天信息创新研究院 Knowledge base system based on multi-source knowledge acquisition technology and construction method thereof
CN112908441A (en) * 2021-03-04 2021-06-04 文华学院 Data processing method and device for medical platform and processing equipment
CN113111503A (en) * 2021-04-01 2021-07-13 重庆传晟酷德大数据科技有限公司 Multi-source heterogeneous data construction method based on CAD
CN113987131A (en) * 2021-11-11 2022-01-28 江苏天汇空间信息研究院有限公司 Heterogeneous multi-source data correlation analysis system and method
CN114385819A (en) * 2022-03-23 2022-04-22 湖南工商大学 Environment judicial domain ontology construction method and device and related equipment
CN115082010A (en) * 2022-06-02 2022-09-20 云南电网有限责任公司信息中心 Intelligent management method, storage medium and system for metadata in power field
CN115098931A (en) * 2022-07-20 2022-09-23 江苏艾佳家居用品有限公司 Small sample analysis method for mining personalized requirements of indoor design of user

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101582073A (en) * 2008-12-31 2009-11-18 北京中机科海科技发展有限公司 Intelligent retrieval system and method based on domain ontology
CN102542027A (en) * 2011-12-22 2012-07-04 北京航空航天大学深圳研究院 Construction method of data integration system for studying ontology based on relation schema
CN102682122A (en) * 2012-05-15 2012-09-19 北京科技大学 Method for constructing semantic data model for material science field based on ontology
US8504511B2 (en) * 2009-08-05 2013-08-06 Fujifilm Medical Systems Usa, Inc. System and method for providing localization of radiological information utilizing radiological domain ontology
US8666982B2 (en) * 2011-10-06 2014-03-04 GM Global Technology Operations LLC Method and system to augment vehicle domain ontologies for vehicle diagnosis

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101582073A (en) * 2008-12-31 2009-11-18 北京中机科海科技发展有限公司 Intelligent retrieval system and method based on domain ontology
US8504511B2 (en) * 2009-08-05 2013-08-06 Fujifilm Medical Systems Usa, Inc. System and method for providing localization of radiological information utilizing radiological domain ontology
US8666982B2 (en) * 2011-10-06 2014-03-04 GM Global Technology Operations LLC Method and system to augment vehicle domain ontologies for vehicle diagnosis
CN102542027A (en) * 2011-12-22 2012-07-04 北京航空航天大学深圳研究院 Construction method of data integration system for studying ontology based on relation schema
CN102682122A (en) * 2012-05-15 2012-09-19 北京科技大学 Method for constructing semantic data model for material science field based on ontology

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
RASHMI CHAUHAN.ETC: "Domain Ontology based Semantic Search for Efficient Information Retrieval through Automatic Query Expansion", 《2013 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND SIGNAL PROCESSING 》 *
XUAN TIAN.ETC: "Measuring Semantic Associaiton in Domain Ontology", 《 THIRD INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRID》 *
李鹏: "面向地质勘查的多源异构数据集成关键技术研究", 《中国博士学位论文全文数据库》 *

Cited By (78)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107209754A (en) * 2014-12-10 2017-09-26 凯恩迪股份有限公司 Technology and semantic signal processing in large-scale unstructured data field
CN107209754B (en) * 2014-12-10 2021-07-13 凯恩迪股份有限公司 Techniques and semantic signal processing in large unstructured data fields
CN104679823A (en) * 2014-12-31 2015-06-03 智慧城市信息技术有限公司 Semantic annotation-based association method and system of heterogeneous data
CN106407208A (en) * 2015-07-29 2017-02-15 清华大学 Establishment method and system for city management ontology knowledge base
CN106407208B (en) * 2015-07-29 2019-06-18 清华大学 A kind of construction method and system of city management ontology knowledge base
CN105426358A (en) * 2015-11-09 2016-03-23 中国农业大学 Automatic disease noun identification method
CN105426358B (en) * 2015-11-09 2018-08-31 中国农业大学 A kind of disease noun automatic identifying method for magnanimity news
CN105677766A (en) * 2015-12-30 2016-06-15 南京理工大学 Algebraic-specification to ontology-description conversion and assessment method of Web service semantics
CN105808853A (en) * 2016-03-09 2016-07-27 哈尔滨工程大学 Engineering application oriented body establishment management and body data automatic obtaining method
CN105808853B (en) * 2016-03-09 2018-08-17 哈尔滨工程大学 A kind of ontological construction management of Engineering Oriented application and ontology data automatic obtaining method
CN106021306B (en) * 2016-05-05 2019-03-15 上海交通大学 Case retrieval system based on Ontology Matching
CN106021306A (en) * 2016-05-05 2016-10-12 上海交通大学 Ontology matching based case search system
CN106055560A (en) * 2016-05-18 2016-10-26 上海申腾信息技术有限公司 Method for collecting data of word segmentation dictionary based on statistical machine learning method
CN106296343A (en) * 2016-08-01 2017-01-04 王四春 A kind of e-commerce transaction monitoring method based on the Internet and big data
CN106302680A (en) * 2016-08-06 2017-01-04 内蒙古大学 A kind of data based on Internet of Things display background system
CN107704480B (en) * 2016-08-08 2021-06-11 百度(美国)有限责任公司 Method and system for extending and reinforcing knowledge graph and computer medium
CN107704480A (en) * 2016-08-08 2018-02-16 百度(美国)有限责任公司 Extension and the method and system and computer media for strengthening knowledge graph
CN109643308B (en) * 2016-08-23 2023-03-28 伊路米纳有限公司 Semantic distance system and method for determining related ontology data
CN109643308A (en) * 2016-08-23 2019-04-16 伊路米纳有限公司 Determine the semantic distance system and method for relevant ontology data
CN106779146A (en) * 2016-11-15 2017-05-31 广州铁路职业技术学院 A kind of tourism service system for providing recommendation tourism route
CN108614821B (en) * 2016-12-09 2020-10-13 中国地质调查局发展研究中心 Geological data interconnection and mutual-checking system
CN108614821A (en) * 2016-12-09 2018-10-02 中国地质调查局发展研究中心 Geologic information interconnection mutually looks into system
CN106777372B (en) * 2017-01-26 2019-08-27 语义(上海)信息科技有限公司 A kind of bee stream device data water conservancy diversion and data method for transformation based on Ontology on Semantic Web
CN106777372A (en) * 2017-01-26 2017-05-31 语义(上海)信息科技有限公司 A kind of honeybee stream device data water conservancy diversion and data method for transformation based on Ontology on Semantic Web
CN107016630A (en) * 2017-02-09 2017-08-04 湖南第师范学院 A kind of novel English teaching learns language system
CN107016630B (en) * 2017-02-09 2023-09-01 湖南第一师范学院 Novel english teaching language learning system
CN107045545A (en) * 2017-03-30 2017-08-15 山东省农业科学院 A kind of peanut cultivation information database constructing system
CN107330007A (en) * 2017-06-12 2017-11-07 南京邮电大学 A kind of Method for Ontology Learning based on multi-data source
CN107491476B (en) * 2017-06-29 2021-01-12 中国科学院计算机网络信息中心 Data model conversion and query analysis method suitable for various big data management systems
CN107491476A (en) * 2017-06-29 2017-12-19 中国科学院计算机网络信息中心 A kind of data model translation and query analysis method suitable for a variety of big data management systems
CN107357933A (en) * 2017-08-04 2017-11-17 刘应波 A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus
CN107545046B (en) * 2017-08-17 2021-05-25 北京奇安信科技有限公司 Fusion method and device for multi-source heterogeneous data
CN107545046A (en) * 2017-08-17 2018-01-05 北京奇安信科技有限公司 A kind of fusion method and device of multi-source heterogeneous data
CN107679484A (en) * 2017-09-28 2018-02-09 辽宁工程技术大学 A kind of Remote Sensing Target automatic detection and recognition methods based on cloud computing storage
CN107808001A (en) * 2017-11-13 2018-03-16 哈尔滨工业大学 Towards the Mode integrating method and device of magnanimity isomeric data
CN107808001B (en) * 2017-11-13 2019-12-06 哈尔滨工业大学 Massive heterogeneous data oriented mode integration method and device
CN107958086A (en) * 2017-12-18 2018-04-24 北京睿力科技有限公司 The multi-source heterogeneous database data for solving data semantic Heterogeneity integrates method
CN108446293B (en) * 2018-01-22 2020-12-15 中电海康集团有限公司 Method for constructing city portrait based on city multi-source heterogeneous data
CN108446293A (en) * 2018-01-22 2018-08-24 中电海康集团有限公司 A method of based on urban multi-source isomeric data structure city portrait
CN108509486A (en) * 2018-02-08 2018-09-07 浙江大学 A kind of safe big data structural management method of intelligent plant multi-source
CN108536796A (en) * 2018-04-02 2018-09-14 北京大学 A kind of isomery Ontology Matching method and system based on figure
CN108710663A (en) * 2018-05-14 2018-10-26 北京大学 A kind of data matching method and system based on ontology model
CN109063114B (en) * 2018-07-27 2020-11-24 华南理工大学广州学院 Heterogeneous data integration method and device for energy cloud platform, terminal and storage medium
CN109063114A (en) * 2018-07-27 2018-12-21 华南理工大学广州学院 Heterogeneous data integrating method, device, terminal and the storage medium of energy cloud platform
CN109271638B (en) * 2018-10-08 2023-01-06 东北石油大学 Personalized semantic integration method and system based on data space
CN109271638A (en) * 2018-10-08 2019-01-25 东北石油大学 A kind of personalized Semantic Integration and system based on data space
CN109597925A (en) * 2018-10-25 2019-04-09 同济大学 A kind of supplier data analysis method and analysis system based on ontology
CN109635119B (en) * 2018-10-25 2023-08-04 同济大学 Industrial big data integration system based on ontology fusion
CN109635119A (en) * 2018-10-25 2019-04-16 同济大学 A kind of industrial big data integrated system based on ontology fusion
CN109541995A (en) * 2018-11-01 2019-03-29 湖南城市学院 A kind of Intelligent indoor finishing absorption dedusting control system
CN109766906A (en) * 2018-11-16 2019-05-17 中国人民解放军海军大连舰艇学院 Naval battle field situation data fusion method and system based on occurrence diagram
CN109587125A (en) * 2018-11-23 2019-04-05 南方电网科学研究院有限责任公司 Network security big data analysis method, system and related device
CN111400458A (en) * 2018-12-27 2020-07-10 上海智臻智能网络科技股份有限公司 Automatic generalization method and device
CN110008288A (en) * 2019-02-19 2019-07-12 武汉烽火技术服务有限公司 The construction method in the knowledge mapping library for Analysis of Network Malfunction and its application
CN111858948A (en) * 2019-04-30 2020-10-30 杭州海康威视数字技术股份有限公司 Ontology construction method and device, electronic equipment and storage medium
CN110197280A (en) * 2019-05-20 2019-09-03 中国银行股份有限公司 A kind of knowledge mapping construction method, apparatus and system
CN110197280B (en) * 2019-05-20 2021-08-06 中国银行股份有限公司 Knowledge graph construction method, device and system
CN110175197A (en) * 2019-05-23 2019-08-27 长沙学院 A kind of body constructing method and system based on semantic Internet of Things
CN110275919A (en) * 2019-06-18 2019-09-24 合肥工业大学 Data integrating method and device
CN110489395A (en) * 2019-07-27 2019-11-22 西南电子技术研究所(中国电子科技集团公司第十研究所) Automatically the method for multi-source heterogeneous data knowledge is obtained
CN110489395B (en) * 2019-07-27 2022-07-29 西南电子技术研究所(中国电子科技集团公司第十研究所) Method for automatically acquiring knowledge of multi-source heterogeneous data
CN110866332A (en) * 2019-10-29 2020-03-06 中国电子科技集团公司第三十八研究所 Complex cable assembly assembling method and system
CN111401988A (en) * 2020-03-02 2020-07-10 重庆邮电大学 Product configuration demand response system based on semantics and order generation method
CN111858649A (en) * 2020-08-05 2020-10-30 哈尔滨工业大学(威海) Heterogeneous data fusion method based on ontology mapping
CN111858649B (en) * 2020-08-05 2022-06-17 哈尔滨工业大学(威海) Heterogeneous data fusion method based on ontology mapping
CN112000725A (en) * 2020-08-28 2020-11-27 哈尔滨工业大学 Ontology fusion pretreatment method for multi-source heterogeneous resources
CN112000725B (en) * 2020-08-28 2023-03-21 哈尔滨工业大学 Ontology fusion preprocessing method for multi-source heterogeneous resources
CN112328623A (en) * 2020-11-06 2021-02-05 昆山数字城市信息技术有限公司 Multi-source heterogeneous data management method based on mixed ontology mode
CN112667606A (en) * 2021-01-15 2021-04-16 中国科学院空天信息创新研究院 Knowledge base system based on multi-source knowledge acquisition technology and construction method thereof
CN112908441A (en) * 2021-03-04 2021-06-04 文华学院 Data processing method and device for medical platform and processing equipment
CN113111503A (en) * 2021-04-01 2021-07-13 重庆传晟酷德大数据科技有限公司 Multi-source heterogeneous data construction method based on CAD
CN113111503B (en) * 2021-04-01 2024-03-05 重庆传晟酷德大数据科技有限公司 CAD-based multi-source heterogeneous data construction method
CN113987131A (en) * 2021-11-11 2022-01-28 江苏天汇空间信息研究院有限公司 Heterogeneous multi-source data correlation analysis system and method
CN114385819A (en) * 2022-03-23 2022-04-22 湖南工商大学 Environment judicial domain ontology construction method and device and related equipment
CN115082010A (en) * 2022-06-02 2022-09-20 云南电网有限责任公司信息中心 Intelligent management method, storage medium and system for metadata in power field
CN115082010B (en) * 2022-06-02 2024-05-24 云南电网有限责任公司信息中心 Intelligent management method, storage medium and system for metadata in electric power field
CN115098931A (en) * 2022-07-20 2022-09-23 江苏艾佳家居用品有限公司 Small sample analysis method for mining personalized requirements of indoor design of user
CN115098931B (en) * 2022-07-20 2022-12-16 江苏艾佳家居用品有限公司 Small sample analysis method for mining personalized requirements of indoor design of user

Also Published As

Publication number Publication date
CN104182454B (en) 2018-03-27

Similar Documents

Publication Publication Date Title
CN104182454A (en) Multi-source heterogeneous data semantic integration model constructed based on domain ontology and method
CN104699719B (en) A kind of semantization method of internet-of-things terminal equipment
CN102156726B (en) Geographic element querying and extending method based on semantic similarity
CN101093559B (en) Method for constructing expert system based on knowledge discovery
CN109710701A (en) A kind of automated construction method for public safety field big data knowledge mapping
CN104572833B (en) A kind of mapping ruler creation method and device
CN103729432A (en) Method for analyzing and sequencing academic influence of theme literature in citation database
CN106370631B (en) A kind of automatic assay of sepectrophotofluorometer and data acquisition and recording method
CN101667201A (en) Integration method of Deep Web query interface based on tree merging
CN104915396A (en) Knowledge retrieving method
Ren et al. Long-Term Preservation of Electronic Record Based on Digital Continuity in Smart Cities.
CN108416524A (en) Estate planning based on a figure general framework refines deciphering method
Orlandi et al. Interlinking heterogeneous data for smart energy systems
CN102737125B (en) Web temporal object model-based outdated webpage information automatic discovering method
Ceci et al. Spatial associative classification: propositional vs structural approach
CN113254517A (en) Service providing method based on internet big data
Li Research and analysis of semantic search technology based on knowledge graph
CN107045545A (en) A kind of peanut cultivation information database constructing system
CN104598548A (en) Method and device for analyzing spatial correlation of agricultural product price
CN111475604A (en) Data processing method and device
CN107945871A (en) A kind of blood disease intelligent classification system based on big data
Guan et al. Application prospect of knowledge graph technology in knowledge management of oil and gas exploration and development
Xiao et al. Integration of heterogeneous agriculture information system based on interoperation of domain ontology
Guo et al. Design and implementation of domain ontology-based oilfield non-metallic pipe information retrieval system
CN117934209B (en) Regional power system carbon emission big data analysis method based on knowledge graph

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180327

CF01 Termination of patent right due to non-payment of annual fee