CN104866598A - Heterogeneous database integrating method based on configurable templates - Google Patents

Heterogeneous database integrating method based on configurable templates Download PDF

Info

Publication number
CN104866598A
CN104866598A CN201510292059.7A CN201510292059A CN104866598A CN 104866598 A CN104866598 A CN 104866598A CN 201510292059 A CN201510292059 A CN 201510292059A CN 104866598 A CN104866598 A CN 104866598A
Authority
CN
China
Prior art keywords
knowledge
template
user
templet
integrated
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510292059.7A
Other languages
Chinese (zh)
Other versions
CN104866598B (en
Inventor
徐哲赢
阎艳
郝佳
明振军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Institute of Technology BIT
Original Assignee
Beijing Institute of Technology BIT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Institute of Technology BIT filed Critical Beijing Institute of Technology BIT
Priority to CN201510292059.7A priority Critical patent/CN104866598B/en
Publication of CN104866598A publication Critical patent/CN104866598A/en
Application granted granted Critical
Publication of CN104866598B publication Critical patent/CN104866598B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems

Abstract

The present invention solves the problem that an integrated retrieval platform cannot take account of both configuration flexibility and integrating depth, so that the openness of the integrated platform and the integrating degree of resources are difficult to meet user requirements, and provides a heterogeneous database integrating method based on configurable templates. The method comprises the steps of: 1, inputting basic information of data sources to be integrated, newly creating knowledge templates, configuring incidence relation of each field in the data sources, and forming new knowledge organization structures; 2, matching attribute types for each field in the newly created knowledge templates so as to form a showing interfaces of the knowledge templates; and 3, carrying out cross-database retrieval on an index table of each knowledge template, returning a retrieval list, carrying out depth retrieval, on the basis of click of a user and according to the organization structure of the knowledge template to which entries belong, and expressing details by using the appertaining knowledge template as a carrier.

Description

Based on the heterogeneous databases integration method of configurable template
Technical field
The invention belongs to information management and Data Integration field, relate to a kind of heterogeneous databases integration method based on configurable template.
Background technology
Along with the arrival of large data age and the deep development of kownledge economy, the research and development such as enterprise, research institute and production mechanism more pay attention to the management to knowledge, implement many Information softwares, and create a large amount of data, are stored in respectively in different databases.And relatively independent, inorganized knowledge is difficult to play sufficient value, therefore need by these resources effectively being integrated to the integrated of heterogeneous database.
These databases have the features such as system isomery, structural isomerism, are called heterogeneous database.Conventional integrated approach takes the mode customized more, carry out integrated to existing heterogeneous database, and require before integrated, complete the read work to these database structures, there is provided basis for the later stage builds retrieve statement, this step is the pith realizing integrated approach.And when user proposes new demand, then need artificial safeguard platform and improve, be a very long and process for complexity.Current also exist the integrated approach that some support user's importing and management database, but, these integrated approaches are merely able to realize preliminary integrated, the most basic search field is returned to user in Aggregated search, when user has further demand to knowledge detailed content and structure, often take the mode of automatic acquisition database association table to return some information that may be correlated with to user, and these information often have the features such as accuracy is low, inorganizable.For head it off, user often will carry out quadratic search in the corresponding information system at this knowledge place or database.Above phenomenon reflects that existing integrated approach can not take into account configuration flexibility and the integrated degree of depth, makes the opening of integrated platform and the degree of integration of resource be difficult to meet the demand of user.
Summary of the invention
The object of the invention is the defect in order to overcome prior art, solve Aggregated search platform and can not take into account configuration flexibility and the integrated degree of depth, make the opening of integrated platform and the degree of integration of resource be difficult to the problem of meeting consumers' demand, propose a kind of heterogeneous databases integration method based on configurable template.
The inventive method is achieved through the following technical solutions:
Based on a heterogeneous databases integration method for configurable template, comprise the following steps:
Step one, input treat the essential information in integrated data source, and newly-built knowledge templet, configures the incidence relation of each field in data source, forms new knowledge organization structure;
Step 2, be each fields match attribute type in newly-built knowledge templet, form the displaying interface of knowledge templet;
Step 3, by carrying out cross search to the concordance list of each knowledge templet, returning retrieval list, clicking based on user, carrying out depth search according to the institutional framework of knowledge templet belonging to this entry, and with affiliated knowledge templet for vector expression detailed content.
Since then, just complete/achieve the heterogeneous databases integration method based on configurable template.
Further, the structure of configurable template comprises three key elements: data source essential information, knowledge organization structure and knowledge templet; Data source essential information comprises each type treating integrated data base, URL, user name, password, arranges corresponding link method according to the dissimilar of database; Knowledge organization structure, by reading the data structure of each database, intercepts partial information wherein, reconfigures according to user's request, and tissue becomes new Knowledge framework; The template that knowledge templet generates from the configuration of template attributes type list according to knowledge organization structure, be used to user show integrated after knowledge content and structure, and the structure rule of template will to be stored in template base.
Further, in step one, layoutprocedure is as follows:
(1) import need the essential information in integrated data source;
(2) newly-built knowledge templet;
(3) concordance list is selected; The corresponding concordance list of each knowledge templet, by mating of user's request and information in concordance list, returns to the corresponding knowledge entry of user;
(4) newly-built knowledge templet field;
(5) table name in the database selecting this field corresponding and row name;
(6) configure the mapping relations of this field and index master meter: the method adopting iteration to configure progressively is gone forward one by one, and is configured by logical relation, guide user mapping relations to be stored in template base between the two.
Beneficial effect of the present invention:
(1) by the Deep integrating to heterogeneous database, only need carry out single configuration when importing database, the details of degree of depth Extracting Knowledge can be continued on the basis of preliminary search, overcome traditional integration platform and can not meet the defect of user to the further demand of knowledge details in information retrieval.
(2) by the mechanism of flexible configuration and storage, when information change such as the database structures in the integration environment, support that user configures dynamically and changes relevant parameter, overcome the defect that traditional integration platform excessively relies on manual maintenance and debugging, effectively improve practicality and the stability of integrated platform.
(3) by the information of the integrated isomerous database of configurable template, realize the knowledge organization pattern of user interactions, overcome the information redundancy that Knowledge Aggregation causes.
Accompanying drawing explanation
Fig. 1 is knowledge organization structure guiding flow figure;
Fig. 2 is data source information-knowledge organization structural allocation process;
Fig. 3 is knowledge organization structure-knowledge templet layoutprocedure;
Fig. 4 is cross search towards master index table and the results list.
Embodiment
(1) building process of configurable knowledge templet
The structure of configurable knowledge templet relates to three key elements: data source essential information, knowledge organization structure and knowledge templet.Data source essential information comprises each type treating integrated data base (Oracle, Mysql, SQLServer etc.), URL, user name, passwords etc., and this method can arrange corresponding link method according to the dissimilar of database.Knowledge organization structure refers to, by reading the data structure of each database, intercepting partial information wherein, reconfiguring according to user's request, tissue becomes new Knowledge framework.Knowledge templet refers to, according to knowledge organization structure from template attributes type list configuration generate template, be used to user show integrated after knowledge content and structure, the structure rule of these templates will be stored in template base.Wherein template attributes type list defines knowledge templet and configures the attribute type scope that can select, and comprises numeral, single file text, multiline text, time, picture, file, video, form etc.As shown in Figure 2,3, this interactive relationship illustrates the detailed process that template builds to interactive relationship between these three key elements, is divided into 3 steps:
Step 1: newly-built knowledge templet.According to user's request, from each database, obtain corresponding field information, and record corresponding title, become new knowledge organization structure.Detailed layoutprocedure is as shown in Figure 1:
1. import need the essential information in integrated data source;
2. newly-built knowledge templet;
3. select concordance list; The corresponding concordance list of each knowledge templet, by mating of user's request and information in concordance list, returns to the corresponding knowledge entry of user.
4. newly-built knowledge templet field;
5. the table name in the database selecting this field corresponding and row name;
6. configure the mapping relations of this field and index master meter: in knowledge templet, each field and concordance list may derive from different databases, even if in same database, also multiple middle table may be there are, cause the path of link field and concordance list to be uncertain, therefore, this method adopts the method for iteration configuration, as shown in Figure 1, progressively go forward one by one, configured by logical relation, guide user mapping relations to be stored in template base between the two.
Step 2: configuration template attribute.Each field in knowledge organization structure and the attribute type in template attributes type list are matched.As design parameter comprises title, creation-time, description and parameter list, then its attribute type quoted is text, time, multiline text and form.By field name and quote attribute type numbering bind together, form one group of " title-type " sequence, form corresponding knowledge templet.
Step 3: generate knowledge templet interface." title-type " sequence corresponding according to template, generates patterned knowledge representation interface automatically, and this interface is user oriented knowledge details and shows interface.
(2) cross search and knowledge representation
Completed the structure of knowledge templet by above series of steps, next by cross search technology, the demand of user is mated with each database, returns corresponding information, and undertaken organizing and expressing by knowledge templet.
The corresponding concordance list (master meter) of each knowledge templet, the field information in knowledge templet can derive from different databases, but the Search Requirement of user is only mated with concordance list.The mechanism of cross search is exactly the input of user mated with each concordance list, and then, by the concordance list that records in knowledge templet and other incidence relations shown, by other information interceptions out.When showing the incidence relation between table and occurring the situation of " one-to-many ", the entry that user's matching degree is the highest can be returned to, with tense marker and reminding user exists multiple result herein, check that if want other results please arrive target database and continue to search.
Different from the retrieval of centralized database, cross search towards be all integrated databases, the concordance list in each knowledge templet will be retrieved, and they are stored in different data sources.If connect one to analyze one in retrieving, take so-called " serial analysis " method, concerning connection two databases, may also imperceptible response speed, if data source to be retrieved reaches tens even several tens, so the delay of searching system can for a few minutes.This is because the connection of database will consume a large amount of system resource, the method for multithreading therefore to be taked to realize parallel search.The demand of user is linked all concordance lists by multithreading simultaneously, after having mated, returns to user search the results list.
Returned the essential information (being stored in the information in concordance list) of knowledge by cross search to user, when user selects a knowledge to check, two processing procedures will be triggered simultaneously: depth search and attribute type coupling.Depth search reads the knowledge organization structure recorded in template base, carries out quadratic search according to the incidence relation between each field and concordance list; Attribute type coupling is by expression way corresponding in each fields match.Depth search is the particular content obtaining knowledge, and attribute type coupling is the carrier forming these contents.Knowledge representation is inserted by the particular content of depth search in corresponding attribute type displaying control, and the final knowledge instance that generates consults reference for designer.

Claims (3)

1., based on a heterogeneous databases integration method for configurable template, it is characterized in that, comprise the following steps:
Step one, input treat the essential information in integrated data source, and newly-built knowledge templet, configures the incidence relation of each field in data source, forms new knowledge organization structure;
Step 2, be each fields match attribute type in newly-built knowledge templet, form the displaying interface of knowledge templet;
Step 3, by carrying out cross search to the concordance list of each knowledge templet, returning retrieval list, clicking based on user, carrying out depth search according to the institutional framework of knowledge templet belonging to this entry, and with affiliated knowledge templet for vector expression detailed content.
Since then, just complete/achieve the heterogeneous databases integration method based on configurable template.
2. a kind of heterogeneous databases integration method based on configurable template as claimed in claim 1, it is characterized in that, the structure of configurable template comprises three key elements: data source essential information, knowledge organization structure and knowledge templet; Data source essential information comprises each type treating integrated data base, URL, user name, password, arranges corresponding link method according to the dissimilar of database; Knowledge organization structure, by reading the data structure of each database, intercepts partial information wherein, reconfigures according to user's request, and tissue becomes new Knowledge framework; The template that knowledge templet generates from the configuration of template attributes type list according to knowledge organization structure, be used to user show integrated after knowledge content and structure, and the structure rule of template will to be stored in template base.
3. a kind of heterogeneous databases integration method based on configurable template as claimed in claim 1, it is characterized in that, further, in step one, layoutprocedure is as follows:
(1) import need the essential information in integrated data source;
(2) newly-built knowledge templet;
(3) concordance list is selected; The corresponding concordance list of each knowledge templet, by mating of user's request and information in concordance list, returns to the corresponding knowledge entry of user;
(4) newly-built knowledge templet field;
(5) table name in the database selecting this field corresponding and row name;
(6) configure the mapping relations of this field and index master meter: the method adopting iteration to configure progressively is gone forward one by one, and is configured by logical relation, guide user mapping relations to be stored in template base between the two.
CN201510292059.7A 2015-06-01 2015-06-01 Heterogeneous databases integration method based on configurable template Active CN104866598B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510292059.7A CN104866598B (en) 2015-06-01 2015-06-01 Heterogeneous databases integration method based on configurable template

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510292059.7A CN104866598B (en) 2015-06-01 2015-06-01 Heterogeneous databases integration method based on configurable template

Publications (2)

Publication Number Publication Date
CN104866598A true CN104866598A (en) 2015-08-26
CN104866598B CN104866598B (en) 2018-05-08

Family

ID=53912424

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510292059.7A Active CN104866598B (en) 2015-06-01 2015-06-01 Heterogeneous databases integration method based on configurable template

Country Status (1)

Country Link
CN (1) CN104866598B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108170656A (en) * 2017-12-28 2018-06-15 阿里巴巴集团控股有限公司 Template establishment method, document creating method, rendering intent and device
CN108614874A (en) * 2018-04-25 2018-10-02 华中科技大学 A kind of multiple database Flexible Integration Method based on SQL Server
CN109473178A (en) * 2018-11-12 2019-03-15 北京懿医云科技有限公司 Method, system, equipment and the storage medium of medical data integration
CN109492059A (en) * 2019-01-03 2019-03-19 北京理工大学 A kind of multi-source heterogeneous data fusion and Modifying model process management and control method
CN109815109A (en) * 2018-12-11 2019-05-28 口碑(上海)信息技术有限公司 Data pattern alteration detection method, apparatus, equipment and readable storage medium storing program for executing
CN110750973A (en) * 2019-09-02 2020-02-04 北京东软望海科技有限公司 Dynamic template configuration method and system
CN111124805A (en) * 2019-11-25 2020-05-08 中国联合网络通信集团有限公司 Data acquisition method, device, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060095414A1 (en) * 2004-10-26 2006-05-04 Via Technologies, Inc. System and method for integrating and transmitting data
CN101149748A (en) * 2007-10-29 2008-03-26 浙江大学 Editing method of semantic mapping information between ontology schema and relational database schema
CN101169780A (en) * 2006-10-25 2008-04-30 华为技术有限公司 Semantic ontology retrieval system and method
CN101187937A (en) * 2007-10-30 2008-05-28 北京航空航天大学 Mode multiplexing isomerous database access and integration method under gridding environment
CN102508706A (en) * 2011-11-18 2012-06-20 北京航空航天大学 Multi-source data integrating platform and establishing method thereof
CN103839138A (en) * 2014-03-08 2014-06-04 成都文昊科技有限公司 System for supporting interaction of multiple heterogeneous systems

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060095414A1 (en) * 2004-10-26 2006-05-04 Via Technologies, Inc. System and method for integrating and transmitting data
CN101169780A (en) * 2006-10-25 2008-04-30 华为技术有限公司 Semantic ontology retrieval system and method
CN101149748A (en) * 2007-10-29 2008-03-26 浙江大学 Editing method of semantic mapping information between ontology schema and relational database schema
CN101187937A (en) * 2007-10-30 2008-05-28 北京航空航天大学 Mode multiplexing isomerous database access and integration method under gridding environment
CN102508706A (en) * 2011-11-18 2012-06-20 北京航空航天大学 Multi-source data integrating platform and establishing method thereof
CN103839138A (en) * 2014-03-08 2014-06-04 成都文昊科技有限公司 System for supporting interaction of multiple heterogeneous systems

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
S.C.BRANDT等: "An ontology-based approach to knowledge management in design processes", 《COMPUTERS AND CHEMICAL ENGINEERING》 *
于琦: "基于本体的异构数据源模式集成研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
张军艳等: "基于本体的语义异构数据集成方法研究", 《信息技术》 *
朱利等: "实时异构集成数据自适应模板解析算法", 《计算机工程与应用》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108170656A (en) * 2017-12-28 2018-06-15 阿里巴巴集团控股有限公司 Template establishment method, document creating method, rendering intent and device
CN108614874A (en) * 2018-04-25 2018-10-02 华中科技大学 A kind of multiple database Flexible Integration Method based on SQL Server
CN108614874B (en) * 2018-04-25 2021-05-18 华中科技大学 Multi-database flexible integration method based on SQL Server
CN109473178A (en) * 2018-11-12 2019-03-15 北京懿医云科技有限公司 Method, system, equipment and the storage medium of medical data integration
CN109473178B (en) * 2018-11-12 2022-04-01 北京懿医云科技有限公司 Method, system, device and storage medium for medical data integration
CN109815109A (en) * 2018-12-11 2019-05-28 口碑(上海)信息技术有限公司 Data pattern alteration detection method, apparatus, equipment and readable storage medium storing program for executing
CN109492059A (en) * 2019-01-03 2019-03-19 北京理工大学 A kind of multi-source heterogeneous data fusion and Modifying model process management and control method
CN109492059B (en) * 2019-01-03 2020-10-27 北京理工大学 Multi-source heterogeneous data fusion and model correction process control method
CN110750973A (en) * 2019-09-02 2020-02-04 北京东软望海科技有限公司 Dynamic template configuration method and system
CN111124805A (en) * 2019-11-25 2020-05-08 中国联合网络通信集团有限公司 Data acquisition method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN104866598B (en) 2018-05-08

Similar Documents

Publication Publication Date Title
US11003645B1 (en) Column lineage for resource dependency system and graphical user interface
CN104866598A (en) Heterogeneous database integrating method based on configurable templates
US9116975B2 (en) Systems and user interfaces for dynamic and interactive simultaneous querying of multiple data stores
US9377936B2 (en) Framework for automated storage processes and flexible workflow
CN107193967A (en) A kind of multi-source heterogeneous industry field big data handles full link solution
KR101312848B1 (en) Browse mode designer
US10579678B2 (en) Dynamic hierarchy generation based on graph data
CN104965886B (en) Data dimension processing method
EP2997513A1 (en) Supporting combination of flow based etl and entity relationship based etl
JP6132698B2 (en) Tabular multidimensional data conversion method and apparatus
CN100474318C (en) Automatic generation system for designing BOM
CN110442620B (en) Big data exploration and cognition method, device, equipment and computer storage medium
CN104102652A (en) Unstructured data storage system and method
CN108536718A (en) A kind of method and system for the IT application in management realized based on input and output semantization
CN114218218A (en) Data processing method, device and equipment based on data warehouse and storage medium
US10552423B2 (en) Semantic tagging of nodes
US10776351B2 (en) Automatic core data service view generator
CN116662441A (en) Distributed data blood margin construction and display method
EP3721354A1 (en) Systems and methods for querying databases using interactive search paths
CN102314514B (en) Scoping method of table data structuration
Delchev et al. Big Data Analysis Architecture
CN104537047B (en) A kind of clothes basic pattern plate searching system based on Lucene
CN112579706A (en) Data warehouse model and application thereof
Madhikerrni et al. Data discovery method for Extract-Transform-Load
CN114661704B (en) Data resource full life cycle management method, system, terminal and medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information

Inventor after: Hao Jia

Inventor after: Yan Yan

Inventor after: Xu Zheying

Inventor after: Ming Zhenjun

Inventor before: Xu Zheying

Inventor before: Yan Yan

Inventor before: Hao Jia

Inventor before: Ming Zhenjun

CB03 Change of inventor or designer information
GR01 Patent grant
GR01 Patent grant