CN102023982A - Data integration platform - Google Patents

Data integration platform Download PDF

Info

Publication number
CN102023982A
CN102023982A CN2009101619747A CN200910161974A CN102023982A CN 102023982 A CN102023982 A CN 102023982A CN 2009101619747 A CN2009101619747 A CN 2009101619747A CN 200910161974 A CN200910161974 A CN 200910161974A CN 102023982 A CN102023982 A CN 102023982A
Authority
CN
China
Prior art keywords
result
data
data integration
module
service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2009101619747A
Other languages
Chinese (zh)
Inventor
白晓颖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN2009101619747A priority Critical patent/CN102023982A/en
Publication of CN102023982A publication Critical patent/CN102023982A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses provides a data integration platform, which comprises a query processing module and a response collecting module, wherein the query processing module is used for receiving queries and translating the queries into appropriate forms for querying various sources; and the response collecting module is constructed to collect results from various sources according to the translated queries and provide the results. The invention also provides a data integration method based on the data integration platform.

Description

Data integration platform
Technical field
Present disclosure relates in general to the data integration platform field.
Background technology
Along with more and more data can be obtained from networks such as the Internets by portal website, programmable interface etc., become more and more important based on the database and the integrated of data of internet.Yet, because data mode, structure and the isomery problem that semantically exists are difficult to obtain and integrated diversified data.Owing to the disposal route difference, be difficult to mutual comparison in the data that provide by different tissues on the Internet, and may and name the inconsistent of custom, cause data collision because of technical term.Some data even have serious quality problems, for example noise and data are unreliable.These data also may lack interdependency and mark needed cross-platform analysis-by-synthesis on the biological support information science.
Summary of the invention
Present disclosure has been described a kind of embodiment of data processing platform (DPP), comprises query processing module and response collecting module.This query processing module receives inquiry, and this query translation is become to be applicable to the form that various data sources are inquired about.Response collecting module is collected the result from various data sources, and final comprehensive Query Result is provided.
Present disclosure has been described a kind of data integrating method, may further comprise the steps: receive inquiry and this query translation is become to be applicable to the form that various data sources are inquired about; And inquire about according to the query statement of being translated, collect the result from each provenance, and comprehensive result is provided.
More than be summary, therefore as required, ins and outs carried out simplification, summary and omission; This area professional and technical personnel should understand that this summary is exemplary, is not intended to constitute any restriction.Other aspects of described device and/or process, feature, advantage and/or other themes described herein will be by becoming clear gradually in this paper subsequent technology is described.This summary has been selected the part notion, is introduced in a simplified manner, and these notions will further describe in follow-up particular content.This summary clearly shows the key feature or the essential feature of claimed subject unintentionally, also and be not used in the auxiliary scope that limits theme required for protection.
Description of drawings
The above-mentioned feature of present disclosure and other features will become more clear by following description, claims and accompanying drawing.Should be understood that these accompanying drawings have just described several embodiments of present disclosure, therefore, they are not considered for limiting the scope of present disclosure, by means of these accompanying drawings, will be to more speaking more of present disclosure bright and details be described.
Fig. 1 shows the schematic outline according to the data integration platform of an exemplary (DIP) 100;
Fig. 2 shows the synoptic diagram according to the data integration platform of an exemplary (DIP) 100;
Fig. 3 shows the synoptic diagram according to the function mark process of an exemplary;
Fig. 4 shows the synoptic diagram according to the three-decker of the function of the DIP of an exemplary;
Fig. 5 shows the synoptic diagram based on the data integrating method of a plurality of DIP according to an exemplary;
The result that Fig. 6 shows according to an exemplary presents figure.
Embodiment
In the following detailed description, with reference to accompanying drawing, described accompanying drawing constitutes the part of present disclosure.In the accompanying drawings, the identical part of the general expression of identical symbol is unless context has explanation in addition.Specify, the illustrative embodiment described in accompanying drawing and the claim and being not intended to is construed as limiting.Can utilize other embodiments, and carry out other changes, these do not break away from the spirit and scope of theme given herein.Understandablely be, can arrange with various structure, replace, in combination and the design present disclosure as described on the whole and illustrated those aspects that all these is definitely expected and constitutes the part of present disclosure herein.
Present disclosure relates generally to the computer-readable medium and the system of some methods relevant with data integration platform, device, computer program, storage computation machine program.
Fig. 1 shows the schematic outline of data integration platform (DIP) 100.DIP comprises query processing module 101, and it is constructed to receive original query 104, and this original query 104 is translated into some suitable forms that each provenance 103 is inquired about of being used for.DIP 100 also comprises response collecting module 102, and it is constructed to according to described inquiry through translation, collects some results 105 from each provenance 103, and result 106 is provided.Original query can be received from client computer device 108.
In an exemplary, query processing module 101 for example as shown in Figure 2, comprises query translators (QT) 201.Response collecting module 102 comprises inquiry bridge (QB) 202, and result set (RA) 203 that grow up to be a useful person.Query translators 201 receives the original query 104 such as keyword query, and it is translated into internal representations or unified form---for example based on request analyser (request parser) etc.Then, described inquiry through translation is sent to inquiry bridge 202.
Inquiry bridge 202 comprises adapter (adapter) 2021, query calls device 2022 and collection device 2023.This adapter 2021 based on the information of each data source (as, basic query pattern, the method for retrieve data etc.), generate application interface desired correct or ad hoc inquiry statement or expression formula, or be applicable to the descriptor format and the semantic ad hoc inquiry statement in different pieces of information source.The query statement that generates is sent to the query calls device 2022 that calls the inquiry service that each Database Systems provides.Query calls device 2022 distribution, that is, the query statement that transmits each generation to its institute at or the data source of correspondence.Data source 206,207 and 208 receives each self-corresponding query statement and in the operation inquiry of data source place.Collection device 2023 by variety of way from collecting result such as each provenance such as database 206, API207 and Internet service 208.For example, collection device 2023 can be configured has a webcrawler module (crawling module), and this module grasps the result on the Internet, LAN (Local Area Network) or database.Collection device 2023 can also be constructed to by access point able to programme, as API207 and network service 208, collects the result.Adapter 2021 is unified internal representations with the structure and the formal transformation of the Query Result of isomery also.Adapter 2021 can be constructed particularly at each known source.The possibility of result that obtains from a plurality of data sources has various forms, promptly textual form, chart, hyperlink URI, can be converted to the tables of data etc. of the table object of a HTML.Result set is grown up to be a useful person and 203 is handled Query Results, for example analyze intersect Query Result consistance, conflict of qualification and contradiction the result, filter the record that repeats and generate result set or the result 106 who put in order.
DIP can also comprise a domain knowledge data storehouse 205 as the DIP nonproductive poll.This knowledge data base can comprise the searching keyword data.These class data help adapter 2021 to generate correct query statement at different sources.ID system with complexity is an example, and knowledge data base contains the mapping relations of different genes ID in the various data sources, thereby only need import an ID (genetic marker), and adapter 2021 will be with correct but different keyword automatically generated data library inquiry statements.After a plurality of inquiries, multi-form data are turned back to the requestor, and " understanding " results' semanteme is assisted in domain knowledge data storehouse 205, and they are combined among the standardized result.Be understandable that the mapped system that this feature can utilize background application to provide is realized.For example, can utilize body (ontology) that the model of the unified concept in this field is provided.In training process, it arrives mapping of conception ontology (conceptontology) to keyword by affix, and by it being used historical tracking learn to keep to upgrade.A keyword can be mapped to a plurality of Ontological concepts.
Therefore,, at first, discern its unified Ontological concept, and be mapped to the corresponding keyword of this identical Ontological concept in the identification disparate databases by the mapped system analysis on backstage for the keyword of online submission.
Assisting down of this knowledge, DIP has certain intellectual analysis ability, can handle class keyword input, and it automatically is mapped as the corresponding keyword of different internet sites/databases.These data from multi-data source can offer scientist and carry out analysis-by-synthesis for it.
In the foregoing description, DIP can accept the query requests of different modes, comprises the query requests to some data types or a plurality of data type, a certain platform or crossover-platform etc.Under the support in knowledge system or domain knowledge data storehouse 205, also can inquiry be proposed with the form of living model, the information of described living model in order to represent that all are relevant comprises basic gene information, experimental result, function mark and the conclusion that pushes away.The inquiry of being submitted to can at first change into unified expression mode and be described.For example, adopt the expression mode of XML form.Interface according to different internet databases limits then, and unified query translation is become different forms.
DIP contains the knowledge of transformation rule aspect.For example, it is a standard SQL, or parameterized API.A hypothesis herein is that internet database is followed some standard interface.
In the data integration process, can provide controller to control the operation of one or two module.Therefore, can also optionally comprise a process management module.This process management module is used to receive the instruction from external unit, and carries out this instruction, and to instruct the operation of each module, described module such as query translators 201, adapter 2021 and result set grow up to be a useful person 203.
In some embodiments, adapter can also comprise serving mixes module (servicemashup module), and can carry out the data integration of process context aware.Process is made up of data processing and analysis operation.For example, when an aforesaid data integration platform was used for biological field, the mode that can adopt service to mix was integrated into data, services in the process together with the service of function mark.As shown in Figure 3, service is mixed module and can be comprised three parts: 1) the function service pool 301, stored service describing at this place; 2) procedure definition 302, in order to service organization is become a process and the process stipulations are provided; And 3) process engine 303, in order to service specified in binding and the invoked procedure.When calling a process, process engine 303 loading process define 302, and find out the service in the function service pool 301.In carrying out, process provides dynamic binding and invoked procedure.Procedure definition 302 limits data stream and the control stream between the difference in functionality service.For example, two services SA and SB are called in regular turn, and the output of service SA is the input of the service SB that follows closely.For this workflow and data stipulations some standards are arranged.For example, be used for the BPEL4WS and the OWL-S of workflow stipulations, and the SCA/SDO that is used for the data stream stipulations.
Function service pool 301 comprises the description to service of function mark and instrument.Instrument in the pond 301 is packaged into Internet service, in order to support the dynamic binding method.Each service comprises as service of third party's function mark and packed service, all can adopt Standards Code to be described, and for example adopts with OWL-S and describes.In OWL-S, " ServiceProfile " provides the high-level layer stipulations to service and ISP, this service of coupling so that issue is promoted, calls and pulled strings, and described stipulations comprise Services Brief description, service function and functional attributes." ServiceGrounding " defined the mapping from the abstractdesription to the specific implementation, and service realization stipulations have been specified the detail of access services, as agreement, message format, serializing, transmission and addressing etc." ServiceModel " described the major function of service, to realize calling, make up and operation such as monitoring of service.Also comprise the description of " atom process " among the OWL-S, " atom process " can be used for encapsulating a service or the service sequences of being made up of some services.Be that function service pool 301 has kept all services and the instrument as " atom process ", binds and calls for process engine 303 based on this result.As the service pool management, pond 301 can keep the url list of an a available service.It can be kept link, classified service and be tied to service etc. as required.Service-Engine 303 is explained these process stipulations.For each function in the procedure definition, in pond 301, find concrete service, bind, call and carry out this service.
Procedure definition 302 XML format description.Procedure definition 302 has been expanded the OWL-S language to support dynamic binding.Be similar to OWL-S, procedure definition 302 has been described the process combination.The OWL-S language is described for anabolic process good reference is provided.Anabolic process can be broken down into other (non-combination or combination) processes, can adopt the control structure such as Sequence and If-Then-Else to illustrate.Be different from the OWL-S language, in procedure definition 302, nondecomposable process is not one " atom process ".In other words, OWL-S process stipulations can be seen as one group " atom process " and " anabolic process " and some control structure information.But procedure definition 302 comprises control structure, " anabolic process " and " AtomicServiceStub " (rather than " atom process " among the OWL-S)." AtomicServiceStub " illustrated COS, and the type service can be provided by one group of service that can replace mutually with identical function.For example, Google and Yahoo both can provide search service on the internet.Therefore, these two application can be classified in one " AtomicServiceStub ".
Process engine 303 is OWL-S process execution engines of expansion, in order to structure and execution service process.According to procedure definition 302, process engine decides to be needed binding and calls which kind of service.In each step, the service of selecting institute to bind and call has dual mode: 1) outside decision, and it can receive the order of services selection from the process management module; And 2) engine decision.Outside decision is regarded as outside preference.The outside has the privilege of the service selected, and this selection has than the higher right of priority of engine decision.The engine decision can provide the calculating of context aware.The service situation comprises service load, service quality and other information relevant with service.The selection of function mark service is outer interactively iterative process.Outside can decide service to be selected based on before result and user's preference.
DIP may operate on the client computing device.It also can move as Web2.0/Web 3.0 based on the internet.
Fig. 4 shows the synoptic diagram according to the three-decker of the function of the DIP of an exemplary.These three layers comprise data Layer 401, and it is used for metadata and vocabulary management; Stratum of intellectual, 402, it is used for information modeling, analysis and Knowledge Discovery; And service layer 403, it is used to provide external service.At data Layer 401, carry out metadata management and processing, such as metadata management 4012 and metadata mapping 4013.For example, at the model of standard vocabulary 4011 definitions based on metadata.Data set provider can register and provide with the data content of metadata compatibility or inform the mapping ruler of DIP from its local vocabulary to central standard.In stratum of intellectual 402, it provides the different view to information, comprise the direct view 4024 that is used for data retrieval, link model bank view angle 4023 with integrated data and the use cross section view 4025 that has statistics and personal information based on specific " rule " (as " central dogma " etc.).In the whole life process of information, all information application datas are traced to the source 4021 and Quality Control Mechanism such as classification 4022.Based on the information of being extracted, excavate 4026, find 4027 and manage 4028 knowledge 4029. in service layer 403, provide data registration 4031 relevant information to data set provider, and data qualification 4032, modeling 4033 are provided and mark services such as 4034, so that understand data better.Provide and inquire about 4035 relevant services by individualized agency 4036, this individualized agency 4036 can discern personal interest, and accumulation knowledge is used for individual service.This service will trigger interpreter 4037 and explain this inquiry and trigger the result's that returned integrated 4038.
Fig. 5 shows the synoptic diagram based on the data integrating method of a plurality of DIP.As shown in the drawing, supplier or registrant 5014 are registered to DIP 501 with data source 5012, are used for open and retrieval.It also can be limited its data-switching by service encapsulates 5013 to DIP 501 canonical form.Being encapsulated in the software development is the Design Mode of widely using.When integrated different software section or subsystem, because interface definition is inconsistent, be difficult to carry out interoperability between the software, for example, function name, parameter name or semantic different.A kind of simple method solves this problem, and being exactly increases the interface package module between the interface of intercommunication mutually.This module can be with interface conversion to consistent each other form.
Simultaneously, this supplier or registrant can be committed to DIP 501 together with its data pattern information with transformation rule, thereby leave conversion work for DIP 501.This DIP 501 also can be by obtaining data source 5015 grasping on the Internet, for example by using webcrawler module.In this mode, it requires data set provider is DIP 501 open standard services, understands its data layout and semanteme for DIP 501.
DIP 501 can return response in every way.Can use asynchronous method.For example, notice and the theme subscription based on incident all is the asynchronous technique that can be used for the DIP structure.
In Fig. 5, one group of DIP (501,502,503) combines, and they are dynamic and are extendible.DIP (501,502,503) can divide the work and cooperates based on standard agreement.For fear of communication and processing bottleneck, a DIP can finish the task of appointment, and data, services on a small scale is provided.For example, DIP (501,502,503) can provide different services by domain classification.A DIP can lay particular emphasis on the gene data service of high-throughput, and another DIP can lay particular emphasis on the compound data, services.DIP (501,502,503) can hold the information of other DIP, and the foundation cooperation that is in operation through consultation.They can participate in cooperation neatly or withdraw from cooperation.
Present disclosure also provides data integrating method.With reference to figure 2, query translators 201 receives inquiry 104, as keyword query, and converts thereof into internal representations or correct form.By adapter 2021, further convert this internal representations to the desired ad hoc inquiry statement of application interface or expression formula.By calling device 2022 the ad hoc inquiry statement is distributed to each data source systems.Data collector 2023 is in the Internet and local online collection result, or by programmable interface collection result, by adapter 2021, this result is converted into unified format.Result set grows up to be a useful person 203 with further result.For example the result of consistance, conflict of qualification and the contradiction that it can analysis result, filter the record that repeats and produce the result set of putting in order.Simultaneously, can provide personalized service, for example pass through personal agent.For example, handle according to personal interest and transactions history or present this inquiry or result.
For the operation of each module of management in the data integration process, can accept user instruction.Simultaneously, this method also comprises the data integration that provides the process context aware, for example mixes module by service, as described with reference to figure 3.
Example
The enforcement that relates in biological field is below described, will be described with reference to figure 6.Fig. 6 shows the result and presents figure.
The user wishes to analyze a kind of result of treatment of medicine and the gene that is influenced by the external device access system.From inquiry, the user uses external unit can select the gene that will check at any time, and the function mark of selecting to call is served.Data integration platform is supported the line chart view and the table view of Query Result now.It is provided to the connection of two services: 1) KEGG path service, and its help user understands behavior and the biological effect in the vital movement process; And 2) DAVID/GO service provides gene information, comprises chromosome, position, disease, another name, pathname etc.The user can with the acquiescence keyword " PTGS2 " and the acquiescence platform filtrator " U95 " come Query Database, " U95 " from
Figure B2009101619747D0000081
Company is one of chip series of widely using.Also can import formal gene title and select platform by drop-down menu.Query Result can show with three kinds of views: 1) the concise and to the point statistics that shows in the result view on left panel 602 wherein has gene name list and their access times; 2) tissue in the data form view 610 and clone information result.In this view, experimental result is collected and is presented in the form, obtains standardized data value for scientist; And 3) line chart of the value in the graph view 612.
The two kinds of views in back are presented among Fig. 6 with the label that separates.Scientist can be more different result view, to discern important or abnormal point.
In a step subsequently, in instrument view 604, provide two instruments.Click any one button and will produce online service call.The result of this service will be presented in the new label of main panel.KEGG service 606 is called has two steps.First step is a path query.Each path has its oneself hyperlink (as, arachidonic acid metabolic (ArachidonicAcid Metabolism)).In fact second step presents and is similar to path chart shown in Figure 6.This calls by the ID translation process support between the different I d system, and this ID translation process is transparent to the final user.In translation process, there is the ID mapped system on the backstage.This system can adopt multiple implementation, for example relational database, XML database or multi-dimensional database.ID is mapped to unified internal representations as a keyword.This system definition the mapping of each ID in each system.Therefore, inquiry ID at first is translated into unified internal representations, is translated into other ID in the other system then.
Each unit in the chart of path be can be clicked to generate new inquiry and to open focus from the webpage of KEGG website, the KEGG website has comprised the specifying information of enzyme, gene or organic organization.
David gene ontology service (David Gene Ontology Service) 608 can also be selected as analysis operation.As demonstration, the David service is simplified by the restriction service parameter.In original David service, the user can dispose its personalized request by the preference that mark type, gene I system, keyword and service aid are set.System generated correct ID system and keyword parameter automatically based on former inquiry.In addition, function service call device is set to " instrument " " gene report " and " mark type " and is set to " GOTERM BPALL " (comprising all biological process marks).This result presents with data form, and is also shown in the label.
The hardware enforcement and the software implementation difference of numerous aspects of system are very little, using hardware or software (but always is not usually, because at some sight, it is very different that the selection between the hardware and software may become) be a kind of design alternative of having represented the compromise between cost and effect.Have the various modes that can realize the effect of process described herein and/or system and/or other technologies, highly preferred mode then becomes with the sight that this process and/or system and/or other technologies are implemented.For example, if an implementer determines that speed and degree of accuracy are very important, the implementer can select based on hardware and/or firmware embodiment; If dirigibility is very main, the implementer can select combination hardware, software and/or firmware so.
Above detailed description has been set forth the various embodiments of device and/or process by using calcspar, process flow diagram and/or embodiment.With regard to this calcspar, process flow diagram and/or embodiment comprise one or more functions and/or operation, those skilled in the art will appreciate that, this side determines, each function among process flow diagram or the embodiment and/or operation can be by various hardware, software, firmware or its combination in any, implements separately or combination is implemented.In one embodiment, the several sections of theme described herein can pass through special IC (ASIC), field programmable gate array (FPGA), and digital signal processor DSP or other integrated forms are implemented.Yet, those of ordinary skill in the art will recognize some aspect of embodiment disclosed herein, on the whole, or on the part, can be used as one or more operate in computer program on one or more computing machines (as, as one or more programs that operate on one or more computer systems), as one or more programs (as one or more programs that operate on one or more microprocessors) that operate on one or more processors, as firmware, or as its combination in any, implement with integrated circuit of equal valuely, and those of ordinary skills are according to have the ability fully design circuit and/or software and/or firmware write code of herein open.In addition, those skilled in the art will appreciate that, the mechanism of theme described herein can change into various forms of program products, and the type that no matter is actually used in the information bearing medium of carrying out this conversion why, and the exemplary of theme described herein all is suitable for.Information bearing medium include, but not limited to following these: recordable-type media, as floppy disk, hard drive, compact disk (CD), digital vidio disc/DVD (DVD), numerical tape, computer memory etc.; And transmission type media, as numeral and/or analogue communication medium (as optical fiber cable, waveguide, wire communication link, wireless communication link etc.).
Persons of ordinary skill in the art will recognize that in this area and come description equipment and/or process with mode given herein, and with engineering practice data handling system is arrived in the equipment and/or the process integration of this description subsequently, is common.That is to say that at least a portion equipment described herein and/or process can be integrated in the data handling system by the experiment of appropriate amount.One of skill in the art will recognize that typical data handling system generally comprises one or more system unit shells, video display apparatus, storer, the processor such as microprocessor and digital signal processor and the computational entity such as operating system of the nonvolatile memory and so on of easily becoming estranged, driver, graphical user interface, and application program, one or more interactive device, as touch pads or screen, and/or control system, comprise that feedback cycle and control engine (as are used for the feedback of sense position and speed; The control engine that is used for mobile and/or adjustment component and/or quantity).Typical data handling system can be utilized any suitable commercial parts, as those commercial parts that can find in data computation/communication and/or network calculations/communication system usually.
Theme described herein sometimes shows and is contained in the different parts that parts in other different parts or different with other connect.The structure that it being understood that drafting like this is exemplary, in fact, can implement the structure that other much also can finish same function.With regard to conceptive, in fact all be " association " in order to the layout of the parts of realizing same function, so that realize required function.Therefore, any two parts in order to realize a specific function of this place combination can be regarded each other " related " as, thereby make the realization required function, and tubular construction or intermediate member be not how.Similarly, any two so related parts also can be counted as each other " functionally connecting " or " combination functionally ", to obtain required function.But the specific embodiment of operability combination include but not limited to physically can be paired and/or physically interactional parts and/or parts wireless interaction and/or wireless interaction and/or mutual in logic and/or can mutual in logic parts
Present disclosure is not limited to the specific embodiments described in this application, and these embodiments are intended to the exemplary illustration as each side.And as what those of ordinary skills just can understand obviously be, can carry out any modification and change, these modifications and change do not break away from its spirit and scope.By the description of front, those of ordinary skills just see obviously outside the method and apparatus that a lot of these places exemplify, drop on method and device of equal value on the function in the disclosure scope.This modification and change all are considered and drop in the claims.Present disclosure only limits by appended claim and with these claims all scopes of equal value mutually.Should be appreciated that term used herein is only in order to describe particular, is not in order to restriction.
For basic all plural number and/or singular references used herein, those of ordinary skills can according to sight and/use, plural number is switched to odd number and/or odd number is switched to plural number.The various singular/plural changes of expressivity ground elaboration herein just for clarity sake are applied.
Those of ordinary skills will understand, generally, term used herein, particularly generally all to be considered to be the term (" comprise " should be construed to " including but not limited to " as term, term " has " should be construed to " having at least ") of " open " to the term in the claims.Those skilled in the art will be further understood that, if the concrete numeral that the claim that consideration is introduced limits, this intention can specifically comprise in the claims, when not having this qualification, does not just have this intention.For example, understand for promoting, below appended claim may comprise using and introduce property phrase " at least one " and " one or more " and introduce claim and limit.Yet, use this phrase should not be understood that to have hinted, the claim of being introduced by indefinite article " " or " one " limits and will anyly contain specific rights requirement that this claim that is introduced into limits and be limited in and only contain disclosing of a this qualification, even if identical claim has comprised introducing phrase " one or more " or " at least one ", and indefinite article, as " one " or " one " (as, " one " and/or " one " should be interpreted as " at least one " or " one or more " usually); Limit for using definite article to introduce claim, this sets up equally.In addition, even specifically comprised the concrete numeral of the claim qualification of being introduced, one of skill in the art will recognize that, the numeral that this qualification should be interpreted at least to be limited usually (as, there are not other changes, pure qualification " two qualifications " means at least two qualifications usually, or two or more qualifications).And, when being similar to " A, at least one among B and the C " when having used, with the usage that those of ordinary skills can understand consider (as, " one has A, B; and at least one the system among the C " will include but not limited to that those only have A, only have B, only have C, and has A together with B, A is together with C, and B is together with C, and/or A, B, C system together etc.).When being similar to " A, B, or among the C etc. at least one " having used, with the usage that those of ordinary skills can understand consider (as, " and one has A, B; or at least one the system among the C " will include but not limited to that those only have A, only have B, only have C, and has A together with B, A is together with C, and B is together with C, and/or A, B, C system together etc.).Those of ordinary skills also will understand, no matter at instructions, claims still in the accompanying drawings, in fact any separation property vocabulary that provides two or more replaceability terms and/or phrase should be understood as that and comprise such possibility: comprise one of them in the term, any one, or two.For example, phrase " A or B " will be understood to include " A " or " B " or " A and B " such possibility.
In addition, when disclosed feature or aspect are when the Ma Kushi group is described, the description that one of skill in the art will recognize that the disclosure also related to this Ma Kushi group each form or the child group of these compositions.
To understand that as those of ordinary skills no matter why, with regard to written description was provided, all scopes disclosed herein had also contained the combination of its all possible subrange or subrange.Any scope of listing be easy to be thought easily to have described fully this scope be cracked and can be cracked at least double, trisection, the quartern, five five equilibriums, ten five equilibriums or the like.As non-limiting example, each scope described herein is easy to be cracked lower by 1/3rd, 1/3rd or the like of middle(-)third and top.As one of ordinary skill will be understood, such as " go up to ", " at least ", " greater than ", " less than " wait language to comprise to be limited digital and the scope of indicating those aforesaid subranges that can be cracked subsequently.At last, will understand that a scope has comprised the member that each is independent as those of ordinary skills.Therefore, the group with 1-3 cell indicates those to have 1,2, or the group of 3 cells.Similarly, the group with 1-5 cell indicates those to have 1,2,3,4, or the group of 5 cells, by that analogy.
When each side disclosed herein and each embodiment, other aspects and embodiment will be conspicuous for those of ordinary skills.Various aspects disclosed herein and embodiment are for purposes of illustration, and are not intended to restriction, and actual scope and spirit are limited by claim subsequently.

Claims (22)

1. data integration platform comprises:
Query processing module, it is constructed to receive inquiry, and this query translation is become to be applicable to the form that each provenance is inquired about; And
Response collecting module, it is constructed to collect the result from each provenance, and the result is provided according to described inquiry through translation.
2. according to the data integration platform of claim 1, wherein response collecting module further is configured to have a webcrawler module, and it is used for grasping the result on database, the Internet or LAN (Local Area Network).
3. according to the data integration platform of claim 1, wherein response collecting module further is configured to collect the result by programmable interface.
4. according to the data integration platform of claim 1, also comprise inquiry domain knowledge data storehouse.
5. according to the data integration platform of claim 1, also comprise proxy module, it is constructed to provide the service of personalization.
6. according to the data integration platform of claim 5, wherein said proxy module identification interest and accumulation knowledge are used for data integration.
7. according to the data integration platform of claim 1, also comprise the process management module, the operation that it is constructed to receive instruction and guides each module based on this instruction.
8. according to the data integration platform of one of claim 1-7, wherein response collecting module comprises adapter, it is constructed to further handle the inquiry of being translated, and is applicable to the descriptor format in different pieces of information source and the ad hoc inquiry statement of semanteme with generation, and converts described result to Unified Form.
9. data integration platform according to Claim 8, wherein response collecting module comprises that result set grows up to be a useful person, it is constructed to handle the result of described Unified Form.
10. according to the data integration platform of claim 9, wherein this result set grow up to be a useful person the consistance, conflict of qualification and the contradiction that further are constructed to analyze this result the result, filter the record that repeats and produce the result set of putting in order.
11. data integration platform according to Claim 8, wherein this adapter also comprise the service mix module, be used to provide the data integration of process context aware.
12. according to the application of the data integration platform of one of claim 1-11 at biological field.
13. data integrating method comprises:
Receive inquiry;
This query translation is become to be applicable to the form that each provenance is inquired about; And, collect the result from each provenance, and the result is provided according to described inquiry through translation.
14. according to the data integrating method of claim 13, wherein this result collects by grasping on database, the Internet or LAN (Local Area Network).
15., wherein collect the result by programmable interface according to the data integrating method of claim 13.
16., also comprise and providing personalized service according to the data integrating method of claim 13.
17. according to the data integrating method of claim 16, wherein personalized service is provided by proxy module, this proxy module is discerned interest and is accumulated knowledge and is used for data integration.
18., also comprise receiving instruction, and manage the operation of each module based on instruction according to the data integrating method of claim 13.
19., also comprise the inquiry that further processing is translated, be applicable to the descriptor format in different pieces of information source and the ad hoc inquiry statement of semanteme with generation, and convert described result to Unified Form according to the data integrating method of one of claim 13-18.
20., also comprise the result who handles described Unified Form according to the data integrating method of claim 19.
21., also comprise the result set that the result of consistance, conflict of qualification and the contradiction of analyzing this result, the record that filters repetition and generation were put in order according to the data integrating method of claim 20.
22., also comprise by service and mix the data integration that module provides the process context aware according to the data integrating method of claim 19.
CN2009101619747A 2009-09-10 2009-09-10 Data integration platform Pending CN102023982A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2009101619747A CN102023982A (en) 2009-09-10 2009-09-10 Data integration platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009101619747A CN102023982A (en) 2009-09-10 2009-09-10 Data integration platform

Publications (1)

Publication Number Publication Date
CN102023982A true CN102023982A (en) 2011-04-20

Family

ID=43865290

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009101619747A Pending CN102023982A (en) 2009-09-10 2009-09-10 Data integration platform

Country Status (1)

Country Link
CN (1) CN102023982A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103984713A (en) * 2014-05-07 2014-08-13 丽水桉阳生物科技有限公司 Financial data query method based on cloud computing
CN108959291A (en) * 2017-05-19 2018-12-07 腾讯科技(深圳)有限公司 Querying method and relevant apparatus

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103984713A (en) * 2014-05-07 2014-08-13 丽水桉阳生物科技有限公司 Financial data query method based on cloud computing
CN103984713B (en) * 2014-05-07 2017-05-31 珠海横琴跨境说网络科技有限公司 A kind of financial data querying method based on cloud computing
CN108959291A (en) * 2017-05-19 2018-12-07 腾讯科技(深圳)有限公司 Querying method and relevant apparatus
CN108959291B (en) * 2017-05-19 2023-03-24 腾讯科技(深圳)有限公司 Query method and related device

Similar Documents

Publication Publication Date Title
JP2022526242A (en) Methods, devices, and systems for annotating text documents
Jaiswal et al. Plant Ontology (PO): a controlled vocabulary of plant structures and growth stages
CN101739390B (en) Data transformation based on a technical design document
US20050251513A1 (en) Techniques for correlated searching through disparate data and content repositories
Frischmuth et al. Ontowiki–an authoring, publication and visualization interface for the data web
Asghar Shiri et al. Thesauri on the Web: current developments and trends
US8667011B2 (en) Web service discovery via data abstraction model and condition creation
Liang et al. Mapping AGROVOC and the Chinese agricultural thesaurus: definitions, tools, procedures
US8566364B2 (en) Web service discovery via data abstraction model augmented by field relationship identification
CN105723366A (en) Method for preparing a system for searching databases and system and method for executing queries to a connected data source
CN110379472A (en) A kind of clinical research project management system
US20120233195A1 (en) Web Service Discovery Via Data Abstraction Model
Daltio et al. Aondê: An ontology web service for interoperability across biodiversity applications
Constantinescu et al. Towards knowledge capturing and innovative human-system interface in an open-source factory modelling and simulation environment
Steindel A comparison between a SNOMED CT problem list and the ICD-10-CM/PCS HIPAA code sets
Fritter et al. A survey of Life Cycle Inventory database implementations and architectures, and recommendations for new database initiatives
Lin et al. An exploratory study using an openEHR 2-level modeling approach to represent common data elements
US8949280B2 (en) Web service discovery via data abstraction model with input assistance
Richesson et al. Heterogeneous but “standard” coding systems for adverse events: Issues in achieving interoperability between apples and oranges
CN102023982A (en) Data integration platform
Zamite et al. MEDCollector: Multisource epidemic data collector
CN101826108A (en) Data integration platform
Schindler et al. How to teach digital library data to swim into research
Golafshar et al. Utilizing open-source platforms to build and deploy interactive patient-reported quality of life tracking tools for monitoring protocol adherence
Malaverri et al. A Tool based on Web Services to Query Biodiversity Information.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20110420