CN102567418A - Methods and devices for integrating and searching data - Google Patents

Methods and devices for integrating and searching data Download PDF

Info

Publication number
CN102567418A
CN102567418A CN2010106205153A CN201010620515A CN102567418A CN 102567418 A CN102567418 A CN 102567418A CN 2010106205153 A CN2010106205153 A CN 2010106205153A CN 201010620515 A CN201010620515 A CN 201010620515A CN 102567418 A CN102567418 A CN 102567418A
Authority
CN
China
Prior art keywords
document
metadata
type
attribute information
search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010106205153A
Other languages
Chinese (zh)
Inventor
万巍
瞿超
雷超
徐剑波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University Founder Group Co Ltd
Beijing Founder Apabi Technology Co Ltd
Original Assignee
Peking University Founder Group Co Ltd
Beijing Founder Apabi Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University Founder Group Co Ltd, Beijing Founder Apabi Technology Co Ltd filed Critical Peking University Founder Group Co Ltd
Priority to CN2010106205153A priority Critical patent/CN102567418A/en
Publication of CN102567418A publication Critical patent/CN102567418A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses methods for integrating and searching data, which are used for realizing the improvement of a data entry way, realizing the integration of a plurality of types of documents and improving the data search efficiency. The method for integrating the data comprises the following steps of: determining the type of a document to be integrated; judging whether the type is an existing type or not; if not, creating a metadata standard document according to the attribute structure of the type; entering attribute information of the document to be integrated into the metadata standard document corresponding to the type; entering the document to be integrated and establishing an index of the attribute information to the document to be integrated. The invention further discloses devices for realizing the two methods.

Description

The method of a kind of data integration, search and device
Technical field
The present invention relates to the computing machine and the communications field, particularly relate to the method and the device of data integration, search.
Background technology
People's life has been come in the development of Along with computer technology, e-book, and has been popularized.People begin to consult e-book through electronic databank.Network technology has further promoted the development of electronic databank and e-book.Information paper miscellaneous is all stored and is consulted through the electronics mode.
Following minute a plurality of sub-searching systems in the present searching system, different sub-searching systems is used to retrieve dissimilar files.It is thus clear that searching system needs compatible a plurality of different sub-searching systems, complexity is higher, is not easy to the maintenance and management in later stage.And stride the retrieval of sub-searching system, its retrieval effectiveness is not good.
For solving the problem that a plurality of sub-searching systems are brought, prior art has proposed the unitized solution of metadata standard.This scheme is to set up a public metadata standard file for dissimilar files, and this metadata standard file comprises dissimilar file all properties.Can retrieve through the metadata standard file like this, no longer need stride a plurality of sub-searching systems, and improve recall precision.But this scheme is not easy to expansion, and dirigibility is not enough, integrates the file of newtype if desired, then need revise the metadata standard file.And, when the metadata standard file is retrieved, need the irrelevant attribute information of file destination that traversal is a lot of and retrieve, influence retrieval effectiveness.
To sum up, the retrieval effectiveness of isomery digital content (file that comprises a plurality of types) is not ideal enough in the prior art.
Summary of the invention
The embodiment of the invention provides the method and the device of a kind of data integration, search, is used to realize the improvement of data-entry-form, realizes polytype integrating documents, and improves data search efficient.
A kind of data integration method is used to improve data typing process, realizes that with polytype integrating documents, so that improve data search efficient, it may further comprise the steps:
Confirm to treat the type of integrating document;
Judge that whether said type is existing type, for not the time, create the metadata standard file according to the attribute structure of the type in judged result;
The said attribute information of integrating document of treating is entered into said type metadata corresponding normative document;
The said integrating document of treating of typing, and set up said attribute information to the said index of treating integrating document.
A kind of data search method is used to improve data search efficient, and it may further comprise the steps:
The attribute information of the keyword coupling of in the type metadata corresponding normative document of appointment, searching and being used to search for;
Through mating successful attribute information and attribute information index, extraction document to file.
A kind of device that is used for data integration comprises:
Type block is used to confirm to treat the type of integrating document;
Create module, be used to judge whether said type is existing type, for not the time, create the metadata standard file according to the attribute structure of the type in judged result;
The typing module is used for the said attribute information of integrating document of treating is entered into said type metadata corresponding normative document, and the said integrating document of treating of typing, and sets up said attribute information to the said index of treating integrating document.
A kind of device that is used for data search comprises:
Matching module is used for searching the attribute information that matees with the keyword that is used to search at the metadata standard file;
Extraction module is used for through mating successful attribute information and the attribute information index to file, extraction document.
The embodiment of the invention is created a metadata standard file to each type, and the attribute information of file of the same type is entered into the metadata corresponding normative document, so that search for through the metadata standard file.So both need not interdepartmental system search, also need not travel through whole metadata standard file (being equivalent to metadata standard files all in the present embodiment), improved data search efficient.
Description of drawings
Fig. 1 is the main method process flow diagram of data integration in the embodiment of the invention;
The method flow diagram of data integration when Fig. 2 creates the metadata standard file for embodiment of the invention indirect;
Fig. 3 in the embodiment of the invention when client-side is confirmed attribute information the method flow diagram of data integration;
Fig. 4 is the main method process flow diagram of data search in the embodiment of the invention;
Fig. 5 is the detailed method process flow diagram of data search in the embodiment of the invention;
Fig. 6 is the primary structure figure of integrating apparatus in the embodiment of the invention;
Fig. 7 is the detailed structure view of integrating apparatus in the embodiment of the invention;
Fig. 8 is the primary structure figure of searcher in the embodiment of the invention;
Fig. 9 is the detailed structure view of searcher in the embodiment of the invention.
Embodiment
The embodiment of the invention is created a metadata standard file to each type, and the attribute information of file of the same type is entered into the metadata corresponding normative document, so that search for through the metadata standard file.So both need not interdepartmental system search, also need not travel through whole metadata standard file (being equivalent to metadata standard files all in the present embodiment), improved data search efficient.
The type difference of a plurality of files is meant that the attribute structure of a plurality of files is different in the present embodiment, and the attribute structure difference comprises that the form of file is different.For example, the form of e-book is PDF, and the form of electronic pictures is JPG, and both forms are different, belong to different types.For another example, the form of e-book and electronic journal all is PDF, but the attribute of electronic journal comprises periodical number, and e-book does not comprise this attribute, and then both also belong to different types.
Referring to Fig. 1, the main method flow process of data integration is following in the present embodiment:
Step 101: the type of confirming to treat integrating document.
Step 102: judge that whether said type is existing type, for not the time, create the metadata standard file according to the attribute structure of the type in judged result.If judge it is existing type, then can directly carry out step 103 and 104.
Step 103: the said attribute information of integrating document of treating is entered into said type metadata corresponding normative document.
Step 104: the said integrating document of treating of typing, and set up said attribute information to the said index of treating integrating document.This step can be carried out with step 103 synchronously.
The metadata standard file is the file in the search system, can directly in search system, create the metadata standard file.Standard and unification for the metadata standard file; And the minimizing maloperation is to the influence of search system; Can directly in search system, not create the metadata standard file; But in the system of client, create description document earlier, and create the metadata standard file by search system according to description document again, concrete implementation procedure is referring to following embodiment.
Referring to Fig. 2, the method flow of data integration was following when the present embodiment indirect was created the metadata standard file:
Step 201: the type of confirming to treat integrating document.This step can be confirmed by manual work, for treating integrating document type is set, if existing type, and then for treating that integrating document is provided with existing type identification, if newtype, then for to treat that integrating document is provided with new type identification.Also can realize automatically, treat integrating document like system and resolve definite its type in back by system.Analysis mode has multiple, as confirming file type through file layout.As confirming file type through file layout and file size, the file of same type, if the size of file then is one type greater than preset threshold value, being not more than preset threshold value then is another kind of type.Other analysis mode can also be arranged, do not enumerate one by one here, everyly can confirm to treat that the analysis mode of the type of integrating document all is applicable to present embodiment.
Step 202: judge that whether said type is existing type, if not existing type then continues step 203, otherwise continues step 206.
Step 203: the description document of creating metadata standard according to the attribute structure of the type.The attribute structure of type can be by the manual work setting.Attribute structure is made up of field, like " author " and " number of the edition " etc.Attribute information is the corresponding content of field, " opens XX " like authors' name, phase number of the edition " 07 phase " etc.Also can confirm the attribute structure of type by system automatically, treat integrating document, obtain title, author, summary, phase number of the edition etc., perhaps according to treating that the head of integrating document confirms attribute structure as attribute structure like parsing.Can also there be alternate manner to confirm attribute information, do not enumerate one by one here.Description document comprises the description to attribute structure, like attribute-bit (metadata standard unique identification), Property Name (can be not unique), data type (data type of attribute information), length (allowing to fill in the maximum length of attribute information), information such as indispensability and default value whether.
Above step can be accomplished in client, by system's realization of client.
Step 204: create the metadata standard file through description document.
Step 205: in the publicly-owned normative document of metadata, set up the corresponding relation of attribute structure in attribute structure and the publicly-owned normative document of metadata in the said metadata standard file.The publicly-owned normative document of metadata is the search for the ease of public information, to improve the search efficiency of public information.If do not consider this point, can omit the publicly-owned normative document of metadata and skip this step.
Step 206: the said attribute information of integrating document of treating is entered into said type metadata corresponding normative document.
If treating integrating document is existing type, then need not create the metadata standard file, as required, can be in the publicly-owned normative document of metadata typing corresponding property information.
Step 207: the said integrating document of treating of typing, and set up said attribute information to the said index of treating integrating document.The typing file is about to file and deposits database in, and this database can be positioned on the file server.
Step 204-207 can be realized by central search system.The system of central authorities' search system and client has constituted distributed system, is convenient to flexible operating and centralized management, makes integration and search procedure unify standard, reduces the influence of maloperation to total system.
Before step 206; Attribute information that can the said type of first verification with treat whether integrating document meets the standard of metadata standard file and database; If meet, then continue step 206 and 207, otherwise stop this attribute information and the typing of treating integrating document; Can further improve the unified standard of operation like this, reduce the influence of maloperation total system.Can check the file that whether is still waiting to integrate,, then repeat flow process shown in Figure 2 if having.
Referring to Fig. 3, in the present embodiment when client-side is confirmed attribute information the method flow of data integration following:
Step 301: the type of confirming to treat integrating document.
Step 302: judge that whether said type is existing type, if not existing type then continues step 303, otherwise continues step 306.
Step 303: the description document of creating metadata standard according to the attribute structure of the type.
Step 304: the said attribute information of integrating document of treating is entered into description document.
Above step can be accomplished in client, by system's realization of client.
Step 305: create the metadata standard file that comprises attribute information through description document.Be equivalent to when creating the metadata standard file, the attribute information in the description document is entered into said type metadata corresponding normative document.
Step 306: in the publicly-owned normative document of metadata, set up the corresponding relation of attribute structure in attribute structure and the publicly-owned normative document of metadata in the said metadata standard file.
If treating integrating document is existing type, then as required, can in the publicly-owned normative document of metadata, increase corresponding property information.
Step 307: the said integrating document of treating of typing, and set up said attribute information to the said index of treating integrating document.The typing file is about to file and deposits database in, and this database can be positioned on the file server.
Step 305-307 can be realized by central search system.The system of central authorities' search system and client has constituted distributed system, is convenient to flexible operating and centralized management, makes integration and search procedure unify standard, reduces the influence of maloperation to total system.
Before step 304 and 307; Attribute information that can the said type of first verification with treat whether integrating document meets the standard of metadata standard file and database; If meet, then continue step 304 and 307, otherwise stop this attribute information and the typing of treating integrating document; Can further improve the unified standard of operation like this, reduce the influence of maloperation total system.Can check the file that whether is still waiting to integrate,, then repeat flow process shown in Figure 3 if having.
The inner structure of metadata standard file has multiple implementation, and for the ease of search, present embodiment provides a kind of preferable implementation, and a plurality of tables that the metadata standard file comprises are as follows:
Table 1, metadata standard table
Figure BSA00000407461300071
Metadata standard ID is used for label table 1, can be generated automatically by system, and ID is unique for this metadata standard, usually by numeral.The metadata standard unique identification can be by the manual work setting, and identifying unique is kind of a preferable mode, usually by textual representation, is the metadata standard file of what type so that get information about.The corresponding data table name claims to comprise the sign of table 3.
Table 2, metadata attributes record sheet
Figure BSA00000407461300072
Field ID is used for uniquely tagged metadata attributes record sheet, can be generated automatically by system.The field title can usually by textual representation, be any field so that get information about by the manual work setting.
Table 3, metadata standard corresponding data table
Figure BSA00000407461300073
Figure BSA00000407461300081
Preserved concrete attribute information in the table 3, record of a digital content ID unique identification.Digital content ID is generated by system automatically, uses numeral usually.The digital content unique identification can be by the manual work setting, be used for representing intuitively be about and so on record.For example, three books of books type are arranged, the digital content ID of these three books is respectively 0,1 and 2, and field one is a title, and field two is the author, and field three is the publication date.Content in the table 3 is so: 0 (digital content ID), and economic type books AA (digital content unique identification), financial crisis (title) is opened XX (author), May 8 (publication date); 1, historical type of books AA, the BB of the Qing Dynasty, king XX, April 27; 2, historical type of books BB, the BB of the Qing Dynasty, king XX, August 27.
The metadata standard file can also comprise mapping table, and is as shown in table 4.
Table 4, publicly-owned attribute mapping table
Figure BSA00000407461300082
Write down the corresponding relation of field in the field mentioned in table 2 and the table 3 and the publicly-owned normative document of metadata in the table 4, can will treat that the publicly-owned attribute information of integrating document is entered into the publicly-owned normative document of metadata through this corresponding relation.
The table that the publicly-owned normative document of metadata comprises is as follows:
Table 5, data content public information table
Figure BSA00000407461300091
For example, the file of a plurality of types all has " author " this attribute field, then the author information of the file of a plurality of types of meeting typing in " author " field of data content public information table.The combination of the full content information of metadata has comprised the information of the field that all can be used for retrieving.In the table 5 all there be to the index of typing file each attribute information.
Present embodiment has been created the metadata standard file respectively for each type in the data integration process, the data search process need is made corresponding improvement so.
Referring to Fig. 4, the main method flow process of data search in the present embodiment is following:
Step 401: in the type metadata corresponding normative document of appointment, search attribute information with the keyword coupling that is used to search for.Wherein, type can be specified by search subscriber.It is a type that the type of appointment is not limited to, and can specify a plurality of or all types.If the user is specified type not, then acquiescence has been specified all types.
Step 402: through mating successful attribute information and attribute information index, extraction document to file.
Type according to appointment is searched in the corresponding metadata normative document, and it has realized personalized search, is applicable to search targetedly.If the type of appointment is more, especially specified all types, its essence is common search, preferable mode is preferentially in the publicly-owned normative document of metadata, to search for, and describes in detail to this situation below.
Referring to Fig. 5, the detailed method flow process of data search in the present embodiment is following:
Step 501: obtain the keyword of user's input and the type of appointment.
Step 502: whether the type of judging appointment is all types, if then continue step 503, otherwise continue step 504.
Step 503: the attribute information of the keyword coupling of search and acquisition in the publicly-owned normative document of metadata.When searching, continue step 505.When not searching, continue step 504.
Step 504: the attribute information of the keyword coupling of search and acquisition in the metadata standard file.When searching, continue step 505, otherwise process ends is perhaps returned the search failure to the user.
Step 505: through mating successful attribute information and attribute information index, extraction document to file.Can also export the file of extraction through output devices such as displays to the user.
For example, the author " opens XX ", and existing books have the photo (a kind of picture) of shooting again, and the file " aaa " of " opening XX " is arranged in the books type, and the file " bbb " of " opening XX " is arranged in picture/mb-type." open XX " if the user imports keyword, and specified all types, then coupling " is opened XX " in the author field of the publicly-owned normative document of metadata, when mating successfully, can extract file " aaa " and " bbb ".Need in the metadata standard file of books type and picture/mb-type, not mate " opening XX " respectively, improve search efficiency.
If the user has imported the keyword about a plurality of fields; And specified more type; The attribute information that then keyword of search and input matees in the publicly-owned normative document of metadata earlier, have the part key speech this moment matees successfully, can be according to corresponding relation as shown in table 4; In the metadata standard file, mated in the pairing scope of field of successful attribute information, continued the attribute information that search is complementary to the keyword that coupling is not successful.This mode had both realized personalized search, had improved search efficiency again.
Understood the implementation procedure of data integration and search through above description, this process can be realized by device, introduces in the face of the inner structure and the function of device down.
Referring to Fig. 6, the device that is used for data integration in the present embodiment comprises: type block 601, establishment module 602 and typing module 603.
Type block 601 is used to confirm to treat the type of integrating document.
Create module 602 and be used to judge whether said type is existing type, for not the time, create the metadata standard file according to the attribute structure of the type in judged result.Creating module 602 comprises: the first establishment unit and the second establishment unit.The first establishment unit is used for creating according to the attribute structure of the type the description document of metadata standard.The second establishment unit is used for creating the metadata standard file through description document.
Typing module 603 is used for the said attribute information of integrating document of treating is entered into said type metadata corresponding normative document, and the said integrating document of treating of typing, and sets up said attribute information to the said index of treating integrating document.
The first establishment unit also is used for the said attribute information of integrating document of treating is entered into description document; The metadata standard file is created through description document in the second establishment unit; And when creating the metadata standard file, typing module 603 is entered into said type metadata corresponding normative document with the attribute information in the description document.
Said integrating apparatus also comprises: publicly-owned module 604, referring to shown in Figure 7, be used in the publicly-owned normative document of metadata, and set up the corresponding relation of attribute structure in attribute structure and the publicly-owned normative document of metadata in the said metadata standard file.Publicly-owned module 604 also is used for increasing corresponding property information in the publicly-owned normative document of metadata.
Said integrating apparatus also comprises: memory module 605 is used to file of storing typing etc.
Referring to Fig. 8, the device that is used for data search in the present embodiment comprises: matching module 801 and extraction module 802.
Matching module 801 is used for searching the attribute information that matees with the keyword that is used to search at the metadata standard file.Matching module 801 also is used to carry out the judgement of step 502, and in the publicly-owned normative document of metadata, searches for the attribute information that matees with the keyword that obtains; And when in the publicly-owned normative document of metadata, not mating successfully, the attribute information of search and the keyword coupling that obtains in the metadata standard file.
Extraction module 802 is used for through mating successful attribute information and the attribute information index to file, extraction document.Extraction module 802 also is used for exporting to the user file of extraction, can output to the link of extraction document earlier, when the user clicks this link, exports the file of extraction again.
Searcher also comprises interface module 803, referring to shown in Figure 9, is used to obtain the keyword of user's input and the type of appointment.
Integrating apparatus and searcher can be same physical entities, and this physical entity both can be realized data integration, can realize data search again.It comprises all modules in integrating apparatus and the searcher.Integrating apparatus also can be positioned at different physical entities with each module in the searcher; For example; Create module 602, typing module 603, matching module 801 and extraction module 802 and be positioned at search server; Memory module 605 is positioned at file server, and type block 601 is positioned at client device with interface module 803.
The embodiment of the invention is created a metadata standard file to each type, and the attribute information of file of the same type is entered into the metadata corresponding normative document, so that search for through the metadata standard file.So both need not interdepartmental system search, also need not travel through whole metadata standard file (being equivalent to metadata standard files all in the present embodiment), improved data search efficient.And; The embodiment of the invention has been created the corresponding relation of publicly-owned attribute in data typing process; And in the publicly-owned normative document of metadata typing attribute information, in the data search process, can in the publicly-owned normative document of metadata, search for earlier, with further raising search efficiency.
Those skilled in the art should understand that embodiments of the invention can be provided as method, system or computer program.Therefore, the present invention can adopt the form of the embodiment of complete hardware embodiment, complete software implementation example or combination software and hardware aspect.And the present invention can be employed in the form that one or more computer-usable storage medium (including but not limited to magnetic disk memory and optical memory etc.) that wherein include computer usable program code go up the computer program of implementing.
The present invention is that reference is described according to the process flow diagram and/or the block scheme of method, equipment (system) and the computer program of the embodiment of the invention.Should understand can be by the flow process in each flow process in computer program instructions realization flow figure and/or the block scheme and/or square frame and process flow diagram and/or the block scheme and/or the combination of square frame.Can provide these computer program instructions to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, make the instruction of carrying out through the processor of computing machine or other programmable data processing device produce to be used for the device of the function that is implemented in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame appointments.
These computer program instructions also can be stored in ability vectoring computer or the computer-readable memory of other programmable data processing device with ad hoc fashion work; Make the instruction that is stored in this computer-readable memory produce the manufacture that comprises command device, this command device is implemented in the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
These computer program instructions also can be loaded on computing machine or other programmable data processing device; Make on computing machine or other programmable devices and to carry out the sequence of operations step producing computer implemented processing, thereby the instruction of on computing machine or other programmable devices, carrying out is provided for being implemented in the step of the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, belong within the scope of claim of the present invention and equivalent technologies thereof if of the present invention these are revised with modification, then the present invention also is intended to comprise these changes and modification interior.

Claims (12)

1. a data integration method is characterized in that, may further comprise the steps:
Confirm to treat the type of integrating document;
Judge that whether said type is existing type, for not the time, create the metadata standard file according to the attribute structure of the type in judged result;
The said attribute information of integrating document of treating is entered into said type metadata corresponding normative document;
The said integrating document of treating of typing, and set up said attribute information to the said index of treating integrating document.
2. the method for claim 1; It is characterized in that; The step of creating the metadata standard file according to the attribute structure of the type comprises: create the description document of metadata standard according to the attribute structure of the type, create the metadata standard file through description document.
3. method as claimed in claim 2 is characterized in that, creates according to the attribute structure of the type after the description document of metadata standard, also comprises step: the said attribute information of integrating document of treating is entered into description document;
The step that the said attribute information of treating integrating document is entered into said type metadata corresponding normative document comprises: when creating the metadata standard file, the attribute information in the description document is entered into said type metadata corresponding normative document.
4. like claim 1,2 or 3 described methods, it is characterized in that, also comprise step: in the publicly-owned normative document of metadata, set up the corresponding relation of attribute structure in attribute structure and the publicly-owned normative document of metadata in the said metadata standard file.
5. a data search method is characterized in that, may further comprise the steps:
The attribute information of the keyword coupling of in the type metadata corresponding normative document of appointment, searching and being used to search for;
Through mating successful attribute information and attribute information index, extraction document to file.
6. method as claimed in claim 5 is characterized in that, in the metadata standard file, before the attribute information of search and the keyword coupling that obtains, also comprises step: the attribute information of search and the keyword coupling that obtains in the publicly-owned normative document of metadata;
Search comprises with the step of the attribute information of the keyword coupling that obtains in the metadata standard file: when in the publicly-owned normative document of metadata, not mating successfully, in the metadata standard file, search for the attribute information with the keyword coupling that obtains.
7. a device that is used for data integration is characterized in that, comprising:
Type block is used to confirm to treat the type of integrating document;
Create module, be used to judge whether said type is existing type, for not the time, create the metadata standard file according to the attribute structure of the type in judged result;
The typing module is used for the said attribute information of integrating document of treating is entered into said type metadata corresponding normative document, and the said integrating document of treating of typing, and sets up said attribute information to the said index of treating integrating document.
8. device as claimed in claim 7 is characterized in that, creates module and comprises:
The first establishment unit is used for the description document according to the attribute structure establishment metadata standard of the type;
The second establishment unit is used for creating the metadata standard file through description document.
9. device as claimed in claim 8 is characterized in that, the first establishment unit also is used for the said attribute information of integrating document of treating is entered into description document; When creating the metadata standard file, the typing module is entered into said type metadata corresponding normative document with the attribute information in the description document.
10. like claim 7,8 or 9 described devices; It is characterized in that; Also comprise: publicly-owned module, be used in the publicly-owned normative document of metadata, set up the corresponding relation of attribute structure in attribute structure and the publicly-owned normative document of metadata in the said metadata standard file.
11. a device that is used for data search is characterized in that, comprising:
Matching module is used for searching the attribute information that matees with the keyword that is used to search at the metadata standard file;
Extraction module is used for through mating successful attribute information and the attribute information index to file, extraction document.
12. device as claimed in claim 11 is characterized in that, matching module also is used at the attribute information of the publicly-owned normative document search of metadata with the keyword coupling that obtains; And when in the publicly-owned normative document of metadata, not mating successfully, the attribute information of search and the keyword coupling that obtains in the metadata standard file.
CN2010106205153A 2010-12-23 2010-12-23 Methods and devices for integrating and searching data Pending CN102567418A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010106205153A CN102567418A (en) 2010-12-23 2010-12-23 Methods and devices for integrating and searching data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010106205153A CN102567418A (en) 2010-12-23 2010-12-23 Methods and devices for integrating and searching data

Publications (1)

Publication Number Publication Date
CN102567418A true CN102567418A (en) 2012-07-11

Family

ID=46412848

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010106205153A Pending CN102567418A (en) 2010-12-23 2010-12-23 Methods and devices for integrating and searching data

Country Status (1)

Country Link
CN (1) CN102567418A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605715A (en) * 2013-11-14 2014-02-26 北京国双科技有限公司 Method and device used for data integration processing of multiple data sources
CN104298685A (en) * 2013-07-18 2015-01-21 北大方正集团有限公司 Method and device for achieving heterogeneous system unified searching
CN104715359A (en) * 2015-04-03 2015-06-17 广东中建普联科技有限公司 Identity management method for material files and material data of structure construction industry
CN104750853A (en) * 2015-04-14 2015-07-01 浪潮集团有限公司 Method and device for searching heterogeneous data
CN105988996A (en) * 2015-01-27 2016-10-05 腾讯科技(深圳)有限公司 Index file generation method and device
CN107295357A (en) * 2016-04-01 2017-10-24 深圳平安综合金融服务有限公司 Image file data input method, Cloud Server and terminal
CN108304401A (en) * 2017-01-11 2018-07-20 北大方正集团有限公司 E-book searching method and system
CN108415794A (en) * 2018-01-30 2018-08-17 河南职业技术学院 File backup method and file backup device
CN111078977A (en) * 2019-11-30 2020-04-28 深圳市智微智能软件开发有限公司 Client data management method and system
CN111930823A (en) * 2020-09-27 2020-11-13 武汉中科通达高新技术股份有限公司 Data query method and device, data center station and storage medium
CN113297207A (en) * 2020-08-24 2021-08-24 阿里巴巴集团控股有限公司 Data processing method, device and equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1920817A (en) * 2006-09-14 2007-02-28 浙江大学 Method for multiple resources pools integral parallel search in open websites
CN101149750A (en) * 2007-10-29 2008-03-26 浙江大学 Data resource integrated method based on metadata
US20080270385A1 (en) * 2005-07-11 2008-10-30 Airbus Method and Tool For Searching In Several Data Sources For a Selected Community of Users

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080270385A1 (en) * 2005-07-11 2008-10-30 Airbus Method and Tool For Searching In Several Data Sources For a Selected Community of Users
CN1920817A (en) * 2006-09-14 2007-02-28 浙江大学 Method for multiple resources pools integral parallel search in open websites
CN101149750A (en) * 2007-10-29 2008-03-26 浙江大学 Data resource integrated method based on metadata

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104298685A (en) * 2013-07-18 2015-01-21 北大方正集团有限公司 Method and device for achieving heterogeneous system unified searching
CN103605715A (en) * 2013-11-14 2014-02-26 北京国双科技有限公司 Method and device used for data integration processing of multiple data sources
CN105988996A (en) * 2015-01-27 2016-10-05 腾讯科技(深圳)有限公司 Index file generation method and device
CN105988996B (en) * 2015-01-27 2020-04-10 腾讯科技(深圳)有限公司 Index file generation method and device
CN104715359B (en) * 2015-04-03 2017-11-17 广东中建普联科技股份有限公司 A kind of structuring construction industry material file and material data identification management method
CN104715359A (en) * 2015-04-03 2015-06-17 广东中建普联科技有限公司 Identity management method for material files and material data of structure construction industry
CN104750853A (en) * 2015-04-14 2015-07-01 浪潮集团有限公司 Method and device for searching heterogeneous data
CN107295357A (en) * 2016-04-01 2017-10-24 深圳平安综合金融服务有限公司 Image file data input method, Cloud Server and terminal
CN107295357B (en) * 2016-04-01 2021-03-16 深圳平安综合金融服务有限公司 Image file data entry method, cloud server and terminal
CN108304401A (en) * 2017-01-11 2018-07-20 北大方正集团有限公司 E-book searching method and system
CN108415794A (en) * 2018-01-30 2018-08-17 河南职业技术学院 File backup method and file backup device
CN111078977A (en) * 2019-11-30 2020-04-28 深圳市智微智能软件开发有限公司 Client data management method and system
CN113297207A (en) * 2020-08-24 2021-08-24 阿里巴巴集团控股有限公司 Data processing method, device and equipment
CN111930823A (en) * 2020-09-27 2020-11-13 武汉中科通达高新技术股份有限公司 Data query method and device, data center station and storage medium

Similar Documents

Publication Publication Date Title
CN102567418A (en) Methods and devices for integrating and searching data
US9569436B2 (en) Computer implemented method and system for annotating a contract
US7882122B2 (en) Remote access of heterogeneous data
CN102483765B (en) File search system and program
US10025782B2 (en) Systems and methods for multiple document version collaboration and management
CN111259006A (en) Universal distributed heterogeneous data integrated physical aggregation, organization, release and service method and system
US20130198221A1 (en) Indexing structures using synthetic document summaries
CN101158958B (en) Fusion enquire method based on MySQL storage engines
BRPI0715523A2 (en) document-centric workflow systems, methods, and software based on document content, metadata, and context
US10095789B2 (en) Method and system of searching composite web page elements and annotations presented by an annotating proxy server
AU2015331030A1 (en) System generator module for electronic document and electronic file
US20200279004A1 (en) Building lineages of documents
CN113190687B (en) Knowledge graph determining method and device, computer equipment and storage medium
CN109446410A (en) Knowledge point method for pushing, device and computer readable storage medium
CN104035993A (en) Memory search method for e-books, e-book management system and reading system
CN110188568A (en) Confidential information identification method, device, equipment and computer readable storage medium
Kurz et al. Semantic enhancement for media asset management systems: Integrating the Red Bull Content Pool in the Web of Data
CN110471892B (en) Revit file data collection method and related device
US9292537B1 (en) Autocompletion of filename based on text in a file to be saved
SG151105A1 (en) System, method and user interface providing customized document portfolio management
CN101374307B (en) Method and apparatus for updating digital content information of mobile equipment
US20080294632A1 (en) Method and System for Sorting/Searching File and Record Media Therefor
Liu et al. A study of entity search in semantic search workshop
CN103853832A (en) Customizable data capturing method in full-text retrieval system
JPH11184924A (en) Scheduling device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120711