CN105426431A - Search system for distributed resource site and implementation method thereof - Google Patents

Search system for distributed resource site and implementation method thereof Download PDF

Info

Publication number
CN105426431A
CN105426431A CN201510741886.XA CN201510741886A CN105426431A CN 105426431 A CN105426431 A CN 105426431A CN 201510741886 A CN201510741886 A CN 201510741886A CN 105426431 A CN105426431 A CN 105426431A
Authority
CN
China
Prior art keywords
resource
service
website
index
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510741886.XA
Other languages
Chinese (zh)
Inventor
胡文彬
李勇波
季统凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
G Cloud Technology Co Ltd
Original Assignee
G Cloud Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by G Cloud Technology Co Ltd filed Critical G Cloud Technology Co Ltd
Priority to CN201510741886.XA priority Critical patent/CN105426431A/en
Publication of CN105426431A publication Critical patent/CN105426431A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The invention relates to the technical field of resource searching and sharing, and particularly relates to a search system for a distributed resource site and an implementation method thereof. A resource site module of the invention issues a resource service to a resource service registration module and authorizes according to the resource type at the same time; the resource service registration module registers the resource service of the resource site; a resource sorting module obtains the resource service information of the resource site through the resource service registration module and sends the resource service information to an index generation module; the index generation module carries out a series of operations on the resource service information, such as analysis, calculation and indexing and the like, and stores index information in a resource index database; and a user inquires and accesses the resource service through a resource query module. Meanwhile, a Web Service external interface is provided for enabling an external application program to schedule the search system or directly access the resource site and obtain the resource service, in order to share resources. The search system provided by the invention can meet the resource searching and resource sharing demands of the distributed resource site, and can be applied to resource searching and sharing of the distributed resource site.

Description

A kind of search system of Based on Distributed resource website and its implementation
Technical field
The present invention relates to resource searching and technology of sharing field, particularly a kind of search system of Based on Distributed resource website and its implementation.
Background technology
Along with the development of internet and deepening continuously of IT application in enterprise, the quantity of the various Information application platform of enterprises gets more and more, and the data resource on platform, also in quick growth, defines the data resource website of distributivity; Search system is the critical services of quick obtaining desired data resource; but general search system function of search is single; some special data resource can not be searched for; the safety practice such as authentication and authorization is not taked to the data resource of some secret yet; can not to provide convenience resource searching service efficiently for user, also cannot effectively protect secret data in enterprise resource.
Summary of the invention
One of technical matters that the present invention solves is the search system providing a kind of Based on Distributed resource website; Adopt the mode of resource service, realize resource index and resource sharing by the issue of resource service.
Two of the technical matters that the present invention solves is the search implementation method providing a kind of Based on Distributed resource website; Based on resource service, data resource is classified, by authentication and authorization, while resource searching service is provided, effectively protect the confidential data resource of enterprises.
The technical scheme that the present invention one of solves the problems of the technologies described above is:
Described system is made up of resource website, resource service registration, the generation of resource go-on-go, index, resource index storehouse and the large module of resource query six;
Described resource website is the data source of search system, primary responsibility is issued resource service, mandate and is extracted data resource, the grade that the data resource of resource website can be divided into public resource, shared resource and confidential resources etc. different, for shared resource and confidential resources, must obtain associated authorization could use;
The resource service that each resource website of registration is issued is responsible in described resource service registration, is served by registration resource, the resource service that acquisition resource website specifically provides and access method thereof;
Described resource go-on-go takes different retrieval modes to resource, does not need to authorize to the public resource such as generic web page, video file, and conventional method can be used to carry out data resource crawl; The needs such as shared and confidential document are authorized, then the descriptor returned to document;
Described index generates is responsible for the resource service relevant information collected to integrate, and calculates according to relevancy algorithm, and last generating web page index is saved in resource index storehouse;
The index information storing data resource is responsible in described resource index storehouse;
Described resource query is responsible for decomposing the keyword of user's input and searching for, then from resource index storehouse, carry out matching inquiry and sort, finally being got up by the content integrations such as the chained address of Search Results and content of pages summary feeds back to user with the form of Web page.
The present invention solve the problems of the technologies described above two technical scheme be:
Described method is according to following process step process:
The first step, resource website issues the resource service of this website to resource service Registering modules;
Second step, resource service Registering modules is responsible for registering the resource service of this resource website;
3rd step, resource service information, by the resource service information of resource service Registering modules Gains resources website, is sent to index generation module by resource go-on-go module;
4th step, index generation module is analyzed resource service information, calculate and after the sequence of operations such as index, index information is stored into resource index storehouse;
5th step, when user is inquired about by resource query module, resource query module in charge decomposes the keyword that user inputs and searches for, then from resource index storehouse, carry out matching inquiry and sort, finally being got up by the content integrations such as the chained address of Search Results and content of pages summary feeds back to user with the form of Web page;
6th step, for needing to obtain the arthorization the resource that could access, resource query module, according to the index information returned, is conducted interviews to corresponding resource website by resource go-on-go module.
Described resource service Registering modules provides WebService external interface simultaneously, allows external application can call search system or directly access resources website Gains resources service, realizes resource sharing.
Described resource website mainly provides issue resource service; Site resource mandate and authentication; Extract this website Various types of data resource.
Adopt system and method for the present invention, there is following beneficial effect: (1) is applicable to the resource searching of distributed resource website; (2) resource sharing of distributed resource website is applicable to; (3) adopt the mode of resource service, obtain as required; (4) WebService technology is adopted, not by the restriction of system platform; (5) resource classification and authorization is adopted, available protecting sensitive data; (6) provide external interface, external application can directly call, with Gains resources service or integration search function.
Accompanying drawing explanation
Below in conjunction with accompanying drawing, the present invention is further described:
Fig. 1 is configuration diagram of the present invention.
Fig. 2 is the configuration diagram that the resource service of station resource point module of the present invention is issued.
Embodiment
As shown in Figure 1, system of the present invention is primarily of the large module composition of resource website, resource service registration, the generation of resource go-on-go, index, resource index storehouse and resource query six.
1, resource website: resource website is the data source of search system, primary responsibility is issued resource service, mandate and is extracted data resource, the grade that the data resource of resource website can be divided into public resource, shared resource and confidential resources etc. different, for shared resource and confidential resources, must obtain associated authorization could use;
2, resource service registration: the resource service that each resource website of registration is issued is responsible in resource service registration, is served, just can know the resource service that resource website specifically provides and access method thereof by registration resource;
3, resource go-on-go: because the type of data resource is different, there is dividing of public resource, shared resource and confidential resources, shared resource and confidential resources need to authorize and could use, so different retrieval modes should be taked to resource, such as do not need to authorize to the public resource such as generic web page, video file, conventional method can be used to carry out data resource crawl, and the needs such as shared and confidential document are authorized, then the descriptor returned to document;
4, index generates: index generates is responsible for the resource service relevant information collected to integrate, and calculates according to relevancy algorithm, and last generating web page index is saved in resource index storehouse;
5, resource index storehouse: the index information storing data resource is responsible in resource index storehouse;
6, resource query: resource query is responsible for decomposing the keyword of user's input and searching for, then from resource index storehouse, carry out matching inquiry and sort, finally being got up by the content integrations such as the chained address of Search Results and content of pages summary feeds back to user with the form of Web page.
As shown in Figure 1, the detailed implementing procedure of the search system of Based on Distributed resource website is:
The first step, resource website issues the resource service of this website to resource registering module by resource service release process;
Second step, the resource service of resource registering module in charge to this resource website is registered;
3rd step, resource service information, by the resource service information of resource registering module Gains resources website, is sent to index generation module by resource go-on-go module;
4th step, index generation module is analyzed resource service information, calculate and after the sequence of operations such as index, index information is stored into resource index storehouse;
5th step, when user is inquired about by resource query module, resource query is responsible for decomposing the keyword of user's input and searching for, then from resource index storehouse, carry out matching inquiry and sort, finally being got up by the content integrations such as the chained address of Search Results and content of pages summary feeds back to user with the form of Web page;
6th step, for needing to obtain the arthorization the resource that could access, resource query module, according to the index information returned, is conducted interviews to corresponding resource website by resource go-on-go module.
Described resource service Registering modules provides WebService external interface simultaneously, allows external application can call search system or directly access resources website Gains resources service, realizes resource sharing.
As shown in Figure 2, the resource service of resource website is issued and is mainly provided following functions:
1, resource service is issued;
2, site resource mandate and authentication;
3, this website Various types of data resource is extracted.

Claims (4)

1. a search system for Based on Distributed resource website, is characterized in that: described system is made up of resource website, resource service registration, the generation of resource go-on-go, index, resource index storehouse and the large module of resource query six;
Described resource website is the data source of search system, primary responsibility is issued resource service, mandate and is extracted data resource, the grade that the data resource of resource website can be divided into public resource, shared resource and confidential resources etc. different, for shared resource and confidential resources, must obtain associated authorization could use;
The resource service that each resource website of registration is issued is responsible in described resource service registration, is served by registration resource, the resource service that acquisition resource website specifically provides and access method thereof;
Described resource go-on-go takes different retrieval modes to resource, does not need to authorize to the public resource such as generic web page, video file, and conventional method can be used to carry out data resource crawl; The needs such as shared and confidential document are authorized, then the descriptor returned to document;
Described index generates is responsible for the resource service relevant information collected to integrate, and calculates according to relevancy algorithm, and last generating web page index is saved in resource index storehouse;
The index information storing data resource is responsible in described resource index storehouse;
Described resource query is responsible for decomposing the keyword of user's input and searching for, then from resource index storehouse, carry out matching inquiry and sort, finally being got up by the content integrations such as the chained address of Search Results and content of pages summary feeds back to user with the form of Web page.
2. an implementation method for the search system of Based on Distributed resource website according to claim 1, is characterized in that: described method is according to following process step process:
The first step, resource website issues the resource service of this website to resource service Registering modules;
Second step, resource service Registering modules is responsible for registering the resource service of this resource website;
3rd step, resource service information, by the resource service information of resource service Registering modules Gains resources website, is sent to index generation module by resource go-on-go module;
4th step, index generation module is analyzed resource service information, calculate and after the sequence of operations such as index, index information is stored into resource index storehouse;
5th step, when user is inquired about by resource query module, resource query module in charge decomposes the keyword that user inputs and searches for, then from resource index storehouse, carry out matching inquiry and sort, finally being got up by the content integrations such as the chained address of Search Results and content of pages summary feeds back to user with the form of Web page;
6th step, for needing to obtain the arthorization the resource that could access, resource query module, according to the index information returned, is conducted interviews to corresponding resource website by resource go-on-go module.
3. implementation method according to claim 2, it is characterized in that: described resource service Registering modules provides WebService external interface simultaneously, allow external application can call search system or directly access resources website Gains resources service, realize resource sharing.
4. the implementation method according to Claims 2 or 3, is characterized in that: described resource website mainly provides issue resource service; Site resource mandate and authentication; Extract this website Various types of data resource.
CN201510741886.XA 2015-11-02 2015-11-02 Search system for distributed resource site and implementation method thereof Pending CN105426431A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510741886.XA CN105426431A (en) 2015-11-02 2015-11-02 Search system for distributed resource site and implementation method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510741886.XA CN105426431A (en) 2015-11-02 2015-11-02 Search system for distributed resource site and implementation method thereof

Publications (1)

Publication Number Publication Date
CN105426431A true CN105426431A (en) 2016-03-23

Family

ID=55504643

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510741886.XA Pending CN105426431A (en) 2015-11-02 2015-11-02 Search system for distributed resource site and implementation method thereof

Country Status (1)

Country Link
CN (1) CN105426431A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111597254A (en) * 2020-04-14 2020-08-28 口碑(上海)信息技术有限公司 Resource data sharing method, device and equipment
CN111899885A (en) * 2020-06-28 2020-11-06 万达信息股份有限公司 Distributed personnel event index implementation method and system
CN115168690A (en) * 2022-09-06 2022-10-11 深圳市明源云科技有限公司 Data query method and device based on browser plug-in, electronic equipment and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001091242A1 (en) * 2000-05-25 2001-11-29 Alexandr Gennadievich Protasov Device for preventing access to a male plug
CN101359338A (en) * 2007-07-26 2009-02-04 株式会社理光 Data providing apparatus, data providing method and program
US7568097B2 (en) * 2001-04-05 2009-07-28 International Business Machines Corporation Method for file system security by controlling access to the file system resources using externally stored attributes
CN103745006A (en) * 2014-01-24 2014-04-23 吕书成 Internet information searching system and internet information searching method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001091242A1 (en) * 2000-05-25 2001-11-29 Alexandr Gennadievich Protasov Device for preventing access to a male plug
US7568097B2 (en) * 2001-04-05 2009-07-28 International Business Machines Corporation Method for file system security by controlling access to the file system resources using externally stored attributes
CN101359338A (en) * 2007-07-26 2009-02-04 株式会社理光 Data providing apparatus, data providing method and program
CN103745006A (en) * 2014-01-24 2014-04-23 吕书成 Internet information searching system and internet information searching method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111597254A (en) * 2020-04-14 2020-08-28 口碑(上海)信息技术有限公司 Resource data sharing method, device and equipment
CN111597254B (en) * 2020-04-14 2023-07-21 口碑(上海)信息技术有限公司 Resource data sharing method, device and equipment
CN111899885A (en) * 2020-06-28 2020-11-06 万达信息股份有限公司 Distributed personnel event index implementation method and system
CN115168690A (en) * 2022-09-06 2022-10-11 深圳市明源云科技有限公司 Data query method and device based on browser plug-in, electronic equipment and medium
CN115168690B (en) * 2022-09-06 2022-12-27 深圳市明源云科技有限公司 Data query method and device based on browser plug-in, electronic equipment and medium

Similar Documents

Publication Publication Date Title
EP3726411A1 (en) Data desensitising method, server, terminal, and computer-readable storage medium
US11176124B2 (en) Managing a search
CN111382174B (en) Multi-party data joint query method, device, server and storage medium
CN104598631B (en) Distributed data processing platform
CN103984745A (en) Distributed video vertical searching method and system
CN112989412B (en) Data desensitization method and device based on SQL statement analysis
US20090063448A1 (en) Aggregated Search Results for Local and Remote Services
US8909669B2 (en) System and method for locating and retrieving private information on a network
CN103902535A (en) Method, device and system for obtaining associational word
CN105468744A (en) Big data platform for realizing tax public opinion analysis and full text retrieval
CN103092844B (en) A kind of index establishing method and system, searching method and system
CN107491463B (en) Optimization method and system for data query
CN105426431A (en) Search system for distributed resource site and implementation method thereof
Chen et al. An efficient authorization framework for securing industrial Internet of Things
CN105447342B (en) script encryption method, decryption method and engine
CN102508884A (en) Method and device for acquiring hotpot events and real-time comments
CN103198066A (en) Word list based information search method and search system
CN104881398A (en) Method for extracting author affiliation information of English literature published by Chinese authors
CN104636368A (en) Data retrieval method and device and server
CN106326317A (en) Data processing method and device
CN105677745A (en) General efficient self-service data search system and implementation method
CN104954465B (en) One kind is suitable for privacy policy synthetic method under cloud service combine scenes
Turek et al. Extensible web crawler–towards multimedia material analysis
CN102339292A (en) Distributed searching method and system
CN105589863B (en) Searching method, data processing method, device and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160323