CN101763391A - Distributed website, information searching method and system thereof - Google Patents

Distributed website, information searching method and system thereof Download PDF

Info

Publication number
CN101763391A
CN101763391A CN200810241812A CN200810241812A CN101763391A CN 101763391 A CN101763391 A CN 101763391A CN 200810241812 A CN200810241812 A CN 200810241812A CN 200810241812 A CN200810241812 A CN 200810241812A CN 101763391 A CN101763391 A CN 101763391A
Authority
CN
China
Prior art keywords
http
website
key word
search
web site
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN200810241812A
Other languages
Chinese (zh)
Inventor
侯华锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Konka Group Co Ltd
Original Assignee
Konka Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Konka Group Co Ltd filed Critical Konka Group Co Ltd
Priority to CN200810241812A priority Critical patent/CN101763391A/en
Publication of CN101763391A publication Critical patent/CN101763391A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a distributed website, an information searching method and a system thereof, wherein the information searching method comprises the steps that: a client organizes keywords input by the user into HTTP-POST request message having the Content-Type of application/Search according to HTTP protocol, and sends the HTTP-POST request message into specified one or more distributed website(s); and website(s) receive(s) the HTTP-POST request message and analyze(s) the keywords, website content matched with the keywords is searched inside the website(s), and the website content is organized into XML text and returned into the client by HTTP response message; wherein the expression of the keywords is in accordance with the XML text format. The invention can be used for realizing distributed searching, and the search result is directly provided by a content owner, so that the real-time property, the accuracy and the objectivity of the content can be ensured.

Description

Distributed website and information search method thereof and system
Technical field
The present invention relates to network information search method, especially relate to a kind of based on HTML (Hypertext Markup Language) (HTTP, Hypertext Transfer Protocol) distributed network is carried out the method and system of information search, and the distributed website of carrying out this searching method.
Background technology
Along with popularizing of internet, the content information sharp increase on the internet, thereby also be used widely by the method that the content that will inquire about searched on the internet in key word.
Yet present way of search all is by several major companies, uses the Web Spider program of oneself, constantly searches for thousands of webpage, and then does further processing by own ordering techniques, the information processing technology.Therefore, the keyword results of user capture is not necessarily with the content match of original web; Search result information may lag behind, and corresponding Search Results does not exist; And the ordering of Search Results also may searched company be revised arbitrarily.
Investigation shows have 95% people to use search engine now, and they is used as " the decision-making consultant " of each side such as health, financing, work, life, so search engine provides complete, just, objective information particularly important to the netizen.
Summary of the invention
The objective of the invention is to propose a kind of based on HTML (Hypertext Markup Language) (HTTP, Hypertext Transfer Protocol) distributed network is carried out the method and system of information search, and the distributed website of carrying out this searching method, the Search Results of objective reality is provided for the user.
For solving technical matters of the present invention, the present invention discloses a kind of information search method of distributed network, its, comprising:
Client is the HTTP-POST request message of application/Search with the key word of user's input by the type that http protocol is organized into Content-Type, and the HTTP-POST request message is sent to one or more distributed websites of appointment;
Website receives described HTTP-POST request message, and the analysis of key word at the web site contents of the inner search of website with keyword matching, is organized into the XML text with web site contents and returns to client with http response message;
Wherein, the expression formula of key word must meet the XML text formatting.
Preferably, the present invention be according to the degree of correlation of key word to the processing of sorting of the web site contents of described coupling.
Preferably, the present invention be according to the significance level of the similarity degree of key word or web site contents to the processing of sorting of the web site contents of described coupling.
In addition, the present invention also discloses a kind of information search system of distributed network, and it comprises:
Client: is the HTTP-POST request message of application/Search with the key word of user's input by the type that http protocol is organized into Content-Type, and the HTTP-POST request message is sent to one or more distributed websites of appointment; Wherein, the expression formula of key word must meet the XML text formatting;
The distributed a plurality of websites that are connected into the internet: receive described HTTP-POST request message, the analysis of key word, at the web site contents of the inner search of website, web site contents is organized into the XML text returns to client with http response message with keyword matching.
Moreover the present invention also discloses a kind of distributed website, and with the distributed internet that is connected into, wherein, each website comprises:
The network receiver module, being used to receive client is the HTTP-POST request message of application/Search with the key word that the user imports by the type that http protocol is organized into Content-Type by network;
Search module is used for analysis of key word among the HTTP-POST request message, at the web site contents of the inner search of website with keyword matching;
The search response module is used for that the web site contents with keyword matching is organized into the XML text and returns to client with http response message;
Wherein, the expression formula of key word must meet the XML text formatting.
Preferably, each website comprises the Search Results order module, be used for according to the degree of correlation of key word to the processing of sorting of the web site contents of described coupling.
Preferably, the Search Results order module be according to the significance level of the similarity degree of key word or web site contents to the processing of sorting of the web site contents of described coupling.
Compared with prior art, the present invention has following beneficial effect:
The present invention utilizes existing HTTP host-host protocol, define a kind of distributed search mode, contain and send the universal standard that searching request, Search Results are returned, can realize distributed search, and Search Results directly provides real-time, the accuracy of assurance content from the content owner; In addition, as long as each website is followed this distributed mode, then all websites can provide the Search Results of oneself, and no matter which kind of technology the website is based on and builds, which kind of storage mode web site contents is based on is preserved, thereby has ensured the comprehensive and extensive applicability of information search.
Description of drawings
Fig. 1 is system topology figure of the present invention;
Fig. 2 is a schematic flow sheet of the present invention;
Fig. 3 is the structural representation of website among Fig. 1.
Embodiment
As shown in Figure 1, plurality of network website 30 is distributed in various places, connects by network 20.Client 10 is then passed through network access equipment (such as router, network node) access network 20, and can be by network 20 any one website 30 of visit.
In conjunction with shown in Figure 2.Any one client 10 can be the HTTP-POST request message of application/Search by the type that http protocol is organized into Content-Type with the key word of user's input all, and the HTTP-POST request message is sent to one or more distributed websites of appointment, wherein, the expression formula of key word must meet extend markup language (Extensible Markup Language, XML) text formatting.
Because http protocol is simple, make the program small scale of http server, thereby communication speed is very fast; In addition, HTTP allows the data object of transmission any type, just in the type of data object or content by Content-Type mark in addition, that is to say, the HTTP content type of Content-Type attribute specified services end response, if do not specify Content-Type, be defaulted as text/html.
And the present invention expands in conventional H TTP agreement a little, and the type that increases a Content-Type is application/Search, and the data object that its expression is being transmitted or the Type C ontent-Type of content search for.
Any one website receives described HTTP-POST request message, the analysis of key word, at the web site contents of the inner search of website with keyword matching, according to the degree of correlation of key word to the processing of sorting of the web site contents of described coupling, even according to the significance level of the similarity degree of key word or web site contents to the processing of sorting of the web site contents of described coupling, the web site contents after ordering handled is organized into the XML text and returns to client with http response message.
In conjunction with shown in Figure 3, the present invention also discloses a kind of distributed website 30, and with the distributed internet 20 that is connected into, wherein, each website 30 comprises: network receiver module 31, search module 32, Search Results order module 33 and search response module.
Wherein, it is the HTTP-POST request message of application/Search with the key word that the user imports by the type that http protocol is organized into Content-Type by network that network receiver module 31 is used to receive client, and the expression formula of key word must meet the XML text formatting; Search module 32 is used for analysis of key word among the HTTP-POST request message, at the web site contents of the inner search of website with keyword matching; Search Results order module 33 be used for according to the degree of correlation of key word to the processing of sorting of the web site contents of described coupling, preferably, the Search Results order module be according to the significance level of the similarity degree of key word or web site contents to the processing of sorting of the web site contents of described coupling; Search response module 34 is used for that the web site contents with keyword matching is organized into the XML text and returns to client with http response message.
To sum up, the present invention utilizes existing HTTP host-host protocol, proposes a kind of distributed search mode, contain and send the universal standard that searching request, Search Results are returned, can realize distributed search, and Search Results directly provides real-time, the accuracy of assurance content from the content owner; In addition, as long as each website is followed this distributed mode, then all websites can provide the Search Results of oneself, and no matter which kind of technology the website is based on and builds, which kind of storage mode web site contents is based on is preserved, thereby has ensured the comprehensive and extensive applicability of information search.

Claims (7)

1. the information search method of a distributed network is characterized in that, comprising:
Client is the HTTP-POST request message of application/Search with the key word of user's input by the type that http protocol is organized into Content-Type, and the HTTP-POST request message is sent to one or more distributed websites of appointment;
Website receives described HTTP-POST request message, and the analysis of key word at the web site contents of the inner search of website with keyword matching, is organized into the XML text with web site contents and returns to client with http response message;
Wherein, the expression formula of key word must meet the XML text formatting.
2. according to the information search method of the described distributed network of claim 1, it is characterized in that, also comprise: according to the degree of correlation of key word to the processing of sorting of the web site contents of described coupling.
3. according to the information search method of the described distributed network of claim 2, it is characterized in that, according to the significance level of the similarity degree of key word or web site contents to the processing of sorting of the web site contents of described coupling.
4. the information search system of a distributed network is characterized in that, comprising:
Client: is the HTTP-POST request message of application/Search with the key word of user's input by the type that http protocol is organized into Content-Type, and the HTTP-POST request message is sent to one or more distributed websites of appointment; Wherein, the expression formula of key word must meet the XML text formatting;
The distributed a plurality of websites that are connected into the internet: receive described HTTP-POST request message, the analysis of key word, at the web site contents of the inner search of website, web site contents is organized into the XML text returns to client with http response message with keyword matching.
5. distributed website, is characterized in that each website comprises with the distributed internet that is connected into:
The network receiver module, being used to receive client is the HTTP-POST request message of application/Search with the key word that the user imports by the type that http protocol is organized into Content-Type by network;
Search module is used for analysis of key word among the HTTP-POST request message, at the web site contents of the inner search of website with keyword matching;
The search response module is used for that the web site contents with keyword matching is organized into the XML text and returns to client with http response message;
Wherein, the expression formula of key word must meet the XML text formatting.
6. according to the described distributed website of claim 5, it is characterized in that each website comprises the Search Results order module, be used for according to the degree of correlation of key word to the processing of sorting of the web site contents of described coupling.
7. according to the described distributed website of claim 6, it is characterized in that, the Search Results order module be according to the significance level of the similarity degree of key word or web site contents to the processing of sorting of the web site contents of described coupling.
CN200810241812A 2008-12-23 2008-12-23 Distributed website, information searching method and system thereof Pending CN101763391A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200810241812A CN101763391A (en) 2008-12-23 2008-12-23 Distributed website, information searching method and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200810241812A CN101763391A (en) 2008-12-23 2008-12-23 Distributed website, information searching method and system thereof

Publications (1)

Publication Number Publication Date
CN101763391A true CN101763391A (en) 2010-06-30

Family

ID=42494555

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200810241812A Pending CN101763391A (en) 2008-12-23 2008-12-23 Distributed website, information searching method and system thereof

Country Status (1)

Country Link
CN (1) CN101763391A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102521267A (en) * 2011-11-21 2012-06-27 沈文策 In-station information searching method and system
CN108304421A (en) * 2017-02-24 2018-07-20 腾讯科技(深圳)有限公司 A kind of information search method and device
CN109857958A (en) * 2019-02-13 2019-06-07 杭州孝道科技有限公司 A kind of method that http input point is searched
CN110442619A (en) * 2019-07-29 2019-11-12 新华三大数据技术有限公司 Search result ordering method, device, electronic equipment and storage medium

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102521267A (en) * 2011-11-21 2012-06-27 沈文策 In-station information searching method and system
CN102521267B (en) * 2011-11-21 2014-01-22 沈文策 In-station information searching method and system
CN108304421A (en) * 2017-02-24 2018-07-20 腾讯科技(深圳)有限公司 A kind of information search method and device
CN108304421B (en) * 2017-02-24 2021-03-23 腾讯科技(深圳)有限公司 Information searching method and device
CN109857958A (en) * 2019-02-13 2019-06-07 杭州孝道科技有限公司 A kind of method that http input point is searched
CN110442619A (en) * 2019-07-29 2019-11-12 新华三大数据技术有限公司 Search result ordering method, device, electronic equipment and storage medium
CN110442619B (en) * 2019-07-29 2022-02-11 新华三大数据技术有限公司 Search result ordering method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN101452453B (en) A kind of method of input method Web side navigation and a kind of input method system
US20090248818A1 (en) Cooperating system, chat server, program, and cooperating method
JP2004537108A (en) Method and system for transforming an XML document into at least one XML document configured according to a subset of an XML grammar rule set
US20100011025A1 (en) Transfer learning methods and apparatuses for establishing additive models for related-task ranking
EA201300375A1 (en) THE METHOD OF ORGANIZING A SEARCH DATABASE USING FUZZY CRITERIA
KR20090008777A (en) System and method of collecting wish-list service of on-line shoping malls
US9582588B2 (en) Methods and systems for providing custom crawl-time metadata
CN103116635A (en) Field-oriented method and system for collecting invisible web resources
CN104636386A (en) Information monitoring method and device
US20120059926A1 (en) System and method for semantic service
CN101894109A (en) Database building method and device
CN101763391A (en) Distributed website, information searching method and system thereof
KR20090048998A (en) System and method for alarming bad public opinion using keyword and recording medium
KR20110009301A (en) Method for displaying acquaintance review, server and program recording medium
CN106156193A (en) Search and the collection method of network address, browser, server and system
CN103425646A (en) Web service discovery method and device
Malik et al. Ontology and Web Usage Mining towards an Intelligent Web focusing web logs
JP2007207202A (en) Information providing system using web log
KR101363497B1 (en) Method and apparatus for managing foaf data
KR101079802B1 (en) System and Method for Searching Website, Devices for Searching Website and Recording Medium
JP7003481B2 (en) Reinforcing rankings for social media accounts and content
KR20090049507A (en) System and method for analysing public opinion using communication network and recording medium
US7890515B2 (en) Article distribution system and article distribution method used in this system
KR101746594B1 (en) push message providing system based on web crawler by learning and following user search history
US20130046751A1 (en) Method and Arrangement for Control of Web Resources

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20100630