CN102622402A - Server, method and system for providing information search service by using sheaf of pages - Google Patents

Server, method and system for providing information search service by using sheaf of pages Download PDF

Info

Publication number
CN102622402A
CN102622402A CN2012100082799A CN201210008279A CN102622402A CN 102622402 A CN102622402 A CN 102622402A CN 2012100082799 A CN2012100082799 A CN 2012100082799A CN 201210008279 A CN201210008279 A CN 201210008279A CN 102622402 A CN102622402 A CN 102622402A
Authority
CN
China
Prior art keywords
group
webpage
url
index
web
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012100082799A
Other languages
Chinese (zh)
Other versions
CN102622402B (en
Inventor
南世东
愼重熩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CHUTNOON Co Ltd
Original Assignee
CHUTNOON Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CHUTNOON Co Ltd filed Critical CHUTNOON Co Ltd
Publication of CN102622402A publication Critical patent/CN102622402A/en
Application granted granted Critical
Publication of CN102622402B publication Critical patent/CN102622402B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a server, method, and system for providing an information search service by using sheaf of pages. The method comprises the following steps: establishing the position information pattern of these data by analyzing the position information initially positioned by the collected data; grouping the collected data into data groups according to the established position information pattern; and choosing one data related to the keyword from the data group and providing a group searching result.

Description

Use page set and server, the method and system of information search service are provided
The application be that March 3, application number in 2006 are 200680006631.8 the applying date, denomination of invention divides an application for the application for a patent for invention of " using page set that server, the method and system of information search service are provided ".
Technical field
The present invention relates to a kind of information search service, more precisely, is a kind ofly to use page group and method, system and the server of information search service are provided.
Background technology
Along with development of Internet, internet information search techniques has obtained very big development, and makes great deal of information on network, can be processed and pile up, and the user can search not only information soon but also certainly.
Internet information search techniques makes the user can use web browser (web browser) to search various information easily from network, picture for example, sound, film image etc.Yet search technique exists an adverse factors and is exactly, along with network address with geometric growth, they can't provide the user real essential information.Modal a kind of method that addresses this is that just is to use search engine.
Search engine is a kind of program that is designed to help the information of finding, these information stores for example are stored in the WWW in public or private network or the PC in computer system.Search engine is through search utility, and for example search machine people or crawler are created the index of website information, and index information is stored in the database.It allows user inquiring to meet the content (particularly those contain the content of given word or phrase) of ad hoc rules, and returns a reference listing that is complementary with ad hoc rules.
Search engine uses web index method, web directory method and first searching method.Web index method is a most general a kind of searching method.It passes through search utility for example search machine people or crawler; Create the index of website information; And index information is stored in the database, and it allows user inquiring to meet the content of ad hoc rules, and returns a reference listing that is complementary with ad hoc rules.
Web directory method is classified editing a database to the page on the Internet according to theme and level, the path of creating clauses and subclauses then, and its allows the user to select and needs the immediate clauses and subclauses of information, and then the scope of dwindling search gradually.
Unit's searching method is a kind of high-level network indexing means, and it creates a tabulation that the search engine of search service can be provided in web index method, make the user can select a search engine to search for.
But these search engines all exist following deficiency separately.Web directory method can not obtain substantial Search Results, because in Search Results, only comprised the webpage of relatively small amount.In addition, the web directory method search is very consuming time, because it needs a lot of steps to come acquired information.Web index method and first searching method make the user before a large amount of Search Results, feel confused, and its reliability of search result is very low, because they offer all pages of user, comprise query page.
Unit's searching method and web index method at first use their algorithm to provide reliability high webpage.But these pages may not offer their information of wanting of user, all have been provided because comprise all pages of inquiry.
For example, above-mentioned searching method can provide the canned data of one page in the book, and the canned data of or many books can be provided, and makes that complex search is impossible.Therefore, solve the low integrity problem of Search Results, auxiliary content, cybercaf's blog (Internet caf ◎ blog) for example, perhaps information service just is applied to search engine and has suffered.
Summary of the invention
Technical scheme
The invention provides a kind of method that information search service can be provided, system and server, this service can be carried out index to the one group of page that meets ad hoc rules, and in this group page, searches for.
Beneficial effect
According to the present invention; The user can be not only fast but also accurate finding information on the Internet; Because a web pages is analyzed in order to create a patterns of position information; The webpage that the use location information pattern will contain similar information is grouped into many groups, then contains a plurality of pages with query-related information, offers the user again after just the form of a representing pages and some low-level pages is divided into one group.
Description of drawings
Through the detailed description of illustrative examples, above and other characteristics of the present invention will be clearer with advantage, wherein the following accompanying drawing of reference:
Fig. 1 is according to one embodiment of present invention, uses one group of page that the block scheme of the system of information search service is provided;
Fig. 2 according to one embodiment of present invention, the block scheme of a group search server;
Fig. 3 and 4 is synoptic diagram of explanation URL according to an embodiment of the invention (URL) pattern and a URL scheme-tree (UP tree);
Fig. 5 is according to one embodiment of present invention, uses one group of page that the process flow diagram of the method for information search service is provided; And
Fig. 6 is a group result for retrieval according to an embodiment of the invention.
The optimum way that carries out an invention
According to an aspect of the present invention, it provides a kind of method that group search service is provided, and comprising: the patterns of position information of (a) creating these data through the positional information of analyzing the initial location of collected data; (b) according to the patterns of position information of having created collected data are divided into groups; And (c) from data set, select an a data set relevant and group search result is provided with key word.
According to another aspect of the present invention; It provides the method that a kind of group search service is provided in a system; This system comprises a user terminal that sends inquiry and output Search Results; The web server that a plurality of pages are provided, and one receive inquiry and create and send the group search server of Search Results to user terminal from user terminal, this method comprises: (a) receive inquiry and query requests signal from user terminal; (b) reception is from the webpage of web server; (c) analyzing web page to be creating a URL pattern, and assigns to a group of web to these webpages with this URL pattern; (d) from group of web, extract index, create index information, and create the URL information of the group of web of index institute reference; And (e) comparison query and index are created a group search result and this result are sent to user terminal.
According to another aspect of the present invention; It provides a system that group search service is provided; This group search service obtains through search information in a plurality of webpages in Wireless/wired network, and system comprises: a user terminal of on Wireless/wired communication network, realizing surfing the web, and it produces searching request through transmitting inquiry and search request signal; Receive the corresponding group search result of this request, and the output group search result is to display unit; The web server from information, creating webpage and webpage is provided; And reception and analyzing web page to be creating the URL pattern, and use the URL pattern and be grouped into group of web to webpage, and group of web is carried out index, search information and create and transmit the group search server that group search result is given user terminal in group of web.
According to another aspect of the present invention, it provides a group search server, and it comprises: a patterns of position information generation module, and it creates the patterns of position information of these data through the positional information of analyzing the initial location of collected data; A webpage grouping module, it is data set according to the patterns of position information of having created with collected packet; And a controller, it is selected a data set relevant with key word and a group search result is provided from data set.
According to another aspect of the present invention; It provides a group search server; This server is received on the Wireless/wired communication network inquiry and the searching request that the user terminal realizing surfing the web sends, the netpage search information that provides at the web server, and send Search Results and give user terminal; This group search server comprises: a web page collection module; It carries out the collecting web page program, the webpage that obtains in order to receive the Wireless/wired communication network of web server access from the web server, and store these webpages; A URL pattern generation module, the webpage that it receives through the analyzing web page collection module is created the URL pattern; A webpage grouping module, the URL pattern that it utilizes URL pattern generation module to create is grouped into group of web with webpage; An index management module, it extracts index from the group of web that the webpage grouping module is divided into groups, in order to create and to store the URL information of the group of web of index information and the reference of index institute; A searching and managing module, its is according to the inquiry of receiving and search request signal and search index information, and the URL information creating that will have the group of web of index associated with the query is a group search result, and group search result is sent to user terminal; And controller; It controls web page collection module, URL pattern generation module, webpage grouping module; Index management module; The searching and managing module makes group search server can use group of web to accomplish search, and carries out communication through Wireless/wired communication network and client terminal and web server.
Embodiment
With combining accompanying drawing, illustrative examples of the present invention is described in detail now.
Fig. 1 is according to one embodiment of present invention, uses the page to divide into groups and the block scheme of the system of information search service is provided.
According to one embodiment of present invention; Use page grouping and provide the system of information search service to comprise 120, one web servers 130 of 110, one Wireless/wired communication networks of a user terminal; A group search server 140; 141, one index servers 150 of a group search db (after this all representing database) and an index data base 151 with DB.
User terminal 110 sends an inquiry and search request signal, and receives the group search result from group search server 140 through Wireless/wired communication network 120 access group search servers 140, exports group search result again to display unit.
User terminal 110 comprises a wire communication unit; This unit comprises an internet modem; For example high bit rate digital subscriber line (VDSL) modulator-demodular unit and cable modem; And/or a mobile communication unit, this unit comprises a mobile communication modulator-demodular unit, for example CDMA (CDMA) 2000 modulator-demodular units and wideband CDMA (W-CDMA) modulator-demodular unit.User terminal 110 uses the communication unit that comprises to visit group search server 140 through Wireless/wired communication network 120.User terminal further comprises a controller that comprises an internal memory and a microprocessor.Internal memory is deposited network browser program, and these programs are used to receive user inquiring, the solicited message search, and the output Search Results is given display unit.The operation of microprocessor control user terminal 110.
The example of user terminal 110 comprises a personal computer (PC); For example computer or a kneetop computer on the table; And a communicating terminal, for example personal digital assistant (PDA), mobile phone, person-to-person communication Service Phone, palm PC, global system for mobile communications (GSM) phone, W-CDMA mobile phone, CDMA-2000 mobile phone and mobile broadband system (MBS) mobile phone.
Wireless/wired communication network 120 couples together user terminal 110, web server 130, group search server 140, index server 150, makes them can use wired or wireless mode to repeat the data of sending and receiving between them.
Web server 130 is typical webservers, comprises a plurality of computer systems or computer software that various information are provided with form web page.The webserver refers to a computer system and computer software (network server program); It is connected to a sub-cells, and passes through computer network, for example Intranet or the Internet with other webservers; Communicate, receive the operation request and operation result is provided.Yet except network server program, the webserver should be interpreted as and comprise and operate in the application program on the webserver and store superincumbent various database.The webserver is embodied in according to operating system, for example DOS, Windows, Linux, UNIX or MacOS, and use corresponding network server program.
Index server 150 is carried out a data collection procedure, and normally data are collected from the web server 130 that is connected to Wireless/wired communication network 120 by a web robot.Index server 150 is the data of renewal collection regularly, and index data base 151 uses upset files or similar mechanism to deposit the data of collecting.
Group search server 140 communicates to read network data with index server 150 and index data base 151, and the positional information that group search server 140 is gone back the phase-split network data is to create multiple patterns of position information.Positional information is meant the internet paths that comprises the network data of collecting.It preferably includes the URL (URLs) of network data.Its analysis contact between patterns of position information is to carry out division operation.Said process can comprise a URL scheme-tree of use and be created in a contact between a plurality of different URL patterns, comprise that also the webpage to having identical URL mode packet thresholding divides into groups.Selectively or additionally, the process of creating with the URL mode packet can comprise predetermined URL pattern dictionary of reference.
Group search server 140 is extracted in the index in the web page group units, creates index information and URL information by the webpage of index reference, and in group search db 141, stores index information and URL information.When group search server 140 when user terminal receives an inquiry and an information search demand, it will inquire about with searching for and compare with the information of establishment about group search result.Group search result can be transferred into user terminal 110 with other Search Results about inquiry.Group search server 140 will be described in detail with reference to Fig. 2.
Even group search server 140 does not receive the group search result about inquiry from the user, it also can be used to provide a group search result about a definite key word.For example, it can use a higher levels of notion that comprises user inquiring or one about the key word of confirming of user inquiring so that a group search result to be provided.Further, it can use a key word about information so that a group search result to be provided.
Group search db 141 stores the index information and the positional information (comprising URL information) of group of web, and these information are by 140 establishments of group search server.The centre word of its storage group further.Database is meant the data structure that in the memory block of computer system, forms through DBMS (data base management system (DBMS)) program, and data are obtained, delete, edit and add therein.Database can use a relevant DBMS and be adapted to the present invention, for example, and Oracle, Informix, Sybase, MS SQL (Microsoft's SQL), or the data base management system (DBMS) of DB2.Database comprises storage, obtains, deletes, edits and adds required territory of data and element.Further, group search db 141 can be separated from each other with index data base 151, or is complete one.
Fig. 2 according to one embodiment of present invention, the block scheme of a group search server.
Group search server 140 is the webservers that comprise a web page collection module 210, URL pattern generation module 220, webpage grouping module 230, index management module 240, a searching and managing module 250 and a controller 260.
Web page collection module 210 is visited web server 130 to collect data through Wireless/wired communication network.Web page collection module 210 can optionally be included in the group search server 140, and by the variation of the data of positional information institute reference, this positional information is collected and be stored in the index data base 151 by index server 150 with reflection.
The URLs of the webpage that URL pattern generation module 220 analyzer-controllers 260 or web page collection module 210 are required is to create the URL pattern.The URL pattern is meant the preassigned pattern of the URL of webpage, and it is created with management and has a web pages of identical content or a web pages of being write as with same pattern.In the present invention, same web page is grouped and is managed to be used for information search.At this moment, the URL pattern is used as a standard selecting same web page.
The URLs of the webpage that URL pattern generation module 220 analyzer-controllers 260 or web page collection module 210 receives comprises the URL pattern of packet domain with establishment.For example, in the SayClub home page server that is provided by Neowiz company, the URL of the representative page or leaf of each ID (identity) is analyzed, and ID is set to a packet domain, has therefore created a http://hompy.
The URL pattern of sayclub.com/ [ID].The URL pattern will be described in detail with reference to Fig. 3.Except packet domain, the URL pattern can (this masterplate be shared by two webpages or web page contents for HyperText Markup Language, HTML) masterplate and creating based on HTTP.
HTML Templates are meant normally used foundation structure, so that webpage can be easy to be written into.For example, it is write with label form, as<table...><tD>[text number]</TD><tD>[title]</TD>...</TABLE>, it is usually used in writing webpage.
A html file that is written as webpage typically is the combination of a html tag and a text, and it observes the grammer of HTML.Html file is made up of a plurality of functional blocks, like, menu block, be used for contiguous block and a message block that is used for content of linking to each other with other portal sites.Therefore functional block and writes to make things convenient for the user with masterplate in being usually used in webpage.
The webpage of being created by same operating parts can be contained in a plurality of webpages of being managed by the web server, and this server provides board service, blog services, minimized homepage service and analog thereof.That is, a plurality of webpages of sharing identical HTML Templates trend towards being created by the identical operations part, and trend towards comprising identical content.
Because the web server 130 that board service, blog services is provided and has minimized the homepage service uses identical HTML Templates writing maximum webpage of being managed by web server 130, so the webpage of being managed by identical web server 130 is shared same HTML Templates.Correspondingly, the webpage of shared same HTML Templates can have same URL pattern.
Based on the contact of passing through UP tree information between the required URL pattern, 230 pairs of different URL patterns of being created by URL pattern generation module 220 of webpage grouping module are divided into groups, and in the URL modal sets, the webpage with same packet domain are divided into groups.Promptly; 230 pairs of URL patterns of webpage grouping module are divided into groups; This URL pattern is different with the URL pattern of being created by URL pattern generation module 220; But they are the phase simple crosscorrelation again, and based on the contact of passing through UP tree information between the required URL pattern, webpage grouping module 230 is divided into groups to the webpage with identical URL mode packet thresholding in the URL modal sets.
For example, be registered in that the URLs of webpage can be summarised as about 20 kinds of different URL patterns in the SayClub homepage.Based on UP tree information, these 20 kinds different URL patterns are grouped in one single group.In them, the webpage with same ID is grouped in the group of web as a grouping thresholding.Correspondingly, when the webpage that is registered in the SayClub homepage is divided into groups according to ID, the packet count of webpage is equal to the quantity of the ID that is registered in the SayClub homepage.Further, this can be applied to be registered in the webpage of Naver blog with being equal to, so that the packet count of webpage is equal to the quantity of the ID that is registered in the Naver blog.
Yet in the present invention, the standard of the webpage that is used to divide into groups is not limited to the grouping thresholding.For example, can through packet domain is carried out " with " or OR operation and webpage is divided into groups.The present invention can comprise further the contact between an index and respective sets is estimated that page group is segmented or changed, this index is extracted by index management module 240.For example, when the index from the page group extraction related to two or more territory, the page can be integrated into a group or organized by the two or more sons of segmentation based on the territory.When representing its content improperly from an index of one group of page extraction, this group can be by deletion to produce a reliable Search Results.
Index management module 240 is extracted an index from a page group of being divided into groups by webpage grouping module 230, and the index information and the URL information that are stored in webpage in the group search db 141.That is, index management module 240 is extracted an index with the establishment index information from a page group, and in the index data base 151 of group search db 141, stores index information.In addition, index management module 240 use UP tree information with the URL information of creating group of web and group search db 141 in storage URL information.
From inquiry of user terminal 110 receptions or key word the time, searching and managing module 250 search index databases 151 receive the group of web information and establishment group search result with matching inquiry index from group search db 141.Coupling between inquiry or key word and index can be carried out through using specific terms dictionary or total information (MI) value.In addition, can use known algorithm and carry out.
Controller 260 control web page collection module 210, URL pattern generation module 220, webpage grouping module 230, index management module 240 and searching and managing module 250 are so that group search server can use a web pages to inquire about.In addition, controller and index server 150 carry out communication with index data 151, receive query search request signal and send group search result from user terminal 110.
Fig. 3 and Fig. 4 are according to one embodiment of present invention, to the synoptic diagram that makes an explanation of URL pattern and UP tree.
Fig. 3 has explained the URL of the user home page of use Neowiz SayClub homepage (after this http://hompy.sayclub.com is referred to as hompy) service, with and the URL of related pages.User home page comprises that some contain the webpage of its ID at user URL.At SayClub hompy, URL is shown as the inquiry form, like " ◎ " symbol of being followed by " name variable=variate-value ".Correspondingly, in Fig. 3, when the value of following " targetmsr1=" was considered to confirm the standard of ID, the URL pattern was by as shown in Figure 4 and create.In addition, in by private blogs service or board service that the portal provided, service provider's domain name can be by a separator follow in order to difference user and bulletin.
Fig. 4 is the pattern through the tree structure of analyzing the URL that the URL of webpage obtains in hompy.With reference to Fig. 3, each webpage comprises user's ID in its URL.Therefore, in the URL of webpage, the part of " ID " can be converted into the packet domain of [ID], and " bulletin type " part that is included in the hompy can be converted into the packet domain of [bulletin type].Even the grouping thresholding has changed, and when the URL browsed content did not change in fact, packet domain can be set to [ignoring] territory, and [ignoring] territory is left in the basket in the process of grouping URL pattern.Based on the variation of grouping thresholding, can through analyze in respective sets file include and get in touch confirm between the packet domain preferentially.
When the URL pattern is created through said process, the URL pattern that is created can be used to summarize all users' of Neowiz hompy webpage.Packet domain can automatically be formed in the process of analyzing the URL address.In private blogs that is provided by portal or community sites or bulletin, the URL pattern is created according to service provider's strategy uniformly.In this case, create and can be through being performed with reference to predetermined URL pattern and path about packet domain to the process of URL mode packet.
Fig. 5 is according to one embodiment of present invention, uses one group of page and the process flow diagram of the method for information search service is provided.
Internet user uses user terminal importing the inquiry of an information search, and send should inquiry and searching request to group search server 140 (operating S410).Operation S410 can be omitted.That is, a group search server can be performed through analyzing storage data, and need not user input query or query requests.After receiving inquiry and search request signal from user terminal 110; The information (comprising address information) that group search server 140 receives about webpage from index data base 151, and (operation S420) collected and compiled to this index data base 151 in advance by 150 of index servers.Group search server 140 optionally operation web page collection module 210 attaches the material of helping to receive from index data base 151.
During this time, according to a preordering method, the web robot program can be performed to receive web page index server 150 and be stored in the index data base 151.
After receiving webpage from index server 150, group search server 140 analyzing web pages are to create URL pattern (S430).
After creating the URL pattern; Based on setting the URL pattern of information acquisition and getting in touch of group of web through UP; And this group of web has the grouping thresholding (operation S440) of same URL pattern in one group of URL pattern, and 140 pairs of different URL patterns of group search server are divided into groups.
Behind the grouping webpage; Group search server 140 is extracted index from the group of web in the group unit; Creating index information and by the URL information (operation S450) of the group of web of index reference, and in the URL information (operating S460) of group search db 150 stored index informations and group of web.
After the URL information of group search db 150 stored index informations and group of web; 140 pairs of index that receive the inquiry at personal terminal, family 110 and be stored in the group search db 150 of group search server compare; Search for, create and send group search result to user terminal 110 (operation S470).
After receiving Search Results from group search server 140, user terminal 110 output Search Results are to display unit.According to the present invention,, also group search service can be provided even inquiry is not exported from the user.
According to the present invention, group search service is grouped into a group of web with a plurality of webpages, and the search entity relevant with this webpage, rather than search packet is contained in a term in the webpage.Search service can be used with board search service.
Recently, board service is widely used on the webpage, and user's registration therein is about the material of customizing messages, the problem of writing information and answer.Board service can comprise the webpage that contains than user search more information.
Correspondingly, when a user input query with request search, representational webpage and the low-level bulletin webpage of sharing about this Query Information are grouped in together and with predesigned order and are provided, rather than the webpage that comprises this inquiry is provided simply.
According to one embodiment of present invention, group search service takes to be board service in the afternoon.Yet the present invention does not limit to so far, but can be applied to the multiple service of group of web to search for of using.
Fig. 6 is according to one embodiment of present invention, explains the synoptic diagram of group search result.
Provide group search result aspect; Its output can be depending on the number of file in user inquiring and key word, the group in proper order, organize in during reality in creation-time or the contact between the popularization degree of increase, group and group file of number of files, and the quantity of single group of said popularization degree such as user capture.In order to estimate this contact, assessment technique can be used, wherein used in respective sets with predetermined term path in, the user uses the frequency of inquiry and key word.Popularization degree can be depending on the number of file polling in respective sets, the data volume of creating in the number of user capture group and the inherent at the fixed time respective sets.
A group search result 530 is promptly exported in input " psp " inquiry in the input window 510 of a user in webpage, and said webpage exports user terminal 110 to so that group search service and selection " search " to be provided.Group search result 530 is classified in classification menu 520 according to " Neo rank order (newly registering order) ".The user can classify to group search result 530 in " related article order " or " popularization degree order " in the classification menu 520.
But the title of group search result 530 display network files, article name etc. are to provide information effectively.Page group information 540 can further comprise the information about the number of the page group classification and the file of including.The inventory 550 that in addition, single file in the single page group can be provided is to make things convenient for the user.Further, can provide sorting item 560 about single page group source-information so that information to be provided effectively.
Though the present invention is described with reference to its illustrative example, it will be appreciated by those skilled in the art that in the scope of following claim, can make the multiple variation on form and the details, and can not break away from protection scope of the present invention.
Utilizability on the industry
The present invention can be applicable to method, system and the server that information search service is provided effectively.

Claims (5)

1. group search server comprises:
Web page collection module is carried out the collecting web page program, in order to receive webpage and to store said webpage;
URL pattern generation module, the URL of the webpage that receives through the analyzing web page collection module are created the URL pattern that is included as the packet domain that webpage divides into groups;
The webpage grouping module, the URL pattern that it utilizes URL pattern generation module to create is grouped into group of web with webpage;
Index management module is extracted index from the group of web that the webpage grouping module is divided into groups, in order to create and to store the URL information of the group of web of index information and the reference of index institute;
The searching and managing module, search index information when receiving inquiry and search request signal, the URL information creating that will have the group of web of the index relevant with said inquiry is a group search result; And
Controller, the control web page collection module, URL pattern generation module, the webpage grouping module, index management module and searching and managing module make group search server can use group of web to accomplish search;
Wherein, URL pattern generation module uses the URL of webpage to generate packet domain.
2. group search server according to claim 1, wherein, URL pattern generation module is created the URL pattern as standard, and said standard is used for preassigned pattern webpage being divided into groups, and this preassigned pattern is shared by the webpage with identical information.
3. group search server according to claim 1; Wherein, Said webpage grouping module is based on the contact of passing through between the URL pattern of URL pattern count information acquisition; Different URL mode packet creating the group of URL pattern, and are grouped into group of web with the webpage of the grouping thresholding with identical URL pattern in the group of URL pattern.
4. group search server according to claim 1, wherein, the webpage that said webpage grouping module will have equal values is grouped into group of web, said value through the packet domain of URL pattern is carried out " with " or the obtaining of OR operation.
5. group search server according to claim 1; Wherein, said index management module is extracted index from the webpage that is contained in the group of web, to create and the storage index information; And create and storage by the URL information of the group of web of index institute reference, so that URL information is corresponding with index.
CN201210008279.9A 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages Active CN102622402B (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR10-2005-0018309 2005-03-04
KR20050018309 2005-03-04
KR20060020346A KR100671077B1 (en) 2005-03-04 2006-03-03 Server, Method and System for Providing Information Search Service by Using Sheaf of Pages
KR10-2006-0020346 2006-03-03

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN2006800066318A Division CN101133415B (en) 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages

Publications (2)

Publication Number Publication Date
CN102622402A true CN102622402A (en) 2012-08-01
CN102622402B CN102622402B (en) 2014-12-03

Family

ID=37623990

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201210008279.9A Active CN102622402B (en) 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages
CN2006800066318A Active CN101133415B (en) 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN2006800066318A Active CN101133415B (en) 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages

Country Status (3)

Country Link
JP (1) JP4769822B2 (en)
KR (1) KR100671077B1 (en)
CN (2) CN102622402B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BRPI0916939A2 (en) * 2008-08-01 2016-08-09 Google Inc provision of posts for discussion threads in response to a search query
JP5096619B2 (en) 2009-07-15 2012-12-12 ネオパッド インコーポレーション Homepage integrated service providing system and method
WO2015074455A1 (en) * 2013-11-25 2015-05-28 北京奇虎科技有限公司 Method and apparatus for computing url pattern of associated webpage
WO2015200600A1 (en) * 2014-06-25 2015-12-30 Google Inc. Deep links for native applications
CN104158890B (en) * 2014-08-21 2018-05-22 广州品唯软件有限公司 The advisory feedback method and device of e-commerce website
KR101647596B1 (en) * 2015-04-20 2016-08-10 숭실대학교산학협력단 Method and server for providing contents service
CN105045684B (en) * 2015-07-16 2018-06-15 北京京东尚科信息技术有限公司 Index switching and the method and device of index control

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0934911A (en) * 1995-07-18 1997-02-07 Fuji Xerox Co Ltd Information retrieval device
JP2001134616A (en) * 1999-10-25 2001-05-18 Nec Corp Method and system for constructing web information on specific topic
JP2001306947A (en) * 2000-04-20 2001-11-02 Ntt Data Corp System and method for analyzing access and recording medium
AU2001263500A1 (en) * 2000-05-01 2001-11-12 R.R. Donnelley And Sons Company Methods and apparatus for serving a web page to a client device based on printed publications and publisher controlled links
KR20010104871A (en) * 2000-05-16 2001-11-28 임갑철 System for internet site search service having a function of automatic sorting of search results
KR100643979B1 (en) * 2000-05-18 2006-11-13 엘지전자 주식회사 Information providing method for information searching result in an internet
JP2002288074A (en) * 2001-03-28 2002-10-04 Nec Corp Electronic communication system, electronic communication method, and computer program
JP3922693B2 (en) * 2002-06-17 2007-05-30 Necシステムテクノロジー株式会社 Internet information retrieval system
JP4231298B2 (en) * 2003-01-14 2009-02-25 日本電信電話株式会社 Information extraction rule creation system, information extraction rule creation program, information extraction system, and information extraction program
JP2004341942A (en) * 2003-05-16 2004-12-02 Nippon Telegr & Teleph Corp <Ntt> Content classification method, content classification device, content classification program, and storage medium storing content classification program

Also Published As

Publication number Publication date
KR20060096356A (en) 2006-09-11
CN101133415B (en) 2012-03-21
CN102622402B (en) 2014-12-03
KR100671077B1 (en) 2007-01-17
CN101133415A (en) 2008-02-27
JP4769822B2 (en) 2011-09-07
JP2008537809A (en) 2008-09-25

Similar Documents

Publication Publication Date Title
JP4648455B2 (en) Personalized search method and personalized search system
US7797295B2 (en) User content feeds from user storage devices to a public search engine
US6212522B1 (en) Searching and conditionally serving bookmark sets based on keywords
US6691105B1 (en) System and method for geographically organizing and classifying businesses on the world-wide web
US8224788B2 (en) System and method for bookmarking and auto-tagging a content item based on file type
JP4846922B2 (en) Method and system for accessing information on network
US6311194B1 (en) System and method for creating a semantic web and its applications in browsing, searching, profiling, personalization and advertising
US8010532B2 (en) System and method for automatically organizing bookmarks through the use of tag data
CN102521251A (en) Method for directly realizing personalized search, device for realizing method, and search server
CN101133415B (en) Server, method and system for providing information search service by using sheaf of pages
CN101542482B (en) Bookmarks and ranking
US20080010286A1 (en) Server bookmarks
CN103339597A (en) Transforming search engine queries
WO2009001138A1 (en) Search result ranking
CN101866347A (en) Method, system that structural data is searched for and method, the system that makes data item structured and can search for
US20100169756A1 (en) Automated bookmarking
CN101751422A (en) Method, mobile terminal and server for carrying out intelligent search at mobile terminal
CN100414869C (en) Method and system for implementing message subscription through Internet
JP4430598B2 (en) Information sharing system and information sharing method
US7349892B1 (en) System and method for automatically organizing and classifying businesses on the World-Wide Web
CN101551813A (en) Network connection apparatus, search equipment and method for collecting search engine data source
WO2007064174A1 (en) System, apparatus and method for providing shared information by connecting a tag to the internet resource and computer readable medium processing the method
CN101676901A (en) Search dispatching method and search server
US20080021889A1 (en) Server, method and system for providing information search service by using sheaf of pages
CN101788981A (en) Deep web mobile search method, server and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant