CN102622402B - Server, method and system for providing information search service by using sheaf of pages - Google Patents

Server, method and system for providing information search service by using sheaf of pages Download PDF

Info

Publication number
CN102622402B
CN102622402B CN201210008279.9A CN201210008279A CN102622402B CN 102622402 B CN102622402 B CN 102622402B CN 201210008279 A CN201210008279 A CN 201210008279A CN 102622402 B CN102622402 B CN 102622402B
Authority
CN
China
Prior art keywords
group
webpage
url
web
index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210008279.9A
Other languages
Chinese (zh)
Other versions
CN102622402A (en
Inventor
南世东
愼重熩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CHUTNOON Co Ltd
Original Assignee
CHUTNOON Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by CHUTNOON Co Ltd filed Critical CHUTNOON Co Ltd
Publication of CN102622402A publication Critical patent/CN102622402A/en
Application granted granted Critical
Publication of CN102622402B publication Critical patent/CN102622402B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a server, method, and system for providing an information search service by using sheaf of pages. The method comprises the following steps: establishing the position information pattern of these data by analyzing the position information initially positioned by the collected data; grouping the collected data into data groups according to the established position information pattern; and choosing one data related to the keyword from the data group and providing a group searching result.

Description

Use page set and server, the method and system of information search service are provided
The application be that March 3, application number in 2006 are 200680006631.8 the applying date, denomination of invention is the dividing an application of application for a patent for invention of " using page set that server, the method and system of information search service are provided ".
Technical field
The present invention relates to a kind of information search service, more precisely, is a kind ofly use page group and method, system and the server of information search service are provided.
Background technology
Along with the development of the Internet, the Internet information search techniques has obtained very large development, and makes a large amount of information can be processed on network and pile up, and user can search information fast and accurately.
The Internet information search techniques makes user can use web browser (web browser) to search from network easily various information, picture for example, sound, film image etc.Yet search technique exists a disadvantageous factor and is exactly, along with network address is with geometric growth, they cannot provide user real essential information.A kind of modal method addressing this is that is used search engine exactly.
Search engine is a kind of program that is designed to help discovery information, and these information are stored in computer system, for example, be stored in the WWW in public or private network or PC.Search engine is by search utility, and for example searching machine people or Web Spider, create the index of website information, and index information is stored in database.It allows user's inquiry to meet the content (particularly those contents that contain given word or phrase) of ad hoc rules, and returns to a reference listing matching with ad hoc rules.
Search engine uses web index method, web directory method and first searching method.Web index method is the most general a kind of searching method.It passes through search utility for example searching machine people or Web Spider, create the index of website information, and index information is stored in database, and it allows user's inquiry to meet the content of ad hoc rules, and returns to a reference listing matching with ad hoc rules.
Web directory method classifies to edit a database according to theme and level to the page on the Internet, then creates bar destination path, and it allows user select and need the immediate entry of information, and then the scope of dwindling search gradually.
Unit's searching method is a kind of high-level network indexing means, and it creates a list that the search engine of search service can be provided in web index method, makes user can select a search engine to search for.
But these search engines all exist following deficiency separately.Web directory method can not obtain substantial Search Results, because only comprised the webpage of relatively small amount in Search Results.In addition, web directory method search is very consuming time, because it needs a lot of steps to carry out acquired information.Web index method and first searching method are felt confused user before a large amount of Search Results, and its reliability of search result is very low, because they offer all pages of user, comprise query page.
First unit's searching method and web index method provide with their algorithm the webpage that reliability is high.But these pages may not offer their information of wanting of user, because comprise that all pages of inquiry have all been provided.
For example, above-mentioned searching method can provide the storage information of one page in book, and the storage information of one or many books can be provided, and make complex search, is impossible.Therefore, solve the low integrity problem of Search Results, auxiliary content, for example cybercafé's blog (Internet caf ◎ blog), or information service, is just applied to search engine and has suffered.
Summary of the invention
Technical scheme
The invention provides a kind of method that information search service can be provided, system and server, this service can be carried out index to meeting one group of page of ad hoc rules, and searches in this group page.
Beneficial effect
According to the present invention, user can be fast and accurate on the Internet, finding information, because a web pages is analyzed in order to create a patterns of position information, use location information pattern is grouped into many groups by the webpage that contains similar information, then contain a plurality of pages with query-related information, after namely the form of a representing pages and some low-level pages is divided into one group, offer again user.
Accompanying drawing explanation
By the detailed description of illustrative examples, above and other Characteristics and advantages of the present invention will be clearer, wherein with reference to following accompanying drawing:
Fig. 1 is according to one embodiment of present invention, and the block scheme of the system of information search service is provided with one group of page;
Fig. 2 according to one embodiment of present invention, the block scheme of a group searching server;
Fig. 3 and 4 is schematic diagram of explanation URL according to an embodiment of the invention (URL(uniform resource locator)) pattern and a URL scheme-tree (UP tree);
Fig. 5 is according to one embodiment of present invention, and the process flow diagram of the method for information search service is provided with one group of page; And
Fig. 6 is a group result for retrieval according to an embodiment of the invention.
The optimum way carrying out an invention
According to an aspect of the present invention, it provides a kind of method that group searching service is provided, and comprising: the patterns of position information that (a) creates these data by analyzing the positional information of the initial location of collected data; (b) according to the patterns of position information having created, collected data are divided into groups; And (c) from data group, select a data group relevant to key word and a group searching result is provided.
According to another aspect of the present invention, it provides a kind of method that group searching service is provided in a system, this system comprises that sends a user terminal of inquiring about and export Search Results, a web server that a plurality of pages are provided, and one receive and inquire about and to create and to send Search Results to the group searching server of user terminal from user terminal, the method comprises: (a) from user terminal reception, inquire about and inquiry request signal; (b) receive the webpage from web server; (c) analyzing web page to be to create a URL pattern, and by this URL pattern, these webpages assigned to a group of web; (d) from group of web, extract index, create index information, and create the URL information of the group of web of index institute reference; And (e) comparison query and index create a group searching result and this result are sent to user terminal.
According to another aspect of the present invention, it provides a system that group searching service is provided, this group searching service obtains by search information in a plurality of webpages in Wireless/wired network, system comprises: on Wireless/wired communication network, realize the user terminal of surfing the web for one, it by transmission, is inquired about and search request signal produces searching request, receive this and ask corresponding group searching result, and output group searching result is to display unit; A web server that creates webpage and webpage is provided from information; And reception and analyzing web page to be to create URL pattern, and to use URL pattern and webpage is grouped into group of web, group of web is carried out to index, search information create and transmit group searching result to the group searching server of user terminal in group of web.
According to another aspect of the present invention, it provides a group searching server, and it comprises: a patterns of position information generation module, and it creates the patterns of position information of these data by analyzing the positional information of the initial location of collected data; A webpage grouping module, it is data group according to the patterns of position information having created by collected packet; And a controller, it selects a data group relevant to key word and a group searching result is provided from data group.
According to another aspect of the present invention, it provides a group searching server, this server is received in inquiry and the searching request that realizes the user terminal transmission of surfing the web on Wireless/wired communication network, the netpage search information providing in web server, and send Search Results to user terminal, this group searching server comprises: a web page collection module, it carries out collecting web page program, the webpage obtaining in order to access Wireless/wired communication network from web server reception web server, and store these webpages; A URL pattern generation module, the webpage that it receives by analyzing web page collection module creates URL pattern; A webpage grouping module, the URL pattern that it utilizes URL pattern generation module to create is grouped into group of web by webpage; An index management module, it extracts index from the group of web of webpage grouping module grouping, in order to create and to store the URL information of the group of web of index information and the reference of index institute; A searching and managing module, its is according to the inquiry of receiving and search request signal and search index information is group searching result by the URL information creating with the group of web of index associated with the query, and group searching result is sent to user terminal; An and controller, it controls web page collection module, URL pattern generation module, webpage grouping module, index management module, searching and managing module, makes group searching server to complete search by group of web, and carries out communication by Wireless/wired communication network and client terminal and web server.
Embodiment
, with by reference to the accompanying drawings, illustrative examples of the present invention is described in detail now.
Fig. 1 is according to one embodiment of present invention, uses banking and the block scheme of the system of information search service is provided.
According to one embodiment of present invention, use banking and provide the system of information search service to comprise a user terminal 110, a Wireless/wired communication network 120, a web server 130, a group searching server 140,141, one index servers 150 of a group searching database (after this all representing database with DB), and an index data base 151.
User terminal 110, by Wireless/wired communication network 120 access group search servers 140, sends an inquiry and search request signal, and receives the group searching result from group searching server 140, then exports group searching result to display unit.
User terminal 110 comprises a wire communication unit, this unit comprises a Internet modem, for example high bit rate digital subscriber line (VDSL) modulator-demodular unit and cable modem, an and/or mobile communication unit, this unit comprises a mobile communication modulator-demodular unit, for example CDMA (CDMA) 2000 modulator-demodular units and wideband CDMA (W-CDMA) modulator-demodular unit.User terminal 110 is used the communication unit comprising to visit group searching server 140 by Wireless/wired communication network 120.User terminal further comprises a controller that comprises an internal memory and a microprocessor.Internal memory is deposited network browser program, and these programs are used to receive user's inquiry, solicited message search, and output Search Results is to display unit.Microprocessor is controlled the operation of user terminal 110.
The example of user terminal 110 comprises a personal computer (PC), for example a table computer or a kneetop computer, and a communicating terminal, for example personal digital assistant (PDA), mobile phone, person-to-person communication Service Phone, palm PC, global system for mobile communications (GSM) phone, W-CDMA mobile phone, CDMA-2000 mobile phone and mobile broadband system (MBS) mobile phone.
Wireless/wired communication network 120 couples together user terminal 110, web server 130, group searching server 140, index server 150, makes them can use wired or wireless mode to repeat the data of sending and receiving between them.
Web server 130 is typical webservers, comprises a plurality of computer systems or computer software that various information are provided with form web page.The webserver refers to a computer system and computer software (network server program), it is connected to a subelement, and pass through computer network with other webservers, for example Intranet or the Internet, communicate, receive operation and ask and provide operation result.Yet except network server program, the webserver should be interpreted as comprising and operate in the application program on the webserver and store superincumbent various database.The webserver is embodied according to operating system, for example DOS, Windows, Linux, UNIX or MacOS, and use corresponding network server program.
Index server 150 is carried out a data collection program, and normally Yi Ge web robot, collects data from being connected to the web server 130 of Wireless/wired communication network 120.The data of collecting are upgraded in index server 150 timings, and index data base 151 is used a upset file or similar mechanism to deposit the data of collecting.
Group searching server 140 communicates to read network data with index server 150 and index data base 151, and group searching server 140 is also analyzed the positional information of network data to create multiple patterns of position information.Positional information refers to the Internet paths that comprises the network data of collecting.It preferably includes the URL(uniform resource locator) (URLs) of network data.Its analysis contact between patterns of position information is to carry out division operation.Said process can comprise a URL scheme-tree of use and be created in a contact between a plurality of different URL patterns, also comprise and dividing into groups to having the webpage of identical URL mode packet thresholding.Selectively or additionally, create and the process of URL mode packet can comprise a predetermined URL pattern dictionary of reference.
Group searching server 140 is extracted in the index in web page group units, creates index information and URL information by the webpage of index reference, and in the interior storage index information of group searching database 141 and URL information.When group searching server 140 receives an inquiry and an information search demand from user terminal, it compares to create the information about group searching result by this inquiry and search.Group searching result with about inquiry other Search Results together with, can be transferred into user terminal 110.Group searching server 140 will be described in detail with reference to Fig. 2.
Even if group searching server 140 does not receive the group searching result about inquiry from user, it also can be used to provide one about the group searching result of a definite key word.For example, it can use a higher levels of concept that comprises user's inquiry or a definite key word about user's inquiry so that a group searching result to be provided.Further, it can use a key word about information so that a group searching result to be provided.
Group searching database 141 stores index information and the positional information (comprising URL information) of group of web, and these information are created by group searching server 140.The centre word of its storage group further.Database refers to the data structure forming in the memory block of computer system by DBMS (data base management system (DBMS)) program, and data are obtained, delete, edit and add therein.Database can be used a relevant DBMS and be adapted to the present invention, for example, and Oracle, Informix, Sybase, MS SQL (Microsoft's SQL), or the data base management system (DBMS) of DB2.Database comprises storage, obtains, deletes, edits and adds the required territory of data and element.Further, group searching database 141 and index data base 151 can be separated from each other, or are complete one.
Fig. 2 according to one embodiment of present invention, the block scheme of a group searching server.
Group searching server 140 is the webservers that comprise a web page collection module 210, URL pattern generation module 220, webpage grouping module 230, index management module 240, a searching and managing module 250 and a controller 260.
Web page collection module 210 is accessed web server 130 to collect data by Wireless/wired communication network.Web page collection module 210 can optionally be included in group searching server 140, and to reflect by the variation of the data of positional information institute reference, this positional information is collected and be stored in index data base 151 by index server 150.
The URLs of the webpage that URL pattern generation module 220 analyzer-controllers 260 or web page collection module 210 are required is to create URL pattern.URL pattern refers to the preassigned pattern of the URL of webpage, and it is created to manage and has a web pages of identical content or a web pages of being write as with same pattern.In the present invention, same web page is grouped and is managed for information search.Now, URL pattern is used as selecting a standard of same web page.
The URLs of the webpage that URL pattern generation module 220 analyzer-controllers 260 or web page collection module 210 receive, to create the URL pattern that comprises packet domain.For example, in the SayClub home page server that You Neowiz company provides, the URL of the representative page of each ID (identity) is analyzed, and ID is set to a packet domain, has therefore created a http://hompy.
Sayclub.com/[ID] URL pattern.URL pattern will be described in detail with reference to Fig. 3.Except packet domain, URL pattern can create based on HTML (Hypertext Markup Language) (HyperText Markup Language, HTML) masterplate, and this masterplate is shared by two webpages or web page contents.
HTML Templates refer to normally used foundation structure, so that webpage can be easy to be written into.For example, it is write with label form, as <Table...><TDGre atT.GreaT.GT[text number] </TD><TDGreatT.G reaT.GT[title] </TD>...</TABLEG reatT.GreaT.GT, it is usually used in writing webpage.
A html file that is written as webpage is the combination of a html tag and a text typically, and it observes the grammer of HTML.Html file is comprised of a plurality of functional blocks, as, menu block, contiguous block and the message block for content for being connected with other portal sites.Therefore functional block, within being usually used in webpage, and writes to facilitate user with masterplate.
The webpage being created by same operating parts can be contained in a plurality of webpages of being managed by web server, and this server provides board service, blog services, minimized homepage service and analog thereof.That is, a plurality of webpages of sharing identical HTML Templates trend towards being created by identical operating parts, and trend towards comprising identical content.
Because the web server 130 that board service, blog services is provided and has minimized homepage service is used identical HTML Templates to write maximum webpage of being managed by web server 130, so the webpage of being managed by identical web server 130 is shared same HTML Templates.Correspondingly, the webpage of shared same HTML Templates can have same URL pattern.
The contact of setting information by UP based between required URL pattern, 230 pairs of different URL patterns that created by URL pattern generation module 220 of webpage grouping module are divided into groups, and to having the webpage of same packet domain, divide into groups in URL modal sets.; 230 pairs of URL patterns of webpage grouping module are divided into groups; this URL pattern is different from the URL pattern being created by URL pattern generation module 220; but they are phase simple crosscorrelation again; the contact of setting information by UP based between required URL pattern, webpage grouping module 230 is divided into groups to having the webpage of identical URL mode packet thresholding in URL modal sets.
For example, be registered in the URLs of webpage in SayClub homepage and can be summarised as about 20 kinds of different URL patterns.Based on UP tree information, these 20 kinds different URL patterns are grouped in one single group.In them, the webpage with same user ID is grouped in a group of web as a grouping thresholding.Correspondingly, when being registered in the webpage of SayClub homepage, according to user ID, divided into groups, the packet count of webpage is equal to the quantity of the user ID that is registered in SayClub homepage.Further, this can be applied to be registered in the webpage of Naver blog equally, so that the packet count of webpage is equal to the quantity of the user ID that is registered in Naver blog.
Yet, in the present invention, for the standard of the webpage that divides into groups, be not limited to grouping thresholding.For example, can be by packet domain be carried out " with " or OR operation and webpage is divided into groups.The present invention can further comprise the contact between an index and a respective sets is evaluated, and page group is segmented or changed, this index is extracted by index management module 240.For example, when the index extracting from page group relates to two or more territory, the page can be integrated into a group or be subdivided two or more subgroups based on territory.When the index extracting from one group of page represents its content improperly, this group can be deleted to produce a reliable Search Results.
Index management module 240 is extracted an index from a page group by 230 groupings of webpage grouping module, and the index information and the URL information that are stored in group searching database 141 interior webpages.That is, index management module 240 is extracted an index to create index information from page group, and at the interior storage index information of index data base 151 of group searching database 141.In addition, index management module 240 is used UP tree information with the URL information of establishment group of web with in the interior storage of group searching database 141 URL information.
When receiving an inquiry or key word from user terminal 110, searching and managing module 250 search index databases 151, receive and have the group of web information of matching inquiry index and create group searching result from group searching database 141.Mating between inquiry or key word and index can be by being used specific terms dictionary or total information (MI) value carry out.In addition, can use known algorithm and carry out.
Controller 260 is controlled web page collection module 210, URL pattern generation module 220, webpage grouping module 230, index management module 240 and searching and managing module 250, so that group searching server can be used a web pages to inquire about.In addition, controller and index server 150 and index data 151 carry out communication, from user terminal 110, receive query search request signal, and send group searching result.
Fig. 3 and Fig. 4 are according to one embodiment of present invention, the schematic diagram making an explanation to URL pattern and UP tree.
Fig. 3 has illustrated the URL of the user home page of use Neowiz SayClub homepage (after this http://hompy.sayclub.com is referred to as hompy) service, with and the URL of related pages.User home page comprises that some contain the webpage of its ID at user URL.At SayClub hompy, URL is expressed as inquiry form, as " ◎ " symbol of being followed by " name variable=variate-value ".Correspondingly, in Fig. 3, when following the value of " targetmsr1=" while being considered to confirm the standard of user ID, URL pattern is created as shown in Figure 4.In addition, in the private blogs service being provided by portal or board service, service provider's domain name can be followed to distinguish user and bulletin by a separator.
Fig. 4 is by analyzing the pattern of the tree structure of the URL that the URL of webpage obtains in hompy.With reference to Fig. 3, each webpage comprises user's ID in its URL.Therefore, in the URL of webpage, the part of " user ID " can be converted into the packet domain of [ID], and " the bulletin type " that be included in hompy partly can be converted into the packet domain of [bulletin type].Even grouping thresholding has changed, and while not changing when the content that URL browses on, packet domain can be set to [ignoring] territory, and [ignoring] territory is left in the basket in the process of URL pattern of dividing into groups.Variation based on grouping thresholding, can by analyze in respective sets file include and contact determine between packet domain preferentially.
When URL pattern is created by said process, the URL pattern being created can be used to summarize all users' of Neowiz hompy webpage.Packet domain can automatically be formed in the process of analyzing URL address.In the private blogs being provided by portal or community sites or bulletin, URL pattern is unified and creates according to service provider's strategy.In this case, create and the process of URL mode packet can be performed by predetermined URL pattern and the path with reference to about packet domain.
Fig. 5 is according to one embodiment of present invention, uses one group of page and the process flow diagram of the method for information search service is provided.
An Internet user user terminal to be to input the inquiry of an information search, and sends this inquiry and searching request to group searching server 140 (operation S410).Operation S410 can be omitted.That is, a group searching server can be performed by analyzing storage data, and without user input query or inquiry request.From user terminal 110, receiving inquiry and search request signal, the information (comprising address information) that group searching server 140 receives about webpage from index data base 151, and this index data base 151 is collected and is compiled (operation S420) in advance by index server 150.Group searching server 140 optionally operation web page collection module 210 to receive the attached material that helps from index data base 151.
During this time, according to a preordering method, web robot program can be performed to receive web page index server 150 and be stored in index data base 151.
From index server 150, receiving webpage, group searching server 140 analyzing web pages are to create URL pattern (S430).
After creating URL pattern, based on set the URL pattern of information acquisition and contacting of group of web by UP, and this group of web has the grouping thresholding (operation S440) of same URL pattern in one group of URL pattern, 140 pairs of different URL patterns of group searching server are divided into groups.
After grouping webpage, group searching server 140 extracts index from the group of web in group unit, to create index information and by the URL information (operation S450) of the group of web of index reference, and in the URL information (operation S460) of the interior storage index information of group searching database 150 and group of web.
After the URL information of the interior storage index information of group searching database 150 and group of web, 140 pairs of group searching servers receive the inquiry of personal family terminal 110 and the index being stored in group searching database 150 contrasts, search for, create and send group searching result to user terminal 110 (operation S470).
From group searching server 140, receiving Search Results, user terminal 110 output Search Results are to display unit.According to the present invention, even inquiry is not output from user, also can provide group searching service.
According to the present invention, group searching service is grouped into a group of web by a plurality of webpages, and the search entity relevant to this webpage, rather than search is contained in a term in webpage.Search service can be together with board search service and is used.
Recently, board service is widely used on webpage, and user's registration is therein about the material of customizing messages, the problem of writing information and answer.Board service can comprise the webpage containing than user search more information.
Correspondingly, when a user input query is with request search, a representational webpage and the shared low-level bulletin webpage about this Query Information, be grouped in together and with predesigned order and be provided, rather than the webpage that comprises this inquiry is provided simply.
According to one embodiment of present invention, in group searching service in the afternoon, take as board service.Yet the present invention does not limit to so far, but can be applied to, use the many services of group of web to search for.
Fig. 6 is according to one embodiment of present invention, explains the schematic diagram of group searching result.
Provide group searching result aspect, its output sequentially can be depending on the number of file in user's inquiry and key word, group, organize in during reality in the creation-time of increase, group and group file of number of files or the contact between popularization degree, and the quantity of single group as user accesses of described popularization degree.In order to evaluate this contact, assessment technique can be used, wherein used in respective sets and predetermined term path in, user uses the frequency of inquiry and key word.Popularization degree can be depending on the number of file polling in respective sets, the number of user's access group and the data volume creating in respective sets in the given time.
When input window 510 interior input " psp " inquiry of a user in webpage, export a group searching result 530, described webpage exports user terminal 110 to so that group searching service and selection " search " to be provided.Group searching result 530 is classified in classification menu 520 according to " Neo rank order (newly registering order) ".User can classify to group searching result 530 in " related article order " or " popularization degree order " in classification menu 520.
Group searching result 530 can display network file title, article name etc. so that information to be provided effectively.Page group information 540 can further comprise the information about the number of page group classification and the file of including.In addition, can provide the inventory 550 of Single document in single page group to facilitate user.Further, can provide sorting item 560 about single page group source-information so that information to be provided effectively.
Although the present invention is referenced its illustrative embodiment and is described, and it will be appreciated by those skilled in the art that within the scope of the following claims, can make the multiple variation in form and details, and can not depart from protection scope of the present invention.
Utilizability in industry
The present invention can be applicable to provide method, system and the server of information search service effectively.

Claims (5)

1. a group searching server, comprising:
Web page collection module, carries out collecting web page program, in order to receive webpage and to store described webpage;
URL pattern generation module, the URL of the webpage receiving by analyzing web page collection module creates the URL pattern of the packet domain that is included as webpage grouping;
Webpage grouping module, the URL pattern of utilizing URL pattern generation module to create is grouped into group of web by webpage;
Index management module is extracted index, in order to create and to store the URL information of the group of web of index information and the reference of index institute from the group of web of webpage grouping module grouping;
Searching and managing module, when receiving inquiry and search request signal, search index information, is group searching result by the URL information creating with the group of web of the index relevant to described inquiry; And
Controller, controls web page collection module, URL pattern generation module, and webpage grouping module, index management module and searching and managing module, make group searching server to complete search by group of web;
Wherein, URL pattern generation module is used the URL of webpage to generate packet domain.
2. group searching server according to claim 1, wherein, URL pattern generation module creates the URL pattern as standard, and described standard is for preassigned pattern, webpage being divided into groups, and this preassigned pattern is shared by the webpage with identical information.
3. group searching server according to claim 1, wherein, the contact of described webpage grouping module based on passing through between the URL pattern of URL scheme-tree information acquisition, group by different URL mode packet with establishment URL pattern, and the webpage of the grouping thresholding with identical URL pattern in the group of URL pattern is grouped into group of web.
4. group searching server according to claim 1, wherein, described webpage grouping module is grouped into group of web by the webpage with identical value, described value by the packet domain to URL pattern carry out " with " or OR operation obtain.
5. group searching server according to claim 1, wherein, described index management module is extracted index from the webpage being contained in group of web, to create and to store index information, and create and store the URL information by the group of web of index institute reference, so that URL information is corresponding with index.
CN201210008279.9A 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages Active CN102622402B (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR10-2005-0018309 2005-03-04
KR20050018309 2005-03-04
KR10-2006-0020346 2006-03-03
KR20060020346A KR100671077B1 (en) 2005-03-04 2006-03-03 Server, Method and System for Providing Information Search Service by Using Sheaf of Pages

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN2006800066318A Division CN101133415B (en) 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages

Publications (2)

Publication Number Publication Date
CN102622402A CN102622402A (en) 2012-08-01
CN102622402B true CN102622402B (en) 2014-12-03

Family

ID=37623990

Family Applications (2)

Application Number Title Priority Date Filing Date
CN2006800066318A Active CN101133415B (en) 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages
CN201210008279.9A Active CN102622402B (en) 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN2006800066318A Active CN101133415B (en) 2005-03-04 2006-03-03 Server, method and system for providing information search service by using sheaf of pages

Country Status (3)

Country Link
JP (1) JP4769822B2 (en)
KR (1) KR100671077B1 (en)
CN (2) CN101133415B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010014954A2 (en) * 2008-08-01 2010-02-04 Google Inc. Providing posts to discussion threads in response to a search query
CN102105875B (en) 2009-07-15 2013-05-01 呢哦派豆株式会社 System and method for providing a consolidated service for a homepage
WO2015074455A1 (en) * 2013-11-25 2015-05-28 北京奇虎科技有限公司 Method and apparatus for computing url pattern of associated webpage
EP3161678B1 (en) 2014-06-25 2020-12-16 Google LLC Deep links for native applications
CN104158890B (en) * 2014-08-21 2018-05-22 广州品唯软件有限公司 The advisory feedback method and device of e-commerce website
KR101647596B1 (en) * 2015-04-20 2016-08-10 숭실대학교산학협력단 Method and server for providing contents service
CN105045684B (en) * 2015-07-16 2018-06-15 北京京东尚科信息技术有限公司 Index switching and the method and device of index control

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1158422A2 (en) * 2000-05-16 2001-11-28 LAS21 Co., Ltd. Internet site search service system and method having an automatic classification function of search results
KR20010105842A (en) * 2000-05-18 2001-11-29 구자홍 Information providing method for information searching result in an internet
CN1439135A (en) * 2000-05-01 2003-08-27 R.R.唐纳利父子公司 Methods and apparatus for serving a web page to a client device based on printed publications and publisher controlled links

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0934911A (en) * 1995-07-18 1997-02-07 Fuji Xerox Co Ltd Information retrieval device
JP2001134616A (en) * 1999-10-25 2001-05-18 Nec Corp Method and system for constructing web information on specific topic
JP2001306947A (en) * 2000-04-20 2001-11-02 Ntt Data Corp System and method for analyzing access and recording medium
JP2002288074A (en) * 2001-03-28 2002-10-04 Nec Corp Electronic communication system, electronic communication method, and computer program
JP3922693B2 (en) * 2002-06-17 2007-05-30 Necシステムテクノロジー株式会社 Internet information retrieval system
JP4231298B2 (en) * 2003-01-14 2009-02-25 日本電信電話株式会社 Information extraction rule creation system, information extraction rule creation program, information extraction system, and information extraction program
JP2004341942A (en) * 2003-05-16 2004-12-02 Nippon Telegr & Teleph Corp <Ntt> Content classification method, content classification device, content classification program, and storage medium storing content classification program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1439135A (en) * 2000-05-01 2003-08-27 R.R.唐纳利父子公司 Methods and apparatus for serving a web page to a client device based on printed publications and publisher controlled links
EP1158422A2 (en) * 2000-05-16 2001-11-28 LAS21 Co., Ltd. Internet site search service system and method having an automatic classification function of search results
KR20010105842A (en) * 2000-05-18 2001-11-29 구자홍 Information providing method for information searching result in an internet

Also Published As

Publication number Publication date
CN101133415B (en) 2012-03-21
KR100671077B1 (en) 2007-01-17
KR20060096356A (en) 2006-09-11
CN102622402A (en) 2012-08-01
JP2008537809A (en) 2008-09-25
JP4769822B2 (en) 2011-09-07
CN101133415A (en) 2008-02-27

Similar Documents

Publication Publication Date Title
JP4648455B2 (en) Personalized search method and personalized search system
US6247021B1 (en) Searchable bookmark sets as an internet advertising medium
CN1858733B (en) Information searching system and searching method
US7797295B2 (en) User content feeds from user storage devices to a public search engine
CA2365705C (en) A system for collecting specific information from several sources of unstructured digitized data
US6311194B1 (en) System and method for creating a semantic web and its applications in browsing, searching, profiling, personalization and advertising
CN101971172B (en) Mobile sitemaps
US8949217B2 (en) Server bookmarks
CN101542482B (en) Bookmarks and ranking
CN102521251A (en) Method for directly realizing personalized search, device for realizing method, and search server
CN102622402B (en) Server, method and system for providing information search service by using sheaf of pages
US20090006388A1 (en) Search result ranking
US20020091835A1 (en) System and method for internet content collaboration
US20030088639A1 (en) Method and an apparatus for transforming content from one markup to another markup language non-intrusively using a server load balancer and a reverse proxy transcoding engine
US20200175081A1 (en) Server, method and system for providing information search service by using sheaf of pages
KR20100094021A (en) Customized and intellectual symbol, icon internet information searching system utilizing a mobile communication terminal and ip-based information terminal
CN101866347A (en) Method, system that structural data is searched for and method, the system that makes data item structured and can search for
CN101681229A (en) Input candidate providing device, input candidate providing system, input candidate providing method, and input candidate providing program
CN101676907A (en) Method and system of directionally acquiring Internet resources
JP4430598B2 (en) Information sharing system and information sharing method
CN100414869C (en) Method and system for implementing message subscription through Internet
KR20000054312A (en) Establishing provide Method for ordered web information
JP4649036B2 (en) Category reporting method, record reporting method, search service device by search server
CN101788981A (en) Deep web mobile search method, server and system
CN107665226A (en) The method for pushing and pusher of a kind of information

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant