CN103034662A - Database establishment device, database establishment method, search application integration system and search application integration method - Google Patents

Database establishment device, database establishment method, search application integration system and search application integration method Download PDF

Info

Publication number
CN103034662A
CN103034662A CN2011103048367A CN201110304836A CN103034662A CN 103034662 A CN103034662 A CN 103034662A CN 2011103048367 A CN2011103048367 A CN 2011103048367A CN 201110304836 A CN201110304836 A CN 201110304836A CN 103034662 A CN103034662 A CN 103034662A
Authority
CN
China
Prior art keywords
search
application information
search application
database
domain name
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011103048367A
Other languages
Chinese (zh)
Other versions
CN103034662B (en
Inventor
张军
钟朝亮
李邵明
松尾昭彦
邹纲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to CN201110304836.7A priority Critical patent/CN103034662B/en
Publication of CN103034662A publication Critical patent/CN103034662A/en
Application granted granted Critical
Publication of CN103034662B publication Critical patent/CN103034662B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a database establishment device and a database establishment method for establishing a search application information database as well as a search application integration device and a search application integration method, which aims at overcoming the problems in the prior art that different application preferences and different application requirements of different users cannot be simultaneously met. The database establishment device comprises a browse session recognition unit, a search session judgment unit and a database establishment unit, wherein the browse session recognition unit is used for recognizing a browse session in a browse history; the search session judgment unit is used for judging whether the browse session is a search session or not; and the database establishment unit is used for acquiring search application information in the obtained search session and establishing the search application information database according to the obtained search application information. The integration device comprises the establishment device and also comprises an application integration unit and an interface unit, and the integration device is used for searching, integrating and displaying. By utilizing the database establishment device and method as well as the search application integration device and method, the application preference and application demand of the user can be more adequately matched.

Description

Integration system and method are used in database construction device and method, search
Technical field
Present invention relates in general to the Web application, more specifically, the present invention relates to a kind of for making up search application information wide area information server construction device and method and search application integration system and method.
Background technology
The integration (Mashup) that Web uses is a kind of technology that is used for several Web set of applications are created altogether new function.Integration can be defined as by extraction and pooled data and function from different Web uses to use the new function of interpolation to Web, to support user's needs and task.
It is that the function of search that will use from several different Web search and/or Search Results combine to support user's search to need and the integration technology of task that search use to be integrated.In traditional method, after deliberation to one group of integration of specifically searching for application.The Search Results that several search commonly used such as (Yahoo), Bing and Ask are used is with the Search Results through integrating that provides these several search to use to the terminal user.In traditional search application integration method and system, usually for comparatively popular, use widely several search using artificial ground to integrate, and different terminal users always uses identical search to use integrated database, can not be according to user's demand, use preference etc. that search is used to integrate to carry out personalized customization.
Summary of the invention
Provided hereinafter about brief overview of the present invention, in order to basic comprehension about some aspect of the present invention is provided.Should be appreciated that this general introduction is not about exhaustive general introduction of the present invention.It is not that intention is determined key of the present invention or pith, neither be intended to limit scope of the present invention.Its purpose only is that the form of simplifying provides some concept, with this as the in greater detail preorder of discussing after a while.
Defects in view of prior art, one of purpose of the present invention provides a kind of for making up search application information wide area information server construction device and method and integration system and method are used in search, to overcome at least the use preference that can not satisfy simultaneously different user that exists in the prior art and the problem of demand.
To achieve these goals, according to an aspect of the present invention, provide a kind of for making up search application information wide area information server construction device, comprise: the browsing session recognition unit, it is arranged to based on user's browsing history and time of origin thereof and identifies browsing session in the browsing histories; The search sessions identifying unit, it is arranged to according to the relevance between the parameter attribute of the record in the browsing session and record judges whether browsing session is search sessions; And the database construction unit, it is arranged to according to the search sessions of judging and obtains search application information in the search sessions, and makes up the search application information database based on the search application information that obtains.
According to another aspect of the present invention, also provide a kind of search to use integration system, comprise aforesaid database construction device, also comprise: use integral unit, it is arranged to and utilizes all search application that relate in the constructed search application information database of database construction device that the keyword of user's input is searched for, and obtains the integration Search Results that the Search Results that all search are used is combined; And interface unit, it is arranged to the demonstration inputting interface, receives the keyword of user's input, and shows above-mentioned integration Search Results.
According to another aspect of the present invention, also provide a kind of for making up search application information wide area information server construction method, having comprised: identified browsing session in the browsing histories based on user's browsing history and time of origin thereof; Judge according to the parameter attribute of the record in the browsing session and the relevance between record whether browsing session is search sessions; And obtain search application information in the search sessions according to the search sessions of judging, and make up the search application information database based on the search application information that obtains.
According to another aspect of the present invention, a kind of search application integration method also is provided, comprise aforesaid database construction method, also comprise: the keyword that receives user's input, and utilize by all search application that relate in the constructed search application information database of database construction method the keyword of user's input is searched for, obtain the integration Search Results that the Search Results that all search are used is combined.
According to other side of the present invention, corresponding computer-readable recording medium also is provided, storing on this computer-readable recording medium can be by the computer program of computing equipment execution, and described program can make described computing equipment carry out above-mentioned database construction method or above-mentioned search application integration method when carrying out.
Database construction device and method and search application integrating apparatus and method according to the invention described above embodiment, can realize one of following at least benefit: the browsing histories by digging user forms an integrated search application information database, this database can comprise popular network search engines, can also comprise that having seldom user's search uses; And, because this database and integration are based on user's browsing histories, thus this database and integrate after Search Results all fully use preference and the demand of match user; In addition, the constructive process of database does not need user's participation, so that the user easily brings into use.
By below in conjunction with the detailed description of accompanying drawing to most preferred embodiment of the present invention, these and other advantage of the present invention will be more obvious.
Description of drawings
The present invention can by with reference to hereinafter by reference to the accompanying drawings given description be better understood, wherein in institute's drawings attached, used same or analogous Reference numeral to represent identical or similar parts.Described accompanying drawing comprises in this manual and forms the part of this instructions together with following detailed description, and is used for further illustrating the preferred embodiments of the present invention and explains principle and advantage of the present invention.In the accompanying drawings:
Fig. 1 be schematically illustrated according to the embodiment of the invention, be used for to make up the block scheme of the structure of search application information wide area information server construction device.
Fig. 2 is the block scheme that schematically shows according to the another kind of structure of the database construction device of the embodiment of the invention.
Fig. 3 show from the network agent daily record, obtain, filtering the schematic diagram of an example of browsing history of record ignored.
Fig. 4 is the block diagram of the structure of the schematically illustrated browsing session recognition unit 110 as shown in Fig. 1 and Fig. 2 according to the embodiment of the invention.
Fig. 5 is the schematic diagram that the time distribution of the browsing history of user within a period of time is shown.
Fig. 6 is the block diagram of the structure of the schematically illustrated search sessions identifying unit 120 as shown in Fig. 1 and Fig. 2 according to the embodiment of the invention.
Fig. 7 is the block diagram of the structure of the schematically illustrated database construction unit 130 as shown in Fig. 1 and Fig. 2 according to the embodiment of the invention.
Fig. 8 is the schematically illustrated block diagram that extracts the structure (omitted search application information and extracted the parts of subelement 710 except clicking clauses and subclauses statistical module 800) of subelement 710 according to the search application information shown in Fig. 7 of the embodiment of the invention.
Fig. 9 shows the schematic diagram of the last set application message that obtains in an example according to the database construction device of the embodiment of the invention.
Figure 10 is the block diagram of the another kind of structure of schematically illustrated database construction device according to the embodiment of the invention.
Figure 11 is the schematically illustrated block diagram of using the structure of integration system according to the search of the embodiment of the invention.
Figure 12 is the block diagram that has schematically shown according to the another kind of structure of the integration system of the embodiment of the invention.
Figure 13 is that at integration system according to an embodiment of the invention uses in the example, utilizes the second sortord to integrating the display interface after Search Results sorts.
Figure 14 is the schematically illustrated schematic diagram for the treatment scheme that makes up search application information wide area information server construction method according to the embodiment of the invention.
The schematic diagram of the treatment scheme of the schematically illustrated search application integration method according to the embodiment of the invention of Figure 15.
Figure 16 shows the structure diagram that can be used to realize according to the hardware configuration of a kind of possible messaging device of the database construction device of the embodiment of the invention and method (or integrating apparatus and integration method are used in search).
It will be appreciated by those skilled in the art that in the accompanying drawing element only for simple and clear for the purpose of and illustrate, and not necessarily draw in proportion.For example, the size of some element may have been amplified with respect to other elements in the accompanying drawing, in order to help to improve the understanding to the embodiment of the invention.
Embodiment
In connection with accompanying drawing example embodiment of the present invention is described hereinafter.For clarity and conciseness, all features of actual embodiment are not described in instructions.Yet, should understand, in the process of any this practical embodiments of exploitation, must make a lot of decisions specific to embodiment, in order to realize developer's objectives, for example, meet those restrictive conditions with system and traffic aided, and these restrictive conditions may change to some extent along with the difference of embodiment.In addition, might be very complicated and time-consuming although will also be appreciated that development, concerning the those skilled in the art that have benefited from present disclosure, this development only is routine task.
At this, what also need to illustrate a bit is, for fear of having blured the present invention because of unnecessary details, only show in the accompanying drawings with according to the closely-related apparatus structure of the solution of the present invention and/or treatment step, and omitted other details little with relation of the present invention.
Fig. 1 be schematically illustrated according to the embodiment of the invention, be used for to make up the block scheme of the structure of search application information wide area information server construction device.As shown in Figure 1, database construction device 100 comprises browsing session recognition unit 110, search sessions identifying unit 120 and database construction unit 130.Wherein, browsing session recognition unit 110 is identified browsing session in user's browsing histories based on user's browsing history and time of origin thereof, search sessions identifying unit 120 judges according to the parameter attribute of the record in the browsing session and the relevance between record whether browsing session is search sessions, database construction unit 130 obtains search application information in this search sessions according to the search sessions of judging, and makes up the search application information database based on the search application information that obtains.
At present, Web browser is widely used as the platform that the user uses Web to use, when the user by the browser browsing page or when using certain Web to use, it is a user's browsing history that each action of user (such as certain link of access, perhaps submitting some data etc. to remote server) all can be recorded into.
In an example, user's browsing history can be the original browsing history that directly obtains.Original browsing history can obtain by user end computer, for example, obtains by being installed in the technology such as browser plug-in on the user end computer or Technology of Network Sniffer.In addition, in the situation that the user uses the network agent online, original browsing history also can obtain by the daily record of network agent.
In another example, user's browsing history also can be the result who obtains after by predetermined filtercondition above-mentioned original browsing history being filtered.
For example, Fig. 2 schematically shows the another kind of structure according to the database construction device of the embodiment of the invention.As shown in Figure 2, database construction device 200 also comprises filter element 140 except comprising browsing session recognition unit 110, search sessions identifying unit 120 and database construction unit 130.Filter element 140 is used for filtering out record ignored from the user's who obtains original browsing history, and the browsing history after will filtering is sent to the browsing session recognition unit and processes.Wherein, record ignored refers to unessential those record clauses and subclauses, data etc. in the practical application of the embodiment of the invention.Utilize filter element 140, for example can obtain browsing history as shown in Figure 3.
Fig. 3 show from the network agent daily record, obtain, filtering the schematic diagram of an example of browsing history of record ignored.The URL (being designated hereinafter simply as referer) that in the browsing history that goes out as shown in Figure 3, can comprise access time (time), access method (method), just accessed URL (URL(uniform resource locator)), points to the webpage of this link URL (referer), the information such as type (content-type) of the data content that returns from distance host, in addition, also comprised original HTML (HTML (Hypertext Markup Language)) page that obtains from remote server.The domain name (hostname), request path (that is, the execution script path on the remote server) that can resolve into remote server to URL by service regeulations expression formula or other known technology (requestpath) and the parameter of this request (parameters).Thus, a browsing history can be expressed as form:
SR=(time,method,hostname,requestpath,
parameters,content-type,referer,body)
Wherein, " body " in the following formula is the body part in the record, and expression is from the response content of remote server, the normally form of html source code.For simplicity with clear for the purpose of, omitted the content of " body " in the browsing history illustrated in fig. 3.In addition, access method can comprise GET, POST, PUT, DELETE etc.
In addition, referer is HTTP Referer, when browser sends request to web server, generally can be with referer, tells server from which page link is come, server can obtain whereby some information for the treatment of.For example, be linked to the website of B from the homepage of A, then the server of B can count the website that has every day how many users to visit B by the link on the homepage of clicking A according to HTTP Referer.
Browsing history shown in Fig. 3 can be to filter out the residue record that obtains behind the record ignored according to Rule-based method from original browsing history.Particularly, filter element 140 can be configured to realize Rule-based method comes the function of filtering record ignored, above-mentioned rule can be: if the content type of record is not text or html, then remove this record; If the access mode of record is not GET or POST, then remove this record; If the request path of record comprises one among suffix " .css ", " .ico " or " .js ", then remove this record; And if the record body be the sky, then remove this record.As long as record satisfies in the above-mentioned rule any one, filter element 140 just should record filtering.Thus, can be from user's browsing history unessential for the present invention, those records that can be counted as the noise files that search uses of filtering, thereby can reduce the quantity of the record that will be processed by browsing session recognition unit 110, therefore help to provide the treatment effeciency of whole device.
Below in conjunction with Fig. 4~Fig. 9 the concrete processing operation of browsing session recognition unit 110, search sessions identifying unit 120 and database construction unit 130 is described.
When the user browses by browser, several active browsing the phase may be arranged, also, the user browses by browser always continuously.For example, the user may use by the Web that browser used 5 minutes, and then the user has stopped browsing, then uses computer in other mode, then the document function of for example using Microsoft office to carry out 10 minutes begins again to carry out web page browsing etc. by browser.Therefore, need to utilize browsing session recognition unit 110 from user's browsing histories, to identify the active phase of browsing, be browsing session, then could utilize the search sessions identifying unit from browsing session, to find out to comprise active that search uses to browse the phase, be search sessions.
Fig. 4 is the block diagram of the structure of the schematically illustrated browsing session recognition unit 110 as shown in Fig. 1 and Fig. 2 according to the embodiment of the invention.As shown in Figure 4, browsing session recognition unit 110 may further include the first judgement subelement 410 and recognin unit 420.
First judges whether the adjacent browsing history that subelement 410 can be arranged in the browsing history of judging the user belongs to same browsing session.For example, first judge subelement 410 can by the time interval between the adjacent browsing history in the browsing history of judging the user whether more than or equal to the Preset Time interval, judge whether described adjacent browsing history belongs to same browsing session.Specifically, between the adjacent browsing history in user's browsing history interval greater than or equal in the situation at Preset Time interval, first judges that the described adjacent browsing history of subelement 410 judgements belongs to respectively different browsing sessions, otherwise first judges that the described adjacent browsing history of subelement 410 judgements belongs to same browsing session.Certainly, can judge in other way also whether adjacent browsing history belongs to same browsing session.
Fig. 5 shows the time distribution map of the browsing history of user within a period of time.In Fig. 5, horizontal ordinate represents the time (supposing that along the axial chronomere of horizontal ordinate be 1 minute) that historical record occurs, and ordinate is illustrated in the quantity of the historical record that produces in each chronomere.
Recognin unit 420 can be arranged to according to first judges that the result of determination of subelement 410 identifies a plurality of browsing sessions in user's browsing history.Thus, can be divided into a plurality of groups to a large amount of browsing histories of user, every group is a browsing session, wherein, can comprise one or more browsing histories in each browsing session.
For user's browsing history as shown in Figure 5, suppose that Preset Time is spaced apart 5 minutes, then first judge subelement 410 to each other interval greater than or equal two adjacent historical records of 5 minutes and be judged to be the browsing session that belongs to different, and to each other time interval is judged to be same browsing session less than two adjacent historical records of 5 minutes.Like this, recognin unit 420 can identify 3 browsing sessions from user's browsing histories as shown in Figure 5.
But the present invention is not limited to this structure, and other can also should be included in the scope of the present invention according to the similar structures that browsing history and time of origin thereof are identified browsing session.For example, by configuration browsing session recognition unit 110, can cut apart browsing histories by the blank time phase in the identification user browsing histories, thereby obtain a plurality of browsing sessions, in other words, the every browsing history of adjacent two blank time between the phase is judged to be a browsing session.
As mentioned above, in user's browsing histories, not only comprise the historical record that the use search is used, also may comprise other historical record, such as using the historical record of reading news or checking and accepting the diverse network Web application of the functions such as mail such as being used for.Therefore, need to from the browsing session that identifies, identify further those sessions that comprise search application information, be search sessions.
Can find by observing, usually comprise the searching key word that highlights among the body of Search Results, and the user may often click Search Results, before the user clicks Search Results and between the record that produces afterwards, there is certain relevance (namely, the referer of the record of clicking is the URL of search operation record), therefore can judge search sessions in the browsing session according to the parameter attribute (for example, the frequency of occurrences of searching key word, highlighted indicating characteristic etc.) of the record in the browsing session and the relevance between record.
Fig. 6 is the block diagram of the structure of the schematically illustrated search sessions identifying unit 120 as shown in Fig. 1 and Fig. 2 according to the embodiment of the invention.As shown in Figure 6, search sessions identifying unit 120 may further include the second judgement subelement 610, the 3rd judgement subelement 620 and the 4th is judged subelement 630.Second judges that subelement 610 can be arranged to judgement and whether have the search operation record in the browsing session of identifying, wherein search operation record is at text, is to have the record that occurrence number surpasses preset value and highlighted parameter value among the body, supposes to represent with SR_search.The 3rd judges subelement 620 can be arranged in the second result of determination of judging subelement 610 as certainly, exist in the situation of search operation record in the browsing session namely identified, judges whether have such record in this browsing session: occur in search operation and record SR_search URL afterwards and that record with search operation and be the record of referer referer.The 4th judge subelement 630 can be arranged to the 3rd result of determination of judging subelement 620 as sure situation under, this browsing session is judged to be search sessions.
Thus, by having the search sessions identifying unit of structure as shown in Figure 6, can further identify which browsing session in the browsing session that identifies is search sessions.
Fig. 7 is the block diagram of the structure of the schematically illustrated database construction unit 130 as shown in Fig. 1 and Fig. 2 according to the embodiment of the invention.As shown in Figure 7, database construction unit 130 can comprise search application information extraction subelement 710 and Database subelement 720.
Wherein, search application information extraction subelement 710 can be arranged in the record that comprises from the search sessions of judging and extract search application information, this search application information can comprise following information at least: the domain name that search is used (that is the hostname that, comprises in the search operation record); The request path corresponding with the domain name of this search application (that is the request path that, comprises in the described search operation record); The searched key word parameter corresponding with domain name and described request path; The search time corresponding with domain name, described request path and described searched key word parameter; And the entry number of clicked mistake in the Search Results corresponding with domain name, described request path, described searched key word parameter and described search time.
Wherein, the searched key word parameter corresponding with domain name and described request path is that occurrence number in the text (being body) of described search operation record surpasses predetermined threshold and highlighted parameter value.Usually, domain name and request path that last set is used can be corresponding at least one keywords, and this shows that the user can successively repeatedly search under same domain name, same request path, and each search can be used identical or different searching key words.
Because the user may carry out the search of one or many to same keyword under same domain name, same path, so correspondingly, the search time corresponding with same domain name, same request path and same searched key word parameter also can be for one or more.
In addition, because the Search Results corresponding with domain name, request path, searching key word and the search time determined is unique, so the entry number of clicked mistake also is well-determined in this Search Results.
In another specific implementation according to the database construction device of the embodiment of the invention, can a click clauses and subclauses statistical module 800 as shown in Figure 8 be set by extracting in the subelement 710 in search application information, determine the entry number of clicked mistake in Search Results.That is, click the entry number that clauses and subclauses statistical module 800 is arranged to clicked mistake in the statistics Search Results corresponding with above-mentioned domain name, above-mentioned request path, above-mentioned searched key word parameter and above-mentioned search time.
Particularly, as shown in Figure 8, click clauses and subclauses statistical module 800 and can comprise definite submodule 810 and statistics submodule 820.Wherein, determine submodule 810 be arranged to determine in the search sessions of judging, have a search operation record that in text occurrence number surpasses preset value and highlighted parameter value.The URL that statistics submodule 820 is arranged in described search sessions is that statistics occurs after described search operation record, record take described search operation is the number of the record of referer, and this number is defined as the entry number of clicked mistake in the Search Results corresponding with domain name, described request path, described searched key word parameter and described search time.Thus, can determine the entry number of clicked mistake in Search Results.
In addition, as shown in Figure 7 Database subelement 720 can be arranged to and extract the search application information that subelement 710 extracts according to search application information and set up the search application information database.In described search application information database, search application information can be divided into groups according to domain name and described request path,, the search application information relevant with same request path with same domain name can be divided into same group of information that is.
For example, Fig. 9 shows the last set application message that obtains in an example according to the database construction device of the embodiment of the invention.As shown in Figure 9, the domain name that search is used is " www.baidu.com ", request path is " s ", the user on Dec 24th, 2010 16:38:35 keyword " Fujitsu " is searched for, and in corresponding Search Results, 3 clauses and subclauses have been clicked, the user on Dec 27th, 2010 15:22:12 keyword " Japan " is searched for, and in corresponding Search Results, clicking 6 clauses and subclauses, etc.
In addition, except the top illustrated information of giving an example, also can be included in the embodiment of the invention related " search application information " such as the information such as search application title, marked graph that obtain by known technology.
Figure 10 is the block diagram of the another kind of structure of schematically illustrated database construction device according to the embodiment of the invention, wherein, the unit that uses solid box to describe in Figure 10 is essential parts, and the unit that the with dashed lines frame is described is non-essential selectable unit (SU), can select as required in actual applications.
As shown in figure 10, in the database construction device 1000 according to the embodiment of the invention, device 1000 can also comprise updating block 150 except comprising browsing session recognition unit 110, search sessions identifying unit 120 and database construction unit 130 and optional filter element 140.Wherein, updating block 150 is arranged to termly and starts the browsing session recognition unit 110, search sessions identifying unit 120 and the database construction unit 130 that are included in the device 1000 and optional filter element 140 rebuilding the search application information database, and replaces original search application information database with the search application information database of new structure.
Particularly, for example, updating block 150 can start browsing session recognition unit 110, search sessions identifying unit 120, database construction unit 130 and optional filter element 140 according to the default cycle in the time interval to be processed separately accordingly, rebuilding the search application information database, and substitute original database with the search application information database of this new structure.Thus, regular update function that can implement device 1000 so that the search application information database that is obtained by this device can be complementary with user's up-to-date browsing histories, and can more meet user current search custom and demand.
Can find out by above description, in the database construction device according to the embodiment of the invention, can be by the history that surfs the web of digging user, create a database that comprises the relevant information that the search relevant with user's browsing histories used, be the search application information database, thereby realize the integration to the search application related information that comprises in user's browsing histories.The search application information database that creates can be supported user's personalized search, this be because, therefore search application in this search application information database and information are to obtain by excavating based on the browsing histories to the specific user, fully use preference and the demand of match user.The integration of the relevant information that the database that traditional search makes up in using and integrating is normally used several fixing search, wherein usually include only relevant information comparatively popular, that use widely several search to use on the network, therefore can not satisfy simultaneously various use preference and the demand of different user; And by the relevant information that can comprise in the database that creates according to the device of the embodiment of the invention that once used all search of user are used, or user's relevant information that used all search are used within recently a period of time, therefore wherein can comprise on the network more unpopular, the relevant information that bright few some search of using is used, and these non-mainstream search application might just be best suited for certain class user's demand and the search of custom is used, therefore, the database that is created by the device according to the embodiment of the invention can be supported the different search needs of different user.
According to embodiments of the invention, also provide a kind of search to use integration system, this integration system comprises that described above being used for makes up search application information wide area information server construction device, is described below in conjunction with Figure 11.
Figure 11 is the schematically illustrated block diagram of using the structure of integration system according to the search of the embodiment of the invention.As shown in figure 11, integration system 1100 comprises above described for making up search application information wide area information server construction device 1110, using integral unit 1120 and interface unit 1130 in conjunction with Fig. 1-10.Wherein, database construction device 1110 can have for example 26S Proteasome Structure and Function shown in Fig. 1,2 and 10, for fear of repetition, has omitted description to the 26S Proteasome Structure and Function of database construction device 1110 at this.
In addition, the structure of each building block also can have for example such as Fig. 4 in the database construction device 1110,6,26S Proteasome Structure and Function shown in 7 and 8, for example, the database construction unit that comprises in the database construction device 1110 can have the 26S Proteasome Structure and Function identical with above database construction unit described in conjunction with Figure 7 130, namely, the database construction unit that comprises in the database construction device 1110 can comprise search application information extraction subelement and Database subelement, wherein, the function of search application information extraction subelement and Database subelement can be extracted referring to above search application information described in conjunction with Figure 7 the function of subelement 710 and Database subelement 720, etc., omit its specific descriptions at this.
Referring to Figure 11, using integral unit 1120 can be arranged to and utilize all search that relate in the constructed search application information database of database construction device 1110 to use the keyword of user's input to be searched for, obtained the integration Search Results that the Search Results that all search are used is combined.
Interface unit 1130 can be arranged to the demonstration inputting interface, receives the keyword of user's input, and shows above-mentioned integration Search Results.
In utilizing an application example of searching for according to the search application integration system of the embodiment of the invention, when the user begins to search for by keyword of interface unit 1130 inputs, using integral unit 1120 utilizes each the involved search in the search application information database that has been created by database construction device 1110 to use, the keyword that comes respectively the user is inputted is at the enterprising line search of network, then use integral unit 1120 each Search Results of searching for application is incorporated into together, and show the current integration Search Results that obtains by interface unit 1130.
In addition, interface unit 1130 can show above-mentioned integration Search Results in a certain order.For example, Figure 12 has schematically shown the another kind of structure according to the integration system of the embodiment of the invention.
As shown in figure 12, integration system 1200 also comprises sequencing unit 1140 except comprising database construction device 1110, using integral unit 1120 and the interface unit 1130.Sequencing unit 1140 is arranged to according to one of following three kinds of modes and sorts to integrating Search Results, and the integration Search Results after will sorting is sent to interface unit 1130, the integration Search Results after this sorts by interface unit 1130 demonstrations afterwards.
The first sortord is: use the number of times that was used according to the search relevant with integrating Search Results and sort.Particularly, can calculate the group number of each self-corresponding search application information of domain name of the search application relevant with integrating Search Results, the group number of the search application information that the domain name of each search application is corresponding is used the number of times that was used as the search of correspondence.
The second sortord is: according to how much sorting of the entry number of clicked mistake in each self-corresponding Search Results of domain name of the search application relevant with integrating Search Results.
The third sortord is: the priority according to the domain name of the search application relevant with integrating Search Results each self-corresponding up-to-date search time sorts, also, and according to sorting its last service time.
Wherein, above-mentioned all sortords can be that the same Search Results of using correspondence is sorted as a whole, also namely, the purpose of ordering is to sorting between each application, using the sortord that corresponding some Search Results then adopt this application itself for one.
For example, illustrate as an example of the second sortord example, use in the example for one at integration system according to an embodiment of the invention, in the search application information database that the browsing histories according to the user makes up, relate to altogether Google, certain section of internal management of a company website, four search application of Nifty and Baidu, also namely above-mentioned " domain name that the search relevant with integrating Search Results used " comprises that above four are searched for the domain name of using separately.For example, as a result cn.fujitsu.com and these two clicked mistakes of result of detail.zol.com.cn of search " Fujitsu " in Google, and search " NEC " in Google, two clicked mistakes of result of nec.com and nec.jp are arranged again, and then the entry number of clicked mistake is 4 in the corresponding history Search Results of Google.Similarly, can obtain the entry number of clicked mistake in Search Results corresponding to certain section of internal management of a company website, Nifty and three search application of Baidu, in this example, these 3 entry number respectively are 2,1 and 3.Then according to above-mentioned the second sortord to the result that the integration Search Results sorts be: Google, Baidu, certain section of internal management of a company website and Nifty.As shown in figure 13, Figure 13 is that at integration system according to an embodiment of the invention uses in the example, utilizes the second sortord to integrating the display interface after Search Results sorts.Wherein, in Figure 13, each use with and corresponding Search Results be positioned at delegation.
It is a kind of for making up search application information wide area information server construction method that embodiments of the invention also provide, and Figure 14 shows the treatment scheme of the method.
As shown in figure 14, the treatment scheme 1400 of this database construction method starts from step S1410, then execution in step S1420.
In step S1420, identify browsing session in the browsing histories, then execution in step S1430 based on user's browsing history and time of origin thereof.
In an example, user's browsing history can be the original browsing history that directly obtains.Wherein, original browsing history can obtain by the mode of the original browsing history of acquisition described hereinbefore, specifically can be referring to above describing.
In another example, user's browsing history also can be by filter out the browsing history after the filtration that obtains behind the record ignored from the user's that obtains original browsing history
In addition, in a specific implementation for the treatment of scheme 1400, the step of the browsing session in the identification browsing histories among the step S1420 can comprise: whether the adjacent browsing history in judgement user's the browsing history belongs to same browsing session; And in user's browsing history, identify a plurality of browsing sessions according to the result who judges.Wherein, whether the adjacent browsing history in above-mentioned judgement user's the browsing history belongs to the concrete decision process of same browsing session can be utilized first to judge that the decision process of subelement 410 is identical with above described in conjunction with Figure 4, specifically describes no longer and repeats.
In step S1430, according to the parameter attribute of the record in the browsing session and the relevance between record, judge whether above-mentioned browsing session is search sessions, then execution in step S1440.
For example, in a specific implementation for the treatment of scheme 1400, in the situation that can judge in the following manner search sessions among the step S1430: satisfy simultaneously following two conditions at browsing session, this browsing session is judged to be search sessions.
Wherein, a condition is: have the search operation record in the browsing session of identifying, wherein, search operation record is to have the record that in text occurrence number surpasses preset value and highlighted parameter value.
Another condition is: have such record in browsing session: occur in after the search operation record and the record take the URL of search operation record as referer.
Thus, can judge which session in the browsing session of having identified based on above two conditions is search sessions.
In step S1440, obtain search application information in the search sessions according to the search sessions of judging, and make up search application information database, then execution in step S1450 by this search application information.
Wherein, in a specific implementation for the treatment of scheme 1400, search application information is extracted in browsing of can comprising in the search sessions of having judged in the record, and then make up the search application information database, wherein, the search application information of extracting can comprise following information at least: the domain name that search is used, the request path corresponding with above-mentioned domain name, the searched key word parameter corresponding with above-mentioned domain name and above-mentioned request path, with above-mentioned domain name, above-mentioned request path and above-mentioned searched key word parameter corresponding search time, and with above-mentioned domain name, above-mentioned request path, the entry number of clicked mistake in the above-mentioned searched key word parameter Search Results corresponding with above-mentioned search time; Wherein, above-mentioned search application information is the information after dividing into groups according to domain name and described request path.
In addition, except the top illustrated information type of giving an example, also can be included in the embodiment of the invention related " search application information " such as the information such as search application title, marked graph that obtain by known technology.
In this explanation, to extract the search application information that subelement 710 extracts identical for mentioned search application information and search application information described in conjunction with Figure 7 above here, and its concrete meaning is referring to above description.The acquisition methods of each information that comprises in the mentioned search application information here in addition, also can be identical with the preparation method of each corresponding informance of above describing.
For example, in a specific implementation for the treatment of scheme 1400, " entry number of clicked mistake in the Search Results corresponding with above-mentioned domain name, above-mentioned request path, above-mentioned searched key word parameter and above-mentioned search time " can obtain in the following manner: determine in the search sessions of judging, have a search operation record that in text occurrence number surpasses preset value and highlighted parameter value; And the URL that statistics occurs after above-mentioned search operation record in above-mentioned search sessions, record take above-mentioned search operation is the number of the record of referer, and this number is defined as the entry number of clicked mistake in the Search Results corresponding with above-mentioned domain name, above-mentioned request path, above-mentioned searched key word parameter and above-mentioned search time.
Treatment scheme 1400 ends at step S1450.
In addition, in another specific implementation for the treatment of scheme 1400, treatment scheme 1400 can also comprise step of updating: rebuild termly the search application information database, and use the new search application information database that makes up to replace original search application information database.For example, in this another specific implementation figure according to treatment scheme 1400, can preset a time interval, and make 1400 these time intervals of every process for the treatment of scheme just re-execute step S1420-1440 one time, thereby the search application information database is upgraded.Step of updating can more meet the nearest browsing histories of user, thereby also more satisfies user current use preference and custom.
The database that creates according to the database construction method of the embodiment of the invention, wherein can comprise used all the search application of user and information or user used all search application and information within recently a period of time, therefore can comprise wherein that more unpopular on the network, bright is that some search of using is used and information, and might these non-mainstream search use the demand that exactly is best suited for certain class user and custom, therefore, the database that creates according to the database construction method of the embodiment of the invention can be supported the different search needs of different user.
Embodiments of the invention also provide a kind of search application integration method, and this integration method comprises above-mentioned database construction method, and Figure 15 shows the treatment scheme of this integration method.The treatment scheme 1500 of this integration method starts from step S1510 as shown in figure 15, then in step S1520 based on user's browsing history and time of origin thereof, browsing session in the identification browsing histories, in step S1530 according to the relevance between the parameter attribute of the record in the browsing session and record, judge whether above-mentioned browsing session is search sessions, in step S1540 according to the search sessions of judging, obtain the search application information in the search sessions, and by search application information structure search application information database, in step S1550, using all search relevant with the search application information database that makes up to use searches for the keyword of user's input, and obtaining integrating Search Results, above-mentioned integration method ends at step S1560.Wherein, included step S1520 in this treatment scheme 1500~S1540 corresponds respectively to the step S1420 that comprises in the above-described treatment scheme 1400~S1440, its specific implementation process can also can obtain similar technique effect referring to above describing, and does not repeat them here.
Each component units in the above-mentioned database construction device according to the embodiment of the invention (or integrating apparatus is used in search), subelement etc. can be configured by the mode of software, firmware, hardware or its combination in any.In the situation that realize by software or firmware, can the program that consist of this software or firmware be installed to the machine with specialized hardware structure (for example general-purpose machinery 1600 shown in Figure 16) from storage medium or network, this machine can be carried out the various functions of above-mentioned each component units, subelement when various program is installed.
Figure 16 shows the structure diagram that can be used to realize according to the hardware configuration of a kind of possible messaging device of the database construction device of the embodiment of the invention and method (or integrating apparatus and integration method are used in search).
In Figure 16, CPU (central processing unit) (CPU) 1601 carries out various processing according to the program of storage in the ROM (read-only memory) (ROM) 1602 or from the program that storage area 1608 is loaded into random access memory (RAM) 1603.In RAM 1603, also store as required data required when CPU 1601 carries out various processing etc.CPU 1601, ROM 1602 and RAM 1603 are connected to each other via bus 1604.Input/output interface 1605 also is connected to bus 1604.
Following parts also are connected to input/output interface 1605: importation 1606 (comprising keyboard, mouse etc.), output 1607 (comprise display, such as cathode-ray tube (CRT) (CRT), liquid crystal display (LCD) etc., and loudspeaker etc.), storage area 1608 (comprising hard disk etc.), communications portion 1609 (comprising such as LAN card, modulator-demodular unit etc. of network interface unit).Communications portion 1609 is via for example the Internet executive communication processing of network.As required, driver 1610 also can be connected to input/output interface 1605.Detachable media 1611 for example disk, CD, magneto-optic disk, semiconductor memory etc. can be installed on the driver 1610 as required, so that the computer program of therefrom reading can be installed in the storage area 1608 as required.
In the situation that realize above-mentioned series of processes by software, can from network for example the Internet or from storage medium for example detachable media 1611 program that consists of softwares is installed.
It will be understood by those of skill in the art that this storage medium is not limited to shown in Figure 16 wherein has program stored therein, distributes separately to provide the detachable media 1611 of program to the user with equipment.The example of detachable media 1611 comprises disk (comprising floppy disk), CD (comprising compact disc read-only memory (CD-ROM) and digital universal disc (DVD)), magneto-optic disk (comprising mini-disk (MD) (registered trademark)) and semiconductor memory.Perhaps, storage medium can be hard disk that comprises in ROM 1602, the storage area 1608 etc., computer program stored wherein, and be distributed to the user with the equipment that comprises them.
In addition, the invention allows for a kind of program product that stores the instruction code that machine readable gets.When described instruction code is read and carried out by machine, can carry out above-mentioned database construction method according to the embodiment of the invention (or search application integration method).Correspondingly, be also included within of the present invention disclosing for the various storage mediums such as disk, CD, magneto-optic disk, semiconductor memory etc. that carry this program product.
Above-mentioned database construction device and method and search application integrating apparatus and method according to the embodiment of the invention, browsing histories by digging user, one be can create by the browsing histories of digging user and the search application relevant with user's browsing histories and the integrated database of relevant information comprised, so that this this database can comprise popular network search engines, can also comprise that having seldom user's search uses; And, because this database and integration are based on user's browsing histories, therefore fully use preference and the demand of match user; In addition, the constructive process of database does not need user's participation, so that the user easily brings into use.
In the above in the description to the specific embodiment of the invention, can in one or more other embodiment, use in same or similar mode for the feature that a kind of embodiment is described and/or illustrated, combined with the feature in other embodiment, or the feature in alternative other embodiment.
Should emphasize, term " comprises/comprise " existence that refers to feature, key element, step or assembly when this paper uses, but does not get rid of the existence of one or more further feature, key element, step or assembly or additional.The term " first " that relates to ordinal number, " second " etc. do not represent enforcement order or the importance degree of feature, key element, step or assembly that these terms limit, and only is for for the purpose of being described clearly and be arranged between these features, key element, step or assembly and identify.
In addition, describe during the method for various embodiments of the present invention is not limited to specifications or accompanying drawing shown in time sequencing carry out, also can be according to other time sequencing, carry out concurrently or independently.The execution sequence of the method for therefore, describing in this instructions is not construed as limiting technical scope of the present invention.
Although the above discloses the present invention by the description to specific embodiments of the invention, but, should be appreciated that, those skilled in the art can design various modifications of the present invention, improvement or equivalent in the spirit and scope of claims.These modifications, improvement or equivalent also should be believed to comprise in protection scope of the present invention.
In addition, obviously, also can realize in the mode that is stored in the computer executable program in the various machine-readable storage mediums according to each operating process of said method of the present invention.
And, purpose of the present invention also can realize by following manner: the storage medium that will store above-mentioned executable program code offers system or equipment directly or indirectly, and the said procedure code is read and carried out to the computing machine in this system or equipment or CPU (central processing unit) (CPU).
At this moment, as long as this system or equipment have the function of executive routine, then embodiments of the present invention are not limited to program, and this program also can be form arbitrarily, for example, the program carried out of target program, interpreter or the shell script that offers operating system etc.
Above-mentioned these machinable mediums include but not limited to: various storeies and storage unit, semiconductor equipment, disc unit be light, magnetic and magneto-optic disk for example, and other is suitable for the medium of the information of storing etc.
In addition, client computer is by being connected to the corresponding website on the Internet, and will download and be installed to according to computer program code of the present invention and then carry out this program in the computing machine, also can realize the present invention.
At last, also need to prove, in this article, only be used for an entity or operation are separated with another entity or operational zone such as relational terms left and right, first and second etc., and not necessarily require or hint and have the relation of any this reality or sequentially between these entities or the operation.And, term " comprises ", " comprising " or its any other variant are intended to contain comprising of nonexcludability, thereby not only comprise those key elements so that comprise process, method, article or the equipment of a series of key elements, but also comprise other key elements of clearly not listing, or also be included as the intrinsic key element of this process, method, article or equipment.In the situation that not more restrictions, the key element that is limited by statement " comprising ... ", and be not precluded within process, method, article or the equipment that comprises described key element and also have other identical element.
To sum up, in an embodiment according to the present invention, the invention provides following scheme:
1. 1 kinds of remarks are used for making up search application information wide area information server construction device, comprising: the browsing session recognition unit, and it is arranged to based on user's browsing history and time of origin thereof and identifies browsing session in the described browsing histories; The search sessions identifying unit, it is arranged to according to the relevance between the parameter attribute of the record in the described browsing session and record judges whether described browsing session is search sessions; And the database construction unit, it is arranged to according to the search sessions of judging and obtains search application information in the described search sessions, and makes up the search application information database based on the search application information that obtains.
Remarks 2. is according to remarks 1 described database construction device, and wherein, described browsing session recognition unit comprises: first judges subelement, and whether its adjacent browsing history that is arranged in the browsing history of judging the user belongs to same browsing session; And the recognin unit, it is arranged to according to first judges that the result of determination of subelement 410 identifies a plurality of browsing sessions in user's browsing history.
Remarks 3. is according to remarks 1 described database construction device, wherein, described search sessions identifying unit comprises: second judges subelement, it is arranged to judges whether there is the search operation record in the browsing session of identifying, wherein, described search operation record is to have to have the record that occurrence number surpasses preset value and highlighted parameter value in text; The 3rd judges subelement, its be arranged to the described second result of determination of judging subelement as sure situation under, judge in described browsing session, whether there is such record: occur in after the described search operation record and with the URL of the described search operation record record as referer; And the 4th judge subelement, its be arranged to the described the 3rd result of determination of judging subelement as sure situation under, described browsing session is judged to be search sessions.
Remarks 4. is according to remarks 1 described database construction device, also comprise: filter element, it is arranged to from the user's who obtains original browsing history and filters out record ignored, and the browsing history after will filtering is sent to the browsing session recognition unit and processes.
Remarks 5. is according to remarks 1 described database construction device, also comprise: updating block, its be arranged to termly start be included in the described database construction device, the miscellaneous part except described updating block, rebuilding the search application information database, and replace original search application information database with the new search application information database that makes up.
Remarks 6. is according to the described database construction device of any one among the remarks 1-5, wherein, described database construction unit comprises: search application information is extracted subelement, it is arranged in the record that comprises from the search sessions of judging and extracts search application information, described search application information comprises following information at least: the domain name that search is used, the request path corresponding with domain name, the searched key word parameter corresponding with domain name and described request path, with domain name, described request path and described searched key word parameter corresponding search time, and and domain name, the described request path, the entry number of clicked mistake in the described searched key word parameter Search Results corresponding with described search time; And Database subelement, it is arranged to the described search application information of extracting according to search application information extraction subelement and sets up the search application information database, and, in described search application information database, described search application information is divided into groups according to domain name and described request path.
Remarks 7. is according to remarks 6 described database construction devices, wherein, in described search application information extraction subelement, comprise and click the clauses and subclauses statistical module, described click clauses and subclauses statistical module is arranged to the described and domain name of statistics, the described request path, the entry number of clicked mistake in the described searched key word parameter Search Results corresponding with described search time, wherein said click clauses and subclauses statistical module comprises: determine submodule, it is arranged to determines in the search sessions of judging, have in text occurrence number and surpass the search operation record of preset value and highlighted parameter value; And statistics submodule, it is arranged in described search sessions URL that statistics occurs, that record take described search operation and is the number of the record of referer after described search operation record, and this number is defined as the entry number of clicked mistake in the Search Results corresponding with domain name, described request path, described searched key word parameter and described search time.
Integration system is used in 8. 1 kinds of search of remarks, comprise such as any one database construction device among the remarks 1-5, also comprise: use integral unit, it is arranged to and utilizes all search application that relate in the constructed search application information database of database construction device that the keyword of user's input is searched for, and obtains the integration Search Results that the Search Results that all search are used is combined; And interface unit, it is arranged to the demonstration inputting interface, receives the keyword of user's input, and shows described integration Search Results.
Remarks 9. is according to remarks 8 described search application integration systems, wherein, the database construction unit that comprises in the described database construction device comprises: search application information is extracted subelement, it is arranged in the record that comprises from the search sessions of judging and extracts search application information, described search application information comprises following information at least: the domain name that search is used, the request path corresponding with domain name, the searched key word parameter corresponding with domain name and described request path, with domain name, described request path and described searched key word parameter corresponding search time, and and domain name, the described request path, the entry number of clicked mistake in the described searched key word parameter Search Results corresponding with described search time; And Database subelement, it is arranged to the described search application information of extracting according to search application information extraction subelement and sets up the search application information database, and, in described search application information database, described search application information is divided into groups according to domain name and described request path.
Remarks 10. is according to remarks 9 described search application integration systems, also comprise sequencing unit, described sequencing unit is arranged to according to one of following three kinds of modes described integration Search Results is sorted, and the integration Search Results after will sorting is sent to interface unit: use the number of times that was used according to the search relevant with described integration Search Results and sort; How much the entry number of clicked mistake sorts in each self-corresponding Search Results of domain name of using according to the search relevant with described integration Search Results; The priority of the domain name of perhaps using according to the search relevant with described integration Search Results each self-corresponding up-to-date search time sorts.
11. 1 kinds of remarks are used for making up search application information wide area information server construction method, comprising: identify browsing session in the described browsing histories based on user's browsing history and time of origin thereof; Judge according to the parameter attribute of the record in the described browsing session and the relevance between record whether described browsing session is search sessions; And obtain search application information in the described search sessions according to the search sessions of judging, and make up the search application information database based on the search application information that obtains.
Remarks 12. is according to remarks 11 described database construction methods, and the browsing session in the described browsing histories of described identification comprises: whether the adjacent browsing history in judgement user's the browsing history belongs to same browsing session; And in user's browsing history, identify a plurality of browsing sessions according to the result who judges.
Remarks 13. is according to remarks 11 described database construction methods, wherein, describedly judge whether described browsing session comprises as search sessions: in the situation that described browsing session satisfies following two conditions simultaneously, judge that described browsing session is search sessions: in the browsing session of identifying, have the search operation record, wherein, described search operation record is to have to have the record that occurrence number surpasses preset value and highlighted parameter value in text; And judge in described browsing session, whether there is such record: occur in after the described search operation record and with the URL of the described search operation record record as referer.
Remarks 14. is according to remarks 11 described database construction methods, and wherein, described user's browsing history is by filter out the browsing history after the filtration that obtains behind the record ignored from the user's that obtains original browsing history.
Remarks 15. also comprises according to remarks 11 described database construction methods: rebuild termly the search application information database, and use the new search application information database that makes up to replace original search application information database.
Remarks 16. is according to the described database construction method of any one among the remarks 11-15, wherein, described search application information comprises following information at least: the domain name that search is used, the request path corresponding with domain name, the searched key word parameter corresponding with domain name and described request path, the search time corresponding with domain name, described request path and described searched key word parameter, and the entry number of clicked mistake in the Search Results corresponding with domain name, described request path, described searched key word parameter and described search time; Wherein, described search application information is the information after dividing into groups according to domain name and described request path.
Remarks 17. is according to remarks 16 described database construction methods, and the entry number of clicked mistake obtains in the following manner in the wherein said Search Results corresponding with domain name, described request path, described searched key word parameter and described search time: determine in the search sessions of judging, have a search operation record that in text occurrence number surpasses preset value and highlighted parameter value; And the URL that statistics occurs after described search operation record in described search sessions, record take described search operation is the number of the record of referer, and described number is defined as the entry number of clicked mistake in the Search Results corresponding with domain name, described request path, described searched key word parameter and described search time.
18. 1 kinds of search of remarks application integration method, comprise such as any one database construction method among the remarks 11-17, also comprise: the keyword that receives user's input, and utilize by all search application that relate in the constructed search application information database of database construction method the keyword of user's input is searched for, obtain the integration Search Results that the Search Results that all search are used is combined.
19. 1 kinds of computer-readable recording mediums of remarks, store the computer program that can be carried out by computing equipment on it, described program can make among the described computing equipment executive basis remarks 11-17 the described database construction method of any one or according to remarks 18 described search application integration methods when carrying out.
Although described the present invention and advantage thereof in detail, be to be understood that and in the situation that does not break away from the spirit and scope of the present invention that limited by appended claim, can carry out various changes, alternative and conversion.And the application's scope is not limited only to the specific embodiment of structure, means, method and the step of the described process of instructions, equipment, manufacturing, material.The one of ordinary skilled in the art will readily appreciate that from disclosure of the present invention, can use according to the present invention and carry out and structure, means, method or the step essentially identical function of corresponding embodiment described herein or acquisition result essentially identical with it, that have and want in the future exploited process, equipment, manufacturing, material now.Therefore, appended claim is intended to comprise in their scope structure, means, method or the step of such process, equipment, manufacturing, material.
Although the above embodiments of the invention of describing in detail by reference to the accompanying drawings should be understood that embodiment described above just is used for explanation the present invention, and are not construed as limiting the invention.For a person skilled in the art, can make various changes and modifications above-mentioned embodiment and do not deviate from the spirit and scope of the invention.Therefore, scope of the present invention is only limited by appended claim and equivalents thereof.

Claims (10)

1. one kind is used for making up search application information wide area information server construction device, comprising:
The browsing session recognition unit, it is arranged to based on user's browsing history and time of origin thereof and identifies browsing session in the described browsing histories;
The search sessions identifying unit, it is arranged to according to the relevance between the parameter attribute of the record in the described browsing session and record judges whether described browsing session is search sessions; And
The database construction unit, it is arranged to according to the search sessions of judging and obtains search application information in the described search sessions, and makes up the search application information database based on the search application information that obtains.
2. database construction device according to claim 1 also comprises:
Filter element, it is arranged to from the user's who obtains original browsing history and filters out record ignored, and the browsing history after will filtering is sent to the browsing session recognition unit and processes.
3. database construction device according to claim 1 also comprises:
Updating block, its be arranged to termly start be included in the described database construction device, the miscellaneous part except described updating block, rebuilding the search application information database, and replace original search application information database with the new search application information database that makes up.
4. the described database construction device of any one according to claim 1-3, wherein, described database construction unit comprises:
Search application information is extracted subelement, and it is arranged in the record that comprises from the search sessions of judging and extracts search application information, and described search application information comprises following information at least: the domain name that search is used; The request path corresponding with domain name; The searched key word parameter corresponding with domain name and described request path; The search time corresponding with domain name, described request path and described searched key word parameter; And the entry number of clicked mistake in the Search Results corresponding with domain name, described request path, described searched key word parameter and described search time; And
The Database subelement, it is arranged to the described search application information of extracting according to search application information extraction subelement and sets up the search application information database, and, in described search application information database, described search application information is divided into groups according to domain name and described request path.
5. the database construction device described in according to claim 4, wherein, in described search application information extraction subelement, comprise and click the clauses and subclauses statistical module, described click clauses and subclauses statistical module is arranged to the entry number of clicked mistake in the described Search Results corresponding with domain name, described request path, described searched key word parameter and described search time of statistics, wherein
Described click clauses and subclauses statistical module comprises:
Determine submodule, its be arranged to determine in the search sessions of judging, have a search operation record that in text occurrence number surpasses preset value and highlighted parameter value; And
The statistics submodule, it is arranged in described search sessions URL that statistics occurs, that record take described search operation and is the number of the record of referer after described search operation record, and this number is defined as the entry number of clicked mistake in the Search Results corresponding with domain name, described request path, described searched key word parameter and described search time.
6. integration system is used in a search, comprises such as any one database construction device among the claim 1-3, also comprises:
Use integral unit, it is arranged to and utilizes all search application that relate in the constructed search application information database of database construction device that the keyword of user's input is searched for, and obtains the integration Search Results that the Search Results that all search are used is combined; And
Interface unit, it is arranged to the demonstration inputting interface, receives the keyword of user's input, and shows described integration Search Results.
7. integration system is used in search according to claim 6, and wherein, the database construction unit that comprises in the described database construction device comprises:
Search application information is extracted subelement, and it is arranged in the record that comprises from the search sessions of judging and extracts search application information, and described search application information comprises following information at least: the domain name that search is used; The request path corresponding with domain name; The searched key word parameter corresponding with domain name and described request path; The search time corresponding with domain name, described request path and described searched key word parameter; And the entry number of clicked mistake in the Search Results corresponding with domain name, described request path, described searched key word parameter and described search time; With
The Database subelement, it is arranged to the described search application information of extracting according to search application information extraction subelement and sets up the search application information database, and, in described search application information database, described search application information is divided into groups according to domain name and described request path.
8. integration system is used in search according to claim 7, also comprises:
Sequencing unit, it is arranged to according to one of following three kinds of modes described integration Search Results is sorted:
Using the number of times that was used according to the search relevant with described integration Search Results sorts;
How much the entry number of clicked mistake sorts in each self-corresponding Search Results of domain name of using according to the search relevant with described integration Search Results; Perhaps
The priority of the domain name of using according to the search relevant with described integration Search Results each self-corresponding up-to-date search time sorts.
9. one kind is used for making up search application information wide area information server construction method, comprising:
Identify browsing session in the described browsing histories based on user's browsing history and time of origin thereof;
Judge according to the parameter attribute of the record in the described browsing session and the relevance between record whether described browsing session is search sessions; And
Obtain search application information in the described search sessions according to the search sessions of judging, and make up the search application information database based on the search application information that obtains.
10. a search application integration method comprises database construction method as claimed in claim 9, also comprises:
Receive the keyword of user's input, and utilize by all search application that relate in the constructed search application information database of database construction method the keyword of user's input is searched for, obtain the integration Search Results that the Search Results that all search are used is combined.
CN201110304836.7A 2011-09-28 2011-09-28 Database sharing apparatus and method, search application integrating system and method Expired - Fee Related CN103034662B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110304836.7A CN103034662B (en) 2011-09-28 2011-09-28 Database sharing apparatus and method, search application integrating system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110304836.7A CN103034662B (en) 2011-09-28 2011-09-28 Database sharing apparatus and method, search application integrating system and method

Publications (2)

Publication Number Publication Date
CN103034662A true CN103034662A (en) 2013-04-10
CN103034662B CN103034662B (en) 2016-06-08

Family

ID=48021563

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110304836.7A Expired - Fee Related CN103034662B (en) 2011-09-28 2011-09-28 Database sharing apparatus and method, search application integrating system and method

Country Status (1)

Country Link
CN (1) CN103034662B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107295405A (en) * 2017-07-14 2017-10-24 深圳市海云天科技股份有限公司 The compression method and system of a kind of video tour record

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079033A (en) * 2006-06-30 2007-11-28 腾讯科技(深圳)有限公司 Integrative searching result sequencing system and method
US20100082637A1 (en) * 2008-09-30 2010-04-01 Yahoo; Inc. Web Page and Web Site Importance Estimation Using Aggregate Browsing History
CN102135985A (en) * 2011-01-28 2011-07-27 百度在线网络技术(北京)有限公司 Method and system for searching by calling search result of third-party search engine

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101079033A (en) * 2006-06-30 2007-11-28 腾讯科技(深圳)有限公司 Integrative searching result sequencing system and method
US20100082637A1 (en) * 2008-09-30 2010-04-01 Yahoo; Inc. Web Page and Web Site Importance Estimation Using Aggregate Browsing History
CN102135985A (en) * 2011-01-28 2011-07-27 百度在线网络技术(北京)有限公司 Method and system for searching by calling search result of third-party search engine

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107295405A (en) * 2017-07-14 2017-10-24 深圳市海云天科技股份有限公司 The compression method and system of a kind of video tour record

Also Published As

Publication number Publication date
CN103034662B (en) 2016-06-08

Similar Documents

Publication Publication Date Title
CN100476830C (en) Network resource searching method and system
US7707161B2 (en) Method and system for creating a concept-object database
CN104715064B (en) It is a kind of to realize the method and server that keyword is marked on webpage
CN103023714B (en) The liveness of topic Network Based and cluster topology analytical system and method
US20150032728A1 (en) System and method of generating a set of search results
US9122769B2 (en) Method and system for processing information of a stream of information
KR101463974B1 (en) Big data analysis system for marketing and method thereof
US8321396B2 (en) Automatically extracting by-line information
US20020065857A1 (en) System and method for analysis and clustering of documents for search engine
WO2011063035A1 (en) A method and system to contextualize information being displayed to a user
CN101576928A (en) Method and device for selecting related article
CN104391978A (en) Method and device for storing and processing web pages of browsers
CN1841377A (en) Crawling databases for information
Alkalbani et al. Design and implementation of the hadoop-based crawler for saas service discovery
CN116226494B (en) Crawler system and method for information search
KR20050070955A (en) Method of scientific information analysis and media that can record computer program thereof
CN103034662A (en) Database establishment device, database establishment method, search application integration system and search application integration method
KR20030051577A (en) Display method for research result in internet site
EP2411930A2 (en) A system for automatic semantic-based mining
CN105095324A (en) User classification apparatus, user classification method and electronic device
CN112417248A (en) Recommendation method, device, model, equipment and storage medium for addressing keywords
Li et al. Research of network data mining based on reliability source under big data environment
CN107463570B (en) Document retrieval/analysis method and device
KR101021022B1 (en) Apparatus and method for providing customized search engine
KR101059032B1 (en) Search Schema Setting Device and Method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160608

Termination date: 20180928